TOP AI Developers by monthly star count
TOP AI Organization Account by AI repo star count
Top AI Project by Category star count
Top Growing Speed list by the speed of gaining stars
Top List of who create influential repos with little people known
Rankings | Developers | Related Project | Project intro | Star count |
---|---|---|---|---|
1 | screenpipe | rewind.ai x cursor.com = your AI assistant that has all the context | 8.8K | |
2 | llama-models | Utilities intended for use with Llama models. | 4.7K | |
3 | DictionaryByGPT4 | 一本 GPT4 生成的单词书📚,超过 8000 个单词分析,涵盖了词义、例句、词根词缀、变形、文化背景、记忆技巧和小故事 | 3.7K | |
4 | RouteLLM | A framework for serving and evaluating LLM routers - save LLM costs without compromising quality! | 3.2K | |
5 | gptpdf | Using GPT to parse PDF | 3.0K | |
6 | datachain | AI-data warehouse to enrich, transform and analyze unstructured data | 1.8K | |
7 | cambrian | Cambrian-1 is a family of multimodal LLMs with a vision-centric design. | 1.8K | |
8 | TEN-Agent | TEN Agent is the world’s first real-time multimodal agent integrated with the OpenAI Realtime API, RTC, and features weather checks, web search, vision, and RAG capabilities. | 1.4K | |
9 | SwarmUI | SwarmUI (formerly StableSwarmUI), A Modular Stable Diffusion Web-User-Interface, with an emphasis on making powertools easily accessible, high performance, and extensibility. | 1.3K | |
10 | WordLlama | Things you can do with the token embeddings of an LLM | 1.3K | |
11 | LlamaGen | Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation | 1.3K | |
12 | korvus | Korvus is a search SDK that unifies the entire RAG pipeline in a single database query. Built on top of Postgres with bindings for Python, JavaScript, Rust and C. | 1.3K | |
13 | ShareGPT4Video | [NeurIPS 2024] An official implementation of ShareGPT4Video: Improving Video Understanding and Generation with Better Captions | 1.3K | |
14 | ragbuilder | A toolkit to create optimal Production-readyRetrieval Augmented Generation(RAG) setup for your data | 1.3K | |
15 | colpali | The code used to train and run inference with the ColPali architecture. | 1.0K | |
16 | ttt-lm-pytorch | Official PyTorch implementation of Learning to (Learn at Test Time): RNNs with Expressive Hidden States | 1.0K | |
17 | LazyLLM | Easiest and laziest way for building multi-agent LLMs applications. | 1.0K | |
18 | Index-1.9B | A SOTA lightweight multilingual LLM | 898 | |
19 | Speech-AI-Forge | 🍦 Speech-AI-Forge is a project developed around TTS generation model, implementing an API Server and a Gradio-based WebUI. | 829 | |
20 | latitude-llm | Latitude is the open-source prompt engineering platform to build, evaluate, and refine your prompts with AI | 828 | |
21 | LLM-workshop-2024 | A 4-hour coding workshop to understand how LLMs are implemented and used | 737 | |
22 | ComfyUI-Florence2 | Inference Microsoft Florence2 VLM | 732 | |
23 | Agentless | Agentless🐱: an agentless approach to automatically solve software development problems | 707 | |
24 | llm-compressor | Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM | 659 | |
25 | dingllm.nvim | Yacine's LLM nvim scripts | 646 | |
26 | EAGLE | EAGLE: Exploring The Design Space for Multimodal LLMs with Mixture of Encoders | 535 | |
27 | voicechat2 | Local SRT/LLM/TTS Voicechat | 533 | |
28 | buffer-of-thought-llm | [NeurIPS 2024 Spotlight] Buffer of Thoughts: Thought-Augmented Reasoning with Large Language Models | 530 | |
29 | ChatGPT-Mirror | 🚀 一键部署个人的 ChatGPT 镜像站 | 514 | |
30 | PyramidKV | The Official Implementation of PyramidKV: Dynamic KV Cache Compression based on Pyramidal Information Funneling | 512 | |
31 | nerve | Instrument any LLM to do actual stuff. | 512 | |
32 | Ovis | A novel Multimodal Large Language Model (MLLM) architecture, designed to structurally align visual and textual embeddings. | 507 | |
33 | LARS | An application for running LLMs locally on your device, with your documents, facilitating detailed citations in generated responses. | 487 | |
34 | magpie | Official repository for "Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing". Your efficient and high-quality synthetic data generation pipeline! | 476 | |
35 | flash-diffusion | Official implementation of ⚡ Flash Diffusion ⚡: Accelerating Any Conditional Diffusion Model for Few Steps Image Generation | 470 | |
36 | chatgpt-artifacts | Bring Claude's Artifacts feature to ChatGPT | 444 | |
37 | StableNormal | [SIGGRAPH Asia 2024 (Journal Track)] StableNormal: Reducing Diffusion Variance for Stable and Sharp Normal | 425 | |
38 | awesome-production-llm | A curated list of awesome open-source libraries for production LLM | 372 | |
39 | mlx-gpt2 | gpt-2 from scratch in mlx | 358 | |
40 | meme_search | Index your memes by their content and text, making them easily retrievable for your meme warfare pleasures. Find funny fast. | 357 | |
41 | llama3.cuda | llama3.cuda is a pure C/CUDA implementation for Llama 3 model. | 305 | |
42 | esp-ai | The simplest and most cost-effective AI integration solution, enabling any device to achieve intelligent conversation functionality (based on ESP development boards). If you like this project, please give it a Star! | 最简单、最低成本的AI接入方案,让任何物品都能实现智能对话功能(基于ESP开发板)。喜欢本项目的话点个 Star 吧,您的一个 Star 对目前的仓库发展非常重要 | 288 | |
43 | llama.ttf | A font for writing tiny stories | 286 | |
44 | llamanet | Replace OpenAI with Llama.cpp Automagically. | 285 | |
45 | LiveBench | LiveBench: A Challenging, Contamination-Free LLM Benchmark | 281 | |
46 | Awesome-Jailbreak-on-LLMs | Awesome-Jailbreak-on-LLMs is a collection of state-of-the-art, novel, exciting jailbreak methods on LLMs. It contains papers, codes, datasets, evaluations, and analyses. | 279 | |
47 | chrome-ai | Vercel AI provider for Chrome built-in model (Gemini Nano) | 273 | |
48 | llama-zip | LLM-powered lossless compression tool | 252 | |
49 | swiftide | Fast, streaming indexing and query library for AI (RAG) applications, written in Rust | 250 | |
50 | HuatuoGPT-Vision | Medical Multimodal LLMs | 249 | |
51 | Foundations-of-LLMs | 249 | ||
52 | EVE | [NeurIPS'24 Spotlight] EVE: Encoder-Free Vision-Language Models | 225 | |
53 | VideoGPT-plus | Official Repository of paper VideoGPT+: Integrating Image and Video Encoders for Enhanced Video Understanding | 211 | |
54 | GoMaxAI-ChatGPT-Midjourney-Pro | 基于Node.js、Vue3、uniapp的ChatGPT+Midjourney绘画+Suno音乐+Pika/Runway/Sora视频 网页服务 | 个人、团队、企业私有化AIGC平台 | 210 | |
55 | rust-genai | Rust multiprovider generative AI client (Ollama, OpenAi, Anthropic, Groq, Gemini, Cohere, ...) | 202 | |
56 | Awesome-Model-Merging-Methods-Theories-Applications | Model Merging in LLMs, MLLMs, and Beyond: Methods, Theories, Applications and Opportunities. arXiv:2408.07666. | 202 | |
57 | LLM-Finetune | 大语言模型微调,Qwen2、GLM4指令微调 | 201 | |
58 | flute | Fast Matrix Multiplications for Lookup Table-Quantized LLMs | 183 | |
59 | GPT-SoVITS2 | GPT-SoVITS2 | 178 | |
60 | llm-router | Tutorial for building LLM router | 157 | |
61 | Train-llm-from-scratch | 使用deepspeed从头开始训练一个LLM,经过pretrain和sft阶段,验证llm学习知识、理解语言、回答问题的能力 | 154 | |
62 | rai | RAI is a multi-vendor agent framework for robotics, utilizing Langchain and ROS 2 tools to perform complex actions, defined scenarios, free interface execution, log summaries, voice interaction and more. | 154 | |
63 | LLaRA | LLaRA: Large Language and Robotics Assistant | 153 | |
64 | ceLLama | Cell type annotation with local Large Language Models (LLMs) - Ensuring privacy and speed with extensive customized reports | 143 | |
65 | StableFace | Build your own Face App with Stable Diffusion 2.1 | 140 | |
66 | TypeGPT | Integrate LLM's into your OS. For any issues or ideas, message us in the discord server below! | 138 | |
67 | GPTQModel | Production ready LLM model compression/quantization toolkit with accelerated inference support for both cpu/gpu via HF, vLLM, and SGLang. | 118 | |
68 | poixe | Platform of Open Intelligence eXperiences for Everyone. 面向所有人的开放智能体验平台,一站式AI对话工具聚合 | 114 | |
69 | Belullama | Belullama is a comprehensive AI application that bundles Ollama, Open WebUI, and Automatic1111 (Stable Diffusion WebUI) into a single, easy-to-use package. | 113 | |
70 | The-Creator-AI | Code like an architect | 105 | |
71 | appworld | 🌍 Repository for "AppWorld: A Controllable World of Apps and People for Benchmarking Interactive Coding Agent", ACL'24 Best Resource Paper. | 104 | |
72 | MMTrustEval | A toolbox for benchmarking trustworthiness of multimodal large language models (MultiTrust, NeurIPS 2024 Track Datasets and Benchmarks) | 101 | |
73 | MM-NIAH | [NeurIPS 2024] Needle In A Multimodal Haystack (MM-NIAH): A comprehensive benchmark designed to systematically evaluate the capability of existing MLLMs to comprehend long multimodal documents. | 98 | |
74 | voice-chat-ai | 🎙️ Speak with AI - Run locally using Ollama, OpenAI or xAI - Speech uses XTTS, OpenAI or ElevenLabs | 93 | |
75 | semantic-kernel-java | Semantic Kernel for Java. Integrate cutting-edge LLM technology quickly and easily into your Java based apps. See https://aka.ms/semantic-kernel. | 91 | |
76 | laravel-ai-translator | Automatic translate your language files into many languages using AI like Claude, GPT and etc. | 87 | |
77 | A3VLM | [CoRL2024] Official repo of `A3VLM: Actionable Articulation-Aware Vision Language Model` | 87 | |
78 | ik_llama.cpp | llama.cpp fork with additional SOTA quants and improved performance | 86 | |
79 | llm-interface | A simple NPM interface for seamlessly interacting with 36 Large Language Model (LLM) providers, including OpenAI, Anthropic, Google Gemini, Cohere, Hugging Face Inference, NVIDIA AI, Mistral AI, AI21 Studio, LLaMA.CPP, and Ollama, and hundreds of models. | 85 | |
80 | glm4v-assistant | Sample GLM4V + ChatTTS AI assistant | 84 | |
81 | VoCo-LLaMA | VoCo-LLaMA: This repo is the official implementation of "VoCo-LLaMA: Towards Vision Compression with Large Language Models". | 81 | |
82 | Claude-React-Jumpstart | 📖 A step-by-step guide for beginners to running Claude-generated React code locally. | 80 | |
83 | awesome-ai-repositories | A curated list of open source repositories for AI Engineers | 80 | |
84 | sd_embed | Generate long weighted prompt embeddings for Stable Diffusion | 78 | |
85 | dive-into-spring-ai | 《动手学SpringAI》包含SSE流/Agent智能体/知识图谱RAG/FunctionCall/历史消息/图片生成/图片理解/Embedding/VectorDatabase/RAG | 76 | |
86 | tiny-ai-client | Tiny client for LLMs with vision and tool calling. As simple as it gets. | 75 | |
87 | LLaMA-Factory-Doc | LLaMA Factory Document | 73 | |
88 | GenerativeAI | GenAI & LLM usecases and applications | 73 | |
89 | ComfyUI-ppm | Fixed Attention Couple, NegPip(negative weights in prompts) for SDXL and FLUX, more CFG++ and SMEA DY samplers, etc. | 73 | |
90 | LLM_Categorical_Hierarchical_Representations | 72 | ||
91 | AIReceiptScanner | Swift library that utilize GPT-4o for scanning receipt and its items | 69 | |
92 | FFAIVideo | A node.js project that generates short videos using popular AI LLM. | 67 | |
93 | YoLLaVA | 🌋👵🏻 Yo'LLaVA: Your Personalized Language and Vision Assistant | 66 | |
94 | VLM-Visualizer | Visualizing the attention of vision-language models | 65 | |
95 | Retrochat-v2 | RetroChat is a powerful command-line interface for interacting with various AI language models. It provides a seamless experience for engaging with different chat providers while offering robust features for managing and customizing your conversations. The code in this repo is 100% AI generated. Nothing has been written by a human. | 65 | |
96 | coir | A Comprehensive Benchmark for Code Information Retrieval. | 63 | |
97 | LiteMultiAgent | The Library for LLM-based multi-agent applications | 62 | |
98 | openshield | OpenShield is a new generation security layer for AI models | 59 | |
99 | chromegemini | Chrome AI Test Page, running Gemini Nano locally in your browser. | 59 | |
100 | SpeechLLM | This repository contains the training, inference, evaluation code for SpeechLLM models and details about the model releases on huggingface. | 58 |