TOP AI Developers by monthly star count
TOP AI Organization Account by AI repo star count
Top AI Project by Category star count
Top Growing Speed list by the speed of gaining stars
Top List of who create influential repos with little people known
Projects and developers that are thriving yet have not been updated for a long time.
Rankings | Developers | Related Project | Project intro | Star count |
---|---|---|---|---|
1 | browser-use | 🌐 Make websites accessible for AI agents. Automate tasks online with ease. | 61.6K | |
2 | Janus | Janus-Series: Unified Multimodal Understanding and Generation Models | 16.5K | |
3 | LightRAG | "LightRAG: Simple and Fast Retrieval-Augmented Generation" | 12.4K | |
4 | verl | verl: Volcano Engine Reinforcement Learning for LLMs | 8.5K | |
5 | open-canvas | 📃 A better UX for chat, writing content, and coding with LLMs. | 3.6K | |
6 | GLM-4-Voice | GLM-4-Voice | 端到端中英语音对话模型 | 2.9K | |
7 | Sidekick | A native macOS app that allows users to chat with a local LLM that can respond with information from files, folders and websites on your Mac without installing any other software. Powered by llama.cpp. | 2.9K | |
8 | text-extract-api | Document (PDF, Word, PPTX ...) extraction and parse API using state of the art modern OCRs + Ollama supported models. Anonymize documents. Remove PII. Convert any document or picture to structured JSON or Markdown | 2.1K | |
9 | mini-omni2 | Towards Open-source GPT-4o with Vision, Speech and Duplex Capabilities。 | 1.8K | |
10 | Build-A-Large-Language-Model-CN | 《Build a Large Language Model (From Scratch)》是一本深入探讨大语言模型原理与实现的电子书,适合希望深入了解 GPT 等大模型架构、训练过程及应用开发的学习者。为了让更多中文读者能够接触到这本极具价值的教材,我决定将其翻译成中文,并通过 GitHub 进行开源共享。 | 1.3K | |
11 | WritingTools | The world's smartest system-wide grammar assistant; a better version of the Apple Intelligence Writing Tools. Works on Windows, Linux, & macOS, with the free Gemini API, local LLMs, & more. | 1.3K | |
12 | open-notebook | An Open Source implementation of Notebook LM with more flexibility and features | 1.3K | |
13 | llama.vim | Vim plugin for LLM-assisted code/text completion | 1.3K | |
14 | FreeChatGpt-4o-2024 | Use ChatGpt 4o forFree - No API Key Need. also ChatGpt 3 and ChatGpt 3.5 Experience the power of ChatGPT with a user-friendly interface, enhanced jailbreaks, and completely free. | 1.0K | |
15 | VideoChat | 实时语音交互数字人,支持端到端语音方案(GLM-4-Voice - THG)和级联方案(ASR-LLM-TTS-THG)。可自定义形象与音色,无须训练,支持音色克隆,首包延迟低至3s。Real-time voice interactive digital human, supporting end-to-end voice solutions (GLM-4-Voice - THG) and cascaded solutions (ASR-LLM-TTS-THG). Customizable appearance and voice, supporting voice cloning, with initial package delay as low as 3s. | 943 | |
16 | llama-swap | Model swapping for llama.cpp (or any local OpenAPI compatible server) | 806 | |
17 | LLaMA-O1 | Large Reasoning Models | 801 | |
18 | aoai-realtime-audio-sdk | Azure OpenAI code resources for using gpt-4o-realtime capabilities. | 776 | |
19 | RIME-LMDG | Rime输入法语法模型全流程构建教程,全局带声调词库,最全声调标注工具链:LMDG - Language, Model, Dictionary, Grammar。Q群:11033572 | 640 | |
20 | VisRAG | Parsing-free RAG supported by VLMs | 611 | |
21 | mlx-engine | Apple MLX engine for LM Studio | 520 | |
22 | localGPT-Vision | Chat with your documents using Vision Language Models. This repo implements an End to End RAG pipeline with both local and proprietary VLMs | 513 | |
23 | simplemind | Python API client for AI providers that intends to replace LangChain and LangGraph for most common use cases. | 479 | |
24 | SDXL_EcomID_ComfyUI | 453 | ||
25 | ghostwriter | Use the reMarkable2 as an interface to vision-LLMs (ChatGPT, Claude, Gemini). Ghost in the machine! | 436 | |
26 | pearai-master | VSCode for the new age of AI. | 433 | |
27 | CoI-Agent | Official code for paper: Chain of Ideas: Revolutionizing Research via Novel Idea Development with LLM Agents | 427 | |
28 | PodCastLM | PDF 生成中文播客 | 417 | |
29 | promptwright | Generate large synthetic data using an LLM | 377 | |
30 | joycaption | JoyCaption is an image captioning Visual Language Model (VLM) being built from the ground up as a free, open, and uncensored model for the community to use in training Diffusion models. | 349 | |
31 | codai | Codai is an AI code assistant that helps developers through a session-based CLI, providing intelligent code suggestions, refactoring, and code reviews based on the full context of the project. | 327 | |
32 | agent-as-a-judge | 🤠 Agent-as-a-Judge and DevAI dataset | 319 | |
33 | aider.el | aider emacs plugin for https://github.com/paul-gauthier/aider | 297 | |
34 | nGPT-pytorch | Quick implementation of nGPT, learning entirely on the hypersphere, from NvidiaAI | 271 | |
35 | NotebookMLX | 📋 NotebookMLX - An Open Source version of NotebookLM (Ported NotebookLlama) | 252 | |
36 | Kolosal | Kolosal AI is an OpenSource and Lightweight alternative to LM Studio to run LLMs 100% offline on your device. | 227 | |
37 | engy | Engy is an AI-powered development tool that generates fully functional web applications from natural language, streamlining the process from idea to working prototype. | 223 | |
38 | mediachain | AI toolkit for making Shorts/Tiktoks | 214 | |
39 | Mirror | An LLM-powered programming-by-example programming language. | 193 | |
40 | chat-ui | Chat UI components for LLM apps | 184 | |
41 | MagicPIG | [ICLR2025] MagicPIG: LSH Sampling for Efficient LLM Generation | 184 | |
42 | ai-analyst | Open source AI analyst powered by E2B. Analyze your CSV files with Llama 3.1 and create interactive charts. | 175 | |
43 | MPLSandbox | MPLSandbox is an out-of-the-box multi-programming language sandbox designed to provide unified and comprehensive feedback from compiler and analysis tools for LLMs. | 169 | |
44 | oat | 🌾 OAT: A research-friendly framework for LLM online alignment, including preference learning, reinforcement learning, etc. | 167 | |
45 | phantasm | Toolkits to create a human-in-the-loop approval layer to monitor and guide AI agents workflow in real-time. | 156 | |
46 | PixelLlama | Ollama client written in Python | 155 | |
47 | VLM2Vec | This repo contains the code for "VLM2Vec: Training Vision-Language Models for Massive Multimodal Embedding Tasks" [ICLR25] | 152 | |
48 | AgentSquare | The official implementation of the paper "AgentSquare: Automatic LLM Agent Search in Modular Design Space"" | 151 | |
49 | llm-jq | Write and execute jq programs with the help of LLM | 149 | |
50 | FreeChatGpt-4o-2024 | 136 | ||
51 | Hands-On-LLM-Fine-Tuning | Hands-on tutorials on fine-tuning various LLMs using different fine-tuning techniques | 130 | |
52 | meta-prompt | For LLMs to better code with Jina API | 129 | |
53 | backtrack_sampler | An easy-to-understand framework for LLM samplers that rewind and revise generated tokens | 129 | |
54 | gemini-ai-code-reviewer | A GitHub Action that automatically reviews pull requests using Google's Gemini AI. | 129 | |
55 | Fast-LLM | Accelerating your LLM training to full speed! Made with ❤️ by ServiceNow Research | 129 | |
56 | LearnGo | LearnGo, a ios versatile learning tool with RAG and Prompting under LLM like GPT-4o-mini, enabling subject Q&A, smart note generation from uploaded materials, offline photo(or AR) object recognition with translations, and multilingual voice input (Cantonese, English, Mandarin). It also features customizable themes for a personalized experience | 124 | |
57 | qapyq | An image viewer and AI-assisted editing/captioning/masking tool that helps with curating datasets for generative AI models, finetunes and LoRA. | 124 | |
58 | lite_llama | A light llama-like llm inference framework based on the triton kernel. | 115 | |
59 | jarvis | A command-line personal assistant that integrates with Google Calendar, Gmail, and Tasks to help manage your digital life. | 115 | |
60 | comanda | Execute agentic workflows defined in simple YAML files | 108 | |
61 | StructRAG | StructRAG: Boosting Knowledge Intensive Reasoning of LLMs via Inference-time Hybrid Information Structurization | 103 | |
62 | SG-Nav | [NeurIPS 2024] SG-Nav: Online 3D Scene Graph Prompting for LLM-based Zero-shot Object Navigation | 101 | |
63 | UniversalBackrooms | A replication of Andy Ayrey's "Backrooms" (https://dreams-of-an-electric-mind.webflow.io/), but runnable with Opus 3, Sonnet 3.5, GPT 4o, and o1-preview | 100 | |
64 | Modality-Integration-Rate | The official code of the paper "Deciphering Cross-Modal Alignment in Large Vision-Language Models with Modality Integration Rate". | 96 | |
65 | graph-constrained-reasoning | Official Implementation of "Graph-constrained Reasoning: Faithful Reasoning on Knowledge Graphs with Large Language Models". | 96 | |
66 | intelligence | A Ruby gem for seamlessly and uniformly interacting with large language and vision model (LLM) API's served by numerous services, including those of OpenAI, Anthropic, Google and others. | 95 | |
67 | Fira | Fira: Can We Achieve Full-rank Training of LLMs Under Low-rank Constraint? | 92 | |
68 | superagentx | Lightweight Multi Agent AI Orchestrator Framework with AGI Capabilities. | 91 | |
69 | FlipAttack | [arXiv 2024] An official source code for paper "FlipAttack: Jailbreak LLMs via Flipping". | 89 | |
70 | ai-chat-android | 💬 AI Chat Bot demo app showcasing the integration of Gemini SDK with Firebase Realtime Database for real-time chat functionality. | 89 | |
71 | TimeMarker | A Versatile Video-LLM for Long and Short Video Understanding with Superior Temporal Localization Ability | 86 | |
72 | VLM-Grounder | [CoRL 2024] VLM-Grounder: A VLM Agent for Zero-Shot 3D Visual Grounding | 85 | |
73 | pal | LLM assistants for R | 80 | |
74 | Grounded-Video-LLM | Grounded-VideoLLM: Sharpening Fine-grained Temporal Grounding in Video Large Language Models | 79 | |
75 | SparseVLMs | Official implementation of paper "SparseVLM: Visual Token Sparsification for Efficient Vision-Language Model Inference". | 77 | |
76 | From_News_to_Forecast | This repository is for the paper entitled: From News to Forecast: Integrating Event Analysis in LLM-based Time Series Forecasting with Reflection (NeurIPS 2024) | 75 | |
77 | chat_with_pdf | Chat with PDF using Llama 3.3 | 75 | |
78 | Quest-Best-Tokens | An introduction to LLM Sampling | 75 | |
79 | STRING | [ICLR'25] Data and code for our paper "Why Does the Effective Context Length of LLMs Fall Short?" | 69 | |
80 | NotebookLlama-Groq | NotebookLlama powered by Groq - Create podcasts on any topic lightning fast | 67 | |
81 | Web-Agent | Web Agent is an automation tool driven by AI. Designed for seamless navigation and task execution on the web, it intelligently interacts with dynamic web elements, performs searches, downloads files, and adapts to page changes. | 66 | |
82 | chatgpt-plus-hezu | 最新ChatGPT Plus合租攻略:国内最靠谱的ChatGPT Plus拼车平台推荐(每月仅27元)!可使用GPT-4o生图功能和GPT-4.1系列模型,还支持满血版DeepSeek-R1、马斯克的Grok-3和谷歌Gemini-2.5 Pro!如果你无法解决科学上网的问题,或觉得每月20美元的会员费用过高,可以考虑ChatGPT Plus共享合租帐号。这种方式不仅能够降低使用成本,还免去了科学上网的复杂操作。 | 61 | |
83 | Build-a-Large-Language-Model-from-Scratch | Building a GPT-like LLM from scratch with PyTorch. | 60 | |
84 | sdxl-unbox | Sparse Autoencoders for Stable Diffusion XL models. | 60 | |
85 | Awesome-Neuro-Symbolic-Learning-with-LLM | ✨✨Latest Advances on Neuro-Symbolic Learning in the era of Large Language Models | 54 | |
86 | sd-webui-ux | Frontend Engine Extension for Stable Diffusion Web UI and Stable Diffusion Web UI Forge. | 53 | |
87 | Differential-Transformer-PyTorch | PyTorch implementation of the Differential-Transformer architecture for sequence modeling, specifically tailored as a decoder-only model similar to large language models (LLMs). The architecture incorporates a novel Differential Attention mechanism, Multi-Head structure, RMSNorm, and SwiGLU. | 52 | |
88 | SubgraphRAG | [ICLR 2025] Simple is Effective: The Roles of Graphs and Large Language Models in Knowledge-Graph-Based Retrieval-Augmented Generation | 51 | |
89 | Graphy-v1 | Graphy v1: A Realtime GraphRAG App using Langchain, Neo4j, GPT-4o, and Streamlit. | 49 | |
90 | podcast-llm | Automatically generate engaging AI podcasts from nothing but an episode title. | 49 | |
91 | ReachQA | Code & Dataset for Paper: "Distill Visual Chart Reasoning Ability from LLMs to MLLMs" | 48 | |
92 | chat2note | This is a tool that can automatically translate the chat log into a note using LLM API | 47 | |
93 | robot-3dlotus | Official implementation of "Towards Generalizable Vision-Language Robotic Manipulation: A Benchmark and LLM-guided 3D Policy." | 47 | |
94 | WololoGPT | 46 | ||
95 | gpt-bert | Official implementation of "GPT or BERT: why not both?" | 46 | |
96 | LMforImageGeneration | Codebase for the paper-Elucidating the design space of language models for image generation | 45 | |
97 | AutoGLM | 44 | ||
98 | google-gemini-app-clone | 43 | ||
99 | SyllableLM | Official Code for SyllableLM: Learning Coarse Semantic Units for Speech Language Models | 42 | |
100 | SAE-based-representation-engineering | [NAACL'25] Steering Knowledge Selection Behaviours in LLMs via SAE-Based Representation Engineering | 42 |