TOP AI Developers by monthly star count
TOP AI Organization Account by AI repo star count
Top AI Project by Category star count
Top Growing Speed list by the speed of gaining stars
Top List of who create influential repos with little people known
Projects and developers that are thriving yet have not been updated for a long time.
Rankings | Developers | Related Project | Project intro | Star count |
---|---|---|---|---|
1 | browser-use | 🌐 Make websites accessible for AI agents. Automate tasks online with ease. | 64.4K | |
2 | LightRAG | "LightRAG: Simple and Fast Retrieval-Augmented Generation" | 17.7K | |
3 | Janus | Janus-Series: Unified Multimodal Understanding and Generation Models | 16.5K | |
4 | suna | Suna - Open Source Generalist AI Agent | 14.5K | |
5 | verl | verl: Volcano Engine Reinforcement Learning for LLMs | 10.2K | |
6 | LatentSync | Taming Stable Diffusion for Lip Sync! | 4.4K | |
7 | preswald | Preswald is a WASM packager for Python-based interactive data apps: bundle full complex data workflows, particularly visualizations, into single files, runnable completely in-browser, using Pyodide, DuckDB, Pandas, and Plotly, Matplotlib, etc. Build dashboards, reports, and notebooks that run offline, load fast, and share like a document. | 4.3K | |
8 | open-canvas | 📃 A better UX for chat, writing content, and coding with LLMs. | 3.6K | |
9 | forge | AI enabled pair programmer for Claude, GPT, O Series, Grok, Deepseek, Gemini and 300+ models | 3.2K | |
10 | GLM-4-Voice | GLM-4-Voice | 端到端中英语音对话模型 | 3.0K | |
11 | text-extract-api | Document (PDF, Word, PPTX ...) extraction and parse API using state of the art modern OCRs + Ollama supported models. Anonymize documents. Remove PII. Convert any document or picture to structured JSON or Markdown | 2.1K | |
12 | multimodal-live-api-web-console | A react-based starter app for using the Multimodal Live API over websockets with Gemini | 1.8K | |
13 | mini-omni2 | Towards Open-source GPT-4o with Vision, Speech and Duplex Capabilities。 | 1.8K | |
14 | llama.vim | Vim plugin for LLM-assisted code/text completion | 1.3K | |
15 | basic-memory | Basic Memory is a knowledge management system that allows you to build a persistent semantic graph from conversations with AI assistants, stored in standard Markdown files on your computer. Integrates directly with Obsidan.md | 1.0K | |
16 | meeting-minutes | A free and open source, self hosted Ai based live meeting note taker and minutes summary generator that can completely run in your Local device (Mac OS support is added. Will add windows and linux support soon) | 978 | |
17 | LLaMA-Mesh | Unifying 3D Mesh Generation with Language Models | 910 | |
18 | ZhiLight | A highly optimized LLM inference acceleration engine for Llama and its variants. | 897 | |
19 | LLaMA-O1 | Large Reasoning Models | 801 | |
20 | starter-applets | Google AI Studio Starter Apps | 799 | |
21 | VisRAG | Parsing-free RAG supported by VLMs | 611 | |
22 | groq-appgen | Project showcasing Llama 3.3 70B HTML codegen abilities | 607 | |
23 | PocketFlow | Minimalist LLM Framework in 100 Lines. Enable LLMs to Program Themselves. | 586 | |
24 | UniWorld-V1 | UniWorld: High-Resolution Semantic Encoders for Unified Visual Understanding and Generation | 583 | |
25 | mlx-engine | Apple MLX engine for LM Studio | 520 | |
26 | vlmrun-hub | A hub for various industry-specific schemas to be used with VLMs. | 510 | |
27 | SDXL_EcomID_ComfyUI | 465 | ||
28 | daydreams | Daydreams is a generative agent framework for executing anything onchain | 457 | |
29 | OmniThink | OmniThink: Expanding Knowledge Boundaries in Machine Writing through Thinking | 445 | |
30 | swift-chat | A lightning-fast, cross-platform AI chat application built with React Native. | 445 | |
31 | pearai-master | VSCode for the new age of AI. | 433 | |
32 | CoI-Agent | Official code for paper: Chain of Ideas: Revolutionizing Research via Novel Idea Development with LLM Agents | 427 | |
33 | WorkflowAI | WorkflowAI is an open-source platform where product and engineering teams
collaborate to build and iterate on AI features. | 412 | |
34 | chat-ui | Chat UI components for LLM apps | 392 | |
35 | clickclickclick | A framework to enable autonomous android and computer use using any LLM (local or remote) | 382 | |
36 | promptwright | Generate large synthetic data using an LLM | 377 | |
37 | codegate | CodeGate: CodeGen Privacy and Security | 326 | |
38 | ai-gateway | Envoy AI Gateway is an open source project for using Envoy Gateway to handle request traffic from application clients to Generative AI services. | 322 | |
39 | agent-as-a-judge | 🤠 Agent-as-a-Judge and DevAI dataset | 319 | |
40 | RoboVLMs | 285 | ||
41 | VLM2Vec | This repo contains the code for "VLM2Vec: Training Vision-Language Models for Massive Multimodal Embedding Tasks" [ICLR 2025] | 274 | |
42 | awesome-open-source-lms | Friends of OLMo and their links. | 266 | |
43 | dingo | Dingo: A Comprehensive AI Data Quality Evaluation Tool | 256 | |
44 | Kolosal | Kolosal AI is an OpenSource and Lightweight alternative to LM Studio to run LLMs 100% offline on your device. | 227 | |
45 | airweave | Turn any app into agent knowledge | 226 | |
46 | VLABench | Official repo of VLABench, a large scale benchmark designed for fairly evaluating VLA, Embodied Agent, and VLMs. | 225 | |
47 | mediachain | AI toolkit for making Shorts/Tiktoks | 214 | |
48 | notte | The agentic internet | 189 | |
49 | MagicPIG | [ICLR2025] MagicPIG: LSH Sampling for Efficient LLM Generation | 184 | |
50 | fabrice-ai | A lightweight, functional, and composable framework for building AI agents. No PhD required. | 181 | |
51 | ai-analyst | Open source AI analyst powered by E2B. Analyze your CSV files with Llama 3.1 and create interactive charts. | 175 | |
52 | oat | 🌾 OAT: A research-friendly framework for LLM online alignment, including preference learning, reinforcement learning, etc. | 167 | |
53 | ffpa-attn-mma | 📚FFPA(Split-D): Yet another Faster Flash Attention with O(1) GPU SRAM complexity large headdim, 1.8x~3x↑🎉 faster than SDPA EA. | 162 | |
54 | ChatRex | Code for ChatRex: Taming Multimodal LLM for Joint Perception and Understanding | 156 | |
55 | phantasm | Toolkits to create a human-in-the-loop approval layer to monitor and guide AI agents workflow in real-time. | 156 | |
56 | AgentSquare | The official implementation of the paper "AgentSquare: Automatic LLM Agent Search in Modular Design Space"" | 151 | |
57 | GLM-Edge | GLM Series Edge Models | 142 | |
58 | llm4ad | LLM4AD: A Platform for Algorithm Design with Large Language Model | 131 | |
59 | meta-prompt | For LLMs to better code with Jina API | 129 | |
60 | Fast-LLM | Accelerating your LLM training to full speed! Made with ❤️ by ServiceNow Research | 129 | |
61 | BALROG | Benchmarking Agentic LLM and VLM Reasoning On Games | 117 | |
62 | FreeScale | Code for FreeScale, a tuning-free method for higher-resolution visual generation | 115 | |
63 | Awesome-LLM-Reasoning-with-NeSy | ✨✨Latest Advances on Neuro-Symbolic Learning in the era of Large Language Models | 107 | |
64 | TrustEval-toolkit | Toolkit for evaluating the trustworthiness of generative foundation models. | 105 | |
65 | intelligence | A Ruby gem for seamlessly and uniformly interacting with large language and vision model (LLM) API's served by numerous services, including those of OpenAI, Anthropic, Google and others. | 95 | |
66 | superagentx | Lightweight Multi Agent AI Orchestrator Framework with AGI Capabilities. | 91 | |
67 | ai-chat-android | 💬 AI Chat Bot demo app showcasing the integration of Gemini SDK with Firebase Realtime Database for real-time chat functionality. | 89 | |
68 | VLM-Grounder | [CoRL 2024] VLM-Grounder: A VLM Agent for Zero-Shot 3D Visual Grounding | 85 | |
69 | flair | [CVPR 2025] FLAIR: VLM with Fine-grained Language-informed Image Representations | 84 | |
70 | OSWorld-G | Scaling Computer-Use Grounding via UI Decomposition and Synthesis | 79 | |
71 | mcp-server-llamacloud | A MCP server connecting to managed indexes on LlamaCloud | 77 | |
72 | AI-Blueprints | 📁 This repository contains end-to-end AI blueprint projects designed to run effortlessly across a wide range of use cases; including data science, machine learning, deep learning, and generative AI. 🛠️ All projects are built using HP AI Studio with ❤️ If you find this useful, please don’t forget to star the repository ⭐ and support our work 🚀 | 76 | |
73 | SepLLM | [ICML 2025] "SepLLM: Accelerate Large Language Models by Compressing One Segment into One Separator" | 75 | |
74 | Quest-Best-Tokens | An introduction to LLM Sampling | 75 | |
75 | one | Build AI powered websites with Astro, Shadcn and Vercel AI SDK | 73 | |
76 | STRING | [ICLR'25] Data and code for our paper "Why Does the Effective Context Length of LLMs Fall Short?" | 69 | |
77 | Web-Agent | Web Agent is an automation tool driven by AI. Designed for seamless navigation and task execution on the web, it intelligently interacts with dynamic web elements, performs searches, downloads files, and adapts to page changes. | 66 | |
78 | 3d-conditioning | Enhance and modify high-quality compositions using real-time rendering and generative AI output without affecting a hero product asset. | 61 | |
79 | DiGIT | [NeurIPS 2024] Stabilize the Latent Space for Image Autoregressive Modeling: A Unified Perspective | 59 | |
80 | ID-Patch | Official implementation of CVPR 2025 paper "ID-Patch: Robust ID Association for Group Photo Personalization". This work proposed propose ID-Patch, a fast and robust method that links identity features to 2D positions via visual patches and embeddings. | 58 | |
81 | sparse_transformers | Sparse Inferencing for transformer based LLMs | 58 | |
82 | SeeDo | [IROS 2025] Human Demo Videos to Robot Action Plans | 54 | |
83 | SubgraphRAG | [ICLR 2025] Simple is Effective: The Roles of Graphs and Large Language Models in Knowledge-Graph-Based Retrieval-Augmented Generation | 51 | |
84 | OLA-VLM | OLA-VLM: Elevating Visual Perception in Multimodal LLMs with Auxiliary Embedding Distillation, arXiv 2024 | 51 | |
85 | GEO-Bench-VLM | GEOBench-VLM: Benchmarking Vision-Language Models for Geospatial Tasks | 49 | |
86 | VLMnav | End-to-End Navigation with VLMs | 48 | |
87 | chat2note | This is a tool that can automatically translate the chat log into a note using LLM API | 47 | |
88 | robot-3dlotus | Official implementation of "Towards Generalizable Vision-Language Robotic Manipulation: A Benchmark and LLM-guided 3D Policy." | 47 | |
89 | gpt-bert | Official implementation of "GPT or BERT: why not both?" | 46 | |
90 | Montessori-Instruct | Official repository for Montessori-Instruct: Generate Influential Training Data Tailored for Student Learning [ICLR 2025] | 41 | |
91 | vscode-copilot-vision | Exploration into leveraging vision capabilities of an LLM | 40 | |
92 | FinGLM2 | 智谱AI 2024年金融行业大模型挑战赛仓库 | 40 | |
93 | Emma-X | Emma-X: An Embodied Multimodal Action Model with Grounded Chain of Thought and Look-ahead Spatial Reasoning | 39 | |
94 | vlm-knowledge-conflict | Code for paper "Unraveling Cross-Modality Knowledge Conflicts in Large Vision-Language Models." | 39 | |
95 | LLM4SR | LLM for Scientific Research Survey | 39 | |
96 | ApolloMoE | ApolloMoE: Efficiently Democratizing Medical LLMs for 50 Languages via a Mixture of Language Family Experts | 38 | |
97 | docspedia | Chat with your pdf using your local LLM, OLLAMA client.(incomplete) | 36 | |
98 | DPO_pLM | 36 | ||
99 | duckdb-extension-openprompt | DuckDB Community Extension to prompt LLMs from SQL | 34 | |
100 | gptme-webui | Web UI for gptme, built with lovable.dev | 34 |