TOP AI Developers by monthly star count
TOP AI Organization Account by AI repo star count
Top AI Project by Category star count
Top Growing Speed list by the speed of gaining stars
Top List of who create influential repos with little people known
Rankings | Developers | Related Project | Project intro | Star count |
---|---|---|---|---|
1 | BitNet | Official inference framework for 1-bit LLMs | 12.7K | |
2 | CosyVoice | Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability. | 10.4K | |
3 | RagaAI-Catalyst | Python SDK for Agent AI Observability, Monitoring and Evaluation Framework. Includes features like agent, llm and tools tracing, debugging multi-agentic system, self-hosted dashboard and advanced analytics with timeline and execution graph view | 10.4K | |
4 | void | 9.9K | ||
5 | zerox | PDF to Markdown with vision models | 9.4K | |
6 | goose | an open source, extensible AI agent that goes beyond code suggestions - install, execute, edit, and test with any LLM | 7.3K | |
7 | midscene | Let AI be your browser operator. | 6.1K | |
8 | MindSearch | 🔍 An LLM-based Multi-agent Framework of Web Search Engine (like Perplexity.ai Pro and SearchGPT) | 5.9K | |
9 | Liger-Kernel | Efficient Triton Kernels for LLM Training | 4.4K | |
10 | nexa-sdk | Nexa SDK is a comprehensive toolkit for supporting GGML and ONNX models. It supports text generation, image generation, vision-language models (VLM), Audio Language Model, auto-speech-recognition (ASR), and text-to-speech (TTS) capabilities. | 4.3K | |
11 | SenseVoice | Multilingual Voice Understanding Model | 4.3K | |
12 | fragments | Open-source Next.js template for building apps that are fully generated by AI. By E2B. | 3.9K | |
13 | LLaMA-Omni | LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve speech capabilities at the GPT-4o level. | 2.8K | |
14 | database-build | In-browser Postgres sandbox with AI assistance (formerly postgres.new) | 2.7K | |
15 | tensorzero | TensorZero creates a feedback loop for optimizing LLM applications — turning production data into smarter, faster, and cheaper models. | 2.6K | |
16 | aide | Conquer Any Code in VSCode: One-Click Comments, Conversions, UI-to-Code, and AI Batch Processing of Files! 在 VSCode 中征服任何代码:一键注释、转换、UI 图生成代码、AI 批量处理文件!💪 | 2.5K | |
17 | ichigo | Local realtime voice AI | 2.2K | |
18 | OmAgent | Build multimodal language agents for fast prototype and production | 1.7K | |
19 | docetl | A system for agentic LLM-powered data processing and ETL | 1.7K | |
20 | lmnr | Laminar - open-source all-in-one platform for engineering AI products. Crate data flywheel for you AI app. Traces, Evals, Datasets, Labels. YC S24. | 1.6K | |
21 | archgw | AI-native (edge and LLM) proxy for agents. Engineered with fast ⚡️ LLMs for task (query) routing, rich observability, and the seamless integration of prompts with your APIs for agentic tasks. Built by the contributors of Envoy proxy. | 1.6K | |
22 | mastra | The TypeScript AI framework. | 1.5K | |
23 | pyspur | Graph UI for AI Agents in Python | 1.5K | |
24 | claude-coder | Kodu is an autonomous coding agent that lives in your IDE. It is a VSCode extension that can help you build your dream project step by step by leveraging the latest technologies in automated coding agents | 1.2K | |
25 | MobileLLM | MobileLLM Optimizing Sub-billion Parameter Language Models for On-Device Use Cases. In ICML 2024. | 1.2K | |
26 | sage | Chat with any codebase in under two minutes | Fully local or via third-party APIs | 1.2K | |
27 | prompt-poet | Streamlines and simplifies prompt design for both developers and non-technical users with a low code approach. | 1.0K | |
28 | rosa | ROSA 🤖 is an AI Agent designed to interact with ROS1- and ROS2-based robotics systems using natural language queries. ROSA helps robot developers inspect, diagnose, understand, and operate robots. | 956 | |
29 | e2m | E2M converts various file types (doc, docx, epub, html, htm, url, pdf, ppt, pptx, mp3, m4a) into Markdown. It’s easy to install, with dedicated parsers and converters, supporting custom configs. E2M offers an all-in-one, flexible, and open-source solution. | 920 | |
30 | BaseAI | BaseAI — The Web AI Framework. The easiest way to build serverless autonomous AI agents with memory. Start building local-first, agentic pipes, tools, and memory. Deploy serverless with one command. | 887 | |
31 | spiritlm | Inference code for the paper "Spirit-LM Interleaved Spoken and Written Language Model". | 873 | |
32 | dynamiq | Dynamiq is an orchestration framework for agentic AI and LLM applications | 715 | |
33 | Hexabot | Hexabot is an open-source AI chatbot / agent builder. It allows you to create and manage multi-channel and multilingual chatbots / agents with ease. | 641 | |
34 | huggingface-llama-recipes | 602 | ||
35 | humanlayer | HumanLayer enables AI agents to communicate with humans in tool-based and async workflows. Guarantee human oversight of high-stakes function calls with approval workflows across slack, email and more. Bring your LLM and Framework of choice and start giving your AI agents safe access to the world. Agentic Workflows, human in the loop, tool calling | 583 | |
36 | PromptChains | Prompt chains maximize intelligence and results when using LLMs | 563 | |
37 | Starmoon | An open source voice-enabled, compact, empathic AI hardware + software 🤖 framework for companionship, entertainment, education, healthcare, IoT applications, AI-enhanced robotics application services, and DIY robotics. Built with Python, NextJS, Arduino, ESP32, LLMs (GPT-4o), STT, TTS, AI agent. | 478 | |
38 | llama-assistant | AI-powered assistant to help you with your daily tasks, powered by Llama 3.2. It can recognize your voice, process natural language, and perform various actions based on your commands: summarizing text, rephasing sentences, answering questions, writing emails, and more. | 475 | |
39 | LongCite | LongCite: Enabling LLMs to Generate Fine-grained Citations in Long-context QA | 462 | |
40 | LLM2CLIP | LLM2CLIP makes SOTA pretrained CLIP model more SOTA ever. | 455 | |
41 | multi-agent-concierge | An example of multi-agent orchestration with llama-index | 390 | |
42 | fastagency | The fastest way to bring multi-agent workflows to production. | 386 | |
43 | mem0-chrome-extension | Claude Memory: Long-term memory for Claude | 378 | |
44 | aisearch-openai-rag-audio | A simple example implementation of the VoiceRAG pattern to power interactive voice generative AI experiences using RAG with Azure AI Search and Azure OpenAI's gpt-4o-realtime-preview model. | 357 | |
45 | ellmer | Call LLM APIs from R | 340 | |
46 | CleanS2S | High-quality and streaming Speech-to-Speech interactive agent in a single file. 只用一个文件实现的流式全双工语音交互原型智能体! | 328 | |
47 | FunAudioLLM-APP | 323 | ||
48 | gemini-api-quickstart | Get up and running with the Gemini API in under 5 minutes (with Python) | 312 | |
49 | Awesome-Attention-Heads | An awesome repository & A comprehensive survey on interpretability of LLM attention heads. | 303 | |
50 | trustgraph | Deploy agentic reasoning in a scalable and reliable platform in minutes. Become an on demand subject matter expert by loading portable cognitive cores for the most complex knowledge work. 🧠 | 288 | |
51 | premsql | End-to-End Local-First Text-to-SQL Pipelines | 281 | |
52 | DataHorse | Chat with your data, modify it, visualize it, create and test machine learning models all in plain English. DataHorse makes data analysis and data science conversational using LLMs. | 252 | |
53 | chatwiki | 开箱即用的基于企业私有知识库的LLM大语言模型的智能客服机器人问答系统,支持私有化部署,代码免费开源且可商用,由芝麻小客服官方推出。 | 246 | |
54 | apo | APO is a one-stop observability platform combining OpenTelemetry with eBPF. Leveraging LLM capabilities to enable auto-pilot analyzing and troubleshooting 🚀. | 244 | |
55 | LlamaVoice | LlamaVoice is a llama-based large voice generation model, providing inference and training ability. | 230 | |
56 | swarmzero | SwarmZero's SDK for building AI agents, swarms of agents and much more. | 221 | |
57 | ProX | Offical Repo for "Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale" | 209 | |
58 | co-op-translator | Easily generate multilingual translations for your project with a single command, powered by Azure AI Services. | 208 | |
59 | VideoGen-Eval | The Dawn of Video Generation: Preliminary Explorations with SORA-like Models | 205 | |
60 | TapeAgents | TapeAgents is a framework that facilitates all stages of the LLM Agent development lifecycle | 205 | |
61 | GraphRag.Net | 参考GraphRag使用 Semantic Kernel 来实现的dotnet版本,可以使用NuGet开箱即用集成到项目中 | 203 | |
62 | MooER | MooER: Moore-threads Open Omni model for speech-to-speech intERaction. MooER-omni includes a series of end-to-end speech interaction models along with training and inference code, covering but not limited to end-to-end speech interaction, end-to-end speech translation and speech recognition. | 190 | |
63 | AUITestAgent | AUITestAgent is the first automatic, natural language-driven GUI testing tool for mobile apps, capable of fully automating the entire process of GUI interaction and function verification. | 182 | |
64 | bolna | Conversational voice AI agents | 165 | |
65 | langfair | LangFair is a Python library for conducting use-case level LLM bias and fairness assessments | 165 | |
66 | TrafficLLM | The repository of TrafficLLM, a universal LLM adaptation framework to learn robust traffic representation for all open-sourced LLM in real-world scenarios and enhance the generalization across diverse traffic analysis tasks. | 161 | |
67 | marly | Context-aware structured outputs. Search your documents or the web for specific data and get it back in JSON or Markdown. | 145 | |
68 | gateway-api-inference-extension | Gateway API Inference Extension | 143 | |
69 | awesome_LLM-harmful-fine-tuning-papers | A survey on harmful fine-tuning attack for large language model | 133 | |
70 | DenseFusion | DenseFusion-1M: Merging Vision Experts for Comprehensive Multimodal Perception | 131 | |
71 | llama-stack-client-python | Python SDK for Llama Stack | 126 | |
72 | transformer-ranker | Efficiently find the best-suited language model (LM) for your NLP task | 116 | |
73 | llama_extract | 113 | ||
74 | effective_llm_alignment | Effective LLM Alignment Toolkit | 113 | |
75 | TEAL | 110 | ||
76 | ai-shifu | LLM-powered AI guide that leads and drives intelligent conversations | 109 | |
77 | KB-Builder | Knowledge Base Builder,是一款基于LLM大语言模型的开源知识库文本处理系统,是「滨电智言」的一款开源工具,旨在成为企业的知识库构建中枢。 | 107 | |
78 | eureka-ml-insights | A framework for standardizing evaluations of large foundation models, beyond single-score reporting and rankings. | 106 | |
79 | multimodal-ai-llm-processing-accelerator | Build multimodal data processing pipelines with Azure AI Services + LLMs | 106 | |
80 | finetuning | Finetune Llama-3-8b on the MathInstruct dataset | 105 | |
81 | LLaVA-MORE | LLaVA-MORE: Enhancing Visual Instruction Tuning with LLaMA 3.1 | 97 | |
82 | flockmtl | FlockMTL: DuckDB extension to seamlessly combine analytics and semantic analysis using language models (LMs) | 95 | |
83 | grps_trtllm | 【高性能OpenAI LLM服务】通过GPRS+TensorRT-LLM+Tokenizers.cpp实现纯C++版高性能OpenAI LLM服务,支持chat和function call模式,支持ai agent,支持分布式多卡推理,支持多模态,支持gradio聊天界面。 | 93 | |
84 | LawGLM | 探索 LLM 在法律行业的应用潜力 | 80 | |
85 | proxy-to-gemini | A proxy sidecar to access Gemini models via OpenAI and Ollama APIs | 74 | |
86 | LLMServingSim | LLMServingSim: A HW/SW Co-Simulation Infrastructure for LLM Inference Serving at Scale | 69 | |
87 | Awesome-Multimodal-LLM-for-Math-STEM | Paper collections of multi-modal LLM for Math/STEM/Code. | 68 | |
88 | multilspy | multispy is a lsp client library in Python intended to be used to build applications around language servers. | 66 | |
89 | MoE-PEFT | An Efficient LLM Fine-Tuning Factory Optimized for MoE PEFT | 65 | |
90 | GhostOS | A framework offers an OS simulator within a Python Code Interface for AI Agents | 57 | |
91 | flow-judge | Code for evaluating with Flow-Judge-v0.1 - an open-source, lightweight (3.8B) language model optimized for LLM system evaluations. Crafted for accuracy, speed, and customization. | 57 | |
92 | barq | Dabarqus is incredibly fast RAG that runs everywhere. | 56 | |
93 | VLM | 56 | ||
94 | gLM2 | 55 | ||
95 | study-drift-lms | A modern learning management system to place learning in the hands of the students | 54 | |
96 | KDPL | [ECCV 2024] - Improving Zero-shot Generalization of Learned Prompts via Unsupervised Knowledge Distillation | 53 | |
97 | gpt-rag-agentic | 51 | ||
98 | openai-chat-vision-quickstart | A demonstration of chatting with uploaded images using OpenAI vision models like gpt-4o. | 51 | |
99 | LiteWebAgent | The Library for LLM-based web-agent applications | 50 | |
100 | agent-openai-java-banking-assistant | multi-agents banking assistant with Java and Semantic Kernel | 49 |