TOP AI Developers by monthly star count
TOP AI Organization Account by AI repo star count
Top AI Project by Category star count
Top Growing Speed list by the speed of gaining stars
Top List of who create influential repos with little people known
Rankings | Developers | Related Project | Project intro | Star count |
---|---|---|---|---|
1 | Auto_Jobs_Applier_AI_Agent | Auto_Jobs_Applier_AI_Agent by AIHawk is an AI Agent that automates the jobs application process. Utilizing artificial intelligence, it enables users to apply for multiple jobs in an automated and personalized way. | 22.2K | |
2 | BitNet | Official inference framework for 1-bit LLMs | 11.1K | |
3 | void | 8.1K | ||
4 | CosyVoice | Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability. | 6.2K | |
5 | zerox | PDF to Markdown with vision models | 6.1K | |
6 | MindSearch | 🔍 An LLM-based Multi-agent Framework of Web Search Engine (like Perplexity.ai Pro and SearchGPT) | 5.1K | |
7 | nexa-sdk | Nexa SDK is a comprehensive toolkit for supporting ONNX and GGML models. It supports text generation, image generation, vision-language models (VLM), auto-speech-recognition (ASR), and text-to-speech (TTS) capabilities. | 3.4K | |
8 | Liger-Kernel | Efficient Triton Kernels for LLM Training | 3.4K | |
9 | fragments | Open-source Next.js template for building apps that are fully generated by AI. By E2B. | 3.4K | |
10 | SenseVoice | Multilingual Voice Understanding Model | 3.4K | |
11 | LLaMA-Omni | LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve speech capabilities at the GPT-4o level. | 2.6K | |
12 | postgres-new | In-browser Postgres sandbox with AI assistance | 2.4K | |
13 | aide | Conquer Any Code in VSCode: One-Click Comments, Conversions, UI-to-Code, and AI Batch Processing of Files! 在 VSCode 中征服任何代码:一键注释、转换、UI 图生成代码、AI 批量处理文件!💪 | 2.3K | |
14 | ichigo | Llama3.1 learns to Listen | 1.8K | |
15 | docetl | A system for agentic LLM-powered data processing and ETL | 1.3K | |
16 | OmAgent | A Multimodal Native Agent Framework for Smart Hardware and More | 1.2K | |
17 | MobileLLM | MobileLLM Optimizing Sub-billion Parameter Language Models for On-Device Use Cases. In ICML 2024. | 1.1K | |
18 | lmnr | Laminar - open-source all-in-one platform for engineering AI products. Traces, Evals, Datasets, Labels. YC S24. | 1.1K | |
19 | sage | Chat with any codebase in under two minutes | Fully local or via third-party APIs | 1.0K | |
20 | AgentNeo | Python SDK for Agent AI Observability, Monitoring and Evaluation Framework. Includes features like agent, llm and tools tracing, debugging multi-agentic system, self-hosted dashboard and advanced analytics with timeline and execution graph view | 988 | |
21 | prompt-poet | Streamlines and simplifies prompt design for both developers and non-technical users with a low code approach. | 911 | |
22 | spiritlm | Inference code for the paper "Spirit-LM Interleaved Spoken and Written Language Model". | 768 | |
23 | rosa | ROSA 🤖 is an AI Agent designed to interact with ROS1- and ROS2-based robotics systems using natural language queries. ROSA helps robot developers inspect, diagnose, understand, and operate robots. | 645 | |
24 | arch | Arch is an intelligent prompt gateway. Engineered with (fast) LLMs for the secure handling, robust observability, and seamless integration of prompts with APIs - all outside business logic. Built by the core contributors of Envoy proxy, on Envoy. | 597 | |
25 | BaseAI | BaseAI — The Web AI Framework. The easiest way to build serverless autonomous AI agents with memory. Start building local-first, agentic pipes, tools, and memory. Deploy serverless with one command. | 530 | |
26 | huggingface-llama-recipes | 529 | ||
27 | tensorzero | TensorZero creates a feedback loop for optimizing LLM applications — turning production data into smarter, faster, and cheaper models. | 480 | |
28 | midscene | An AI-powered automation SDK can control the page, perform assertions, and extract data in JSON format using natural language. | 477 | |
29 | Hexabot | Hexabot is an open-source AI chatbot / agent builder. It allows you to create and manage multi-channel and multilingual chatbots / agents with ease. | 477 | |
30 | lotus | LOTUS: A semantic query engine - process data with LLMs as easily as writing pandas code | 421 | |
31 | llama-assistant | AI-powered assistant to help you with your daily tasks, powered by Llama 3.2. It can recognize your voice, process natural language, and perform various actions based on your commands: summarizing text, rephasing sentences, answering questions, writing emails, and more. | 412 | |
32 | LongCite | LongCite: Enabling LLMs to Generate Fine-grained Citations in Long-context QA | 407 | |
33 | Starmoon | An open source voice-enabled, compact, empathic AI hardware + software 🤖 framework for companionship, entertainment, education, pediatric care, IoT robotics applications, AI-enhanced robotics application services, research, and DIY robotics kit development using Python, NextJs, Arduino, ESP32, LLMs (GPT), STT, TTS, Emotion Analysis, AI agent | 402 | |
34 | mem0-chrome-extension | Claude Memory: Long-term memory for Claude | 353 | |
35 | PromptChains | Prompt chains maximize intelligence and results when using LLMs | 340 | |
36 | multi-agent-concierge | An example of multi-agent orchestration with llama-index | 320 | |
37 | FunAudioLLM-APP | 287 | ||
38 | e2m | E2M converts various file types (doc, docx, epub, html, htm, url, pdf, ppt, pptx, mp3, m4a) into Markdown. It’s easy to install, with dedicated parsers and converters, supporting custom configs. E2M offers an all-in-one, flexible, and open-source solution. | 286 | |
39 | claude-coder | Kodu is an autonomous coding agent that lives in your IDE. It is a VSCode extension that can help you build your dream project step by step by leveraging the latest technologies in automated coding agents | 283 | |
40 | fastagency | The fastest way to bring multi-agent workflows to production. | 280 | |
41 | gemini-api-quickstart | Get up and running with the Gemini API in under 5 minutes (with Python) | 276 | |
42 | Awesome-Attention-Heads | An awesome repository & A comprehensive survey on interpretability of LLM attention heads. | 264 | |
43 | humanlayer | HumanLayer enables AI agents to communicate with humans in tool-based and async workflows. Guarantee human oversight of high-stakes function calls with approval workflows across slack, email and more. Bring your LLM and Framework of choice and start giving your AI agents safe access to the world. Agentic Workflows, human in the loop, tool calling | 257 | |
44 | aisearch-openai-rag-audio | A simple example implementation of the VoiceRAG pattern to power interactive voice generative AI experiences using RAG with Azure AI Search and Azure OpenAI's gpt-4o-realtime-preview model. | 253 | |
45 | DataHorse | Chat with your data, modify it, visualize it, create and test machine learning models all in plain English. DataHorse makes data analysis and data science conversational using LLMs. | 241 | |
46 | dynamiq | Dynamiq is an orchestration framework for agentic AI and LLM applications | 237 | |
47 | LlamaVoice | LlamaVoice is a llama-based large voice generation model, providing inference and training ability. | 221 | |
48 | CleanS2S | High-quality and streaming Speech-to-Speech interactive agent in a single file. 只用一个文件实现的流式全双工语音交互原型智能体! | 207 | |
49 | mastra | The TypeScript AI framework. | 192 | |
50 | GraphRag.Net | 参考GraphRag使用 Semantic Kernel 来实现的dotnet版本,可以使用NuGet开箱即用集成到项目中 | 184 | |
51 | trustgraph | Connect Data Silos with Reliable AI⚡🚀 | 166 | |
52 | MooER | MooER: Moore-threads Open Omni model for speech-to-speech intERaction. MooER-omni includes a series of end-to-end speech interaction models along with training and inference code, covering but not limited to end-to-end speech interaction, end-to-end speech translation and speech recognition. | 157 | |
53 | chatwiki | 开箱即用的基于企业私有知识库的LLM大语言模型的智能客服机器人问答系统,支持私有化部署,代码免费开源且可商用,由芝麻小客服官方推出。 | 150 | |
54 | AUITestAgent | AUITestAgent is the first automatic, natural language-driven GUI testing tool for mobile apps, capable of fully automating the entire process of GUI interaction and function verification. | 148 | |
55 | premsql | End-to-End Local-First Text-to-SQL Pipelines | 139 | |
56 | TapeAgents | TapeAgents is a framework that facilitates all stages of the LLM Agent development lifecycle | 122 | |
57 | KB-Builder | Knowledge Base Builder,是一款基于LLM大语言模型的开源知识库生成管理优化构建系统,是「滨电智言」的一款开源工具,旨在成为企业的知识库构建中枢。 | 118 | |
58 | DenseFusion | DenseFusion-1M: Merging Vision Experts for Comprehensive Multimodal Perception | 117 | |
59 | VideoGen-Eval | The Dawn of Video Generation: Preliminary Explorations with SORA-like Models | 116 | |
60 | marly | Contextualized Structured Outputs. Search your documents or the web for specific data and get it back in JSON or Markdown. | 112 | |
61 | TrafficLLM | The repository of TrafficLLM, a universal LLM adaptation framework to learn robust traffic representation for all open-sourced LLM in real-world scenarios and enhance the generalization across diverse traffic analysis tasks. | 110 | |
62 | llama_extract | 105 | ||
63 | finetuning | Finetune Llama-3-8b on the MathInstruct dataset | 97 | |
64 | TEAL | 95 | ||
65 | grps_trtllm | 【grps接入trtllm】通过GPRS+TensorRT-LLM+Tokenizers.cpp实现纯C++版高性能OpenAI LLM服务,支持chat和function call模式,支持ai agent,支持分布式多卡推理,支持多模态,支持gradio聊天界面。 | 89 | |
66 | effective_llm_alignment | Effective LLM Alignment Toolkit | 87 | |
67 | LLaVA-MORE | LLaVA-MORE: Enhancing Visual Instruction Tuning with LLaMA 3.1 | 84 | |
68 | eureka-ml-insights | A framework for standardizing evaluations of large foundation models, beyond single-score reporting and rankings. | 83 | |
69 | transformer-ranker | Efficiently find the best-suited language model (LM) for your NLP task | 77 | |
70 | awesome_LLM-harmful-fine-tuning-papers | A survey on harmful fine-tuning attack for large language model | 76 | |
71 | swarmzero | SwarmZero's SDK for building AI agents, swarms of agents and much more. | 74 | |
72 | multimodal-ai-llm-processing-accelerator | Build multimodal data processing pipelines with Azure AI Services + LLMs | 73 | |
73 | bolna | Full stack tools for building voice agents | 71 | |
74 | llama-stack-client-python | Python SDK for Llama Stack | 69 | |
75 | llm-instance-gateway | LLM Instance gateway implementation. | 69 | |
76 | flockmtl-duckdb | FlockMTL: DuckDB extension to seamlessly combine analytics and semantic analysis using language models (LMs) | 65 | |
77 | co-op-translator | Easily automate multilingual translations for your projects with co-op-translator, powered by advanced LLM technology and Azure AI Services. | 60 | |
78 | proxy-to-gemini | A proxy sidecar to access Gemini models via OpenAI and Ollama APIs | 56 | |
79 | langfair | LangFair is a Python library for conducting use-case level LLM bias and fairness assessments | 54 | |
80 | ai-shifu | LLM-powered AI guide that leads and drives intelligent conversations | 53 | |
81 | flow-judge | Code for evaluating with Flow-Judge-v0.1 - an open-source, lightweight (3.8B) language model optimized for LLM system evaluations. Crafted for accuracy, speed, and customization. | 53 | |
82 | gLM2 | 52 | ||
83 | GhostOS | An agent framework offering a Python code interface for LLM-driven agents and meta-agents to do everything by code generation. | 51 | |
84 | study-drift-lms | A modern learning management system to place learning in the hands of the students | 50 | |
85 | KDPL | [ECCV 2024] - Improving Zero-shot Generalization of Learned Prompts via Unsupervised Knowledge Distillation | 46 | |
86 | MoE-PEFT | An Efficient LLM Fine-Tuning Factory Optimized for MoE PEFT | 43 | |
87 | ros2_nanollm | ROS2 nodes for LLM, VLM, VLA | 38 | |
88 | barq | Dabarqus is a stand alone application that implements a complete RAG solution. | 37 | |
89 | raft-distillation-recipe | A recipe that will walk you through using either Meta Llama 3.1 405B or GPT-4o deployed on Azure AI to generate a synthetic dataset using UC Berkeley's Gorilla project RAFT method. | 33 | |
90 | openai-chat-vision-quickstart | A demonstration of chatting with uploaded images using OpenAI vision models like gpt-4o. | 31 | |
91 | LiteWebAgent | The Library for LLM-based web-agent applications | 31 | |
92 | Parrot | 🎉 The code repository for "Parrot: Multilingual Visual Instruction Tuning" in PyTorch. | 30 | |
93 | dify-google-cloud-terraform | Terraform configuration for deploying Dify on Google Cloud with scalability, high availability, and production-level readiness. | 29 | |
94 | VLM | 29 | ||
95 | GMAI-MMBench | GMAI-MMBench: A Comprehensive Multimodal Evaluation Benchmark Towards General Medical AI. | 29 | |
96 | agent-openai-java-banking-assistant | multi-agents banking assistant with Java and Semantic Kernel | 28 | |
97 | Awesome-Multimodal-LLM-for-Math-STEM | Paper collections of multi-modal LLM for Math/STEM/Code. | 28 | |
98 | stable-diffusion-webui-forge | Stable Diffusion WebUI Forge docker images for use in GPU cloud and local environments. Includes AI-Dock base for authentication and improved user experience. | 27 | |
99 | moorellm | Finite State Machine based approach to create Agentic LLM Apps! | 27 | |
100 | PodGPT | PodGPT: A multilingual audio-augmented large language model for research and education | 27 |