TOP AI Developers by monthly star count
TOP AI Organization Account by AI repo star count
Top AI Project by Category star count
Top Growing Speed list by the speed of gaining stars
Top List of who create influential repos with little people known
Rankings | Developers | Related Project | Project intro | Star count |
---|---|---|---|---|
1 | Auto_Jobs_Applier_AI_Agent | Auto_Jobs_Applier_AI_Agent by AIHawk is an AI Agent that automates the jobs application process. Utilizing artificial intelligence, it enables users to apply for multiple jobs in an automated and personalized way. | 22.5K | |
2 | BitNet | Official inference framework for 1-bit LLMs | 11.4K | |
3 | void | 8.2K | ||
4 | zerox | PDF to Markdown with vision models | 6.7K | |
5 | CosyVoice | Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability. | 6.4K | |
6 | MindSearch | 🔍 An LLM-based Multi-agent Framework of Web Search Engine (like Perplexity.ai Pro and SearchGPT) | 5.2K | |
7 | nexa-sdk | Nexa SDK is a comprehensive toolkit for supporting ONNX and GGML models. It supports text generation, image generation, vision-language models (VLM), auto-speech-recognition (ASR), and text-to-speech (TTS) capabilities. | 4.2K | |
8 | SenseVoice | Multilingual Voice Understanding Model | 3.5K | |
9 | fragments | Open-source Next.js template for building apps that are fully generated by AI. By E2B. | 3.5K | |
10 | Liger-Kernel | Efficient Triton Kernels for LLM Training | 3.5K | |
11 | LLaMA-Omni | LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve speech capabilities at the GPT-4o level. | 2.6K | |
12 | postgres-new | In-browser Postgres sandbox with AI assistance | 2.5K | |
13 | aide | Conquer Any Code in VSCode: One-Click Comments, Conversions, UI-to-Code, and AI Batch Processing of Files! 在 VSCode 中征服任何代码:一键注释、转换、UI 图生成代码、AI 批量处理文件!💪 | 2.3K | |
14 | ichigo | Local realtime voice AI | 2.0K | |
15 | OmAgent | A Multimodal Native Agent Framework for Smart Hardware and More | 1.3K | |
16 | docetl | A system for agentic LLM-powered data processing and ETL | 1.3K | |
17 | lmnr | Laminar - open-source all-in-one platform for engineering AI products. Crate data flywheel for you AI app. Traces, Evals, Datasets, Labels. YC S24. | 1.2K | |
18 | MobileLLM | MobileLLM Optimizing Sub-billion Parameter Language Models for On-Device Use Cases. In ICML 2024. | 1.2K | |
19 | AgentNeo | Python SDK for Agent AI Observability, Monitoring and Evaluation Framework. Includes features like agent, llm and tools tracing, debugging multi-agentic system, self-hosted dashboard and advanced analytics with timeline and execution graph view | 1.1K | |
20 | sage | Chat with any codebase in under two minutes | Fully local or via third-party APIs | 1.1K | |
21 | prompt-poet | Streamlines and simplifies prompt design for both developers and non-technical users with a low code approach. | 928 | |
22 | archgw | Arch is an intelligent gateway for agents. Engineered with (fast) LLMs for the secure handling, rich observability, and seamless integration of prompts with your APIs - all outside business logic. Built by the core contributors of Envoy proxy, on Envoy. | 809 | |
23 | spiritlm | Inference code for the paper "Spirit-LM Interleaved Spoken and Written Language Model". | 805 | |
24 | tensorzero | TensorZero creates a feedback loop for optimizing LLM applications — turning production data into smarter, faster, and cheaper models. | 697 | |
25 | mastra | The TypeScript AI framework. | 672 | |
26 | rosa | ROSA 🤖 is an AI Agent designed to interact with ROS1- and ROS2-based robotics systems using natural language queries. ROSA helps robot developers inspect, diagnose, understand, and operate robots. | 671 | |
27 | BaseAI | BaseAI — The Web AI Framework. The easiest way to build serverless autonomous AI agents with memory. Start building local-first, agentic pipes, tools, and memory. Deploy serverless with one command. | 577 | |
28 | huggingface-llama-recipes | 535 | ||
29 | claude-coder | Kodu is an autonomous coding agent that lives in your IDE. It is a VSCode extension that can help you build your dream project step by step by leveraging the latest technologies in automated coding agents | 503 | |
30 | Hexabot | Hexabot is an open-source AI chatbot / agent builder. It allows you to create and manage multi-channel and multilingual chatbots / agents with ease. | 502 | |
31 | midscene | An AI-powered automation SDK can control the page, perform assertions, and extract data in JSON format using natural language. | 494 | |
32 | dynamiq | Dynamiq is an orchestration framework for agentic AI and LLM applications | 486 | |
33 | lotus | LOTUS: A semantic query engine - process data with LLMs as easily as writing pandas code | 435 | |
34 | llama-assistant | AI-powered assistant to help you with your daily tasks, powered by Llama 3.2. It can recognize your voice, process natural language, and perform various actions based on your commands: summarizing text, rephasing sentences, answering questions, writing emails, and more. | 423 | |
35 | PromptChains | Prompt chains maximize intelligence and results when using LLMs | 423 | |
36 | Starmoon | An open source voice-enabled, compact, empathic AI hardware + software 🤖 framework for companionship, entertainment, education, pediatric care, IoT robotics applications, AI-enhanced robotics application services, research, and DIY robotics kit development using Python, NextJs, Arduino, ESP32, LLMs (GPT), STT, TTS, Emotion Analysis, AI agent | 419 | |
37 | LongCite | LongCite: Enabling LLMs to Generate Fine-grained Citations in Long-context QA | 414 | |
38 | mem0-chrome-extension | Claude Memory: Long-term memory for Claude | 360 | |
39 | multi-agent-concierge | An example of multi-agent orchestration with llama-index | 338 | |
40 | e2m | E2M converts various file types (doc, docx, epub, html, htm, url, pdf, ppt, pptx, mp3, m4a) into Markdown. It’s easy to install, with dedicated parsers and converters, supporting custom configs. E2M offers an all-in-one, flexible, and open-source solution. | 303 | |
41 | fastagency | The fastest way to bring multi-agent workflows to production. | 300 | |
42 | humanlayer | HumanLayer enables AI agents to communicate with humans in tool-based and async workflows. Guarantee human oversight of high-stakes function calls with approval workflows across slack, email and more. Bring your LLM and Framework of choice and start giving your AI agents safe access to the world. Agentic Workflows, human in the loop, tool calling | 297 | |
43 | FunAudioLLM-APP | 288 | ||
44 | LLM2CLIP | LLM2CLIP makes SOTA pretrained CLIP model more SOTA ever. | 285 | |
45 | gemini-api-quickstart | Get up and running with the Gemini API in under 5 minutes (with Python) | 279 | |
46 | Awesome-Attention-Heads | An awesome repository & A comprehensive survey on interpretability of LLM attention heads. | 266 | |
47 | aisearch-openai-rag-audio | A simple example implementation of the VoiceRAG pattern to power interactive voice generative AI experiences using RAG with Azure AI Search and Azure OpenAI's gpt-4o-realtime-preview model. | 265 | |
48 | CleanS2S | High-quality and streaming Speech-to-Speech interactive agent in a single file. 只用一个文件实现的流式全双工语音交互原型智能体! | 243 | |
49 | DataHorse | Chat with your data, modify it, visualize it, create and test machine learning models all in plain English. DataHorse makes data analysis and data science conversational using LLMs. | 242 | |
50 | LlamaVoice | LlamaVoice is a llama-based large voice generation model, providing inference and training ability. | 223 | |
51 | elmer | Call LLM APIs from R | 196 | |
52 | GraphRag.Net | 参考GraphRag使用 Semantic Kernel 来实现的dotnet版本,可以使用NuGet开箱即用集成到项目中 | 187 | |
53 | trustgraph | Connect Data Silos with Explainable AI⚡🚀 | 181 | |
54 | premsql | End-to-End Local-First Text-to-SQL Pipelines | 170 | |
55 | MooER | MooER: Moore-threads Open Omni model for speech-to-speech intERaction. MooER-omni includes a series of end-to-end speech interaction models along with training and inference code, covering but not limited to end-to-end speech interaction, end-to-end speech translation and speech recognition. | 165 | |
56 | chatwiki | 开箱即用的基于企业私有知识库的LLM大语言模型的智能客服机器人问答系统,支持私有化部署,代码免费开源且可商用,由芝麻小客服官方推出。 | 161 | |
57 | AUITestAgent | AUITestAgent is the first automatic, natural language-driven GUI testing tool for mobile apps, capable of fully automating the entire process of GUI interaction and function verification. | 151 | |
58 | swarmzero | SwarmZero's SDK for building AI agents, swarms of agents and much more. | 145 | |
59 | TapeAgents | TapeAgents is a framework that facilitates all stages of the LLM Agent development lifecycle | 135 | |
60 | VideoGen-Eval | The Dawn of Video Generation: Preliminary Explorations with SORA-like Models | 130 | |
61 | marly | Context-aware structured outputs. Search your documents or the web for specific data and get it back in JSON or Markdown. | 121 | |
62 | DenseFusion | DenseFusion-1M: Merging Vision Experts for Comprehensive Multimodal Perception | 120 | |
63 | TrafficLLM | The repository of TrafficLLM, a universal LLM adaptation framework to learn robust traffic representation for all open-sourced LLM in real-world scenarios and enhance the generalization across diverse traffic analysis tasks. | 118 | |
64 | KB-Builder | Knowledge Base Builder,是一款基于LLM大语言模型的开源知识库生成管理优化构建系统,是「滨电智言」的一款开源工具,旨在成为企业的知识库构建中枢。 | 118 | |
65 | llama_extract | 105 | ||
66 | finetuning | Finetune Llama-3-8b on the MathInstruct dataset | 98 | |
67 | TEAL | 96 | ||
68 | transformer-ranker | Efficiently find the best-suited language model (LM) for your NLP task | 94 | |
69 | grps_trtllm | 【grps接入trtllm】通过GPRS+TensorRT-LLM+Tokenizers.cpp实现纯C++版高性能OpenAI LLM服务,支持chat和function call模式,支持ai agent,支持分布式多卡推理,支持多模态,支持gradio聊天界面。 | 92 | |
70 | eureka-ml-insights | A framework for standardizing evaluations of large foundation models, beyond single-score reporting and rankings. | 90 | |
71 | effective_llm_alignment | Effective LLM Alignment Toolkit | 89 | |
72 | LLaVA-MORE | LLaVA-MORE: Enhancing Visual Instruction Tuning with LLaMA 3.1 | 86 | |
73 | multimodal-ai-llm-processing-accelerator | Build multimodal data processing pipelines with Azure AI Services + LLMs | 82 | |
74 | awesome_LLM-harmful-fine-tuning-papers | A survey on harmful fine-tuning attack for large language model | 82 | |
75 | bolna | Full stack tools for building voice agents | 82 | |
76 | llm-instance-gateway | LLM Instance gateway implementation. | 80 | |
77 | llama-stack-client-python | Python SDK for Llama Stack | 75 | |
78 | ai-shifu | LLM-powered AI guide that leads and drives intelligent conversations | 75 | |
79 | flockmtl-duckdb | FlockMTL: DuckDB extension to seamlessly combine analytics and semantic analysis using language models (LMs) | 68 | |
80 | langfair | LangFair is a Python library for conducting use-case level LLM bias and fairness assessments | 67 | |
81 | co-op-translator | Easily generate multilingual translations for your project with a single command, powered by Azure AI Services. | 64 | |
82 | proxy-to-gemini | A proxy sidecar to access Gemini models via OpenAI and Ollama APIs | 56 | |
83 | gLM2 | 53 | ||
84 | flow-judge | Code for evaluating with Flow-Judge-v0.1 - an open-source, lightweight (3.8B) language model optimized for LLM system evaluations. Crafted for accuracy, speed, and customization. | 53 | |
85 | GhostOS | An agent framework offering a Python code interface for LLM-driven agents and meta-agents to do everything by code generation. | 51 | |
86 | study-drift-lms | A modern learning management system to place learning in the hands of the students | 50 | |
87 | KDPL | [ECCV 2024] - Improving Zero-shot Generalization of Learned Prompts via Unsupervised Knowledge Distillation | 48 | |
88 | MoE-PEFT | An Efficient LLM Fine-Tuning Factory Optimized for MoE PEFT | 47 | |
89 | barq | Dabarqus is a stand alone application that implements a complete RAG solution. | 47 | |
90 | ros2_nanollm | ROS2 nodes for LLM, VLM, VLA | 39 | |
91 | openai-chat-vision-quickstart | A demonstration of chatting with uploaded images using OpenAI vision models like gpt-4o. | 35 | |
92 | raft-distillation-recipe | A recipe that will walk you through using either Meta Llama 3.1 405B or GPT-4o deployed on Azure AI to generate a synthetic dataset using UC Berkeley's Gorilla project RAFT method. | 35 | |
93 | LiteWebAgent | The Library for LLM-based web-agent applications | 35 | |
94 | VLM | 35 | ||
95 | Parrot | 🎉 The code repository for "Parrot: Multilingual Visual Instruction Tuning" in PyTorch. | 33 | |
96 | agent-openai-java-banking-assistant | multi-agents banking assistant with Java and Semantic Kernel | 31 | |
97 | GMAI-MMBench | GMAI-MMBench: A Comprehensive Multimodal Evaluation Benchmark Towards General Medical AI. | 31 | |
98 | dify-google-cloud-terraform | Terraform configuration for deploying Dify on Google Cloud with scalability, high availability, and production-level readiness. | 30 | |
99 | moorellm | Finite State Machine based approach to create Agentic LLM Apps! | 28 | |
100 | GPT-Talker | [ACMMM'2024] Generative Expressive Conversational Speech Synthesis | 28 |