TOP AI Developers by monthly star count
TOP AI Organization Account by AI repo star count
Top AI Project by Category star count
Top Growing Speed list by the speed of gaining stars
Top List of who create influential repos with little people known
Rankings | Developers | Related Project | Project intro | Star count |
---|---|---|---|---|
1 | Auto_Jobs_Applier | Auto_Jobs_Applier by AIHawk is an Agen that automates the jobs application process. Utilizing artificial intelligence, it enables users to apply for multiple jobs in an automated and personalized way. | 21.9K | |
2 | BitNet | Official inference framework for 1-bit LLMs | 10.9K | |
3 | void | 8.0K | ||
4 | CosyVoice | Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability. | 6.1K | |
5 | zerox | PDF to Markdown with vision models | 5.9K | |
6 | MindSearch | 🔍 An LLM-based Multi-agent Framework of Web Search Engine (like Perplexity.ai Pro and SearchGPT) | 5.1K | |
7 | Liger-Kernel | Efficient Triton Kernels for LLM Training | 3.4K | |
8 | fragments | Open-source Next.js template for building apps that are fully generated by AI. By E2B. | 3.4K | |
9 | SenseVoice | Multilingual Voice Understanding Model | 3.3K | |
10 | nexa-sdk | Nexa SDK is a comprehensive toolkit for supporting ONNX and GGML models. It supports text generation, image generation, vision-language models (VLM), auto-speech-recognition (ASR), and text-to-speech (TTS) capabilities. | 3.1K | |
11 | LLaMA-Omni | LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve speech capabilities at the GPT-4o level. | 2.5K | |
12 | postgres-new | In-browser Postgres sandbox with AI assistance | 2.4K | |
13 | aide | Conquer Any Code in VSCode: One-Click Comments, Conversions, UI-to-Code, and AI Batch Processing of Files! 在 VSCode 中征服任何代码:一键注释、转换、UI 图生成代码、AI 批量处理文件!💪 | 2.3K | |
14 | ichigo | Llama3.1 learns to Listen | 1.7K | |
15 | docetl | A system for agentic LLM-powered data processing and ETL | 1.2K | |
16 | MobileLLM | MobileLLM Optimizing Sub-billion Parameter Language Models for On-Device Use Cases. In ICML 2024. | 1.1K | |
17 | sage | Chat with any codebase in under two minutes | Fully local or via third-party APIs | 1.0K | |
18 | lmnr | Laminar - open-source all-in-one platform for engineering AI products. Traces, Evals, Datasets, Labels. YC S24. | 960 | |
19 | prompt-poet | Streamlines and simplifies prompt design for both developers and non-technical users with a low code approach. | 910 | |
20 | AgentNeo | Python SDK for Agent AI Observability, Monitoring and Evaluation Framework. Includes features like agent, llm and tools tracing, debugging multi-agentic system, self-hosted dashboard and advanced analytics with timeline and execution graph view | 862 | |
21 | spiritlm | Inference code for the paper "Spirit-LM Interleaved Spoken and Written Language Model". | 752 | |
22 | rosa | ROSA 🤖 is an AI Agent designed to interact with ROS1- and ROS2-based robotics systems using natural language queries. ROSA helps robot developers inspect, diagnose, understand, and operate robots. | 620 | |
23 | arch | Arch is an intelligent prompt gateway. Engineered with (fast) LLMs for the secure handling, robust observability, and seamless integration of prompts with APIs - all outside business logic. Built by the core contributors of Envoy proxy, on Envoy. | 537 | |
24 | huggingface-llama-recipes | 522 | ||
25 | midscene | An AI-powered automation SDK can control the page, perform assertions, and extract data in JSON format using natural language. | 461 | |
26 | Hexabot | Hexabot is an open-source AI chatbot / agent builder. It allows you to create and manage multi-channel and multilingual chatbots / agents with ease. | 443 | |
27 | tensorzero | make LLMs improve through experience | 413 | |
28 | lotus | LOTUS: A semantic query engine - process data with LLMs as easily as writing pandas code | 411 | |
29 | llama-assistant | AI-powered assistant to help you with your daily tasks, powered by Llama 3.2. It can recognize your voice, process natural language, and perform various actions based on your commands: summarizing text, rephasing sentences, answering questions, writing emails, and more. | 403 | |
30 | LongCite | LongCite: Enabling LLMs to Generate Fine-grained Citations in Long-context QA | 402 | |
31 | Starmoon | An open source voice-enabled, compact, empathic AI hardware + software 🤖 framework for companionship, entertainment, education, pediatric care, IoT robotics applications, AI-enhanced robotics application services, research, and DIY robotics kit development using Python, NextJs, Arduino, ESP32, LLMs (GPT), STT, TTS, Emotion Analysis, AI agent | 396 | |
32 | mem0-chrome-extension | Claude Memory: Long-term memory for Claude | 343 | |
33 | PromptChains | Prompt chains maximize intelligence and results when using LLMs | 319 | |
34 | multi-agent-concierge | An example of multi-agent orchestration with llama-index | 313 | |
35 | FunAudioLLM-APP | 283 | ||
36 | e2m | E2M converts various file types (doc, docx, epub, html, htm, url, pdf, ppt, pptx, mp3, m4a) into Markdown. It’s easy to install, with dedicated parsers and converters, supporting custom configs. E2M offers an all-in-one, flexible, and open-source solution. | 273 | |
37 | gemini-api-quickstart | Get up and running with the Gemini API in under 5 minutes (with Python) | 272 | |
38 | fastagency | The fastest way to bring multi-agent workflows to production. | 261 | |
39 | Awesome-Attention-Heads | An awesome repository & A comprehensive survey on interpretability of LLM attention heads. | 257 | |
40 | humanlayer | HumanLayer enables AI agents to communicate with humans in tool-based and async workflows. Guarantee human oversight of high-stakes function calls with approval workflows across slack, email and more. Bring your LLM and Framework of choice and start giving your AI agents safe access to the world. Agentic Workflows, human in the loop, tool calling | 247 | |
41 | aisearch-openai-rag-audio | A simple example implementation of the VoiceRAG pattern to power interactive voice generative AI experiences using RAG with Azure AI Search and Azure OpenAI's gpt-4o-realtime-preview model. | 243 | |
42 | DataHorse | Chat with your data, modify it, visualize it, create and test machine learning models all in plain English. DataHorse makes data analysis and data science conversational using LLMs. | 242 | |
43 | claude-coder | Kodu is an autonomous coding agent that lives in your IDE. It is a VSCode extension that can help you build your dream project step by step by leveraging the latest technologies in automated coding agents | 234 | |
44 | dynamiq | Dynamiq is an orchestration framework for agentic AI and LLM applications | 229 | |
45 | LlamaVoice | LlamaVoice is a llama-based large voice generation model, providing inference and training ability. | 219 | |
46 | mastra | The TypeScript AI framework. | 191 | |
47 | CleanS2S | High-quality and streaming Speech-to-Speech interactive agent in a single file. 只用一个文件实现的流式全双工语音交互原型智能体! | 182 | |
48 | GraphRag.Net | 参考GraphRag使用 Semantic Kernel 来实现的dotnet版本,可以使用NuGet开箱即用集成到项目中 | 182 | |
49 | MooER | MooER: Moore-threads Open Omni model for spech-to-speech intERaction. MooER-omni includes a series of end-to-end speech interaction models along with training and inference code, covering but not limited to end-to-end speech interaction, end-to-end speech translation and speech recognition. | 150 | |
50 | trustgraph | Connect Data Silos with Reliable AI⚡🚀 | 150 | |
51 | AUITestAgent | AUITestAgent is the first automatic, natural language-driven GUI testing tool for mobile apps, capable of fully automating the entire process of GUI interaction and function verification. | 148 | |
52 | chatwiki | 开箱即用的基于企业私有知识库的LLM大语言模型的智能客服机器人问答系统,支持私有化部署,代码免费开源且可商用,由芝麻小客服官方推出。 | 146 | |
53 | premsql | End-to-End Local-First Text-to-SQL Pipelines | 136 | |
54 | KB-Builder | Knowledge Base Builder,是一款基于LLM大语言模型的开源知识库生成管理优化构建系统,是「滨电智言」的一款开源工具,旨在成为企业的知识库构建中枢。 | 118 | |
55 | DenseFusion | DenseFusion-1M: Merging Vision Experts for Comprehensive Multimodal Perception | 115 | |
56 | TapeAgents | TapeAgents is a framework that facilitates all stages of the LLM Agent development lifecycle | 115 | |
57 | VideoGen-Eval | The Dawn of Video Generation: Preliminary Explorations with SORA-like Models | 111 | |
58 | marly | The open-source, model-agnostic alternative to OpenAI's structured outputs for your own documents or the web. | 108 | |
59 | TrafficLLM | The repository of TrafficLLM, a universal LLM adaptation framework to learn robust traffic representation for all open-sourced LLM in real-world scenarios and enhance the generalization across diverse traffic analysis tasks. | 106 | |
60 | llama_extract | 105 | ||
61 | finetuning | Finetune Llama-3-8b on the MathInstruct dataset | 97 | |
62 | TEAL | 95 | ||
63 | grps_trtllm | 【grps接入trtllm】通过接入TensorRT-LLM以及Tokenizers.cpp实现纯c++版本高性能LLM服务,兼容OpenAI接口协议,支持chat和function call模式,支持ai agent,支持分布式多卡推理,支持多模态,支持gradio聊天界面。 | 87 | |
64 | LLaVA-MORE | LLaVA-MORE: Enhancing Visual Instruction Tuning with LLaMA 3.1 | 84 | |
65 | eureka-ml-insights | A framework for standardizing evaluations of large foundation models, beyond single-score reporting and rankings. | 82 | |
66 | effective_llm_alignment | Effective LLM Alignment Toolkit | 82 | |
67 | multimodal-ai-llm-processing-accelerator | Build multimodal data processing pipelines with Azure AI Services + LLMs | 73 | |
68 | awesome_LLM-harmful-fine-tuning-papers | A survey on harmful fine-tuning attack for large language model | 67 | |
69 | llama-stack-client-python | Python SDK for Llama Stack | 66 | |
70 | bolna | Full stack tools for building voice agents | 66 | |
71 | flockmtl-duckdb | FlockMTL: DuckDB extension to seamlessly combine analytics and semantic analysis using language models (LMs) | 64 | |
72 | llm-instance-gateway | LLM Instance gateway implementation. | 64 | |
73 | co-op-translator | Easily automate multilingual translations for your projects with co-op-translator, powered by advanced LLM technology and Azure AI Services. | 60 | |
74 | proxy-to-gemini | A proxy sidecar to access Gemini models via OpenAI and Ollama APIs | 54 | |
75 | gLM2 | 52 | ||
76 | flow-judge | Code for evaluating with Flow-Judge-v0.1 - an open-source, lightweight (3.8B) language model optimized for LLM system evaluations. Crafted for accuracy, speed, and customization. | 52 | |
77 | GhostOS | An agent framework offering a Python code interface for LLM-driven agents and meta-agents to do everything by code generation. | 51 | |
78 | study-drift-lms | A modern learning management system to place learning in the hands of the students | 49 | |
79 | ai-shifu | LLM-powered AI guide that leads and drives intelligent conversations | 47 | |
80 | langfair | LangFair is a Python library for conducting use-case level LLM bias and fairness assessments | 45 | |
81 | KDPL | [ECCV 2024] - Improving Zero-shot Generalization of Learned Prompts via Unsupervised Knowledge Distillation | 44 | |
82 | MoE-PEFT | An Efficient LLM Fine-Tuning Factory Optimized for MoE PEFT | 37 | |
83 | ros2_nanollm | ROS2 nodes for LLM, VLM, VLA | 37 | |
84 | barq | Dabarqus is a stand alone application that implements a complete RAG solution. | 37 | |
85 | raft-distillation-recipe | A recipe that will walk you through using either Meta Llama 3.1 405B or GPT-4o deployed on Azure AI to generate a synthetic dataset using UC Berkeley's Gorilla project RAFT method. | 32 | |
86 | LiteWebAgent | The Library for LLM-based web-agent applications | 30 | |
87 | Parrot | 🎉 The code repository for "Parrot: Multilingual Visual Instruction Tuning" in PyTorch. | 29 | |
88 | GMAI-MMBench | GMAI-MMBench: A Comprehensive Multimodal Evaluation Benchmark Towards General Medical AI. | 29 | |
89 | agent-openai-java-banking-assistant | multi-agents banking assistant with Java and Semantic Kernel | 27 | |
90 | VLM | 27 | ||
91 | moorellm | Finite State Machine based approach to create Agentic LLM Apps! | 27 | |
92 | Awesome-Multimodal-LLM-for-Math-STEM | Paper collections of multi-modal LLM for Math/STEM/Code. | 27 | |
93 | openai-chat-vision-quickstart | A demonstration of chatting with uploaded images using OpenAI vision models like gpt-4o. | 26 | |
94 | dify-google-cloud-terraform | Terraform configuration for deploying Dify on Google Cloud with scalability, high availability, and production-level readiness. | 26 | |
95 | PodGPT | PodGPT: A multilingual audio-augmented large language model for research and education | 26 | |
96 | stable-diffusion-webui-forge | Stable Diffusion WebUI Forge docker images for use in GPU cloud and local environments. Includes AI-Dock base for authentication and improved user experience. | 25 | |
97 | GPT-Talker | [ACMMM'2024] Generative Expressive Conversational Speech Synthesis | 25 | |
98 | PrimeDepth | PrimeDepth: Efficient Monocular Depth Estimation with a Stable Diffusion Preimage | 22 | |
99 | LawGLM | 探索 LLM 在法律行业的应用潜力 | 21 | |
100 | dolomite-engine | Dolomite Engine is a library for pretraining/finetuning LLMs | 21 |