TOP AI Developers by monthly star count
TOP AI Organization Account by AI repo star count
Top AI Project by Category star count
Top Growing Speed list by the speed of gaining stars
Top List of who create influential repos with little people known
Projects and developers that are thriving yet have not been updated for a long time.
| Rankings | Developers | Related Project | Project intro | Star count |
|---|---|---|---|---|
1 | TradingAgents-CN | 基于多智能体LLM的中文金融交易框架 - TradingAgents中文增强版 | 4.5K | |
2 | gpt-load | 智能密钥轮询的多渠道 AI 代理。 Multi-channel AI proxy with intelligent key rotation. | 2.7K | |
3 | zen-mcp-server | The power of Claude Code + [Gemini Pro / Flash / O3 / Grok / OpenRouter / Ollama / Custom Model / All Of The Above] working as one. | 2.1K | |
4 | GLM-V | GLM-4.5V and GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning | 1.5K | |
5 | cipher | Cipher is an opensource memory layer specifically designed for coding agents. Compatible with Cursor, Windsurf, Claude Desktop, Claude Code, Gemini CLI, AWS's Kiro, VS Code, and Roo Code through MCP, and coding agents, such as Kimi K2. Built by https://byterover.dev/ | 978 | |
6 | code-graph-rag | Better than Claude Code or Gemini CLI for Monorepos | 851 | |
7 | WFGY | WFGY 2.0 — Semantic Reasoning Engine for LLMs (MIT). Fixes RAG/OCR drift, collapse & “ghost matches” via symbolic overlays + logic patches. Autoboot; OneLine & Flagship. ⭐ Star if you explore semantic RAG or hallucination mitigation. | 795 | |
8 | slime | slime is a LLM post-training framework aiming for RL Scaling. | 665 | |
9 | gemini-mcp-tool | MCP server that enables AI assistants to interact with Google Gemini CLI, leveraging Gemini's massive token window for large file analysis and codebase understanding | 534 | |
10 | llama-scan | Transcribe PDFs with local LLMs | 467 | |
11 | NativeMindExtension | NativeMind: Your fully private, open-source, on-device AI assistant | 437 | |
12 | swama | High-performance MLX-based LLM inference engine for macOS with native Swift implementation | 369 | |
13 | mixture_of_recursions | Mixture-of-Recursions: Learning Dynamic Recursive Depths for Adaptive Token-Level Computation | 366 | |
14 | DreamLayer | The Most intuitive Stable Diffusion WebUI for AI artists, developers & researchers | 363 | |
15 | AI-Gist | ✨ AI Gist 是一款隐私优先的 AI 提示词管理工具,致力于让个人收藏的 AI 提示词能够发挥最大价值。支持变量替换、Jinja 模板、AI 生成与调优、历史版本记录、云端备份等核心功能。 | 361 | |
16 | 4o-ghibli-at-home | The GPT-4o image generation we have at home. A powerful, self-hosted AI photo stylizer built for performance and privacy. | 266 | |
17 | All-Model-Chat | All Model Chat 是一款功能强大、支持多模态输入的聊天机器人界面,旨在提供与 Google Gemini API 家族无缝交互的极致体验。它集成了动态模型选择、多模态文件输入、流式响应、全面的聊天历史管理以及广泛的自定义选项,为您带来无与伦比的 AI 互动体验。 | 253 | |
18 | code-context | MCP plugin for semantic code search. Integrates with Claude Code, Gemini CLI, Cursor, or any AI coding agents. | 249 | |
19 | catwalk | 🐈 A collection of LLM inference providers and models | 162 | |
20 | ultra-mcp | 100x Your Claude Code, Gemini CLI, Cursor and/or any coding tools with MCP client support | 154 | |
21 | Jailbreaks-GPT-Gemini-deepseek- | Jailbreaks GPT, Sora, Claude, Gemini ,deepseek this prompt unlocks rage mode | 144 | |
22 | c4-genai-suite | c4 GenAI Suite | 144 | |
23 | Med-VLM-Bench-Summary | A Curated Benchmark Repository for Medical Vision-Language Models | 117 | |
24 | Mirage | Machine Mental Imagery: Empower Multimodal Reasoning with Latent Visual Tokens (arXiv 2025) | 115 | |
25 | eca | Editor Code Assistant (ECA) - AI pair programming capabilities agnostic of editor | 107 | |
26 | regress-lm | Library for text-to-text regression, applicable to any input string representation and allows pretraining and fine-tuning over multiple regression tasks. | 98 | |
27 | ComfyUI-OmniGen2 | ComfyUI-OmniGen2 is now available in ComfyUI, OmniGen2 is a powerful and efficient unified multimodal model. Its architecture is composed of two key components: a 3B Vision-Language Model (VLM) and a 4B diffusion model. | 95 | |
28 | ComfyUI-GLM4 | ComfyUI GLM Nodes: Seamlessly integrate ZhipuAI GLM models into ComfyUI for enhanced text generation and image-to-prompt capabilities. Features include advanced text chat (video prompt expansion) and GLM-4V powered image description. | 90 | |
29 | oh-my-logo | Display giant ASCII-art logos with colorful gradients in your terminal — like Claude Code or Gemini CLI. | 85 | |
30 | Awesome-Interleaving-Reasoning | Interleaving Reasoning: Next-Generation Reasoning Systems for AGI | 77 | |
31 | bcmi354 followers 3-East 307 SEIEE Building, No. 800 Dongchuan Road. Minhang District, Shanghai, 200240 | GPSDiffusion-Object-Shadow-Generation-SDXL | 60 | |
32 | TimeCapsule-SLM | AI creative coding studio Deepresearch , blogs , Animation all in browser full privacy. | 57 | |
33 | herobot | Herobot is your 24/7 customer service assistant that helps you manage multi-channel customer conversations effortlessly. | 55 | |
34 | gemini-cli-action | 54 | ||
35 | Eris. | Eris is a private AI chat application that runs entirely on your device using Apple's MLX framework. Named after the dwarf planet that challenged our understanding of the solar system, Eris challenges the notion that AI must live in the cloud. | 52 | |
36 | EdgeCareer | 🚀 EdgeCareer – AI-Powered Career Coach Full Stack AI Career Coach built with React 19 + Next.js 15, Tailwind CSS, NeonDB, Prisma, Clerk Authentication, Inngest, Gemini API, and Shadcn UI. A cutting-edge AI-driven career platform that provides personalized job recommendations, AI resume reviews, and real-time career insights to help users | 51 | |
37 | FastFlowLM | Run LLMs on AMD Ryzen™ AI NPUs. Just like Ollama - but purpose-built and deeply optimized for the AMD NPUs. | 47 | |
38 | StableMotion | This is the official repo for paper "StableMotion: Repurposing Diffusion-Based Image Priors for Motion Estimation" | 43 | |
39 | sre | The Operating System for Agents | 41 | |
40 | R1 | 🚀enhanced GRPO with more verifiable rewards and real-time evaluators | 35 | |
41 | VoiceHub | 部署于 CloudFlare Pages 的 AI 语音服务,使用 siliconflow 的语音转录模型 SenseVoiceSmall 和 openai 的 gpt-4o-mini-tts | 35 | |
42 | StableCodec | [ICCV 2025] StableCodec: Taming One-Step Diffusion for Extreme Image Compression | 34 | |
43 | google-veo3-from-scratch | A Step-by-Step Implementation of Google Veo 3 Architecture from Scratch | 33 | |
44 | genai-prices | Calculate prices for calling LLM inference APIs. | 33 | |
45 | crazyagent | 极简高效、易于集成、灵活扩展、上下文管理强大、适合新手的 LLM 智能体开发框架 | 32 | |
46 | awesome-reviewers | Ready-to-use system prompts for Agentic Code Review. | 30 | |
47 | ASVR | Autoregressive Semantic Visual Reconstruction Helps VLMs Understand Better | 29 | |
48 | llama_bot_rails | 28 | ||
49 | gemini-code-flow | AI-powered development orchestration for Gemini CLI - adapted from Claude Code Flow by ruvnet | 28 | |
50 | ollama-accessible-anywhere | 🚀 Run Ollama AI (Llama 3, Phi-3) on Colab/local & access via your custom domain using Cloudflare Tunnel. Easy guides for your personal, global LLM endpoint! #Ollama #Cloudflare #SelfHostedAI | 25 | |
51 | LMeterX | Professional Load Testing for Any OpenAI-Compatible LLM API | 24 | |
52 | ScoreMD | A framework for training diffusion models with stable, self-consistent scores near the data distribution. | 24 | |
53 | wingman | Emacs package for LLM-assisted code/text completion | 23 | |
54 | timecopilot | The GenAI Forecasting Agent · LLMs × Foundation Time Series Models | 23 | |
55 | getcursor | Cursor AI IDE Desktop & System Installer. | 20 | |
56 | KawaiiGPT | WormGPT kawaii ver | 20 | |
57 | ComfyUI-StableAudioX | A powerful audio generation extension for ComfyUI that integrates AudioX models for high-quality audio synthesis from text and video inputs. | 19 | |
58 | llms | A universal LLM API transformation server | 18 | |
59 | ShareGPT-4o-Image | 14 | ||
60 | VRBench | [ICCV 2025] A Benchmark for Multi-Step Reasoning in Long Narrative Videos | 14 | |
61 | sora-json-prompt-crafter | A vibe coded Sora JSON Prompt Crafter for curious humans and prompt engineers | 14 | |
62 | astack | 🤖 A composable framework for building AI applications. | 13 | |
63 | llamate | A simple, "Ollama-like" tool for managing and running GGUF language models from your terminal. | 12 | |
64 | GLMDeepSeaAgents | Data-driven multi-agent ecosystem for autonomous decision-making in GLM深远海船舶应用赛 | 12 | |
65 | Qwen2.5-VL-Batched | A batched implementation for efficient Qwen2.5-VL inference. | 12 | |
66 | DyME | Empowering Small VLMs to Think with Dynamic Memorization and Exploration | 12 | |
67 | onesdk | OneSDK: A unified AI access SDK for edge devices, providing LLM capabilities (text/voice chat, image generation) and IoT device management with MQTT support, compatible with ESP32, Linux, macOS, and Windows platforms. | 11 | |
68 | TTPMapper | TTPMapper is an AI-driven threat intelligence parser that converts unstructured reports whether from web URLs or PDF files into structured intelligence. Using the DeepSeek LLM, it extracts MITRE ATT&CK techniques, IOCs, threat actors, and generates contextual summaries. | 11 | |
69 | Gemini-CLI-Git-Ask | A code analysis tool that enables natural language queries about Git repositories using Google's Gemini CLI. | 11 | |
70 | rolebasedgroup | A workload for deploying LLM inference services on Kubernetes | 11 | |
71 | all-rag-techniques | Explore various RAG techniques in a straightforward way. This repository provides practical examples and resources for developers looking to enhance their skills. 🐙🌍 | 10 | |
72 | ABench | ABench is an evolving open-source benchmark suite designed to rigorously evaluate and enhance Large Language Models (LLMs) on complex cross-domain tasks. | 10 | |
73 | sora | A Java Spring Boot web application that generates Sora videos using Azure OpenAI API. This application provides a user-friendly web interface for creating AI-generated videos with configurable video specifications including resolution and duration. | 9 | |
74 | kolosal-cli | Super lightweight Ollama alternative to run Llama 3.3, DeepSeek-R1, Phi-4, Gemma 3, Mistral Small 3.1 and other large language models. | 9 | |
75 | AzureSoraSDK | C# SDK for Azure Sora | 8 | |
76 | Neural-Pixel | A simple GUI wrapper for stable-diffusion.cpp written using C and GTK 4. | 8 | |
77 | react-native-apple-llm | This repository offers a straightforward way to access Apple Intelligence Foundation Models in your React Native apps. 🌟 With easy installation and clear API methods, you can quickly check model availability and configure sessions for optimal performance. 🛠️ | 8 | |
78 | VLM3D-Dockers | 8 | ||
79 | vlm-scaffolding | 8 | ||
80 | Termux-AI-Free-Agent | This is my LangChain AI Agent for Termux using a totally free AI API with a lot of tools. | 8 | |
81 | EdgeLLM | Simple LLM package for edge devices. | 8 | |
82 | LEANN | RAG on Everything with LEANN. Enjoy 97% storage savings while running a fast, accurate, and 100% private RAG application on your personal device. | 8 | |
83 | ia-na-pratica | IA na Prática: LLM, RAG, MCP, Agents, Function Calling, Multimodal, TTS/STT e mais | 8 | |
84 | When-Java-meets-LLM | 🚀 When Java meets LLM: 大模型应用开发学习笔记 | 8 | |
85 | HOLa | [ICCV 2025] official code repository of paper "HOLa: Zero-Shot HOI Detection with Low-Rank Decomposed VLM Feature Adaptation" | 7 | |
86 | iRAT | Code for iRAT paper. | 7 | |
87 | VisPruner | [ICCV 2025] Official code for paper: Beyond Text-Visual Attention: Exploiting Visual Cues for Effective Token Pruning in VLMs | 7 | |
88 | transkribus-hf | A simple way to go from Transkribus to HuggingFace for VLM training | 7 | |
89 | LLM-Hallucination-Detection-Script | A comprehensive toolkit for detecting potential hallucinations in LLM responses. Compatible with any LLM API (OpenAI, Anthropic, local models, etc.) | 7 | |
90 | DataFlow-Doc | Documentation for DataFlow, Data-centric AI system for LLM. | 7 | |
91 | RWKV_APP | A fast, lightweight, and extensible RWKV chat UI powered by Flutter. Offline-ready, multi-backend support, ideal for local RWKV inference. | 7 | |
92 | ai-news-daily.github.io | Automated, zero-cost AI news aggregator. Crawls 50+ sources, uses local LLMs for categorization, and deploys to GitHub Pages. | 7 | |
93 | gemini-plays-pokemon-public | 6 | ||
94 | ComfyUI-Magcache-for-SDXL | Magcache implementation for SDXL | 6 | |
95 | LocalLLaMA | 📚 LocalLLaMA Archive — Community-powered static archive for r/LocalLLaMA | 6 | |
96 | chat-llm | Chat with an LLM | 5 | |
97 | Bobs-Lora-Loader | A custom LoRA loader node for ComfyUI with advanced block-weighting controls for both SDXL and FLUX models. Features presets for common use-cases like 'Character' and 'Style', and a 'Custom' mode for fine-grained control over individual model blocks. | 5 | |
98 | Multi-VLM-Processing-Server | A FastAPI-based application that processes images using multiple Vision Language Models (VLMs) to answer to user query. | 5 | |
99 | Foundation-Models-Playgrounds | Explore the Foundation Models Playgrounds repository to see how to engage with Apple's Foundation Models through Swift. Each playground offers clear examples, from simple chats to dynamic storytelling. 🛠️📚 | 5 | |
100 | google-veo3-from-scratch | # Google Veo 3 Implemented from ScratchThis repository contains an implementation of Google Veo 3, a cutting-edge text-to-video generation system. 🎥 Explore the code to create high-quality videos from text prompts and enhance your projects with advanced AI capabilities. 🌟 | 5 |