TOP AI Developers by monthly star count
TOP AI Organization Account by AI repo star count
Top AI Project by Category star count
Top Growing Speed list by the speed of gaining stars
Top List of who create influential repos with little people known
Projects and developers that are thriving yet have not been updated for a long time.
Rankings | Developers | Related Project | Project intro | Star count |
---|---|---|---|---|
1 | gemini-fullstack-langgraph-quickstart | Get started with building Fullstack Agents using Gemini 2.5 and LangGraph | 14.9K | |
2 | system_prompts_leaks | Collection of extracted System Prompts from popular chatbots like ChatGPT, Claude & Gemini | 7.1K | |
3 | nanoVLM | The simplest, fastest repository for training/finetuning small-sized VLMs. | 3.6K | |
4 | PandaWiki | PandaWiki 是一款 AI 大模型驱动的开源知识库搭建系统,帮助你快速构建智能化的 产品文档、技术文档、FAQ、博客系统,借助大模型的力量为你提供 AI 创作、AI 问答、AI 搜索等能力。 | 3.0K | |
5 | mcp-context-forge | A Model Context Protocol (MCP) Gateway & Registry. Serves as a central management point for tools, resources, and prompts that can be accessed by MCP-compatible LLM applications. Converts REST API endpoints to MCP, composes virtual MCP servers with added security and observability, and converts between protocols (stdio, SSE, Streamable HTTP). | 549 | |
6 | Narratium.ai | Open-source platform for AI-driven storytelling, worldbuilding, and immersive roleplay | 441 | |
7 | lemonai | The world's first Full-Stack Open-Source General AI Agent | 401 | |
8 | CloudFlare-AI-Insight-Daily | AI 洞察日报项目,每日为您精选 AI 领域的最新动态,包括行业新闻、热门开源项目和前沿学术论文以及科技大V推文,并通过 Google Gemini 模型进行智能处理与日报生成,最终自动发布到 GitHub Pages 生成AI日报。 | 386 | |
9 | VLM-3R | VLM-3R: Vision-Language Models Augmented with Instruction-Aligned 3D Reconstruction | 191 | |
10 | agent-builder | An example agent demonstrating streaming, tool use, and interactivity from your terminal. This agent builder can help you to build your own agents and tools. | 159 | |
11 | lemonade | Local LLM Server with GPU and NPU Acceleration | 157 | |
12 | Pixel-Reasoner | Pixel-Level Reasoning Model trained with RL | 142 | |
13 | Bench2Drive-VL | Adapting VLMs to Bench2Drive. | 127 | |
14 | CHATS | CHATS: Combining Human-Aligned Optimization and Test-Time Sampling for Text-to-Image Generation (ICML2025) | 113 | |
15 | AetherLink | AetherLink移动应用是一个基于现代Web技术构建的跨平台AI助手应用。该应用支持与多种AI模型(如OpenAI、Google Gemini、Anthropic Claude、Grok、硅基流动、火山方舟等)的交互,提供流畅的对话体验,并支持Android平台部署。应用采用React、TypeScript和Capacitor框架开发,具有高度可定制的模型配置、多主题聊天管理、AI思考过程可视化、语音合成、语音识别、MCP工具支持、知识库管理等特色功能。 | 111 | |
16 | VTool-R1 | Code for the paper "VTool-R1: VLMs Learn to Think with Images via Reinforcement Learning on Multimodal Tool Use" | 89 | |
17 | mlx-lm-lora | Train Large Language Models on MLX. | 81 | |
18 | GeminiImageApp | 基于 Google Gemini AI 的全功能图像处理应用 | 71 | |
19 | attachments | Easiest way to give context to LLMs; Attachments has the ambition to be the general funnel for any files to be transformed into images+text for large language models context by only adding 2 lines to your python code. | 70 | |
20 | Awesome-spatial-visual-reasoning-MLLMs | Repository for awesome spatial/visual reasoning MLLMs. (focus more on embodied applications) | 54 | |
21 | VLM_Merging | Github repository for "Bring Reason to Vision: Understanding Perception and Reasoning through Model Merging" (ICML 2025) | 47 | |
22 | vlms-are-biased | Vision Language Models are Biased | 42 | |
23 | Code2Logic | Code2Logic: Game-Code-Driven Data Synthesis for Enhancing VLMs General Reasoning | 38 | |
24 | sdxl-turbo-interpretability | 35 | ||
25 | VLM_GRPO | An implementation of GRPO for Unsloth's VLMs training | 33 | |
26 | vision-ai-checkup | Take your LLM to the optometrist. | 31 | |
27 | Awesome-LM-AD-Decision | A comprehensive list of awesome research, resources, and tools for leveraging LLMs, VLMs, and VLA models in autonomous driving decision-making and motion planning. | 30 | |
28 | gc-qa-rag | A RAG (Retrieval-Augmented Generation) solution Based on Pre-generated QA Pairs. 基于预生成 QA 对的 RAG 知识库解决方案 | 22 | |
29 | llama-runner | Llama.cpp runner/swapper and proxy that emulates LMStudio / Ollama backends | 20 | |
30 | SurgVLM | 19 | ||
31 | cadrille | cadrille: Multi-modal CAD Reconstruction with Online Reinforcement Learning | 19 | |
32 | Scalable-Imitation-Learning-for-LMAPF | This is code repo for the paper, Depolying Ten Thousand Robots: Scalable Imitation Learninig for Lifelong Mulit-Agent Path Finding, which won the ICRA 2025 best paper on multi-robot systems and the best student paper. | 18 | |
33 | awesome-money-platforms | Awesome Free Platforms for Money Making daily updated by LLM | 17 | |
34 | awesome-system-prompts | system prompts of LLMs, AI Tools, AI Products, DeepSeek,ChatGPT, Gemini, Grok, Qwen | 16 | |
35 | node-memory-system | Concept for a node-based memory system for LLMs | 15 | |
36 | SunnyLand | A platformer game developed in C++ with SDL3, glm, nlohmann-json and Tiled. | 15 | |
37 | RAIF | A Recipe for Building LLM Reasoners to Solve Complex Instructions | 14 | |
38 | agentic-difusion | a comprehensive diffusion-based code refinement model | 13 | |
39 | DarkGPT-Lite | DarkGPT Lite is a specialized CLI tool providing unrestricted conversations with AI for cybersecurity research purposes | 12 | |
40 | cz_ai | A Commitizen plugin that leverages OpenAI's GPT-4o to automatically generate clear, concise, and conventional commit messages based on your staged git changes. | 11 | |
41 | ProEmacs | A modern, fast, and beautiful Emacs configuration for developers who want power without the complexity. | 11 | |
42 | IR3D-Bench | Official Code of IR3D-Bench: Evaluating Vision-Language Model Scene Understanding as Agentic Inverse Rendering | 11 | |
43 | sdialog | Synthetic Dialog Generation and Analysis with LLMs | 10 | |
44 | mlx-grpo | 🧠 Train your own DeepSeek-R1 style reasoning model on Mac! First MLX implementation of GRPO - the breakthrough technique behind R1's o1-matching performance. Build mathematical reasoning AI without expensive RLHF. Apple Silicon optimized. 🚀 | 9 | |
45 | triage.flow | Transform any repository into an intelligent code assistant. Upload, analyze, and chat with your codebase using advanced AI with full repository context. | 9 | |
46 | ICML25-TimeVLM | 8 | ||
47 | smyth-docs | Everything you need to build, deploy, and collaborate with agents. Ride the llama, avoid the drama. | 8 | |
48 | ComfyUI-AniSora | ComfyUI-AniSora is now available in ComfyUI, Index-AniSora is the most powerful open-source animated video generation model. It enables one-click creation of video shots across diverse anime styles including series episodes, Chinese original animations, manga adaptations, VTuber content, anime PVs, mad-style parodies(鬼畜动画), and more! | 7 | |
49 | TIME | TIME: A Multi-level Benchmark for Temporal Reasoning of LLMs in Real-World Scenario | 7 | |
50 | llama-stack-k8s-operator | 7 | ||
51 | ComfyUI-JoyCaption | ComfyUI custom node powered by LLaVA VLM for advanced image captioning with customizable styles and memory-efficient inference. | 7 | |
52 | mlx-stable-diffusion | A simplified mlx implementation of the original stable diffusion 1.5 algorithm featuring fine-tuned weight loading and lora loading for Text and Image generation | 7 | |
53 | Kuzco | Kuzco is a Swift package for integrating large language models (LLMs) directly into iOS, macOS, and Mac Catalyst apps. Built on `llama.cpp`, it offers customizable prompts, flexible tuning, and async/await-friendly APIs for on-device AI. | 7 | |
54 | lmdamdawbcn | 6 | ||
55 | NOVER | R1-Zero on any Data | 6 | |
56 | PPMC | 点点现聊,是一款功能丰富、现代化的 Web 聊天应用,它利用 WebRTC 进行用户间的直接媒体通信,支持文本、文件共享、语音消息以及实时音频/视频/屏幕共享通话。通过基于 WebSocket 的信令服务器(由Java Spring Boot实现)完成初始用户发现和连接协商。WebRTC 通讯减少了对中央服务器传输大量媒体数据的依赖(除信令与可选的 TURN 中继服务外),而AI聊天和TTS功能则通过后端代理与外部服务交互。该应用深度集成了具有文本转语音(TTS)功能的主题化AI助手联系人,AI角色拥有动态上下文(如每日随机事件和心情)并支持长对话智能摘要,带来更生动的交互体验。 | 6 | |
57 | Katz | [ATC'25] Katz is a high-performance serving system designed specifically for diffusion model workflows with multiple adapters. | 5 | |
58 | inferno | Run Llama 3.3, DeepSeek-R1, Phi-4, Gemma 3, Mistral Small 3.1, and other state-of-the-art language models locally with scorching-fast performance. Inferno provides an intuitive CLI and an OpenAI/Ollama-compatible API, putting the inferno of AI innovation directly in your hands. | 5 | |
59 | AI.Llama.Traing.Offline | This repo has specific easy steps for you to be able to train your Llama AI Model offline | 5 | |
60 | CoSo | Source code for "Towards Efficient Online Tuning of VLM Agents via Counterfactual Soft Reinforcement Learning" | 5 | |
61 | HackSynth-GRPO | This is a repository for the AgentX project "Improving LLM Agents With Reinforcement Learning on Cryptographic CTF Challenges" | 5 | |
62 | t5sdxl | Experiments in giving SDXL a T5 text encoder front end | 4 | |
63 | Awesome-VLM-Synthetic-Data | 🔥 The first survey on bridging VLMs and synthetic data, for which I completed the entire process of reading 125 papers and writing the research paper in just 10 days. | 4 | |
64 | ComfyUI-MaxedOut | Custom ComfyUI nodes used in Maxed Out workflows (SDXL, Flux, etc.) | 4 | |
65 | ai-code-context-helper | 🤖 A lightweight desktop tool for developers working with AI assistants. 📊 Visualize project structure, 📋 selectively export file paths and code to clipboard, making collaboration with ChatGPT, Claude and other AI assistants more efficient. 🌐 Supports multi-language UI, 🔍 file filtering, and ⚙️ customizable output formats. | 4 | |
66 | erlang-lmdb | 4 | ||
67 | Awesome-AudioLM-Datasets | 4 | ||
68 | llmlogs | A blog dedicated to helping creators, developers, and curious minds understand the rapidly growing world of Large Language Models (LLMs). | 4 | |
69 | LLM_exam | 4 | ||
70 | emotion_ai | The Aura Emotion AI system has chroma with a local embedding model, memvid qr code mp4 infinite memory, brainwave and neurochemical simulations, sociobiological reasoning, autonomous subsystem processing with a Gemini flash model so the main model is less taxed, is a MCP client with adaptive tool learning and MCP server. | 4 | |
71 | AgentNull | AgentNull is a comprehensive catalog of attack vectors targeting autonomous AI agents, complete with proof-of-concepts for each method. Explore the structured threat information and replicate scenarios using the provided resources. 🐙👨💻 | 4 | |
72 | ovtinyautoencoders | SD/SDXL/Flux Tiny Autoencoder converter for OpenVINO | 3 | |
73 | smallvm_lms | 3 | ||
74 | sora | A simple, intuitive, Roblox admin script. | 3 | |
75 | sdxl-embedding-converter | Convert SD1.5 Textual Inversion to SDXL | 3 | |
76 | Awesome-LLM-reasoning-papers | This repository offers a well-organized collection of resources focused on reasoning in Large Language Models (LLMs). Explore foundational papers, evaluation benchmarks, and practical tools to enhance your understanding of LLM reasoning. 🐙🌐 | 3 | |
77 | Leap-Ai-Make-Your-Ai-Api-Chat-Music-Coding | An AI development platform that allows users to build custom AI models through an API, supporting chatbots, music generation, and coding tools to streamline creative and technical workflows. | 3 | |
78 | Reddit-Ribbit-Ribbit | 🐸 Reddit-Ribbit-Ribbit: A friendly AI Reddit agent that hops into discussions! Powered by Google Gemini 🚀, it crafts engaging, Gen Alpha-style comments with emojis ✨. Features include subreddit monitoring, keyword filtering, image/URL analysis, and a fully customizable "bot brain". | 3 | |
79 | frac-cot | An efficient sampling method for long-CoT LLM with fractured CoT. | 3 | |
80 | VeriReason | This is the Github Repo for the paper: VeriReason: Reinforcement Learning with Testbench Feedback for Reasoning-Enhanced Verilog Generation | 3 | |
81 | AICA-VLM | https://affective-ai.github.io/ | 3 | |
82 | browser-use-annotator | A web-based tool for annotating browser use trajectory for VLMs | 3 | |
83 | llama-nexus | 2 | ||
84 | GeminiCursorHelper | AI Assistant that helps you open your Desktop's Apps with voice command | 2 | |
85 | ChatGPT-Guide | ChatGPT 中文版:国内免费直连指南(支持GPT-4.1、4o画图,无需翻墙)【7月持续更新】 | 2 | |
86 | chatgpt-LESS-THAN-8GB-OF-RAM-claude-ASI-super-inteligence-DeepSeek-R1-0528-Qwen3-8B-GGUF-KoboldCPP | 2 | ||
87 | ChatGPT-China-CN | ChatGPT 中文版:国内直连教程(支持GPT-4、4o、o1、o3 和 DeepSeek R1,无需翻墙)【5月持续更新】 | 2 | |
88 | gpt-free | ChatGPT中文版镜像站点大全:国内直连免费使用GPT-4/GPT-4o/Claude(无需翻墙) | 2 | |
89 | SorachioChat-v2 | Sorachio - Web Chatbot AI v2 | 2 | |
90 | geo-lm | Geologic models from Llama 4 language model + GemPy! | 2 | |
91 | discord-chatbot | Discord chat bot using AI models from Nano-GPT.com. Supports popular AI models such as ChatGPT, Llama, Claude, Gemini, Grok, and many more! | 2 | |
92 | llama-hack | 2 | ||
93 | UI-TARS-desktop | A GUI Agent application based on UI-TARS(Vision-Language Model) that allows you to control your computer using natural language. | 2 | |
94 | MIRA-Multimodal-Intelligent-Robotic-Assistant | 基于Qwen Agent框架,融合JAKA机械臂、视觉检测、语音识别与合成、MCP数据库的多模态大模型 | 2 | |
95 | GeospatialVLM | VLM specially crafted for geospatial reasoning tasks | 2 | |
96 | vlmd | Variational Latent Mode Decomposition | 2 | |
97 | guide | 2025 ChatGPT 使用教程和最佳实践,涵盖注册设置、Prompt 模板、Explorer GPT、DALL·E/Sora 使用技巧,适合新手与进阶者 | 2 | |
98 | AllInApp | A modular Python app that generates podcast episodes from the "All-In" podcast using AI. Features audio transcription with Whisper.cpp, lesson extraction via spaCy, script generation with GPT-Neo, voice cloning using Coqui TTS, show art creation with Stable Diffusion, and RSS feed generation. Ideal for automation and AI-driven content creation. | 2 | |
99 | SDXL-LoRA-Fine-tuning-for-Ghibli-Style | 이 프로젝트는 Stable Diffusion XL (SDXL) 모델을 LoRA로 fine-tuning하여 지브리 스타일의 이미지를 생성하는 실험을 진행합니다. | 2 | |
100 | modal-sdxl | A powerful and flexible text-to-image generation system built with Modal and Stable Diffusion XL. This project provides both a web interface and API for generating high-quality images from text prompts. | 2 |