TOP AI Developers by monthly star count
TOP AI Organization Account by AI repo star count
Top AI Project by Category star count
Top Growing Speed list by the speed of gaining stars
Top List of who create influential repos with little people known
Projects and developers that are thriving yet have not been updated for a long time.
Rankings | Developers | Related Project | Project intro | Star count |
---|---|---|---|---|
1 | zen-mcp-server | The power of Claude Code + [Gemini Pro / Flash / O3 / Grok / OpenRouter / Ollama / Custom Model / All Of The Above] working as one. | 2.1K | |
2 | gpt-load | 一个高性能的OpenAI格式API多密钥轮询代理服务器,支持负载均衡,使用 Go 语言开发。A high-performance OpenAI-compatible API proxy server with multi-key rotation and load balancing, built with Go. | 1.6K | |
3 | GLM-4.1V-Thinking | GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning. | 782 | |
4 | code-graph-rag | Better than Claude Code or Gemini CLI especially for Monorepos | 473 | |
5 | NativeMindExtension | NativeMind: Your fully private, open-source, on-device AI assistant | 368 | |
6 | DreamLayer | Most intuitive Stable Diffusion WebUI for AI artists, developers & researchers | 271 | |
7 | 4o-ghibli-at-home | The GPT-4o image generation we have at home. A powerful, self-hosted AI photo stylizer built for performance and privacy. | 266 | |
8 | CodeIndexer | Semantic code search by indexing code and storing in vector database. Compatible with MCP and VSCode extension. You can use it with Claude Code and Gemini CLI, or build your AI Coding IDE and code search plugin. | 156 | |
9 | All-Model-Chat | 一个基于React的强大聊天机器人界面,与Google Gemini API无缝交互。支持多种多模态输入(文本、图像、视频、音频、PDF、自定义文件)、动态模型切换和实时流式响应。提供强大的聊天记录、高级AI配置(系统提示、思考过程、Canvas助手)和丰富的标记功能。 | 151 | |
10 | Med-VLM-Bench-Summary | A Curated Benchmark Repository for Medical Vision-Language Models | 117 | |
11 | ComfyUI-OmniGen2 | ComfyUI-OmniGen2 is now available in ComfyUI, OmniGen2 is a powerful and efficient unified multimodal model. Its architecture is composed of two key components: a 3B Vision-Language Model (VLM) and a 4B diffusion model. | 95 | |
12 | oh-my-logo | Display giant ASCII-art logos with colorful gradients in your terminal — like Claude Code or Gemini CLI. | 85 | |
13 | Awesome-Interleaving-Reasoning | Interleaving Reasoning: Next-Generation Reasoning Systems for AGI | 77 | |
14 | Mirage | Machine Mental Imagery: Empower Multimodal Reasoning with Latent Visual Tokens (arXiv 2025) | 63 | |
15 | herobot | Herobot is your 24/7 customer service assistant that helps you manage multi-channel customer conversations effortlessly. | 55 | |
16 | gemini-cli-action | 54 | ||
17 | EdgeCareer | 🚀 EdgeCareer – AI-Powered Career Coach Full Stack AI Career Coach built with React 19 + Next.js 15, Tailwind CSS, NeonDB, Prisma, Clerk Authentication, Inngest, Gemini API, and Shadcn UI. A cutting-edge AI-driven career platform that provides personalized job recommendations, AI resume reviews, and real-time career insights to help users | 51 | |
18 | Jailbreaks-GPT-Gemini-deepseek- | Jailbreaks GPT, Sora, Claude, Gemini ,deepseek this prompt unlocks rage mode | 47 | |
19 | eca | Editor Code Assistant (ECA) - AI pair programming capabilities in any editor | 45 | |
20 | StableMotion | This is the official repo for paper "StableMotion: Repurposing Diffusion-Based Image Priors for Motion Estimation" | 43 | |
21 | sre | The Operating System for Agents | 41 | |
22 | R1 | 🚀enhanced GRPO with more verifiable rewards and real-time evaluators | 35 | |
23 | ComfyUI-GLM4 | ComfyUI GLM Nodes: Seamlessly integrate ZhipuAI GLM models into ComfyUI for enhanced text generation and image-to-prompt capabilities. Features include advanced text chat (video prompt expansion) and GLM-4V powered image description. | 35 | |
24 | google-veo3-from-scratch | A Step-by-Step Implementation of Google Veo 3 Architecture from Scratch | 33 | |
25 | crazyagent | 极简高效、易于集成、灵活扩展、上下文管理强大、适合新手的 LLM 智能体开发框架 | 32 | |
26 | ASVR | Autoregressive Semantic Visual Reconstruction Helps VLMs Understand Better | 29 | |
27 | gemini-code-flow | AI-powered development orchestration for Gemini CLI - adapted from Claude Code Flow by ruvnet | 28 | |
28 | LMeterX | Professional Load Testing for Any OpenAI-Compatible LLM API | 24 | |
29 | wingman | Emacs package for LLM-assisted code/text completion | 23 | |
30 | llama_bot_rails | 19 | ||
31 | ComfyUI-StableAudioX | A powerful audio generation extension for ComfyUI that integrates AudioX models for high-quality audio synthesis from text and video inputs. | 19 | |
32 | llms | A universal LLM API transformation server | 18 | |
33 | VoiceHub | 部署于 CloudFlare Pages 的 AI 语音服务,使用 siliconflow 的语音转录模型 SenseVoiceSmall 和 openai 的 gpt-4o-mini-tts | 17 | |
34 | ShareGPT-4o-Image | 14 | ||
35 | astack | 🤖 A composable framework for building AI applications. | 13 | |
36 | llamate | A simple, "Ollama-like" tool for managing and running GGUF language models from your terminal. | 12 | |
37 | GLMDeepSeaAgents | Data-driven multi-agent ecosystem for autonomous decision-making in GLM深远海船舶应用赛 | 12 | |
38 | TTPMapper | TTPMapper is an AI-driven threat intelligence parser that converts unstructured reports whether from web URLs or PDF files into structured intelligence. Using the DeepSeek LLM, it extracts MITRE ATT&CK techniques, IOCs, threat actors, and generates contextual summaries. | 11 | |
39 | ScoreMD | A framework for training diffusion models with stable, self-consistent scores near the data distribution. | 11 | |
40 | Gemini-CLI-Git-Ask | A code analysis tool that enables natural language queries about Git repositories using Google's Gemini CLI. | 11 | |
41 | rolebasedgroup | A workload for deploying LLM inference services on Kubernetes | 11 | |
42 | all-rag-techniques | Explore various RAG techniques in a straightforward way. This repository provides practical examples and resources for developers looking to enhance their skills. 🐙🌍 | 10 | |
43 | regress-lm | Library for text-to-text regression, applicable to any input string representation and allows pretraining and fine-tuning over multiple regression tasks. | 9 | |
44 | sora | A Java Spring Boot web application that generates Sora videos using Azure OpenAI API. This application provides a user-friendly web interface for creating AI-generated videos with configurable video specifications including resolution and duration. | 9 | |
45 | AzureSoraSDK | C# SDK for Azure Sora | 8 | |
46 | react-native-apple-llm | This repository offers a straightforward way to access Apple Intelligence Foundation Models in your React Native apps. 🌟 With easy installation and clear API methods, you can quickly check model availability and configure sessions for optimal performance. 🛠️ | 8 | |
47 | vlm-scaffolding | 8 | ||
48 | EdgeLLM | Simple LLM package for edge devices. | 8 | |
49 | VisPruner | [ICCV 2025] Official code for paper: Beyond Text-Visual Attention: Exploiting Visual Cues for Effective Token Pruning in VLMs | 7 | |
50 | transkribus-hf | A simple way to go from Transkribus to HuggingFace for VLM training | 7 | |
51 | LLM-Hallucination-Detection-Script | A comprehensive toolkit for detecting potential hallucinations in LLM responses. Compatible with any LLM API (OpenAI, Anthropic, local models, etc.) | 7 | |
52 | gemini-plays-pokemon-public | 6 | ||
53 | bcmi351 followers 3-East 307 SEIEE Building, No. 800 Dongchuan Road. Minhang District, Shanghai, 200240 | GPSDiffusion-Object-Shadow-Generation-SDXL | 6 | |
54 | ComfyUI-Magcache-for-SDXL | Magcache implementation for SDXL | 6 | |
55 | LocalLLaMA | 📚 LocalLLaMA Archive — Community-powered static archive for r/LocalLLaMA | 6 | |
56 | chat-llm | Chat with an LLM | 5 | |
57 | HOLa | [ICCV 2025] official code repository of paper "HOLa: Zero-Shot HOI Detection with Low-Rank Decomposed VLM Feature Adaptation" | 5 | |
58 | Bobs-Lora-Loader | A custom LoRA loader node for ComfyUI with advanced block-weighting controls for both SDXL and FLUX models. Features presets for common use-cases like 'Character' and 'Style', and a 'Custom' mode for fine-grained control over individual model blocks. | 5 | |
59 | Multi-VLM-Processing-Server | A FastAPI-based application that processes images using multiple Vision Language Models (VLMs) to answer to user query. | 5 | |
60 | Neural-Pixel | A simple GUI wrapper for stable-diffusion.cpp written using C and GTK 4. | 5 | |
61 | Foundation-Models-Playgrounds | Explore the Foundation Models Playgrounds repository to see how to engage with Apple's Foundation Models through Swift. Each playground offers clear examples, from simple chats to dynamic storytelling. 🛠️📚 | 5 | |
62 | google-veo3-from-scratch | # Google Veo 3 Implemented from ScratchThis repository contains an implementation of Google Veo 3, a cutting-edge text-to-video generation system. 🎥 Explore the code to create high-quality videos from text prompts and enhance your projects with advanced AI capabilities. 🌟 | 5 | |
63 | ChatGPTDumper | 5 | ||
64 | uvg | unsloth vlm grpo | 4 | |
65 | claude_code-gemini-mcp | Connect Claude Code with Google's Gemini AI for seamless collaboration on coding tasks. Use this tool for quick code reviews and to brainstorm ideas effectively. 🐙💻 | 4 | |
66 | Cerno-Agentic-Local-Deep-Research | Cerno is an open-source tool that enables deep, multi-step research with autonomous AI agents. It offers clear insights into each reasoning step, allowing users to manage complex workflows effectively. 🐙🌟 | 4 | |
67 | forksilly.doc | ForkSilly的文档。ForkSilly:兼容sillytavern V2角色卡(png)、世界书、正则、预设的安卓应用 | 4 | |
68 | Foundation-Models-Framework-Example | Explore the Foundation Models Framework through this practical iOS app that showcases on-device AI capabilities. Dive into features like basic chat and structured data generation while leveraging the power of Apple Silicon! 🐙✨ | 4 | |
69 | astrbot_plugin_antipromptinjector | AstrBot 插件,用于检测和管理大型语言模型 (LLM) 的输出。它能帮助你识别和响应 LLM 生成的特定内容,确保其输出符合你的预期和安全标准。通过此插件,你可以更好地控制 LLM 的行为,避免不希望出现的信息。 | 4 | |
70 | FastFlowLM | Run LLMs on AMD Ryzen™ AI NPUs. Just like Ollama - but purpose-built and deeply optimized for the AMD NPUs. | 4 | |
71 | onesdk | OneSDK: A unified AI access SDK for edge devices, providing LLM capabilities (text/voice chat, image generation) and IoT device management with MQTT support, compatible with ESP32, Linux, macOS, and Windows platforms. | 3 | |
72 | kolosal-cli | Super lightweight Ollama alternative to run Llama 3.3, DeepSeek-R1, Phi-4, Gemma 3, Mistral Small 3.1 and other large language models. | 3 | |
73 | self-preference-activation-steering | finding a self-preference vector in Llama-3.1-8B-Instruct residual stream activations | 3 | |
74 | Light-R1-ZERO | DeepSeek R1-zero轻量级复现 | 3 | |
75 | augment-user | # Augment UserThis tool automates the registration process for the AugmentCode platform. It simplifies tasks like email verification and provides a clear progress display. 🚀🛠️ | 3 | |
76 | private-ai-setup-dream-guide | The Private AI Setup Dream Guide for Demos automates the installation of the software needed for a local private AI setup, utilizing AI models (LLMs and diffusion models) for use cases such as general assistance, business ideas, coding, image generation, systems administration, marketing, planning, and more. | 3 | |
77 | langtons-emergence | Recently I have been researching emergent complexities through first principles reductionism of Langton's Ant in the hopes that they could potentially offer insights into the emergent intricacies of frontier large language models. | 3 | |
78 | econosim | 🎓 Simulador interativo desenvolvido como trabalho final da disciplina ACH2063 — Introdução à Administração e Economia para Computação (USP). O EconoSim permite que alunos tomem decisões fiscais e monetárias em tempo real e visualizem os impactos sobre o modelo IS-LM de forma dinâmica e colaborativa. | 3 | |
79 | GodCore | All-in-one local AI stack for Mistral-13B and Llama.cpp, with one-step CUDA wheel install, OpenAI-compatible API, and modern web dashboard. Switch between local and cloud chat, run on your own GPU, and deploy instantly—no API keys or paywalls. Designed for easy install, custom builds, and fast remote access. Enjoy! | 3 | |
80 | llm_log_pipeline | Containerized Go service that uses LLMs (e.g., LLaMA 3.1 Instruct) to analyze backend logs via RabbitMQ and store structured insights in PostgreSQL | 3 | |
81 | DaNangtourguide | This is a Rag-LLM chatbot demo for tour guide in Da Nang | 3 | |
82 | sora-json-prompt-crafter | A vibe coded Sora JSON Prompt Crafter for curious humans and prompt engineers | 3 | |
83 | LLM-Attack-Prompt | Explore LLM-Attack-Prompt for a thorough examination of LLM vulnerabilities and attack techniques. 🛠️ This repository offers valuable insights for security researchers and developers looking to enhance their understanding of LLM safety mechanisms. 🐱💻 | 3 | |
84 | Llama2 | 3 | ||
85 | gang-gpt-gta-v | GangGPT revolutionizes Grand Theft Auto V multiplayer roleplay by integrating advanced AI systems that create a living, breathing virtual world. Built on RAGE:MP with Azure OpenAI GPT-4o-mini, this project delivers procedurally-generated missions, intelligent NPCs with persistent memory, and dynamic faction warfare-transforming traditional roleplay | 3 | |
86 | laravel-gemini-translator | Laravel Gemini AI Translation Extractor scans your Laravel project for translation keys, uses Google Gemini AI for translations, and generates language files automatically—streamlining and accelerating your localization workflow. | 3 | |
87 | Gemini-Discord-Bot | 3 | ||
88 | prompto | Prompto – a web-browser extension + Next.js dashboard that intercepts your GPT/Claude prompts, auto-applies 25 advanced engineering tricks (CoT, XML guards, compression, role, etc.), shows a diff, and delivers higher-quality answers with optimized token usage. | 3 | |
89 | VLM3D-Dockers | 3 | ||
90 | FreeSeekR1-Agent | This is my LangChain AI Agent using a totally free DeepSeek-R1 API with a lot of tools. | 3 | |
91 | Realistic-Image-Generator | 📌 Project Overview This project is a text-to-image generation system that uses the Stable Diffusion model to generate realistic images based on natural language descriptions provided by the user. The system asks the user to describe the image they want, and then automatically creates a high-quality image using AI. | 3 | |
92 | V-Droid | Source code of the paper "V-Droid: Advancing Mobile GUI Agent Through Generative Verifiers" | 3 | |
93 | vlm-circuits-analysis | Code for the experiments and websites of the paper "Same Task, Different Circuits" | 2 | |
94 | chatgpt-sites | ChatGPT 中文版:国内ChatGPT镜像网站整理合集(支持 GPT-4、GPT-4o、o1、o3及Claude 4 Sonnet、Gemini 2.5 Pro、Grok3,无需翻墙)【7月持续更新】ChatGPT 中文版是基于 OpenAI 的 ChatGPT 模型开发的中文使用版本,专为中文用户,提供更流畅、更精准的智能AI对话。全面体验 ChatGPT 中文版,无需翻墙,支持 GPT-4、GPT-4o、o1、o3 和本地化功能。本项目提供一站式的 ChatGPT中文版使用指南,包括国内可用的 ChatGPT镜像网站,帮助快速上手,无论是个人使用还是专业需求,均可无限使用 GPT-4、GPT-4o、o1、o3及Claude 4 Sonnet、Gemini 2.5 Pro、Grok3~ | 2 | |
95 | Nemotron_Nano_VL | Implementing Llama-3.1-Nemotron-Nano-VL-8B-V1 as a Remote Zoo Model for FiftyOne | 2 | |
96 | Sorachio-Chat | Sorachio - AI Assistant | 2 | |
97 | sora-modules | My collection of Sora Modules | 2 | |
98 | HFH_LMS | Habitad for humanity LMS from from zero. | 2 | |
99 | mlx-grpo | MLX-GRPO allows you to train your own DeepSeek-R1 models directly on your Mac. This implementation simplifies the process of building advanced reasoning AI, making it accessible for developers. 🐙🌟 | 2 | |
100 | lmstudio.ex | 2 |