TOP AI Developers by monthly star count
TOP AI Organization Account by AI repo star count
Top AI Project by Category star count
Top Growing Speed list by the speed of gaining stars
Top List of who create influential repos with little people known
Projects and developers that are thriving yet have not been updated for a long time.
| Rankings | Developers | Related Project | Project intro | Star count |
|---|---|---|---|---|
1 | gemini-cli | An open-source AI agent that brings the power of Gemini directly into your terminal. | 71.8K | |
2 | dyad | Free, local, open-source AI app builder ✨ v0 / lovable / Bolt alternative 🌟 Star if you like it! | 13.9K | |
3 | adk-python | An open-source, code-first Python toolkit for building, evaluating, and deploying sophisticated AI agents with flexibility and control. | 10.7K | |
4 | deepwiki-open | Open Source DeepWiki: AI-Powered Wiki Generator for GitHub/Gitlab/Bitbucket Repositories. Join the discord: https://discord.gg/gMwThUMeme | 9.0K | |
5 | awesome-gpt4o-images | Awesome curated collection of images and prompts generated by GPT-4o and gpt-image-1. Explore AI generated visuals created with ChatGPT and Sora, showcasing OpenAI’s advanced image generation capabilities. | 6.7K | |
6 | klavis | Klavis AI (YC X25): Open Source MCP integration for AI applications | 3.9K | |
7 | cactus | Cross-platform framework for deploying LLM/VLM/TTS models locally on smartphones. | 2.9K | |
8 | nexent | Nexent is a zero-code platform for auto-generating agents — no orchestration, no complex drag-and-drop required. Nexent also offers powerful capabilities for agent running control, data processing and MCP tools. | 2.3K | |
9 | voltagent | Open Source TypeScript AI Agent Framework | 2.3K | |
10 | OmniSVG | OmniSVG is the first family of end-to-end multimodal SVG generators that leverage pre-trained Vision-Language Models (VLMs), capable of generating complex and detailed SVGs, from simple icons to intricate anime characters. | 2.1K | |
11 | read-frog | 🐸 Read Frog - Open Source Immersive Translate | 🐸 陪读蛙 - 开源沉浸式翻译 | 2.1K | |
12 | ICEdit | Image editing is worth a single LoRA! 0.1% training data for fantastic image editing! Training released! Surpasses GPT-4o in ID persistence! Official ComfyUI workflow release! Only 4GB VRAM is enough to run! | 1.9K | |
13 | Step1X-Edit | A SOTA open-source image editing model, which aims to provide comparable performance against the closed-source models like GPT-4o and Gemini 2 Flash. | 1.6K | |
14 | LLM-RL-Visualized | 🌟100+ 原创 LLM / RL 原理图📚,《大模型算法》作者巨献!💥(100+ LLM/RL Algorithm Maps ) | 1.2K | |
15 | layra | LAYRA—an enterprise-ready, out-of-the-box solution—unlocks next-generation intelligent systems powered by visual RAG and limitless visual multi-step agent workflow orchestration. | 796 | |
16 | Lumina-mGPT-2.0 | Lumina-mGPT 2.0: Stand-alone Autoregressive Image Modeling | 738 | |
17 | terminator | Computer use SDK for building agents that learn from human screen recordings. Accessibility-first. Cross-platform (Windows/macOS/Linux), near-deterministic. | 686 | |
18 | SkyRL | SkyRL: A Modular Full-stack RL Library for LLMs | 609 | |
19 | autobe | AI Vibe Coding Agent of TS backend server, enhanced by compiler skills, generating 100% working code | 556 | |
20 | mLLMCelltype | An iterative multi-LLM consensus framework for accurate cell type annotation in single-cell RNA-seq data | 546 | |
21 | better-chatbot | Just a Better Chatbot. Powered by MCP Client & Workflows. | 444 | |
22 | atropos | Atropos is a Language Model Reinforcement Learning Environments framework for collecting and evaluating LLM trajectories through diverse environments | 414 | |
23 | Ming | Ming - facilitating advanced multimodal understanding and generation capabilities built upon the Ling LLM. | 387 | |
24 | tome | a magical LLM desktop client that makes it easy for *anyone* to use LLMs and MCP | 373 | |
25 | ai-gateway | The fastest, lightest, and easiest-to-integrate AI gateway on the market. Fully open-sourced. | 325 | |
26 | LiteRT-LM | 310 | ||
27 | GRPO-Zero | Implementing DeepSeek R1's GRPO algorithm from scratch | 281 | |
28 | laravel-mcp-server | A Laravel package for implementing secure Model Context Protocol servers using Streamable HTTP and SSE transport, providing real-time communication and a scalable tool system for enterprise environments. | 269 | |
29 | sokuji | Live speech translation application built with Electron 34 and React, using OpenAI's Realtime API. | 219 | |
30 | eShopLite | eShopLite is a set of reference .NET applications implementing an eCommerce site with features like Semantic Search, MCP, Reasoning models and more. | 101 | |
31 | ProxyAsLocalModel | Proxy remote LLM API as Ollama and LM Studio, for using them in JetBrains AI Assistant | 101 | |
32 | InteractVLM | [CVPR 2025] InteractVLM: 3D Interaction Reasoning from 2D Foundational Models | 92 | |
33 | pcc-groq-llama | Voice AI agent starter kit with Groq, Llama 4, and (optionally) Twilio | 68 | |
34 | chapter-llama | Official PyTorch implementation of the paper "Chapter-Llama: Efficient Chaptering in Hour-Long Videos with LLMs" | 59 | |
35 | smartpdfs | Summarize PDFs into beautiful sections with Llama 3.3 | 57 | |
36 | Cognito-AI_Sidekick | Cognito: Supercharge your Chrome browser with AI. Guide, query, and control everything using natural language. | 55 | |
37 | LocalLLMClient | Swift local LLM client for iOS, macOS, Linux | 53 | |
38 | llm-d-deployer | Helm charts for llm-d | 49 | |
39 | stopwatch | A tool for benchmarking LLMs on Modal | 41 | |
40 | reverse_vlm | 🔥 Official implementation of "Generate, but Verify: Reducing Visual Hallucination in Vision-Language Models with Retrospective Resampling" | 39 | |
41 | Vocal-Agent | A cutting-edge voice assistant combining real-time speech recognition, AI reasoning, and neural text-to-speech capabilities. | 38 | |
42 | AI-NoteMaker | NoteCraft is a full-stack web app built with Django, Celery, and Next.js that lets users upload academic PDFs, generate AI-powered notes, and retrieve relevant content using RAG. It's optimized for long documents, supports async processing, and runs fully containerized with Docker. | 34 | |
43 | LMSA | Android app that connects to LM Studio running on your computer, allowing you to chat with your favorite AI models from your mobile device. | 33 | |
44 | MCPollinations | A Model Context Protocol (MCP) server that enables AI assistants to generate images, text, and audio through the Pollinations APIs. Supports customizable parameters, image saving, and multiple model options. | 31 | |
45 | mcp-velociraptor | VelociraptorMCP is a Model Context Protocol bridge for exposing LLMs to MCP clients. | 31 | |
46 | g4f.dev | g4f.dev – free and convenient AI endpoints you can use directly in your apps, scripts, and even right in your browser. chatgpt, deepseek, grok or gemini - https://discord.gg/qXA4Wf4Fsm | 31 | |
47 | langchain-mcp-client | This Streamlit application provides a user interface for connecting to MCP (Model Context Protocol) servers and interacting with them using different LLM providers (OpenAI, Anthropic, Google, Ollama). | 31 | |
48 | cursor-free-vip | [Support 0.48.x](Reset Cursor AI MachineID & Auto Sign Up / In & Bypass Higher Token Limit)自动注册 Cursor Ai ,自动重置机器ID , 免费升级使用Pro功能: You've reached your trial request limit. / Too many free trial accounts used on this machine. Please upgrade to pro. We have this limit in place to prevent abuse. Please let us know if you believe this is a mistake. | 30 | |
49 | GeminiOCR | 26 | ||
50 | AIstudioProxyAPI | Node.js+Playwright服务器,通过模拟 OpenAI API 的方式来访问 Google AI Studio 网页版,服务器无缝交互转发gemini模型对话。这使得兼容 OpenAI API 的客户端(如 Open WebUI, NextChat 等)可以使用 AI Studio 的无限额度及能力。经测试因无法绕过自动化检测故暂不支持无头模式启动实例-自用项目随缘维护 | 25 | |
51 | cora | ✨ PyTorch implementation of "Cora: Correspondence-aware Image Editing Using Few-Step Diffusion", accepted at SIGGRAPH 2025. | 24 | |
52 | adk-go | An open-source, code-first Go toolkit for building, evaluating, and deploying sophisticated AI agents with flexibility and control. | 24 | |
53 | gpt4o-image-prompts | GPT-5(GPT5),GPT-4o(GPT4o) Image Prompts | 23 | |
54 | tooluser | Enable tool-use ability for any LLM model (DeepSeek V3/R1, etc.) | 20 | |
55 | circle-guard-bench | First-of-its-kind benchmark for evaluating the protection capabilities of large language model (LLM) guard systems | 19 | |
56 | Gen-AI-Virtual-Try-On-Clothes | Gen AI-Powered Virtual Try-On Clothes Platform Upload any model and garment image to preview realistic try-on results instantly. Built with Google Gemini, FastAPI, and React. Ideal for fashion, retail, and e-commerce. | 19 | |
57 | thinking-intervention | Used for thinking process intervention of reasoning models such as DeepSeek-R1, effectively controlling the reasoning thinking process. 用于DeepSeek-R1等推理模型的思维过程干预,有效控制推理思考过程 | 18 | |
58 | ramalama-stack | An external provider for Llama Stack allowing for the use of RamaLama for inference. | 18 | |
59 | gemini-book-generator | Let's generate books using LLM | 18 | |
60 | ngpt | 🤖 nGPT - A lightning-fast CLI tool that brings any OpenAI-compatible LLM (OpenAI, Ollama, Groq, Claude, Gemini) directly to your terminal. Generate code, craft git commits, execute shell commands, rewrite text, and chat interactively, all with seamless provider switching and real-time streaming. | 17 | |
61 | spice | SPICE: A Synergistic, Precise, Iterative, and Customizable Image Editing Workflow | 14 | |
62 | awsome_kali_MCPServers | awsome kali MCPServers is a set of MCP servers tailored for Kali Linux, designed to empower AI Agents in reverse engineering and security testing. It offers flexible network analysis, target sniffing, traffic analysis, binary understanding, and automation, enhancing AI-driven workflows. | 14 | |
63 | careyou | I'm an AI assistant with extensive knowledge in psychology, and my name is Care. | 13 | |
64 | Streaming-Avatar | 🌟 Full-stack app for real-time avatar streaming with HeyGen & Gemini AI. Built with React, TypeScript, Express, and Tailwind during a hackathon by Team "Bit by Bit" | 13 | |
65 | CAD-GPT | [AAAI2025] CAD-GPT: Synthesising CAD Construction Sequence with Spatial Reasoning-Enhanced Multimodal LLMs | 12 | |
66 | open-coscientist-agents | Implementation of Google Deepmind's AI co-scientist with LangGraph and GPT Researcher | 11 | |
67 | BUILD25-LAB331 | This repository hosts the instructions and workshop materials for Lab 331 (Deep Research with Langchain and DeepSeek R1) for Microsoft Build | 10 | |
68 | CloudToLocalLLM | 10 | ||
69 | ai-git-commiter | 这是一个基于AI大模型的Git Commit自动生成的VSCode插件。它可以帮助您根据代码变更自动生成高质量的Commit消息,提高开发效率。 | 10 | |
70 | gemini-live | Google Gemini live voice to text realtime stream in the browser | 10 | |
71 | agentic-coding | Agentic Coding Rules, Templates etc... | 9 | |
72 | OmniMind | OmniMind: An open-source Python library for effortless MCP (Model Context Protocol) integration, AI Agents, AI workflows, and AI Automations. Plug & Play AI Tools for MCP Servers and Clients, powered by Google Gemini. | 9 | |
73 | VLMLight | Official implementation of VLMLight | 9 | |
74 | headlinesquare-home | The home of HeadlineSquare and Dr. Headline. HeadlineSquare is a public square for US news headlines, fully powered by Dr. Headline, an autonomous AI agent who applies academic neutrality and rigor to news curation. | 9 | |
75 | DiffPure-RobustVLM | ICCV 2025 official implementation for Safeguarding Vision-Language Models: Mitigating Vulnerabilities to Gaussian Noise in Perturbation-based Attacks | 8 | |
76 | LMBenchmark | 8 | ||
77 | hinata | ❄️ CLI AI agents + Unix philosophy | 8 | |
78 | simpleaichat | Lightweight secure AI chat client. | 8 | |
79 | LLM-RAG-Agent-Tutorial | LLM-RAG-Agent-Tutorial | 8 | |
80 | talk2dom | Locate web elements using natural language. Powered by LLM. | 8 | |
81 | gen-alt-text | This browser extension helps you generate detailed, accessible alt text for images and videos you add to posts on Bluesky, using Google Gemini AI. | 8 | |
82 | llama-server-cli.py | A simple, user-friendly CLI tool for managing, and running, llama.cpp's llama-server with multiple configuration profiles and providing OpenAI-compatible API access. | 7 | |
83 | prompt-booster | Prompt Booster: A comprehensive tool for optimizing LLM prompts with version control, A/B testing, and template management. Supports multiple AI providers (OpenAI, Gemini, DeepSeek, Qwen, etc.) across web and desktop platforms. Increase your AI prompt effectiveness with professional engineering tools. | 7 | |
84 | Faraday-Web-Researcher-Agent | Faraday: An Autonomous Web Research Agent (LangGraph/Streamlit). 🕵️♀️ Investigates queries using dynamic tools (Tavily, Google, NewsAPI, etc.), gathers multi-source info, and synthesizes structured reports in a Streamlit UI. Features agentic workflow & source tracking. | 7 | |
85 | gemini-live | This project enables real-time streaming of audio (and optionally video or screen captures) from your local device to Google Gemini using the Live API. It allows you to interact with Gemini through both text and voice, supporting conversational AI responses. | 7 | |
86 | SunoBot | SunoBot — a personalized, generative audio companion designed to boost user engagement through interactive storytelling and voice-based experiences. | 7 | |
87 | vega-ai | Vega AI is an intelligent job application tracking system that transforms how job seekers manage their search. Job seekers can track every opportunity, instantly see their match score for each role, and generate AI-powered resumes and cover letters tailored to specific job requirements. | 7 | |
88 | Vocal-Agent | A cutting-edge Cascading voice assistant combining real-time speech recognition, AI reasoning, and neural text-to-speech capabilities. | 6 | |
89 | LoRACaptioner | Image Captioning and Prompt Optimization for LoRAs | 6 | |
90 | Open-Source-Prompt-Library | Here is where I store all my useful prompts | 6 | |
91 | recounter | 在Mac上记录您在任何应用打下的字与您剪切板的内容并在本地通过ollama DeepSeek-r1分析。Reminding all the words you have typed for Macbook and analysis with ollama Deepseek-r1 | 5 | |
92 | Custom_GPTs | ⭐ THE WORLD’S LARGEST Self-Created INDEX of Custom GPTs | 5 | |
93 | ai-scraping-defense | **Currently in testing phase** This system combats scraping by unauthorized AI bots targeting FOSS or documentation sites. It employs a multi-layered defense strategy including real-time detection, tarpitting, honeypots, and behavioral analysis with optional AI/LLM integration for sophisticated threat assessment | 5 | |
94 | gpt-4o-latency-comparison | Benchmarking toolkit for measuring real-world latency of GPT-4o audio implementations | 4 | |
95 | chatpdflocal | A chat pdf app on MacOS, keeping your data safe, handling massive documents, and connecting you with SOTA AI models | 4 | |
96 | Llama-360 | An AI-Powered Multi-Agent Solution for Retail Banking Products. | 4 | |
97 | llamaedit | code editor with completion powered by llamacpp | 4 | |
98 | llama.cr | 4 | ||
99 | ai-access | A flexible PHP library providing access to various AI models (Gemini, OpenAI, Anthropic, DeepSeek, Grok) via a consistent interface. | 4 | |
100 | GEN-AI-CAPSTONE-PROJECT | This project demonstrates a Generative AI-powered assistant that streamlines the job application process using Google Gemini Pro. It analyzes a user’s resume against a job description, calculates a match score, suggests tailored bullet points, and generates a personalized cover letter — all formatted in structured JSON for automation. | 4 |