TOP AI Developers by monthly star count
TOP AI Organization Account by AI repo star count
Top AI Project by Category star count
Top Growing Speed list by the speed of gaining stars
Top List of who create influential repos with little people known
Projects and developers that are thriving yet have not been updated for a long time.
Rankings | Developers | Related Project | Project intro | Star count |
---|---|---|---|---|
1 | LatentSync | Taming Stable Diffusion for Lip Sync! | 2.8K | |
2 | paperless-ai | An automated document analyzer for Paperless-ngx using OpenAI API, Ollama, Deepseek-r1, Azure and all OpenAI API compatible Services to automatically analyze and tag your documents. | 2.6K | |
3 | open-deep-research | Open source alternative to Gemini Deep Research. Generate reports with AI based on search results. | 1.5K | |
4 | preswald | 🐵 Preswald is a full-stack platform for building, deploying, and managing interactive data applications. It brings ingestion, storage, transformation, and visualization into a simple SDK, minimizing complexity while maintaining flexibility for both prototyping and production-grade use cases. | 1.2K | |
5 | vlms-zero-to-hero | This series will take you on a journey from the fundamentals of NLP and Computer Vision to the cutting edge of Vision-Language Models. | 1.0K | |
6 | gemini-teacher | English pronunciation correction teacher built with gemini | 983 | |
7 | meeting-minutes | A free and open source, self hosted Ai based live meeting note taker and minutes summary generator that can completely run in your Local device (Mac OS support is added. Will add windows and linux support soon) | 978 | |
8 | ZhiLight | A highly optimized LLM inference acceleration engine for Llama and its variants. | 872 | |
9 | starter-applets | Google AI Studio Starter Apps | 799 | |
10 | cursor-ai-downloads | All Cursor AI's official download links for both the latest and older versions, making it easy for you to update, downgrade, and choose any version. 🚀 | 608 | |
11 | PocketFlow | Minimalist LLM Framework in 100 Lines. Enable LLMs to Program Themselves. | 586 | |
12 | groq-appgen | Project showcasing Llama 3.3 70B HTML codegen abilities | 455 | |
13 | chipper | ✨ AI interface for tinkerers (Ollama, Haystack RAG, Python) | 422 | |
14 | OmniThink | OmniThink: Expanding Knowledge Boundaries in Machine Writing through Thinking | 404 | |
15 | clickclickclick | A framework to enable autonomous android and computer use using any LLM (local or remote) | 382 | |
16 | gemini-2-live-api-demo | Vanilla JS web interface for Gemini 2.0 flash-exp Multimodal API with text, audio, camera, screen inputs and audio responses and function calling | 297 | |
17 | RoboVLMs | 285 | ||
18 | awesome-open-source-lms | Friends of OLMo and their links. | 266 | |
19 | airweave | Turn any app into agent knowledge | 226 | |
20 | notte | The agentic internet | 189 | |
21 | fabrice-ai | A lightweight, functional, and composable framework for building AI agents. No PhD required. | 181 | |
22 | X-Codec-2.0 | Codec for paper: LLaSA: Scaling Train-time and Inference-time Compute for LLaMA-based Speech Synthesis | 153 | |
23 | VLABench | Official repo of VLABench, a large scale benchmark designed for fairly evaluating VLA, Embodied Agent, and VLMs. | 148 | |
24 | FreeScale | Code for FreeScale, a tuning-free method for higher-resolution visual generation | 115 | |
25 | AI-book-maker-with-perplexity-search-grounding | uses gpt-4o and gpt-4-mini to write books on topics while researching with perplexity API | 86 | |
26 | NaVid-VLN-CE | [RSS 2024] NaVid: Video-based VLM Plans the Next Step for Vision-and-Language Navigation | 84 | |
27 | VLM-RL | VLM-RL: A Unified Vision Language Models and Reinforcement Learning Framework for Safe Autonomous Driving | 84 | |
28 | PiTrac | Launch monitor using low-cost raspberry pi and camera hardware to determine ball launch speed, angles and spin | 83 | |
29 | Sora | An iOS and macOS modular web scraping app | 81 | |
30 | MLX-Model-Manager | MLX Model Manager unifies loading and inferencing with LLMs and VLMs. | 81 | |
31 | TrustEval-toolkit | TrustEval: A modular and extensible toolkit for comprehensive trust evaluation of generative foundation models (GenFMs) | 79 | |
32 | windsurf-vip-free | It's an automatic sign-up for a Windsurf account, so you only need an email address to get unlimited access to windsurf's premium features..这是一个自动注册 Windsurf 账号的工具,你只需要一个邮箱即可无限使用windsurf高级功能。 | 79 | |
33 | Awesome-RS-Temporal-VLM | Remote Sensing Temporal Vision-Language Models: A Comprehensive Survey | 76 | |
34 | atlas-mcp-server | A Model Context Protocol (MCP) server providing path-based task management with dependency tracking & more for Large Language Models (LLM) Agents | 58 | |
35 | dingo | Dingo: A Comprehensive Data Quality Evaluation Tool | 57 | |
36 | microsoft-markitdown-streamlit-ui | A document analysis tool built with Streamlit and Microsoft MarkItDown. Extract and analyze content from multiple document formats with optional GPT-4o enhancement. | 47 | |
37 | SeeGround | [CVPR'25] SeeGround: See and Ground for Zero-Shot Open-Vocabulary 3D Visual Grounding | 46 | |
38 | gptree | A CLI tool to provide LLM context for coding projects by combining project files into a single text file (or clipboard text) with directory tree structure. | 41 | |
39 | FinGLM2 | 智谱AI 2024年金融行业大模型挑战赛仓库 | 40 | |
40 | chatgpt-cn | ChatGPT 中文版:国内免费使用指南及镜像网站推荐(支持 GPT-4o 和 o1)【2025年2月更新】 | 40 | |
41 | Emma-X | Emma-X: An Embodied Multimodal Action Model with Grounded Chain of Thought and Look-ahead Spatial Reasoning | 39 | |
42 | student-gpt-tools | AI tools collection for students and researchers. 学生和研究人员的AI工具合集。 | 35 | |
43 | hn-enhancer | Hacker News Companion - Browser extension | 29 | |
44 | SDXL-Training-Improvements | 📊 Research-focused SDXL training framework exploring novel optimization approaches. Goals include enhanced image quality, training stability & comprehensive monitoring. ⭐ Performance-focused research framework. | 19 | |
45 | sources | READ THE README | 19 | |
46 | sec-docs | An experimental project using LLM technology to generate security documentation for Open Source Software (OSS) projects | 18 | |
47 | sora-website | The official landing page for Sora Labs. | 18 | |
48 | Electron-Executor | Roblox Electron Executor is one of the most favorite Roblox Executors at the moment. Before I tell you how to download Electron Executor, let me tell you that it is currently available safely for Windows. But it is not officially available for Android users as of now but the update is coming and will be launched soon. | 16 | |
49 | GVA-Survey | Generalist Virtual Agents: A Survey on Autonomous Agents Across Digital Platforms | 14 | |
50 | prompt_builder | A macOS tool to build long-context prompts for models like OpenAI o1 and Gemini 2.0. | 14 | |
51 | neurips2024 | Read and Listen to NeurIPS 2024 Papers | 12 | |
52 | llmling-agent | Multi-agent workflows and complex Agent interactions, both via YAML manifest and programmatic usage. Pydantic-AI and LiteLLM backends. Human-in-the-loop integration. | 12 | |
53 | Trading-GPT | TradeGPT is an intelligent trading bot built with ChatGPT and AI to automate and optimize trading strategies. It analyzes market data, predicts trends, and executes trades in real-time, providing traders with tools to enhance efficiency and profitability. | 12 | |
54 | agno-ai-news-slack-bot | 🤖 AI News Slack Bot | Daily AI tech updates in Slack Channel | Uses GPT-4o, DuckDuckGo & Phidata | Dockerized & Portainer-ready | 12 | |
55 | ComfyUI-Nuke-a-Text-Encoder | For SDXL, SD1.5, Flux. Nuke T5 and let CLIP guide Flux.1 on its own! Or let let random guide Flux.1! Or load a CLIP crazy opinion embedding about your image and let that guide the AI! | 11 | |
56 | octoprompt | OctoPrompt is a local based prompt engineer for your codebase. Craft the perfect prompt for modern LLMs. | 11 | |
57 | zwai-lab | comfyUI+vscode,两大神器首次跨界融合。集成式AGI大模型开发+AIGC创意平台,苹果OOTB模式,解压即用。comfyUI+vscode, The first cross-border fusion of two major artifacts. Integrated AGI Large Model Development +AIGC Creative Platform, Apple OOTB mode, ready to use for decompression. | 10 | |
58 | llm-arxiv-daily | Automatically update arXiv papers about LLM Reasoning, LLM Evaluation, LLM & MLLM and Video Understanding using Github Actions. | 10 | |
59 | FreeScoutGPT | FreeScout module for requests to ChatGPT using latest models | 10 | |
60 | DDG2API | DuckDuckGo AI to API. Chatting with DuckDuckGo AI through API,Free to use gpt-4o-mini, claude-3-haiku, llama-3.1-70b, mixtral-8x7b, etc. Supports continuous dialogue, compatible with OpenAI API format. | 10 | |
61 | codellm-releases | AI code editor that enhances developer productivity, bundled with the AI super assistant, ChatLLM. | 10 | |
62 | MathUtils | A tool for evaluating LLMs on the MATH and GSM8K dataset. | 9 | |
63 | video-analysis-with-gpt-4o | This repository showcases how to leverage the capabilities of LLMs to analyze and extract insights from video files or video URLs, including their audio content, offering several configurable parameters, such as the duration for splitting the video, the number of frames to extract per second, frame resizing, and prompts. | 9 | |
64 | VLArena | Closed-loop evaluation for end-to-end VLM autonomous driving agent | 9 | |
65 | SDXL-Buildings-with-a-cozy-New-Year-s-atmosphere | 触发关键词 Newyear,把物体绒意年味化!!!一起看看怎么装扮家乡的地标迎接新年吧~ | 9 | |
66 | FloTorch | FloTorch is an open-source tool for optimizing Generative AI workloads on AWS. It automates RAG proof-of-concept development with features like hyperparameter tuning, vector database optimization, and LLM integration. FloTorch streamlines experimentation, ensures security, and accelerates production with cost-efficient, validated workflows. | 9 | |
67 | Promptus | Official impl. of "Promptus: Can Prompts Streaming Replace Video Streaming with Stable Diffusion". A general semantic video communication framework. | 9 | |
68 | AktivaAI | Local LLM Discord Bot | 8 | |
69 | Video-Bench | Video Generation Benchmark | 7 | |
70 | TheReader | Document Reader Powered by Gemini AI | 7 | |
71 | vlmrun-python-sdk | Official Python SDK for VLM Run | 6 | |
72 | papercheckai | PaperCheckAI is a cutting-edge, open-source platform-as-a-service leveraging state-of-the-art multimodal large language models (LLMs) to autonomously digitise, interpret, and evaluate handwritten long-form answer sheets in compliance with custom rubrics, ensuring scalable, secure, and explainable AI-driven assessments for educational institutions. | 6 | |
73 | llm-agents-evaluation | 6 | ||
74 | coc-ai | AI-powered code assistant for Vim. OpenAI and ChatGPT plugin for Vim and Neovim, built on top of coc.nvim's extension system, with support for DeepSeek-R1(deepseek-reasoner) | 6 | |
75 | ChatGPT-CN-Guide | 【ChatGPT 中文版】国内镜像网站免费推荐(支持 4o、o1 和 GPT-4)【2025年3月更新】 | 5 | |
76 | axon | ML platform for training, versioning, and experimenting with VLM and VLA models at scale | 5 | |
77 | CB-LLMs | 5 | ||
78 | graphrag_webui | A web interface for GraphRAG. 🚀 | 5 | |
79 | LLM-dialog-box | Mading a LLM chat compoent | 5 | |
80 | dcit201-classroom-lms-LMS | dcit201-classroom-lms-LMS created by GitHub Classroom | 5 | |
81 | GLM_vision | GLM_vision 是一款适用于 chatgpt-on-wechat 的图像和视频分析插件,基于智谱GLM-4V视觉模型,支持通过URL链接分析图片和视频内容。 | 4 | |
82 | Local-NotebookLM | Googles NotebookLM but local | 4 | |
83 | RAG-Diffusion-xl | RAG-Diffusion re-implemented in sdxl | 3 | |
84 | cultural_evolution | implements the methodology outlined in the paper *Cultural Evolution of Cooperation among LLM Agents*. The paper explores whether a society of large language model (LLM) agents can develop cooperative norms through cultural evolution, using the classic *Donor Game*. The goal is to evaluate multi-agent interaction dynamics | 3 | |
85 | LLM-paper-daily | Automatically Update LLM Papers Daily using Github Actions. Ref: https://github.com/Vincentqyw/cv-arxiv-daily | 3 | |
86 | sora-editor-with-androlua | 3 | ||
87 | NextBench | NextBench is a collection of wide variety of benchmarks for accessing the performance of LLMs and VLMs and more. | 2 | |
88 | vla-gender-bias | [ICLR 2025] Revealing and Reducing Gender Biases in Vision and Language Assistants (VLAs) | 2 | |
89 | sdxs-controlnet-sketch | Running Stable Diffusion in under 400MB in the browser in under half a second | 2 | |
90 | openGemini-trino-plugin | This project is a plug-in for the distributed SQL engine Trino. With this plug-in, you can use Trino directly to access openGemini. | 2 | |
91 | sora | Sora是什么?如何使用Sora?Sora入口在哪?Sora订阅保姆级教程! | 2 | |
92 | ChatGPT-CN-Guide | 【ChatGPT 中文版】国内镜像网站免费使用(支持 GPT-4o、o1 和 GPT-4)【2025年3月更新】 | 2 | |
93 | LLMs_API | 2 | ||
94 | OSIR-LMTS | 2 | ||
95 | LMS-Karagasthalawa-School-BackEnd | 📚 This Library Management System for Karagastalawa Maha Vidyalaya simplifies book management and borrowing processes. 📖✨ It ensures efficient organization of library resources, 📂 tracks borrowed books, and enhances the overall library experience for students and staff. 🏫💡 | 2 | |
96 | llm_eval | Simple project to enable rapid evaluation of different prompts & models | 2 | |
97 | rocswap | llama.cpp + ROCm + llama-swap | 2 | |
98 | rewsury | It helps users interact with multiple AI models directly through Telegram, particularly large language models like DeepSeek-R1, Claude, GPT, Grok, Cohere, DALL.E, SDXL, Llama 3.3, and Mistral. | 2 | |
99 | ComfyUI-Open-Sora-I2V | Another comfy implementation for the short video generation project hpcaitech/Open-Sora, supporting latest V2 and V3 models as well as image to video functions, etc. | 2 | |
100 | Sora | A Discord music bot built with ForgeScript and ForgeMusic. | 2 |