TOP AI Developers by monthly star count
TOP AI Organization Account by AI repo star count
Top AI Project by Category star count
Top Growing Speed list by the speed of gaining stars
Top List of who create influential repos with little people known
Projects and developers that are thriving yet have not been updated for a long time.
Rankings | Developers | Related Project | Project intro | Star count |
---|---|---|---|---|
1 | KrillinAI | A video translation and dubbing tool powered by LLMs, offering professional-grade translations and one-click full-process deployment. It can generate content optimized for platforms like YouTube,TikTok, and Shorts. 基于AI大模型的视频翻译和配音工具,专业级翻译,一键部署全流程,可以生成适配抖音,小红书,哔哩哔哩,视频号,TikTok,Youtube Shorts等形态的内容 | 7.5K | |
2 | LatentSync | Taming Stable Diffusion for Lip Sync! | 4.1K | |
3 | paperless-ai | An automated document analyzer for Paperless-ngx using OpenAI API, Ollama, Deepseek-r1, Azure and all OpenAI API compatible Services to automatically analyze and tag your documents. | 3.4K | |
4 | cursor-ai-downloads | All Cursor AI's official download links for both the latest and older versions, making it easy for you to update, downgrade, and choose any version. 🚀 | 1.9K | |
5 | open-deep-research | Open source alternative to Gemini Deep Research. Generate reports with AI based on search results. | 1.5K | |
6 | preswald | 🐵 Preswald is a full-stack platform for building, deploying, and managing interactive data applications. It brings ingestion, storage, transformation, and visualization into a simple SDK, minimizing complexity while maintaining flexibility for both prototyping and production-grade use cases. | 1.2K | |
7 | vlms-zero-to-hero | This series will take you on a journey from the fundamentals of NLP and Computer Vision to the cutting edge of Vision-Language Models. | 1.0K | |
8 | gemini-teacher | English pronunciation correction teacher built with gemini | 983 | |
9 | meeting-minutes | A free and open source, self hosted Ai based live meeting note taker and minutes summary generator that can completely run in your Local device (Mac OS support is added. Will add windows and linux support soon) | 978 | |
10 | ZhiLight | A highly optimized LLM inference acceleration engine for Llama and its variants. | 884 | |
11 | starter-applets | Google AI Studio Starter Apps | 799 | |
12 | groq-appgen | Project showcasing Llama 3.3 70B HTML codegen abilities | 607 | |
13 | PocketFlow | Minimalist LLM Framework in 100 Lines. Enable LLMs to Program Themselves. | 586 | |
14 | daydreams | Daydreams is a generative agent framework for executing anything onchain | 457 | |
15 | OmniThink | OmniThink: Expanding Knowledge Boundaries in Machine Writing through Thinking | 445 | |
16 | chipper | ✨ AI interface for tinkerers (Ollama, Haystack RAG, Python) | 437 | |
17 | WorkflowAI | WorkflowAI is an open-source platform where product and engineering teams
collaborate to build and iterate on AI features. | 393 | |
18 | clickclickclick | A framework to enable autonomous android and computer use using any LLM (local or remote) | 382 | |
19 | gemini-2-live-api-demo | Vanilla JS web interface for Gemini 2.0 flash-exp Multimodal API with text, audio, camera, screen inputs and audio responses and function calling | 297 | |
20 | RoboVLMs | 285 | ||
21 | awesome-open-source-lms | Friends of OLMo and their links. | 266 | |
22 | airweave | Turn any app into agent knowledge | 226 | |
23 | VLABench | Official repo of VLABench, a large scale benchmark designed for fairly evaluating VLA, Embodied Agent, and VLMs. | 225 | |
24 | X-Codec-2.0 | Codec for paper: LLaSA: Scaling Train-time and Inference-time Compute for LLaMA-based Speech Synthesis | 224 | |
25 | notte | The agentic internet | 189 | |
26 | fabrice-ai | A lightweight, functional, and composable framework for building AI agents. No PhD required. | 181 | |
27 | Sora | An iOS and macOS modular web scraping app | 176 | |
28 | PiTrac | Launch monitor using low-cost raspberry pi and camera hardware to determine ball launch speed, angles and spin | 154 | |
29 | Local-NotebookLM | Googles NotebookLM but local | 153 | |
30 | FreeScale | Code for FreeScale, a tuning-free method for higher-resolution visual generation | 115 | |
31 | LLM-FuzzX | LLM-FuzzX is a user-friendly fuzz testing tool for Large Language Models (e.g., GPT, Claude, LLaMA), featuring advanced task-aware mutation strategies, fine-grained evaluations, and jailbreak detection to uncover potential security vulnerabilities and enhance model robustness. | 111 | |
32 | Awesome-RS-SpatioTemporal-VLMs | Remote Sensing Spatio-Temporal Vision-Language Models: A Comprehensive Survey | 108 | |
33 | AI-book-maker-with-perplexity-search-grounding | uses gpt-4o and gpt-4-mini to write books on topics while researching with perplexity API | 86 | |
34 | NaVid-VLN-CE | [RSS 2024] NaVid: Video-based VLM Plans the Next Step for Vision-and-Language Navigation | 84 | |
35 | VLM-RL | VLM-RL: A Unified Vision Language Models and Reinforcement Learning Framework for Safe Autonomous Driving | 84 | |
36 | MLX-Model-Manager | MLX Model Manager unifies loading and inferencing with LLMs and VLMs. | 81 | |
37 | TrustEval-toolkit | TrustEval: A modular and extensible toolkit for comprehensive trust evaluation of generative foundation models (GenFMs) | 79 | |
38 | windsurf-vip-free | It's an automatic sign-up for a Windsurf account, so you only need an email address to get unlimited access to windsurf's premium features..这是一个自动注册 Windsurf 账号的工具,你只需要一个邮箱即可无限使用windsurf高级功能。 | 79 | |
39 | atlas-mcp-server | A Model Context Protocol (MCP) server providing path-based task management with dependency tracking & more for Large Language Models (LLM) Agents | 58 | |
40 | dingo | Dingo: A Comprehensive Data Quality Evaluation Tool | 57 | |
41 | sources | READ THE README | 48 | |
42 | microsoft-markitdown-streamlit-ui | A document analysis tool built with Streamlit and Microsoft MarkItDown. Extract and analyze content from multiple document formats with optional GPT-4o enhancement. | 47 | |
43 | SeeGround | [CVPR'25] SeeGround: See and Ground for Zero-Shot Open-Vocabulary 3D Visual Grounding | 46 | |
44 | OSWorld-G | Scaling Computer-Use Grounding via UI Decomposition and Synthesis | 43 | |
45 | mcp-server-llamacloud | A MCP server connecting to a managed index on LlamaCloud | 41 | |
46 | gptree | A CLI tool to provide LLM context for coding projects by combining project files into a single text file (or clipboard text) with directory tree structure. | 41 | |
47 | FinGLM2 | 智谱AI 2024年金融行业大模型挑战赛仓库 | 40 | |
48 | chatgpt-cn | ChatGPT 中文版:国内免费使用指南及镜像网站推荐(支持 GPT-4o 和 o1)【2025年5月更新】 | 40 | |
49 | Emma-X | Emma-X: An Embodied Multimodal Action Model with Grounded Chain of Thought and Look-ahead Spatial Reasoning | 39 | |
50 | student-gpt-tools | AI tools collection for students and researchers. 学生和研究人员的AI工具合集。 | 35 | |
51 | divergent | LLM Divergent Thinking Creativity Benchmark. LLMs generate 25 unique words that start with a given letter with no connections to each other or to 50 initial random words. | 30 | |
52 | hn-enhancer | Hacker News Companion - Browser extension | 29 | |
53 | Analysis | 一些较新逆向demo,目前有爱奇艺、QQ音乐(sign、cookie获取及刷新)、网易云(param、encSecKey及cookie获取)、酷狗(signature)、优酷、剧看看、抖音(a_bogus 192)、小红书(x-s、x-s-common),自备cookie | [可封装成大模型API]大模型kimi、deepseek(自备Bearer Token,支持菜单、(是否开启R1深度思考,默认v3)k1.5\kimi、流式输出、是否开启联网搜索)、讯飞星火(自备cookie,流式输出联网搜索)、豆包(自备cookie、是否启用深度搜索)、unlimited ai(上代理即用,无限制,上下文推理,能干什么,如其名) | 26 | |
54 | autosrt_page | AutoSRT is an macOS app that automatically generates dual language subtitles from video files. | 25 | |
55 | alith | Simple, Composable, High-Performance, Safe and Web3 Friendly AI Agents and LazAI Gateway for Everyone | 24 | |
56 | SDXL-Training-Improvements | 📊 Research-focused SDXL training framework exploring novel optimization approaches. Goals include enhanced image quality, training stability & comprehensive monitoring. ⭐ Performance-focused research framework. | 19 | |
57 | sec-docs | An experimental project using LLM technology to generate security documentation for Open Source Software (OSS) projects | 18 | |
58 | sora-website | The official landing page for Sora Labs. | 18 | |
59 | Trading-GPT | TradeGPT is an intelligent trading bot built with ChatGPT and AI to automate and optimize trading strategies. It analyzes market data, predicts trends, and executes trades in real-time, providing traders with tools to enhance efficiency and profitability. | 18 | |
60 | bubbaloop | 🦄 Serving Platform for Spatial AI and Robotics. | 17 | |
61 | Electron-Executor | Roblox Electron Executor is one of the most favorite Roblox Executors at the moment. Before I tell you how to download Electron Executor, let me tell you that it is currently available safely for Windows. But it is not officially available for Android users as of now but the update is coming and will be launched soon. | 16 | |
62 | awesome-llm-app | 这是一个汇集了众多优秀大型语言模型应用的合集,这些应用采用了检索增强生成(RAG)技术,并使用了 OpenAI、Anthropic、Gemini 以及开源模型。Collection of awesome LLM apps with RAG using OpenAI, Anthropic, Gemini and opensource models. | 14 | |
63 | GVA-Survey | Generalist Virtual Agents: A Survey on Autonomous Agents Across Digital Platforms | 14 | |
64 | prompt_builder | A macOS tool to build long-context prompts for models like OpenAI o1 and Gemini 2.0. | 14 | |
65 | LMPC_qpOASES_diff | 14 | ||
66 | neurips2024 | Read and Listen to NeurIPS 2024 Papers | 12 | |
67 | llmling-agent | Multi-agent workflows and complex Agent interactions, both via YAML manifest and programmatic usage. Pydantic-AI and LiteLLM backends. Human-in-the-loop integration. | 12 | |
68 | agno-ai-news-slack-bot | 🤖 AI News Slack Bot | Daily AI tech updates in Slack Channel | Uses GPT-4o, DuckDuckGo & Phidata | Dockerized & Portainer-ready | 12 | |
69 | ComfyUI-Nuke-a-Text-Encoder | For SDXL, SD1.5, Flux. Nuke T5 and let CLIP guide Flux.1 on its own! Or let let random guide Flux.1! Or load a CLIP crazy opinion embedding about your image and let that guide the AI! | 11 | |
70 | octoprompt | OctoPrompt is a local based prompt engineer for your codebase. Craft the perfect prompt for modern LLMs. | 11 | |
71 | zwai-lab | comfyUI+vscode,两大神器首次跨界融合。集成式AGI大模型开发+AIGC创意平台,苹果OOTB模式,解压即用。comfyUI+vscode, The first cross-border fusion of two major artifacts. Integrated AGI Large Model Development +AIGC Creative Platform, Apple OOTB mode, ready to use for decompression. | 10 | |
72 | llm-arxiv-daily | Automatically update arXiv papers about LLM Reasoning, LLM Evaluation, LLM & MLLM and Video Understanding using Github Actions. | 10 | |
73 | notebook-llama | An open source implementation of NotebookLM that runs on Union | 10 | |
74 | FreeScoutGPT | FreeScout module for requests to ChatGPT using latest models | 10 | |
75 | DDG2API | DuckDuckGo AI to API. Chatting with DuckDuckGo AI through API,Free to use gpt-4o-mini, claude-3-haiku, llama-3.1-70b, mixtral-8x7b, etc. Supports continuous dialogue, compatible with OpenAI API format. | 10 | |
76 | codellm-releases | AI code editor that enhances developer productivity, bundled with the AI super assistant, ChatLLM. | 10 | |
77 | MathUtils | A tool for evaluating LLMs on the MATH and GSM8K dataset. | 9 | |
78 | video-analysis-with-gpt-4o | This repository showcases how to leverage the capabilities of LLMs to analyze and extract insights from video files or video URLs, including their audio content, offering several configurable parameters, such as the duration for splitting the video, the number of frames to extract per second, frame resizing, and prompts. | 9 | |
79 | VLArena | Closed-loop evaluation for end-to-end VLM autonomous driving agent | 9 | |
80 | SDXL-Buildings-with-a-cozy-New-Year-s-atmosphere | 触发关键词 Newyear,把物体绒意年味化!!!一起看看怎么装扮家乡的地标迎接新年吧~ | 9 | |
81 | FloTorch | FloTorch is an open-source tool for optimizing Generative AI workloads on AWS. It automates RAG proof-of-concept development with features like hyperparameter tuning, vector database optimization, and LLM integration. FloTorch streamlines experimentation, ensures security, and accelerates production with cost-efficient, validated workflows. | 9 | |
82 | Promptus | Official impl. of "Promptus: Can Prompts Streaming Replace Video Streaming with Stable Diffusion". A general semantic video communication framework. | 9 | |
83 | shiny.ollama | Chat offline with open-source LLMs like deepseek-r1, nemotron, qwen, llama and more all through a simple R package powered by Shiny and Ollama. 🚀 | 9 | |
84 | AktivaAI | Local LLM Discord Bot | 8 | |
85 | deepseek-n8n-automate-workflow | Self-hosted AI Starter Kit is an open, docker compose template that quickly bootstraps a fully featured Local AI and Low Code development environment including Open WebUI for an interface to chat with your N8N agents. Now updated for running Deepseek-r1 locally. | 8 | |
86 | Video-Bench | Video Generation Benchmark | 7 | |
87 | TheReader | Document Reader Powered by Gemini AI | 7 | |
88 | vlmrun-python-sdk | Official Python SDK for VLM Run | 6 | |
89 | papercheckai | PaperCheckAI is a cutting-edge, open-source platform-as-a-service leveraging state-of-the-art multimodal large language models (LLMs) to autonomously digitise, interpret, and evaluate handwritten long-form answer sheets in compliance with custom rubrics, ensuring scalable, secure, and explainable AI-driven assessments for educational institutions. | 6 | |
90 | lmb | Language Model Board, a better way to read the LM Arena results | 6 | |
91 | llm-agents-evaluation | 6 | ||
92 | dcit201-classroom-lms-LMS | dcit201-classroom-lms-LMS created by GitHub Classroom | 6 | |
93 | reasoning-models | Experiments with reasoning models, training techniques, papers | 6 | |
94 | coc-ai | AI-powered code assistant for Vim. OpenAI and ChatGPT plugin for Vim and Neovim, built on top of coc.nvim's extension system, with support for DeepSeek-R1(deepseek-reasoner) | 6 | |
95 | multiscale-byte-lm | A hierarchical LM that scales to training on +5M context windows | 6 | |
96 | ChatGPT-CN-Guide | 【ChatGPT 中文版】国内镜像网站免费推荐(支持 4o、o1 和 GPT-4)【2025年3月更新】 | 5 | |
97 | axon | ML platform for training, versioning, and experimenting with VLM and VLA models at scale | 5 | |
98 | CB-LLMs | 5 | ||
99 | graphrag_webui | A web interface for GraphRAG. 🚀 | 5 | |
100 | LLM-dialog-box | Mading a LLM chat compoent | 5 |