TOP AI Developers by monthly star count
TOP AI Organization Account by AI repo star count
Top AI Project by Category star count
Top Growing Speed list by the speed of gaining stars
Top List of who create influential repos with little people known
Projects and developers that are thriving yet have not been updated for a long time.
Rankings | Developers | Related Project | Project intro | Star count |
---|---|---|---|---|
1 | paperless-ai | An automated document analyzer for Paperless-ngx using OpenAI API, Ollama, Deepseek-r1, Azure and all OpenAI API compatible Services to automatically analyze and tag your documents. | 2.9K | |
2 | LatentSync | Taming Stable Diffusion for Lip Sync! | 2.8K | |
3 | open-deep-research | Open source alternative to Gemini Deep Research. Generate reports with AI based on search results. | 1.5K | |
4 | cursor-ai-downloads | All Cursor AI's official download links for both the latest and older versions, making it easy for you to update, downgrade, and choose any version. 🚀 | 1.5K | |
5 | preswald | 🐵 Preswald is a full-stack platform for building, deploying, and managing interactive data applications. It brings ingestion, storage, transformation, and visualization into a simple SDK, minimizing complexity while maintaining flexibility for both prototyping and production-grade use cases. | 1.2K | |
6 | vlms-zero-to-hero | This series will take you on a journey from the fundamentals of NLP and Computer Vision to the cutting edge of Vision-Language Models. | 1.0K | |
7 | gemini-teacher | English pronunciation correction teacher built with gemini | 983 | |
8 | meeting-minutes | A free and open source, self hosted Ai based live meeting note taker and minutes summary generator that can completely run in your Local device (Mac OS support is added. Will add windows and linux support soon) | 978 | |
9 | ZhiLight | A highly optimized LLM inference acceleration engine for Llama and its variants. | 884 | |
10 | starter-applets | Google AI Studio Starter Apps | 799 | |
11 | groq-appgen | Project showcasing Llama 3.3 70B HTML codegen abilities | 605 | |
12 | PocketFlow | Minimalist LLM Framework in 100 Lines. Enable LLMs to Program Themselves. | 586 | |
13 | OmniThink | OmniThink: Expanding Knowledge Boundaries in Machine Writing through Thinking | 438 | |
14 | chipper | ✨ AI interface for tinkerers (Ollama, Haystack RAG, Python) | 437 | |
15 | clickclickclick | A framework to enable autonomous android and computer use using any LLM (local or remote) | 382 | |
16 | gemini-2-live-api-demo | Vanilla JS web interface for Gemini 2.0 flash-exp Multimodal API with text, audio, camera, screen inputs and audio responses and function calling | 297 | |
17 | RoboVLMs | 285 | ||
18 | awesome-open-source-lms | Friends of OLMo and their links. | 266 | |
19 | airweave | Turn any app into agent knowledge | 226 | |
20 | X-Codec-2.0 | Codec for paper: LLaSA: Scaling Train-time and Inference-time Compute for LLaMA-based Speech Synthesis | 224 | |
21 | notte | The agentic internet | 189 | |
22 | fabrice-ai | A lightweight, functional, and composable framework for building AI agents. No PhD required. | 181 | |
23 | Local-NotebookLM | Googles NotebookLM but local | 153 | |
24 | VLABench | Official repo of VLABench, a large scale benchmark designed for fairly evaluating VLA, Embodied Agent, and VLMs. | 148 | |
25 | FreeScale | Code for FreeScale, a tuning-free method for higher-resolution visual generation | 115 | |
26 | PiTrac | Launch monitor using low-cost raspberry pi and camera hardware to determine ball launch speed, angles and spin | 105 | |
27 | AI-book-maker-with-perplexity-search-grounding | uses gpt-4o and gpt-4-mini to write books on topics while researching with perplexity API | 86 | |
28 | NaVid-VLN-CE | [RSS 2024] NaVid: Video-based VLM Plans the Next Step for Vision-and-Language Navigation | 84 | |
29 | VLM-RL | VLM-RL: A Unified Vision Language Models and Reinforcement Learning Framework for Safe Autonomous Driving | 84 | |
30 | Sora | An iOS and macOS modular web scraping app | 81 | |
31 | MLX-Model-Manager | MLX Model Manager unifies loading and inferencing with LLMs and VLMs. | 81 | |
32 | TrustEval-toolkit | TrustEval: A modular and extensible toolkit for comprehensive trust evaluation of generative foundation models (GenFMs) | 79 | |
33 | windsurf-vip-free | It's an automatic sign-up for a Windsurf account, so you only need an email address to get unlimited access to windsurf's premium features..这是一个自动注册 Windsurf 账号的工具,你只需要一个邮箱即可无限使用windsurf高级功能。 | 79 | |
34 | Awesome-RS-Temporal-VLM | Remote Sensing Temporal Vision-Language Models: A Comprehensive Survey | 76 | |
35 | atlas-mcp-server | A Model Context Protocol (MCP) server providing path-based task management with dependency tracking & more for Large Language Models (LLM) Agents | 58 | |
36 | dingo | Dingo: A Comprehensive Data Quality Evaluation Tool | 57 | |
37 | microsoft-markitdown-streamlit-ui | A document analysis tool built with Streamlit and Microsoft MarkItDown. Extract and analyze content from multiple document formats with optional GPT-4o enhancement. | 47 | |
38 | SeeGround | [CVPR'25] SeeGround: See and Ground for Zero-Shot Open-Vocabulary 3D Visual Grounding | 46 | |
39 | mcp-server-llamacloud | A MCP server connecting to a managed index on LlamaCloud | 41 | |
40 | gptree | A CLI tool to provide LLM context for coding projects by combining project files into a single text file (or clipboard text) with directory tree structure. | 41 | |
41 | FinGLM2 | 智谱AI 2024年金融行业大模型挑战赛仓库 | 40 | |
42 | chatgpt-cn | ChatGPT 中文版:国内免费使用指南及镜像网站推荐(支持 GPT-4o 和 o1)【2025年2月更新】 | 40 | |
43 | Emma-X | Emma-X: An Embodied Multimodal Action Model with Grounded Chain of Thought and Look-ahead Spatial Reasoning | 39 | |
44 | student-gpt-tools | AI tools collection for students and researchers. 学生和研究人员的AI工具合集。 | 35 | |
45 | divergent | LLM Divergent Thinking Creativity Benchmark. LLMs generate 25 unique words that start with a given letter with no connections to each other or to 50 initial random words. | 30 | |
46 | hn-enhancer | Hacker News Companion - Browser extension | 29 | |
47 | WorkflowAI | WorkflowAI is an open-source platform where product and engineering teams
collaborate to build and iterate on AI features. | 25 | |
48 | SDXL-Training-Improvements | 📊 Research-focused SDXL training framework exploring novel optimization approaches. Goals include enhanced image quality, training stability & comprehensive monitoring. ⭐ Performance-focused research framework. | 19 | |
49 | sources | READ THE README | 19 | |
50 | sec-docs | An experimental project using LLM technology to generate security documentation for Open Source Software (OSS) projects | 18 | |
51 | Analysis | 一些最新逆向demo,目前有爱奇艺、QQ音乐(sign、cookie获取及刷新)、网易云(param、encSecKey及cookie获取)、酷狗(signature)、优酷、剧看看、抖音(a_bogus 192)、小红书(x-s、x-s-common),自备cookie,大模型kimi、deepseek(自备Bearer Token,支持菜单、(是否开启R1深度思考)k1.5\kimi、流式输出、是否开启联网搜索)、讯飞星火(自备cookie,流式输出联网搜索) | 18 | |
52 | sora-website | The official landing page for Sora Labs. | 18 | |
53 | Electron-Executor | Roblox Electron Executor is one of the most favorite Roblox Executors at the moment. Before I tell you how to download Electron Executor, let me tell you that it is currently available safely for Windows. But it is not officially available for Android users as of now but the update is coming and will be launched soon. | 16 | |
54 | GVA-Survey | Generalist Virtual Agents: A Survey on Autonomous Agents Across Digital Platforms | 14 | |
55 | prompt_builder | A macOS tool to build long-context prompts for models like OpenAI o1 and Gemini 2.0. | 14 | |
56 | LMPC_qpOASES_diff | 14 | ||
57 | neurips2024 | Read and Listen to NeurIPS 2024 Papers | 12 | |
58 | llmling-agent | Multi-agent workflows and complex Agent interactions, both via YAML manifest and programmatic usage. Pydantic-AI and LiteLLM backends. Human-in-the-loop integration. | 12 | |
59 | Trading-GPT | TradeGPT is an intelligent trading bot built with ChatGPT and AI to automate and optimize trading strategies. It analyzes market data, predicts trends, and executes trades in real-time, providing traders with tools to enhance efficiency and profitability. | 12 | |
60 | agno-ai-news-slack-bot | 🤖 AI News Slack Bot | Daily AI tech updates in Slack Channel | Uses GPT-4o, DuckDuckGo & Phidata | Dockerized & Portainer-ready | 12 | |
61 | ComfyUI-Nuke-a-Text-Encoder | For SDXL, SD1.5, Flux. Nuke T5 and let CLIP guide Flux.1 on its own! Or let let random guide Flux.1! Or load a CLIP crazy opinion embedding about your image and let that guide the AI! | 11 | |
62 | octoprompt | OctoPrompt is a local based prompt engineer for your codebase. Craft the perfect prompt for modern LLMs. | 11 | |
63 | zwai-lab | comfyUI+vscode,两大神器首次跨界融合。集成式AGI大模型开发+AIGC创意平台,苹果OOTB模式,解压即用。comfyUI+vscode, The first cross-border fusion of two major artifacts. Integrated AGI Large Model Development +AIGC Creative Platform, Apple OOTB mode, ready to use for decompression. | 10 | |
64 | llm-arxiv-daily | Automatically update arXiv papers about LLM Reasoning, LLM Evaluation, LLM & MLLM and Video Understanding using Github Actions. | 10 | |
65 | notebook-llama | An open source implementation of NotebookLM that runs on Union | 10 | |
66 | FreeScoutGPT | FreeScout module for requests to ChatGPT using latest models | 10 | |
67 | DDG2API | DuckDuckGo AI to API. Chatting with DuckDuckGo AI through API,Free to use gpt-4o-mini, claude-3-haiku, llama-3.1-70b, mixtral-8x7b, etc. Supports continuous dialogue, compatible with OpenAI API format. | 10 | |
68 | codellm-releases | AI code editor that enhances developer productivity, bundled with the AI super assistant, ChatLLM. | 10 | |
69 | MathUtils | A tool for evaluating LLMs on the MATH and GSM8K dataset. | 9 | |
70 | video-analysis-with-gpt-4o | This repository showcases how to leverage the capabilities of LLMs to analyze and extract insights from video files or video URLs, including their audio content, offering several configurable parameters, such as the duration for splitting the video, the number of frames to extract per second, frame resizing, and prompts. | 9 | |
71 | VLArena | Closed-loop evaluation for end-to-end VLM autonomous driving agent | 9 | |
72 | SDXL-Buildings-with-a-cozy-New-Year-s-atmosphere | 触发关键词 Newyear,把物体绒意年味化!!!一起看看怎么装扮家乡的地标迎接新年吧~ | 9 | |
73 | FloTorch | FloTorch is an open-source tool for optimizing Generative AI workloads on AWS. It automates RAG proof-of-concept development with features like hyperparameter tuning, vector database optimization, and LLM integration. FloTorch streamlines experimentation, ensures security, and accelerates production with cost-efficient, validated workflows. | 9 | |
74 | Promptus | Official impl. of "Promptus: Can Prompts Streaming Replace Video Streaming with Stable Diffusion". A general semantic video communication framework. | 9 | |
75 | shiny.ollama | Chat offline with open-source LLMs like deepseek-r1, nemotron, qwen, llama and more all through a simple R package powered by Shiny and Ollama. 🚀 | 9 | |
76 | AktivaAI | Local LLM Discord Bot | 8 | |
77 | Video-Bench | Video Generation Benchmark | 7 | |
78 | TheReader | Document Reader Powered by Gemini AI | 7 | |
79 | vlmrun-python-sdk | Official Python SDK for VLM Run | 6 | |
80 | papercheckai | PaperCheckAI is a cutting-edge, open-source platform-as-a-service leveraging state-of-the-art multimodal large language models (LLMs) to autonomously digitise, interpret, and evaluate handwritten long-form answer sheets in compliance with custom rubrics, ensuring scalable, secure, and explainable AI-driven assessments for educational institutions. | 6 | |
81 | llm-agents-evaluation | 6 | ||
82 | coc-ai | AI-powered code assistant for Vim. OpenAI and ChatGPT plugin for Vim and Neovim, built on top of coc.nvim's extension system, with support for DeepSeek-R1(deepseek-reasoner) | 6 | |
83 | ChatGPT-CN-Guide | 【ChatGPT 中文版】国内镜像网站免费推荐(支持 4o、o1 和 GPT-4)【2025年3月更新】 | 5 | |
84 | axon | ML platform for training, versioning, and experimenting with VLM and VLA models at scale | 5 | |
85 | CB-LLMs | 5 | ||
86 | graphrag_webui | A web interface for GraphRAG. 🚀 | 5 | |
87 | lmb | Language Model Board, a better way to read the LMSYS results | 5 | |
88 | LLM-dialog-box | Mading a LLM chat compoent | 5 | |
89 | dcit201-classroom-lms-LMS | dcit201-classroom-lms-LMS created by GitHub Classroom | 5 | |
90 | GLM_vision | GLM_vision 是一款适用于 chatgpt-on-wechat 的图像和视频分析插件,基于智谱GLM-4V视觉模型,支持通过URL链接分析图片和视频内容。 | 4 | |
91 | reasoning-models | Experiments with reasoning models, training techniques, papers | 4 | |
92 | Commify | Commify: You Should Commit Yourself. Commify is a CLI tool that generates meaningful, structured commit messages for Git repositories using AI. | 4 | |
93 | hass_lmair | Home Assistant custom integration for the Light Manager Air by jb media. Control lights, blinds and other actuators, receive radio signals , and manage scenes through HASS. | 4 | |
94 | mru-lm | An LM forked from my transformer-train-script repo that replaces attention with a novel idea called "matrix recurrent units." | 4 | |
95 | RAG-Diffusion-xl | RAG-Diffusion re-implemented in sdxl | 3 | |
96 | cultural_evolution | implements the methodology outlined in the paper *Cultural Evolution of Cooperation among LLM Agents*. The paper explores whether a society of large language model (LLM) agents can develop cooperative norms through cultural evolution, using the classic *Donor Game*. The goal is to evaluate multi-agent interaction dynamics | 3 | |
97 | LLM-paper-daily | Automatically Update LLM Papers Daily using Github Actions. Ref: https://github.com/Vincentqyw/cv-arxiv-daily | 3 | |
98 | LlamaCppAndroidTest | 3 | ||
99 | genai-toolbox-llamaindex-python | LlamaIndex SDK for interacting with the Gen AI Toolbox for Databases. | 3 | |
100 | sora-editor-with-androlua | 3 |