TOP AI Developers by monthly star count
TOP AI Organization Account by AI repo star count
Top AI Project by Category star count
Top Growing Speed list by the speed of gaining stars
Top List of who create influential repos with little people known
Rankings | Developers | Related Project | Project intro | Star count |
---|---|---|---|---|
1 | zerox | PDF to Markdown with vision models | 6.1K | |
2 | MindSearch | 🔍 An LLM-based Multi-agent Framework of Web Search Engine (like Perplexity.ai Pro and SearchGPT) | 5.1K | |
3 | repomix | 📦 Repomix (formerly Repopack) is a powerful tool that packs your entire repository into a single, AI-friendly file. Perfect for when you need to feed your codebase to Large Language Models (LLMs) or other AI tools like Claude, ChatGPT, and Gemini. | 3.8K | |
4 | fragments | Open-source Next.js template for building apps that are fully generated by AI. By E2B. | 3.4K | |
5 | minimind | 「大模型」3小时完全从0训练26M的小参数GPT,个人显卡即可推理训练! | 2.7K | |
6 | cake | Distributed LLM and StableDiffusion inference for mobile, desktop and server. | 2.6K | |
7 | postgres-new | In-browser Postgres sandbox with AI assistance | 2.4K | |
8 | aide | Conquer Any Code in VSCode: One-Click Comments, Conversions, UI-to-Code, and AI Batch Processing of Files! 在 VSCode 中征服任何代码:一键注释、转换、UI 图生成代码、AI 批量处理文件!💪 | 2.3K | |
9 | XcodeLLMEligible | 1.9K | ||
10 | ichigo | Llama3.1 learns to Listen | 1.8K | |
11 | GraphRAG-Local-UI | GraphRAG using Local LLMs - Features robust API and multiple apps for Indexing/Prompt Tuning/Query/Chat/Visualizing/Etc. This is meant to be the ultimate GraphRAG/KG local LLM app. | 1.7K | |
12 | ai-renamer | A Node.js CLI that uses Ollama and LM Studio models (Llava, Gemma, Llama etc.) to intelligently rename files by their contents | 1.6K | |
13 | nano-graphrag | A simple, easy-to-hack GraphRAG implementation | 1.4K | |
14 | llamatutor | An AI personal tutor built with Llama 3.1 | 1.4K | |
15 | docetl | A system for agentic LLM-powered data processing and ETL | 1.3K | |
16 | OmAgent | A Multimodal Native Agent Framework for Smart Hardware and More | 1.2K | |
17 | MobileLLM | MobileLLM Optimizing Sub-billion Parameter Language Models for On-Device Use Cases. In ICML 2024. | 1.1K | |
18 | lm.rs | Minimal LLM inference in Rust | 918 | |
19 | prompt-poet | Streamlines and simplifies prompt design for both developers and non-technical users with a low code approach. | 911 | |
20 | gpt4-captcha-bypass | Captcha Bypass using GPT4-o | 717 | |
21 | arch | Arch is an intelligent prompt gateway. Engineered with (fast) LLMs for the secure handling, robust observability, and seamless integration of prompts with APIs - all outside business logic. Built by the core contributors of Envoy proxy, on Envoy. | 597 | |
22 | huggingface-llama-recipes | 529 | ||
23 | free-llm-api-resources | A list of free LLM inference resources accessible via API. | 521 | |
24 | harbor | Effortlessly run LLM backends, APIs, frontends, and services with one command. | 504 | |
25 | tensorzero | TensorZero creates a feedback loop for optimizing LLM applications — turning production data into smarter, faster, and cheaper models. | 480 | |
26 | midscene | An AI-powered automation SDK can control the page, perform assertions, and extract data in JSON format using natural language. | 477 | |
27 | lotus | LOTUS: A semantic query engine - process data with LLMs as easily as writing pandas code | 421 | |
28 | ell | A command-line interface for LLMs written in Bash. | 420 | |
29 | Stable-Hair | Stable-Hair: Real-World Hair Transfer via Diffusion Model | 365 | |
30 | ttt-lm-jax | Official JAX implementation of Learning to (Learn at Test Time): RNNs with Expressive Hidden States | 363 | |
31 | PromptChains | Prompt chains maximize intelligence and results when using LLMs | 340 | |
32 | uni-api | This is a project that unifies the management of LLM APIs. It can call multiple backend services through a unified API interface, convert them to the OpenAI format uniformly, and support load balancing. Currently supported backend services include: OpenAI, Anthropic, DeepBricks, OpenRouter, Gemini, Vertex, etc. | 337 | |
33 | stable-diffusion-from-scratch | Implementation of Stable Diffusion with PyTorch | 296 | |
34 | FunAudioLLM-APP | 287 | ||
35 | fastagency | The fastest way to bring multi-agent workflows to production. | 280 | |
36 | Awesome-Attention-Heads | An awesome repository & A comprehensive survey on interpretability of LLM attention heads. | 264 | |
37 | llmdocparser | A package for parsing PDFs and analyzing their content using LLMs. | 233 | |
38 | LlamaVoice | LlamaVoice is a llama-based large voice generation model, providing inference and training ability. | 221 | |
39 | TokenPacker | The code for "TokenPacker: Efficient Visual Projector for Multimodal LLM". | 213 | |
40 | claude-artifact-runner | A React-based web app project that enables running Claude AI’s Artifacts either locally or on your own server. | 202 | |
41 | GraphRag.Net | 参考GraphRag使用 Semantic Kernel 来实现的dotnet版本,可以使用NuGet开箱即用集成到项目中 | 184 | |
42 | lmms-finetune | A minimal codebase for finetuning large multimodal models, supporting llava-1.5/1.6, llava-interleave, llava-next-video, llava-onevision, qwen-vl, qwen2-vl, phi3-v etc. | 174 | |
43 | MambaInLlama | Official Repository of The Mamba in the Llama: Distilling and Accelerating Hybrid Models | 170 | |
44 | trustgraph | Connect Data Silos with Reliable AI⚡🚀 | 166 | |
45 | chatwiki | 开箱即用的基于企业私有知识库的LLM大语言模型的智能客服机器人问答系统,支持私有化部署,代码免费开源且可商用,由芝麻小客服官方推出。 | 150 | |
46 | AUITestAgent | AUITestAgent is the first automatic, natural language-driven GUI testing tool for mobile apps, capable of fully automating the entire process of GUI interaction and function verification. | 148 | |
47 | LLMEPET | [MM'24 Oral] Prior Knowledge Integration via LLM Encoding and Pseudo Event Regulation for Video Moment Retrieval | 139 | |
48 | premsql | End-to-End Local-First Text-to-SQL Pipelines | 139 | |
49 | style-reference | List of Stable Diffusion style prompts, optimized for RobMix Zenith. | 130 | |
50 | DenseFusion | DenseFusion-1M: Merging Vision Experts for Comprehensive Multimodal Perception | 117 | |
51 | marly | Contextualized Structured Outputs. Search your documents or the web for specific data and get it back in JSON or Markdown. | 112 | |
52 | ChatGPT-code-preview | Artifacts-like chrome extension for ChatGPT, inspired by Claude 3.5 Sonnet. Requires CSP unblocker for JS to function. | 106 | |
53 | llama_extract | 105 | ||
54 | Awesome-LLM-KV-Cache | Awesome-LLM-KV-Cache: A curated list of 📙Awesome LLM KV Cache Papers with Codes. | 99 | |
55 | finetuning | Finetune Llama-3-8b on the MathInstruct dataset | 97 | |
56 | minuet-ai.nvim | 💃 Dance with Intelligence in Your Code. Minuet AI integrates with nvim-cmp, offering AI completion from popular LLMs including OpenAI, Gemini, Claude, and more. | 96 | |
57 | GPT-Autoagent-Multimodal-Task-Project | 本项目展示了如何利用 GPT 自动化检索仓库内的文件(如 PDF、XLS、Word 等)并完成多模态任务。可将家庭摄像头的视频帧送入仓库,可以自动化判断家庭是否危险的事情(利用大模型对世界的理解力)。 | 85 | |
58 | LLaVA-MORE | LLaVA-MORE: Enhancing Visual Instruction Tuning with LLaMA 3.1 | 84 | |
59 | eureka-ml-insights | A framework for standardizing evaluations of large foundation models, beyond single-score reporting and rankings. | 83 | |
60 | extensionOS | Imagine a world where everyone can access powerful AI models—LLMs, generative image models, and speech recognition—directly in their web browser. Integrating AI into daily browsing will revolutionise online interactions, offering instant, intelligent assistance tailored to individual needs. | 79 | |
61 | transformer-ranker | Efficiently find the best-suited language model (LM) for your NLP task | 77 | |
62 | bookmarksAI | GPT automatically organizes your browser bookmarks | 75 | |
63 | LLavaImageTagger | Creates an index of images, queries a local LLM and adds tags to the image metadata | 63 | |
64 | Awesome-OOD-VLM | Generalized Out-of-Distribution Detection and Beyond in Vision Language Model Era: A Survey [Miyai+, arXiv2024] | 62 | |
65 | jd_scripts | jd_scripts,https://jdc.52hym666.top脚本N100, Nas, awesome双11Nas羊毛docker自动nodejs约i茅台damai秒杀pxq抢票12306工具autojs面板qinglong神器go数据PDF程序tv入门java有趣next博客halo建站API收集python机器人AI一键admin生PT成bot部JD群晖签到WPS自用tgB站ele小白proxy云盘VPN部署lucky饿了么frp基础CDN简单push票星球TikTok纷玩岛fwd一个gpt拼多多小红书美团技术, chatgpt, go,jd, jd_scripts, jd_scripts京东脚本, nginx, php, python, rust, 京东, 代挂, 免费, | 57 | |
66 | duckduckgo-ai-chat | Providing Duckduckgo AI Chat API, which can use gpt-4o-mini for free. | 53 | |
67 | ai-shifu | LLM-powered AI guide that leads and drives intelligent conversations | 53 | |
68 | gLM2 | 52 | ||
69 | AIEntries | wordpress plugin to automatice creation of quality wordpress standard posts using NEWS API , GEMINI AI and Stable Diffusion API for free | 51 | |
70 | ai-mv-generator | A multi-agent system designed for generating music videos with scrolling subtitles based on lyrics. This project analyzes the lyrics and creates detailed prompts based on the analysis results to generate story-like images, ultimately producing an image-to-image music video, using openai API, gpt-4o and dall-e 3 models. | 50 | |
71 | Llama3-8B_Emotion_Text_Classification_LoRA | Emotion text classification using Llama3-8b with LoRA and FlashAttention. Based on LLaMA-Factory. | 49 | |
72 | bookmark-summary | 用 LLM 和 jina reader 生成的总结 | 49 | |
73 | chatgpt-vue3-light-mvp | 💭 一个可二次开发 Chat Bot 对话 Web 端 MVP 原型模板, 基于 Vue3、Vite 5、TypeScript、Naive UI 、UnoCSS 等主流技术构建, 🧤简单集成大模型 API, 采用单轮 AI 问答对话模式, 每次提问独立响应, 无需上下文, 支持打字机效果流式输出, 集成 markdown-it 预览, 💼 易于定制和快速搭建 Chat 类大语言模型产品 (附示例截图) | 48 | |
74 | Awesome-Cluade-Artifacts | Share your claude artifacts❤️ | 47 | |
75 | KDPL | [ECCV 2024] - Improving Zero-shot Generalization of Learned Prompts via Unsupervised Knowledge Distillation | 46 | |
76 | lobe-chat-pro | 基于lobe-chat,增加了绘图面板,支持midjourney、dall-e-3,flux,suno,luma,runway,kling(快手可灵),用户注册登录,后续支持stable-diffusion,充值消费等 | 46 | |
77 | large-model-proxy | Large Model Proxy is designed to make it easy to run multiple resource-heavy Large Models (LM) on the same machine with limited amount of VRAM/other resources. It listens on a dedicated port for each proxied LM, making them always available to the clients connecting to these ports. | 46 | |
78 | compose-would-you-rather-game | 📱 Compose Multiplatform, 100% UI shared by Compose, generates contents by Gemini | 43 | |
79 | AUTOSAR-MCAL-Embedded-Upskilling-Bootcamp | AUTOSAR MCAL Embedded Upskilling Bootcamp by Modular MX. | 41 | |
80 | marqo-llama3_1 | Demo of a RAG Question and Answering System with Llama 3.1 and Marqo | 39 | |
81 | Python-Voice-Assistant-Suryanshsk | A Python-based virtual assistant using Gemini AI. Features include voice recognition, text-to-speech, weather updates, news retrieval, jokes, Wikipedia info, and music management. Comes with an interactive web interface. Easily extendable and customizable. | 39 | |
82 | Awesome-MLLM-LLM-Colab | Happy experimenting with MLLM and LLM models! | 39 | |
83 | ros2_nanollm | ROS2 nodes for LLM, VLM, VLA | 38 | |
84 | barq | Dabarqus is a stand alone application that implements a complete RAG solution. | 37 | |
85 | ThinkRAG | A LLM RAG system runs on your laptop. | 33 | |
86 | control-lora-v3 | ControlLoRA Version 3: LoRA Is All You Need to Control the Spatial Information of Stable Diffusion. | 33 | |
87 | MMInstruct | The official implementation of the paper "MMInstruct: A High-Quality Multi-Modal Instruction Tuning Dataset with Extensive Diversity". The MMInstruct dataset includes 973K instructions from 24 domains and four instruction types. | 33 | |
88 | localwriter | A LibreOffice Writer extension that adds local-inference generative AI features. | 32 | |
89 | TheoremLlama | This is the official repository for all the code of TheoremLlama | 31 | |
90 | CompBench | CompBench evaluates the comparative reasoning of multimodal large language models (MLLMs) with 40K image pairs and questions across 8 dimensions of relative comparison: visual attribute, existence, state, emotion, temporality, spatiality, quantity, and quality. CompBench covers diverse visual domains, including animals, fashion, sports, and scenes. | 31 | |
91 | LiteWebAgent | The Library for LLM-based web-agent applications | 31 | |
92 | SynthVLM | 30 | ||
93 | LLM-Engines | 29 | ||
94 | Awesome-Multimodal-LLM-for-Math-STEM | Paper collections of multi-modal LLM for Math/STEM/Code. | 28 | |
95 | PodGPT | PodGPT: A multilingual audio-augmented large language model for research and education | 27 | |
96 | codestral-mamba-for-vscode | Use Codestral Mamba with Visual Studio Code and the Continue extension. A local LLM alternative to GitHub Copilot. | 26 | |
97 | hello-llama | A simple chat bot to play with Llama 3.1 | 26 | |
98 | dtinth2.0K followers Krungthepmahanakhonamonrattanakosinmahintharayutthayamahadilokphopnoppharatratchathaniburiromudomratchaniwetmahasathanamonphimanawatansathitsakkathattiyawitsanukamprasit (Bangkok), Thailand | autosub | Automated generation of subtitles for tech talks in Thai language using Speechmatics, Gemini, GPT-4o and Claude. | 26 |
99 | Knowledge-Graph-for-RAG-using-Neo4j | In this project I designed a knowledge graph focused on Napoleon's history. I built a RAG application using this data and improved the output of LLM using the relationship between nodes | 26 | |
100 | GPT-Talker | [ACMMM'2024] Generative Expressive Conversational Speech Synthesis | 26 |