TOP AI Developers by monthly star count
TOP AI Organization Account by AI repo star count
Top AI Project by Category star count
Top Growing Speed list by the speed of gaining stars
Top List of who create influential repos with little people known
Rankings | Developers | Related Project | Project intro | Star count |
---|---|---|---|---|
1 | zerox | PDF to Markdown with vision models | 5.9K | |
2 | MindSearch | 🔍 An LLM-based Multi-agent Framework of Web Search Engine (like Perplexity.ai Pro and SearchGPT) | 5.1K | |
3 | repomix | 📦 Repomix (formerly Repopack) is a powerful tool that packs your entire repository into a single, AI-friendly file. Perfect for when you need to feed your codebase to Large Language Models (LLMs) or other AI tools like Claude, ChatGPT, and Gemini. | 3.5K | |
4 | fragments | Open-source Next.js template for building apps that are fully generated by AI. By E2B. | 3.4K | |
5 | minimind | 「大模型」3小时完全从0训练26M的小参数GPT,个人显卡即可推理训练! | 2.6K | |
6 | cake | Distributed LLM and StableDiffusion inference for mobile, desktop and server. | 2.6K | |
7 | postgres-new | In-browser Postgres sandbox with AI assistance | 2.4K | |
8 | aide | Conquer Any Code in VSCode: One-Click Comments, Conversions, UI-to-Code, and AI Batch Processing of Files! 在 VSCode 中征服任何代码:一键注释、转换、UI 图生成代码、AI 批量处理文件!💪 | 2.3K | |
9 | XcodeLLMEligible | 1.7K | ||
10 | ichigo | Llama3.1 learns to Listen | 1.7K | |
11 | GraphRAG-Local-UI | GraphRAG using Local LLMs - Features robust API and multiple apps for Indexing/Prompt Tuning/Query/Chat/Visualizing/Etc. This is meant to be the ultimate GraphRAG/KG local LLM app. | 1.7K | |
12 | ai-renamer | A Node.js CLI that uses Ollama and LM Studio models (Llava, Gemma, Llama etc.) to intelligently rename files by their contents | 1.6K | |
13 | llamatutor | An AI personal tutor built with Llama 3.1 | 1.4K | |
14 | nano-graphrag | A simple, easy-to-hack GraphRAG implementation | 1.3K | |
15 | docetl | A system for agentic LLM-powered data processing and ETL | 1.2K | |
16 | MobileLLM | MobileLLM Optimizing Sub-billion Parameter Language Models for On-Device Use Cases. In ICML 2024. | 1.1K | |
17 | prompt-poet | Streamlines and simplifies prompt design for both developers and non-technical users with a low code approach. | 910 | |
18 | lm.rs | Minimal LLM inference in Rust | 909 | |
19 | gpt4-captcha-bypass | Captcha Bypass using GPT4-o | 717 | |
20 | arch | Arch is an intelligent prompt gateway. Engineered with (fast) LLMs for the secure handling, robust observability, and seamless integration of prompts with APIs - all outside business logic. Built by the core contributors of Envoy proxy, on Envoy. | 537 | |
21 | huggingface-llama-recipes | 522 | ||
22 | harbor | Effortlessly run LLM backends, APIs, frontends, and services with one command. | 483 | |
23 | free-llm-api-resources | A list of free LLM inference resources accessible via API. | 476 | |
24 | midscene | An AI-powered automation SDK can control the page, perform assertions, and extract data in JSON format using natural language. | 461 | |
25 | ell | A command-line interface for LLMs written in Bash. | 420 | |
26 | tensorzero | make LLMs improve through experience | 413 | |
27 | lotus | LOTUS: A semantic query engine - process data with LLMs as easily as writing pandas code | 411 | |
28 | ttt-lm-jax | Official JAX implementation of Learning to (Learn at Test Time): RNNs with Expressive Hidden States | 362 | |
29 | Stable-Hair | Stable-Hair: Real-World Hair Transfer via Diffusion Model | 360 | |
30 | uni-api | This is a project that unifies the management of LLM APIs. It can call multiple backend services through a unified API interface, convert them to the OpenAI format uniformly, and support load balancing. Currently supported backend services include: OpenAI, Anthropic, DeepBricks, OpenRouter, Gemini, Vertex, etc. | 325 | |
31 | PromptChains | Prompt chains maximize intelligence and results when using LLMs | 319 | |
32 | stable-diffusion-from-scratch | Implementation of Stable Diffusion with PyTorch | 296 | |
33 | FunAudioLLM-APP | 283 | ||
34 | fastagency | The fastest way to bring multi-agent workflows to production. | 261 | |
35 | Awesome-Attention-Heads | An awesome repository & A comprehensive survey on interpretability of LLM attention heads. | 257 | |
36 | llmdocparser | A package for parsing PDFs and analyzing their content using LLMs. | 229 | |
37 | LlamaVoice | LlamaVoice is a llama-based large voice generation model, providing inference and training ability. | 219 | |
38 | TokenPacker | The code for "TokenPacker: Efficient Visual Projector for Multimodal LLM". | 210 | |
39 | claude-artifact-runner | A React-based web app project that enables running Claude AI’s Artifacts either locally or on your own server. | 192 | |
40 | GraphRag.Net | 参考GraphRag使用 Semantic Kernel 来实现的dotnet版本,可以使用NuGet开箱即用集成到项目中 | 182 | |
41 | lmms-finetune | A minimal codebase for finetuning large multimodal models, supporting llava-1.5/1.6, llava-interleave, llava-next-video, llava-onevision, qwen-vl, qwen2-vl, phi3-v etc. | 170 | |
42 | MambaInLlama | Official Repository of The Mamba in the Llama: Distilling and Accelerating Hybrid Models | 169 | |
43 | trustgraph | Connect Data Silos with Reliable AI⚡🚀 | 150 | |
44 | AUITestAgent | AUITestAgent is the first automatic, natural language-driven GUI testing tool for mobile apps, capable of fully automating the entire process of GUI interaction and function verification. | 148 | |
45 | chatwiki | 开箱即用的基于企业私有知识库的LLM大语言模型的智能客服机器人问答系统,支持私有化部署,代码免费开源且可商用,由芝麻小客服官方推出。 | 146 | |
46 | LLMEPET | [MM'24 Oral] Prior Knowledge Integration via LLM Encoding and Pseudo Event Regulation for Video Moment Retrieval | 139 | |
47 | premsql | End-to-End Local-First Text-to-SQL Pipelines | 136 | |
48 | style-reference | List of Stable Diffusion style prompts, optimized for RobMix Zenith. | 129 | |
49 | DenseFusion | DenseFusion-1M: Merging Vision Experts for Comprehensive Multimodal Perception | 115 | |
50 | marly | The open-source, model-agnostic alternative to OpenAI's structured outputs for your own documents or the web. | 108 | |
51 | ChatGPT-code-preview | Artifacts-like chrome extension for ChatGPT, inspired by Claude 3.5 Sonnet. Requires CSP unblocker for JS to function. | 105 | |
52 | llama_extract | 105 | ||
53 | finetuning | Finetune Llama-3-8b on the MathInstruct dataset | 97 | |
54 | Awesome-LLM-KV-Cache | Awesome-LLM-KV-Cache: A curated list of 📙Awesome LLM KV Cache Papers with Codes. | 88 | |
55 | GPT-Autoagent-Multimodal-Task-Project | 本项目展示了如何利用 GPT 自动化检索仓库内的文件(如 PDF、XLS、Word 等)并完成多模态任务。可将家庭摄像头的视频帧送入仓库,可以自动化判断家庭是否危险的事情(利用大模型对世界的理解力)。 | 85 | |
56 | minuet-ai.nvim | 💃 Dance with Intelligence in Your Code. Minuet AI integrates with nvim-cmp, offering AI completion from popular LLMs including OpenAI, Gemini, Claude, and more. | 85 | |
57 | LLaVA-MORE | LLaVA-MORE: Enhancing Visual Instruction Tuning with LLaMA 3.1 | 84 | |
58 | eureka-ml-insights | A framework for standardizing evaluations of large foundation models, beyond single-score reporting and rankings. | 82 | |
59 | extensionOS | Imagine a world where everyone can access powerful AI models—LLMs, generative image models, and speech recognition—directly in their web browser. Integrating AI into daily browsing will revolutionise online interactions, offering instant, intelligent assistance tailored to individual needs. | 78 | |
60 | bookmarksAI | GPT automatically organizes your browser bookmarks | 73 | |
61 | LLavaImageTagger | Creates an index of images, queries a local LLM and adds tags to the image metadata | 63 | |
62 | Awesome-OOD-VLM | Generalized Out-of-Distribution Detection and Beyond in Vision Language Model Era: A Survey [Miyai+, arXiv2024] | 62 | |
63 | jd_scripts | jd_scripts,https://jdc.52hym666.top脚本N100, Nas, awesome双11Nas羊毛docker自动nodejs约i茅台damai秒杀pxq抢票12306工具autojs面板qinglong神器go数据PDF程序tv入门java有趣next博客halo建站API收集python机器人AI一键admin生PT成bot部JD群晖签到WPS自用tgB站ele小白proxy云盘VPN部署lucky饿了么frp基础CDN简单push票星球TikTok纷玩岛fwd一个gpt拼多多小红书美团技术, chatgpt, go,jd, jd_scripts, jd_scripts京东脚本, nginx, php, python, rust, 京东, 代挂, 免费, | 54 | |
64 | gLM2 | 52 | ||
65 | duckduckgo-ai-chat | Providing Duckduckgo AI Chat API, which can use gpt-4o-mini for free. | 51 | |
66 | AIEntries | wordpress plugin to automatice creation of quality wordpress standard posts using NEWS API , GEMINI AI and Stable Diffusion API for free | 50 | |
67 | ai-mv-generator | A multi-agent system designed for generating music videos with scrolling subtitles based on lyrics. This project analyzes the lyrics and creates detailed prompts based on the analysis results to generate story-like images, ultimately producing an image-to-image music video, using openai API, gpt-4o and dall-e 3 models. | 50 | |
68 | Llama3-8B_Emotion_Text_Classification_LoRA | Emotion text classification using Llama3-8b with LoRA and FlashAttention. Based on LLaMA-Factory. | 49 | |
69 | bookmark-summary | 用 LLM 和 jina reader 生成的总结 | 49 | |
70 | Awesome-Cluade-Artifacts | Share your claude artifacts❤️ | 47 | |
71 | ai-shifu | LLM-powered AI guide that leads and drives intelligent conversations | 47 | |
72 | lobe-chat-pro | 基于lobe-chat,增加了绘图面板,支持midjourney、dall-e-3,flux,suno,luma,runway,用户注册登录,后续支持stable-diffusion,充值消费等 | 45 | |
73 | large-model-proxy | Large Model Proxy is designed to make it easy to run multiple resource-heavy Large Models (LM) on the same machine with limited amount of VRAM/other resources. It listens on a dedicated port for each proxied LM, making them always available to the clients connecting to these ports. | 45 | |
74 | KDPL | [ECCV 2024] - Improving Zero-shot Generalization of Learned Prompts via Unsupervised Knowledge Distillation | 44 | |
75 | chatgpt-vue3-light-mvp | 💭 一个可二次开发 Chat Bot 对话 Web 端 MVP 原型模板, 基于 Vue3、Vite 5、TypeScript、Naive UI 、UnoCSS 等主流技术构建, 🧤简单集成大模型 API, 采用单轮 AI 问答对话模式, 每次提问独立响应, 无需上下文, 支持打字机效果流式输出, 集成 markdown-it 预览, 💼 易于定制和快速搭建 Chat 类大语言模型产品 (附示例截图) | 44 | |
76 | compose-would-you-rather-game | 📱 Compose Multiplatform, 100% UI shared by Compose, generates contents by Gemini | 43 | |
77 | AUTOSAR-MCAL-Embedded-Upskilling-Bootcamp | AUTOSAR MCAL Embedded Upskilling Bootcamp by Modular MX. | 41 | |
78 | Python-Voice-Assistant-Suryanshsk | A Python-based virtual assistant using Gemini AI. Features include voice recognition, text-to-speech, weather updates, news retrieval, jokes, Wikipedia info, and music management. Comes with an interactive web interface. Easily extendable and customizable. | 39 | |
79 | marqo-llama3_1 | Demo of a RAG Question and Answering System with Llama 3.1 and Marqo | 38 | |
80 | Awesome-MLLM-LLM-Colab | Happy experimenting with MLLM and LLM models! | 38 | |
81 | ros2_nanollm | ROS2 nodes for LLM, VLM, VLA | 37 | |
82 | barq | Dabarqus is a stand alone application that implements a complete RAG solution. | 37 | |
83 | ThinkRAG | A LLM RAG system runs on your laptop. | 32 | |
84 | control-lora-v3 | ControlLoRA Version 3: LoRA Is All You Need to Control the Spatial Information of Stable Diffusion. | 32 | |
85 | CompBench | CompBench evaluates the comparative reasoning of multimodal large language models (MLLMs) with 40K image pairs and questions across 8 dimensions of relative comparison: visual attribute, existence, state, emotion, temporality, spatiality, quantity, and quality. CompBench covers diverse visual domains, including animals, fashion, sports, and scenes. | 31 | |
86 | MMInstruct | The official implementation of the paper "MMInstruct: A High-Quality Multi-Modal Instruction Tuning Dataset with Extensive Diversity". The MMInstruct dataset includes 973K instructions from 24 domains and four instruction types. | 31 | |
87 | TheoremLlama | This is the official repository for all the code of TheoremLlama | 30 | |
88 | LiteWebAgent | The Library for LLM-based web-agent applications | 30 | |
89 | SynthVLM | 29 | ||
90 | localwriter | A LibreOffice Writer extension that adds local-inference generative AI features. | 29 | |
91 | LLM-Engines | 29 | ||
92 | Awesome-Multimodal-LLM-for-Math-STEM | Paper collections of multi-modal LLM for Math/STEM/Code. | 27 | |
93 | codestral-mamba-for-vscode | Use Codestral Mamba with Visual Studio Code and the Continue extension. A local LLM alternative to GitHub Copilot. | 26 | |
94 | hello-llama | A simple chat bot to play with Llama 3.1 | 26 | |
95 | dtinth2.0K followers Krungthepmahanakhonamonrattanakosinmahintharayutthayamahadilokphopnoppharatratchathaniburiromudomratchaniwetmahasathanamonphimanawatansathitsakkathattiyawitsanukamprasit (Bangkok), Thailand | autosub | Automated generation of subtitles for tech talks in Thai language using Speechmatics, Gemini, GPT-4o and Claude. | 26 |
96 | PodGPT | PodGPT: A multilingual audio-augmented large language model for research and education | 26 | |
97 | jPaste | jPaste : Avoid LLM content placeholders with one click! | 25 | |
98 | RawRAG | Let's RAG it RAW without fancy frameworks | 25 | |
99 | Knowledge-Graph-for-RAG-using-Neo4j | In this project I designed a knowledge graph focused on Napoleon's history. I built a RAG application using this data and improved the output of LLM using the relationship between nodes | 25 | |
100 | GPT-Talker | [ACMMM'2024] Generative Expressive Conversational Speech Synthesis | 25 |