TOP AI Developers by monthly star count
TOP AI Organization Account by AI repo star count
Top AI Project by Category star count
Top Growing Speed list by the speed of gaining stars
Top List of who create influential repos with little people known
Rankings | Developers | Related Project | Project intro | Star count |
---|---|---|---|---|
1 | browser-use | Make websites accessible for AI agents | 27.7K | |
2 | LightRAG | "LightRAG: Simple and Fast Retrieval-Augmented Generation" | 11.6K | |
3 | open-canvas | 📃 A better UX for chat, writing content, and coding with LLMs. | 3.6K | |
4 | verl | veRL: Volcano Engine Reinforcement Learning for LLM | 2.7K | |
5 | GLM-4-Voice | GLM-4-Voice | 端到端中英语音对话模型 | 2.6K | |
6 | text-extract-api | Document (PDF, Word, PPTX ...) extraction and parse API using state of the art modern OCRs + Ollama supported models. Anonymize documents. Remove PII. Convert any document or picture to structured JSON or Markdown | 2.1K | |
7 | mini-omni2 | Towards Open-source GPT-4o with Vision, Speech and Duplex Capabilities。 | 1.6K | |
8 | WritingTools | The world's smartest system-wide grammar assistant; a better version of the Apple Intelligence Writing Tools. Works on Windows, Linux, & macOS, with the free Gemini API, local LLMs, & more. | 1.2K | |
9 | llama.vim | Vim plugin for LLM-assisted code/text completion | 1.1K | |
10 | FreeChatGpt-4o-2024 | Use ChatGpt 4o forFree - No API Key Need. also ChatGpt 3 and ChatGpt 3.5 Experience the power of ChatGPT with a user-friendly interface, enhanced jailbreaks, and completely free. | 1.0K | |
11 | open-notebook | An Open Source implementation of Notebook LM with more flexibility and features | 975 | |
12 | LLaMA-O1 | Large Reasoning Models | 801 | |
13 | aoai-realtime-audio-sdk | Azure OpenAI code resources for using gpt-4o-realtime capabilities. | 757 | |
14 | VideoChat | 实时语音交互数字人,支持端到端语音方案(GLM-4-Voice - THG)和级联方案(ASR-LLM-TTS-THG)。可自定义形象与音色,无须训练,支持音色克隆,首包延迟低至3s。Real-time voice interactive digital human, supporting end-to-end voice solutions (GLM-4-Voice - THG) and cascaded solutions (ASR-LLM-TTS-THG). Customizable appearance and voice, supporting voice cloning, with initial package delay as low as 3s. | 659 | |
15 | VisRAG | Parsing-free RAG supported by VLMs | 592 | |
16 | localGPT-Vision | Chat with your documents using Vision Language Models. This repo implements an End to End RAG pipeline with both local and proprietary VLMs | 503 | |
17 | simplemind | Python API client for AI providers that intends to replace LangChain and LangGraph for most common use cases. | 461 | |
18 | pearai-master | VSCode for the new age of AI. | 433 | |
19 | RIME-LMDG | Rime输入法语法模型全流程构建教程,全局带声调词库,最全带读音单字表词典:LMDG - Language, Model, Dictionary, Grammar | 430 | |
20 | CoI-Agent | Official code for paper: Chain of Ideas: Revolutionizing Research via Novel Idea Development with LLM Agents | 427 | |
21 | ghostwriter | Use the reMarkable2 as an interface to vision-LLMs (ChatGPT, Claude, Gemini). Ghost in the machine! | 417 | |
22 | PodCastLM | PDF 生成中文播客 | 400 | |
23 | SDXL_EcomID_ComfyUI | 380 | ||
24 | promptwright | Generate large synthetic data using an LLM | 377 | |
25 | agent-as-a-judge | 🤠 Agent-as-a-Judge and DevAI dataset | 319 | |
26 | joycaption | JoyCaption is an image captioning Visual Language Model (VLM) being built from the ground up as a free, open, and uncensored model for the community to use in training Diffusion models. | 309 | |
27 | aider.el | aider emacs plugin for https://github.com/paul-gauthier/aider | 297 | |
28 | nGPT-pytorch | Quick implementation of nGPT, learning entirely on the hypersphere, from NvidiaAI | 271 | |
29 | codai | Codai is an AI code assistant that helps developers through a session-based CLI, providing intelligent code suggestions, refactoring, and code reviews based on the full context of the project. It supports multiple LLM providers, such as OpenAI, DeepSeek, Azure OpenAI, Ollama, Anthropic, and OpenRouter, to streamline daily development tasks. | 246 | |
30 | engy | Engy is an AI-powered development tool that generates fully functional web applications from natural language, streamlining the process from idea to working prototype. | 221 | |
31 | mediachain | AI toolkit for making Shorts/Tiktoks | 214 | |
32 | Mirror | An LLM-powered programming-by-example programming language. | 193 | |
33 | MagicPIG | [ICLR2025] MagicPIG: LSH Sampling for Efficient LLM Generation | 184 | |
34 | chat-ui | Chat UI components for LLM apps | 184 | |
35 | llama-swap | transparent proxy server for llama.cpp's server to provide automatic model swapping | 183 | |
36 | MPLSandbox | MPLSandbox is an out-of-the-box multi-programming language sandbox designed to provide unified and comprehensive feedback from compiler and analysis tools for LLMs. | 169 | |
37 | oat | 🌾 OAT: A research-friendly framework for LLM online alignment, including preference learning, reinforcement learning, etc. | 167 | |
38 | phantasm | Toolkits to create a human-in-the-loop approval layer to monitor and guide AI agents workflow in real-time. | 156 | |
39 | PixelLlama | Ollama client written in Python | 155 | |
40 | AgentSquare | The official implementation of the paper "AgentSquare: Automatic LLM Agent Search in Modular Design Space"" | 151 | |
41 | llm-jq | Write and execute jq programs with the help of LLM | 149 | |
42 | FreeChatGpt-4o-2024 | 136 | ||
43 | Hands-On-LLM-Fine-Tuning | Hands-on tutorials on fine-tuning various LLMs using different fine-tuning techniques | 130 | |
44 | VLM2Vec | This repo contains the code for "VLM2Vec: Training Vision-Language Models for Massive Multimodal Embedding Tasks" [ICLR25] | 129 | |
45 | backtrack_sampler | An easy-to-understand framework for LLM samplers that rewind and revise generated tokens | 129 | |
46 | gemini-ai-code-reviewer | A GitHub Action that automatically reviews pull requests using Google's Gemini AI. | 129 | |
47 | Fast-LLM | Accelerating your LLM training to full speed! Made with ❤️ by ServiceNow Research | 129 | |
48 | meta-prompt | For LLMs to better code with Jina API | 129 | |
49 | LearnGo | LearnGo, a ios versatile learning tool with RAG and Prompting under LLM like GPT-4o-mini, enabling subject Q&A, smart note generation from uploaded materials, offline photo(or AR) object recognition with translations, and multilingual voice input (Cantonese, English, Mandarin). It also features customizable themes for a personalized experience | 119 | |
50 | jarvis | A command-line personal assistant that integrates with Google Calendar, Gmail, and Tasks to help manage your digital life. | 115 | |
51 | Sidekick | A native macOS app that allows users to chat with a local LLM that can respond with information from files, folders and websites on your Mac without installing any other software. | 108 | |
52 | comanda | Execute agentic workflows defined in simple YAML files | 108 | |
53 | ai-analyst | Open source AI analyst powered by E2B. Analyze your CSV files with Llama 3.1 and create interactive charts. | 106 | |
54 | StructRAG | StructRAG: Boosting Knowledge Intensive Reasoning of LLMs via Inference-time Hybrid Information Structurization | 103 | |
55 | SG-Nav | [NeurIPS 2024] SG-Nav: Online 3D Scene Graph Prompting for LLM-based Zero-shot Object Navigation | 101 | |
56 | qapyq | An image viewer and AI-assisted editing/captioning/masking tool that helps with curating datasets for generative AI models, finetunes and LoRA. | 100 | |
57 | UniversalBackrooms | A replication of Andy Ayrey's "Backrooms" (https://dreams-of-an-electric-mind.webflow.io/), but runnable with Opus 3, Sonnet 3.5, GPT 4o, and o1-preview | 97 | |
58 | graph-constrained-reasoning | Official Implementation of "Graph-constrained Reasoning: Faithful Reasoning on Knowledge Graphs with Large Language Models". | 96 | |
59 | intelligence | A Ruby gem for seamlessly and uniformly interacting with large language and vision model (LLM) API's served by numerous services, including those of OpenAI, Anthropic, Google and others. | 95 | |
60 | Modality-Integration-Rate | The official code of the paper "Deciphering Cross-Modal Alignment in Large Vision-Language Models with Modality Integration Rate". | 93 | |
61 | Fira | Fira: Can We Achieve Full-rank Training of LLMs Under Low-rank Constraint? | 92 | |
62 | superagentx | Lightweight Multi Agent AI Orchestrator Framework with AGI Capabilities. | 91 | |
63 | FlipAttack | [arXiv 2024] An official source code for paper "FlipAttack: Jailbreak LLMs via Flipping". | 89 | |
64 | ai-chat-android | 💬 AI Chat Bot demo app showcasing the integration of Gemini SDK with Firebase Realtime Database for real-time chat functionality. | 89 | |
65 | TimeMarker | A Versatile Video-LLM for Long and Short Video Understanding with Superior Temporal Localization Ability | 86 | |
66 | VLM-Grounder | [CoRL 2024] VLM-Grounder: A VLM Agent for Zero-Shot 3D Visual Grounding | 82 | |
67 | pal | LLM assistants for R | 80 | |
68 | lite_llama | A light llama-like llm inference framework based on the triton kernel. | 79 | |
69 | Grounded-Video-LLM | Grounded-VideoLLM: Sharpening Fine-grained Temporal Grounding in Video Large Language Models | 79 | |
70 | From_News_to_Forecast | This repository is for the paper entitled: From News to Forecast: Integrating Event Analysis in LLM-based Time Series Forecasting with Reflection (NeurIPS 2024) | 75 | |
71 | chat_with_pdf | Chat with PDF using Llama 3.3 | 75 | |
72 | Quest-Best-Tokens | An introduction to LLM Sampling | 75 | |
73 | SparseVLMs | Official implementation of paper "SparseVLM: Visual Token Sparsification for Efficient Vision-Language Model Inference" proposed by Peking University and UC Berkeley. | 73 | |
74 | STRING | [ICLR'25] Data and code for our paper "Why Does the Effective Context Length of LLMs Fall Short?" | 69 | |
75 | NotebookLlama-Groq | NotebookLlama powered by Groq - Create podcasts on any topic lightning fast | 67 | |
76 | Build-a-Large-Language-Model-from-Scratch | Building a GPT-like LLM from scratch with PyTorch. | 60 | |
77 | sd-webui-ux | Frontend Engine Extension for Stable Diffusion Web UI and Stable Diffusion Web UI Forge. | 53 | |
78 | Differential-Transformer-PyTorch | PyTorch implementation of the Differential-Transformer architecture for sequence modeling, specifically tailored as a decoder-only model similar to large language models (LLMs). The architecture incorporates a novel Differential Attention mechanism, Multi-Head structure, RMSNorm, and SwiGLU. | 52 | |
79 | podcast-llm | Automatically generate engaging AI podcasts from nothing but an episode title. | 49 | |
80 | ReachQA | Code & Dataset for Paper: "Distill Visual Chart Reasoning Ability from LLMs to MLLMs" | 48 | |
81 | chat2note | This is a tool that can automatically translate the chat log into a note using LLM API | 47 | |
82 | robot-3dlotus | Official implementation of "Towards Generalizable Vision-Language Robotic Manipulation: A Benchmark and LLM-guided 3D Policy." | 47 | |
83 | Graphy-v1 | Graphy v1: A Realtime GraphRAG App using Langchain, Neo4j, GPT-4o, and Streamlit. | 46 | |
84 | sdxl-unbox | 46 | ||
85 | WololoGPT | 46 | ||
86 | gpt-bert | Official implementation of "GPT or BERT: why not both?" | 46 | |
87 | LMforImageGeneration | Codebase for the paper-Elucidating the design space of language models for image generation | 45 | |
88 | AutoGLM | 44 | ||
89 | google-gemini-app-clone | 43 | ||
90 | SubgraphRAG | [ICLR 2025] Simple is Effective: The Roles of Graphs and Large Language Models in Knowledge-Graph-Based Retrieval-Augmented Generation | 43 | |
91 | SyllableLM | Official Code for SyllableLM: Learning Coarse Semantic Units for Speech Language Models | 42 | |
92 | SAE-based-representation-engineering | [NAACL'25] Steering Knowledge Selection Behaviours in LLMs via SAE-Based Representation Engineering | 42 | |
93 | BugGPT | OpenAI o1 advanced reasoning powered vulnerable web page generator for testing and educational purposes | 42 | |
94 | nouv | Free AI & Community powered Learning Experience | 42 | |
95 | Montessori-Instruct | Official repository for Montessori-Instruct: Generate Influential Training Data Tailored for Student Learning [ICLR 2025] | 41 | |
96 | SeeDo | Human Demo Videos to Robot Action Plans | 41 | |
97 | clientai | A unified client for AI providers with built-in agent support. | 41 | |
98 | vscode-copilot-vision | Exploration into leveraging vision capabilities of an LLM | 40 | |
99 | contextual-doc-retrieval-opneai-reranker | Contextual Doc Retrieval is a Python-based system leveraging OpenAI GPT-4o and Cohere for re-ranking and query expansion, combined with BM25 for accurate document retrieval. It parses PDFs, chunks content contextually, and enhances search precision with AI-powered contextual understanding and re-ranking. | 39 | |
100 | vlm-knowledge-conflict | Code for paper "Unraveling Cross-Modality Knowledge Conflicts in Large Vision-Language Models." | 39 |