TOP AI Developers by monthly star count
TOP AI Organization Account by AI repo star count
Top AI Project by Category star count
Top Growing Speed list by the speed of gaining stars
Top List of who create influential repos with little people known
Rankings | Developers | Related Project | Project intro | Star count |
---|---|---|---|---|
1 | LightRAG | "LightRAG: Simple and Fast Retrieval-Augmented Generation" | 9.5K | |
2 | open-canvas | 📃 A better UX for chat, writing content, and coding with LLMs. | 2.7K | |
3 | GLM-4-Voice | GLM-4-Voice | 端到端中英语音对话模型 | 2.3K | |
4 | mini-omni2 | Towards Open-source GPT-4o with Vision, Speech and Duplex Capabilities。 | 1.6K | |
5 | pdf-extract-api | Document (PDF) extraction and parse API using state of the art modern OCRs + Ollama supported models. Anonymize documents. Remove PII. Convert any document or picture to structured JSON or Markdown | 1.4K | |
6 | FreeChatGpt-4o-2024 | Use ChatGpt 4o forFree - No API Key Need. also ChatGpt 3 and ChatGpt 3.5 Experience the power of ChatGPT with a user-friendly interface, enhanced jailbreaks, and completely free. | 1.0K | |
7 | aoai-realtime-audio-sdk | Azure OpenAI code resources for using gpt-4o-realtime capabilities. | 652 | |
8 | LLaMA-O1 | Large Reasoning Models | 626 | |
9 | WritingTools | The world's smartest system-wide grammar assistant; a better version of the Apple Intelligence Writing Tools. Works with the free Gemini API, local LLMs, and other cloud providers. | 570 | |
10 | VideoChat | 实时语音交互数字人,支持端到端语音方案(GLM-4-Voice - THG)和级联方案(ASR-LLM-TTS-THG)。可自定义形象与音色,无须训练,支持音色克隆,首包延迟低至3s。Real-time voice interactive digital human, supporting end-to-end voice solutions (GLM-4-Voice - THG) and cascaded solutions (ASR-LLM-TTS-THG). Customizable appearance and voice, supporting voice cloning, with initial package delay as low as 3s. | 443 | |
11 | localGPT-Vision | Chat with your documents using Vision Language Models. This repo implements an End to End RAG pipeline with both local and proprietary VLMs | 431 | |
12 | VisRAG | Parsing-free RAG supported by VLMs | 421 | |
13 | CoI-Agent | Official code for paper: Chain of Ideas: Revolutionizing Research via Novel Idea Development with LLM Agents | 371 | |
14 | PodCastLM | PDF 生成中文播客 | 362 | |
15 | SDXL_EcomID_ComfyUI | 304 | ||
16 | simplemind | Python API client for AI providers that intends to replace LangChain and LangGraph for most common use cases. | 304 | |
17 | pearai-master | PearAI acts as an inventory that curates the leading, cutting-edge AI tools in one place. We build a centralized user interface for these solutions, allowing you to effortlessly use the best AI tools without having to waste effort looking for alternatives. This repository lists all the other repos that make up the entire PearAI project. | 293 | |
18 | nGPT-pytorch | Quick implementation of nGPT, learning entirely on the hypersphere, from NvidiaAI | 259 | |
19 | engy | Engy is an AI-powered development tool that generates fully functional web applications from natural language, streamlining the process from idea to working prototype. | 257 | |
20 | promptwright | Generate large synthetic data using an LLM | 214 | |
21 | agent-as-a-judge | 🤠 Agent-as-a-Judge and DevAI dataset | 204 | |
22 | codai | Codai is an AI code assistant that helps developers through a session-based CLI, providing intelligent code suggestions, refactoring, and code reviews based on the full context of the project. It supports multiple LLMs, including GPT-4o, GPT-4, and Ollama, to streamline daily development tasks. | 158 | |
23 | Mirror | An LLM-powered programming-by-example programming language. | 150 | |
24 | PixelLlama | Ollama client written in Python | 146 | |
25 | joycaption | JoyCaption is an image captioning Visual Language Model (VLM) being built from the ground up as a free, open, and uncensored model for the community to use in training Diffusion models. | 144 | |
26 | FreeChatGpt-4o-2024 | 136 | ||
27 | phantasm | Toolkits to create a human-in-the-loop approval layer to monitor and guide AI agents workflow in real-time. | 134 | |
28 | AgentSquare | The official implementation of the paper "AgentSquare: Automatic LLM Agent Search in Modular Design Space"" | 132 | |
29 | aider.el | aider emacs plugin for https://github.com/paul-gauthier/aider | 128 | |
30 | open-notebook | An Open Source implementation of Notebook LM with more flexibility and features | 127 | |
31 | backtrack_sampler | An easy-to-understand framework for LLM samplers that rewind and revise generated tokens | 113 | |
32 | llm-jq | Write and execute jq programs with the help of LLM | 112 | |
33 | meta-prompt | For LLMs to better code with Jina API | 111 | |
34 | jarvis | A command-line personal assistant that integrates with Google Calendar, Gmail, and Tasks to help manage your digital life. | 108 | |
35 | chat-ui | Chat UI components for LLM apps | 108 | |
36 | MPLSandbox | MPLSandbox is an out-of-the-box multi-programming language sandbox designed to provide unified and comprehensive feedback from compiler and analysis tools for LLMs. | 106 | |
37 | mediachain | AI toolkit for making Shorts/Tiktoks | 101 | |
38 | llama.vim | Vim plugin for LLM-assisted code/text completion | 97 | |
39 | gemini-ai-code-reviewer | A GitHub Action that automatically reviews pull requests using Google's Gemini AI. | 95 | |
40 | FlipAttack | [arXiv 2024] An official source code for paper "FlipAttack: Jailbreak LLMs via Flipping". | 92 | |
41 | Hands-On-LLM-Fine-Tuning | Hands-on tutorials on fine-tuning various LLMs using different fine-tuning techniques | 88 | |
42 | LearnGo | LearnGo, a ios versatile learning tool with RAG and Prompting under LLM like GPT-4o-mini, enabling subject Q&A, smart note generation from uploaded materials, offline photo(or AR) object recognition with translations, and multilingual voice input (Cantonese, English, Mandarin). It also features customizable themes for a personalized experience | 87 | |
43 | Modality-Integration-Rate | The official code of the paper "Deciphering Cross-Modal Alignment in Large Vision-Language Models with Modality Integration Rate". | 85 | |
44 | Fira | Fira: Can We Achieve Full-rank Training of LLMs Under Low-rank Constraint? | 84 | |
45 | ai-chat-android | 💬 AI Chat Bot demo app showcasing the integration of Gemini SDK with Firebase Realtime Database for real-time chat functionality. | 83 | |
46 | VLM2Vec | This repo contains the code and data for "VLM2Vec: Training Vision-Language Models for Massive Multimodal Embedding Tasks" | 78 | |
47 | UniversalBackrooms | A replication of Andy Ayrey's "Backrooms" (https://dreams-of-an-electric-mind.webflow.io/), but runnable with Opus 3, Sonnet 3.5, GPT 4o, and o1-preview | 73 | |
48 | StructRAG | StructRAG: Boosting Knowledge Intensive Reasoning of LLMs via Inference-time Hybrid Information Structurization | 72 | |
49 | intelligence | A Ruby gem for seamlessly and uniformly interacting with large language and vision model (LLM) API's served by numerous services, including those of OpenAI, Anthropic, Google and others. | 69 | |
50 | VLM-Grounder | [CoRL 2024] VLM-Grounder: A VLM Agent for Zero-Shot 3D Visual Grounding | 69 | |
51 | Quest-Best-Tokens | An introduction to LLM Sampling | 66 | |
52 | STRING | Data and code for our paper "Why Does the Effective Context Length of LLMs Fall Short?" | 64 | |
53 | Grounded-Video-LLM | Grounded-VideoLLM: Sharpening Fine-grained Temporal Grounding in Video Large Language Models | 64 | |
54 | chat2note | This is a tool that can automatically translate the chat log into a note using LLM API | 63 | |
55 | graph-constrained-reasoning | Official Implementation of "Graph-constrained Reasoning: Faithful Reasoning on Knowledge Graphs with Large Language Models". | 61 | |
56 | MagicPIG | MagicPIG: LSH Sampling for Efficient LLM Generation | 59 | |
57 | SG-Nav | [NeurIPS 2024] SG-Nav: Online 3D Scene Graph Prompting for LLM-based Zero-shot Object Navigation | 58 | |
58 | NotebookLlama-Groq | NotebookLlama powered by Groq - Create podcasts on any topic lightning fast | 58 | |
59 | SparseVLMs | Official implementation of paper "SparseVLM: Visual Token Sparsification for Efficient Vision-Language Model Inference" proposed by Peking University and UC Berkeley. | 56 | |
60 | ai-analyst | Open source AI analyst powered by E2B. Analyze your CSV files with Llama 3.1 and create interactive charts. | 46 | |
61 | WololoGPT | 46 | ||
62 | Differential-Transformer-PyTorch | PyTorch implementation of the Differential-Transformer architecture for sequence modeling, specifically tailored as a decoder-only model similar to large language models (LLMs). The architecture incorporates a novel Differential Attention mechanism, Multi-Head structure, RMSNorm, and SwiGLU. | 45 | |
63 | From_News_to_Forecast | This repository is for the paper entitled: From News to Forecast: Integrating Event Analysis in LLM-based Time Series Forecasting with Reflection (NeurIPS 2024) | 43 | |
64 | BugGPT | OpenAI o1 advanced reasoning powered vulnerable web page generator for testing and educational purposes | 42 | |
65 | llama-swap | HTTP proxy for on-demand model loading with llama.cpp (or other OpenAI compatible backends) | 41 | |
66 | nouv | Free AI & Community powered Learning Experience | 39 | |
67 | Fast-LLM | Accelerating your LLM training to full speed | 38 | |
68 | TimeMarker | A Versatile Video-LLM for Long and Short Video Understanding with Superior Temporal Localization Ability | 38 | |
69 | docspedia | Chat with your pdf using your local LLM, OLLAMA client. | 37 | |
70 | google-gemini-app-clone | 37 | ||
71 | gpt-bert | Official implementation of "GPT or BERT: why not both?" | 37 | |
72 | contextual-doc-retrieval-opneai-reranker | Contextual Doc Retrieval is a Python-based system leveraging OpenAI GPT-4o and Cohere for re-ranking and query expansion, combined with BM25 for accurate document retrieval. It parses PDFs, chunks content contextually, and enhances search precision with AI-powered contextual understanding and re-ranking. | 36 | |
73 | SyllableLM | Official Code for SyllableLM: Learning Coarse Semantic Units for Speech Language Models | 35 | |
74 | chat_with_pdf | Chat with PDF using Llama 3.2 | 35 | |
75 | sdxl-unbox | 35 | ||
76 | vlm-knowledge-conflict | Code for paper "Unraveling Cross-Modality Knowledge Conflicts in Large Vision-Language Models." | 34 | |
77 | sd-webui-ux | Frontend Engine Extension for Stable Diffusion Web UI and Stable Diffusion Web UI Forge. | 34 | |
78 | Sidekick | A native macOS app that allows users to chat with a local LLM with context of your files, folders and websites on your Mac without installing any other software. | 33 | |
79 | oat | 🌾 OAT: Online AlignmenT for LLMs | 33 | |
80 | Graphy-v1 | Graphy v1: A Realtime GraphRAG App using Langchain, Neo4j, GPT-4o, and Streamlit. | 32 | |
81 | Montessori-Instruct | Official repository for Montessori-Instruct: Generate Influential Training Data Tailored for Student Learning | 32 | |
82 | ReachQA | Code & Dataset for Paper: "Distill Visual Chart Reasoning Ability from LLMs to MLLMs" | 32 | |
83 | SeeDo | Human Demo Videos to Robot Action Plans | 31 | |
84 | LMforImageGeneration | Codebase for the paper-Elucidating the design space of language models for image generation | 31 | |
85 | Screenshot_LLM | Screenshot LLM is a Python application that leverages the power of AI to analyze screenshots. Built with PyQt6 for a user-friendly interface and integrating with various AI models (including Ollama), it provides insightful information directly from images. | 29 | |
86 | AutoGLM | 29 | ||
87 | SAE-based-representation-engineering | Steering Knowledge Selection Behaviours in LLMs via SAE-Based Representation Engineering | 28 | |
88 | comanda | Execute agentic workflows defined in simple YAML files | 27 | |
89 | LongPrompt-LLamaGen | This repository provides an improved LLamaGen Model, fine-tuned on 500,000 high-quality images, each accompanied by over 300 token prompts. And it's also powered by additional prompt refining features for improved performance. | 27 | |
90 | lite_llama | The llama model inference lite framework by tirton. | 27 | |
91 | webcam-audio-description-ai | Capture webcam via browser, generate text descriptions with Google Gemini, generate speech with ElevenLabs | 26 | |
92 | ai-generated-fake-podcasts | A growing list of fake podcasts generated by Notebook LM | 25 | |
93 | LeReT | Learning to Retrieve by Trying - Source code for Grounding by Trying: LLMs with Reinforcement Learning-Enhanced Retrieval | 25 | |
94 | groundLMM | Emerging Pixel Grounding in Large Multimodal Models Without Grounding Supervision | 24 | |
95 | LLM-from-scratch | 23 | ||
96 | duckdb-extension-openprompt | DuckDB Community Extension to prompt LLMs from SQL | 23 | |
97 | splicing | Splicing: Gen-AI Copilot for Data Engineering | 22 | |
98 | FailureLLMUnlearning | 22 | ||
99 | VertexAI-KMP-Sample | Compose Multiplatform sample that uses the Firebase Vertex AI SDKs | 22 | |
100 | pal | LLM assistants for R | 22 |