TOP AI Developers by monthly star count
TOP AI Organization Account by AI repo star count
Top AI Project by Category star count
Top Growing Speed list by the speed of gaining stars
Top List of who create influential repos with little people known
Rankings | Developers | Related Project | Project intro | Star count |
---|---|---|---|---|
1 | LightRAG | "LightRAG: Simple and Fast Retrieval-Augmented Generation" | 8.9K | |
2 | open-canvas | 📃 A better UX for chat, writing content, and coding with LLMs. | 2.6K | |
3 | GLM-4-Voice | GLM-4-Voice | 端到端中英语音对话模型 | 2.3K | |
4 | mini-omni2 | Towards Open-source GPT-4o with Vision, Speech and Duplex Capabilities。 | 1.6K | |
5 | pdf-extract-api | Document (PDF) extraction and parse API using state of the art modern OCRs + Ollama supported models. Anonymize documents. Remove PII. Convert any document or picture to structured JSON or Markdown | 1.3K | |
6 | FreeChatGpt-4o-2024 | Use ChatGpt 4o forFree - No API Key Need. also ChatGpt 3 and ChatGpt 3.5 Experience the power of ChatGPT with a user-friendly interface, enhanced jailbreaks, and completely free. | 1.0K | |
7 | aoai-realtime-audio-sdk | Azure OpenAI code resources for using gpt-4o-realtime capabilities. | 640 | |
8 | LLaMA-O1 | Large Reasoning Models | 585 | |
9 | WritingTools | The world's smartest system-wide grammar assistant; a better version of the Apple Intelligence Writing Tools. Works with the free Gemini API, local LLMs, and other cloud providers. | 506 | |
10 | VideoChat | 实时语音交互数字人,支持端到端语音方案(GLM-4-Voice - THG)和级联方案(ASR-LLM-TTS-THG)。可自定义形象与音色,无须训练,支持音色克隆,首包延迟低至3s。Real-time voice interactive digital human, supporting end-to-end voice solutions (GLM-4-Voice - THG) and cascaded solutions (ASR-LLM-TTS-THG). Customizable appearance and voice, supporting voice cloning, with initial package delay as low as 3s. | 423 | |
11 | localGPT-Vision | Chat with your documents using Vision Language Models. This repo implements an End to End RAG pipeline with both local and proprietary VLMs | 416 | |
12 | VisRAG | Parsing-free RAG supported by VLMs | 399 | |
13 | CoI-Agent | Official code for paper: Chain of Ideas: Revolutionizing Research via Novel Idea Development with LLM Agents | 365 | |
14 | PodCastLM | PDF 生成中文播客 | 357 | |
15 | SDXL_EcomID_ComfyUI | 298 | ||
16 | simplemind | Python API client for AI providers that intends to replace LangChain and LangGraph for most common use cases. | 292 | |
17 | pearai-master | PearAI acts as an inventory that curates the leading, cutting-edge AI tools in one place. We build a centralized user interface for these solutions, allowing you to effortlessly use the best AI tools without having to waste effort looking for alternatives. This repository lists all the other repos that make up the entire PearAI project. | 282 | |
18 | engy | Engy is an AI-powered development tool that generates fully functional web applications from natural language, streamlining the process from idea to working prototype. | 256 | |
19 | nGPT-pytorch | Quick implementation of nGPT, learning entirely on the hypersphere, from NvidiaAI | 256 | |
20 | promptwright | Generate large synthetic data using a local LLM | 211 | |
21 | agent-as-a-judge | 🤠 Agent-as-a-Judge and DevAI dataset | 194 | |
22 | codai | Codai is an AI code assistant that helps developers through a session-based CLI, providing intelligent code suggestions, refactoring, and code reviews based on the full context of the project. It supports multiple LLMs, including GPT-4o, GPT-4, and Ollama, to streamline daily development tasks. | 149 | |
23 | FreeChatGpt-4o-2024 | 136 | ||
24 | joycaption | JoyCaption is an image captioning Visual Language Model (VLM) being built from the ground up as a free, open, and uncensored model for the community to use in training Diffusion models. | 135 | |
25 | AgentSquare | The official implementation of the paper "AgentSquare: Automatic LLM Agent Search in Modular Design Space"" | 129 | |
26 | phantasm | Toolkits to create a human-in-the-loop approval layer to monitor and guide AI agents workflow in real-time. | 128 | |
27 | aider.el | aider emacs plugin for https://github.com/paul-gauthier/aider | 126 | |
28 | backtrack_sampler | An easy-to-understand framework for LLM samplers that rewind and revise generated tokens | 113 | |
29 | llm-jq | Write and execute jq programs with the help of LLM | 112 | |
30 | meta-prompt | For LLMs to better code with Jina API | 108 | |
31 | jarvis | A command-line personal assistant that integrates with Google Calendar, Gmail, and Tasks to help manage your digital life. | 105 | |
32 | chat-ui | Chat UI components for LLM apps | 101 | |
33 | llama.vim | Vim plugin for LLM-assisted code/text completion | 96 | |
34 | open-notebook | An Open Source implementation of Notebook LM with more flexibility and features | 95 | |
35 | TurboReel_studio | Text to Shorts/Tiktoks, AI Video Engine | 94 | |
36 | gemini-ai-code-reviewer | A GitHub Action that automatically reviews pull requests using Google's Gemini AI. | 92 | |
37 | FlipAttack | [arXiv 2024] An official source code for paper "FlipAttack: Jailbreak LLMs via Flipping". | 90 | |
38 | Hands-On-LLM-Fine-Tuning | Hands-on tutorials on fine-tuning various LLMs using different fine-tuning techniques | 87 | |
39 | Fira | Fira: Can We Achieve Full-rank Training of LLMs Under Low-rank Constraint? | 84 | |
40 | LearnGo | LearnGo, a ios versatile learning tool with RAG and Prompting under LLM like GPT-4o-mini, enabling subject Q&A, smart note generation from uploaded materials, offline photo(or AR) object recognition with translations, and multilingual voice input (Cantonese, English, Mandarin). It also features customizable themes for a personalized experience | 84 | |
41 | ai-chat-android | 💬 AI Chat Bot demo app showcasing the integration of Gemini SDK with Firebase Realtime Database for real-time chat functionality. | 81 | |
42 | Modality-Integration-Rate | The official code of the paper "Deciphering Cross-Modal Alignment in Large Vision-Language Models with Modality Integration Rate". | 80 | |
43 | UniversalBackrooms | A replication of Andy Ayrey's "Backrooms" (https://dreams-of-an-electric-mind.webflow.io/), but runnable with Opus 3, Sonnet 3.5, GPT 4o, and o1-preview | 72 | |
44 | StructRAG | StructRAG: Boosting Knowledge Intensive Reasoning of LLMs via Inference-time Hybrid Information Structurization | 71 | |
45 | VLM2Vec | This repo contains the code and data for "VLM2Vec: Training Vision-Language Models for Massive Multimodal Embedding Tasks" | 70 | |
46 | VLM-Grounder | [CoRL 2024] VLM-Grounder: A VLM Agent for Zero-Shot 3D Visual Grounding | 67 | |
47 | PixelLlama | Ollama client written in Python | 65 | |
48 | intelligence | A Ruby gem for seamlessly and uniformly interacting with large language and vision model (LLM) API's served by numerous services, including those of OpenAI, Anthropic, Google and others. | 64 | |
49 | STRING | Data and code for our paper "Why Does the Effective Context Length of LLMs Fall Short?" | 64 | |
50 | Quest-Best-Tokens | An introduction to LLM Sampling | 64 | |
51 | chat2note | This is a tool that can automatically translate the chat log into a note using LLM API | 63 | |
52 | Grounded-Video-LLM | Grounded-VideoLLM: Sharpening Fine-grained Temporal Grounding in Video Large Language Models | 63 | |
53 | graph-constrained-reasoning | Official Implementation of "Graph-constrained Reasoning: Faithful Reasoning on Knowledge Graphs with Large Language Models". | 58 | |
54 | MagicPIG | MagicPIG: LSH Sampling for Efficient LLM Generation | 58 | |
55 | SparseVLMs | Official implementation of paper "SparseVLM: Visual Token Sparsification for Efficient Vision-Language Model Inference" proposed by Peking University and UC Berkeley. | 55 | |
56 | SG-Nav | [NeurIPS 2024] SG-Nav: Online 3D Scene Graph Prompting for LLM-based Zero-shot Object Navigation | 55 | |
57 | NotebookLlama-Groq | NotebookLlama powered by Groq - Create podcasts on any topic lightning fast | 55 | |
58 | WololoGPT | 44 | ||
59 | ai-analyst | Open source AI analyst powered by E2B. Analyze your CSV files with Llama 3.1 and create interactive charts. | 42 | |
60 | llama-swap | HTTP proxy for on-demand model loading with llama.cpp (or other OpenAI compatible backends) | 41 | |
61 | BugGPT | OpenAI o1 advanced reasoning powered vulnerable web page generator for testing and educational purposes | 41 | |
62 | Differential-Transformer-PyTorch | PyTorch implementation of the Differential-Transformer architecture for sequence modeling, specifically tailored as a decoder-only model similar to large language models (LLMs). The architecture incorporates a novel Differential Attention mechanism, Multi-Head structure, RMSNorm, and SwiGLU. | 41 | |
63 | nouv | Free AI & Community powered Learning Experience | 39 | |
64 | From_News_to_Forecast | This repository is for the paper entitled: From News to Forecast: Integrating Event Analysis in LLM-based Time Series Forecasting with Reflection (NeurIPS 2024) | 38 | |
65 | docspedia | Chat with your pdf using your local LLM, OLLAMA client. | 37 | |
66 | google-gemini-app-clone | 37 | ||
67 | Fast-LLM | Accelerating your LLM training to full speed | 36 | |
68 | contextual-doc-retrieval-opneai-reranker | Contextual Doc Retrieval is a Python-based system leveraging OpenAI GPT-4o and Cohere for re-ranking and query expansion, combined with BM25 for accurate document retrieval. It parses PDFs, chunks content contextually, and enhances search precision with AI-powered contextual understanding and re-ranking. | 35 | |
69 | SyllableLM | Official Code for SyllableLM: Learning Coarse Semantic Units for Speech Language Models | 35 | |
70 | gpt-bert | Official implementation of "GPT or BERT: why not both?" | 35 | |
71 | chat_with_pdf | Chat with PDF using Llama 3.2 | 34 | |
72 | Sidekick | A native macOS app that allows users to chat with a local LLM with context of your files, folders and websites on your Mac without installing any other software. | 33 | |
73 | TimeMarker | A Versatile Video-LLM for Long and Short Video Understanding with Superior Temporal Localization Ability | 33 | |
74 | vlm-knowledge-conflict | Code for paper "Unraveling Cross-Modality Knowledge Conflicts in Large Vision-Language Models." | 32 | |
75 | Montessori-Instruct | Official repository for Montessori-Instruct: Generate Influential Training Data Tailored for Student Learning | 32 | |
76 | sd-webui-ux | Frontend Engine Extension for Stable Diffusion Web UI and Stable Diffusion Web UI Forge. | 32 | |
77 | sdxl-unbox | 32 | ||
78 | oat | 🌾 OAT: Online AlignmenT for LLMs | 32 | |
79 | SeeDo | Human Demo Videos to Robot Action Plans | 31 | |
80 | LMforImageGeneration | Codebase for the paper-Elucidating the design space of language models for image generation | 31 | |
81 | Graphy-v1 | Graphy v1: A Realtime GraphRAG App using Langchain, Neo4j, GPT-4o, and Streamlit. | 30 | |
82 | Screenshot_LLM | Screenshot LLM is a Python application that leverages the power of AI to analyze screenshots. Built with PyQt6 for a user-friendly interface and integrating with various AI models (including Ollama), it provides insightful information directly from images. | 29 | |
83 | ReachQA | Code & Dataset for Paper: "Distill Visual Chart Reasoning Ability from LLMs to MLLMs" | 29 | |
84 | LongPrompt-LLamaGen | This repository provides an improved LLamaGen Model, fine-tuned on 500,000 high-quality images, each accompanied by over 300 token prompts. And it's also powered by additional prompt refining features for improved performance. | 27 | |
85 | SAE-based-representation-engineering | Steering Knowledge Selection Behaviours in LLMs via SAE-Based Representation Engineering | 26 | |
86 | AutoGLM | 26 | ||
87 | Mirror | An LLM-powered programming-by-example programming language. | 26 | |
88 | comanda | Execute chains of LLM models and actions | 25 | |
89 | ai-generated-fake-podcasts | A growing list of fake podcasts generated by Notebook LM | 24 | |
90 | groundLMM | Emerging Pixel Grounding in Large Multimodal Models Without Grounding Supervision | 24 | |
91 | webcam-audio-description-ai | Capture webcam via browser, generate text descriptions with Google Gemini, generate speech with ElevenLabs | 24 | |
92 | LeReT | Learning to Retrieve by Trying - Source code for Grounding by Trying: LLMs with Reinforcement Learning-Enhanced Retrieval | 24 | |
93 | VertexAI-KMP-Sample | Compose Multiplatform sample that uses the Firebase Vertex AI SDKs | 22 | |
94 | duckdb-extension-openprompt | DuckDB Community Extension to prompt LLMs from SQL | 22 | |
95 | splicing | Splicing: Gen-AI Copilot for Data Engineering | 21 | |
96 | clientai | A unified client for seamless interaction with multiple AI providers. | 21 | |
97 | MPLSandbox | MPLSandbox is an out-of-the-box multi-programming language sandbox designed to provide unified and comprehensive feedback from compiler and analysis tools for LLMs. | 21 | |
98 | vscode-copilot-vision | Exploration into leveraging vision capabilities of an LLM | 21 | |
99 | files-to-claude-xml | Use XML tags for long context prompting using Claude's multi-document structure. | 20 | |
100 | FailureLLMUnlearning | 20 |