TOP AI Developers by monthly star count
TOP AI Organization Account by AI repo star count
Top AI Project by Category star count
Top Growing Speed list by the speed of gaining stars
Top List of who create influential repos with little people known
Projects and developers that are thriving yet have not been updated for a long time.
Rankings | Organization Account | Related Project | Project intro | Star count |
---|---|---|---|---|
1 | LightRAG | "LightRAG: Simple and Fast Retrieval-Augmented Generation" | 12.4K | |
2 | deep-searcher | Open Source Deep Research Alternative to Reason and Search on Private Data. Written in Python. | 5.9K | |
3 | pyspur | AI Agent Builder in Python | 3.1K | |
4 | Sidekick | A native macOS app that allows users to chat with a local LLM that can respond with information from files, folders and websites on your Mac without installing any other software. Powered by llama.cpp. | 2.8K | |
5 | nano-graphrag | A simple, easy-to-hack GraphRAG implementation | 2.5K | |
6 | lmnr | Laminar - open-source all-in-one platform for engineering AI products. Create data flywheel for your AI app. Traces, Evals, Datasets, Labels. YC S24. | 2.0K | |
7 | sage | Chat with any codebase in under two minutes | Fully local or via third-party APIs | 1.2K | |
8 | dynamiq | Dynamiq is an orchestration framework for agentic AI and LLM applications | 715 | |
9 | agentica | TypeScript AI Framework specialized AI Function Calling enhanced by compiler skills. | 636 | |
10 | VisRAG | Parsing-free RAG supported by VLMs | 611 | |
11 | PocketFlow | Minimalist LLM Framework in 100 Lines. Enable LLMs to Program Themselves. | 586 | |
12 | cloudflare-rag | Fullstack "Chat with your PDFs" RAG (Retrieval Augmented Generation) app built fully on Cloudflare | 516 | |
13 | sanic-web | 一个轻量级、支持全链路且易于二次开发的大模型应用项目(Large Model Data Assistant) 支持DeepSeek/Qwen2.5等大模型 基于 Dify 、Ollama&Vllm、Sanic 和 Text2SQL 📊 等技术构建的一站式大模型应用开发项目,采用 Vue3、TypeScript 和 Vite 5 打造现代UI。它支持通过 ECharts 📈 实现基于大模型的数据图形化问答,具备处理 CSV 文件 📂 表格问答的能力。同时,能方便对接第三方开源 RAG 系统 检索系统 🌐等,以支持广泛的通用知识问答。 | 468 | |
14 | chipper | ✨ AI interface for tinkerers (Ollama, Haystack RAG, Python) | 437 | |
15 | aisearch-openai-rag-audio | A simple example implementation of the VoiceRAG pattern to power interactive voice generative AI experiences using RAG with Azure AI Search and Azure OpenAI's gpt-4o-realtime-preview model. | 377 | |
16 | graphrag-visualizer | A web-based tool for visualizing and exploring artifacts from Microsoft's GraphRAG. | 281 | |
17 | premsql | End-to-End Local-First Text-to-SQL Pipelines | 281 | |
18 | llmdocparser | A package for parsing PDFs and analyzing their content using LLMs. | 256 | |
19 | airweave | Turn any app into agent knowledge | 226 | |
20 | flock | Flock is a workflow-based low-code platform for rapidly building chatbots, RAG, and coordinating multi-agent teams.(Flock 是一个基于workflow工作流的低代码平台,用于快速构建聊天机器人、RAG、Agent和Muti-Agent应用。) | 204 | |
21 | GraphRag.Net | 参考GraphRag使用 Semantic Kernel 来实现的dotnet版本,可以使用NuGet开箱即用集成到项目中 | 203 | |
22 | VLM2Vec | This repo contains the code for "VLM2Vec: Training Vision-Language Models for Massive Multimodal Embedding Tasks" [ICLR25] | 152 | |
23 | confabulations | Hallucinations (Confabulations) Document-Based Benchmark for RAG. Includes human-verified questions and answers. | 142 | |
24 | ThinkRAG | A LLM RAG system runs on your laptop. 大模型检索增强生成系统,可以轻松部署在笔记本电脑上,实现本地知识库智能问答。 | 140 | |
25 | NanoSage | Local LLM Powered Recursive Search & Smart Knowledge Explorer | 96 | |
26 | bookmarksAI | GPT automatically organizes your browser bookmarks | 87 | |
27 | asktube | AskTube - An AI-powered YouTube video summarizer and QA assistant powered by Retrieval Augmented Generation (RAG) 🤖. Run it entirely on your local machine with Ollama, or cloud-based models like Claude, OpenAI, Gemini, Mistral, and more. | 78 | |
28 | SubgraphRAG | [ICLR 2025] Simple is Effective: The Roles of Graphs and Large Language Models in Knowledge-Graph-Based Retrieval-Augmented Generation | 51 | |
29 | gfm-rag | Graph Foundation Model for Retrieval Augmented Generation | 46 | |
30 | ai-agent-flight-scanner | AI agent to search Google Flights data | 44 | |
31 | deploy-langfuse-on-ecs-with-fargate | Self-hosting Langfuse on Amazon ECS with Fargate using CDK Python | 34 | |
32 | simple-rag | Too many docs? Quickly search over any PDF or Markdown documents | 30 | |
33 | advanced-rag | Jupyter Notebooks for Mastering LLM with Advanced RAG Course | 28 | |
34 | ragChatbot | This project is a dynamic AI Agent chatbot that can be trained from various sources, such as PDFs, documents, websites, and YouTube videos. | 27 | |
35 | pa | A Personal Assistant leveraging Retrieval-Augmented Generation (RAG) and the LLaMA-3.1-8B-Instant Large Language Model (LLM). This tool is designed to revolutionize PDF document analysis tasks by combining machine learning with retrieval-based systems. | 27 | |
36 | ragbits | Building blocks for rapid development of GenAI applications | 25 | |
37 | AI-Sales-agent | Sales AI agent that talks with your customers, recommend products, book consultations, and process Stripe payments | 23 | |
38 | Public_QiDiHui | QiDiHui: RAG, appbuilder, ErnieBot, multi-model, 十万个为什么 | 21 | |
39 | PixlieAI | graph + ai in your products; reduce costs and get correct answers from your data | 18 | |
40 | Awesome-RAG | An up-to-date curated list of Retrieval-Augmented Generation (RAG) for Large Language Models (LLMs). | 17 | |
41 | needle-python | Needle simplifies building RAG pipelines. | 17 | |
42 | coffee-chat-voice-assistant | Coffee Chat Voice Assistant is a voice-driven ordering system powered by Azure OpenAI GPT-4o Realtime API, simulating the experience of ordering coffee with a café barista. It supports natural conversations, live order updates, and real-time transcription, showcasing the power of AI for seamless customer interactions. | 17 | |
43 | video-search-and-summarization | Blueprint for Ingesting massive volumes of live or archived videos and extract insights for summarization and interactive Q&A | 17 | |
44 | nuclia-eval | Library for evaluating RAG using Nuclia's models | 16 | |
45 | Chatchat-Lite | 从零开始基于 LangGraph 和 Streamlit 实现基于本地模型的 RAG、Agent 应用 | 13 | |
46 | minRAG | minRAG is a RAG system that starts from scratch, pursuing the ultimate simplicity and power. It consists of no more than 10,000 lines of code, requires no installation, and can be launched with a double-click | 13 | |
47 | DSPy-Multi-Hop-Chain-of-Thought-RAG | Discover advanced AI techniques in my repository combining Multi-Hop Chain of Thought (CoT) and Retrieval-Augmented Generation (RAG) using DSPy and Indexify. Enhance complex problem-solving with multi-step reasoning and external knowledge integration. Perfect for AI enthusiasts and researchers. | 12 | |
48 | SpeakingAI | SpeakingAI is a demo of privately deployable 'GPT-4o like AI + RAG', a fully functional web AI server with audio query/answer in streaming, using LLM and RAG for backend knowledge. | 12 | |
49 | arXivRAG | A comprehensive tool designed to enhance the retrieval and generation of academic content from the arXiv database, leveraging advanced Retrieval-Augmented Generation (RAG) techniques. | 11 | |
50 | lecca-io | Lecca.io | AI Agents & Automations | 11 | |
51 | notebooks | GenAI Playground | 10 | |
52 | rag-chatbot | An open-source template for building a RAG chatbot for your company 🤖 | 10 | |
53 | remembear | An app for people with short-term-memory 🧠 | 9 | |
54 | RAGify | Chat with your documents using Generative AI & Retrieval-Augmented Generation (RAG) | 8 | |
55 | WovenSnips | WovenSnips: A Lightweight, Free, and Open-source Implementation of Retrieval-Augmented Generation (RAG) using Straico API | 7 | |
56 | vlm-api | REST API for computing cross-modal similarity between images and text using the ColPaLI vision-language model | 7 | |
57 | ACLReady | ACLReady, a retrieval-augmented language model application that can be used to empower authors to reflect on their work and assist authors with the ACL checklist. | 6 | |
58 | check | Automated fact-check | 6 | |
59 | Agentic-RAG-using-Crew-AI | Agentic RAG using Crew AI | 6 | |
60 | RAGLink | 一个开源的RAG框架,旨在为用户提供了一个强大、灵活且简单的开发环境。 | 5 | |
61 | Graph_RAG | A Flask app running GraphRAG for healthcare, made with Vertex AI and Neo4j, to be deployed in a container (Cloud Run or ECS). | 5 | |
62 | pikobrain | Function-calling API for LLM from multiple providers | 5 | |
63 | Azure-AI-Search-Vector-Store-LangChain-RAG-Pattern-with-Jira | This project combines Azure AI Search, Azure OpenAI Service, LangChain, React.JS, and Python FastAPI to create an intelligent system for managing Jira issues. It features advanced AI search for seamless document retrieval, a user-friendly React.JS front-end, and a robust Python FastAPI back-end. | 5 | |
64 | AgentBlueprint | An Data-Orientated Structure Framework for Next Generation AI Application Development | 5 | |
65 | RAG_in_CPU | This repo is for advanced RAG systems, each branch will represent a project based on RAG. | 5 | |
66 | Llama_RAG_System | Llama_RAG_System is a local Retrieval-Augmented Generation (RAG) system that leverages the LLaMA model to provide intelligent answers to user queries by processing uploaded PDFs and fetching relevant web information while ensuring privacy. | 5 | |
67 | nextjs-langchain-gemini-rag-pdf-chatbot | Next.js LangChain Gemini RAG Chatbot: An advanced Retrieval-Augmented Generation (RAG) chatbot that enables users to upload multiple PDFs and receive precise answers to their questions based on the content. Powered by LangChain and Gemini, this chatbot is built with Next.js and offers efficient PDF-based query handling for enhanced user engagement. | 5 | |
68 | RAG-Ollama-Chat-with-PDF | This application allows users to upload PDF files, process them, and ask questions about the content using a locally hosted language model. The system uses Retrieval-Augmented Generation (RAG) to provide accurate answers based on the uploaded PDFs. | 5 | |
69 | graphrag_webui | A web interface for GraphRAG. 🚀 | 5 | |
70 | chatbot | AI chatbot with multiple trending LLMs and a RAG option to chat with your PDF documents | 5 | |
71 | ragtitles | Optimize Subtitles for RAG Ingestion | 4 | |
72 | AI-Companion-Builder | AI-Companion is a cool software that lets you create your own custom AI models of people you admire, like actors or celebrities. It's a tool to make personalized artificial intelligence companions based on your favorite individuals. | 4 | |
73 | awesome-rag | A curated list of awesome RAG. | 4 | |
74 | fhb-assistant | A RAG based LLM assistant for australian first home buyers | 4 | |
75 | Multimodal-VideoRAG | Multimodal-VideoRAG: Using BridgeTower Embeddings and Large Vision Language Models | 4 | |
76 | LLMOps | This project(RAG) focuses on operationalizing LLMs by integrating OpenAI, MLflow, FastAPI, and RAGAS for evaluation. It allows users to deploy and manage LLMs, track model runs, and log evaluation metrics in MLflow. The project also features MLflow traces that logs all the user inputs ,responses ,retrieved contexts ,and other essential metrices. | 4 | |
77 | dbt_unified_rag | Fivetran dbt package designed to generate an end model and Cortex Search Service (for Snowflake destinations only) which contains unstructured document data to be used for Retrieval Augmented Generation (RAG) applications leveraging Large Language Models (LLMs) | 4 | |
78 | Chat-with-PDF-Locally | Chat with PDF locally: An advanced chatbot using Ollama/Openrouter LLMs to interactively extract information from PDFs, Using Streamlit & Ollama/Openrouter API and langchain | 4 | |
79 | ragit | A RAG back and front end application | 4 | |
80 | pgvector_pgsql_windows | pgvector extension, binary compiled in Microsoft Windows with PostgreSQL | 3 | |
81 | RAG-using-Llama-3.1-WebUi-on-Streamlit | Run Llama on serverless with multimodal RAG, with focus on reading files (But with limited token ofc) | 3 | |
82 | AI-Customer-Support | an AI-powered customer support chatbot using Next.js and the OpenAI API. | 3 | |
83 | Hello.AI.World | Just some initial learning for usage of AI models within .NET platform with Semantic Kernel APIs | 3 | |
84 | jchunk | JChunk is a lightweight and flexible library designed to provide multiple strategies for text chunking within Spring Boot applications | 3 | |
85 | rag_engine | Python package for implementing Retrieval-Augmented Generation (RAG) using OpenAI's embeddings and a SQLite database with vector search capabilities | 3 | |
86 | genai | Collection of GenAI resources & tutorials ... | 3 | |
87 | rag-chatbot-app-with-fastapi | RAG chatbot applilcation with fastapi and docker | 3 | |
88 | doc-talk | Chat with your documents. RAG implementation leveraging OpenAI's GPT-4o and Text-Embedding-3-Small models. | 3 | |
89 | Langchain-Chatchat | 基于 ChatGLM 等大语言模型与 Langchain 等应用框架实现,开源、可离线部署的 RAG 与 Agent 应用项目 | 3 | |
90 | molina | A Rust and Python Synthetic Integration for an agentic-LLM approach to build a research agent for local knowledge representation. | 3 | |
91 | PartSelect-LLM-Assistant | GPT-4o Agent - PartSelect Website - Information on 2 Million Parts | 3 | |
92 | hexabot-plugin-gemini | The Google Gemini Plugin for Hexabot Chatbot / Agent Builder to enable the LLM RAG Capability | 3 | |
93 | IntelliAnswer | A RAG-based question-answering system that processes user queries using local documents. It extracts relevant information to answer questions, falling back to a large language model when local sources are insufficient, ensuring accurate and contextual responses. | 3 | |
94 | Multi-RAG-File-System | Multi RAG File System using Groq, Huggingface, Llama Index, Langchain | 3 | |
95 | reliable-agentic-rag | Project exploring RAG LLM agents | 3 | |
96 | Chat-with-PDF | A RAG Application powered by Llama 3.2b and HuggingFace Embeddings | 3 | |
97 | DocGPT | DocGPT (Doctor GPT) is an advanced medical diagnosis system that combines Vision Transformer (ViT) based deep learning models with LangChain agents to provide comprehensive medical image analysis and detailed diagnostic reports. The system leverages the power of PyTorch for deep learning and Groq's LLM for generating human-like medical insights. | 3 | |
98 | RAG-using-DeepSeek-R1 | This repository highlights my learning journey in building Retrieval-Augmented Generation (RAG) pipelines using DeepSeek on Lightning AI, covering document ingestion, retrieval, and integration with generative AI. It showcases fine-tuning, evaluation, and optimization for accurate open-domain QA and knowledge management. | 3 | |
99 | docsifer | Docsifer is a powerful tool for converting various data formats into Markdown for applications such as indexing, text analysis, and more. It supports PDF, PowerPoint, Word, Excel, Images, Audio, HTML, and other text-based formats, and leverages LLMs to enhance performance. | 3 | |
100 | llamaindex-docs-agent | A simple ReAct agent that has access to LlamaIndex docs and to the internet to provide you with insights on LlamaIndex itself. | 3 |