TOP AI Developers by monthly star count
TOP AI Organization Account by AI repo star count
Top AI Project by Category star count
Top Growing Speed list by the speed of gaining stars
Top List of who create influential repos with little people known
Rankings | Developers | Related Project | Project intro | Star count |
---|---|---|---|---|
1 | browser-use | Make websites accessible for AI agents | 27.7K | |
2 | open-canvas | 📃 A better UX for chat, writing content, and coding with LLMs. | 3.6K | |
3 | verl | veRL: Volcano Engine Reinforcement Learning for LLM | 2.7K | |
4 | GLM-4-Voice | GLM-4-Voice | 端到端中英语音对话模型 | 2.6K | |
5 | text-extract-api | Document (PDF, Word, PPTX ...) extraction and parse API using state of the art modern OCRs + Ollama supported models. Anonymize documents. Remove PII. Convert any document or picture to structured JSON or Markdown | 2.1K | |
6 | mini-omni2 | Towards Open-source GPT-4o with Vision, Speech and Duplex Capabilities。 | 1.6K | |
7 | preswald | 🐵 Preswald is a full-stack platform for building, deploying, and managing interactive data applications. It brings ingestion, storage, transformation, and visualization into a simple SDK, minimizing complexity while maintaining flexibility for both prototyping and production-grade use cases. | 1.2K | |
8 | llama.vim | Vim plugin for LLM-assisted code/text completion | 1.1K | |
9 | LLaMA-Mesh | Unifying 3D Mesh Generation with Language Models | 910 | |
10 | LLaMA-O1 | Large Reasoning Models | 801 | |
11 | VisRAG | Parsing-free RAG supported by VLMs | 592 | |
12 | pearai-master | VSCode for the new age of AI. | 433 | |
13 | CoI-Agent | Official code for paper: Chain of Ideas: Revolutionizing Research via Novel Idea Development with LLM Agents | 427 | |
14 | SDXL_EcomID_ComfyUI | 380 | ||
15 | promptwright | Generate large synthetic data using an LLM | 377 | |
16 | ZhiLight | A highly optimized inference acceleration engine for Llama and its variants. | 363 | |
17 | clickclickclick | A framework to enable autonomous android and computer use using any LLM (local or remote) | 349 | |
18 | codegate | CodeGate: CodeGen Privacy and Security | 326 | |
19 | agent-as-a-judge | 🤠 Agent-as-a-Judge and DevAI dataset | 319 | |
20 | airweave | Turn any app into agent knowledge | 226 | |
21 | mediachain | AI toolkit for making Shorts/Tiktoks | 214 | |
22 | notte | The agentic internet | 189 | |
23 | chat-ui | Chat UI components for LLM apps | 184 | |
24 | MagicPIG | [ICLR2025] MagicPIG: LSH Sampling for Efficient LLM Generation | 184 | |
25 | fabrice-ai | A lightweight, functional, and composable framework for building AI agents. No PhD required. | 181 | |
26 | oat | 🌾 OAT: A research-friendly framework for LLM online alignment, including preference learning, reinforcement learning, etc. | 167 | |
27 | phantasm | Toolkits to create a human-in-the-loop approval layer to monitor and guide AI agents workflow in real-time. | 156 | |
28 | AgentSquare | The official implementation of the paper "AgentSquare: Automatic LLM Agent Search in Modular Design Space"" | 151 | |
29 | llm4ad | LLM4AD: A Platform for Algorithm Design with Large Language Model | 131 | |
30 | meta-prompt | For LLMs to better code with Jina API | 129 | |
31 | GLM-Edge | GLM Series Edge Models | 129 | |
32 | VLM2Vec | This repo contains the code for "VLM2Vec: Training Vision-Language Models for Massive Multimodal Embedding Tasks" [ICLR25] | 129 | |
33 | Fast-LLM | Accelerating your LLM training to full speed! Made with ❤️ by ServiceNow Research | 129 | |
34 | BALROG | Benchmarking Agentic LLM and VLM Reasoning On Games | 114 | |
35 | FreeScale | Code for FreeScale, a tuning-free method for higher-resolution visual generation | 114 | |
36 | ai-analyst | Open source AI analyst powered by E2B. Analyze your CSV files with Llama 3.1 and create interactive charts. | 106 | |
37 | intelligence | A Ruby gem for seamlessly and uniformly interacting with large language and vision model (LLM) API's served by numerous services, including those of OpenAI, Anthropic, Google and others. | 95 | |
38 | superagentx | Lightweight Multi Agent AI Orchestrator Framework with AGI Capabilities. | 91 | |
39 | ai-chat-android | 💬 AI Chat Bot demo app showcasing the integration of Gemini SDK with Firebase Realtime Database for real-time chat functionality. | 89 | |
40 | VLM-Grounder | [CoRL 2024] VLM-Grounder: A VLM Agent for Zero-Shot 3D Visual Grounding | 82 | |
41 | Quest-Best-Tokens | An introduction to LLM Sampling | 75 | |
42 | STRING | [ICLR'25] Data and code for our paper "Why Does the Effective Context Length of LLMs Fall Short?" | 69 | |
43 | DiGIT | [NeurIPS 2024] Stabilize the Latent Space for Image Autoregressive Modeling: A Unified Perspective | 59 | |
44 | 3d-conditioning | Enhance and modify high-quality compositions using real-time rendering and generative AI output without affecting a hero product asset. | 59 | |
45 | TrustEval-toolkit | TrustEval: A modular and extensible toolkit for comprehensive trust evaluation of generative foundation models (GenFMs) | 49 | |
46 | VLMnav | End-to-End Navigation with VLMs | 48 | |
47 | OLA-VLM | OLA-VLM: Elevating Visual Perception in Multimodal LLMs with Auxiliary Embedding Distillation, arXiv 2024 | 48 | |
48 | chat2note | This is a tool that can automatically translate the chat log into a note using LLM API | 47 | |
49 | robot-3dlotus | Official implementation of "Towards Generalizable Vision-Language Robotic Manipulation: A Benchmark and LLM-guided 3D Policy." | 47 | |
50 | gpt-bert | Official implementation of "GPT or BERT: why not both?" | 46 | |
51 | SubgraphRAG | [ICLR 2025] Simple is Effective: The Roles of Graphs and Large Language Models in Knowledge-Graph-Based Retrieval-Augmented Generation | 43 | |
52 | SeeDo | Human Demo Videos to Robot Action Plans | 41 | |
53 | Montessori-Instruct | Official repository for Montessori-Instruct: Generate Influential Training Data Tailored for Student Learning [ICLR 2025] | 41 | |
54 | vscode-copilot-vision | Exploration into leveraging vision capabilities of an LLM | 40 | |
55 | FinGLM2 | 智谱AI 2024年金融行业大模型挑战赛仓库 | 40 | |
56 | Emma-X | Emma-X: An Embodied Multimodal Action Model with Grounded Chain of Thought and Look-ahead Spatial Reasoning | 39 | |
57 | vlm-knowledge-conflict | Code for paper "Unraveling Cross-Modality Knowledge Conflicts in Large Vision-Language Models." | 39 | |
58 | LLM4SR | LLM for Scientific Research Survey | 39 | |
59 | ApolloMoE | ApolloMoE: Efficiently Democratizing Medical LLMs for 50 Languages via a Mixture of Language Family Experts | 38 | |
60 | docspedia | Chat with your pdf using your local LLM, OLLAMA client.(incomplete) | 36 | |
61 | DPO_pLM | 36 | ||
62 | dingo | Dingo: A Comprehensive Data Quality Evaluation Tool | 35 | |
63 | duckdb-extension-openprompt | DuckDB Community Extension to prompt LLMs from SQL | 34 | |
64 | ai-generated-fake-podcasts | A growing list of fake podcasts generated by Notebook LM | 31 | |
65 | llgtrt | TensorRT-LLM server with Structured Outputs (JSON) built with Rust | 31 | |
66 | ProSA | [EMNLP 2024 Findings] ProSA: Assessing and Understanding the Prompt Sensitivity of LLMs | 24 | |
67 | open-sesame | Open Source multi-modal LLM environment. Host your own web and mobile chat interface, powered by real-time bots and voice AI functionality. | 24 | |
68 | splicing | Splicing: Gen-AI Copilot for Data Engineering | 23 | |
69 | PULSE | The code, data, and models for "Teach Multimodal LLMs to Comprehend Electrocardiographic Images". | 23 | |
70 | siliconflow-plugin | 基于 Yunzai 的 AIGC 插件,可免费使用 FLUX、SD、MJ 等绘图、LLM 推理、谷歌实时搜索 LLM、Vits 语音合成等功能。支持多模型接入、多 Key 负载均衡、图生图、直链获取、Markdown 图片输出等特性 | 23 | |
71 | filament-translations-gpt | Translations Manager extension to use ChatGPT openAI to auto translate your __(), trans() fn | 22 | |
72 | vlmrun-hub | A hub for various industry-specific schemas to be used with VLMs. | 22 | |
73 | gptparse | Document parser for RAG | 20 | |
74 | netaivideoanalyzer | This repository contains a series of samples on how to analyse a video using multimodal Large Language Models, like GPT-4o or GPT-4o-mini. | 19 | |
75 | llm-comparison-backend | This is an opensource project allowing you to compare two LLM's head to head with a given prompt, this section will be regarding the backend of this project, allowing for llm api's to be incorporated and used in the front-end | 19 | |
76 | M5Module-LLM | Arduino library for M5Stack LLM Module | 18 | |
77 | sora-website | The official landing page for Sora Labs. | 18 | |
78 | BehAV | BehAV: Behavioral Rule Guided Autonomy Using VLM for Robot Navigation in Outdoor Scenes | 17 | |
79 | entropix | Unofficial entropix impl for Gemma2 and Llama and Qwen2 and Mistral | 17 | |
80 | llm-honeypot | 16 | ||
81 | Sketch2Code | Code for the paper: Sketch2Code: Evaluating Vision-Language Models for Interactive Web Design Prototyping | 15 | |
82 | llm-arithmetic-heuristics | 15 | ||
83 | llm-landscape | NeurIPS'24 - LLM Safety Landscape | 14 | |
84 | gpt_sovits_rs | 14 | ||
85 | netaiTrafficJamAnalyzer | TrafficJamAnalyzer is an advanced tool designed to help monitor and analyze traffic conditions by processing images from CCTV cameras around the roads of Tenerife. By utilizing artificial intelligence (AI) with Semantic Kernel and OpenAI, the application accurately assesses traffic density and identifies locations with potential traffic jams. | 13 | |
86 | fine-tune-qwen2-vl-with-llama-factory | 12 | ||
87 | GeminiLite-Laravel | 12 | ||
88 | hands-on-llama | 11 | ||
89 | computer-agent-arena-hub | Computer Agent Arena Hub: Compare & Test AI Agents on Crowdsourced Real-World Computer Use Tasks | 11 | |
90 | lecca-io | Lecca.io | AI Agents & Automations | 11 | |
91 | jira-ticket-classification | A Python-based AWS solution for automated Jira ticket classification using Amazon Bedrock. This project helps Jira users automate ticket categorization featuring S3 integration, AWS Glue deduplication, LLMs, and Terraform deployment. | 9 | |
92 | LLM-ReasoningTest | Evaluating LLMs' Mathematical and Coding Competency through Ontology-guided Interventions | 9 | |
93 | Conversation_Reconstruction_Attack | This is the public code repository for the paper 'Reconstruct Your Previous Conversations! Comprehensively Investigating Privacy Leakage Risks in Conversations with GPT Models' | 9 | |
94 | FloTorch | FloTorch is an open-source tool for optimizing Generative AI workloads on AWS. It automates RAG proof-of-concept development with features like hyperparameter tuning, vector database optimization, and LLM integration. FloTorch streamlines experimentation, ensures security, and accelerates production with cost-efficient, validated workflows. | 9 | |
95 | ai-calorie-counter | An LLM-based app to easily track calories and exercise by taking a photo of your meal or describing your physical activity | 8 | |
96 | aibook | (WIP) 🦀 An Insanely Fast 🚀 Full Stack Content Generation SaaS Platform Powered by Dioxus, Dioxus Server Functions, Axum, Unsplash, Gemini AI & MongoDB. | 8 | |
97 | DrawEduMath | Can VLMs understand students' hand-drawn math work? | 7 | |
98 | vlm-api | REST API for computing cross-modal similarity between images and text using the ColPaLI vision-language model | 7 | |
99 | llama-pruning | This project provides tools to load and prune large language models using a structured pruning method. | 7 | |
100 | DecorateLM | 6 |