TOP AI Developers by monthly star count
TOP AI Organization Account by AI repo star count
Top AI Project by Category star count
Top Growing Speed list by the speed of gaining stars
Top List of who create influential repos with little people known
Rankings | Developers | Related Project | Project intro | Star count |
---|---|---|---|---|
1 | DictionaryByGPT4 | 一本 GPT4 生成的单词书📚,超过 8000 个单词分析,涵盖了词义、例句、词根词缀、变形、文化背景、记忆技巧和小故事 | 2.8K | |
2 | llama-models | Utilities intended for use with Llama models. | 2.5K | |
3 | gptpdf | Using GPT to parse PDF | 2.2K | |
4 | cambrian | Cambrian-1 is a family of multimodal LLMs with a vision-centric design. | 1.5K | |
5 | RouteLLM | A framework for serving and evaluating LLM routers - save LLM costs without compromising quality! | 1.5K | |
6 | ShareGPT4Video | An official implementation of ShareGPT4Video: Improving Video Understanding and Generation with Better Captions | 994 | |
7 | korvus | Korvus is a search SDK that unifies the entire RAG pipeline in a single database query. Built on top of Postgres with bindings for Python, JavaScript, Rust and C. | 913 | |
8 | LlamaGen | Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation | 865 | |
9 | screen-pipe | Library to build personalized AI powered by what you've seen, said, or heard. Works with Ollama. Alternative to Rewind.ai. Open. Secure. You own your data. Rust. | 809 | |
10 | SwarmUI | SwarmUI (formerly StableSwarmUI), A Modular Stable Diffusion Web-User-Interface, with an emphasis on making powertools easily accessible, high performance, and extensibility. | 680 | |
11 | ttt-lm-pytorch | Official PyTorch implementation of Learning to (Learn at Test Time): RNNs with Expressive Hidden States | 637 | |
12 | Index-1.9B | A lightweight multilingual SOTA LLM | 631 | |
13 | datachain | DataChain 🔗 AI-dataframe to enrich, transform and analyze data from cloud storages for ML training and LLM apps | 520 | |
14 | ChatTTS-Forge | 🍦 ChatTTS-Forge is a project developed around the TTS generation model ChatTTS, implementing an API Server and a Gradio-based WebUI. | 473 | |
15 | ComfyUI-Florence2 | Inference Microsoft Florence2 VLM | 469 | |
16 | PyramidKV | The Official Implementation of PyramidKV: Dynamic KV Cache Compression based on Pyramidal Information Funneling | 458 | |
17 | flash-diffusion | Official implementation of ⚡ Flash Diffusion ⚡: Accelerating Any Conditional Diffusion Model for Few Steps Image Generation | 422 | |
18 | LARS | An application for running LLMs locally on your device, with your documents, facilitating detailed citations in generated responses. | 358 | |
19 | buffer-of-thought-llm | Buffer of Thoughts: Thought-Augmented Reasoning with Large Language Models | 352 | |
20 | LLM-workshop-2024 | A 4-hour coding workshop to understand how LLMs are implemented and used | 351 | |
21 | chatgpt-artifacts | Bring Claude's Artifacts feature to ChatGPT | 331 | |
22 | mlx-gpt2 | gpt-2 from scratch in mlx | 302 | |
23 | EAGLE | EAGLE: Exploring The Design Space for Multimodal LLMs with Mixture of Encoders | 281 | |
24 | awesome-production-llm | A curated list of awesome open-source libraries for production LLM | 273 | |
25 | llama3.cuda | llama3.cuda is a pure C/CUDA implementation for Llama 3 model. | 269 | |
26 | dingllm.nvim | Yacine's LLM nvim scripts | 267 | |
27 | ASTRA.ai | A lightning-fast workflow builder, it supports multimodal interaction, highly customizable extensions, and is intuitive to use even without any coding knowledge. | 261 | |
28 | voicechat2 | Local SRT/LLM/TTS Voicechat | 258 | |
29 | chrome-ai | Vercel AI provider for Chrome built-in model (Gemini Nano) | 256 | |
30 | llamanet | Replace OpenAI with Llama.cpp Automagically. | 247 | |
31 | magpie | Official repository for "Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing". Your efficient and high-quality synthetic data generation pipeline! | 241 | |
32 | meme_search | Index your memes by their content and text, making them easily retrievable for your meme warfare pleasures. Find funny fast. | 235 | |
33 | llama-zip | LLM-powered lossless compression tool | 221 | |
34 | ChatGPT-Mirror | 🚀 一键部署自己的 ChatGPT 镜像站 | 207 | |
35 | colpali | The code used to train and run inference with the ColPali architecture. | 196 | |
36 | llama.ttf | A font for writing tiny stories | 186 | |
37 | EVE | EVE: Encoder-Free Vision-Language Models from BAAI | 168 | |
38 | VideoGPT-plus | Official Repository of paper VideoGPT+: Integrating Image and Video Encoders for Enhanced Video Understanding | 161 | |
39 | StableNormal | StableNormal: Reducing Diffusion Variance for Stable and Sharp Normal | 157 | |
40 | esp-ai | The simplest and most cost-effective AI integration solution, enabling any device to achieve intelligent conversation functionality (based on ESP development boards). If you like this project, please give it a Star! | 最简单、最低成本的AI接入方案,让任何物品都能实现智能对话功能(基于ESP开发板)。喜欢本项目的话点个 Star 吧,您的一个 Star 对目前的仓库发展非常重要 | 145 | |
41 | flute | Fast Matrix Multiplications for Lookup Table-Quantized LLMs | 144 | |
42 | Train-llm-from-scratch | 使用deepspeed从头开始训练一个LLM,经过pretrain和sft阶段,验证llm学习知识、理解语言、回答问题的能力 | 136 | |
43 | TypeGPT | Integrate LLM's into your OS. For any issues or ideas, message us in the discord server below! | 135 | |
44 | HuatuoGPT-Vision | Medical Multimodal LLMs | 134 | |
45 | rust-genai | Rust multiprovider generative AI client (Ollama, OpenAi, Anthropic, Groq, Gemini, Cohere, ...) | 134 | |
46 | llm-router | Tutorial for building LLM router | 133 | |
47 | StableFace | Build your own Face App with Stable Diffusion 2.1 | 132 | |
48 | LazyLLM | Easyest and lazyest way for building multi-agent LLMs applications. | 131 | |
49 | ceLLama | Cell type annotation with local Large Language Models (LLMs) - Ensuring privacy and speed with extensive customized reports | 126 | |
50 | GPT-SoVITS2 | GPT-SoVITS2 | 125 | |
51 | LLM-Finetune | 大语言模型微调,Qwen2、GLM4指令微调 | 117 | |
52 | LLaRA | LLaRA: Large Language and Robotics Assistant | 110 | |
53 | poixe | Platform of Open Intelligence eXperiences for Everyone. 面向所有人的开放智能体验平台,一站式AI对话工具聚合 | 107 | |
54 | ragbuilder | A toolkit to create optimal Production-ready RAG setup for your data | 103 | |
55 | tiny-ai-client | Tiny client for LLMs with vision and tool calling. As simple as it gets. | 73 | |
56 | rai | RAI is a multi-vendor agent framework for robotics, utilizing Langchain and ROS 2 tools to perform complex actions, defined scenarios, free interface execution, log summaries, voice interaction and more. | 71 | |
57 | laravel-ai-translator | Automatic translate your language files into many languages using AI like Claude, GPT and etc. | 69 | |
58 | MM-NIAH | This is the official implementation of the paper "Needle In A Multimodal Haystack" | 66 | |
59 | awesome-ai-repositories | A curated list of open source repositories for AI Engineers | 64 | |
60 | MMTrustEval | A toolbox for benchmarking trustworthiness of multimodal large language models (MultiTrust) | 61 | |
61 | A3VLM | Official repo of `A3VLM: Actionable Articulation-Aware Vision Language Model` | 58 | |
62 | AIReceiptScanner | Swift library that utilize GPT-4o for scanning receipt and its items | 57 | |
63 | lmw3BLACKl | mw3 skin-swapper skin-changer skinchanger skinswapper inventory-changer mw3-inventory-changer mw3-skinswapper mw3-skinchanger mw3-skin-changer mw3-skin-swapper modern warfare 3 skin-swapper skin-changer skinchanger skinswapper inventory-changer modern warfare 3-skinswapper modern warfare 3-skinchanger modern warfare 3-skin-changer | 56 | |
64 | VoCo-LLaMA | VoCo-LLaMA: This repo is the official implementation of "VoCo-LLaMA: Towards Vision Compression with Large Language Models". | 55 | |
65 | Ovis | A novel Multimodal Large Language Model (MLLM) architecture, designed to structurally align visual and textual embeddings. | 52 | |
66 | coir | A Comprehensive Benchmark for Code Information Retrieval. | 51 | |
67 | chromegemini | Chrome AI Test Page, running Gemini Nano locally in your browser. | 51 | |
68 | glm4v-assistant | Sample GLM4V + ChatTTS AI assistant | 49 | |
69 | ik_llama.cpp | llama.cpp clone with additional SOTA quants and improved CPU performance | 45 | |
70 | sd_embed | Generate long weighted prompt embeddings for Stable Diffusion | 43 | |
71 | Uncond-Zero-for-ComfyUI | Allows to sample without generating any uncond with Stable Diffusion! | 42 | |
72 | Claude-React-Jumpstart | 📖 A step-by-step guide for beginners to running Claude-generated React code locally. | 37 | |
73 | swiftide | Blazing fast data pipelines for Retrieval Augmented Generation written in Rust | 35 | |
74 | llm-interface | A simple NPM interface for seamlessly interacting with 36 Large Language Model (LLM) providers, including OpenAI, Anthropic, Google Gemini, Cohere, Hugging Face Inference, NVIDIA AI, Mistral AI, AI21 Studio, LLaMA.CPP, and Ollama, and hundreds of models. | 34 | |
75 | easy-llms | Easy "1-line" calling of all LLMs from OpenAI, MS Azure, AWS Bedrock, GCP Vertex, and Ollama | 34 | |
76 | YoLLaVA | 🌋👵🏻 Yo'LLaVA: Your Personalized Language and Vision Assistant | 34 | |
77 | Belullama | CasaOS + Ollama + Open WebUI = Belullama | 33 | |
78 | vecto | Hybrid Search with Postgres and Ecto | 33 | |
79 | CARES | [arXiv'24 & ICMLW'24] CARES: A Comprehensive Benchmark of Trustworthiness in Medical Vision Language Models | 32 | |
80 | SpeechLLM | This repository contains the training, inference, evaluation code for SpeechLLM models and details about the model releases on huggingface. | 31 | |
81 | Retrochat-v2 | RetroChat is a powerful command-line interface for interacting with various AI language models. It provides a seamless experience for engaging with different chat providers while offering robust features for managing and customizing your conversations. Also I am planning to reach 3000 lines in a single .py with agent support. I will not hear anyone | 31 | |
82 | voice-chat-ai | 🎙️ Speak with AI - Run locally using ollama or OpenAI - XTTS or OpenAI Speech or ElevenLabs | 30 | |
83 | The-Creator-AI | Choose code files for AI Chat though UI | 30 | |
84 | LLaMA-Factory-Doc | LLaMA Factory Document | 29 | |
85 | simply-simplify-language | Use machine learning to make your institutional communication more understandable and inclusive. | 29 | |
86 | WCA | [ICML 2024] "Visual-Text Cross Alignment: Refining the Similarity Score in Vision-Language Models" | 29 | |
87 | openshield | OpenShield is a new generation security layer for AI models | 27 | |
88 | Llama3-8b-Naija_v1 | 25 | ||
89 | Simple_Llama3_from_scratch | 24 | ||
90 | omniai | OmniAI standardizes the APIs for multiple AI providers like OpenAI's Chat GPT, Mistral's LeChat, Claude's Anthropic and Google's Gemini. | 24 | |
91 | LLaVA-UHD-Better | A bug-free and improved implementation of LLaVA-UHD, based on the code from the official repo | 24 | |
92 | Kling-AI-Webui | Kling AI, Make Imagination Alive. This is a revolutionary text-to-video model like Sora. Kling AI WebUI is the open source project to integrate Kling AI Video Generation Model. | 24 | |
93 | SDXL-Dynamic-Image-Generator | 24 | ||
94 | midistral | LLM finetuned for generating symbolic music | 23 | |
95 | LLM4VPR | Can multimodal LLM help visual place recognition? | 23 | |
96 | StableDiffusionHelper | Advanced automated image processing tool for selection, cropping, and standardization. (Helper for stable diffusion), now updated with GUI 🎉 | 23 | |
97 | SoccerGPT | Small POC to predict game outcomes of the 2024 European Championship using GPT-4o and sportmonks football API. | 23 | |
98 | LangGraph-learn | learning resource of langgraph for dummy | 21 | |
99 | EngAce | Personalize the way Vietnamese learn English using generative AI | 21 | |
100 | Llama3-Med | 19 |