TOP AI Developers by monthly star count
TOP AI Organization Account by AI repo star count
Top AI Project by Category star count
Top Growing Speed list by the speed of gaining stars
Top List of who create influential repos with little people known
Rankings | Developers | Related Project | Project intro | Star count |
---|---|---|---|---|
1 | open-canvas | 📃 A better UX for chat, writing content, and coding with LLMs. | 2.6K | |
2 | GLM-4-Voice | GLM-4-Voice | 端到端中英语音对话模型 | 2.3K | |
3 | mini-omni2 | Towards Open-source GPT-4o with Vision, Speech and Duplex Capabilities。 | 1.6K | |
4 | pdf-extract-api | Document (PDF) extraction and parse API using state of the art modern OCRs + Ollama supported models. Anonymize documents. Remove PII. Convert any document or picture to structured JSON or Markdown | 1.3K | |
5 | LLaMA-O1 | Large Reasoning Models | 585 | |
6 | LLaMA-Mesh | Unifying 3D Mesh Generation with Language Models | 485 | |
7 | VisRAG | Parsing-free RAG supported by VLMs | 399 | |
8 | CoI-Agent | Official code for paper: Chain of Ideas: Revolutionizing Research via Novel Idea Development with LLM Agents | 365 | |
9 | SDXL_EcomID_ComfyUI | 298 | ||
10 | pearai-master | PearAI acts as an inventory that curates the leading, cutting-edge AI tools in one place. We build a centralized user interface for these solutions, allowing you to effortlessly use the best AI tools without having to waste effort looking for alternatives. This repository lists all the other repos that make up the entire PearAI project. | 282 | |
11 | promptwright | Generate large synthetic data using a local LLM | 211 | |
12 | agent-as-a-judge | 🤠 Agent-as-a-Judge and DevAI dataset | 194 | |
13 | AgentSquare | The official implementation of the paper "AgentSquare: Automatic LLM Agent Search in Modular Design Space"" | 129 | |
14 | phantasm | Toolkits to create a human-in-the-loop approval layer to monitor and guide AI agents workflow in real-time. | 128 | |
15 | meta-prompt | For LLMs to better code with Jina API | 108 | |
16 | chat-ui | Chat UI components for LLM apps | 101 | |
17 | llama.vim | Vim plugin for LLM-assisted code/text completion | 96 | |
18 | TurboReel_studio | Text to Shorts/Tiktoks, AI Video Engine | 94 | |
19 | ai-chat-android | 💬 AI Chat Bot demo app showcasing the integration of Gemini SDK with Firebase Realtime Database for real-time chat functionality. | 81 | |
20 | VLM2Vec | This repo contains the code and data for "VLM2Vec: Training Vision-Language Models for Massive Multimodal Embedding Tasks" | 70 | |
21 | VLM-Grounder | [CoRL 2024] VLM-Grounder: A VLM Agent for Zero-Shot 3D Visual Grounding | 67 | |
22 | STRING | Data and code for our paper "Why Does the Effective Context Length of LLMs Fall Short?" | 64 | |
23 | intelligence | A Ruby gem for seamlessly and uniformly interacting with large language and vision model (LLM) API's served by numerous services, including those of OpenAI, Anthropic, Google and others. | 64 | |
24 | Quest-Best-Tokens | An introduction to LLM Sampling | 64 | |
25 | chat2note | This is a tool that can automatically translate the chat log into a note using LLM API | 63 | |
26 | MagicPIG | MagicPIG: LSH Sampling for Efficient LLM Generation | 58 | |
27 | ai-analyst | Open source AI analyst powered by E2B. Analyze your CSV files with Llama 3.1 and create interactive charts. | 42 | |
28 | DiGIT | [NeurIPS 2024] Stabilize the Latent Space for Image Autoregressive Modeling: A Unified Perspective | 41 | |
29 | docspedia | Chat with your pdf using your local LLM, OLLAMA client. | 37 | |
30 | Fast-LLM | Accelerating your LLM training to full speed | 36 | |
31 | gpt-bert | Official implementation of "GPT or BERT: why not both?" | 35 | |
32 | ApolloMoE | ApolloMoE: Efficiently Democratizing Medical LLMs for 50 Languages via a Mixture of Language Family Experts | 34 | |
33 | vlm-knowledge-conflict | Code for paper "Unraveling Cross-Modality Knowledge Conflicts in Large Vision-Language Models." | 32 | |
34 | Montessori-Instruct | Official repository for Montessori-Instruct: Generate Influential Training Data Tailored for Student Learning | 32 | |
35 | oat | 🌾 OAT: Online AlignmenT for LLMs | 32 | |
36 | SeeDo | Human Demo Videos to Robot Action Plans | 31 | |
37 | VLMnav | End-to-End Navigation with VLMs | 26 | |
38 | ai-generated-fake-podcasts | A growing list of fake podcasts generated by Notebook LM | 24 | |
39 | duckdb-extension-openprompt | DuckDB Community Extension to prompt LLMs from SQL | 22 | |
40 | vscode-copilot-vision | Exploration into leveraging vision capabilities of an LLM | 21 | |
41 | splicing | Splicing: Gen-AI Copilot for Data Engineering | 21 | |
42 | ProSA | [EMNLP 2024 Findings] ProSA: Assessing and Understanding the Prompt Sensitivity of LLMs | 17 | |
43 | gptparse | Document parser for RAG | 16 | |
44 | Conversation_Reconstruction_Attack | This is the public code repository for the paper 'Reconstruct Your Previous Conversations! Comprehensively Investigating Privacy Leakage Risks in Conversations with GPT Models' | 15 | |
45 | entropix | Unofficial entropix impl for Gemma2 and Llama and Qwen2 and Mistral | 15 | |
46 | llm-comparison-backend | This is an opensource project allowing you to compare two LLM's head to head with a given prompt, this section will be regarding the backend of this project, allowing for llm api's to be incorporated and used in the front-end | 13 | |
47 | open-sesame | Open Source multi-modal LLM environment. Host your own web and mobile chat interface, powered by real-time bots and voice AI functionality. | 13 | |
48 | llgtrt | TensorRT-LLM server with Structured Outputs (JSON) built with Rust | 13 | |
49 | Sketch2Code | Code for the paper: Sketch2Code: Evaluating Vision-Language Models for Interactive Web Design Prototyping | 10 | |
50 | netaivideoanalyzer | This repository contains a series of samples on how to analyse a video using multimodal Large Language Models, like GPT-4o or GPT-4o-mini. | 9 | |
51 | BehAV | BehAV: Behavioral Rule Guided Autonomy Using VLM for Robot Navigation in Outdoor Scenes | 9 | |
52 | PULSE | The code, data, and models for "Teach Multimodal LLMs to Comprehend Electrocardiographic Images". | 9 | |
53 | duckai | DuckDuckGo AI to OpenAI API | 8 | |
54 | jira-ticket-classification | A Python-based AWS solution for automated Jira ticket classification using Amazon Bedrock. This project helps Jira users automate ticket categorization featuring S3 integration, AWS Glue deduplication, LLMs, and Terraform deployment. | 7 | |
55 | llm-landscape | NeurIPS'24 - LLM Safety Landscape | 7 | |
56 | M5Module-LLM | Arduino library for M5Stack LLM Module | 7 | |
57 | SubgraphRAG | Code for the paper "Simple is Effective: The Roles of Graphs and Large Language Models in Knowledge-Graph-Based Retrieval-Augmented Generation" | 7 | |
58 | fine-tune-qwen2-vl-with-llama-factory | 6 | ||
59 | LLM-ReasoningTest | Evaluating LLMs' Mathematical and Coding Competency through Ontology-guided Interventions | 6 | |
60 | siliconflow-plugin | 基于 Yunzai 的 AIGC 插件,可免费使用 FLUX、SD、MJ 等绘图、LLM 推理、Vits语音合成等。 | 6 | |
61 | LLAMAINDEX-RAGATHON-WORKSHOP24 | The repository for the LlamaIndex RAG-A-THON 2024 Workshop by AI Makerspace | 5 | |
62 | netaiTrafficJamAnalyzer | TrafficJamAnalyzer is an advanced tool designed to help monitor and analyze traffic conditions by processing images from CCTV cameras around the roads of Tenerife. By utilizing artificial intelligence (AI) with Semantic Kernel and OpenAI, the application accurately assesses traffic density and identifies locations with potential traffic jams. | 5 | |
63 | llm-arithmetic-heuristics | 5 | ||
64 | aibook | (WIP) 🦀 An Insanely Fast 🚀 Full Stack Content Generation SaaS Platform Powered by Dioxus, Dioxus Server Functions, Axum, Unsplash, Gemini AI & MongoDB. | 5 | |
65 | llama-pruning | This project provides tools to load and prune large language models using a structured pruning method. | 5 | |
66 | 3d-conditioning | Enhance and modify high-quality compositions using real-time rendering and generative AI output without affecting a hero product asset. | 5 | |
67 | ai-calorie-counter | An LLM-based app to easily track calories and exercise by taking a photo of your meal or describing your physical activity | 4 | |
68 | LLM4SR | LLM for Scientific Research Survey | 4 | |
69 | GreatIterator | Iterates on problem via LLM until it finishes | 4 | |
70 | ComfyUI-Gemini | Nodes for Google Gemini API | 3 | |
71 | hexabot-wordpress-live-chat-widget | Wordpres Chatbot Plugin for Hexabot.ai Live Chat Widget. | 3 | |
72 | llm-honeypot | 3 | ||
73 | parseltongue | A library to compress codebases so much that LLMs can reason through them 100x faster | 3 | |
74 | LMS-Development-Repo | Repository for the LMS (Last Man Standing) web application developed by Team 1 in CS4337 Big Data Management Module | 3 | |
75 | gpt_sovits_rs | 3 | ||
76 | AceGPT-v2 | Paper: Alignment at Pre-training! Towards Native Alignment for Arabic LLMs | 2 | |
77 | Team14_Peckish | 2024-oct-4-responsible-llm-hackathon-team-cockatiel created by GitHub Classroom | 2 | |
78 | openkbs | Open source platform for building AI agents | 2 | |
79 | 2024fall-crosslingual-vlm-block-seminar | Materials of 2024 Fall cross-lingual visual language models block seminar at LMU Munich. | 2 | |
80 | reflex-llamaindex-template | 2 | ||
81 | tts | Python project to convert PDFs/images into an audio file using OpenAI's GPT-4o and TTS-1 models. | 2 | |
82 | caltech-llmops | Class at Caltech on LLMOps by Brian Ray | 2 | |
83 | hexabot-plugin-ollama | The Ollama Plugin Extension for Hexabot Chatbot / Agent Builder that provides a custom block for Generative AI + RAG | 2 | |
84 | FlashCardGPT | 2 | ||
85 | finance-gpt | 2 | ||
86 | researcher | Multi agent LLM in-depth researcher | 2 | |
87 | llmops-production-rag | 2 | ||
88 | chatgpt-bot-example | Backend to a custom GPT that can be used to answer queries about your AWS account | 2 | |
89 | LLAMotion | Focus on the generation of demonstration animations for mathematics, statistics, etc., based on the Llama large model. | 2 | |
90 | pycasts | A text to Podcast inference API | 2 | |
91 | DecorateLM | 1 | ||
92 | llama-3-prompt-injection-fine-tuning | 1 | ||
93 | ai-chat-vision-quickstart-csharp | A C# sample of chatting with uploaded images using OpenAI vision models like gpt-4o. | 1 | |
94 | CSharpToJsonSchema | Helpers and Source generator to define OpenAI/Ollama/Anthropic/Gemini/LangChain tools natively through C# interfaces and without Reflection | 1 | |
95 | gemini-prompting-workshop | Prompting workshop with Gemini AI | 1 | |
96 | Synthia-Unity | Unity - Google-Gemini | 1 | |
97 | WaColor | Templates of Japanese colors ( Macha, Azuki, Kinako, Sora ) | 1 | |
98 | PrefectLLMOrchestration | 1 | ||
99 | LambdaLM | LLM Tooling | 1 | |
100 | example-stable-diffusion-inpainting | 1 |