TOP AI Developers by monthly star count
TOP AI Organization Account by AI repo star count
Top AI Project by Category star count
Top Growing Speed list by the speed of gaining stars
Top List of who create influential repos with little people known
Rankings | Developers | Related Project | Project intro | Star count |
---|---|---|---|---|
1 | LLaMA-Mesh | Unifying 3D Mesh Generation with Language Models | 620 | |
2 | WavChat | A Survey of Spoken Dialogue Models (60 pages) | 99 | |
3 | SDXL-Training-Improvements | SDXL Training Improvements | 43 | |
4 | BALROG | Benchmarking Agentic LLM and VLM Reasoning On Games | 34 | |
5 | VLMnav | End-to-End Navigation with VLMs | 31 | |
6 | maux-calories-tracker | 🤖 AI-powered food analysis tool that instantly calculates calories and nutrients from images. Built with Next.js 15, Vercel AI SDK, and GPT-4o. | 28 | |
7 | ama | Ask Me Anything for any website, powered by Firecrawl and OpenAI GPT-4o-mini | 25 | |
8 | Build-An-LLM-RAG-Chatbot-With-LangChain-Python | Build-An-LLM-RAG-Chatbot-With-LangChain-Python | 20 | |
9 | gpt-resolve | Can GPT solve Brazilian university entrance exams? | 17 | |
10 | AI-Frontiers-Digest | AI Frontiers Digest leverages LLMs to intelligently curate and summarize the latest developments in AI, while also generating engaging Podcasts. | 17 | |
11 | Thinking-GPT4o | The prompts which could enable GPT-4o to think and act like GPT-o1-mini | 15 | |
12 | GLMix | [NeurIPS 2024] official code release for our paper "Revisiting the Integration of Convolution and Attention for Vision Backbone". | 12 | |
13 | Pycon-mini-Tokai-2024-VLM-Colaboratory-Sample | PyCon mini 東海 2024 のトーク「Google Colaboratoryで試すVLM」で紹介したサンプル集 | 11 | |
14 | Free-Unoffical-OpenAI-API | A powerful, unofficial OpenAI-compatible API service offering free access to GPT-4o, GPT-4-turbo, and audio preview models like gpt-4o-audio-preview & Realtime Models like gpt-4o-realtime. Features streaming responses, voice synthesis, TTS with no authentication requirements. Hosted on Hugging Face & Railway (Free tier) | 10 | |
15 | netaivideoanalyzer | This repository contains a series of samples on how to analyse a video using multimodal Large Language Models, like GPT-4o or GPT-4o-mini. | 10 | |
16 | PhoneLM | 10 | ||
17 | chat-with-image | AI Gemini | 9 | |
18 | IPLoc | Repository for the paper: Teaching VLMs to Localize Specific Objects from In-context Examples | 8 | |
19 | 3d-conditioning | Enhance and modify high-quality compositions using real-time rendering and generative AI output without affecting a hero product asset. | 8 | |
20 | Korean-SAT-LLM-Leaderboard | Korean SAT leader board | 7 | |
21 | llama-pruning | This project provides tools to load and prune large language models using a structured pruning method. | 6 | |
22 | ObsiAI | An AI chatbot plugin for Obsidian using the Gemini API for note summarization, content generation, and more. Enhance your workflow with AI assistance like the Notion AI bot. | 6 | |
23 | gemini-flashcards | Revolutionizing learning through AI-powered flashcards and adaptive study methods. | 5 | |
24 | aibook | (WIP) 🦀 An Insanely Fast 🚀 Full Stack Content Generation SaaS Platform Powered by Dioxus, Dioxus Server Functions, Axum, Unsplash, Gemini AI & MongoDB. | 5 | |
25 | build-n-roll-tg-bot | Telegram bot for fast and efficient creation of D&D character | 4 | |
26 | Llama_impact-3.2 | "GovEase" is a simple platform that connects citizens with essential government information and services. | 4 | |
27 | Search-GPT | Chat GPT with real time internet search capabilities. Built using Python, LangChain, Streamlit, Open AI, and Google Search Results. | 4 | |
28 | par_gpt | CLI LLM tool | 3 | |
29 | ThinkGPT | 3 | ||
30 | opencoder-llm.github.io | 3 | ||
31 | XmodelLM-1.5 | 3 | ||
32 | FitnessLM | 3 | ||
33 | Jailbreak_VLM | 2 | ||
34 | MacOS-Menu-Apps | Name your screenshots using VLMs | 2 | |
35 | VLM_TriTraining | Construct a Tri-Training framework using VLMs as base estimators, and evaluate its accuracy on multiple semi-supervised learning benchmarks. | 2 | |
36 | SDXL_Anime_Arena | 2 | ||
37 | UltraEval-Audio | An easy-to-use, fast, and easily integrable tool for evaluating audio LLM | 2 | |
38 | ROS2-NIM-API-Robotics-Intelligence-meta-llama3-70b- | 2 | ||
39 | vlm-api | REST API for computing cross-modal similarity between images and text using the ColPaLI vision-language model | 2 | |
40 | LLAMotion | Focus on the generation of demonstration animations for mathematics, statistics, etc., based on the Llama large model. | 2 | |
41 | Funasr-Qwen-GPTSovits | <综合> Funasr语音识别,调用Qwen大模型回答,通过GPTSovits输出语音的ai程序,其中调用模型还是在线,后续将添加离线大模型 | 2 | |
42 | lms-notifications | 2 | ||
43 | MycodeGpt | MycodeGpt | 2 | |
44 | SmolVLM | 2 | ||
45 | Manga-Image-Colorization | Colorisation of Manga Image using with multiple DL models | 2 | |
46 | Ai-Course-Generator | Full Stack AI Course Generator App With NextJs, React, TailwindCss, Gemini Api, Drizzle | 2 | |
47 | DecorateLM | 2 | ||
48 | open_search_llama | 2 | ||
49 | feedforge | A feedback app for mipyme bits in peru using gemini, open ai and a lot diferent and useful integrations(AYNI HACKATON REGIONAL WINNER PROJECT) | 2 | |
50 | streamlit-rag-gemini | 2 | ||
51 | AI-Content-Generator | A scalable AI content generator app using Next.js , React, Tailwind CSS, TypeScript, Gemini AI, and Clerk. Features include dynamic templates, searchable filters, adaptive forms, and rich text editing. Drizzle ORM and PostgreSQL enable secure data management, with Vercel deployment for fast, reliable updates | 2 | |
52 | LLM_ESG_POS | LLM-based ESG-Focused Portfolio Optimization Support service (LEPOS) - KWU Industry-Academic Cooperation SW Project 2024 by KWargs | 2 | |
53 | PoseMaster-Dynamic-Person-Repose-ChangeClothes-and-Face-Transformation | PoseMorphAI is a comprehensive pipeline built using ComfyUI and Stable Diffusion, designed to reposition people in images, modify their facial features, and change their clothes seamlessly. This solution leverages advanced pose estimation, facial conditioning, image generation, and detail refinement modules for high-quality output. | 2 | |
54 | ed-ut5-SorayaGarces | Repositorio de UT5 | 2 | |
55 | finnance-ai | A finance saas integrated with gpt chat | 2 | |
56 | chatgpt-4o | 【ChatGPT中文版】国内使用指南(支持GPT-4,无限使用GPT-4o和o1,无需翻墙)【11月持续更新】 可以国内直接使用的 ChatGPT中文版!无需翻墙,支持GPT-4,无限使用GPT-4o和o1-preview。 | 1 | |
57 | moondream-local-vlm-nextjs-starter | Moondream Web Interface: A modern web interface for the Moondream vision language model, built with Next.js and FastAPI. This project provides a user-friendly way to interact with images using Moondream's vision-language capabilities. | 1 | |
58 | ImageCraft | Stable Diffusion + ImageBind = ImageCraft (nothing to do with Minecraft unfortunately) | 1 | |
59 | analyze-documents-batch-aoai | Analyze pdf documents using Azure OpenAI GPT-4o vision capabilities. | 1 | |
60 | Gemini-project-1.0 | its 1.0 version ( development at its initial stage) | 1 | |
61 | GeminiHackathon | 1 | ||
62 | Beam | A real-time graphics engine build with OpenGL, GLFW, and GLM. | 1 | |
63 | youtube-trascribe-summarize-gpt | 1 | ||
64 | GetFreeChat | Automatic collection of free instances of AI text models (ChatGPT, Claude, llama and others) | 1 | |
65 | niewinise | The Clean-UI project offers an intuitive interface for interacting with Llama-3.2 models, enabling users to generate detailed image descriptions and engage in AI-driven conversations. It includes easy setup instructions and a logging system for tracking interactions, making it ideal for developers and researchers. | 1 | |
66 | VLM4KOMO | 1 | ||
67 | ComfyUI_QAIC | A custom extension for ComfyUI that optimizes and deploys Stable Diffusion workflows on Qualcomm Cloud AI 100 accelerators. | 1 | |
68 | Gemini-Tutoring-Bot | An AI Tutor that tailors lessons based on the user's learning style and progress | 1 | |
69 | Llama3-Finetune | Llama3 Finetuning for SQL Code Generation | 1 | |
70 | SUDOKO-SOLVER | An AI Chatbox developed using the Gemini API, offering interactive, real-time responses for an engaging conversational experience. Built with HTML, CSS, and JavaScript, it features a responsive design, intuitive interface, and smooth animations, showcasing advanced API integration and a user-focused chat experience. | 1 | |
71 | HearMeGLM | 1 | ||
72 | GLMTest | 1 | ||
73 | Xmake-OpenGl | Minimal repository template for project using xmake and opengl | 1 | |
74 | Playable-Plane-Render | Playable plane render with OpenGL, using glfw, glew, glm. | 1 | |
75 | haiku_LM_dashboard | Haiku Generation and Visualization Dashboard RNN LM NLP Final Project - Spring 2023 | 1 | |
76 | LMC | Langevin Monte Carlo | 1 | |
77 | llama-3-2-ollama | Local Chatbot with Llama 3.2 Model and Ollama | 1 | |
78 | Stable-Diffusion-Model | 1 | ||
79 | sd-forge-supir | SUPIR upscaling wrapper for Forge Webui | 1 | |
80 | OpenResearch | Open-Research.ai is an AI-driven search engine that leverages OpenAI and Serper.dev to deliver a powerful search experience. 1-Click deploy to Vercel. | 1 | |
81 | diff-summarizer | A Git commit message generator powered by OpenAI GPT-4o and Anthropic Claude 3.5 | 1 | |
82 | SumTube | SumTube é uma ferramenta que extrai legendas de vídeos do YouTube e cria resumos concisos com o Google Gemini. A ferramenta permite exibir o conteúdo transcrito e exportar o resumo em PDF, oferecendo uma solução prática e eficiente para obter uma visão geral rápida de vídeos longos. | 1 | |
83 | Sketch-Solve | Sketch&Solve is an iOS18 Math notes inspired app, allowing users to draw, write, and calculate math expressions using the GEMINI API. Built with React, it uses a canvas component where drawings are sent to the API as images for analysis. The API returns results in expression form, which the app renders on the frontend using LaTeX. | 1 | |
84 | PDDL__LLM_Task_planning | 1 | ||
85 | GLMS | A gamified learning management system for our capstone project | 1 | |
86 | ShellGPT | ShellGPT is an AI powered command-line tool that suggests and explains shell commands based on natural language queries. Perfect for those cases where you forget how to exit vim! | 1 | |
87 | netflix-gpt | 1 | ||
88 | pineapple-fastapi | This is pineapple app to call Chat GPT endpoint. Backend api is fastapi | 1 | |
89 | lms-react-laravel | 1 | ||
90 | Gen-AI-Project-Using-Llama3.1- | This is end to end LLM and gen ai project that will use Llama3.1 open source LLM, chromadb (vector store), Langchain and streamlit to build a tool called cold email generator. | 1 | |
91 | Lawyer-Llamaa | Lawyer Llama is an intelligent legal assistant powered by LLaMA specifically designed to provide expert guidance on the Indian Constitution as of 2023. This application provides precise, context-aware legal information by leveraging advanced language models and vector search technology. | 1 | |
92 | VLM-based-Remote-Sensing | 1 | ||
93 | Ishansourav | 1 | ||
94 | SoraDBlite | SoraDBlite is a Python class designed to simplify interactions with MongoDB databases. And the operation are similar to the mongodb, and it is easy to understand. It is the lite version of pymongo. | 1 | |
95 | stable-diffusion-v1-5-finetuned-logos-white-background | Dataset: https://www.kaggle.com/datasets/juliancamilovelandia/logos-with-white-background?select=dataset_annotations_descriptions.json | 1 | |
96 | discord-bot | Discord Bot + Replicate + Stable Diffusion | 1 | |
97 | ai-artist | An AI Artist application utilizing LLM and Stable Diffusion. | 1 | |
98 | sd-from-scratch | Stable diffusion from scratch in PyTorch with full training script and web ui for inference. | 1 | |
99 | GRID-6X | Layout for Seamless Image Assembly | 1 | |
100 | TabooGPT-4o | 1 |