TOP AI Developers by monthly star count
TOP AI Organization Account by AI repo star count
Top AI Project by Category star count
Top Growing Speed list by the speed of gaining stars
Top List of who create influential repos with little people known
Rankings | Developers | Related Project | Project intro | Star count |
---|---|---|---|---|
1 | SDXL-Training-Improvements | 33 | ||
2 | maux-calories-tracker | 🤖 AI-powered food analysis tool that instantly calculates calories and nutrients from images. Built with Next.js 15, Vercel AI SDK, and GPT-4o. | 24 | |
3 | gpt-resolve | Can GPT solve Brazilian university entrance exams? | 12 | |
4 | Build-An-LLM-RAG-Chatbot-With-LangChain-Python | 11 | ||
5 | WavChat | A Survey of Spoken Dialogue Models (60 pages) | 11 | |
6 | Free-Unoffical-OpenAI-API | A powerful, unofficial OpenAI-compatible API service offering free access to GPT-4o, GPT-4-turbo, and audio preview models like gpt-4o-audio-preview & Realtime Models like gpt-4o-realtime. Features streaming responses, voice synthesis, TTS with no authentication requirements. Hosted on Hugging Face & Railway (Free tier) | 9 | |
7 | netaivideoanalyzer | This repository contains a series of samples on how to analyse a video using multimodal Large Language Models, like GPT-4o or GPT-4o-mini. | 5 | |
8 | VLMnav | End-to-End Navigation with VLMs | 4 | |
9 | GLMix | [NeurIPS 2024] official code release for our paper "Revisiting the Integration of Convolution and Attention for Vision Backbone". | 3 | |
10 | par_gpt | CLI LLM tool | 3 | |
11 | build-n-roll-tg-bot | Telegram bot for fast and efficient creation of D&D character | 3 | |
12 | ThinkGPT | 3 | ||
13 | PhoneLM | 3 | ||
14 | SSNamer | Name your screenshots using VLMs | 2 | |
15 | VLM_TriTraining | Construct a Tri-Training framework using VLMs as base estimators, and evaluate its accuracy on multiple semi-supervised learning benchmarks. | 2 | |
16 | SDXL_Anime_Arena | 2 | ||
17 | Funasr-Qwen-GPTSovits | <综合> Funasr语音识别,调用Qwen大模型回答,通过GPTSovits输出语音的ai程序,其中调用模型还是在线,后续将添加离线大模型 | 2 | |
18 | lms-notifications | 2 | ||
19 | MycodeGpt | MycodeGpt | 2 | |
20 | SmolVLM | 2 | ||
21 | Korean-SAT-LLM-Leaderboard | Korean SAT leader board | 1 | |
22 | Jailbreak_VLM | 1 | ||
23 | moondream-local-vlm-nextjs-starter | Moondream Web Interface: A modern web interface for the Moondream vision language model, built with Next.js and FastAPI. This project provides a user-friendly way to interact with images using Moondream's vision-language capabilities. | 1 | |
24 | ImageCraft | Stable Diffusion + ImageBind = ImageCraft (nothing to do with Minecraft unfortunately) | 1 | |
25 | Gemini-project-1.0 | its 1.0 version ( development at its initial stage) | 1 | |
26 | UltraEval-Audio | An easy-to-use, fast, and easily integrable tool for evaluating audio LLM | 1 | |
27 | youtube-trascribe-summarize-gpt | 1 | ||
28 | ROS2-NIM-API-Robotics-Intelligence-meta-llama3-70b- | 1 | ||
29 | niewinise | The Clean-UI project offers an intuitive interface for interacting with Llama-3.2 models, enabling users to generate detailed image descriptions and engage in AI-driven conversations. It includes easy setup instructions and a logging system for tracking interactions, making it ideal for developers and researchers. | 1 | |
30 | VLM4KOMO | 1 | ||
31 | ComfyUI_QAIC | A custom extension for ComfyUI that optimizes and deploys Stable Diffusion workflows on Qualcomm Cloud AI 100 accelerators. | 1 | |
32 | Gemini-Tutoring-Bot | An AI Tutor that tailors lessons based on the user's learning style and progress | 1 | |
33 | Llama3-Finetune | Llama3 Finetuning for SQL Code Generation | 1 | |
34 | SUDOKO-SOLVER | An AI Chatbox developed using the Gemini API, offering interactive, real-time responses for an engaging conversational experience. Built with HTML, CSS, and JavaScript, it features a responsive design, intuitive interface, and smooth animations, showcasing advanced API integration and a user-focused chat experience. | 1 | |
35 | HearMeGLM | 1 | ||
36 | Xmake-OpenGl | Minimal repository template for project using xmake and opengl | 1 | |
37 | Playable-Plane-Render | Playable plane render with OpenGL, using glfw, glew, glm. | 1 | |
38 | LMC | Langevin Monte Carlo | 1 | |
39 | llama-3-2-ollama | Local Chatbot with Llama 3.2 Model and Ollama | 1 | |
40 | Stable-Diffusion-Model | 1 | ||
41 | sd-forge-supir | SUPIR upscaling wrapper for Forge Webui | 1 | |
42 | OpenResearch | Open-Research.ai is an AI-driven search engine that leverages OpenAI and Serper.dev to deliver a powerful search experience. 1-Click deploy to Vercel. | 1 | |
43 | diff-summarizer | A Git commit message generator powered by OpenAI GPT-4o and Anthropic Claude 3.5 | 1 | |
44 | SumTube | SumTube é uma ferramenta que extrai legendas de vídeos do YouTube e cria resumos concisos com o Google Gemini. A ferramenta permite exibir o conteúdo transcrito e exportar o resumo em PDF, oferecendo uma solução prática e eficiente para obter uma visão geral rápida de vídeos longos. | 1 | |
45 | PDDL__LLM_Task_planning | 1 | ||
46 | netflix-gpt | 1 | ||
47 | pineapple-fastapi | This is pineapple app to call Chat GPT endpoint. Backend api is fastapi | 1 | |
48 | Gen-AI-Project-Using-Llama3.1- | This is end to end LLM and gen ai project that will use Llama3.1 open source LLM, chromadb (vector store), Langchain and streamlit to build a tool called cold email generator. | 1 | |
49 | Lawyer-Llamaa | Lawyer Llama is an intelligent legal assistant powered by LLaMA specifically designed to provide expert guidance on the Indian Constitution as of 2023. This application provides precise, context-aware legal information by leveraging advanced language models and vector search technology. | 1 | |
50 | sorav-ishan | 1 | ||
51 | stable-diffusion-v1-5-finetuned-logos-white-background | Dataset: https://www.kaggle.com/datasets/juliancamilovelandia/logos-with-white-background?select=dataset_annotations_descriptions.json | 1 | |
52 | ai-artist | An AI Artist application utilizing LLM and Stable Diffusion. | 1 | |
53 | sd-from-scratch | Stable diffusion from scratch in PyTorch with full training script and web ui for inference. | 1 | |
54 | GRID-6X | Layout for Seamless Image Assembly | 1 | |
55 | TabooGPT-4o | 1 | ||
56 | Cyberdeck001 | RPI + Dual Camera + ChatGPT + Claude + Gemini + Grok + Perplexity + 5" Screen + Speaker&Mic | 1 | |
57 | Ai-Course-Generator | Full Stack AI Course Generator App With NextJs, React, TailwindCss, Gemini Api, Drizzle | 1 | |
58 | ChatGLM_ChatRobot | ChatGLM私有化部署聊天机器人 | 1 | |
59 | areaone-LMS | areaone lms is a system where its ctudents can access course materials available in the system. | 1 | |
60 | LMStudioWrapper | 1 | ||
61 | EduLlama | 1 | ||
62 | Coding-Assistant-using-Llama-3.1 | 1 | ||
63 | streamlit-rag-gemini | 1 | ||
64 | gemini-api | 1 | ||
65 | LLM-Fuzz | A novel Fuzzer utilizing Large Language Model for Ethereum Smart Contract Vulnerability Detection. | 1 | |
66 | unchained | A framework for developing applications powered by large language models (LLMs) in Gleam | 1 | |
67 | xnano | exteremely nano llm workflows | 1 | |
68 | Stinson_Kelly_LMS | 0 | ||
69 | GLMCyp-Predictor | 0 | ||
70 | LMS | 0 | ||
71 | bookIPs-Solvook-LLM | Developing a LLM based on LLaMA 3.1 for tasks related to korean - english educational contents using LLaMA factory | 0 | |
72 | Graduation-Project | 0 | ||
73 | ChatGPT-apiYouTube | 0 | ||
74 | gpt_wn | gpt_wn | 0 | |
75 | Full-Stack-LMS | We will | 0 | |
76 | k4lm3d.github.io | Gamer Profile Website | 0 | |
77 | llama | 0 | ||
78 | PrescriptionAnalyzer_llama | 0 | ||
79 | netflix-gpt | 0 | ||
80 | netflix-gpt | 0 | ||
81 | LMS | LMS project | 0 | |
82 | HandsOnVLM.github.io | 0 | ||
83 | LLM-VLM-Tutorial-4 | 0 | ||
84 | Efficient-Road-Repairs-System.VLM | 0 | ||
85 | VLM-self-correction | 0 | ||
86 | learning_vlm | 0 | ||
87 | VLMAgent | 0 | ||
88 | cashback_filler_bot | Automates filling cashback category information from banks using a Telegram Bot integrated with Vision Language Models (VLM) for image processing and the Notion API for organized data storage | 0 | |
89 | vlm_finetuning | 0 | ||
90 | vlm | 0 | ||
91 | VLM_PHI | Evaluating Vision-Language Models for Detecting and De-identifying Protected Health Information Burn-in on Medical Images | 0 | |
92 | vlense | A Python package to extract text from images and PDFs using Vision Language Models (VLM). | 0 | |
93 | frontend_sorapat | 0 | ||
94 | Video_Parody | 🎥 Create parody videos from uploaded files. Created in preparation for Sora. | 0 | |
95 | Frame-work-soraya | 0 | ||
96 | sorayandeh | 0 | ||
97 | AndreaSoranzo.github.io | 0 | ||
98 | Open-Sora | 0 | ||
99 | Soransharifi | Config files for my GitHub profile. | 0 | |
100 | SoraHub_RustPlus | 0 |