Discover The Top AI hackers & Projects

39029 AI hackers and 46252 AI repos(projects) in the AI listing. Data updated from github daily.

Recent Trending

BeehiveInnovations

gemini-cli-action

amitkumardemo

11 followers

Jodhpur Rajasthan

EdgeCareer

🚀 EdgeCareer – AI-Powered Career Coach Full Stack AI Career Coach built with React 19 + Next.js 15, Tailwind CSS, NeonDB, Prisma, Clerk Authentication, Inngest, Gemini API, and Shadcn UI. A cutting-edge AI-driven career platform that provides personalized job recommendations, AI resume reviews, and real-time career insights to help users

editor-code-assistant

3 followers

eca

Editor Code Assistant (ECA) - AI pair programming capabilities in any editor

SmythOS

18 followers

United States of America

sre

The Operating System for Agents

ziyiwhat

11 followers

Chengdu

StableMotion

This is the official repo for paper "StableMotion: Repurposing Diffusion-Based Image Priors for Motion Estimation"

July

tkaufmann

1 followers

claude-gemini-bridge

🤖 Intelligent integration between Claude Code and Google Gemini for large-scale code analysis

GlowLED

5 followers

Nanjing Jiangsu province China

SimpleLLM

KenoLeon

31 followers

Mexico City

BubbleUI

A modern, experimental chat UI for Gemini and LLMs with local-first storage, context management, and multi-conversation support—all in your browser.

penfever

19 followers

NYC Metro Area

marvis

MARVIS (Modality Adaptive Reasoning over VISualizations) is an 'everything predictor' powered by VLMs + embeddings

Rubait-stu

2 followers

FoxChat

WindZZzzZZzz

111 followers

Auckland, New Zealand

gpt-toc-extension

a Chrome extension that displays a table of contents for ChatGPT generated content in a right sidebar

werruww

0 followers

suc-FastVLM-0.5B-ONNX-cpu

seungjun-green

21 followers

toxic-llm

Hysocs

4 followers

Aozora_SDXL_Training

A layer selective fine tuning approach

hamzalekranbi

1 followers

APktool-MCP

Powerful Apktool MCP server for Android APK analysis and reverse engineering. Integrates with Gemini CLI for AI-driven security insights. 🚀👨💻

Rishabh5321

4 followers

India

gemini-cli-flake

Just a flake packaging gemini-cli dev package

KernFerm

12 followers

Bubbles-AI-GPT

A secure, web-based chat interface that lets you talk to multiple AI models from different providers - all in one beautiful, easy-to-use application.

SAIF5700

0 followers

openai-agents-sdk-chatbot-UI

Build and deploy your chatbot with the OpenAI and AI SDK. Stay synced with your Vercel app effortlessly. 🌐🤖

Abdouuul

0 followers

image-generator

Image generator using Stable Diffusion

Pix3lPirat3

25 followers

prismarine-search

A curated knowledge dataset for Mineflayer and the PrismarineJS ecosystem, created as part of my journey learning how to train and enhance LLMs using real-world codebases, API structure, and community questions.

iamatharv05

0 followers

Atharv-s-Finance-Bot

AI Finance Chatbot 💰🤖 A Flask web app powered by IBM Watsonx.ai (Llama 3 70B) that gives personalized budgeting & money advice via chat. Features markdown responses, secure API integration, and reset functionality. Built with Python, JavaScript, and HTML/CSS.

June

BeehiveInnovations

24 followers

zen-mcp-server

2.1K

The power of Claude Code + [Gemini Pro / Flash / O3 / Grok / OpenRouter / Ollama / Custom Model / All Of The Above] working as one.

tbphp

14 followers

China

gpt-load

1.4K

THUDM

10.8K followers

FIT Building, Tsinghua University

GLM-4.1V-Thinking

511

GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning.

vitali87

33 followers

London

code-graph-rag

473

Better than Claude Code or Gemini CLI especially for Monorepos

TheAhmadOsman

145 followers

San Francisco, CA

4o-ghibli-at-home

266

The GPT-4o image generation we have at home. A powerful, self-hosted AI photo stylizer built for performance and privacy.

DreamLayer-AI

0 followers

DreamLayer

120

Most intuitive Stable Diffusion WebUI for AI artists, developers & researchers

yezanting

9 followers

Med-VLM-Bench-Summary

117

A Curated Benchmark Repository for Medical Vision-Language Models

Yuan-ManX

416 followers

Shanghai, China

ComfyUI-OmniGen2

shinshin86

51 followers

Japan

oh-my-logo

Display giant ASCII-art logos with colorful gradients in your terminal — like Claude Code or Gemini CLI.

Osilly

55 followers

Awesome-Interleaving-Reasoning

Interleaving Reasoning: Next-Generation Reasoning Systems for AGI

UMass-Embodied-AGI

141 followers

United States of America

Mirage

Machine Mental Imagery: Empower Multimodal Reasoning with Latent Visual Tokens (arXiv 2025)

herobot-id

4 followers

Indonesia

herobot

Herobot is your 24/7 customer service assistant that helps you manage multi-channel customer conversations effortlessly.

google-gemini

5.9K followers

United States of America

gemini-cli-action

amitkumardemo

11 followers

Jodhpur Rajasthan

EdgeCareer

ShadowHackrs

190 followers

jordan

Jailbreaks-GPT-Gemini-deepseek-

Jailbreaks GPT, Sora, Claude, Gemini ,deepseek this prompt unlocks rage mode

editor-code-assistant

3 followers

eca

Editor Code Assistant (ECA) - AI pair programming capabilities in any editor

May

google-gemini

5.9K followers

United States of America

gemini-fullstack-langgraph-quickstart

15.1K

Get started with building Fullstack Agents using Gemini 2.5 and LangGraph

asgeirtj

352 followers

system_prompts_leaks

7.2K

Collection of extracted System Prompts from popular chatbots like ChatGPT, Claude & Gemini

huggingface

50.9K followers

NYC + Paris

nanoVLM

3.6K

The simplest, fastest repository for training/finetuning small-sized VLMs.

chaitin

1.4K followers

Beijing

PandaWiki

3.0K

PandaWiki 是一款 AI 大模型驱动的开源知识库搭建系统，帮助你快速构建智能化的产品文档、技术文档、FAQ、博客系统，借助大模型的力量为你提供 AI 创作、AI 问答、AI 搜索等能力。

IBM

6.3K followers

United States of America

mcp-context-forge

549

A Model Context Protocol (MCP) Gateway & Registry. Serves as a central management point for tools, resources, and prompts that can be accessed by MCP-compatible LLM applications. Converts REST API endpoints to MCP, composes virtual MCP servers with added security and observability, and converts between protocols (stdio, SSE, Streamable HTTP).

Narratium

2 followers

Narratium.ai

441

Open-source platform for AI-driven storytelling, worldbuilding, and immersive roleplay

hexdocom

5 followers

lemonai

401

The world's first Full-Stack Open-Source General AI Agent

justlovemaki

2 followers

CloudFlare-AI-Insight-Daily

386

AI 洞察日报项目，每日为您精选 AI 领域的最新动态，包括行业新闻、热门开源项目和前沿学术论文以及科技大V推文，并通过 Google Gemini 模型进行智能处理与日报生成，最终自动发布到 GitHub Pages 生成AI日报。

roothch

6 followers

PreenCut

205

AI-Powered Video Retrieval & Clipping Tool

VITA-Group

585 followers

VLM-3R

191

VLM-3R: Vision-Language Models Augmented with Instruction-Aligned 3D Reconstruction

strands-agents

692 followers

agent-builder

178

An example agent demonstrating streaming, tool use, and interactivity from your terminal. This agent builder can help you to build your own agents and tools.

lemonade-sdk

7 followers

lemonade

157

Local LLM Server with GPU and NPU Acceleration

TIGER-AI-Lab

307 followers

Canada

Pixel-Reasoner

142

Pixel-Level Reasoning Model trained with RL

Thinklab-SJTU

678 followers

Shanghai

Bench2Drive-VL

127

Adapting VLMs to Bench2Drive.

AIDC-AI

278 followers

CHATS

117

CHATS: Combining Human-Aligned Optimization and Test-Time Sampling for Text-to-Image Generation (ICML2025)

1600822305

4 followers

AetherLink

111

AetherLink移动应用是一个基于现代Web技术构建的跨平台AI助手应用。该应用支持与多种AI模型（如OpenAI、Google Gemini、Anthropic Claude、Grok、硅基流动、火山方舟等）的交互，提供流畅的对话体验，并支持Android平台部署。应用采用React、TypeScript和Capacitor框架开发，具有高度可定制的模型配置、多主题聊天管理、AI思考过程可视化、语音合成、语音识别、MCP工具支持、知识库管理等特色功能。

April

google-gemini

5.9K followers

United States of America

gemini-cli

51.7K

An open-source AI agent that brings the power of Gemini directly into your terminal.

google

56.6K followers

United States of America

adk-python

10.1K

An open-source, code-first Python toolkit for building, evaluating, and deploying sophisticated AI agents with flexibility and control.

AsyncFuncAI

109 followers

United States of America

deepwiki-open

7.7K

Open Source DeepWiki: AI-Powered Wiki Generator for GitHub/Gitlab/Bitbucket Repositories. Join the discord: https://discord.gg/gMwThUMeme

jamez-bondos

111 followers

awesome-gpt4o-images

6.5K

Awesome curated collection of images and prompts generated by GPT-4o and gpt-image-1. Explore AI generated visuals created with ChatGPT and Sora, showcasing OpenAI’s advanced image generation capabilities.

Klavis-AI

188 followers

klavis

2.7K

Klavis AI (YC X25): Open Source MCP integration for AI applications

VoltAgent

73 followers

United States of America

voltagent

2.3K

Open Source TypeScript AI Agent Framework

OmniSVG

20 followers

OmniSVG

1.8K

OmniSVG is the first family of end-to-end multimodal SVG generators that leverage pre-trained Vision-Language Models (VLMs), capable of generating complex and detailed SVGs, from simple icons to intricate anime characters.

River-Zhang

111 followers

Cambridge, MA

ICEdit

1.8K

Image editing is worth a single LoRA! 0.1% training data for fantastic image editing! Training released! Surpasses GPT-4o in ID persistence! Official ComfyUI workflow release! Only 4GB VRAM is enough to run!

stepfun-ai

890 followers

Step1X-Edit

1.5K

A SOTA open-source image editing model, which aims to provide comparable performance against the closed-source models like GPT-4o and Gemini 2 Flash.

cactus-compute

116 followers

United Kingdom

cactus

1.1K

A cross-platform framework for deploying LLMs, VLMs, Embedding Models, TTS models and more locally on smartphones.

ModelEngine-Group

370 followers

nexent

1.1K

Nexent is a zero-code platform for developing intelligent agents — no orchestration, no complex drag-and-drop required. Built on the MCP tool ecosystem, Nexent also offers powerful capabilities for model integration, data processing, and knowledge base management.

liweiphys

16 followers

layra

728

LAYRA—an enterprise-ready, out-of-the-box solution—unlocks next-generation intelligent systems powered by visual RAG and limitless visual multi-step agent workflow orchestration.

mediar-ai

222 followers

United States of America

terminator

686

Computer use SDK for building agents that learn from human screen recordings. Accessibility-first. Cross-platform (Windows/macOS/Linux), near-deterministic.

cafferychen777

92 followers

mLLMCelltype

537

An iterative multi-LLM consensus framework for accurate cell type annotation in single-cell RNA-seq data

NovaSky-AI

192 followers

United States of America

SkyRL

528

SkyRL: A Modular Full-stack RL Library for LLMs

NousResearch

881 followers

atropos

414

Atropos is a Language Model Reinforcement Learning Environments framework for collecting and evaluating LLM trajectories through diverse environments

March

elder-plinius

4.0K followers

CL4R1T4S

7.2K

AI SYSTEMS TRANSPARENCY FOR ALL! - LEAKED SYSTEM PROMPTS FOR CHATGPT, GEMINI, GROK, CLAUDE, PERPLEXITY, CURSOR, WINDSURF, DEVIN, REPLIT, AND MORE!

langmanus

123 followers

langmanus

5.3K

A community-driven AI automation framework that builds upon the incredible work of the open source community. Our goal is to combine language models with specialized tools for tasks like web search, crawling, and Python code execution, while giving back to the community that made this possible.

manycore-research

157 followers

Hangzhou, China

SpatialLM

3.5K

SpatialLM: Training Large Language Models for Structured Indoor Modeling

badboysm890

145 followers

Earth, Milky Way

ClaraVerse

2.7K

Clara — Privacy-first, fully local AI workspace with Ollama LLM chat, tool calling, agent builder, Stable Diffusion, and embedded n8n-style automation. No backend. No API keys. Just your stack, your machine.

SkyworkAI

949 followers

Singapore

Skywork-R1V

2.6K

Skywork-R1V2:Multimodal Hybrid Reinforcement Learning for Reasoning

Kilo-Org

53 followers

kilocode

1.5K

Open Source AI coding assistant for planning, building, and fixing code. We're a superset of Roo, Cline, and our own features. Follow us: kilocode.ai/social

wyeeeee

24 followers

hajimi

1.1K

这是一个基于 FastAPI 构建的 Gemini API 代理

ml-explore

3.5K followers

mlx-lm

1.1K

Run LLMs with MLX

rikkahub

6 followers

rikkahub

861

RikkaHub is a Android APP that supports for multiple LLM providers.

TsinghuaC3I

68 followers

China

Awesome-RL-Reasoning-Recipes

696

Awesome RL Reasoning Recipes ("Triple R")

felixrieseberg

4.9K followers

San Francisco

clippy

631

📎 Clippy, now with some AI

RUC-NLPIR

92 followers

China

WebThinker

580

🌐 WebThinker: Empowering Large Reasoning Models with Deep Research Capability

yousef-rafat

28 followers

miniDiffusion

571

A reimplementation of Stable Diffusion 3.5 in pure PyTorch

langfengQ

29 followers

Singapore

verl-agent

514

verl-agent is an extension of veRL, designed for training LLM/VLM agents via RL. verl-agent is also the official code for paper "Group-in-Group Policy Optimization for LLM Agent Training"

meta-llama

8.9K followers

llama-prompt-ops

502

An open-source tool for seamless migration from other LLMs to Llama, and for general prompt optimization.

LightChen233

107 followers

Awesome-Long-Chain-of-Thought-Reasoning

395

Latest Advances on Long Chain-of-Thought Reasoning

February

Fosowl

431 followers

Nice, France

agenticSeek

19.8K

Fully Local Manus AI. No APIs, No $200 monthly bills. Enjoy an autonomous agent that thinks, browses the web, and code for the sole cost of electricity. 🔔 Official updates only via twitter @Martin993886460 (Beware of fake account)

dzhng

605 followers

deep-research

13.7K

An AI-powered research assistant that performs iterative, deep research on any topic by combining search engines, web scraping, and large language models. The goal of this repo is to provide the simplest implementation of a deep research agent - e.g. an agent that can refine its research direction overtime and deep dive into a topic.

zilliztech

693 followers

United States of America

deep-searcher

6.4K

Open Source Deep Research Alternative to Reason and Search on Private Data. Written in Python.

om-ai-lab

681 followers

VLM-R1

5.2K

Solve Visual Understanding with Reinforced VLMs

StarsfieldAI

129 followers

R1-V

3.8K

Witness the aha moment of VLM with less than $3.

ThinkInAIXYZ

120 followers

deepchat

3.6K

🐬DeepChat - A smart assistant that connects powerful AI to your personal world

LearningCircuit

46 followers

local-deep-research

3.1K

Local Deep Research achieves ~95% on SimpleQA benchmark (tested with GPT-4.1-mini) and includes benchmark tools to test on your own data. Searches 10+ sources - arXiv, PubMed, GitHub, web, and your private documents.

ErlichLiu

166 followers

China / Chongqing

DeepClaude

2.7K

Unleash Next-Level AI! 🚀 💻 Code Generation: DeepSeek r1 + Claude 3.7 Sonnet - Unparalleled Performance! 📝 Content Creation: DeepSeek r1 + Gemini 2.5 Pro - Superior Quality! 🔌 OpenAI-Compatible. 🌊 Streaming & Non-Streaming Support. ✨ Experience the Future of AI – Today! Click to Try Now! ✨

AnotiaWang

65 followers

China

deep-research-web-ui

1.9K

(Supports DeepSeek R1) An AI-powered research assistant that performs iterative, deep research on any topic by combining search engines, web scraping, and large language models.

ravitemer

22 followers

India

mcphub.nvim

1.2K

An MCP client for Neovim that seamlessly integrates MCP servers into your editing workflow with an intuitive interface for managing, testing, and using MCP servers with your favorite chat plugins.

thu-pacman

195 followers

Beijing, China

chitu

1.2K

High-performance inference framework for large language models, focusing on efficiency, flexibility, and availability.

HiveNexus

10 followers

HiveChat

1.0K

An AI chat bot for small and medium-sized teams, supporting models such as Deepseek, Open AI, Claude, and Gemini. 专为中小团队设计的 AI 聊天应用，支持 Deepseek、Open AI、Claude、Gemini 等模型。

fdarkaou

56 followers

Everywhere

open-deep-research

782

An open-source alternative to OpenAI and Gemini's deep research.

TideDra

71 followers

Beijing, CN

lmm-r1

744

Extend OpenRLHF to support LMM RL training for reproduction of DeepSeek-R1 on multimodal tasks.

lmgame-org

35 followers

United States of America

GamingAgent

699

LLM/VLM gaming agents and model evaluation through games.

yaotingwangofficial

14 followers

Awesome-MCoT

677

Multimodal Chain-of-Thought Reasoning: A Comprehensive Survey

January

deepseek-ai

77.9K followers

DeepSeek-R1

90.4K

huggingface

50.9K followers

NYC + Paris

open-r1

25.0K

Fully open reproduction of DeepSeek-R1

bytedance

12.1K followers

Singapore

UI-TARS-desktop

14.9K

The Open All-in-One Multimodal AI Agent Stack connecting Cutting-edge AI Models and Agent Infra.

Jiayi-Pan

1.3K followers

Bay Area, CA

TinyZero

12.0K

Minimal reproduction of DeepSeek R1-Zero

getAsterisk

218 followers

United States of America

deepclaude

5.2K

A high-performance LLM inference API and Chat UI that integrates DeepSeek R1's CoT reasoning traces with Anthropic Claude models.

multimodal-art-projection

216 followers

YuE

4.2K

YuE: Open Full-song Music Generation Foundation Model, something similar to Suno.ai but open

simstudioai

69 followers

sim

4.1K

Sim Studio is an open-source AI agent workflow builder. Sim Studio's interface is a lightweight, intuitive way to quickly build and deploy LLMs that connect with your favorite tools.

jina-ai

3.2K followers

Germany

node-DeepResearch

3.6K

Keep searching, reading webpages, reasoning until it finds the answer (or exceeding the token budget)

hkust-nlp

261 followers

Hong Kong

simpleRL-reason

3.2K

This is a replicate of DeepSeek-R1-Zero and DeepSeek-R1 training on small models with limited data

MiniMax-AI

2.0K followers

MiniMax-01

3.0K

The official repo of MiniMax-Text-01 and MiniMax-VL-01, large-language-model & vision-language-model based on Linear Attention

GoogleCloudPlatform

14.6K followers

agent-starter-pack

2.1K

A collection of production-ready Generative AI Agent templates built for Google Cloud. It accelerates development by providing a holistic, production-ready solution, addressing common challenges (Deployment & Operations, Evaluation, Customization, Observability) in building and deploying GenAI agents.

ammaarreshi

88 followers

Gemini-Search

1.9K

Perplexity style AI Search engine clone built with Gemini 2.0 Flash and Grounding

bodo-run

16 followers

yek

1.6K

A fast Rust based tool to serialize text-based files in a repository or directory for LLM consumption

sauravpanda

63 followers

San Francisco

BrowserAI

1.1K

Run local LLMs like llama, deepseek-distill, kokoro and more inside your browser

tech-shrimp

1.7K followers

gemini-playground

738

Deploy a Gemini multimodal chat website in 10 seconds, Severless! 只需准备一个Gemini API Key，10秒即可部署一个Gemini多模态对话的网站。

ggml-org

1.4K followers

United States of America

llama.vscode

704

VS Code extension for LLM-assisted code/text completion

December 2024

krillinai

160 followers

Wuhan, China

KrillinAI

7.5K

A video translation and dubbing tool powered by LLMs, offering professional-grade translations and one-click full-process deployment. It can generate content optimized for platforms like YouTube，TikTok, and Shorts. 基于AI大模型的视频翻译和配音工具，专业级翻译，一键部署全流程，可以生成适配抖音，小红书，哔哩哔哩，视频号，TikTok，Youtube Shorts等形态的内容

bytedance

12.1K followers

Singapore

LatentSync

4.5K

Taming Stable Diffusion for Lip Sync!

StructuredLabs

96 followers

United States of America

preswald

4.3K

Preswald is a WASM packager for Python-based interactive data apps: bundle full complex data workflows, particularly visualizations, into single files, runnable completely in-browser, using Pyodide, DuckDB, Pandas, and Plotly, Matplotlib, etc. Build dashboards, reports, and notebooks that run offline, load fast, and share like a document.

clusterzx

74 followers

Germany

paperless-ai

3.7K

An automated document analyzer for Paperless-ngx using OpenAI API, Ollama, Deepseek-r1, Azure and all OpenAI API compatible Services to automatically analyze and tag your documents.

antinomyhq

13 followers

forge

3.2K

AI enabled pair programmer for Claude, GPT, O Series, Grok, Deepseek, Gemini and 300+ models

oslook

369 followers

cursor-ai-downloads

2.8K

All Cursor AI's official download links for both the latest and older versions, making it easy for you to update, downgrade, and choose any version. 🚀

btahir

147 followers

Seattle

open-deep-research

1.5K

Open source alternative to Gemini Deep Research. Generate reports with AI based on search results.

SkalskiP

5.6K followers

127.0.0.1

vlms-zero-to-hero

1.0K

This series will take you on a journey from the fundamentals of NLP and Computer Vision to the cutting edge of Vision-Language Models.

basicmachines-co

25 followers

basic-memory

1.0K

Basic Memory is a knowledge management system that allows you to build a persistent semantic graph from conversations with AI assistants, stored in standard Markdown files on your computer. Integrates directly with Obsidan.md

nishuzumi

327 followers

gemini-teacher

983

English pronunciation correction teacher built with gemini

Zackriya-Solutions

13 followers

meeting-minutes

978

A free and open source, self hosted Ai based live meeting note taker and minutes summary generator that can completely run in your Local device (Mac OS support is added. Will add windows and linux support soon)

zhihu

300 followers

Beijing, China

ZhiLight

897

A highly optimized LLM inference acceleration engine for Llama and its variants.

google-gemini

5.9K followers

United States of America

starter-applets

799

Google AI Studio Starter Apps

groq

760 followers

groq-appgen

607

Project showcasing Llama 3.3 70B HTML codegen abilities

The-Pocket

9 followers

PocketFlow

586

Minimalist LLM Framework in 100 Lines. Enable LLMs to Program Themselves.

daydreamsai

27 followers

daydreams

457

Daydreams is a generative agent framework for executing anything onchain

November 2024

Nutlope

7.0K followers

New York City, NY

llama-ocr

2.2K

Document to Markdown OCR library with Llama 3.2 vision

papersgpt

12 followers

papersgpt-for-zotero

1.5K

Zotero chat PDF with AI, DeepSeek, GPT 4.1, ChatGPT, Claude, Gemini, Qwen3

edwko

34 followers

OuteTTS

1.2K

Interface for OuteTTS models.

Francis-Rings

54 followers

StableAnimator

1.2K

[CVPR2025] We present StableAnimator, the first end-to-end ID-preserving video diffusion framework, which synthesizes high-quality videos without any post-processing, conditioned on a reference image and a sequence of poses.

huanngzh

226 followers

Beijing, China

MV-Adapter

1.0K

[768 Resolution] [Any "SDXL" Model] [Various Conditions] [Texture Synthesis] Official impl. of "MV-Adapter: Multi-view Consistent Image Generation Made Easy"

robertpiosik

29 followers

Poznań, Poland

CodeWebChat

1.0K

Initialize your favorite chatbot, then apply the response with a single click

nv-tlabs

723 followers

LLaMA-Mesh

910

Unifying 3D Mesh Generation with Language Models

apconw

12 followers

sanic-web

829

一个轻量级、支持全链路且易于二次开发的大模型应用项目(Large Model Data Assistant) 支持DeepSeek/Qwen2.5等大模型基于 Dify 、Ollama&Vllm、Sanic 和 Text2SQL 📊 等技术构建的一站式大模型应用开发项目，采用 Vue3、TypeScript 和 Vite 5 打造现代UI。它支持通过 ECharts 📈 实现基于大模型的数据图形化问答，具备处理 CSV 文件 📂 表格问答的能力。同时，能方便对接第三方开源 RAG 系统检索系统 🌐等，以支持广泛的通用知识问答。

PaulPauls

70 followers

Munich, Germany

llama3_interpretability_sae

601

A complete end-to-end pipeline for LLM interpretability with sparse autoencoders (SAEs) using Llama 3.2, written in pure PyTorch and fully reproducible.

PKU-YuanGroup

1.2K followers

China

UniWorld-V1

583

UniWorld: High-Resolution Semantic Encoders for Unified Visual Understanding and Generation

vlm-run

45 followers

United States of America

vlmrun-hub

510

A hub for various industry-specific schemas to be used with VLMs.

aws-samples

15.3K followers

swift-chat

445

A lightning-fast, cross-platform AI chat application built with React Native.

jjleng

37 followers

Palo Alto

copilot-more

393

GPT-4o and Claude-3.7-Sonnet APIs for coding.

sdcb

500 followers

China

chats

327

User-friendly Enterprise Ready AI Interface (Supports Ollama, OpenAI API, DeepSeek...)

stacklok

83 followers

United States of America

codegate

326

CodeGate: CodeGen Privacy and Security

jishengpeng

173 followers

WavChat

270

A Survey of Spoken Dialogue Models (60 pages)

October 2024

browser-use

2.4K followers

United States of America

browser-use

64.6K

🌐 Make websites accessible for AI agents. Automate tasks online with ease.

HKUDS

2.0K followers

Hong Kong

LightRAG

17.7K

"LightRAG: Simple and Fast Retrieval-Augmented Generation"

deepseek-ai

77.9K followers

Janus

16.5K

Janus-Series: Unified Multimodal Understanding and Generation Models

kortix-ai

240 followers

suna

14.5K

Suna - Open Source Generalist AI Agent

volcengine

844 followers

verl

10.2K

verl: Volcano Engine Reinforcement Learning for LLMs

langchain-ai

8.4K followers

United States of America

open-canvas

3.6K

📃 A better UX for chat, writing content, and coding with LLMs.

THUDM

10.8K followers

FIT Building, Tsinghua University

GLM-4-Voice

3.0K

GLM-4-Voice | 端到端中英语音对话模型

johnbean393

94 followers

Sidekick

3.0K

A native macOS app that allows users to chat with a local LLM that can respond with information from files, folders and websites on your Mac without installing any other software. Powered by llama.cpp.

CatchTheTornado

35 followers

text-extract-api

2.1K

Document (PDF, Word, PPTX ...) extraction and parse API using state of the art modern OCRs + Ollama supported models. Anonymize documents. Remove PII. Convert any document or picture to structured JSON or Markdown

lfnovo

71 followers

São Paulo, Brazil

open-notebook

1.9K

An Open Source implementation of Notebook LM with more flexibility and features

gpt-omni

201 followers

mini-omni2

1.8K

Towards Open-source GPT-4o with Vision, Speech and Duplex Capabilities。

skindhu

30 followers

Build-A-Large-Language-Model-CN

1.6K

《Build a Large Language Model (From Scratch)》是一本深入探讨大语言模型原理与实现的电子书，适合希望深入了解 GPT 等大模型架构、训练过程及应用开发的学习者。为了让更多中文读者能够接触到这本极具价值的教材，我决定将其翻译成中文，并通过 GitHub 进行开源共享。

theJayTea

66 followers

Bangalore

WritingTools

1.3K

The world's smartest system-wide grammar assistant; a better version of the Apple Intelligence Writing Tools. Works on Windows, Linux, & macOS, with the free Gemini API, local LLMs, & more.

ggml-org

1.4K followers

United States of America

llama.vim

1.3K

Vim plugin for LLM-assisted code/text completion

AugustoAlmondes

12 followers

FreeChatGpt-4o-2024

1.0K

Use ChatGpt 4o forFree - No API Key Need. also ChatGpt 3 and ChatGpt 3.5 Experience the power of ChatGPT with a user-friendly interface, enhanced jailbreaks, and completely free.

Henry-23

35 followers

Shenzhen, China

VideoChat

1.0K

实时语音交互数字人，支持端到端语音方案（GLM-4-Voice - THG）和级联方案（ASR-LLM-TTS-THG）。可自定义形象与音色，无须训练，支持音色克隆，首包延迟低至3s。Real-time voice interactive digital human, supporting end-to-end voice solutions (GLM-4-Voice - THG) and cascaded solutions (ASR-LLM-TTS-THG). Customizable appearance and voice, supporting voice cloning, with initial package delay as low as 3s.

September 2024

allenai

3.3K followers

Seattle, WA

olmocr

12.9K

Toolkit for linearizing PDFs for LLM datasets/training

voideditor

163 followers

United States of America

void

10.6K

NirDiamant

1.7K followers

GenAI_Agents

8.2K

This repository provides tutorials and implementations for various Generative AI Agent techniques, from basic to advanced. It serves as a comprehensive guide for building intelligent, interactive AI systems.

hijkzzz

522 followers

Awesome-LLM-Strawberry

6.5K

A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.

bklieger-groq

236 followers

g1

4.2K

g1: Using Llama-3.1 70b on Groq to create o1-like reasoning chains

jingyaogong

870 followers

HangZhou, China

minimind-v

4.0K

🚀 「大模型」1小时从0训练26M参数的视觉多模态VLM！🌏 Train a 26M-parameter VLM from scratch in just 1 hours!

souzatharsis

216 followers

New York, NY

podcastfy

3.3K

An Open Source Python alternative to NotebookLM's podcast feature: Transforming Multimodal Content into Captivating Multilingual Audio Conversations with GenAI

PySpur-Dev

30 followers

pyspur

3.1K

AI Agent Builder in Python

ictnlp

234 followers

Beijing, China

LLaMA-Omni

2.9K

LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve speech capabilities at the GPT-4o level.

QiuYannnn

51 followers

Los Angeles

Local-File-Organizer

2.4K

An AI-powered file management tool that ensures privacy by organizing local texts, images. Using Llama3.2 3B and Llava v1.6 models with the Nexa SDK, it intuitively scans, restructures, and organizes files for quick, seamless access and easy retrieval.

kodu-ai

27 followers

claude-coder

1.2K

Kodu is an autonomous coding agent that lives in your IDE. It is a VSCode extension that can help you build your dream project step by step by leveraging the latest technologies in automated coding agents

LangbaseInc

172 followers

United States of America

BaseAI

946

BaseAI — The Web AI Framework. The easiest way to build serverless autonomous AI agents with memory. Start building local-first, agentic pipes, tools, and memory. Deploy serverless with one command.

Hoper-J

30 followers

AI-Guide-and-Demos-zh_CN

841

这是一份入门AI/LLM大模型的逐步指南，包含教程和演示代码，带你从API走进本地大模型部署和微调，代码文件会提供Kaggle或Colab在线版本，即便没有显卡也可以进行学习。项目中还开设了一个小型的代码游乐场🎡，你可以尝试在里面实验一些有意思的AI脚本。同时，包含李宏毅 (HUNG-YI LEE）2024生成式人工智能导论课程的完整中文镜像作业。

getcellm

4 followers

cellm

771

Use LLMs in Excel formulas

NPC-Worldwide

11 followers

npcpy

766

The AI toolkit for the AI developer

Hexastack

24 followers

Hexabot

754

Hexabot is an open-source AI chatbot / agent builder. It allows you to create and manage multi-channel and multilingual chatbots / agents with ease.

August 2024

feder-cr

2.0K followers

Genoa

Jobs_Applier_AI_Agent_AIHawk

28.4K

AIHawk aims to easy job hunt process by automating the job application process. Utilizing artificial intelligence, it enables users to apply for multiple jobs in a tailored way.

block

526 followers

goose

15.2K

an open source, extensible AI agent that goes beyond code suggestions - install, execute, edit, and test with any LLM

mastra-ai

228 followers

mastra

14.7K

The TypeScript AI agent framework. ⚡ Assistants, RAG, observability. Supports any LLM: GPT-4, Claude, Gemini, Llama.

raga-ai-hub

72 followers

RagaAI-Catalyst

14.7K

Python SDK for Agent AI Observability, Monitoring and Evaluation Framework. Includes features like agent, llm and tools tracing, debugging multi-agentic system, self-hosted dashboard and advanced analytics with timeline and execution graph view

microsoft

96.3K followers

Redmond, WA

BitNet

12.8K

Official inference framework for 1-bit LLMs

codexu

163 followers

Beijing

note-gen

5.6K

A cross-platform Markdown note-taking application dedicated to using AI to bridge recording and writing, organizing fragmented knowledge into a readable note.

2.1K followers

Sunnyvale, CA, USA

Liger-Kernel

5.3K

Efficient Triton Kernels for LLM Training

zaidmukaddam

320 followers

Mumbai

scira

4.6K

Scira (Formerly MiniPerplx) is a minimalistic AI-powered search engine that helps you find information on the internet. Powered by Vercel AI SDK! Search with models like Grok 2.0.

linyqh

93 followers

NarratoAI

3.6K

利用AI大模型，一键解说并剪辑视频； Using AI models to automatically provide commentary and edit videos with a single click.

VITA-MLLM

114 followers

VITA

2.3K

✨✨VITA-1.5: Towards GPT-4o Level Real-Time Vision and Speech Interaction

lmnr-ai

56 followers

lmnr

2.1K

Laminar - open-source all-in-one platform for engineering AI products. Create data flywheel for your AI app. Traces, Evals, Datasets, Labels. YC S24.

codelion

185 followers

Singapore

optillm

2.0K

Optimizing inference proxy for LLMs

itsOwen

113 followers

United Kingdom

CyberScraper-2077

1.6K

A Powerful web scraper powered by LLM | OpenAI, Gemini & Ollama

wwbin2017

30 followers

bailing

1.3K

百聆是一个类似GPT-4o的语音对话机器人，通过ASR+LLM+TTS实现，集成DeepSeek R1等优秀大模型，时延低至800ms，Mac等低配置也可运行，支持打断

karpathy

105.5K followers

Stanford

nano-llama31

1.3K

nanoGPT style version of Llama 3.1

Storia-AI

44 followers

United States of America

sage

1.2K

Chat with any codebase in under two minutes | Fully local or via third-party APIs

2024

browser-use

2.4K followers

United States of America

browser-use

64.6K

🌐 Make websites accessible for AI agents. Automate tasks online with ease.

All-Hands-AI

926 followers

OpenHands

59.8K

🙌 OpenHands: Code Less, Make More

RVC-Boss

1.4K followers

GPT-SoVITS

48.4K

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Shubhamsaboo

2.5K followers

awesome-llm-apps

47.8K

Collection of awesome LLM apps with AI Agents and RAG using OpenAI, Anthropic, Gemini and opensource models.

unclecode

1.1K followers

Singapore

crawl4ai

47.1K

🚀🤖 Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper. Don't be shy, join here: https://discord.gg/jP8KfhDhyN

mendableai

766 followers

United States of America

firecrawl

42.2K

🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.

harry0703

798 followers

MoneyPrinterTurbo

37.5K

利用AI大模型，一键生成高清短视频 Generate short videos with one click using AI LLM.

2noise

330 followers

ChatTTS

37.0K

A generative speech model for daily dialogue.

meta-llama

8.9K followers

llama3

28.8K

The official Meta Llama 3 GitHub site

CherryHQ

582 followers

China

cherry-studio

28.5K

🍒 Cherry Studio is a desktop client that supports for multiple LLM providers.

feder-cr

2.0K followers

Genoa

Jobs_Applier_AI_Agent_AIHawk

28.4K

AIHawk aims to easy job hunt process by automating the job application process. Utilizing artificial intelligence, it enables users to apply for multiple jobs in a tailored way.

karpathy

105.5K followers

Stanford

llm.c

25.9K

LLM training in simple, raw C/CUDA

ComposioHQ

472 followers

United States of America

composio

25.6K

Composio equips your AI agents & LLMs with 100+ high-quality integrations via function calling

stanford-oval

917 followers

Stanford, CA

storm

22.2K

An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.

Cinnamon

240 followers

kotaemon

20.9K

An open-source RAG-based tool for chatting with your documents.

yamadashy

275 followers

Tokyo/Japan

repomix

17.4K

📦 Repomix is a powerful tool that packs your entire repository into a single, AI-friendly file. Perfect for when you need to feed your codebase to Large Language Models (LLMs) or other AI tools like Claude, ChatGPT, DeepSeek, Perplexity, Gemini, Gemma, Llama, Grok, and more.