microsoft

Open source projects and samples from Microsoft

Redmond, WA[email protected]

Github Data

Followers 93773

Following 0

Links

https://x.com/OpenAtMicrosoft https://opensource.microsoft.com

AI Project

Public repos: 6880Public gists: 0

BUILD25-LAB331

This repository hosts the instructions and workshop materials for Lab 331 (Deep Research with Langchain and DeepSeek R1) for Microsoft Build

language: Python

created at: 2025-04-24

updated at: 2025-05-07

DKI_LLM

This is a repository for DKI group concerning the LLM-related papers alongside with code.

language: Python

created at: 2025-02-25

updated at: 2025-05-27

peoplejoin

Code and data for LM Agents for Coordinating Multi-User Information Gathering

language: Python

created at: 2025-02-06

updated at: 2025-02-17

lmm-graphical-perception

Evaluating Graphical Perception of Large Multimodal Models

language: Jupyter Notebook

created at: 2024-10-21

updated at: 2025-03-12

vscode-copilot-vision

Exploration into leveraging vision capabilities of an LLM

star: 40fork: 3

language: TypeScript

created at: 2024-10-07

updated at: 2025-02-07

VLM-Video-Action-Localization

language: Python

created at: 2024-09-03

updated at: 2025-01-23

multilspy

multispy is a lsp client library in Python intended to be used to build applications around language servers.

star: 66fork: 24

language: Python

created at: 2024-08-07

updated at: 2024-12-16

BitNet

Official inference framework for 1-bit LLMs

star: 12.8Kfork: 898

language: C++

created at: 2024-08-05

updated at: 2025-03-04

LLM2CLIP

LLM2CLIP makes SOTA pretrained CLIP model more SOTA ever.

star: 455fork: 19

language: Python

created at: 2024-07-22

updated at: 2025-02-06

eureka-ml-insights

A framework for standardizing evaluations of large foundation models, beyond single-score reporting and rankings.

star: 106fork: 15

language: Python

created at: 2024-07-18

updated at: 2025-02-07

toktrie

LLM token utility library

language: Rust

created at: 2024-07-05

updated at: 2025-01-25

semantic-kernel-java

Semantic Kernel for Java. Integrate cutting-edge LLM technology quickly and easily into your Java based apps. See https://aka.ms/semantic-kernel.

star: 128fork: 25

language: Java

created at: 2024-06-12

updated at: 2025-02-08

LLM-Rubric

This is a repository for code, data, and models associated with the paper LLM-RUBRIC: A Multidimensional, Calibrated Approach to Automated Evaluation of Natural Language Texts, accepted at ACL 2024.

language: Python

created at: 2024-05-21

updated at: 2025-02-18

prompty

Prompty makes it easy to create, manage, debug, and evaluate LLM prompts for your AI applications. Prompty is an asset class and format for LLM prompts designed to enhance observability, understandability, and portability for developers.

star: 715fork: 63

language: Python

created at: 2024-04-22

updated at: 2025-02-10

RD-Agent

Research and development (R&D) is crucial for the enhancement of industrial productivity, especially in the AI era, where the core aspects of R&D are mainly focused on data and models. We are committed to automating these high-value generic R&D processes through R&D-Agent, which lets AI drive data-driven AI. 🔗https://aka.ms/RD-Agent-Tech-Report

star: 4.8Kfork: 433

language: Python

created at: 2024-04-03

updated at: 2025-05-27

graphrag

A modular graph-based Retrieval-Augmented Generation (RAG) system

star: 23.1Kfork: 2.3K

language: Python

created at: 2024-03-27

updated at: 2025-03-04

BitBLAS

BitBLAS is a library to support mixed-precision matrix multiplications, especially for quantized LLM deployment.

star: 517fork: 39

language: Python

created at: 2024-02-23

updated at: 2025-02-10

T-MAC

Low-bit LLM inference on CPU with lookup table

star: 666fork: 50

language: C++

created at: 2024-02-02

updated at: 2025-02-10

WizardLM2

star: 60fork: 7

created at: 2024-01-18

updated at: 2025-01-15

UFO

A UI-Focused Agent for Windows OS Interaction.

star: 6.6Kfork: 844

language: Python

created at: 2024-01-08

updated at: 2025-03-04

llmops-workshop

Learn how to build solutions with Large Language Models.

star: 141fork: 49

language: Jupyter Notebook

created at: 2024-01-04

updated at: 2025-01-31

sarathi-serve

A low-latency & high-throughput serving engine for LLMs

star: 305fork: 39

language: Python

created at: 2023-11-02

updated at: 2025-02-09

only_train_once

OTOv1-v3, NeurIPS, ICLR, TMLR, DNN Training, Compression, Structured Pruning, Erasing Operators, CNN, Diffusion, LLM

star: 30fork: 6

language: Python

created at: 2023-10-19

updated at: 2025-02-06

llm-data-creation

Model, Code & Data for the EMNLP'23 paper "Making Large Language Models Better Data Creators"

star: 124fork: 15

language: Python

created at: 2023-10-13

updated at: 2025-02-03

genaiops-promptflow-template

GenAIOps with Prompt Flow is a "GenAIOps template and guidance" to help you build LLM-infused apps using Prompt Flow. It offers a range of features including Centralized Code Hosting, Lifecycle Management, Variant and Hyperparameter Experimentation, A/B Deployment, reporting for all runs and experiments and so on.

star: 296fork: 256

language: Python

created at: 2023-10-12

updated at: 2025-02-07

rag-experiment-accelerator

The RAG Experiment Accelerator is a versatile tool designed to expedite and facilitate the process of conducting experiments and evaluations using Azure Cognitive Search and RAG pattern.

star: 223fork: 79

language: Python

created at: 2023-09-25

updated at: 2025-02-07

TaskWeaver

A code-first agent framework for seamlessly planning and executing data analytics tasks.

star: 5.5Kfork: 705

language: Python

created at: 2023-09-11

updated at: 2025-02-09

RecAI

Bridging LLM and Recommender System.

star: 695fork: 63

language: Jupyter Notebook

created at: 2023-09-07

updated at: 2025-02-09

autogen

A programming framework for agentic AI 🤖 PyPi: autogen-agentchat Discord: https://aka.ms/autogen-discord Office Hour: https://aka.ms/autogen-officehour

star: 39.1Kfork: 5.7K

language: Python

created at: 2023-08-18

updated at: 2025-02-10

genaiscript

Automatable GenAI Scripting

star: 2.6Kfork: 182

language: TypeScript

created at: 2023-08-17

updated at: 2025-05-31

Llama-2-Onnx

star: 1.0Kfork: 95

language: Python

created at: 2023-07-17

updated at: 2025-01-25

kernel-memory

RAG architecture: index and query any data using LLM and natural language, track sources, show citations, asynchronous memory patterns.

star: 1.8Kfork: 334

created at: 2023-07-13

updated at: 2025-02-10

LLMLingua

[EMNLP'23, ACL'24] To speed up LLMs' inference and enhance LLM's perceive of key information, compress the prompt and KV-Cache, which achieves up to 20x compression with minimal performance loss.

star: 4.9Kfork: 271

language: Python

created at: 2023-07-07

updated at: 2025-02-10

promptflow

Build high-quality LLM apps - from prototyping, testing to production deployment and monitoring.

star: 10.0Kfork: 942

language: Python

created at: 2023-06-30

updated at: 2025-03-03

TypeChat

TypeChat is a library that makes it easy to build natural language interfaces using types.

star: 8.4Kfork: 394

language: TypeScript

created at: 2023-06-20

updated at: 2025-03-04

generative-ai-for-beginners

21 Lessons, Get Started Building with Generative AI 🔗 https://microsoft.github.io/generative-ai-for-beginners/

star: 84.0Kfork: 43.7K

language: Jupyter Notebook

created at: 2023-06-19

updated at: 2025-05-27

LLaVA-Med

Large Language-and-Vision Assistant for Biomedicine, built towards multimodal GPT-4 level capabilities.

star: 1.7Kfork: 214

language: Python

created at: 2023-05-20

updated at: 2025-02-10

gpt-review

star: 268fork: 49

language: Python

created at: 2023-04-21

updated at: 2025-01-19

sample-app-aoai-chatGPT

Sample code for a simple web chat experience through Azure OpenAI, including Azure OpenAI On Your Data.

star: 1.8Kfork: 2.9K

language: Python

created at: 2023-04-06

updated at: 2025-03-02

ChatGPT-Robot-Manipulation-Prompts

star: 369fork: 38

created at: 2023-04-06

updated at: 2025-01-26

JARVIS

JARVIS, a system to connect LLMs with ML community. Paper: https://arxiv.org/pdf/2303.17580.pdf

star: 24.0Kfork: 2.0K

language: Python

created at: 2023-03-30

updated at: 2025-03-04

semantic-kernel

Integrate cutting-edge LLM technology quickly and easily into your apps

star: 23.3Kfork: 3.6K

created at: 2023-02-27

updated at: 2025-03-04

PromptCraft-Robotics

Community for applying LLMs to robotics and a robot simulator with ChatGPT integration

star: 1.9Kfork: 209

language: Python

created at: 2023-02-08

updated at: 2025-02-10

automated-explanations

Generating and validating natural-language explanations.

star: 47fork: 6

language: HTML

created at: 2023-01-30

updated at: 2025-02-03

LMOps

General technology for enabling AI capabilities w/ LLMs and MLLMs

star: 4.0Kfork: 310

language: Python

created at: 2022-12-13

updated at: 2025-06-03

torchscale

Foundation Architecture for (M)LLMs

star: 3.0Kfork: 213

language: Python

created at: 2022-11-17

updated at: 2025-02-06

BioGPT

star: 4.4Kfork: 455

language: Python

created at: 2022-08-15

updated at: 2025-02-09

PyCodeGPT

A pre-trained GPT model for Python code completion and generation

star: 271fork: 43

language: Python

created at: 2022-03-09

updated at: 2025-02-04

DialogLM

Official Implementation of "DialogLM: Pre-trained Model for Long Dialogue Understanding and Summarization."

star: 137fork: 9

language: Python

created at: 2021-12-03

updated at: 2024-09-12

COCO-LM

[NeurIPS 2021] COCO-LM: Correcting and Contrasting Text Sequences for Language Model Pretraining

star: 118fork: 13

language: Python

created at: 2021-10-21

updated at: 2024-10-23

semantic_parsing_with_constrained_lm

Code to reproduce experiments in the paper "Constrained Language Models Yield Few-Shot Semantic Parsers" (EMNLP 2021).

star: 61fork: 7

language: Python

created at: 2021-09-08

updated at: 2024-09-28

Tutel

Tutel MoE: An Optimized Mixture-of-Experts Implementation

star: 791fork: 95

language: Python

created at: 2021-08-06

updated at: 2025-04-02

DialoGPT

Large-scale pretraining for dialogue

star: 2.4Kfork: 345

language: Python

created at: 2019-08-29

updated at: 2025-01-26

unilm

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

star: 20.8Kfork: 2.6K

language: Python

created at: 2019-07-23

updated at: 2025-03-04

LMChallenge

A library & tools to evaluate predictive language models.

star: 63fork: 13

language: Python

created at: 2017-09-12

updated at: 2025-01-17

Cognitive-WebLM-Windows

Windows SDK for the Microsoft Web Language Model API, part of Cognitive Services

star: 15fork: 15

created at: 2016-06-02

updated at: 2023-01-28

Sora

The Microsoft Research Software Radio (Sora) is a programmable software radio platform based on the commodity multicore CPU in a host PC. The SDK provides the drivers, user mode 802.11a/b/n samples, and a debug plot tool.

star: 358fork: 125

created at: 2015-03-11

updated at: 2025-03-20