openai

Github Data

Followers 91785
Following 0

AI Project

Public repos: 189Public gists: 0

SWELancer-Benchmark

This repo contains the dataset and code for the paper "SWE-Lancer: Can Frontier LLMs Earn $1 Million from Real-World Freelance Software Engineering?"
star: 484fork: 35
language: Python
created at: 2025-02-18
updated at: 2025-02-20

prm800k

800,000 step-level correctness labels on LLM solutions to MATH problems
star: 1.9Kfork: 111
language: Python
created at: 2023-04-13
updated at: 2025-02-10

evals

Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.
star: 15.5Kfork: 2.7K
language: Python
created at: 2023-01-23
updated at: 2025-02-10

gpt-discord-bot

Example Discord bot written in Python that uses the completions API to have conversations with the `text-davinci-003` model, and the moderations API to filter the messages.
star: 1.8Kfork: 669
language: Python
created at: 2022-12-21
updated at: 2025-02-09

gpt-3

GPT-3: Language Models are Few-Shot Learners
star: 15.7Kfork: 2.3K
language:
created at: 2020-05-18
updated at: 2025-02-10

image-gpt

star: 2.0Kfork: 387
language: Python
created at: 2020-05-07
updated at: 2025-02-07

lm-human-preferences

Code for the paper Fine-Tuning Language Models from Human Preferences
star: 1.3Kfork: 166
language: Python
created at: 2019-09-14
updated at: 2025-02-09

gpt-2-output-dataset

Dataset of GPT-2 outputs for research in detection, biases, and more
star: 2.0Kfork: 550
language: Python
created at: 2019-05-03
updated at: 2025-02-09

gpt-2

Code for the paper "Language Models are Unsupervised Multitask Learners"
star: 23.0Kfork: 5.6K
language: Python
created at: 2019-02-11
updated at: 2025-02-10

finetune-transformer-lm

Code and model for the paper "Improving Language Understanding by Generative Pre-Training"
star: 2.2Kfork: 505
language: Python
created at: 2018-06-11
updated at: 2025-02-08