mihirp1998

researcher at CMU MLD

Pittsburgh

Github Data

Followers 43

Following 0

Links

AI Project

Public repos: 103Public gists: 2

AlignProp

AlignProp uses direct reward backpropogation for the alignment of large-scale text-to-image diffusion models. Our method is 25x more sample and compute efficient than reinforcement learning methods (PPO) for finetuning Stable Diffusion

star: 267fork: 9

language: Python

created at: 2023-10-06

updated at: 2025-02-10