RLHFlow

Code for the Workflow of Reinforcement Learning from Human Feedback (RLHF)
United States of America[email protected]

Github Data

Followers 107
Following 0