instructGOOSE

Implementation of Reinforcement Learning from Human Feedback (RLHF)

Stars

158

Forks

20

Language

Jupyter Notebook

Last Updated

Dec 22, 2023

Similar Repos