trlx

A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)

Stars

4353

Forks

464

Language

Python

Last Updated

May 26, 2024

Similar Repos