LLM-RLHF-Tuning

LLM Tuning with PEFT (SFT+RM+PPO+DPO with LoRA)

Stars

295

Forks

11

Language

Python

Last Updated

Feb 29, 2024

Similar Repos