Vicuna-LoRA-RLHF-PyTorch

A full pipeline to finetune Vicuna LLM with LoRA and RLHF on consumer hardware. Implementation of RLHF (Reinforcement Learning with …

Stars

196

Forks

18

Language

Python

Last Updated

May 14, 2024

Similar Repos