LLM-RLHF-Tuning

LLM Tuning with PEFT (SFT+RM+PPO+DPO with LoRA)

Stars

295

Forks

11

Language

Python

Last Updated

Feb 29, 2024

Similar Repos

Repo	Language	Stars	Description	Updated At
LLaMA-Efficient-Tuning	Python	4	Fine-tuning LLaMA with PEFT (SFT+RLHF)	May 28, 2023
llama-trl	None	3	LLaMA-TRL: Fine-tuning LLaMA with PPO and LoRA	Apr 06, 2023
LLM-Finetuning	Jupyter Notebook	750	LLM Finetuning with peft	Jan 18, 2024
ChatGLM-Peft-Tuning	Python	12	ChatGLM-Peft-Tuning	Apr 04, 2023
ChatGLM-Efficient-Tuning	Python	326	Fine-tuning ChatGLM-6B with PEFT \| 基于 PEFT 的高效 ChatGLM 微调	Apr 24, 2023
ChatGLM-Efficient-Tuning	None	2	Fine-tuning ChatGLM-6B with PEFT \| 基于 PEFT 的高效 ChatGLM 微调	Jun 09, 2023
ChatGLM-Efficient-Tuning	Python	2	Fine-tuning ChatGLM-6B with PEFT \| 基于 PEFT 的高效 ChatGLM 微调	Jun 27, 2023
llama-lora-fine-tuning	Python	45	llama fine-tuning with lora	Jul 18, 2023
haven	TypeScript	312	LLM fine-tuning and eval	Jan 17, 2024
peft	None	5	🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.	Apr 11, 2023
peft	Python	3693	🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.	Apr 12, 2023
peft	None	2	🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.	Apr 19, 2023
my_pefty_llama	Python	8	Minimal implementation of multiple PEFT methods for LLaMA fine-tuning	Apr 26, 2023
my_pefty_llama	None	2	Minimal implementation of multiple PEFT methods for LLaMA fine-tuning	Apr 20, 2023
peft	Python	3	🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.	Jun 11, 2023
peft	Python	2	🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.	Dec 03, 2023
multi-lora-fine-tune	Python	111	Provide efficient LLM LoRA fine tune	Jan 18, 2024
baichuan_sft_lora	Python	51	baichuan LLM surpervised finetune by lora	Jan 15, 2024
LongLM	Python	287	LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning	Jan 19, 2024
punica	Python	629	Serving multiple LoRA finetuned LLM as one	Jan 18, 2024
chatgpt-alternatives	None	5	Collection of ChatGPT alternatives & LLM tuning methods	Mar 30, 2023
reference-project-llm-fine-tuning	Jupyter Notebook	2	Fine-tuning an LLM for sentiment analysis	Nov 29, 2023
doppelganger	Python	146	Fine-tuning LLM on my Telegram chats	Jan 18, 2024
LLM-Tuning	Python	825	Tuning LLMs with no tears💦, sharing LLM-tools with love❤️.	Jan 18, 2024
llama.mmengine	Python	35	Training LLaMA language model with MMEngine! It supports LoRA fine-tuning!	Apr 09, 2023
Instruction_Tuning	Python	4	Tuning ChatGLM with Lora to follow instructions and solve downstream tasks.	Jul 31, 2023
LLaMa-EasyFT	Python	3	A Toolkit for Fine-Tuning Large Language Models with LoRA and DeepSpeed	Apr 14, 2023
ImageBind-LoRA	Python	18	Fine-tuning "ImageBind One Embedding Space to Bind Them All" with LoRA	May 17, 2023
demo-llm-tuning	Python	2	Demo MLRun project for LLM Tuning and serving pipelines	Jun 15, 2023
FinetuneGLMWithPeft	Python	74	Simple implementation of using lora form the peft library to fine-tune the chatglm-6b	Apr 20, 2023
Platypus	Python	610	Code for fine-tuning Platypus fam LLMs using LoRA	Jan 17, 2024
LLaMA-LoRA-Tuner	Python	5	Tools for testing and fine-tuning LLaMA models using LoRA, based on Alpaca-LoRA (https://github.com/tloen/alpaca-lora).	Apr 09, 2023
LLM-Finetuning-Hub	Python	552	Repository that contains LLM fine-tuning and deployment scripts along with our research findings.	Jan 17, 2024
gpt-j-fine-tuning-example	Jupyter Notebook	33	Fine-tuning 6-Billion GPT-J (& other models) with LoRA and 8-bit compression	Mar 21, 2023
LLM-finetuning	Jupyter Notebook	10	This repository provides code and resources for Parameter Efficient Fine-Tuning (PEFT), a technique for improving …	Oct 13, 2023
tiger	Jupyter Notebook	358	Open Source LLM toolkit to build trustworthy LLM applications. TigerArmor (AI safety), TigerRAG (embedding, RAG), …	Jan 18, 2024
llama-peft-tuner	Python	11	Tune LLaMa-7B on Alpaca Dataset using PEFT / LORA Based on @zphang's https://github.com/zphang/minimal-llama scripts.	Apr 07, 2023
PPO-Algorithms	Python	7	Experiments of the three PPO-Algorithms (PPO, clipped PPO, PPO with KL-penalty) proposed by John Schulman …	Jun 22, 2023
h2o-llmstudio	Python	702	H2O LLM Studio - a framework and no-code GUI for fine-tuning LLMs	Apr 24, 2023
drola	CSS	61	Drone with Lora	Apr 12, 2022
LLM-Adapters	Python	374	LLM-Adapters: An Adapter Family for Parameter-Efficient Fine-Tuning of Large Language Models	Apr 12, 2023
awesome-llm-human-preference-datasets	None	27	A curated list of Human Preference Datasets for LLM fine-tuning, RLHF, and eval.	May 09, 2023
LoRa_1W_APRS_Tracker	C++	3	LoRa APRS Tracker with 1 Watt LoRa Module	Apr 18, 2023
Colab_for_Alpaca_Lora	Jupyter Notebook	12	Here is a Google Colab Notebook for fine-tuning Alpaca Lora (within 3 hours with a …	Apr 16, 2023
LoRaExperiments	Arduino	12	Miscellaneous Experiments with LoRa	Aug 03, 2021
flan-t5-tweet-quality-predictor	Jupyter Notebook	4	Flan T5 LLM fine-tuning, by attaching a regression model last hidden layers activations. Runs on …	Apr 29, 2023
ppo	Python	2	Implementation of PPO with TF 2.0 and Pyoneer.	Nov 04, 2022
42dot_LLM	Python	91	42dot LLM consists of a pre-trained language model, 42dot LLM-PLM, and a fine-tuned model, 42dot …	Jan 18, 2024
Hyperparameter-Tuning-with-Python	Jupyter Notebook	85	Hyperparameter Tuning with Python	Oct 07, 2022
PTR	Python	99	Prompt Tuning with Rules	Aug 15, 2022