instructGOOSE

Implementation of Reinforcement Learning from Human Feedback (RLHF)

Stars

158

Forks

20

Language

Jupyter Notebook

Last Updated

Dec 22, 2023

Similar Repos

Repo	Language	Stars	Description	Updated At
rewardmodeling	Python	4	Train reward models for reinforcement learning from human feedback (RLHF).	Aug 28, 2023
alpaca-rlhf	Python	5	Finetuning alpaca with RLHF (Reinforcement Learning with Human Feedback)	Apr 25, 2023
safe-rlhf	Python	49	Safe-RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback	May 16, 2023
safe-rlhf	Python	2	Safe-RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback	Oct 22, 2023
trlx	Python	3061	A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)	Apr 25, 2023
PaLM-rlhf-pytorch	Python	74	Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically …	Dec 12, 2022
PaLM-rlhf-pytorch	Python	2	Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically …	May 21, 2023
PaLM-rlhf-pytorch	None	2	Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically …	Nov 30, 2023
PaLM-rlhf-jax	Python	6	Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically …	Apr 21, 2023
hh-rlhf	None	774	Human preference data for "Training a Helpful and Harmless Assistant with Reinforcement Learning from Human …	Apr 25, 2023
awesome-RLHF-language-models	None	26	Curated list of resources for Reinforcement Learning from Human Feedback and Language Models	Apr 24, 2023
Alpaca-LoRA-RLHF-PyTorch	Python	7	A full pipeline to finetune Alpaca LLM with LoRA and RLHF on consumer hardware. Implementation …	Apr 23, 2023
ChatGLM-LoRA-RLHF-PyTorch	Python	15	A full pipeline to finetune ChatGLM LLM with LoRA and RLHF on consumer hardware. Implementation …	Apr 24, 2023
Vicuna-LoRA-RLHF-PyTorch	Python	9	A full pipeline to finetune Vicuna LLM with LoRA and RLHF on consumer hardware. Implementation …	Apr 24, 2023
awesome-RLHF	None	972	A curated list of reinforcement learning with human feedback resources (continually updated)	Apr 24, 2023
summarize-from-feedback	Python	398	Code for "Learning to summarize from human feedback"	Aug 12, 2022
my-alpaca	Jupyter Notebook	27	Try original alpaca. The multi-turn version is at [multi-turn-alpaca](https://github.com/l294265421/multi-turn-alpaca) and the version further trained with …	Apr 25, 2023
DQN-tensorflow	Python	2341	Tensorflow implementation of Human-Level Control through Deep Reinforcement Learning	Aug 03, 2022
DQN-tensorflow	Python	3	Tensorflow implementation of Human-Level Control through Deep Reinforcement Learning	Dec 13, 2018
DQN-tensorflow	Python	3	Tensorflow implementation of Human-Level Control through Deep Reinforcement Learning	May 23, 2020
cogment-verse	Python	45	Library of Environments, Human Actor UIs and Agent implementation for Human In the Loop Learning …	May 01, 2023
DensePacker	Python	2	Reinforcement Learning Implementation	Sep 07, 2022
minichatgpt	Jupyter Notebook	11	annotated tutorial of the huggingface TRL repo for reinforcement learning from human feedback connecting equations …	Mar 28, 2023
learning-from-human-preferences	Python	6	Reproduction of OpenAI and DeepMind's "Deep Reinforcement Learning from Human Preferences"	Jul 29, 2022
learning-from-human-preferences	Python	230	Reproduction of OpenAI and DeepMind's "Deep Reinforcement Learning from Human Preferences"	Apr 06, 2023
reinforcement-learning-code	Python	3	Reinforcement learning algorithm implementation	Mar 16, 2023
CircuitBreakerAgent-reinforcement-learning	Scala	3	Reinforcement learning Circuit Breaker implementation	Dec 15, 2021
DRL-ExampleCode	Python	5	Implementation code when learning deep reinforcement learning	Dec 01, 2023
human-level-control	HTML	13	Presentation on Human-Level Control Through Deep Reinforcement Learning	Feb 13, 2020
neurips2018_rl_challenge	Python	2	Reinforcement learning for human walking motion with prosthetic leg	Sep 26, 2019
ChatGPT-Decoded-GPT2-FAQ-Bot-RLHF-PPO	Jupyter Notebook	30	A Practical Guide to Developing a Reliable FAQ Chatbot with Reinforcement Learning and Human Feedback …	Mar 13, 2023
Reward-Modeling	Python	2	Reward Modeling from Human Preferences and Advantage Actor-Critic Reinforcement Learning: A Reproducibility Study	Jul 28, 2021
reinforcement-learning-implementation	Jupyter Notebook	225	Reinforcement Learning examples implementation and explanation	Aug 08, 2022
robo_rl	Python	2	Pytorch implementation of reinforcement learning algorithms	Oct 01, 2021
RL_intro_code	Python	2	Implementation for Reinforcement Learning: An Introduction	Apr 05, 2021
UDRL	Python	2	Implementation of upside down Reinforcement Learning	Jan 14, 2020
RL-An-Introduction_example_code	Python	6	reinforcement learning: an introduction python implementation	Feb 10, 2020
Human-level-control-through-deep-reinforcement-learning	Python	18	📖 Paper: Human-level control through deep reinforcement learning 🕹️	Jul 26, 2022
safe_rl_manipulators	C++	6	Verifiably Safe Deep Reinforcement Learning for Robotic Manipulationin Human Environments	Apr 04, 2023
human_marl	Python	6	Cooperative Multi Agent Reinforcement Learning with Human in the Loop	Apr 24, 2023
Scanpath_Prediction	Python	76	Predicting Goal-directed Human Attention Using Inverse Reinforcement Learning (CVPR2020)	Apr 19, 2023
Fanfare	Python	8	Addon: Gamification, feedback, and reinforcement	Mar 10, 2022
TAMER	Python	13	Implementation of the TAMER algorithm from "Interactively Shaping Agents via Human Reinforcement" (Knox, Stone - …	Apr 20, 2023
superhf	Jupyter Notebook	2	Open-source Human Feedback Library	Apr 12, 2023
alpaca_farm	Python	31	A Simulation Framework for Methods that Learn from Human Feedback	May 24, 2023
rurel	Rust	50	Flexible, reusable reinforcement learning (Q learning) implementation in Rust	Aug 11, 2022
Reinforcement_Learning_Assignment	Python	4	Implementation of Reinforcement Learning in Fall 2018	May 11, 2020
reinforcement-learning-an-introduction	None	5	Python Implementation of Reinforcement Learning: An Introduction	Sep 01, 2021
deep-rl-tensorflow	Python	1566	TensorFlow implementation of Deep Reinforcement Learning papers	Sep 19, 2022
reinforcement-learning-an-introduction	Python	11667	Python Implementation of Reinforcement Learning: An Introduction	Aug 18, 2022