outer-value-function-meta-rl

Code of the paper: Debiasing Meta-Gradient Reinforcement Learning by Learning the Outer Value Function

Stars

10

Forks

2

Language

Jupyter Notebook

Last Updated

Jan 04, 2024

Similar Repos

Repo	Language	Stars	Description	Updated At
svg	Jupyter Notebook	41	On the model-based stochastic value gradient for continuous reinforcement learning	Jun 22, 2022
MRA	Python	2	Code for paper "Learning Meta Representation for Agents in Multi-Agent Reinforcement Learning".	Nov 27, 2023
Reinforcement_learning_tutorial_with_demo	Jupyter Notebook	619	Reinforcement Learning Tutorial with Demo: DP (Policy and Value Iteration), Monte Carlo, TD Learning (SARSA, …	Apr 17, 2023
Reinforcement_learning_tutorial_with_demo	Jupyter Notebook	3	Reinforcement Learning Tutorial with Demo: DP (Policy and Value Iteration), Monte Carlo, TD Learning (SARSA, …	Feb 20, 2022
GradLRE	Python	21	The source code of paper "Gradient Imitation Reinforcement Learning for Low Resource Relation Extraction"	Mar 30, 2023
Meta-Reinforcement-Learning	Jupyter Notebook	17	Code snippets of Meta Reinforcement Learning algorithms	Oct 31, 2022
AdMRL	Python	31	Code for paper "Model-based Adversarial Meta-Reinforcement Learning" (https://arxiv.org/abs/2006.08875)	Oct 02, 2022
mlsh	Python	573	Code for the paper "Meta-Learning Shared Hierarchies"	Aug 12, 2022
MetaGym	Python	211	Collection of Reinforcement Learning / Meta Reinforcement Learning Environments.	Aug 08, 2022
meta-q-learning	Python	68	Code for the paper "Meta-Q-Learning"( ICLR 2020)	Aug 12, 2022
coinrun	C++	338	Code for the paper "Quantifying Transfer in Reinforcement Learning"	Jun 20, 2022
apdac	Python	2	Code for the paper: "Attention-based Partial Decoupling of Policy and Value for Generalization in Reinforcement …	Nov 01, 2022
non_conventional_value_function_approximation	Jupyter Notebook	2	Comparative Evaluation of Non-Conventional Value Function Approximation Methods in Reinforcement Learning	Nov 13, 2023
learnable-init	Jupyter Notebook	5	Code for the paper: Discovering Weight Initializers with Meta-Learning	Mar 21, 2022
supervised-reptile	JavaScript	928	Code for the paper "On First-Order Meta-Learning Algorithms"	Aug 10, 2022
DMML	Python	105	code for ICCV19 paper "Deep Meta Metric Learning"	Mar 08, 2023
DeterministicPolicyGradient.jl	Julia	8	Reinforcement learning with Deterministic Policy Gradient methods	May 27, 2021
reinforcement-learning-tic-tac-toe	JavaScript	14	A reinforcement learning algorithm for agents to learn the tic-tac-toe, using the value function.	Apr 19, 2023
DCPG	Python	12	Official PyTorch implementation of "Rethinking Value Function Learning for Generalization in Reinforcement Learning" (NeurIPS 2022)	Oct 24, 2023
DIRL-bidding_preference	Python	8	The code of the algorithm proposed in the paper "Deep Inverse Reinforcement Learning for Objective …	May 28, 2023
auxiliary-tasks-rl	Python	19	Code for the paper Adaptive Auxiliary Task Weighting for Reinforcement Learning	Apr 28, 2022
train-procgen	Python	143	Code for the paper "Leveraging Procedural Generation to Benchmark Reinforcement Learning"	Jul 31, 2022
boolean_composition	Python	9	Code for the paper "A Boolean Task Algebra For Reinforcement Learning"	Jan 28, 2023
phasic-policy-gradient	Python	193	Code for the paper "Phasic Policy Gradient"	Jul 28, 2022
macaw	Python	30	Code for "Offline Meta-Reinforcement Learning with Advantage Weighting" [ICML 2021]	Mar 09, 2023
taming-maml	Python	27	Taming MAML: efficient unbiased meta-reinforcement learning	May 08, 2023
pytorch-maml-rl	Python	2	Reinforcement Learning with Model-Agnostic Meta-Learning in Pytorch	Jun 23, 2022
pytorch-maml-rl	Python	736	Reinforcement Learning with Model-Agnostic Meta-Learning in Pytorch	May 07, 2023
paper-reviews	None	31	weekly reinforcement learning paper reviews	Aug 05, 2022
rl-paper-study	None	183	Reinforcement Learning paper review study	Aug 06, 2022
hyperclip	Python	22	Code repository for the paper "Meta-Learning via Classifier(-free) Diffusion Guidance"	Apr 17, 2023
mushroom_hierarchical	Python	2	Code for the paper: Graph-Based Design of Hierarchical Reinforcement Learning Agents	Jan 27, 2021
reinforcement_learning_augmentation	Python	2	Modified code and experiments from the "Feature augmentation with reinforcement learning" paper	Jun 21, 2023
L2F	Python	29	Source code for CVPR 2020 paper "Learning to Forget for Meta-Learning"	May 22, 2022
linear-rl	Python	5	Reinforcement Learning with linear function approximation	May 11, 2023
SAVE	Python	2	Implementation of "Statistical Inference of the Value Function for Reinforcement Learning" in Infinite Horizon Settings …	Jan 22, 2024
MetaIRL	Python	52	Meta-Inverse Reinforcement Learning with Probabilistic Context Variables	Aug 11, 2022
evolution-strategies-starter	Python	1451	Code for the paper "Evolution Strategies as a Scalable Alternative to Reinforcement Learning"	Aug 11, 2022
trajectory-transformer	Python	203	Code for the paper "Offline Reinforcement Learning as One Big Sequence Modeling Problem"	Aug 12, 2022
deep-rl-humanoid-motions-masters	Python	3	Repository for code from the Master's Thesis "Imitation Learning and Meta-Reinforcement Learning for Optimizing Humanoid …	Sep 08, 2021
gpt3-jobadvert-bias	Jupyter Notebook	2	Code for the paper "Looking for a Handsome Carpenter! Debiasing GPT-3 Job Advertisements".	Jul 16, 2023
ALFA	Python	42	Source code for NeurIPS 2020 paper "Meta-Learning with Adaptive Hyperparameters"	Oct 02, 2022
VEM	Python	7	Codes accompanying the paper "Offline Reinforcement Learning with Value-Based Episodic Memory" (ICLR 2022 https://arxiv.org/abs/2110.09796)	Jun 19, 2022
multi-armed-bandit-with-policy-gradient	Jupyter Notebook	8	A multi armed bandit Reinforcement learning problem using Policy Gradient.	May 09, 2020
openai-gym-policy-gradient	Python	105	Reinforcement Learning using Policy Gradient to solve OpenAI Gym games	Apr 11, 2023
robosumo	Python	255	Code for the paper "Continuous Adaptation via Meta-Learning in Nonstationary and Competitive Environments"	Aug 08, 2022
Meta_Control_Variates	Jupyter Notebook	2	Code released for the paper "Meta-learning Control Variates: Variance Reduction with Limited Data"	Apr 09, 2023
dm_alchemy	Python	186	DeepMind Alchemy task environment: a meta-reinforcement learning benchmark	Jul 24, 2022
or-rl-benchmarks	Python	59	The source code for the paper: 'ORL: Reinforcement Learning Benchmarks for Online Stochastic Optimization Problems'	Jun 25, 2022
rl-plasticity-experiments	Python	3	Source code and data for the paper "Testing the Plasticity of Reinforcement Learning Based Systems"	Mar 07, 2023