Stars
10
Forks
2
Language
Jupyter Notebook
Last Updated
Jan 04, 2024
Similar Repos
Repo | Language | Stars | Description | Updated At |
---|---|---|---|---|
Jupyter Notebook | 41 | On the model-based stochastic value gradient for continuous reinforcement learning | Jun 22, 2022 | |
Python | 2 | Code for paper "Learning Meta Representation for Agents in Multi-Agent Reinforcement Learning". | Nov 27, 2023 | |
Jupyter Notebook | 619 | Reinforcement Learning Tutorial with Demo: DP (Policy and Value Iteration), Monte Carlo, TD Learning (SARSA, … | Apr 17, 2023 | |
Jupyter Notebook | 3 | Reinforcement Learning Tutorial with Demo: DP (Policy and Value Iteration), Monte Carlo, TD Learning (SARSA, … | Feb 20, 2022 | |
Python | 21 | The source code of paper "Gradient Imitation Reinforcement Learning for Low Resource Relation Extraction" | Mar 30, 2023 | |
Jupyter Notebook | 17 | Code snippets of Meta Reinforcement Learning algorithms | Oct 31, 2022 | |
Python | 31 | Code for paper "Model-based Adversarial Meta-Reinforcement Learning" (https://arxiv.org/abs/2006.08875) | Oct 02, 2022 | |
Python | 573 | Code for the paper "Meta-Learning Shared Hierarchies" | Aug 12, 2022 | |
Python | 211 | Collection of Reinforcement Learning / Meta Reinforcement Learning Environments. | Aug 08, 2022 | |
Python | 68 | Code for the paper "Meta-Q-Learning"( ICLR 2020) | Aug 12, 2022 | |
C++ | 338 | Code for the paper "Quantifying Transfer in Reinforcement Learning" | Jun 20, 2022 | |
Python | 2 | Code for the paper: "Attention-based Partial Decoupling of Policy and Value for Generalization in Reinforcement … | Nov 01, 2022 | |
Jupyter Notebook | 2 | Comparative Evaluation of Non-Conventional Value Function Approximation Methods in Reinforcement Learning | Nov 13, 2023 | |
Jupyter Notebook | 5 | Code for the paper: Discovering Weight Initializers with Meta-Learning | Mar 21, 2022 | |
JavaScript | 928 | Code for the paper "On First-Order Meta-Learning Algorithms" | Aug 10, 2022 | |
Python | 105 | code for ICCV19 paper "Deep Meta Metric Learning" | Mar 08, 2023 | |
Julia | 8 | Reinforcement learning with Deterministic Policy Gradient methods | May 27, 2021 | |
JavaScript | 14 | A reinforcement learning algorithm for agents to learn the tic-tac-toe, using the value function. | Apr 19, 2023 | |
Python | 12 | Official PyTorch implementation of "Rethinking Value Function Learning for Generalization in Reinforcement Learning" (NeurIPS 2022) | Oct 24, 2023 | |
Python | 8 | The code of the algorithm proposed in the paper "Deep Inverse Reinforcement Learning for Objective … | May 28, 2023 | |
Python | 19 | Code for the paper Adaptive Auxiliary Task Weighting for Reinforcement Learning | Apr 28, 2022 | |
Python | 143 | Code for the paper "Leveraging Procedural Generation to Benchmark Reinforcement Learning" | Jul 31, 2022 | |
Python | 9 | Code for the paper "A Boolean Task Algebra For Reinforcement Learning" | Jan 28, 2023 | |
Python | 193 | Code for the paper "Phasic Policy Gradient" | Jul 28, 2022 | |
Python | 30 | Code for "Offline Meta-Reinforcement Learning with Advantage Weighting" [ICML 2021] | Mar 09, 2023 | |
Python | 27 | Taming MAML: efficient unbiased meta-reinforcement learning | May 08, 2023 | |
Python | 2 | Reinforcement Learning with Model-Agnostic Meta-Learning in Pytorch | Jun 23, 2022 | |
Python | 736 | Reinforcement Learning with Model-Agnostic Meta-Learning in Pytorch | May 07, 2023 | |
None | 31 | weekly reinforcement learning paper reviews | Aug 05, 2022 | |
None | 183 | Reinforcement Learning paper review study | Aug 06, 2022 | |
Python | 22 | Code repository for the paper "Meta-Learning via Classifier(-free) Diffusion Guidance" | Apr 17, 2023 | |
Python | 2 | Code for the paper: Graph-Based Design of Hierarchical Reinforcement Learning Agents | Jan 27, 2021 | |
Python | 2 | Modified code and experiments from the "Feature augmentation with reinforcement learning" paper | Jun 21, 2023 | |
Python | 29 | Source code for CVPR 2020 paper "Learning to Forget for Meta-Learning" | May 22, 2022 | |
Python | 5 | Reinforcement Learning with linear function approximation | May 11, 2023 | |
Python | 2 | Implementation of "Statistical Inference of the Value Function for Reinforcement Learning" in Infinite Horizon Settings … | Jan 22, 2024 | |
Python | 52 | Meta-Inverse Reinforcement Learning with Probabilistic Context Variables | Aug 11, 2022 | |
Python | 1451 | Code for the paper "Evolution Strategies as a Scalable Alternative to Reinforcement Learning" | Aug 11, 2022 | |
Python | 203 | Code for the paper "Offline Reinforcement Learning as One Big Sequence Modeling Problem" | Aug 12, 2022 | |
Python | 3 | Repository for code from the Master's Thesis "Imitation Learning and Meta-Reinforcement Learning for Optimizing Humanoid … | Sep 08, 2021 | |
Jupyter Notebook | 2 | Code for the paper "Looking for a Handsome Carpenter! Debiasing GPT-3 Job Advertisements". | Jul 16, 2023 | |
Python | 42 | Source code for NeurIPS 2020 paper "Meta-Learning with Adaptive Hyperparameters" | Oct 02, 2022 | |
Python | 7 | Codes accompanying the paper "Offline Reinforcement Learning with Value-Based Episodic Memory" (ICLR 2022 https://arxiv.org/abs/2110.09796) | Jun 19, 2022 | |
Jupyter Notebook | 8 | A multi armed bandit Reinforcement learning problem using Policy Gradient. | May 09, 2020 | |
Python | 105 | Reinforcement Learning using Policy Gradient to solve OpenAI Gym games | Apr 11, 2023 | |
Python | 255 | Code for the paper "Continuous Adaptation via Meta-Learning in Nonstationary and Competitive Environments" | Aug 08, 2022 | |
Jupyter Notebook | 2 | Code released for the paper "Meta-learning Control Variates: Variance Reduction with Limited Data" | Apr 09, 2023 | |
Python | 186 | DeepMind Alchemy task environment: a meta-reinforcement learning benchmark | Jul 24, 2022 | |
Python | 59 | The source code for the paper: 'ORL: Reinforcement Learning Benchmarks for Online Stochastic Optimization Problems' | Jun 25, 2022 | |
Python | 3 | Source code and data for the paper "Testing the Plasticity of Reinforcement Learning Based Systems" | Mar 07, 2023 |