Stars
18
Forks
0
Language
Python
Last Updated
Oct 20, 2023
Similar Repos
Repo | Language | Stars | Description | Updated At |
---|---|---|---|---|
Python | 6 | Reinforcement learning with Proximal Policy Optimization (https://arxiv.org/abs/1707.06347) | Mar 16, 2023 | |
None | 2 | PyTorch implementation of Neural Combinatorial Optimization with Reinforcement Learning https://arxiv.org/abs/1611.09940 | Nov 22, 2020 | |
Python | 439 | PyTorch implementation of Neural Combinatorial Optimization with Reinforcement Learning https://arxiv.org/abs/1611.09940 | Oct 07, 2022 | |
Python | 11 | CS234 Reinforcement Learning: Keras implementation of Recurrent Deterministic Policy Gradient (https://arxiv.org/abs/1512.04455) | Jan 01, 2023 | |
Python | 67 | Source code of Neural Logic Reinforcement Learning (https://arxiv.org/abs/1904.10729) | Sep 27, 2022 | |
Python | 163 | Pytorch implementation of Distributed Proximal Policy Optimization: https://arxiv.org/abs/1707.02286 | Sep 24, 2022 | |
Python | 69 | Code release for Deep Incubation (https://arxiv.org/abs/2212.04129) | May 06, 2023 | |
Python | 31 | Code for paper "Model-based Adversarial Meta-Reinforcement Learning" (https://arxiv.org/abs/2006.08875) | Oct 02, 2022 | |
Python | 528 | Sublinear memory optimization for deep learning. https://arxiv.org/abs/1604.06174 | Oct 13, 2022 | |
Python | 18 | MetaBox: A Benchmark Platform for Meta-Black-Box Optimization with Reinforcement Learning (https://arxiv.org/abs/2310.08252) | Oct 14, 2023 | |
Python | 8 | "Policy Information Capacity: Information-Theoretic Measure for Task Complexity in Deep Reinforcement Learning"(https://arxiv.org/abs/2103.12726) | Jun 20, 2022 | |
Python | 11 | PyTorch code and checkpoints release for VanillaKD: https://arxiv.org/abs/2305.15781 | May 30, 2023 | |
Python | 7 | Codes accompanying the paper "Offline Reinforcement Learning with Value-Based Episodic Memory" (ICLR 2022 https://arxiv.org/abs/2110.09796) | Jun 19, 2022 | |
Python | 7 | Public code release for "Deep Reinforcement Learning for Closed-Loop Blood Glucose Control" (Ian Fox et … | Nov 23, 2021 | |
Python | 6 | Official code release of TUSK (NeurIPS'22, https://arxiv.org/abs/2206.08460). | Mar 28, 2023 | |
Python | 337 | Code for https://arxiv.org/abs/1611.10080 | Oct 17, 2022 | |
Shell | 139 | Code for https://arxiv.org/abs/1703.05693 | May 29, 2022 | |
Python | 141 | Code for https://arxiv.org/abs/1810.04622 | Aug 07, 2022 | |
Python | 16 | Code for https://arxiv.org/abs/1712.00254 | Dec 08, 2021 | |
Python | 24 | Codes accompanying the paper "Offline Reinforcement Learning via High-Fidelity Generative Behavior Modeling" (ICLR 2023) https://arxiv.org/abs/2209.14548 | May 08, 2023 | |
Python | 42 | Code for "Constrained Variational Policy Optimization for Safe Reinforcement Learning" (ICML 2022) | Apr 26, 2023 | |
Jupyter Notebook | 52 | Explore the optimization landscape for direct policy learning reinforcement learning. | Jul 26, 2022 | |
Python | 400 | Basic implementation of [Neural Architecture Search with Reinforcement Learning](https://arxiv.org/abs/1611.01578). | Oct 13, 2022 | |
Python | 26 | [AAAI 2020] Code release for "Domain Conditioned Adaptation Network" https://arxiv.org/abs/2005.06717 | Jul 12, 2022 | |
Python | 13 | Rider Reinforcement Learning Environment with Proximal Policy Optimization | Aug 29, 2022 | |
Python | 22 | Code for paper https://arxiv.org/abs/2102.13186 | Sep 23, 2022 | |
Python | 10 | Reference code for https://arxiv.org/abs/1906.08879 | Oct 13, 2022 | |
Jupyter Notebook | 30 | Code accompanying the paper "Action Robust Reinforcement Learning and Applications in Continuous Control" https://arxiv.org/abs/1901.09184 | Aug 29, 2022 | |
Python | 5 | Code accompanying https://arxiv.org/abs/1802.02219 | Aug 02, 2022 | |
Python | 169 | An implementation of FeUdal Networks for Hierarchical Reinforcement Learning as published : https://arxiv.org/abs/1703.01161 | Jul 19, 2022 | |
Python | 69 | A Tensorflow based implementation of "Asynchronous Methods for Deep Reinforcement Learning": https://arxiv.org/abs/1602.01783 | Sep 04, 2021 | |
Python | 5 | Replicating "Asynchronous Methods for Deep Reinforcement Learning" (http://arxiv.org/abs/1602.01783) | Oct 08, 2019 | |
Python | 395 | Replicating "Asynchronous Methods for Deep Reinforcement Learning" (http://arxiv.org/abs/1602.01783) | Oct 04, 2022 | |
Python | 50 | MELD: Meta-Reinforcement Learning from Images via Latent State Models https://arxiv.org/abs/2010.13957 | Sep 11, 2022 | |
Python | 31 | [TPAMI 2021] Code release for "Generalized Domain Conditioned Adaptation Network" https://arxiv.org/abs/2103.12339 | Aug 13, 2022 | |
Python | 3 | Code for Dr Jekyll and Mr Hyde: the Strange Case of Off-Policy Policy Updates: https://arxiv.org/abs/2109.14727 | Jun 17, 2022 | |
Python | 6 | code for uai2017 paper: https://arxiv.org/abs/1704.06131 | Apr 06, 2021 | |
OpenEdge ABL | 9 | code for icml paper: https://arxiv.org/abs/1711.03243v3 | Jul 20, 2020 | |
Jupyter Notebook | 50 | Code for the paper https://arxiv.org/abs/2003.00827 | Aug 30, 2022 | |
Python | 6 | Demo code for PVANet https://arxiv.org/abs/1611.08588 | Jan 17, 2019 | |
Jupyter Notebook | 72 | Pytorch notebook with One Cycle Policy implementation (https://arxiv.org/abs/1803.09820) | Oct 02, 2022 | |
Python | 24 | "Generalized Decision Transformer for Offline Hindsight Information Matching" (https://arxiv.org/abs/2111.10364) | Aug 10, 2022 | |
Python | 201 | Code for "KBGAN: Adversarial Learning for Knowledge Graph Embeddings" https://arxiv.org/abs/1711.04071 | Oct 10, 2022 | |
Python | 3 | Pseudo code from https://arxiv.org/abs/2102.04152 | May 25, 2022 | |
Python | 81 | Hybrid Code Networks https://arxiv.org/abs/1702.03274 | Feb 08, 2022 | |
Python | 11 | Code for MOBILE: Model-Bellman Inconsistency Penalized Offline Policy Optimization | Dec 12, 2023 | |
Python | 18 | CoLa - Decentralized Linear Learning: https://arxiv.org/abs/1808.04883 | Feb 19, 2022 | |
Python | 34 | Codes accompanying the paper "Believe What You See: Implicit Constraint Approach for Offline Multi-Agent Reinforcement … | Jun 20, 2022 | |
Python | 95 | Practical low-rank gradient compression for distributed optimization: https://arxiv.org/abs/1905.13727 | Oct 16, 2022 | |
CSS | 2 | Code for our paper "CliqueCNN: Deep Unsupervised Exemplar Learning" https://arxiv.org/abs/1608.08792 | Sep 03, 2022 |