SPOT

Code release for "Supported Policy Optimization for Offline Reinforcement Learning" (NeurIPS 2022), https://arxiv.org/abs/2202.06239

Stars

18

Forks

0

Language

Python

Last Updated

Oct 20, 2023

Similar Repos

Repo	Language	Stars	Description	Updated At
Reinforcement_Learning_PPO	Python	6	Reinforcement learning with Proximal Policy Optimization (https://arxiv.org/abs/1707.06347)	Mar 16, 2023
neural-combinatorial-rl-pytorch	None	2	PyTorch implementation of Neural Combinatorial Optimization with Reinforcement Learning https://arxiv.org/abs/1611.09940	Nov 22, 2020
neural-combinatorial-rl-pytorch	Python	439	PyTorch implementation of Neural Combinatorial Optimization with Reinforcement Learning https://arxiv.org/abs/1611.09940	Oct 07, 2022
RecurrentDPG	Python	11	CS234 Reinforcement Learning: Keras implementation of Recurrent Deterministic Policy Gradient (https://arxiv.org/abs/1512.04455)	Jan 01, 2023
NLRL	Python	67	Source code of Neural Logic Reinforcement Learning (https://arxiv.org/abs/1904.10729)	Sep 27, 2022
Pytorch-DPPO	Python	163	Pytorch implementation of Distributed Proximal Policy Optimization: https://arxiv.org/abs/1707.02286	Sep 24, 2022
Deep-Incubation	Python	69	Code release for Deep Incubation (https://arxiv.org/abs/2212.04129)	May 06, 2023
AdMRL	Python	31	Code for paper "Model-based Adversarial Meta-Reinforcement Learning" (https://arxiv.org/abs/2006.08875)	Oct 02, 2022
pytorch-memonger	Python	528	Sublinear memory optimization for deep learning. https://arxiv.org/abs/1604.06174	Oct 13, 2022
MetaBox	Python	18	MetaBox: A Benchmark Platform for Meta-Black-Box Optimization with Reinforcement Learning (https://arxiv.org/abs/2310.08252)	Oct 14, 2023
pic	Python	8	"Policy Information Capacity: Information-Theoretic Measure for Task Complexity in Deep Reinforcement Learning"(https://arxiv.org/abs/2103.12726)	Jun 20, 2022
vanillaKD	Python	11	PyTorch code and checkpoints release for VanillaKD: https://arxiv.org/abs/2305.15781	May 30, 2023
VEM	Python	7	Codes accompanying the paper "Offline Reinforcement Learning with Value-Based Episodic Memory" (ICLR 2022 https://arxiv.org/abs/2110.09796)	Jun 19, 2022
RL4BG	Python	7	Public code release for "Deep Reinforcement Learning for Closed-Loop Blood Glucose Control" (Ian Fox et …	Nov 23, 2021
TUSK	Python	6	Official code release of TUSK (NeurIPS'22, https://arxiv.org/abs/2206.08460).	Mar 28, 2023
ademxapp	Python	337	Code for https://arxiv.org/abs/1611.10080	Oct 17, 2022
SVDNet-for-Pedestrian-Retrieval	Shell	139	Code for https://arxiv.org/abs/1703.05693	May 29, 2022
pytorch-prunes	Python	141	Code for https://arxiv.org/abs/1810.04622	Aug 07, 2022
MSTmodel	Python	16	Code for https://arxiv.org/abs/1712.00254	Dec 08, 2021
SfBC	Python	24	Codes accompanying the paper "Offline Reinforcement Learning via High-Fidelity Generative Behavior Modeling" (ICLR 2023) https://arxiv.org/abs/2209.14548	May 08, 2023
cvpo-safe-rl	Python	42	Code for "Constrained Variational Policy Optimization for Safe Reinforcement Learning" (ICML 2022)	Apr 26, 2023
policy-learning-landscape	Jupyter Notebook	52	Explore the optimization landscape for direct policy learning reinforcement learning.	Jul 26, 2022
neural-architecture-search	Python	400	Basic implementation of [Neural Architecture Search with Reinforcement Learning](https://arxiv.org/abs/1611.01578).	Oct 13, 2022
DCAN	Python	26	[AAAI 2020] Code release for "Domain Conditioned Adaptation Network" https://arxiv.org/abs/2005.06717	Jul 12, 2022
Rider-PPO	Python	13	Rider Reinforcement Learning Environment with Proximal Policy Optimization	Aug 29, 2022
nifty	Python	22	Code for paper https://arxiv.org/abs/2102.13186	Sep 23, 2022
generalizable-device-placement	Python	10	Reference code for https://arxiv.org/abs/1906.08879	Oct 13, 2022
ActionRobustRL	Jupyter Notebook	30	Code accompanying the paper "Action Robust Reinforcement Learning and Applications in Continuous Control" https://arxiv.org/abs/1901.09184	Aug 29, 2022
transfer-hpo-framework	Python	5	Code accompanying https://arxiv.org/abs/1802.02219	Aug 02, 2022
feudal_networks	Python	169	An implementation of FeUdal Networks for Hierarchical Reinforcement Learning as published : https://arxiv.org/abs/1703.01161	Jul 19, 2022
async-deep-rl	Python	69	A Tensorflow based implementation of "Asynchronous Methods for Deep Reinforcement Learning": https://arxiv.org/abs/1602.01783	Sep 04, 2021
async-rl	Python	5	Replicating "Asynchronous Methods for Deep Reinforcement Learning" (http://arxiv.org/abs/1602.01783)	Oct 08, 2019
async-rl	Python	395	Replicating "Asynchronous Methods for Deep Reinforcement Learning" (http://arxiv.org/abs/1602.01783)	Oct 04, 2022
meld	Python	50	MELD: Meta-Reinforcement Learning from Images via Latent State Models https://arxiv.org/abs/2010.13957	Sep 11, 2022
GDCAN	Python	31	[TPAMI 2021] Code release for "Generalized Domain Conditioned Adaptation Network" https://arxiv.org/abs/2103.12339	Aug 13, 2022
Dr-Jekyll-and-Mr-Hyde-The-Strange-Case-of-Off-Policy-Policy-Updates	Python	3	Code for Dr Jekyll and Mr Hyde: the Strange Case of Off-Policy Policy Updates: https://arxiv.org/abs/2109.14727	Jun 17, 2022
uai2017_learning_to_acquire_information	Python	6	code for uai2017 paper: https://arxiv.org/abs/1704.06131	Apr 06, 2021
icml2018_selecting_representative_examples	OpenEdge ABL	9	code for icml paper: https://arxiv.org/abs/1711.03243v3	Jul 20, 2020
CheXclusion	Jupyter Notebook	50	Code for the paper https://arxiv.org/abs/2003.00827	Aug 30, 2022
pva-faster-rcnn	Python	6	Demo code for PVANet https://arxiv.org/abs/1611.08588	Jan 17, 2019
One_Cycle_Policy	Jupyter Notebook	72	Pytorch notebook with One Cycle Policy implementation (https://arxiv.org/abs/1803.09820)	Oct 02, 2022
generalized_dt	Python	24	"Generalized Decision Transformer for Offline Hindsight Information Matching" (https://arxiv.org/abs/2111.10364)	Aug 10, 2022
KBGAN	Python	201	Code for "KBGAN: Adversarial Learning for Knowledge Graph Embeddings" https://arxiv.org/abs/1711.04071	Oct 10, 2022
eigengame	Python	3	Pseudo code from https://arxiv.org/abs/2102.04152	May 25, 2022
hcn	Python	81	Hybrid Code Networks https://arxiv.org/abs/1702.03274	Feb 08, 2022
mobile	Python	11	Code for MOBILE: Model-Bellman Inconsistency Penalized Offline Policy Optimization	Dec 12, 2023
cola	Python	18	CoLa - Decentralized Linear Learning: https://arxiv.org/abs/1808.04883	Feb 19, 2022
ICQ	Python	34	Codes accompanying the paper "Believe What You See: Implicit Constraint Approach for Offline Multi-Agent Reinforcement …	Jun 20, 2022
powersgd	Python	95	Practical low-rank gradient compression for distributed optimization: https://arxiv.org/abs/1905.13727	Oct 16, 2022
cliquecnn	CSS	2	Code for our paper "CliqueCNN: Deep Unsupervised Exemplar Learning" https://arxiv.org/abs/1608.08792	Sep 03, 2022