ppo

A stable version of ppo and a2c

Stars

4

Forks

0

Language

Python

Last Updated

Nov 20, 2023

Similar Repos

Repo	Language	Stars	Description	Updated At
torch-ppo	Python	6	PyTorch implementation of PPO, A2C, ACKTR, and GAIL	Mar 14, 2022
a2c_is_a_special_case_of_ppo	Python	12	A2C is a special case of PPO!	Aug 17, 2022
obstacle-tower-pytorch-a2c-ppo	Python	2	PPO/A2C in PyTorch for the Obstacle Tower Challenge	Mar 09, 2022
torch-ac	Python	159	Recurrent and multi-process PyTorch implementation of deep reinforcement Actor-Critic algorithms A2C and PPO	Apr 23, 2023
ChatGPT_trading_bot-1	None	2	Creating an AI Based Trading bot using RNN, LSTM, PPO, A2C, DDPG	Feb 27, 2024
pytorch-rl-minimal-implementations	Python	2	Independent and minimal implementations of some reinforcement learning algorithms using PyTorch (including PPO, A3C, A2C, …	Feb 24, 2022
Deep-reinforcement-learning-with-pytorch	Python	2823	PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and …	Apr 24, 2023
Deep-reinforcement-learning-with-pytorch	None	2	PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and …	Apr 20, 2023
Deep-reinforcement-learning-with-pytorch	None	2	PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and …	Feb 08, 2023
SB3_PPO_to_ONNX	Python	2	Test deploying Stable-Baselines3 PPO model to ONNX	Apr 25, 2022
DRL-code-pytorch	Python	398	Concise pytorch implements of DRL algorithms, including REINFORCE, A2C, DQN, PPO(discrete and continuous), DDPG, TD3, …	Apr 26, 2023
mujoco-benchmark	None	101	Provide full reinforcement learning benchmark on mujoco environments, including ddpg, sac, td3, pg, a2c, ppo, …	Feb 27, 2023
openwrt	C	583	OpenWrt Stable Version	Aug 12, 2022
openwrt-jdcloud	C	3	OpenWrt Stable Version	Oct 16, 2022
deep_rl	None	2	PyTorch implementations of Deep Reinforcement Learning algorithms (DQN, DDQN, A2C, VPG, TRPO, PPO, DDPG, TD3, …	Jul 08, 2022
PyTorch-RL	Python	856	PyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) and Generative Adversarial …	Aug 10, 2022
PyTorch-RL	Python	2	PyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) and Generative Adversarial …	Jun 23, 2022
Advanced-Actor-Critic-Methods	Python	2	PPO	Mar 28, 2022
deep-rl-toolkit	Python	9	RLToolkit is a flexible and high-efficient reinforcement learning framework. Include implementation of DQN, AC, ACER, …	Apr 01, 2023
PPO-clip-and-PPO-penalty-on-Atari-Domain	Python	57	Implement PPO-clip and PPO-penalty on Atari, which is the only open source of PPO-penalty	Mar 05, 2023
pytorch-a2c-ppo-acktr	Python	2	PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO) and Scalable trust-region method …	Dec 03, 2019
pytorch-a2c-ppo-acktr	Python	2	PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO) and Scalable trust-region method …	Sep 24, 2018
pytorch-a2c-ppo-acktr	Python	4	PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO) and Scalable trust-region method …	Nov 10, 2022
pytorch-a2c-ppo-acktr	Python	3	PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO) and Scalable trust-region method …	Jan 21, 2020
pytorch-baselines-micropolis	Python	3	PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO) and Scalable trust-region method …	Mar 18, 2022
iso6.9-2.6stable	Python	2	the stable version of iso6.9	Jun 10, 2022
a2c	Python	3	A2C implementation with TensorFlow	Nov 29, 2021
opera-opensuse	None	2	Opera 2x stable and developer version packaged for opensue	Jan 19, 2017
semicolon.js	JavaScript	227	A more secure. stable and reliable version of vapor.js	Nov 11, 2022
semicolon.js	None	2	A more secure. stable and reliable version of vapor.js	Jan 08, 2013
semicolon.js	JavaScript	2	A more secure. stable and reliable version of vapor.js	Oct 04, 2022
global-store	TypeScript	5	provides version stable stores for library.	Aug 17, 2022
s3st14	PHP	15	SLiMS version 3 stable 14 (Seulanga)	Mar 31, 2022
s3st15_matoa	PHP	14	SLiMS version 3 stable 15 (Matoa)	Mar 31, 2022
NWWbot	None	2	The stable version of the bot	Aug 15, 2021
classic-dark	ReScript	27	Darklang stable version - currently on darklang.com	May 02, 2023
NWWbot	TypeScript	3	The stable version of the bot	May 09, 2022
python-tesseract-alpine	Dockerfile	2	🐋 Alpine Linux, Python (latest stable version) and Tesseract (latest version from Git).	Jul 06, 2023
PPO-Algorithms	Python	7	Experiments of the three PPO-Algorithms (PPO, clipped PPO, PPO with KL-penalty) proposed by John Schulman …	Jun 22, 2023
FlappyBird	C	12	FlappyBird C++ version, powered by Cocos2d-x 3.0 stable version.	Jan 13, 2022
nick-stable-diffusion	Jupyter Notebook	53	Nick's Docker-based version of Stable Diffusion	May 07, 2023
cuckoo-reporting-module	Python	9	Cuckoo reporting module for version 1.2 stable	Jan 25, 2020
OpenDSA-stable	JavaScript	6	Stable release version for the OpenDSA project	Mar 02, 2017
Deep-Reinforcement-Learning-Algorithms	Jupyter Notebook	457	32 projects in the framework of Deep Reinforcement Learning algorithms: Q-learning, DQN, PPO, DDPG, TD3, …	Apr 27, 2023
Airplane_2.0.1	C++	9	Wechat Airplane C++ version, powered by Cocos2d-x 2.0.1 stable version.	Dec 11, 2019
Airplane_2.2.0	C++	54	Wechat Airplane C++ version, powered by Cocos2d-x 2.2.0 stable version.	May 23, 2020
Airplane_3.0	C++	52	Wechat Airplane C++ version, powered by Cocos2d-x 3.0 stable version.	Apr 23, 2023
pytorch-a2c-ppo-acktr-gail	Python	2901	PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for …	Aug 29, 2022
pytorch-a2c-ppo-acktr	Python	2	PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for …	Jul 09, 2021
pytorch-a2c-ppo-acktr-gail	None	2	PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for …	Oct 18, 2021