Stars
4
Forks
0
Language
Python
Last Updated
Nov 20, 2023
Similar Repos
Repo | Language | Stars | Description | Updated At |
---|---|---|---|---|
Python | 6 | PyTorch implementation of PPO, A2C, ACKTR, and GAIL | Mar 14, 2022 | |
Python | 12 | A2C is a special case of PPO! | Aug 17, 2022 | |
Python | 2 | PPO/A2C in PyTorch for the Obstacle Tower Challenge | Mar 09, 2022 | |
Python | 159 | Recurrent and multi-process PyTorch implementation of deep reinforcement Actor-Critic algorithms A2C and PPO | Apr 23, 2023 | |
None | 2 | Creating an AI Based Trading bot using RNN, LSTM, PPO, A2C, DDPG | Feb 27, 2024 | |
Python | 2 | Independent and minimal implementations of some reinforcement learning algorithms using PyTorch (including PPO, A3C, A2C, … | Feb 24, 2022 | |
Python | 2823 | PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and … | Apr 24, 2023 | |
None | 2 | PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and … | Apr 20, 2023 | |
None | 2 | PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and … | Feb 08, 2023 | |
Python | 2 | Test deploying Stable-Baselines3 PPO model to ONNX | Apr 25, 2022 | |
Python | 398 | Concise pytorch implements of DRL algorithms, including REINFORCE, A2C, DQN, PPO(discrete and continuous), DDPG, TD3, … | Apr 26, 2023 | |
None | 101 | Provide full reinforcement learning benchmark on mujoco environments, including ddpg, sac, td3, pg, a2c, ppo, … | Feb 27, 2023 | |
C | 583 | OpenWrt Stable Version | Aug 12, 2022 | |
C | 3 | OpenWrt Stable Version | Oct 16, 2022 | |
None | 2 | PyTorch implementations of Deep Reinforcement Learning algorithms (DQN, DDQN, A2C, VPG, TRPO, PPO, DDPG, TD3, … | Jul 08, 2022 | |
Python | 856 | PyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) and Generative Adversarial … | Aug 10, 2022 | |
Python | 2 | PyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) and Generative Adversarial … | Jun 23, 2022 | |
Python | 2 | PPO | Mar 28, 2022 | |
Python | 9 | RLToolkit is a flexible and high-efficient reinforcement learning framework. Include implementation of DQN, AC, ACER, … | Apr 01, 2023 | |
Python | 57 | Implement PPO-clip and PPO-penalty on Atari, which is the only open source of PPO-penalty | Mar 05, 2023 | |
Python | 2 | PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO) and Scalable trust-region method … | Dec 03, 2019 | |
Python | 2 | PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO) and Scalable trust-region method … | Sep 24, 2018 | |
Python | 4 | PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO) and Scalable trust-region method … | Nov 10, 2022 | |
Python | 3 | PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO) and Scalable trust-region method … | Jan 21, 2020 | |
Python | 3 | PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO) and Scalable trust-region method … | Mar 18, 2022 | |
Python | 2 | the stable version of iso6.9 | Jun 10, 2022 | |
Python | 3 | A2C implementation with TensorFlow | Nov 29, 2021 | |
None | 2 | Opera 2x stable and developer version packaged for opensue | Jan 19, 2017 | |
JavaScript | 227 | A more secure. stable and reliable version of vapor.js | Nov 11, 2022 | |
None | 2 | A more secure. stable and reliable version of vapor.js | Jan 08, 2013 | |
JavaScript | 2 | A more secure. stable and reliable version of vapor.js | Oct 04, 2022 | |
TypeScript | 5 | provides version stable stores for library. | Aug 17, 2022 | |
PHP | 15 | SLiMS version 3 stable 14 (Seulanga) | Mar 31, 2022 | |
PHP | 14 | SLiMS version 3 stable 15 (Matoa) | Mar 31, 2022 | |
None | 2 | The stable version of the bot | Aug 15, 2021 | |
ReScript | 27 | Darklang stable version - currently on darklang.com | May 02, 2023 | |
TypeScript | 3 | The stable version of the bot | May 09, 2022 | |
Dockerfile | 2 | 🐋 Alpine Linux, Python (latest stable version) and Tesseract (latest version from Git). | Jul 06, 2023 | |
Python | 7 | Experiments of the three PPO-Algorithms (PPO, clipped PPO, PPO with KL-penalty) proposed by John Schulman … | Jun 22, 2023 | |
C | 12 | FlappyBird C++ version, powered by Cocos2d-x 3.0 stable version. | Jan 13, 2022 | |
Jupyter Notebook | 53 | Nick's Docker-based version of Stable Diffusion | May 07, 2023 | |
Python | 9 | Cuckoo reporting module for version 1.2 stable | Jan 25, 2020 | |
JavaScript | 6 | Stable release version for the OpenDSA project | Mar 02, 2017 | |
Jupyter Notebook | 457 | 32 projects in the framework of Deep Reinforcement Learning algorithms: Q-learning, DQN, PPO, DDPG, TD3, … | Apr 27, 2023 | |
C++ | 9 | Wechat Airplane C++ version, powered by Cocos2d-x 2.0.1 stable version. | Dec 11, 2019 | |
C++ | 54 | Wechat Airplane C++ version, powered by Cocos2d-x 2.2.0 stable version. | May 23, 2020 | |
C++ | 52 | Wechat Airplane C++ version, powered by Cocos2d-x 3.0 stable version. | Apr 23, 2023 | |
Python | 2901 | PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for … | Aug 29, 2022 | |
Python | 2 | PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for … | Jul 09, 2021 | |
None | 2 | PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for … | Oct 18, 2021 |