Stars
1200
Forks
280
Language
Python
Last Updated
May 27, 2024
Similar Repos
Repo | Language | Stars | Description | Updated At |
---|---|---|---|---|
Python | 25 | Tensorflow implementation of Asynchronous Advantage Actor Critic (A3C) from "Asynchronous Methods for Deep Reinforcement Learning". | May 25, 2022 | |
Python | 112 | PyTorch implementation of Advantage async actor-critic Algorithms (A3C) in PyTorch | Mar 30, 2023 | |
Python | 862 | Asynchronous Advantage Actor-Critic (A3C) algorithm for Super Mario Bros | Aug 23, 2022 | |
None | 2 | Asynchronous Advantage Actor-Critic (A3C) algorithm for Super Mario Bros | Oct 28, 2019 | |
Python | 3 | Asynchronous Advantage Actor-Critic using Generalized Advantage Estimation (PyTorch) | Dec 01, 2021 | |
Python | 2 | asynchronous advantage actor critic | May 22, 2018 | |
Python | 61 | advantage actor-critic reinforcement learning for openai gym cartpole | Apr 08, 2023 | |
Python | 170 | Reinforcement learning baseline agent trained with the Actor-critic (A3C) algorithm. | Jul 27, 2022 | |
Jupyter Notebook | 2 | This repo contains the Deep Reinforcement Learning algorithm Soft Actor Critic (SAC) implementation in PyTorch | Jan 23, 2024 | |
Python | 14 | Implementation of Deep Reinforcement Learning Benchmark Algorithms, including DQN, Double DQN, Dueling DQN, Reinforce, Actor-Critic, … | Mar 28, 2023 | |
ASP.NET | 2 | Path Planning Using Deep Reinforcement Learning: Soft Actor–Critic | May 01, 2022 | |
Python | 2 | Reward Modeling from Human Preferences and Advantage Actor-Critic Reinforcement Learning: A Reproducibility Study | Jul 28, 2021 | |
Python | 2 | Reinforcement Learning using Soft Actor Critic | May 20, 2021 | |
Python | 27 | Model Predictive Actor-Critic Reinforcement Learning | Aug 09, 2022 | |
Python | 9 | Implementation of Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor. | Apr 17, 2023 | |
Python | 2 | PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO) and Scalable trust-region method … | Dec 03, 2019 | |
Python | 2 | PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO) and Scalable trust-region method … | Sep 24, 2018 | |
Python | 4 | PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO) and Scalable trust-region method … | Nov 10, 2022 | |
Python | 3 | PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO) and Scalable trust-region method … | Jan 21, 2020 | |
Python | 3 | PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO) and Scalable trust-region method … | Mar 18, 2022 | |
Python | 159 | Recurrent and multi-process PyTorch implementation of deep reinforcement Actor-Critic algorithms A2C and PPO | Apr 23, 2023 | |
R | 24 | Deep Reinforcement Learning in R (Deep Q Learning, Policy Gradient, Actor-Critic Method, etc) | Mar 14, 2023 | |
Python | 5 | Off-Policy Correction for Actor-Critic Algorithms in Deep Reinforcement Learning | Feb 18, 2023 | |
Python | 31 | Distributed Tensorflow Implementation of Asynchronous Methods for Deep Reinforcement Learning | Mar 23, 2023 | |
None | 2 | Trading with recurrent actor-critic reinforcement learning | Nov 19, 2023 | |
Jupyter Notebook | 51 | [DEPRECATED] Advantage Actor Critic model in PyTorch inspired by OpenAI baselines TensorFlow implementation | Aug 23, 2022 | |
Python | 628 | PyTorch implementation of soft actor critic | May 14, 2023 | |
Jupyter Notebook | 70 | This project explores deep reinforcement learning, hybrid actor-critic approach with A3C/PPO combined with curiosity for … | Aug 12, 2022 | |
Python | 2901 | PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for … | Aug 29, 2022 | |
Python | 2 | PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for … | Jul 09, 2021 | |
None | 2 | PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for … | Oct 18, 2021 | |
Python | 5 | PyTorch Implementation of Soft Actor-Critic Algorithm | Jul 22, 2022 | |
Jupyter Notebook | 384 | PyTorch implementation of Soft Actor-Critic (SAC) | Apr 22, 2023 | |
Python | 23 | Deep Reinforcement Learning with pytorch & visdom (the branch for A3C continuous control) | Sep 17, 2022 | |
Python | 3 | Tensorflow + Keras + OpenAI Gym implementation of 1-step Q Learning from "Asynchronous Methods for … | May 06, 2019 | |
Python | 1006 | Tensorflow + Keras + OpenAI Gym implementation of 1-step Q Learning from "Asynchronous Methods for … | Mar 02, 2023 | |
Python | 2 | Tensorflow + Keras + OpenAI Gym implementation of 1-step Q Learning from "Asynchronous Methods for … | Mar 31, 2017 | |
Python | 5 | Hierarchical Actor-Critic in Pytorch | May 01, 2023 | |
Python | 458 | PyTorch implementation of deep reinforcement learning algorithms | Apr 08, 2023 | |
Python | 67 | Pytorch implementation of distributed deep reinforcement learning | Aug 19, 2022 | |
Jupyter Notebook | 27 | Deep Reinforcement Learning Algorithms Implementation in PyTorch | Feb 02, 2023 | |
Python | 628 | Hybrid CPU/GPU implementation of the A3C algorithm for deep reinforcement learning. | Jul 28, 2022 | |
Python | 177 | PyTorch implementation of Soft Actor-Critic + Autoencoder(SAC+AE) | Apr 25, 2023 | |
Python | 17 | PyTorch implementation of the discrete Soft-Actor-Critic algorithm. | May 04, 2023 | |
Python | 20 | PyTorch implementation of discrete version of Soft Actor-Critic. | May 09, 2023 | |
Python | 3 | Deep Reinforcement Learning in TensorFlow (with A3C, ACER, etc.) | Oct 22, 2021 | |
Python | 27 | Online repo for deep reinforcement learning (A3C) on generals.io | Apr 26, 2023 | |
Jupyter Notebook | 2 | 2D self-driving car model using Advantage Actor Critic | Jun 20, 2022 | |
Python | 5 | Replicating "Asynchronous Methods for Deep Reinforcement Learning" (http://arxiv.org/abs/1602.01783) | Oct 08, 2019 | |
Python | 395 | Replicating "Asynchronous Methods for Deep Reinforcement Learning" (http://arxiv.org/abs/1602.01783) | Oct 04, 2022 |