Stars
2
Forks
2
Language
Python
Last Updated
May 22, 2018
Similar Repos
Repo | Language | Stars | Description | Updated At |
---|---|---|---|---|
Python | 3 | Asynchronous Advantage Actor-Critic using Generalized Advantage Estimation (PyTorch) | Dec 01, 2021 | |
Python | 862 | Asynchronous Advantage Actor-Critic (A3C) algorithm for Super Mario Bros | Aug 23, 2022 | |
None | 2 | Asynchronous Advantage Actor-Critic (A3C) algorithm for Super Mario Bros | Oct 28, 2019 | |
Python | 1019 | PyTorch implementation of Asynchronous Advantage Actor Critic (A3C) from "Asynchronous Methods for Deep Reinforcement Learning". | Aug 26, 2022 | |
Python | 25 | Tensorflow implementation of Asynchronous Advantage Actor Critic (A3C) from "Asynchronous Methods for Deep Reinforcement Learning". | May 25, 2022 | |
Python | 61 | advantage actor-critic reinforcement learning for openai gym cartpole | Apr 08, 2023 | |
Jupyter Notebook | 2 | 2D self-driving car model using Advantage Actor Critic | Jun 20, 2022 | |
Python | 112 | PyTorch implementation of Advantage async actor-critic Algorithms (A3C) in PyTorch | Mar 30, 2023 | |
Python | 2 | Vanilla Actor Critic | Mar 15, 2023 | |
Jupyter Notebook | 51 | [DEPRECATED] Advantage Actor Critic model in PyTorch inspired by OpenAI baselines TensorFlow implementation | Aug 23, 2022 | |
Python | 2 | Reward Modeling from Human Preferences and Advantage Actor-Critic Reinforcement Learning: A Reproducibility Study | Jul 28, 2021 | |
Python | 241 | Actor-critic with experience replay | Apr 20, 2023 | |
Python | 5 | Hierarchical Actor-Critic in Pytorch | May 01, 2023 | |
Scala | 3 | Actor Critic Temporal Difference Learning | Oct 25, 2018 | |
Python | 2 | Reinforcement Learning using Soft Actor Critic | May 20, 2021 | |
None | 2 | Soft Actor-Critic with advanced features | Dec 12, 2021 | |
Jupyter Notebook | 2 | Soft Actor-Critic implementation in JAX. | Apr 01, 2022 | |
Python | 27 | Model Predictive Actor-Critic Reinforcement Learning | Aug 09, 2022 | |
Python | 2 | Soft Actor-Critic algorithm in MXNet | Apr 15, 2022 | |
Jupyter Notebook | 58 | Implementation of Tsallis Actor Critic method | Apr 04, 2023 | |
Jupyter Notebook | 12 | Recommendation system with actor and critic | Jan 23, 2023 | |
Python | 35 | Soft Actor-Critic with advanced features | May 15, 2023 | |
Python | 628 | PyTorch implementation of soft actor critic | May 14, 2023 | |
Python | 5 | PyTorch Implementation of Soft Actor-Critic Algorithm | Jul 22, 2022 | |
Jupyter Notebook | 384 | PyTorch implementation of Soft Actor-Critic (SAC) | Apr 22, 2023 | |
Python | 2 | TensorFlow implementation of the Actor-Critic model | Jun 10, 2021 | |
Python | 20 | ICLR Reproducibility Challenge for Discriminator-Actor-Critic | Dec 19, 2022 | |
None | 2 | Trading with recurrent actor-critic reinforcement learning | Nov 19, 2023 | |
Python | 14 | Modified versions of the Soft Actor-Critic algorithm for Atari games from https://github.com/ac-93/soft-actor-critic. | Oct 10, 2022 | |
Python | 4 | Believer-Skeptic/Actor-Critic Cortico-BG Neural Network | Sep 26, 2018 | |
None | 2 | Actor-Critic Sequence Generation for Relative Difference Captioning | Dec 26, 2021 | |
Python | 48 | Policy Gradient Actor-Critic PyTorch | Lunar Lander v2 | Jul 24, 2022 | |
Python | 5 | Soft actor-critic on Pendulum-v0 with PyTorch | Feb 11, 2023 | |
Python | 177 | PyTorch implementation of Soft Actor-Critic + Autoencoder(SAC+AE) | Apr 25, 2023 | |
ASP.NET | 2 | Path Planning Using Deep Reinforcement Learning: Soft Actor–Critic | May 01, 2022 | |
Python | 17 | PyTorch implementation of the discrete Soft-Actor-Critic algorithm. | May 04, 2023 | |
Python | 20 | PyTorch implementation of discrete version of Soft Actor-Critic. | May 09, 2023 | |
Jupyter Notebook | 2 | A JAX Implementation of the Soft Actor Critic Algorithm | Aug 20, 2023 | |
Jupyter Notebook | 2 | A simple A2C made from scratch in PyTorch. Accompanying comic at https://hackernoon.com/intuitive-rl-intro-to-advantage-actor-critic-a2c-4ff545978752 | Jul 23, 2021 | |
Jupyter Notebook | 2824 | PyTorch0.4 implementation of: actor critic / proximal policy optimization / acer / ddpg / twin … | Oct 17, 2022 | |
Python | 7 | The official code base of Shared Experience Actor-Critic (NeurIPS2020) | May 30, 2022 | |
Python | 22 | The official code base of Shared Experience Actor-Critic (NeurIPS2020) | Mar 21, 2023 | |
Python | 3 | The test code for the paper "Attention-based advantage actor-critic algorithm with prioritized experience replay for … | Apr 17, 2023 | |
Python | 9 | Implementation of Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor. | Apr 17, 2023 | |
Python | 10 | A general model-free off-policy actor-critic implementation. Continuous and Discrete Soft Actor-Critic with multimodal observations, data … | Apr 25, 2022 | |
Python | 2 | A general model-free off-policy actor-critic implementation. Continuous and Discrete Soft Actor-Critic with multimodal observations, data … | Aug 27, 2022 | |
Python | 170 | Reinforcement learning baseline agent trained with the Actor-critic (A3C) algorithm. | Jul 27, 2022 | |
Python | 205 | PyTorch implementation of Hierarchical Actor Critic (HAC) for OpenAI gym environments | Aug 17, 2022 | |
Python | 29 | implementation of our self-guided and self-regularized actor-critic algorithm | Mar 15, 2023 | |
Python | 5 | Off-Policy Correction for Actor-Critic Algorithms in Deep Reinforcement Learning | Feb 18, 2023 |