Stars
17
Forks
5
Language
Python
Last Updated
May 12, 2023
Similar Repos
Repo | Language | Stars | Description | Updated At |
---|---|---|---|---|
Python | 2 | A general model-free off-policy actor-critic implementation. Continuous and Discrete Soft Actor-Critic with multimodal observations, data … | Aug 27, 2022 | |
Python | 17 | PyTorch implementation of the discrete Soft-Actor-Critic algorithm. | May 04, 2023 | |
Python | 20 | PyTorch implementation of discrete version of Soft Actor-Critic. | May 09, 2023 | |
Python | 2 | Advantage-Filtered Behavioral Cloning for Offline Continuous Control | Jun 08, 2022 | |
Python | 2 | Reinforcement Learning using Soft Actor Critic | May 20, 2021 | |
None | 2 | Soft Actor-Critic with advanced features | Dec 12, 2021 | |
Jupyter Notebook | 2 | Soft Actor-Critic implementation in JAX. | Apr 01, 2022 | |
Python | 2 | Soft Actor-Critic algorithm in MXNet | Apr 15, 2022 | |
Python | 35 | Soft Actor-Critic with advanced features | May 15, 2023 | |
Python | 628 | PyTorch implementation of soft actor critic | May 14, 2023 | |
Python | 9 | Implementation of Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor. | Apr 17, 2023 | |
Python | 5 | PyTorch Implementation of Soft Actor-Critic Algorithm | Jul 22, 2022 | |
Jupyter Notebook | 384 | PyTorch implementation of Soft Actor-Critic (SAC) | Apr 22, 2023 | |
Python | 14 | Modified versions of the Soft Actor-Critic algorithm for Atari games from https://github.com/ac-93/soft-actor-critic. | Oct 10, 2022 | |
Jupyter Notebook | 2824 | PyTorch0.4 implementation of: actor critic / proximal policy optimization / acer / ddpg / twin … | Oct 17, 2022 | |
Python | 2 | Source code for paper "Combining Soft-Actor Critic with Cross-Entropy Method for Policy Search in Continuous … | Apr 29, 2022 | |
Python | 5 | Soft actor-critic on Pendulum-v0 with PyTorch | Feb 11, 2023 | |
Jupyter Notebook | 20 | Continuous-time Markov model with discrete observations | Dec 27, 2022 | |
Python | 11 | Continuous-time Markov model with discrete observations | Jan 18, 2022 | |
Python | 48 | Policy Gradient Actor-Critic PyTorch | Lunar Lander v2 | Jul 24, 2022 | |
Python | 177 | PyTorch implementation of Soft Actor-Critic + Autoencoder(SAC+AE) | Apr 25, 2023 | |
ASP.NET | 2 | Path Planning Using Deep Reinforcement Learning: Soft Actor–Critic | May 01, 2022 | |
Jupyter Notebook | 2 | A JAX Implementation of the Soft Actor Critic Algorithm | Aug 20, 2023 | |
Python | 5 | Off-Policy Correction for Actor-Critic Algorithms in Deep Reinforcement Learning | Feb 18, 2023 | |
Python | 14 | A simple and easy to use implementation of the soft actor-critic algorithm. | Sep 02, 2022 | |
Jupyter Notebook | 2 | This repo contains the Deep Reinforcement Learning algorithm Soft Actor Critic (SAC) implementation in PyTorch | Jan 23, 2024 | |
R | 24 | Deep Reinforcement Learning in R (Deep Q Learning, Policy Gradient, Actor-Critic Method, etc) | Mar 14, 2023 | |
Python | 948 | Softlearning is a reinforcement learning framework for training maximum entropy policies in continuous domains. Includes … | Aug 29, 2022 | |
Python | 10 | Soft Actor-Critic implementation with SOTA model-free extension (REDQ) and SOTA model-based extension (MBPO). | Nov 22, 2022 | |
Python | 2 | Simple implementations of vanilla reinforce (policy gradient) and actor critic methods with numpy and different … | Apr 20, 2021 | |
Python | 31 | Offline Risk-Averse Actor-Critic (O-RAAC). A model-free RL algorithm for risk-averse RL in a fully offline … | May 12, 2023 | |
Python | 5 | Solving the CarRacingv0 OpenAI Gym environment with an Actor-Critic Network and Proximal Policy Optimization using … | Feb 12, 2023 | |
Python | 9 | Framework for developing Actor-Critic deep RL algorithms (A3C, A2C, PPO, GAE, etc..) in different environments … | Mar 01, 2022 | |
Python | 201 | PyTorch implementation of Soft-Actor-Critic and Prioritized Experience Replay (PER) + Emphasizing Recent Experience (ERE) + … | Apr 25, 2023 | |
Python | 3 | A Soft Actor Policy based model free off-policy network to control the steering and throttle … | Sep 17, 2022 | |
Python | 4 | 深度强化学习路径规划, SAC路径规划, Soft Actor-Critic算法, SAC-pytorch | Jul 07, 2023 | |
Python | 18 | Code for the CoRL 2019 paper AC-Teach: A Bayesian Actor-Critic Method for Policy Learning with … | Aug 15, 2022 | |
Python | 21 | Code accompanying the paper Adversarially Trained Actor Critic for Offline Reinforcement Learning by Ching-An Cheng*, … | Aug 01, 2022 | |
Jupyter Notebook | 2 | Implementation of basic RL steps and algorithms - Dynamic Programming approach, Monte-Carlo approach, DQN on … | Aug 10, 2022 | |
Python | 2 | PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO) and Scalable trust-region method … | Dec 03, 2019 | |
Python | 2 | PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO) and Scalable trust-region method … | Sep 24, 2018 | |
Python | 4 | PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO) and Scalable trust-region method … | Nov 10, 2022 | |
Python | 3 | PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO) and Scalable trust-region method … | Jan 21, 2020 | |
Python | 3 | PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO) and Scalable trust-region method … | Mar 18, 2022 | |
Python | 58 | PyTorch implementation of the Offline Reinforcement Learning algorithm CQL. Includes the versions DQN-CQL and SAC-CQL … | Apr 22, 2023 | |
Python | 2901 | PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for … | Aug 29, 2022 | |
Python | 2 | PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for … | Jul 09, 2021 | |
None | 2 | PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for … | Oct 18, 2021 | |
Python | 3 | SimplySAC replicates Soft-Actor-Critic with minimum (~200) lines of code in clean, readable PyTorch style, while … | May 18, 2021 | |
Jupyter Notebook | 5 | Autonomous atom manipulation - in this project, we use deep reinforcement learning algorithms including soft … | Mar 08, 2023 |