Stars
26
Forks
0
Language
Python
Last Updated
Jan 04, 2024
Similar Repos
Repo | Language | Stars | Description | Updated At |
---|---|---|---|---|
Python | 11 | Code for MOBILE: Model-Bellman Inconsistency Penalized Offline Policy Optimization | Dec 12, 2023 | |
Python | 44 | Codebase of Deployment-Efficient Reinforcement Learning via Model-Based Offline Optimization (ICLR2021) | Jun 23, 2022 | |
Python | 28 | Learning Action-Value Gradients in Model-based Policy Optimization | Jul 29, 2022 | |
Python | 21 | Implementation of ICML2020 paper <Bidirectional Model-based Policy Optimization> | Apr 28, 2023 | |
Python | 6 | The Official Code for Offline Model-based Adaptable Policy Learning | May 07, 2023 | |
Python | 100 | Proximal Policy Optimization implementation with TensorFlow | Mar 15, 2023 | |
Python | 5 | Benchmark for "Offline Policy Comparison with Confidence" | Sep 28, 2022 | |
Python | 2 | Baselines for "Offline Policy Comparison with Confidence" | Jul 04, 2022 | |
Python | 362 | Code for the paper "When to Trust Your Model: Model-Based Policy Optimization" | Aug 11, 2022 | |
Python | 4 | Code release for "Supported Policy Optimization for Offline Reinforcement Learning", https://arxiv.org/abs/2202.06239 | Aug 04, 2022 | |
Jupyter Notebook | 2 | A Sample-Efficient Variance Reduction based Experience Replay Method for Policy Optimization Algorithms | Jan 18, 2023 | |
Python | 13 | Rider Reinforcement Learning Environment with Proximal Policy Optimization | Aug 29, 2022 | |
Jupyter Notebook | 12 | Proximal Policy Optimization with TensorFlow and OpenAI Gym | Apr 26, 2023 | |
Python | 3 | Proximal Policy Optimization in PyTorch | Oct 23, 2022 | |
Python | 2 | Proximal Policy Optimization in PyTorch | Nov 02, 2020 | |
Jupyter Notebook | 2 | Efficient Explaining CSPs with Unsatisfiable Subset Optimization | Aug 09, 2021 | |
Python | 56 | Training efficient drone controllers with Analytic Policy Gradient | May 17, 2023 | |
Python | 2 | model of recall policy with Cody | Mar 05, 2023 | |
Python | 428 | Generative image model with learned similarity measures | May 03, 2023 | |
TeX | 14 | Offline optimization-based video stabilization | Apr 04, 2023 | |
Jupyter Notebook | 353 | Trust Region Policy Optimization with TensorFlow and OpenAI Gym | Feb 09, 2023 | |
Jupyter Notebook | 6 | Trust Region Policy Optimization with TensorFlow and OpenAI Gym | Aug 20, 2020 | |
MATLAB | 8 | Efficient Multiscale Topology Optimization | Jul 30, 2022 | |
Python | 11 | Official pytorch implementation of the paper <Model-based Multi-agent Policy Optimization with Adaptive Opponent-wise Rollouts>. | Mar 06, 2023 | |
Python | 3 | Implementation of PILCO: A Model-Based and Data-Efficient Approach to Policy Search | Apr 21, 2021 | |
Python | 5 | Model Optimization | Apr 14, 2022 | |
Python | 41 | Unofficial pytorch implementation of CVPR2021 paper "Checkerboard Context Model for Efficient Learned Image Compression". | Apr 19, 2023 | |
Python | 8 | Efficient joint input optimization and inference with DEQ | Nov 28, 2022 | |
Jupyter Notebook | 2 | My reasearch about Proximal Policy Optimization | Nov 24, 2023 | |
Python | 42 | Pytorch implementation of intrinsic curiosity module with proximal policy optimization | Jul 31, 2022 | |
Python | 2 | Code for Policy Optimization as Online Learning with Mediator Feedback | Apr 02, 2023 | |
Python | 6 | Reinforcement learning with Proximal Policy Optimization (https://arxiv.org/abs/1707.06347) | Mar 16, 2023 | |
PowerShell | 71 | Optimize-Offline is a Windows 10 offline image optimization framework. | May 01, 2023 | |
None | 4 | The official code for paper “Residual Policy Learning Facilitates Efficient Model-Free Autonomous Racing” | Mar 15, 2023 | |
Python | 35 | [ICLR 2022] Official implementation of paper: Efficient Learning of Safe Driving Policy via Human-AI Copilot … | Apr 06, 2023 | |
Python | 5 | [ICLR 22] Official implementation of paper: "Efficient Learning of Safe Driving Policy via Human-AI Copilot … | Feb 20, 2023 | |
Jupyter Notebook | 5 | Offline Policy Evaluation via Adaptive Weighting with Data from Contextual Bandits | Feb 08, 2023 | |
Jupyter Notebook | 10 | Multifidelity Kriging, Efficient Global Optimization | Jun 22, 2022 | |
C++ | 12 | Source code for "Multi-objective Model-based Policy Search for Data-efficient Learning with Sparse Rewards" (CoRL 2018) | Jul 12, 2022 | |
Python | 5 | Optimization seq2seq model. | May 11, 2020 | |
Python | 4 | Code accompanying paper "Coordinated Proximal Policy Optimization" | Apr 29, 2022 | |
Python | 356 | PyTorch implementation of Trust Region Policy Optimization | Aug 17, 2022 | |
Python | 22 | Pytorch Implementation of Proximal Policy Optimization Algorithm | Dec 07, 2021 | |
MATLAB | 27 | Differentiable predictive control (DPC) policy optimization examples. | Sep 20, 2022 | |
Python | 120 | Proximal Policy Optimization (PPO) algorithm for Contra | Jul 22, 2022 | |
Jupyter Notebook | 2 | Trust Region Policy Optimization, Qishi Journal Club | Jan 02, 2023 | |
Python | 17 | Proximal Policy Optimization (Continuous Version) in PyTorch. | May 07, 2023 | |
Python | 7 | Policy-based optimization : single-step policy gradient seen as an evolution strategy | Jan 24, 2023 | |
C++ | 67 | An offline tool for pose-graph-optimization. | May 23, 2023 | |
Python | 5 | TensorFlow implementation of the IJCAI 2021 paper MapGo: Model-Assisted Policy Optimization for Goal-Oriented Tasks | Mar 11, 2023 |