Stars
6
Forks
2
Language
Julia
Last Updated
Mar 20, 2023
Similar Repos
Repo | Language | Stars | Description | Updated At |
---|---|---|---|---|
Python | 222 | Code for the paper "Evolved Policy Gradients" | Aug 03, 2022 | |
Python | 4 | Official repository for: Continuous Control With Ensemble DeepDeterministic Policy Gradients | Dec 06, 2021 | |
Jupyter Notebook | 22 | Pytorch implementation of the Deep Deterministic Policy Gradients for Continuous Control | Nov 02, 2022 | |
Jupyter Notebook | 10 | Pytorch Implementation of Twin Delayed Deep Deterministic Policy Gradients for Continuous Control | Nov 09, 2022 | |
Python | 105 | Code for paper "EasyDGL: Encode, Train and Interpret for Continuous-time Dynamic Graph Learning" | Apr 22, 2023 | |
Python | 170 | Codebase for Evolutionary Reinforcement Learning (ERL) from the paper "Evolution-Guided Policy Gradients in Reinforcement Learning" … | May 18, 2023 | |
Python | 10 | Code for ICML21 paper "Learning Self-Modulating Attention in Continuous Time Space with Applications to Sequential … | Mar 19, 2023 | |
Python | 11 | Code for the paper "Understanding the Mechanics of SPIGOT: Surrogate Gradients for Latent Structure Learning" | Jun 21, 2022 | |
Jupyter Notebook | 2 | Pytorch implementation for Policy Gradients | Jun 14, 2020 | |
Python | 193 | Code for the paper "Phasic Policy Gradient" | Jul 28, 2022 | |
Python | 28 | Learning Action-Value Gradients in Model-based Policy Optimization | Jul 29, 2022 | |
Python | 25 | Code for "Continuous-Time Meta-Learning with Forward Mode Differentiation" (ICLR 2022) | Apr 15, 2023 | |
Python | 20 | Code accompanying NeurIPS 2019 paper: "Distributional Policy Optimization - An Alternative Approach for Continuous Control" | Sep 14, 2022 | |
Jupyter Notebook | 2 | Code repository for experiments in the paper "Invariant Policy Learning: A Causal Perspective" | Feb 04, 2022 | |
C++ | 2 | Code for the paper "Learning Humanoid Robot Running Skills through Proximal Policy Optimization" | Sep 18, 2022 | |
Python | 4 | Repository for MAP-Elites with Descriptor-Conditioned Gradients and Archive Distillation into a Single Policy paper, introducing … | Dec 24, 2023 | |
Python | 44 | Codes accompanying the paper "DOP: Off-Policy Multi-Agent Decomposed Policy Gradients" (ICLR 2021, https://arxiv.org/abs/2007.12322) | Aug 09, 2022 | |
Python | 95 | codes for the paper "POMO: Policy Optimization with Multiple Optima for Reinforcement Learning" | May 24, 2023 | |
None | 4 | The official code for paper “Residual Policy Learning Facilitates Efficient Model-Free Autonomous Racing” | Mar 15, 2023 | |
Jupyter Notebook | 4 | Code for the paper "Continuous Attractors for Dynamic Memories" | Apr 07, 2023 | |
Python | 255 | Code for the paper "Continuous Adaptation via Meta-Learning in Nonstationary and Competitive Environments" | Aug 08, 2022 | |
Python | 13 | Source code for the ICLR'22 paper on "Half-Inverse Gradients" | Jun 13, 2022 | |
Python | 12 | The code of paper *Learning Robust Policy against Disturbance in Transition Dynamics via State-Conservative Policy … | Apr 04, 2023 | |
Python | 2 | Code associated to the paper "Safe and Smooth: Certified Continuous-Time Range-Only Localization" | Feb 13, 2023 | |
Jupyter Notebook | 31 | Code for the paper "Batch size invariance for policy optimization" | Jul 11, 2022 | |
Python | 29 | The source code for NeurIPS 2020 paper "Graph Policy Network for Transferable Active Learning on … | Jun 28, 2022 | |
Python | 2 | Source code for paper "Combining Soft-Actor Critic with Cross-Entropy Method for Policy Search in Continuous … | Apr 29, 2022 | |
Jupyter Notebook | 4 | This repository contains the code for the paper "Local policy search with Bayesian optimization". | May 18, 2022 | |
Python | 10 | Code for the paper "Real Time Speech Emotion Recognition using Machine Learning" | May 24, 2023 | |
Python | 18 | Code for R:SS 2021 paper RMP2: A Structured Composable Policy Class for Robot Learning. | Jun 09, 2022 | |
Jupyter Notebook | 3 | Data and code for Vogel et al., Transcriptomic Gradients paper | Mar 18, 2023 | |
Python | 57 | Code of Empirical Bayes Transductive Meta-Learning with Synthetic Gradients | Apr 06, 2023 | |
Python | 2 | Code for the paper: "Attention-based Partial Decoupling of Policy and Value for Generalization in Reinforcement … | Nov 01, 2022 | |
Python | 69 | The paper "Learning Representations for Time Series Clustering" | May 23, 2023 | |
Python | 2 | Code for the paper "Approximating Continuous Convolutions for Deep Network Compression" | Apr 18, 2023 | |
Python | 5 | [ICCV2021] Code for the paper "Cherry-Picking Gradients: Learning Low-Rank Embeddings of Visual Data via Differentiable … | May 21, 2022 | |
Python | 3 | Source code for paper: Importance Adaptive Policy Distillation | Sep 22, 2022 | |
Python | 2 | [ICLR 2023] The official code for paper "Guarded Policy Optimization with Imperfect Online Demonstrations" | Mar 18, 2023 | |
Jupyter Notebook | 2 | Implementation of Deep Deterministic Policy Gradients using TensorFlow, compatible with the OpenAI Gym | Sep 18, 2019 | |
Jupyter Notebook | 5 | Source code for "Optimizing for Generalization in Machine Learning with Cross-Validation Gradients" | Jul 31, 2018 | |
Python | 10 | The code for paper 'Hierarchical Policy for Non-prehensile Multi-object Rearrangement with Deep Reinforcement Learning and … | Nov 05, 2021 | |
Python | 8 | "Adaptive Cruise Control for a Hybrid Vehicle with Deep Policy Gradients". Final project for ECE … | May 10, 2023 | |
HTML | 1219 | Reinforcement Learning Agents in Javascript (Dynamic Programming, Temporal Difference, Deep Q-Learning, Stochastic/Deterministic Policy Gradients) | Aug 20, 2022 | |
TypeScript | 4 | Reinforcement Learning Agents in Javascript (Dynamic Programming, Temporal Difference, Deep Q-Learning, Stochastic/Deterministic Policy Gradients) | Jan 04, 2023 | |
Python | 2 | Code for Policy Optimization as Online Learning with Mediator Feedback | Apr 02, 2023 | |
Jupyter Notebook | 5 | Code for the paper: Discovering Weight Initializers with Meta-Learning | Mar 21, 2022 | |
Python | 2 | Source code for the paper "Fourier learning with cyclical data", | Aug 11, 2022 | |
Python | 24 | Source code for the paper "Policy Architectures for Compositional Generalization in Control" | May 28, 2022 | |
Matlab | 7 | Code for the ECCV 2018 paper "3D Scene Flow from 4D Light Field Gradients" | Jan 20, 2023 | |
Python | 4 | Code accompanying paper "Coordinated Proximal Policy Optimization" | Apr 29, 2022 |