Deep Reinforcement Learning implemented in pytorch.
This project trying to mimic the structure of rllab code and occasionally borrow code from rllab and inverse_rl
- Implemented Algorithms
-
Online
- DDPG (Deep Deterministic Policy Gradient)
- NAF (Normalized Advantage Function)
- Soft Q-Learning (Deep Energy-Based Policy)
-
Batch
- TRPO (Trust Region Policy Optimization)
-
Inverse Reinforcement Learning & Imitation Learning
- Generative Adversarial Imitation Learning
- Adversarial Inverse Reinforcement Learning
-