Skip to content

Latest commit

 

History

History
15 lines (10 loc) · 379 Bytes

README.md

File metadata and controls

15 lines (10 loc) · 379 Bytes

PolicyGradient

Policy Gradient methods implemented using Torch

This project implements some Reinforcement Learning algorithms for continuous control tasks with torch

  • REINFORCE
  • GPOMDP
  • Stochastic Policy Gradient Theorem (SPG)
  • Deterministic Policy Gradient Theorem (DPG)
  • DGP with Deep Learning (DDPG)
  • Deep Continuous Q-Learning

Comments and advice are welcomed!