Skip to content

Implementation of Meta-RL A3C algorithm

License

Notifications You must be signed in to change notification settings

AnySomebody1/Meta-RL

 
 

Repository files navigation

Meta-RL

Tensorflow implementation of Meta-RL A3C algorithm taken from Learning to Reinforcement Learn. For more information, as well as explainations of each of the experiments, see my corresponding Medium post. A3C is built from previous implementation available here.

Contains iPython notebooks for:

  • A3C-Meta-Bandit - Set of bandit tasks described in paper. Including: Independent, Dependent, and Restless bandits.
  • A3C-Meta-Context - Rainbow bandit task using randomized colors to indicate reward-giving arm in each episode.
  • A3C-Meta-Grid - Rainbow Gridworld task; a variation of gridworld in which goal colors are randomzied each episode and must be learned "on the fly."

About

Implementation of Meta-RL A3C algorithm

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Jupyter Notebook 84.9%
  • Python 15.1%