deep-reinforcement-learning-algorithm

深層強化学習アルゴリズムの実装

実装済み(中)アルゴリズム

DQN: Human-level control through deep reinforcement learning
DoubleDQN: Deep Reinforcement Learning with Double Q-learning
PrioritizedExperienceReplay: Prioritized Experience Replay
DuelingNetwork: Dueling Network Architectures for Deep Reinforcement Learning
CategoricalDQN(C51): A Distributional Perspective on Reinforcement Learning
NoisyNetwork: Noisy Networks for Exploration
SimplePolicyGradient: Simple Statistical Gradient-Following Algorithms for Connectionist Reinforcement Learning
REINFORCE: Simple Statistical Gradient-Following Algorithms for Connectionist Reinforcement Learning
Actor-Critic: Witten(1977); Barto, Sutton, Anderson(1983); Sutton(1984)
RandomNetworkDistillation(RND): Exploration by Random Network Distillation
GORILA: Massively Parallel Methods for Deep Reinforcement Learning

使用方法

pip install -r requirements.txt
python main.py

アルゴリズム選択

simulator.pyのself.policyに使用したいアルゴリズムのクラスを定義．
ただし，PolicyGradientとREINFORCEはPGSimulationクラスを使用し，ActorCriticはACSimulationクラスを使用してください．(main.pyも適宜変更してください)

def __init__(self, sim, epi, env):
    ...
    self.policy = DQN()

注意事項

ハイパーパラメータやGym環境，指標，実験設定の保存等，そこら辺はかなりいい加減に書いているので注意が必要．
実装の練習にもなると思うので余力がある人は自分でカスタマイズしてみてください．

Name		Name	Last commit message	Last commit date
Latest commit History 64 Commits
policy		policy
.gitignore		.gitignore
README.md		README.md
collector.py		collector.py
main.py		main.py
requirements.txt		requirements.txt
simulator.py		simulator.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

deep-reinforcement-learning-algorithm

実装済み(中)アルゴリズム

使用方法

アルゴリズム選択

注意事項

About

Releases

Packages

Languages

astrfo/deep-reinforcement-learning-algorithm

Folders and files

Latest commit

History

Repository files navigation

deep-reinforcement-learning-algorithm

実装済み(中)アルゴリズム

使用方法

アルゴリズム選択

注意事項

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages