mcts-tic-tac-toe

https://github.com/hayoung-kim/mcts-tic-tac-toe

solve tic tac toe (3x3) problem using vanila mcts. It was made for the purpose of studying the concept of mcts.

youtube demo: https://youtu.be/9BEcOWviCjA

Requirement

pygame

How to play

python play.py

Implement detail

selection: UCT algorithm (https://en.wikipedia.org/wiki/Monte_Carlo_tree_search)

expansion: create all possible node from state

simulation: randomly play simulation(=random roll-out policy). This simulation method lower the performance.

backprop: back propagate simulation result.

References

http://tim.hibal.org/blog/alpha-zero-how-and-why-it-works/

https://medium.com/swlh/tic-tac-toe-at-the-monte-carlo-a5e0394c7bc2

https://github.com/reinforcement-learning-kr/alpha_omok

https://github.com/JuliaPOMDP/MCTS.jl

Acknowledgement

Kyushik Min helped a lot for implementing algorithm.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
env		env
img		img
.gitignore		.gitignore
README.md		README.md
VanilaMCTS.py		VanilaMCTS.py
play.py		play.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

mcts-tic-tac-toe

Requirement

How to play

Implement detail

References

Acknowledgement

About

Releases

Packages

Languages

hayoung-kim/mcts-tic-tac-toe

Folders and files

Latest commit

History

Repository files navigation

mcts-tic-tac-toe

Requirement

How to play

Implement detail

References

Acknowledgement

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages