Play with Mab

This project is built to explain the popular Reinforcement Learning framework of Multi Armed Bandit (MAB) with a easy UI.

You can either play the "MAB envinroment" yourself, by playing with the available arms, or simulate one of the available algorithm which are:

Thompson Sampling
Upper Confidence Bound 1 (tuned)
Epsilon-Greedy
Random Policy

Parameter updates of the algorithm will be shown at each time steps until convergence is reached. Currently there's a fixed time horizon of 100 steps.

Setup

if you want you can setup a python virtual environment: pip -m venv play_with_mab_venv

in case you did you should activate it: . play_with_mab_venv/bin/activate

then

pip install -r requirements.txt

and just run the main

python3 game_core/__main__.py

Name		Name	Last commit message	Last commit date
Latest commit History 82 Commits
game_core		game_core
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Play with Mab

Setup

About

Releases

Packages

Languages

License

LorenzoCiampiconi/play-with-mab

Folders and files

Latest commit

History

Repository files navigation

Play with Mab

Setup

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages