RL Pendulum Physics Environment

This is a released package of a mathematically simulated pendulum that can be flexibly adapted for applying it as an RL environment. An advantage of this pendulum simulation is the flexibility of the parameter definition, including the number of rods, the rod masses, rod inertias, and more.

It is aimed to enable more flexible RL research on the difficult control problem of the cart-pendulum environment.

Best view 📒this notebook📒 (.html file; 281 MB) and review the overall clean code to get all the information you need. Besides matplotlib animations and LaTeX math summaries, this package also supports rendering Plotly Animations, which are a lot faster to render for RL research, and easier to embed into Panel UIs. Checkout ✨this interactive animation✨ (.html file; 56 MB).

I also provide an unfinished Panel UI prototype of a planned RL experiment platform that could help others build flexible UIs for experiment configurations and applications. It is out of my scope to finish this properly, however, this could provide a lot of value for other researchers. Here is a 🎞video demonstration🎞 (.mkv file; watchable w.o download) of the current Panel UI implementation.

Credit: The very good math derivations of this notebook for the triple cart-pendulum were reused for this project and abstracted to an n-rodded cart-pendulum class.

Example Gallery

Matplotlib + Plotly Animations

Pendulum.Animation.Matplotlib.v5.mp4

Pendulum.Animation.Plotly.v6.mp4

The Current UI

Setup

You generally have to follow the setup_commands.txt to set up and launch the Panel UI.

If you only want to run the RL pendulum in a notebook, it is recommended to only copy the self-contained directory rl_pendulum/envs/cart_pendulum, while leaving the other files. The other files won't be needed, and just give functionality to run the Panel UI.

🚧 Note that the rl_pendulum/envs/rl_cart_pendulum is not fully functional, but provides an extensive feature engineering pipeline. It is planned to reimplement the environment in JAX later and to use Stoix agents to solve it without much feature engineering.

Name		Name	Last commit message	Last commit date
Latest commit History 28 Commits
rl_pendulum		rl_pendulum
README.md		README.md
UI.ipynb		UI.ipynb
setup.py		setup.py
setup_commands.txt		setup_commands.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

RL Pendulum Physics Environment

Example Gallery

Matplotlib + Plotly Animations

The Current UI

Setup

About

Releases

Packages

Languages

HenningBeyer/RL_Pendulum_Physics_Environment

Folders and files

Latest commit

History

Repository files navigation

RL Pendulum Physics Environment

Example Gallery

Matplotlib + Plotly Animations

The Current UI

Setup

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages