Improving Reinforcement Learning-based Autonomous Agents with Causal Models

Paper: Improving Reinforcement Learning-based Autonomous Agents with Causal Models

Abstract: Autonomous Agents trained with Reinforcement Learning (RL) must explore the effects of their available actions in different environment states to successfully learn optimal control policies or build a model of the environment. Exploration efforts may be impractical in complex environments, hence ways to prune the exploration space must be found. In this paper, we propose to augment an autonomous agent with a causal model of the core dynamics of its environment, learnt on a simplified version and then used as a driving assistant for larger or more complex environments. Our experiments with different RL algorithms, in a set of increasingly complex environments, and with different base- line exploration strategies, show that learning such a model suffices in improving the agent behaviour.

Maintainer: Giovanni Briglia
Affiliation: Distributed and Pervasive Intelligence Group at University of Modena and Reggio Emilia
Contact: [email protected] and [email protected]

Project Structure

additional_assessments
   |__init__.py
   |__1_evaluation_for_test1_and_test2.py
   |__1_test1_and_test2
   |__2_evaluation_sensitive_analysis_batch_episodes.py
   |__2_launcher_sensitive_analysis_batch_episodes_for_online_cd.py
   |__3_evaluation_test3.py
   |__3_test3.py
   |__4.1_launcher_offline_CD_analysis.py
   |__4.2_launcher_offline_CD_analysis_multienv.py
   |__init__.py
Plots_and_Tables
   |__Comparison123
      |__results.pkl
      |__results.xlsx
   |__Comparison4
      |__results.pkl
      |__results.xlsx
scripts
   |__algorithms
      |__init__py
      |__causal_discovery.py
      |__dqn_agent.py
      |__q_learning_agent.py
      |__random_agent.py
   |__launchers
      |__init__.py
      |__launcher_comparison4_paper.py
      |__launcher_comparison123_paper.py
   |__utils
      |__images_for_render
         |__bowser.png
         |__goal.png
         |__supermario.png
         |__wall.png
      |__init.py
      |__batch_episodes_for_online_cd_values.pkl
      |__dqn_class_and_memory.py
      |__environment.py
      |__exploration_strategies.py
      |__ground_truth_causal_graph.json
      |__ground_truth_causal_graph.png
      |__ground_truth_causal_table.pkl
      |__merge_causal_graphs.py
      |__others.py
      |__plot_and_tables.py
      |__seed_values.npy
      |__test_causal_table
      |__train_models.py
   __init__.py
   example.py
__init__.py
experiments_parameters.png
flowchart_resume.png
global_variables.py
LICENSE
README
requirements.txt
setup.py
values_reasons_parameters.png

Installation

Create a new python virtual environment with 'python 3.10'
Install 'requirements'
```
pip install -r requirements.txt
```
Install setup
```
python setup.py install
```
Run test example
```
python3.10 -m scripts/example.py
```

How to Reproduce Paper Results

For comparison: Vanilla vs Causal Offline vs Causal Online in Grid-like Environments:

python setup.py install

python3.10 -m scripts/launchers/launcher_comparison123_paper.py

For comparison: With and Without Transfer Learning in Maze-like Environments:

python setup.py install

python3.10 -m scripts/launchers/launcher_comparison4_paper.py

Parameters

Results

Vanilla vs Causal Offline vs Causal Online

Transfer Learning

Develop your Own Extension

Your extension can take various paths:

One direction involves modifying the causal discovery algorithms.
Another direction entails adding new kinds of agent (currently Q-Learning and DQN have been developed). It's crucial to maintain consistency with the training class by implementing the "update_Q_or_memory", "update_exp_fact", "select_action" and "return_q_table" functions. Additionally, in the "global_variables.py" script, you need to include your custom label.
The third direction involves testing new environments.

Citation

@inproceedings{}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Improving Reinforcement Learning-based Autonomous Agents with Causal Models

Project Structure

Installation

How to Reproduce Paper Results

Parameters

Results

Vanilla vs Causal Offline vs Causal Online

Transfer Learning

Develop your Own Extension

Citation

About

Releases

Packages

Contributors 2

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 310 Commits
.idea		.idea
Plots_and_Tables		Plots_and_Tables
additional_assessments		additional_assessments
scripts		scripts
LICENSE		LICENSE
README.md		README.md
__init__.py		__init__.py
experiments_parameters.png		experiments_parameters.png
flowchart_resume.png		flowchart_resume.png
global_variables.py		global_variables.py
requirements.txt		requirements.txt
res_comp123_1.png		res_comp123_1.png
res_comp123_2.png		res_comp123_2.png
res_transfer_learning.png		res_transfer_learning.png
setup.py		setup.py
values_reasons_parameters.png		values_reasons_parameters.png

License

Giovannibriglia/AgentGroup_CausalRL

Folders and files

Latest commit

History

Repository files navigation

Improving Reinforcement Learning-based Autonomous Agents with Causal Models

Project Structure

Installation

How to Reproduce Paper Results

Parameters

Results

Vanilla vs Causal Offline vs Causal Online

Transfer Learning

Develop your Own Extension

Citation

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages