Traffic Signal Cycle Control with Centralized Critic and Decentralized Actors under Varying Intervention Frequencies

This repository contains the code for the paper "Traffic Signal Cycle Control with Centralized Critic and Decentralized Actors under Varying Intervention Frequencies".

Traffic Signal Cycle Control with Centralized Critic and Decentralized Actors under Varying Intervention Frequencies

🎉 News

We have successfully implemented all commonly used Action Designs in traffic signal control. These include Choose Next Phase, Next or Not, Set Current Phase Duration, Adjust Single Phase, and Adjust All Phases. The weights for each model have been uploaded to the save_models directory corresponding to each method.
We have migrated the simulation platform used in this project from Aiolos to TransSimHub (TSHub). We would like to express our sincere gratitude to our colleagues at SenseTime, @KanYuheng (阚宇衡), @MaZian (马子安), and @XuChengcheng (徐承成) (in alphabetical order) for their valuable contributions. The development of TransSimHub (TSHub) is a continuation of the work done on Aiolos.

🔑 Key Points

Adaptation to Varying Intervention Frequencies: The intervention frequency significantly impacts the effectiveness of traffic signal control systems, influenced by factors such as resource limitations, safety, traffic flow disruption, and system stability. This research introduces varying intervention frequencies to the TSC system, defining it as the rate at which traffic signals are adjusted in response to changing traffic conditions. This approach is particularly beneficial for scenarios requiring manual verification, where lower frequencies may be preferable.

An example of applying the intervention frequency based on cycle-based control action design in a four-phase traffic signal system

Enhanced Action Utilization: To accommodate varying intervention frequencies, particularly lower frequencies, it is crucial for the control agent to maximize the impact of each action taken. Our research introduces a novel action strategy named adjust all phases, which allows for the simultaneous adjustment of all traffic phases within a single cycle, thereby increasing the effectiveness of each intervention.

An example of adjust all phases in a four phases traffic signal system

Efficient Management of Large Action Spaces: This research employs a Centralized Critic and Decentralized Actors (CCDA) architecture to effectively manage large action spaces. Decentralized actors are responsible for adjusting individual signal phases, which reduces the complexity of the action space. Simultaneously, a centralized critic evaluates the overall traffic scenario, ensuring coordinated actions among the decentralized actors, thus enhancing overall system performance.

The framework of our method with the intervention frequency

📥 Installation

Before using, make sure TSHub is installed.

git clone https://github.com/Traffic-Alpha/TransSimHub.git
cd TransSimHub
pip install -e ".[rl]"

🏃‍♂️ Training

Once TSHub is installed, you can start training models. We provide five different agent designs, each in its own folder:

Choose Next Phase: Chooses a phase from all possible phases at each time step.
Next or Not: Decides whether to switch to the next phase at each time step.
Set Phase duration: Sets the phase duration at the beginning of each phase.
Adjust Single Phase: Modifies a single phase in the entire cycle.
Adjust All Phases: A method proposed in this project that adjusts all phases.

Each folder has the following structure:

- METHOD_NAME
   - train_utils/ # Contains training-related code, such as network structure
   - env_utils/ # Environment-related code, state, action, reward
   - train_METHOD_NAME.py # Training script
   - eval_METHOD_NAME.py # Testing script
   - train_deltas.sh # Script for training at different delta times

The image below illustrates the four commonly adopted agent designs for TSC tasks:

Four Commonly Used Agent Designs

To train a model, run the train_METHOD_NAME.py script. For instance, to train the Adjust_All_Phases_Cycle model with 20 simultaneous simulations and an action interval of 60 seconds, use the following command:

python train_adjust_all_phases.py --delta_time 60 --num_envs 20

You can also use the train_deltas.sh script to train models at different delta times more conveniently:

nohup bash train_deltas.sh > train_deltas.log &

After training, two folders, log and save_models, will be created. The log folder contains the cumulative rewards of each simulation during training, useful for plotting reward curves. The save_models folder stores the model weights saved during training for testing. Use the plot_rewards.py script to plot the reward curve:

python plot_rewards.py --action_type Adjust_All_Phases_Cycle --delta_time 60

The image below illustrates the reward curve for Adjust_All_Phases_Cycle when $\Delta t =60$.

The Reward Curve for Adjust All Phases (delta t=60)

🧪 Evaluation

After training, you can test the models by loading the saved weights from the save_models folder. Use the eval_METHOD_NAME.py script for testing. For instance, to test the Adjust All Phases model with a delta time of 60, use the following command:

python eval_adjust_all_phases.py --delta_time 60

The test script generates an exp_output file, which includes vehicle information (tripinfo.out_1.xml) and traffic light phase information (./add/tls_programs.out_1.xml). Use the analysis_tripinfo.py script to analyze the tripinfo.out_1.xml file:

python analysis_tripinfo.py --action_type Adjust_All_Phases_Cycle --delta_time 60

Upon running the above command, a statistical analysis of various attributes such as travel time, waiting time, and waiting count will be displayed. Here's a sample output:

Statistics for travelTime:
  Mean: 95.78
  Variance: 1970.98
  Max: 363.00
  Min: 42.00
  Percentile_25: 65.00
  Percentile_50: 85.00
  Percentile_75: 108.00

Statistics for waitingTime:
  Mean: 25.09
  Variance: 1009.32
  Max: 242.00
  Min: 0.00
  Percentile_25: 0.00
  Percentile_50: 17.00
  Percentile_75: 35.00

Statistics for waitingCount:
  Mean: 1.30
  Variance: 2.17
  Max: 10.00
  Min: 0.00
  Percentile_25: 0.00
  Percentile_50: 1.00
  Percentile_75: 2.00

Statistics for stopTime:
  Mean: 0.00
  Variance: 0.00
  Max: 0.00
  Min: 0.00
  Percentile_25: 0.00
  Percentile_50: 0.00
  Percentile_75: 0.00

Statistics for timeLoss:
  Mean: 50.47
  Variance: 2048.42
  Max: 318.27
  Min: 6.42
  Percentile_25: 19.31
  Percentile_50: 38.38
  Percentile_75: 62.55

Statistics for CO_abs:
  Mean: 11344.70
  Variance: 43907549.64
  Max: 53831.92
  Min: 2969.10
  Percentile_25: 6936.84
  Percentile_50: 9646.80
  Percentile_75: 13351.00

...

You can also use the plot_tls_program.py script to visualize the phase changes of the traffic signal light. Note that this is not applicable for the Choose Next Phase model as it doesn't have a concept of a signal light cycle. For instance, to plot the phase change of Adjust All Phase with an interval of 60 seconds, use:

python plot_tls_program.py --action_type Adjust_All_Phases_Cycle --delta_time 60

The figures below illustrate how the duration of various traffic light phases changes under different control intervals. It shows that as the control interval, denoted as $\Delta t$, increases, the changes in phase durations stabilize, while the general pattern remains similar:

$\Delta t = 60$	$\Delta t = 120$	$\Delta t = 300$

📚 Citation

If you find this work useful, please cite our papers:

@article{wang2024traffic,
  title={Traffic Signal Cycle Control with Centralized Critic and Decentralized Actors under Varying Intervention Frequencies},
  author={Wang, Maonan and Chen, Yirong and Kan, Yuheng and Xu, Chengcheng and Lepech, Michael and Pun, Man-On and Xiong, Xi},
  journal={arXiv preprint arXiv:2406.08248},
  year={2024}
}

📫 Contact

If you have any questions, please open an issue in this repository. We will respond as soon as possible.

Name		Name	Last commit message	Last commit date
Latest commit History 49 Commits
Adjust_All_Phases_Cycle		Adjust_All_Phases_Cycle
Adjust_Single_Phase_Cycle		Adjust_Single_Phase_Cycle
Choose_Next_Phase		Choose_Next_Phase
Next_or_Not		Next_or_Not
Set_Current_Phase_Duration		Set_Current_Phase_Duration
_assets		_assets
sumo_envs		sumo_envs
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
analysis_tripinfo.py		analysis_tripinfo.py
plot_rewards.py		plot_rewards.py
plot_tls_program.py		plot_tls_program.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Traffic Signal Cycle Control with Centralized Critic and Decentralized Actors under Varying Intervention Frequencies

🎉 News

🔑 Key Points

📥 Installation

🏃‍♂️ Training

🧪 Evaluation

📚 Citation

📫 Contact

About

Languages

License

Traffic-Alpha/CCDA-Light

Folders and files

Latest commit

History

Repository files navigation

Traffic Signal Cycle Control with Centralized Critic and Decentralized Actors under Varying Intervention Frequencies

🎉 News

🔑 Key Points

📥 Installation

🏃‍♂️ Training

🧪 Evaluation

📚 Citation

📫 Contact

About

Topics

Resources

License

Stars

Watchers

Forks

Languages