The 6th semester project for course on Reinforcement Learning.
Simulated an aircraft Traffic Control agent which maneuvers around other planes to reach a common destination. Had to modify a PyGame based game to make it a Markov Decision Process (MDP) and formualted the state-space, action-space and reward setup. Next, we trained the agent using the SARSA algorithm.
A detailed report is present in the "RL_Project_report.pdf" file.
Abhinav Sethi
Hritik Bana
A demo run is shown below. The demo shows episodes that end either when all 50 planes reach the destination or any two planes collide. (A lot of parameters, such as number of planes, destinations, obstacles etc can be modified as per convenience):
To replicate run type python game2rl.py -p 50
on the command line.