Flatland Challenge

The Flatland challenge aims to address the problem of train scheduling and rescheduling by providing a simple grid world environment and allowing for diverse experimental approaches. This is the second edition of this challenge. In the first one, participants mainly used solutions from the operations research field. In this second edition we are encouraging participants to use solutions which leverage the recent progress in reinforcement learning.

For more information visit the official page.

The following work is related to the second edition of this challenge started in Summer 2020. In the first edition, which took place in 2019, the majority of the participants delivered solutions from the operations research field, this edition tries to encourage participants to elaborate Reinforcement Learning solutions, indeed the challenge has been selected to be featured in the NeurIPS 2020 Accepted competitions. In this work we propose and describe the implementation of some Deep Reinforcement Learning solutions, leveraging the most recent outcomes in this vibrant and exciting field of study. A detailed explanation and analysis of our strategies can be read in the report.

Project structure

.
├── models (contains PyTorch neural network saved parameters of our final solution)
├── modules (contains Git submodules)
│   ├── MARL-Papers (list of MARL papers)
│   └── neurips2020-flatland-starter-kit (contains files useful for submitting solutions to the Challenge)
├── report (contains resources and source files to generate the Latex report)
├── single (contains preliminary works on single agent setting performed on previous versions of Flatland)
└── src
    ├── common (contains source code in common with all the approaches)
    ├── curriculum (contains files related with the curriculum approach)
    ├── d3qn (contains files related with the D3QN approach)
    │   ├── hyperparameters
    │   │   └── server.py (code to run Sweeps)
    │   ├── d3qn_flatland.py (main loop)
    │   ├── d3qn_main.py (hyperparameter definition and starting point)
    │   ├── eval_d3qn.py (code to evaluate the policy)
    │   ├── memory.py (experience replay)
    │   ├── model.py (the network architecture)
    │   └── policy.py (the )
    └──  psppo (contains files related with the parameter sharing PPO approach)

Getting started

In order to run locally, install the dependencies listed in environment.yml or create an Anaconda virtual environment executing:

conda env create -f environment.yml

Training and evaluating the different approaches can be performed running the files src/d3qn/d3qn_main.py, src/psppo/ps_ppo_main.py and curriculum/curriculum_learning.py. In PS-PPO and D3QN evaluation is chosen setting evaluation_mode to True.

Team members

Alessandro Lombardi - University of Bologna - [email protected] - https://github.com/AlessandroLombardi

Fiorenzo Parascandolo - University of Bologna - [email protected] - https://github.com/FiorenzoParascandolo1

Name		Name	Last commit message	Last commit date
Latest commit History 263 Commits
models		models
modules		modules
report		report
single		single
src		src
.gitignore		.gitignore
.gitmodules		.gitmodules
README.md		README.md
environment.yml		environment.yml
presentation.pdf		presentation.pdf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Flatland Challenge

Project structure

Getting started

Team members

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

alomb/FlatlandChallenge

Folders and files

Latest commit

History

Repository files navigation

Flatland Challenge

Project structure

Getting started

Team members

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages