This repository contains both the application and workload datasets as well as the crawler and processing scripts needed for replicating our paper "Microservice Applications and Their Workloads on GitHub".
Instructions on how to run the GitHub API Crawler can be found here.
Instructions on how to run the processing pipeline on the raw data can be found here.
The datasets can be found in datasets
.