Feel free to join my Discord Server to discuss this tool!
A simple tool for reproducable dataset gathering for machine learning tasks.
MacOS/Linux
curl https://raw.githubusercontent.com/ex3ndr/datasets/main/install.sh | shRight now available datasets are listed here: https://korshakov.com/datasets
Entry point is the datasets.yaml file in your project, that looks like this:
datasets:
- cifar-100 # This downloads from central repository
- name: some_private_dataset
source: https://not-so-real-url.orgThen you can execute sync of datasets:
datasets syncThat's all!
MIT