cleanml-resultscontains the model scores from every experimental condition.datacontains the raw datasets.demodqis the code module written and used to run analysis.
Jupyter Notebooks:
- (RQ1)
detect_errors-combinedinvestigates error types for each dataset broken down by sensitive attributes. - (RQ1)
deep-dive-data-errors-mislabelsis an exploration of potentially mislabeled samples in the raw datasets. - (RQ2)
compute-result-tablecomputes fairness metrics and statistical significance on the results incleanml-resultsand converts the raw result data structure into the result table incleanml.csv. - (RQ2)
cleanml-analysisgenerates the table in our paper that describes total case counts with negative, insignificant, and positive impact on fairness and on accuracy. It also examines how many experimental conditions had non-negative impact on fairness for each dataset and error type. - (RQ2)
cleanml-analysis-per-modelgroups all experiments by model type and error type and tallies the impact on fairness and on accuracy. - (RQ2)
cleanml-analysis-cleaning-typecounts cases with positive impact on fairness for each data cleaning method. - (RQ2)
cleanml-accuraciesidentifies the best model types for each data error type with respect to model accuracy.
# Set up virtual env
python3 -m venv venv
source venv/bin/activate
pip install -r requirements.txt
# Start Jupyter server
jupyter notebook