This repo contains the Python and MATLAB scripts for the paper submitted to Interspeech 2025.
The scripts are numbered in the order they were executed. The Python packages and environments can be found under ./conda_env
.
Due to file size and copyright protection, the .model/
, ./data
, ./STM_output
, ./melspectrogram_norm_output
, ./yamnet_output
and ./vggish_output
folder are gitignored.