Distilling a Pretrained Language Model to a Multilingual ASR Model

Official implementation of the paper: https://arxiv.org/abs/2206.12638
Accepted to Interspeech 2022.

Oral presentation @ Interspeech

How to run experiments (Table 1)

Environments

I used Python 3.8.12.
Check requirements.txt for additional requirements.

Supported datasets

Check configs for supported datasets.
For example, if you want CommonVoice Czech, set $dataset as common_voice_czech.

From scratch

# If you change the # of GPUs, you have to fix per_device_train_batch_size in training config.
CUDA_VISIBLE_DEVICES=0,1 python3 train.py \
    +distill=random_init \
    +dataset=$dataset \
    +train=v1 \
    +xlsr=w2v2_xlsr

Fine-tuning

CUDA_VISIBLE_DEVICES=0,1 python3 train.py \
    +distill=vanilla \
    +dataset=$dataset \
    +train=v1 \
    +xlsr=w2v2_xlsr

Fine-tuning + Distill-L2S

# You have to set $lambda as the trade-off hyperparameter, i.e., 0.25, 0.5 or 1.0.
CUDA_VISIBLE_DEVICES=0,1 python3 train.py \
    +distill=shrink \
    +dataset=$dataset \
    +train=v1 \
    +xlsr=w2v2_xlsr \
    distill.feat_loss=$lambda

Name		Name	Last commit message	Last commit date
Latest commit History 24 Commits
config		config
resources		resources
LICENSE		LICENSE
README.md		README.md
create_mls_csv.py		create_mls_csv.py
data_utils.py		data_utils.py
model_utils.py		model_utils.py
requirements.txt		requirements.txt
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Distilling a Pretrained Language Model to a Multilingual ASR Model

Oral presentation @ Interspeech

How to run experiments (Table 1)

About

Uh oh!

Releases

Packages

Languages

License

juice500ml/xlm_to_xlsr

Folders and files

Latest commit

History

Repository files navigation

Distilling a Pretrained Language Model to a Multilingual ASR Model

Oral presentation @ Interspeech

How to run experiments (Table 1)

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages