GitHub - aitor-alvarez/acoustic-transformer-models: Acoustic Transformer Models for Audio Classification

Pytorch Lightning Acoustic Transformer Models

The code in this repository can be used to finetune Transformer models such as WavLM, Wav2vec, Wav2vec Conformer, or HuBert in downstream classification tasks. A classification head has been added to the models to be able to use it in acoustic classification tasks.

The following pretrained models (or any other from the same family: https://huggingface.co/collections/facebook/xlsr-651e8a5bb947065cccb62c6c or https://huggingface.co/collections/facebook/wav2vec-20-651e865258e3dee2586c89f5) can be used and passed to the trainer:

microsoft/wavlm-large
facebook/wav2vec2-conformer-rope-large-960h-ft
facebook/wav2vec2-base
facebook/wav2vec2-large-xlsr-53
facebook/wav2vec2-xls-r-300m
facebook/wav2vec2-xls-r-1b
facebook/wav2vec2-xls-r-2b
facebook/hubert-large-ll60k
facebook/hubert-xlarge-ls960-ft

You can also pretrain a model directly by passing your own (--model_name).

Before you train the model, you need to clone the repo and run the requirements:

pip install -r requirements.txt

The following command trains and tests the model:

python main.py --model_name 'either_pretrained_model_from_the_above_list_or_your_own_model'
--batch_size 16 --num_epochs 100
--data_dir 'path_to_dataset_directory'
--n_gpus 4 --n_nodes 1 --strategy='ddp'

You can include DeepSpeed optimizations by installing it:

pip install deepspeed

and use any of the DeepSpeed strategies:

python main.py --model_name 'either_pretrained_model_from_the_above_list_or_your_own_model'
--batch_size 16 --num_epochs 100
--data_dir 'path_to_dataset_directory'
--n_gpus 4 --n_nodes 1 --strategy='deepspeed_stage_2_offload'

Name		Name	Last commit message	Last commit date
Latest commit History 58 Commits
.gitignore		.gitignore
README.md		README.md
data.py		data.py
main.py		main.py
model.py		model.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Pytorch Lightning Acoustic Transformer Models

About

Releases

Packages

Languages

aitor-alvarez/acoustic-transformer-models

Folders and files

Latest commit

History

Repository files navigation

Pytorch Lightning Acoustic Transformer Models

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages