Code for ICASSP 2024 paper WhisperSeg: Positive Transfer of the Whisper Speech Transformer to Human and Animal Voice Activity Detection
-
Updated
Nov 12, 2024 - Python
Code for ICASSP 2024 paper WhisperSeg: Positive Transfer of the Whisper Speech Transformer to Human and Animal Voice Activity Detection
Master thesis project adapting the WhisperSeg model for ring-tailed lemurs, including all code necessary for experiment reproducibility.
Add a description, image, and links to the animal-sound-detection topic page so that developers can more easily learn about it.
To associate your repository with the animal-sound-detection topic, visit your repo's landing page and select "manage topics."