BioLingual

Contrastive language-audio pretraining for bioacoustics

Paper: Transferable Models for Bioacoustics with Human Language Supervision arxiv

Use the model on HuggingFace! davidrrobinson/BioLingual

BioLingual is a language-audio model for bioacoustics, useful for zero-shot audio classification and sound detection, text-to-audio search, or for fine-tuning on new bioacoustic tasks.

AnimalSpeak is a large-scale language-audio dataset used to train BioLingual, created by captioning bioacoustic archives including Xeno-canto and iNaturalist.

To recreate the BEANS benchmarking results from the paper:

pip install -r requirements.txt
cd beans

Follow instructions in beans/README.MD to download the datasets

python run_benchmark.py

The AnimalSpeak dataset is released at https://huggingface.co/datasets/davidrrobinson/AnimalSpeak

Acknowledgements

We express our gratitude to the authors of CLAP and beans, which much of this repository is based on, for making their code open-source.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
CLAP		CLAP
animalspeak		animalspeak
beans		beans
.gitignore		.gitignore
BioLingual.png		BioLingual.png
LICENSE		LICENSE
README.md		README.md
eval_retrieval.py		eval_retrieval.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

BioLingual

Contrastive language-audio pretraining for bioacoustics

Acknowledgements

About

Releases

Packages

Languages

License

david-rx/BioLingual

Folders and files

Latest commit

History

Repository files navigation

BioLingual

Contrastive language-audio pretraining for bioacoustics

Acknowledgements

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages