Audio-Language Datasets of Scenes and Events: A Survey

G Wijngaard, E Formisano, M Esposito… - arXiv preprint arXiv …, 2024 - arxiv.org
Audio-language models (ALMs) process sounds to provide a linguistic description of sound-
producing events and scenes. Recent advances in computing power and dataset creation …

Soundbay: Deep Learning Framework for Marine Mammals and Bioacoustic Research

N Bressler, M Faran, A Galor, MM Michelashvili… - arXiv preprint arXiv …, 2023 - arxiv.org
This paper presents Soundbay, an open-source Python framework that allows bio-acoustics
and machine learning researchers to implement and utilize deep learning-based algorithms …