Parameter-efficient transfer learning of audio spectrogram transformers

U Cappellazzo, D Falavigna, A Brutti… - 2024 IEEE 34th …, 2024 - ieeexplore.ieee.org
Parameter-efficient transfer learning (PETL) methods have emerged as a solid alternative to
the standard full fine-tuning approach. They only train a few extra parameters for each …

Characterizing continual learning scenarios and strategies for audio analysis

R Bhatt, P Kumari, D Mahapatra, AE Saddik… - arXiv preprint arXiv …, 2024 - arxiv.org
Audio analysis is useful in many application scenarios. The state-of-the-art audio analysis
approaches assume the data distribution at training and deployment time will be the same …

[PDF][PDF] Shared-Adapters: A Novel Transformer-based Parameter Efficient Transfer Learning Approach For Children's Automatic Speech Recognition

T Rolland, A Abad - Procs. of Interspeech, Kos Island, Greece, 2024 - isca-archive.org
Abstract Automatic Speech Recognition (ASR) often faces challenges in processing
children's speech due to data scarcity. Training large ASR models becomes particularly …

[PDF][PDF] Towards improved Automatic Speech Recognition for children

T Rolland, A Abad - Transfer, 2024 - isca-archive.org
Abstract Children's Automatic Speech Recognition (ASR) represents a considerable
challenge, with a considerable performance decline of state-of-the-art systems when …

Audio Contrastive based Fine-tuning

Y Wang, Q Liang, C Xiao, Y Li, NA Moubayed… - arXiv preprint arXiv …, 2023 - arxiv.org
Audio classification plays a crucial role in speech and sound processing tasks with a wide
range of applications. There still remains a challenge of striking the right balance between …

Low Resource Language Adaptation using Two-stage Regularization for Multilingual ASR

CY Kwok, JQ Yip, ES Chng - 2024 International Conference on …, 2024 - ieeexplore.ieee.org
A significant portion of the global population speaks multiple languages, including many low-
resource languages on which current multilingual ASR models perform poorly. To improve …

[PDF][PDF] Low-resource Language Adaptation with Ensemble of PEFT Approaches

CY Kwok, S Li, JQ Yip, ES Chng - researchgate.net
Despite recent advances in automatic speech recognition (ASR) performance on common
languages, a large fraction of the world's languages remain unsupported. Parameter …