Parameter-efficient transfer learning of audio spectrogram transformers
Parameter-efficient transfer learning (PETL) methods have emerged as a solid alternative to
the standard full fine-tuning approach. They only train a few extra parameters for each …
the standard full fine-tuning approach. They only train a few extra parameters for each …
Characterizing continual learning scenarios and strategies for audio analysis
Audio analysis is useful in many application scenarios. The state-of-the-art audio analysis
approaches assume the data distribution at training and deployment time will be the same …
approaches assume the data distribution at training and deployment time will be the same …
[PDF][PDF] Shared-Adapters: A Novel Transformer-based Parameter Efficient Transfer Learning Approach For Children's Automatic Speech Recognition
Abstract Automatic Speech Recognition (ASR) often faces challenges in processing
children's speech due to data scarcity. Training large ASR models becomes particularly …
children's speech due to data scarcity. Training large ASR models becomes particularly …
[PDF][PDF] Towards improved Automatic Speech Recognition for children
Abstract Children's Automatic Speech Recognition (ASR) represents a considerable
challenge, with a considerable performance decline of state-of-the-art systems when …
challenge, with a considerable performance decline of state-of-the-art systems when …
Audio Contrastive based Fine-tuning
Audio classification plays a crucial role in speech and sound processing tasks with a wide
range of applications. There still remains a challenge of striking the right balance between …
range of applications. There still remains a challenge of striking the right balance between …
Low Resource Language Adaptation using Two-stage Regularization for Multilingual ASR
A significant portion of the global population speaks multiple languages, including many low-
resource languages on which current multilingual ASR models perform poorly. To improve …
resource languages on which current multilingual ASR models perform poorly. To improve …
[PDF][PDF] Low-resource Language Adaptation with Ensemble of PEFT Approaches
Despite recent advances in automatic speech recognition (ASR) performance on common
languages, a large fraction of the world's languages remain unsupported. Parameter …
languages, a large fraction of the world's languages remain unsupported. Parameter …