[PDF][PDF] Whisper Activity Detection Using CNN-LSTM Based Attention Pooling Network Trained for a Speaker Identification Task.

AR Naini, M Satyapriya, PK Ghosh - INTERSPEECH, 2020 - isca-archive.org
In this work, we proposed a method to detect the whispered speech region in a noisy audio
file called whisper activity detection (WAD). Due to the lack of pitch and noisy nature of …

Whisper40: A Multi-Person Chinese Whisper Speaker Recognition Dataset Containing Same-Text Neutral Speech

J Yang, R Zhou - Information, 2024 - mdpi.com
Whisper speaker recognition (WSR) has received extensive attention from researchers in
recent years, and it plays an important role in medical, judicial, and other fields. Among …

Dual Attention Pooling Network for Recording Device Classification Using Neutral and Whispered Speech

AR Naini, B Singhal, PK Ghosh - ICASSP 2022-2022 IEEE …, 2022 - ieeexplore.ieee.org
In this work, we proposed a method for recording device classification using the recorded
speech signal. With the rapid increase in different mobile and professional recording …

Shouted and whispered speech compensation for speaker verification systems

S Prieto, A Ortega, I López-Espejo, E Lleida - Digital Signal Processing, 2022 - Elsevier
Nowadays, speaker verification systems begin to perform very well under normal speech
conditions due to the plethora of neutrally-phonated speech data available, which are used …

wspire: A parallel multi-device corpus in neutral and whispered speech

B Singhal, AR Naini, PK Ghosh - 2021 24th Conference of the …, 2021 - ieeexplore.ieee.org
Most of the speech technologies for whispered speech are lagging behind due to the
scarcity of data. Hence, in this paper, we present and open source a multi-device parallel …

Whisper to Neutral Mapping Using I-Vector Space Likelihood and a Cosine Similarity Based Iterative Optimization for Whispered Speaker Verification

AR Naini, A Rao, PK Ghosh - 2022 National Conference on …, 2022 - ieeexplore.ieee.org
In this work, we propose an iterative optimization algorithm to learn a feature mapping (FM)
from the whispered to neutral speech features. Such an FM can be used to improve the …