End-to-end model for named entity recognition from speech without paired training data

S Mdhaffar, J Duret, T Parcollet, Y Estève - arXiv preprint arXiv …, 2022 - arxiv.org
Recent works showed that end-to-end neural approaches tend to become very popular for
spoken language understanding (SLU). Through the term end-to-end, one considers the use …

Towards end-to-end private automatic speaker recognition

F Teixeira, A Abad, B Raj, I Trancoso - arXiv preprint arXiv:2206.11750, 2022 - arxiv.org
The development of privacy-preserving automatic speaker verification systems has been the
focus of a number of studies with the intent of allowing users to authenticate themselves …

Acoustic-based fluency classification using LSTM-Attention with computationally-cheap data augmentation for an adaptive voicebot

PS Wade, M Andries, I Kanellos, T Moudenc - 2023 - imt-atlantique.hal.science
Most voicebots still ignore, nowadays, user fluency level, although recognizing it would
allow to give answers to adaptation issues, according to the language level of the …

[PDF][PDF] An efficient approach for the automated segmentation and transcription of the People's Speech corpus

A Biswas, A Boumadane, S Peillon, G Bleas - isca-archive.org
Advancements in speech technology have led to the integration of modern ASR systems into
various applications such as chatbots, medical dictation, video transcription etc …

[PDF][PDF] 声質変換技術によるデータ拡張を利用した話者認証モデルの学習

平塚喬, ヒラツカキョウ, 小高知宏… - 福井大学学術研究院 …, 2024 - u-fukui.repo.nii.ac.jp
In this paper, we tried to improve the authentication performance by increasing the training
data of the speaker authentication model using voice conversion technology. We used one …