Time-domain joint training strategies of speech enhancement and intent classification neural models

MN Ali, D Falavigna, A Brutti - Sensors, 2022 - mdpi.com
Robustness against background noise and reverberation is essential for many real-world
speech-based applications. One way to achieve this robustness is to employ a speech …

Direct enhancement of pre-trained speech embeddings for speech processing in noisy conditions

MN Ali, A Brutti, D Falavigna - Computer Speech & Language, 2023 - Elsevier
Lately, the development of deep learning algorithms has marked milestones in the field of
speech processing. In particular, the release of pre-trained feature extraction models has …

[PDF][PDF] Enhancing Embeddings for Speech Classification in Noisy Conditions.

MN Ali, A Brutti, D Falavigna - Interspeech, 2022 - isca-archive.org
Robustness against noise is critical for several speech applications in real-world
environments. In general, to improve the robustness, a speech enhancement front-end is …

Robust speech command recognition in challenging industrial environments

S Bini, V Carletti, A Saggese, M Vento - Computer Communications, 2024 - Elsevier
Speech is among the main forms of communication between humans and robots in
industrial settings, being the most natural way for a human worker to issue commands …

Self-Attention-Based Convolutional GRU for Enhancement of Adversarial Speech Examples

C Jannu, SD Vanambathina - International Journal of Image and …, 2024 - World Scientific
Recent research has identified adversarial examples which are the challenges to DNN-
based ASR systems. In this paper, we propose a new model based on Convolutional GRU …

Intent based multimodal speech and gesture fusion for human-robot communication in assembly situation

S Paul, M Sintek, V Këpuska, M Silaghi… - 2022 21st IEEE …, 2022 - ieeexplore.ieee.org
Understanding the intent is an essential step for maintaining effective communications. This
essential feature is used in communications for assembling, patrolling, and surveillance. A …

Analysis of Natural Language Understanding Systems with L2 Learner Specific Synthetic Grammatical Errors Based on Parts-of-Speech

S Ranjan, SK Nanduri, P Virdi, C Yarra - International Conference on …, 2023 - Springer
Second language learners often make grammatical mistakes, which can impact the
performance of Spoken Language Understanding (SLU) systems. SLU systems consist of …

Neural Enhancement Strategies for Robust Speech Processing

MNAM Nawar - 2023 - iris.unitn.it
In real-world scenarios, speech signals are often contaminated with environmental noises,
and reverberation, which degrades speech quality and intelligibility. Lately, the development …

[PDF][PDF] Understanding non-native Speech using SLU systems

S RANJAN - 2024 - web2py.iiit.ac.in
Spoken language understanding (SLU) systems are a critical component of modern dialog
systems, enabling natural language interactions between humans and machines. However …