Time-domain joint training strategies of speech enhancement and intent classification neural models
Robustness against background noise and reverberation is essential for many real-world
speech-based applications. One way to achieve this robustness is to employ a speech …
speech-based applications. One way to achieve this robustness is to employ a speech …
Direct enhancement of pre-trained speech embeddings for speech processing in noisy conditions
Lately, the development of deep learning algorithms has marked milestones in the field of
speech processing. In particular, the release of pre-trained feature extraction models has …
speech processing. In particular, the release of pre-trained feature extraction models has …
[PDF][PDF] Enhancing Embeddings for Speech Classification in Noisy Conditions.
Robustness against noise is critical for several speech applications in real-world
environments. In general, to improve the robustness, a speech enhancement front-end is …
environments. In general, to improve the robustness, a speech enhancement front-end is …
Robust speech command recognition in challenging industrial environments
Speech is among the main forms of communication between humans and robots in
industrial settings, being the most natural way for a human worker to issue commands …
industrial settings, being the most natural way for a human worker to issue commands …
Self-Attention-Based Convolutional GRU for Enhancement of Adversarial Speech Examples
C Jannu, SD Vanambathina - International Journal of Image and …, 2024 - World Scientific
Recent research has identified adversarial examples which are the challenges to DNN-
based ASR systems. In this paper, we propose a new model based on Convolutional GRU …
based ASR systems. In this paper, we propose a new model based on Convolutional GRU …
Intent based multimodal speech and gesture fusion for human-robot communication in assembly situation
Understanding the intent is an essential step for maintaining effective communications. This
essential feature is used in communications for assembling, patrolling, and surveillance. A …
essential feature is used in communications for assembling, patrolling, and surveillance. A …
Analysis of Natural Language Understanding Systems with L2 Learner Specific Synthetic Grammatical Errors Based on Parts-of-Speech
S Ranjan, SK Nanduri, P Virdi, C Yarra - International Conference on …, 2023 - Springer
Second language learners often make grammatical mistakes, which can impact the
performance of Spoken Language Understanding (SLU) systems. SLU systems consist of …
performance of Spoken Language Understanding (SLU) systems. SLU systems consist of …
Neural Enhancement Strategies for Robust Speech Processing
MNAM Nawar - 2023 - iris.unitn.it
In real-world scenarios, speech signals are often contaminated with environmental noises,
and reverberation, which degrades speech quality and intelligibility. Lately, the development …
and reverberation, which degrades speech quality and intelligibility. Lately, the development …
[PDF][PDF] Understanding non-native Speech using SLU systems
S RANJAN - 2024 - web2py.iiit.ac.in
Spoken language understanding (SLU) systems are a critical component of modern dialog
systems, enabling natural language interactions between humans and machines. However …
systems, enabling natural language interactions between humans and machines. However …