Deep learning for environmentally robust speech recognition: An overview of recent developments

Z Zhang, J Geiger, J Pohjalainen, AED Mousa… - ACM Transactions on …, 2018 - dl.acm.org
Eliminating the negative effect of non-stationary environmental noise is a long-standing
research topic for automatic speech recognition but still remains an important challenge …

An overview of noise-robust automatic speech recognition

J Li, L Deng, Y Gong… - IEEE/ACM Transactions …, 2014 - ieeexplore.ieee.org
New waves of consumer-centric applications, such as voice search and voice interaction
with mobile devices and home entertainment systems, increasingly require automatic …

Audio-visual speech modeling for continuous speech recognition

S Dupont, J Luettin - IEEE transactions on multimedia, 2000 - ieeexplore.ieee.org
This paper describes a speech recognition system that uses both acoustic and visual
speech information to improve recognition performance in noisy environments. The system …

Multimodal interfaces

S Oviatt - The human-computer interaction handbook, 2007 - taylorfrancis.com
More recent multimodal systems have moved away from processing simple mouse or
touchpad pointing, and have begun designing systems based on two parallel input streams …

[图书][B] Speech synthesis and recognition

W Holmes - 2002 - taylorfrancis.com
With the growing impact of information technology on daily life, speech is becoming
increasingly important for providing a natural means of communication between humans …

Single channel speech enhancement based on masking properties of the human auditory system

N Virag - IEEE Transactions on speech and audio processing, 1999 - ieeexplore.ieee.org
This paper addresses the problem of single channel speech enhancement at very low signal-
to-noise ratios (SNRs)(< 10 dB). The proposed approach is based on the introduction of an …

Robust automatic speech recognition with missing and unreliable acoustic data

M Cooke, P Green, L Josifovski, A Vizinho - Speech communication, 2001 - Elsevier
Human speech perception is robust in the face of a wide variety of distortions, both
experimentally applied and naturally occurring. In these conditions, state-of-the-art …

Confidence measures for speech recognition: A survey

H Jiang - Speech communication, 2005 - Elsevier
In speech recognition, confidence measures (CM) are used to evaluate reliability of
recognition results. A good confidence measure can largely benefit speech recognition …

[HTML][HTML] Human-technology integration with industrial conversational agents: A conceptual architecture and a taxonomy for manufacturing

S Colabianchi, A Tedeschi, F Costantino - Journal of Industrial Information …, 2023 - Elsevier
Conversational agents are systems with great potential to enhance human-computer
interaction in industrial settings. Although the number of applications of conversational …

Designing the user interface for multimodal speech and pen-based gesture applications: State-of-the-art systems and future research directions

S Oviatt, P Cohen, L Wu, L Duncan… - Human-computer …, 2000 - Taylor & Francis
The growing interest in multimodal interface design is inspired in large part by the goals of
supporting more transparent, flexible, efficient, and powerfully expressive means of human …