Deep learning for environmentally robust speech recognition: An overview of recent developments
Eliminating the negative effect of non-stationary environmental noise is a long-standing
research topic for automatic speech recognition but still remains an important challenge …
research topic for automatic speech recognition but still remains an important challenge …
An overview of noise-robust automatic speech recognition
New waves of consumer-centric applications, such as voice search and voice interaction
with mobile devices and home entertainment systems, increasingly require automatic …
with mobile devices and home entertainment systems, increasingly require automatic …
Audio-visual speech modeling for continuous speech recognition
This paper describes a speech recognition system that uses both acoustic and visual
speech information to improve recognition performance in noisy environments. The system …
speech information to improve recognition performance in noisy environments. The system …
Multimodal interfaces
S Oviatt - The human-computer interaction handbook, 2007 - taylorfrancis.com
More recent multimodal systems have moved away from processing simple mouse or
touchpad pointing, and have begun designing systems based on two parallel input streams …
touchpad pointing, and have begun designing systems based on two parallel input streams …
[图书][B] Speech synthesis and recognition
W Holmes - 2002 - taylorfrancis.com
With the growing impact of information technology on daily life, speech is becoming
increasingly important for providing a natural means of communication between humans …
increasingly important for providing a natural means of communication between humans …
Single channel speech enhancement based on masking properties of the human auditory system
N Virag - IEEE Transactions on speech and audio processing, 1999 - ieeexplore.ieee.org
This paper addresses the problem of single channel speech enhancement at very low signal-
to-noise ratios (SNRs)(< 10 dB). The proposed approach is based on the introduction of an …
to-noise ratios (SNRs)(< 10 dB). The proposed approach is based on the introduction of an …
Robust automatic speech recognition with missing and unreliable acoustic data
Human speech perception is robust in the face of a wide variety of distortions, both
experimentally applied and naturally occurring. In these conditions, state-of-the-art …
experimentally applied and naturally occurring. In these conditions, state-of-the-art …
Confidence measures for speech recognition: A survey
H Jiang - Speech communication, 2005 - Elsevier
In speech recognition, confidence measures (CM) are used to evaluate reliability of
recognition results. A good confidence measure can largely benefit speech recognition …
recognition results. A good confidence measure can largely benefit speech recognition …
[HTML][HTML] Human-technology integration with industrial conversational agents: A conceptual architecture and a taxonomy for manufacturing
S Colabianchi, A Tedeschi, F Costantino - Journal of Industrial Information …, 2023 - Elsevier
Conversational agents are systems with great potential to enhance human-computer
interaction in industrial settings. Although the number of applications of conversational …
interaction in industrial settings. Although the number of applications of conversational …
Designing the user interface for multimodal speech and pen-based gesture applications: State-of-the-art systems and future research directions
The growing interest in multimodal interface design is inspired in large part by the goals of
supporting more transparent, flexible, efficient, and powerfully expressive means of human …
supporting more transparent, flexible, efficient, and powerfully expressive means of human …