Speech recognition in noisy environments: A survey

Z Zhang, J Geiger, J Pohjalainen, AED Mousa… - ACM Transactions on …, 2018 - dl.acm.org

Eliminating the negative effect of non-stationary environmental noise is a long-standing
research topic for automatic speech recognition but still remains an important challenge …

被引用次数：392 相关文章所有 10 个版本

[PDF] kresttechnology.com

An overview of noise-robust automatic speech recognition

J Li, L Deng, Y Gong… - IEEE/ACM Transactions …, 2014 - ieeexplore.ieee.org

New waves of consumer-centric applications, such as voice search and voice interaction
with mobile devices and home entertainment systems, increasingly require automatic …

被引用次数：666 相关文章所有 9 个版本

[PDF] academia.edu

Audio-visual speech modeling for continuous speech recognition

S Dupont, J Luettin - IEEE transactions on multimedia, 2000 - ieeexplore.ieee.org

This paper describes a speech recognition system that uses both acoustic and visual
speech information to improve recognition performance in noisy environments. The system …

被引用次数：799 相关文章所有 11 个版本

[PDF] wisc.edu

Multimodal interfaces

S Oviatt - The human-computer interaction handbook, 2007 - taylorfrancis.com

More recent multimodal systems have moved away from processing simple mouse or
touchpad pointing, and have begun designing systems based on two parallel input streams …

被引用次数：813 相关文章所有 9 个版本

[图书][B] Speech synthesis and recognition

W Holmes - 2002 - taylorfrancis.com

With the growing impact of information technology on daily life, speech is becoming
increasingly important for providing a natural means of communication between humans …

被引用次数：936 相关文章所有 6 个版本

Single channel speech enhancement based on masking properties of the human auditory system

N Virag - IEEE Transactions on speech and audio processing, 1999 - ieeexplore.ieee.org

This paper addresses the problem of single channel speech enhancement at very low signal-
to-noise ratios (SNRs)(< 10 dB). The proposed approach is based on the introduction of an …

被引用次数：954 相关文章所有 7 个版本

[PDF] laslab.org

Robust automatic speech recognition with missing and unreliable acoustic data

M Cooke, P Green, L Josifovski, A Vizinho - Speech communication, 2001 - Elsevier

Human speech perception is robust in the face of a wide variety of distortions, both
experimentally applied and naturally occurring. In these conditions, state-of-the-art …

被引用次数：838 相关文章所有 16 个版本

[PDF] upm.es

Confidence measures for speech recognition: A survey

H Jiang - Speech communication, 2005 - Elsevier

In speech recognition, confidence measures (CM) are used to evaluate reliability of
recognition results. A good confidence measure can largely benefit speech recognition …

被引用次数：529 相关文章所有 9 个版本

[HTML] sciencedirect.com

[HTML][HTML] Human-technology integration with industrial conversational agents: A conceptual architecture and a taxonomy for manufacturing

S Colabianchi, A Tedeschi, F Costantino - Journal of Industrial Information …, 2023 - Elsevier

Conversational agents are systems with great potential to enhance human-computer
interaction in industrial settings. Although the number of applications of conversational …

被引用次数：10 相关文章所有 4 个版本

[HTML] acm.org

Designing the user interface for multimodal speech and pen-based gesture applications: State-of-the-art systems and future research directions

S Oviatt, P Cohen, L Wu, L Duncan… - Human-computer …, 2000 - Taylor & Francis

The growing interest in multimodal interface design is inspired in large part by the goals of
supporting more transparent, flexible, efficient, and powerfully expressive means of human …

被引用次数：578 相关文章所有 13 个版本