Speech recognition by machine, a review

MA Anusuya, SK Katti - arXiv preprint arXiv:1001.2267, 2010 - arxiv.org
This paper presents a brief survey on Automatic Speech Recognition and discusses the
major themes and advances made in the past 60 years of research, so as to provide a …

TLEFuzzyNet: fuzzy rank-based ensemble of transfer learning models for emotion recognition from human speeches

KK Sahoo, I Dutta, MF Ijaz, M Woźniak… - IEEE Access, 2021 - ieeexplore.ieee.org
Human speech is not only a verbose medium of communication but it also conveys
emotions. The past decade has seen a lot of research going on with speech data which …

[PDF][PDF] Speech recognition technology: a survey on Indian languages

G Hemakumar, P Punitha - International Journal of Information …, 2013 - academia.edu
This paper presents a brief survey of Automatic Speech Recognition (ASR) and discusses
the major themes and advances made in the past 70 years of research, so as to provide a …

Sparse auditory reproducing kernel (SPARK) features for noise-robust speech recognition

A Fazel, S Chakrabartty - IEEE transactions on audio, speech …, 2011 - ieeexplore.ieee.org
In this paper, we present a novel speech feature extraction algorithm based on a
hierarchical combination of auditory similarity and pooling functions. The computationally …

A Fast and Low-Distortion Capacity Adaptive Synchronized Acoustic-to-Acoustic Steganography Scheme

X Huang, Y Abe, I Echizen - Recent Advances in Information Hiding and …, 2013 - Springer
Data transmissions in public communications systems are not secure because of the chance
of their being intercepted, and tampered with by eavesdroppers. The security of acoustic …

Controlling tradeoff between approximation accuracy and complexity of a smooth function in a reproducing kernel Hilbert space for noise reduction

X Lu, M Unoki, S Matsuda, C Hori… - IEEE transactions on …, 2012 - ieeexplore.ieee.org
Noise reduction algorithms are widely used to mitigate noise effects on speech to improve
the robustness of speech technology applications. However, they inevitably cause speech …

Audio-visual speech processing for human computer interaction

SW Chin, KP Seng, LM Ang - Advances in robotics and virtual reality, 2012 - Springer
This chapter presents an audio-visual speech recognition (AVSR) for Human Computer
Interaction (HCI) that mainly focuses on 3 modules:(i) the radial basis function neural …

Analog auditory perception model for robust speech recognition

Y Deng, S Chakrabartty… - 2004 IEEE International …, 2004 - ieeexplore.ieee.org
An auditory perception model for noise-robust speech feature extraction is presented. The
model assumes continuous-time filtering and rectification, amenable to real-time, low-power …

[PDF][PDF] Continuous feature adaptation for non-native speech recognition

Y Deng, X Li, C Kwan, B Raj, R Stern - International Journal of Computer …, 2007 - Citeseer
The current speech interfaces in many military applications may be adequate for native
speakers. However, the recognition rate drops quite a lot for non-native speakers (people …

A study of spoken audio processing using machine learning for libraries, archives and museums (LAM)

W Xu, M Esteva, P Cui, E Castillo… - … Conference on Big …, 2020 - ieeexplore.ieee.org
As the need to provide access to spoken word audio collections in libraries, archives, and
museums (LAM) increases, so does the need to process them efficiently and consistently …