Speech recognition by machine, a review
MA Anusuya, SK Katti - arXiv preprint arXiv:1001.2267, 2010 - arxiv.org
This paper presents a brief survey on Automatic Speech Recognition and discusses the
major themes and advances made in the past 60 years of research, so as to provide a …
major themes and advances made in the past 60 years of research, so as to provide a …
TLEFuzzyNet: fuzzy rank-based ensemble of transfer learning models for emotion recognition from human speeches
KK Sahoo, I Dutta, MF Ijaz, M Woźniak… - IEEE Access, 2021 - ieeexplore.ieee.org
Human speech is not only a verbose medium of communication but it also conveys
emotions. The past decade has seen a lot of research going on with speech data which …
emotions. The past decade has seen a lot of research going on with speech data which …
[PDF][PDF] Speech recognition technology: a survey on Indian languages
G Hemakumar, P Punitha - International Journal of Information …, 2013 - academia.edu
This paper presents a brief survey of Automatic Speech Recognition (ASR) and discusses
the major themes and advances made in the past 70 years of research, so as to provide a …
the major themes and advances made in the past 70 years of research, so as to provide a …
Sparse auditory reproducing kernel (SPARK) features for noise-robust speech recognition
A Fazel, S Chakrabartty - IEEE transactions on audio, speech …, 2011 - ieeexplore.ieee.org
In this paper, we present a novel speech feature extraction algorithm based on a
hierarchical combination of auditory similarity and pooling functions. The computationally …
hierarchical combination of auditory similarity and pooling functions. The computationally …
A Fast and Low-Distortion Capacity Adaptive Synchronized Acoustic-to-Acoustic Steganography Scheme
Data transmissions in public communications systems are not secure because of the chance
of their being intercepted, and tampered with by eavesdroppers. The security of acoustic …
of their being intercepted, and tampered with by eavesdroppers. The security of acoustic …
Controlling tradeoff between approximation accuracy and complexity of a smooth function in a reproducing kernel Hilbert space for noise reduction
Noise reduction algorithms are widely used to mitigate noise effects on speech to improve
the robustness of speech technology applications. However, they inevitably cause speech …
the robustness of speech technology applications. However, they inevitably cause speech …
Audio-visual speech processing for human computer interaction
SW Chin, KP Seng, LM Ang - Advances in robotics and virtual reality, 2012 - Springer
This chapter presents an audio-visual speech recognition (AVSR) for Human Computer
Interaction (HCI) that mainly focuses on 3 modules:(i) the radial basis function neural …
Interaction (HCI) that mainly focuses on 3 modules:(i) the radial basis function neural …
Analog auditory perception model for robust speech recognition
Y Deng, S Chakrabartty… - 2004 IEEE International …, 2004 - ieeexplore.ieee.org
An auditory perception model for noise-robust speech feature extraction is presented. The
model assumes continuous-time filtering and rectification, amenable to real-time, low-power …
model assumes continuous-time filtering and rectification, amenable to real-time, low-power …
[PDF][PDF] Continuous feature adaptation for non-native speech recognition
The current speech interfaces in many military applications may be adequate for native
speakers. However, the recognition rate drops quite a lot for non-native speakers (people …
speakers. However, the recognition rate drops quite a lot for non-native speakers (people …
A study of spoken audio processing using machine learning for libraries, archives and museums (LAM)
As the need to provide access to spoken word audio collections in libraries, archives, and
museums (LAM) increases, so does the need to process them efficiently and consistently …
museums (LAM) increases, so does the need to process them efficiently and consistently …