End-to-end acoustic modeling using convolutional neural networks for HMM-based automatic speech recognition
In hidden Markov model (HMM) based automatic speech recognition (ASR) system,
modeling the statistical relationship between the acoustic speech signal and the HMM states …
modeling the statistical relationship between the acoustic speech signal and the HMM states …
Accented speech recognition: Benchmarking, pre-training, and diverse data
Building inclusive speech recognition systems is a crucial step towards developing
technologies that speakers of all language varieties can use. Therefore, ASR systems must …
technologies that speakers of all language varieties can use. Therefore, ASR systems must …
Summary of the DISPLACE challenge 2023-DIarization of SPeaker and LAnguage in Conversational Environments
In multi-lingual societies, where multiple languages are spoken in a small geographic
vicinity, informal conversations often involve mix of languages. Existing speech technologies …
vicinity, informal conversations often involve mix of languages. Existing speech technologies …
Finnish parliament ASR corpus: Analysis, benchmarks and statistics
Public sources like parliament meeting recordings and transcripts provide ever-growing
material for the training and evaluation of automatic speech recognition (ASR) systems. In …
material for the training and evaluation of automatic speech recognition (ASR) systems. In …
A longitudinal bilingual Frisian-Dutch radio broadcast database designed for code-switching research
E Yilmaz, M Andringa, S Kingma, J Dijkstra… - Proceedings of the …, 2016 - research.rug.nl
We present a new speech database containing 18.5 hours of annotated radio broadcasts in
the Frisian language. Frisian is mostly spoken in the province Fryslaˆn and it is the second …
the Frisian language. Frisian is mostly spoken in the province Fryslaˆn and it is the second …
Automatic voice based disease detection method using one dimensional local binary pattern feature extraction network
Voices have been widely used for disease detection in the literature but these methods are
non-invasive. In this article a 1D local binary pattern (LBP) based feature extraction network …
non-invasive. In this article a 1D local binary pattern (LBP) based feature extraction network …
IITG-HingCoS corpus: A Hinglish code-switching database for automatic speech recognition
Code-switching is a phenomenon in linguistics which refers to the use of two or more
languages, especially within the same discourse. This phenomenon has been observed in …
languages, especially within the same discourse. This phenomenon has been observed in …
Semi-supervised acoustic model training for speech with code-switching
In the FAME! project, we aim to develop an automatic speech recognition (ASR) system for
Frisian-Dutch code-switching (CS) speech extracted from the archives of a local broadcaster …
Frisian-Dutch code-switching (CS) speech extracted from the archives of a local broadcaster …
Exploration of end-to-end framework for code-switching speech recognition task: Challenges and enhancements
G Sreeram, R Sinha - IEEE Access, 2020 - ieeexplore.ieee.org
The end-to-end (E2E) framework has emerged as a viable alternative to conventional hybrid
systems in automatic speech recognition (ASR) domain. Unlike the monolingual case, the …
systems in automatic speech recognition (ASR) domain. Unlike the monolingual case, the …
Automatic speech recognition and translation of a swiss german dialect: Walliserdeutsch
Abstract Walliserdeutsch is a Swiss German dialect spoken in the south west of Switzerland.
To investigate the potential of automatic speech processing of Walliserdeutsch, a small …
To investigate the potential of automatic speech processing of Walliserdeutsch, a small …