End-to-end acoustic modeling using convolutional neural networks for HMM-based automatic speech recognition

D Palaz, M Magimai-Doss, R Collobert - Speech Communication, 2019 - Elsevier
In hidden Markov model (HMM) based automatic speech recognition (ASR) system,
modeling the statistical relationship between the acoustic speech signal and the HMM states …

Accented speech recognition: Benchmarking, pre-training, and diverse data

A Aksënova, Z Chen, CC Chiu, D van Esch… - arXiv preprint arXiv …, 2022 - arxiv.org
Building inclusive speech recognition systems is a crucial step towards developing
technologies that speakers of all language varieties can use. Therefore, ASR systems must …

Summary of the DISPLACE challenge 2023-DIarization of SPeaker and LAnguage in Conversational Environments

S Baghel, S Ramoji, S Jain, PR Chowdhuri… - Speech …, 2024 - Elsevier
In multi-lingual societies, where multiple languages are spoken in a small geographic
vicinity, informal conversations often involve mix of languages. Existing speech technologies …

Finnish parliament ASR corpus: Analysis, benchmarks and statistics

A Virkkunen, A Rouhe, N Phan, M Kurimo - Language Resources and …, 2023 - Springer
Public sources like parliament meeting recordings and transcripts provide ever-growing
material for the training and evaluation of automatic speech recognition (ASR) systems. In …

A longitudinal bilingual Frisian-Dutch radio broadcast database designed for code-switching research

E Yilmaz, M Andringa, S Kingma, J Dijkstra… - Proceedings of the …, 2016 - research.rug.nl
We present a new speech database containing 18.5 hours of annotated radio broadcasts in
the Frisian language. Frisian is mostly spoken in the province Fryslaˆn and it is the second …

Automatic voice based disease detection method using one dimensional local binary pattern feature extraction network

T Tuncer, S Dogan, F Ertam - Applied Acoustics, 2019 - Elsevier
Voices have been widely used for disease detection in the literature but these methods are
non-invasive. In this article a 1D local binary pattern (LBP) based feature extraction network …

IITG-HingCoS corpus: A Hinglish code-switching database for automatic speech recognition

S Ganji, K Dhawan, R Sinha - Speech communication, 2019 - Elsevier
Code-switching is a phenomenon in linguistics which refers to the use of two or more
languages, especially within the same discourse. This phenomenon has been observed in …

Semi-supervised acoustic model training for speech with code-switching

E Yılmaz, M McLaren, H van den Heuvel… - Speech …, 2018 - Elsevier
In the FAME! project, we aim to develop an automatic speech recognition (ASR) system for
Frisian-Dutch code-switching (CS) speech extracted from the archives of a local broadcaster …

Exploration of end-to-end framework for code-switching speech recognition task: Challenges and enhancements

G Sreeram, R Sinha - IEEE Access, 2020 - ieeexplore.ieee.org
The end-to-end (E2E) framework has emerged as a viable alternative to conventional hybrid
systems in automatic speech recognition (ASR) domain. Unlike the monolingual case, the …

Automatic speech recognition and translation of a swiss german dialect: Walliserdeutsch

PN Garner, D Imseng, T Meyer - Proceedings of Interspeech, 2014 - infoscience.epfl.ch
Abstract Walliserdeutsch is a Swiss German dialect spoken in the south west of Switzerland.
To investigate the potential of automatic speech processing of Walliserdeutsch, a small …