MediaParl: Bilingual mixed language accented speech database

End-to-end acoustic modeling using convolutional neural networks for HMM-based automatic speech recognition

D Palaz, M Magimai-Doss, R Collobert - Speech Communication, 2019 - Elsevier

In hidden Markov model (HMM) based automatic speech recognition (ASR) system,
modeling the statistical relationship between the acoustic speech signal and the HMM states …

被引用次数：159 相关文章所有 9 个版本

[PDF] arxiv.org

Accented speech recognition: Benchmarking, pre-training, and diverse data

A Aksënova, Z Chen, CC Chiu, D van Esch… - arXiv preprint arXiv …, 2022 - arxiv.org

Building inclusive speech recognition systems is a crucial step towards developing
technologies that speakers of all language varieties can use. Therefore, ASR systems must …

被引用次数：18 相关文章所有 4 个版本

[PDF] arxiv.org

Summary of the DISPLACE challenge 2023-DIarization of SPeaker and LAnguage in Conversational Environments

S Baghel, S Ramoji, S Jain, PR Chowdhuri… - Speech …, 2024 - Elsevier

In multi-lingual societies, where multiple languages are spoken in a small geographic
vicinity, informal conversations often involve mix of languages. Existing speech technologies …

被引用次数：5 相关文章所有 4 个版本

[PDF] springer.com

Finnish parliament ASR corpus: Analysis, benchmarks and statistics

A Virkkunen, A Rouhe, N Phan, M Kurimo - Language Resources and …, 2023 - Springer

Public sources like parliament meeting recordings and transcripts provide ever-growing
material for the training and evaluation of automatic speech recognition (ASR) systems. In …

被引用次数：11 相关文章所有 10 个版本

[PDF] rug.nl

A longitudinal bilingual Frisian-Dutch radio broadcast database designed for code-switching research

E Yilmaz, M Andringa, S Kingma, J Dijkstra… - Proceedings of the …, 2016 - research.rug.nl

We present a new speech database containing 18.5 hours of annotated radio broadcasts in
the Frisian language. Frisian is mostly spoken in the province Fryslaˆn and it is the second …

被引用次数：51 相关文章所有 9 个版本

Automatic voice based disease detection method using one dimensional local binary pattern feature extraction network

T Tuncer, S Dogan, F Ertam - Applied Acoustics, 2019 - Elsevier

Voices have been widely used for disease detection in the literature but these methods are
non-invasive. In this article a 1D local binary pattern (LBP) based feature extraction network …

被引用次数：25 相关文章

IITG-HingCoS corpus: A Hinglish code-switching database for automatic speech recognition

S Ganji, K Dhawan, R Sinha - Speech communication, 2019 - Elsevier

Code-switching is a phenomenon in linguistics which refers to the use of two or more
languages, especially within the same discourse. This phenomenon has been observed in …

被引用次数：27 相关文章所有 2 个版本

[PDF] arxiv.org

Semi-supervised acoustic model training for speech with code-switching

E Yılmaz, M McLaren, H van den Heuvel… - Speech …, 2018 - Elsevier

In the FAME! project, we aim to develop an automatic speech recognition (ASR) system for
Frisian-Dutch code-switching (CS) speech extracted from the archives of a local broadcaster …

被引用次数：30 相关文章所有 9 个版本

[PDF] ieee.org

Exploration of end-to-end framework for code-switching speech recognition task: Challenges and enhancements

G Sreeram, R Sinha - IEEE Access, 2020 - ieeexplore.ieee.org

The end-to-end (E2E) framework has emerged as a viable alternative to conventional hybrid
systems in automatic speech recognition (ASR) domain. Unlike the monolingual case, the …

被引用次数：14 相关文章所有 4 个版本

[PDF] epfl.ch

Automatic speech recognition and translation of a swiss german dialect: Walliserdeutsch

PN Garner, D Imseng, T Meyer - Proceedings of Interspeech, 2014 - infoscience.epfl.ch

Abstract Walliserdeutsch is a Swiss German dialect spoken in the south west of Switzerland.
To investigate the potential of automatic speech processing of Walliserdeutsch, a small …

被引用次数：24 相关文章所有 10 个版本