Attention Based Hybrid i-Vector BLSTM Model for Language Recognition.

Z Fan, M Li, S Zhou, B Xu - arXiv preprint arXiv:2012.06185, 2020 - arxiv.org

Wav2vec 2.0 is a recently proposed self-supervised framework for speech representation
learning. It follows a two-stage training process of pre-training and fine-tuning, and performs …

被引用次数：226 相关文章所有 7 个版本

[PDF] arxiv.org

An overview of Indian spoken language recognition from machine learning perspective

S Dey, M Sahidullah, G Saha - ACM Transactions on Asian and Low …, 2022 - dl.acm.org

Automatic spoken language identification (LID) is a very important research field in the era of
multilingual voice-command-based human-computer interaction. A front-end LID module …

被引用次数：19 相关文章所有 9 个版本

[PDF] researchgate.net

End-to-end language diarization for bilingual code-switching speech

H Liu, LPG Perera, X Zhang, J Dauwels… - … Conference of the …, 2021 - research.tudelft.nl

We propose two end-to-end neural configurations for language diarization on bilingual code-
switching speech. The first, a BLSTM-E2E architecture, includes a set of stacked …

被引用次数：27 相关文章所有 7 个版本

[PDF] arxiv.org

Towards relevance and sequence modeling in language recognition

B Padi, A Mohan, S Ganapathy - IEEE/ACM Transactions on …, 2020 - ieeexplore.ieee.org

The task of automatic language identification (LID) involving multiple dialects of the same
language family in the presence of noise is a challenging problem. In these scenarios, the …

被引用次数：18 相关文章所有 7 个版本

Multi-domain attention fusion network for language recognition

M Ju, Y Xu, D Ke, K Su - SN Computer Science, 2022 - Springer

Attention-based convolutional neural network models are increasingly adopted for language
recognition tasks. In this paper, based on the self-attention mechanism, we solve the study of …

被引用次数：3 相关文章所有 2 个版本

[PDF] arxiv.org

Improving language identification for multilingual speakers

A Titus, J Silovsky, N Chen, R Hsiao… - ICASSP 2020-2020 …, 2020 - ieeexplore.ieee.org

Spoken language identification (LID) technologies have improved in recent years from
discriminating largely distinct languages to discriminating highly similar languages or even …

被引用次数：14 相关文章所有 4 个版本

[PDF] arxiv.org

Cross-corpora language recognition: A preliminary investigation with Indian languages

S Dey, G Saha, M Sahidullah - 2021 29th European Signal …, 2021 - ieeexplore.ieee.org

In this paper, we conduct one of the very first studies for cross-corpora performance
evaluation in the spoken language identification (LID) problem. Cross-corpora evaluation …

被引用次数：8 相关文章所有 9 个版本

[PDF] iiit.ac.in

Study on the effect of emotional speech on language identification

P Jain, K Gurugubelli… - 2020 national conference …, 2020 - ieeexplore.ieee.org

Identifying language information from speech utterance is referred to as spoken language
identification. Language Identification (LID) is essential in multilingual speech systems. The …

被引用次数：8 相关文章所有 3 个版本

Boosting Character-based Mandarin ASR via Chinese Pinyin Representation

L Li, Y Long, D Xu, Y Li - International Journal of Speech Technology, 2023 - Springer

Current end-to-end automatic speech recognition (ASR) models have achieved good results
in phonetic language such as English and French. However, Chinese character is a typical …

被引用次数：1 相关文章所有 2 个版本

Universal and accent-discriminative encoders for conformer-based accent-invariant speech recognition

X Wang, Y Long, D Xu - International Journal of Speech Technology, 2022 - Springer

Accent-variation is a challenging issue, either for traditional hybrid or current end-to-end
(E2E) automatic speech recognition (ASR). Building an accent-invariant and high quality …

被引用次数：1 相关文章所有 2 个版本