E2E-based multi-task learning approach to joint speech and accent recognition

J Zhang, Y Peng, P Van Tung, H Xu, H Huang… - arXiv preprint arXiv …, 2021 - arxiv.org
In this paper, we propose a single multi-task learning framework to perform End-to-End
(E2E) speech recognition (ASR) and accent recognition (AR) simultaneously. The proposed …

Automatic accent identification using Gaussian mixture models

T Chen, C Huang, E Chang… - IEEE Workshop on …, 2001 - ieeexplore.ieee.org
It is well known that speaker variability caused by accent is an important factor io speech
recognition. Some major accents in China are so different as to make this problem very …

Accent issues in large vocabulary continuous speech recognition

C Huang, T Chen, E Chang - International Journal of Speech Technology, 2004 - Springer
This paper addresses accent 1 issues in large vocabulary continuous speech recognition.
Cross-accent experiments show that the accent problem is very dominant in speech …

Aispeech-sjtu accent identification system for the accented english speech recognition challenge

H Huang, X Xiang, Y Yang, R Ma… - ICASSP 2021-2021 …, 2021 - ieeexplore.ieee.org
This paper describes the AISpeech-SJTU system for the accent identification track of the
Interspeech-2020 Accented English Speech Recognition Challenge. In this challenge track …

[PDF][PDF] Automatic speech recognition of multiple accented English data.

D Vergyri, L Lamel, JL Gauvain - Interspeech, 2010 - isca-archive.org
Accent variability is an important factor in speech that can significantly degrade automatic
speech recognition performance. We investigate the effect of multiple accents on an English …

Feature subset selection for improved native accent identification

T Wu, J Duchateau, JP Martens… - Speech …, 2010 - Elsevier
In this paper, we develop methods to identify accents of native speakers. Accent
identification differs from other speaker classification tasks because accents may differ in a …

Automatic classification of speaker characteristics

P Nguyen, D Tran, X Huang… - … on Communications and …, 2010 - ieeexplore.ieee.org
An automatic voice-based classification system of speaker characteristics including age,
gender and accent is presented in this paper. Speakers are grouped according to their …

Spoken language characterization

MP Harper, M Maxwell - Springer handbook of speech processing, 2008 - Springer
This chapter describes the types of information that can be used to characterize spoken
languages. Automatic spoken language identification (LID) systems, which are tasked with …

Native vs. non-native accent identification using Japanese spoken telephone numbers

K Amino, T Osanai - Speech Communication, 2014 - Elsevier
In forensic investigations, it would be helpful to be able to identify a speaker's native
language based on the sound of their speech. Previous research on foreign accent …

Text classification based on nonlinear dimensionality reduction techniques and support vector machines

L Shi, J Zhang, E Liu, P He - Third International Conference on …, 2007 - ieeexplore.ieee.org
Text classification is an important task in the field of natural language processing. The
dimension of the text data is huge for the text documents are usually represented with the …