A study on robust detection of pronunciation erroneous tendency based on deep neural network. Y Gao, Y Xie, W Cao, J Zhang Interspeech, 693-696, 2015 | 44 | 2015 |
Improving mandarin tone recognition based on dnn by combining acoustic and articulatory features using extended recognition networks J Lin, W Li, Y Gao, Y Xie, NF Chen, SM Siniscalchi, J Zhang, CH Lee Journal of Signal Processing Systems 90, 1077-1087, 2018 | 29 | 2018 |
Articulatory Copy Synthesis Based on a Genetic Algorithm. Y Gao, S Stone, P Birkholz INTERSPEECH, 3770-3774, 2019 | 15 | 2019 |
M2-CTTS: End-to-End Multi-Scale Multi-Modal Conversational Text-to-Speech Synthesis J Xue, Y Deng, F Wang, Y Li, Y Gao, J Tao, J Sun, J Liang ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 14 | 2023 |
Articulatory copy synthesis using long-short term memory networks Y Gao, P Steiner, P Birkholz Studientexte zur Sprachkommunikation: Elektronische Sprachsignalverarbeitung …, 2020 | 11 | 2020 |
Formant tracking using dilated convolutional networks through dense connection with gating mechanism W Dai, J Zhang, Y Gao, W Wei, D Ke, B Lin, Y Xie arXiv preprint arXiv:2005.10803, 2020 | 10 | 2020 |
Resynthesizing the geco speech corpus with vocaltractlab K Sering, N Stehwien, Y Gao, MV Butz, H Baayen Konferenz Elektronische Sprachsignalverarbeitung, 95-102, 2019 | 10 | 2019 |
Computational Modelling of Tone Perception Based on Direct Processing of f0 Contours Y Chen, Y Gao, Y Xu Brain Sciences 12 (3), 337, 2022 | 8 | 2022 |
An acoustic comparison of German tense and lax vowels produced by German native speakers and Mandarin Chinese learners Y Gao, H Ding, P Birkholz The Journal of the Acoustical Society of America 148 (1), EL112-EL118, 2020 | 8 | 2020 |
Auffusion: Leveraging the power of diffusion and large language models for text-to-audio generation J Xue, Y Deng, Y Gao, Y Li arXiv preprint arXiv:2401.01044, 2024 | 7 | 2024 |
Improving pronunciation erroneous tendency detection with multi-model soft targets J Lin, Y Gao, W Zhang, L Wei, Y Xie, J Zhang Journal of Signal Processing Systems 92, 793-803, 2020 | 7 | 2020 |
Improving Mandarin tone recognition based on DNN by combining acoustic and articulatory features J Lin, Y Xie, Y Gao, J Zhang 2016 10th International Symposium on Chinese Spoken Language Processing …, 2016 | 7 | 2016 |
Improving pronunciation erroneous tendency detection with convolutional long short-term memory L Yang, Y Xie, Y Gao, J Zhang 2017 International Conference on Asian Language Processing (IALP), 52-56, 2017 | 6 | 2017 |
Text-aware end-to-end mispronunciation detection and diagnosis L Peng, Y Gao, B Lin, D Ke, Y Xie, J Zhang arXiv preprint arXiv:2206.07289, 2022 | 5 | 2022 |
A practical way to improve automatic phonetic segmentation performance W Peng, Y Gao, B Lin, J Zhang 2021 12th International Symposium on Chinese Spoken Language Processing …, 2021 | 5 | 2021 |
Modell einer Frauenstimme für die artikulatorische Sprachsynthese mit VocalTractLab S Drechsel, Y Gao, J Frahm, P Birkholz Konferenz Elektronische Sprachsignalverarbeitung, 239-246, 2019 | 5 | 2019 |
Cmcu-css: Enhancing naturalness via commonsense-based multi-modal context understanding in conversational speech synthesis Y Deng, J Xue, F Wang, Y Gao, Y Li Proceedings of the 31st ACM International Conference on Multimedia, 6081-6089, 2023 | 4 | 2023 |
End-to-End Mispronunciation Detection and Diagnosis Using Transfer Learning L Peng, Y Gao, R Bao, Y Li, J Zhang Applied Sciences 13 (11), 6793, 2023 | 4 | 2023 |
DNN based detection of Pronunciation Erroneous Tendency in data sparse condition Y Gao, Y Xie, J Lin, J Zhang 2016 Asia-Pacific Signal and Information Processing Association Annual …, 2016 | 4 | 2016 |
Concss: Contrastive-based context comprehension for dialogue-appropriate prosody in conversational speech synthesis Y Deng, J Xue, Y Jia, Q Li, Y Han, F Wang, Y Gao, D Ke, Y Li ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | 3 | 2024 |