关注
Yingming Gao
标题
引用次数
引用次数
年份
A study on robust detection of pronunciation erroneous tendency based on deep neural network.
Y Gao, Y Xie, W Cao, J Zhang
Interspeech, 693-696, 2015
442015
Improving mandarin tone recognition based on dnn by combining acoustic and articulatory features using extended recognition networks
J Lin, W Li, Y Gao, Y Xie, NF Chen, SM Siniscalchi, J Zhang, CH Lee
Journal of Signal Processing Systems 90, 1077-1087, 2018
292018
Articulatory Copy Synthesis Based on a Genetic Algorithm.
Y Gao, S Stone, P Birkholz
INTERSPEECH, 3770-3774, 2019
152019
M2-CTTS: End-to-End Multi-Scale Multi-Modal Conversational Text-to-Speech Synthesis
J Xue, Y Deng, F Wang, Y Li, Y Gao, J Tao, J Sun, J Liang
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
142023
Articulatory copy synthesis using long-short term memory networks
Y Gao, P Steiner, P Birkholz
Studientexte zur Sprachkommunikation: Elektronische Sprachsignalverarbeitung …, 2020
112020
Formant tracking using dilated convolutional networks through dense connection with gating mechanism
W Dai, J Zhang, Y Gao, W Wei, D Ke, B Lin, Y Xie
arXiv preprint arXiv:2005.10803, 2020
102020
Resynthesizing the geco speech corpus with vocaltractlab
K Sering, N Stehwien, Y Gao, MV Butz, H Baayen
Konferenz Elektronische Sprachsignalverarbeitung, 95-102, 2019
102019
Computational Modelling of Tone Perception Based on Direct Processing of f0 Contours
Y Chen, Y Gao, Y Xu
Brain Sciences 12 (3), 337, 2022
82022
An acoustic comparison of German tense and lax vowels produced by German native speakers and Mandarin Chinese learners
Y Gao, H Ding, P Birkholz
The Journal of the Acoustical Society of America 148 (1), EL112-EL118, 2020
82020
Auffusion: Leveraging the power of diffusion and large language models for text-to-audio generation
J Xue, Y Deng, Y Gao, Y Li
arXiv preprint arXiv:2401.01044, 2024
72024
Improving pronunciation erroneous tendency detection with multi-model soft targets
J Lin, Y Gao, W Zhang, L Wei, Y Xie, J Zhang
Journal of Signal Processing Systems 92, 793-803, 2020
72020
Improving Mandarin tone recognition based on DNN by combining acoustic and articulatory features
J Lin, Y Xie, Y Gao, J Zhang
2016 10th International Symposium on Chinese Spoken Language Processing …, 2016
72016
Improving pronunciation erroneous tendency detection with convolutional long short-term memory
L Yang, Y Xie, Y Gao, J Zhang
2017 International Conference on Asian Language Processing (IALP), 52-56, 2017
62017
Text-aware end-to-end mispronunciation detection and diagnosis
L Peng, Y Gao, B Lin, D Ke, Y Xie, J Zhang
arXiv preprint arXiv:2206.07289, 2022
52022
A practical way to improve automatic phonetic segmentation performance
W Peng, Y Gao, B Lin, J Zhang
2021 12th International Symposium on Chinese Spoken Language Processing …, 2021
52021
Modell einer Frauenstimme für die artikulatorische Sprachsynthese mit VocalTractLab
S Drechsel, Y Gao, J Frahm, P Birkholz
Konferenz Elektronische Sprachsignalverarbeitung, 239-246, 2019
52019
Cmcu-css: Enhancing naturalness via commonsense-based multi-modal context understanding in conversational speech synthesis
Y Deng, J Xue, F Wang, Y Gao, Y Li
Proceedings of the 31st ACM International Conference on Multimedia, 6081-6089, 2023
42023
End-to-End Mispronunciation Detection and Diagnosis Using Transfer Learning
L Peng, Y Gao, R Bao, Y Li, J Zhang
Applied Sciences 13 (11), 6793, 2023
42023
DNN based detection of Pronunciation Erroneous Tendency in data sparse condition
Y Gao, Y Xie, J Lin, J Zhang
2016 Asia-Pacific Signal and Information Processing Association Annual …, 2016
42016
Concss: Contrastive-based context comprehension for dialogue-appropriate prosody in conversational speech synthesis
Y Deng, J Xue, Y Jia, Q Li, Y Han, F Wang, Y Gao, D Ke, Y Li
ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024
32024
系统目前无法执行此操作,请稍后再试。
文章 1–20