Improving prosody with linguistic and bert derived features in multi-speaker based mandarin chinese neural tts Y Xiao, L He, H Ming, FK Soong ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 60 | 2020 |
Paired phone-posteriors approach to ESL pronunciation quality assessment Y Xiao, FK Soong, W Hu bdl 1 (782d), 3, 2018 | 19 | 2018 |
Prosodyspeech: Towards advanced prosody model for neural text-to-speech Y Yi, L He, S Pan, X Wang, Y Xiao ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 11 | 2022 |
Proficiency Assessment of ESL Learner's Sentence Prosody with TTS Synthesized Voice as Reference. Y Xiao, FK Soong INTERSPEECH, 1755-1759, 2017 | 11 | 2017 |
ContextSpeech: Expressive and Efficient Text-to-Speech for Paragraph Reading Y Xiao, S Zhang, X Wang, X Tan, L He, S Zhao, FK Soong, T Lee arXiv preprint arXiv:2307.00782, 2023 | 5 | 2023 |
QS-TTS: Towards Semi-Supervised Text-to-Speech Synthesis via Vector-Quantized Self-Supervised Speech Representation Learning H Guo, F Xie, J Kang, Y Xiao, X Wu, H Meng IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2024 | 3 | 2024 |
Improving fastspeech tts with efficient self-attention and compact feed-forward network Y Xiao, X Wang, L He, FK Soong ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 2 | 2022 |
Contrastive Context-Speech Pretraining for Expressive Text-to-Speech Synthesis Y Xiao, X Wang, X Tan, L He, X Zhu, T Lee ACM Multimedia 2024, 2024 | | 2024 |
UniStyle: Unified Style Modeling for Speaking Style Captioning and Stylistic Speech Synthesis X Zhu, W Tian, X Wang, L He, Y Xiao, X Wang, X Tan, L Xie ACM Multimedia 2024, 2024 | | 2024 |