Seen and unseen emotional style transfer for voice conversion with a new emotional speech dataset K Zhou, B Sisman, R Liu, H Li ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 168 | 2021 |
Emotional voice conversion: Theory, databases and ESD K Zhou, B Sisman, R Liu, H Li Speech Communication 137, 1-18, 2022 | 119 | 2022 |
Expressive TTS training with frame and style reconstruction loss R Liu, B Sisman, G Gao, H Li IEEE/ACM Transactions on Audio, Speech, and Language Processing 29, 1806-1818, 2021 | 85 | 2021 |
Teacher-student training for robust tacotron-based tts R Liu, B Sisman, J Li, F Bao, G Gao, H Li ICASSP 2020-2020 IEEE international conference on acoustics, speech and …, 2020 | 64 | 2020 |
Reinforcement learning for emotional text-to-speech synthesis with improved emotion discriminability R Liu, B Sisman, H Li arXiv preprint arXiv:2104.01408, 2021 | 38 | 2021 |
GraphSpeech: Syntax-Aware Graph Attention Network For Neural Speech Synthesis R Liu, B Sisman, H Li IEEE ICASSP 2021. IEEE International Conference on Acoustics, Speech and …, 2021 | 34 | 2021 |
Mongolian text-to-speech system based on deep neural network R Liu, F Bao, G Gao, Y Wang Man-Machine Speech Communication: 14th National Conference, NCMMSC 2017 …, 2018 | 30 | 2018 |
Exploiting Morphological and Phonological Features to Improve Prosodic Phrasing for Mongolian Speech Synthesis R Liu, B Sisman, F Bao, J Yang, G Gao, H Li IEEE/ACM Transactions on Audio, Speech, and Language Processing 29, 274-285, 2021 | 27 | 2021 |
Improving Mongolian Phrase Break Prediction by Using Syllable and Morphological Embeddings with BiLSTM Model. R Liu, F Bao, G Gao, H Zhang, Y Wang Interspeech, 57-61, 2018 | 22 | 2018 |
Modeling prosodic phrasing with multi-task learning in tacotron-based TTS R Liu, B Sisman, F Bao, G Gao, H Li IEEE Signal Processing Letters 27, 1470-1474, 2020 | 21 | 2020 |
Exploiting modality-invariant feature for robust multimodal emotion recognition with missing modalities H Zuo, R Liu, J Zhao, G Gao, H Li ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 19 | 2023 |
Wavetts: Tacotron-based tts with joint time-frequency domain loss R Liu, B Sisman, F Bao, G Gao, H Li arXiv preprint arXiv:2002.00417, 2020 | 16 | 2020 |
Fasttalker: A neural text-to-speech architecture with shallow and group autoregression R Liu, B Sisman, Y Lin, H Li Neural Networks 141, 306-314, 2021 | 14 | 2021 |
Decoding knowledge transfer for neural text-to-speech training R Liu, B Sisman, G Gao, H Li IEEE/ACM Transactions on Audio, Speech, and Language Processing 30, 1789-1802, 2022 | 12 | 2022 |
Visualtts: Tts with accurate lip-speech synchronization for automatic voice over J Lu, B Sisman, R Liu, M Zhang, H Li ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 11 | 2022 |
End-to-end mongolian text-to-speech system J Li, H Zhang, R Liu, X Zhang, F Bao 2018 11th international symposium on chinese spoken language processing …, 2018 | 11 | 2018 |
Multistage deep transfer learning for emIoT-enabled human–computer interaction R Liu, Q Liu, H Zhu, H Cao IEEE Internet of Things Journal 9 (16), 15128-15137, 2022 | 10 | 2022 |
A lstm approach with sub-word embeddings for mongolian phrase break prediction R Liu, F Bao, G Gao, H Zhang, Y Wang Proceedings of the 27th International Conference on Computational …, 2018 | 10 | 2018 |
Accurate emotion strength assessment for seen and unseen speech based on data-driven deep learning R Liu, B Sisman, B Schuller, G Gao, H Li arXiv preprint arXiv:2206.07229, 2022 | 9 | 2022 |
Text-to-speech for low-resource agglutinative language with morphology-aware language model pre-training R Liu, Y Hu, H Zuo, Z Luo, L Wang, G Gao IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2024 | 8 | 2024 |