Speech emotion recognition using capsule networks X Wu, S Liu, Y Cao, X Li, J Yu, D Dai, X Ma, S Hu, Z Wu, X Liu, H Meng ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019 | 120 | 2019 |
Any-to-many voice conversion with location-relative sequence-to-sequence modeling S Liu, Y Cao, D Wang, X Wu, X Liu, H Meng IEEE/ACM Transactions on Audio, Speech, and Language Processing 29, 1717-1728, 2021 | 77 | 2021 |
Diffsvc: A diffusion probabilistic model for singing voice conversion S Liu, Y Cao, D Su, H Meng 2021 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2021 | 43 | 2021 |
End-to-end code-switched tts with mix of monolingual recordings Y Cao, X Wu, S Liu, J Yu, X Li, Z Wu, X Liu, H Meng ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019 | 43 | 2019 |
End-to-end accent conversion without using native utterances S Liu, D Wang, Y Cao, L Sun, X Wu, S Kang, Z Wu, X Liu, D Su, D Yu, ... ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 39 | 2020 |
Fastsvc: Fast cross-domain singing voice conversion with feature-wise linear modulation S Liu, Y Cao, N Hu, D Su, H Meng 2021 ieee international conference on multimedia and expo (icme), 1-6, 2021 | 35 | 2021 |
Vara-tts: Non-autoregressive text-to-speech synthesis based on very deep vae with residual attention P Liu, Y Cao, S Liu, N Hu, G Li, C Weng, D Su arXiv preprint arXiv:2102.06431, 2021 | 31 | 2021 |
Transferring source style in non-parallel voice conversion S Liu, Y Cao, S Kang, N Hu, X Liu, D Su, D Yu, H Meng arXiv preprint arXiv:2005.09178, 2020 | 24 | 2020 |
Speech emotion recognition using sequential capsule networks X Wu, Y Cao, H Lu, S Liu, D Wang, Z Wu, X Liu, H Meng IEEE/ACM Transactions on Audio, Speech, and Language Processing 29, 3280-3291, 2021 | 23 | 2021 |
Code-switched speech synthesis using bilingual phonetic posteriorgram with only monolingual corpora Y Cao, S Liu, X Wu, S Kang, P Liu, Z Wu, X Liu, D Su, D Yu, H Meng ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 21 | 2020 |
Rapid Style Adaptation Using Residual Error Embedding for Expressive Speech Synthesis. X Wu, Y Cao, M Wang, S Liu, S Kang, Z Wu, X Liu, D Su, D Yu, H Meng Interspeech, 3072-3076, 2018 | 15 | 2018 |
Emotional voice conversion with cycle-consistent adversarial network S Liu, Y Cao, H Meng arXiv preprint arXiv:2004.03781, 2020 | 13 | 2020 |
Jointly Trained Conversion Model and WaveNet Vocoder for Non-Parallel Voice Conversion Using Mel-Spectrograms and Phonetic Posteriorgrams. S Liu, Y Cao, X Wu, L Sun, X Liu, H Meng INTERSPEECH, 714-718, 2019 | 13 | 2019 |
Exemplar-based emotive speech synthesis X Wu, Y Cao, H Lu, S Liu, S Kang, Z Wu, X Liu, H Meng IEEE/ACM Transactions on Audio, Speech, and Language Processing 29, 874-886, 2021 | 9 | 2021 |
Multi-target emotional voice conversion with neural vocoders S Liu, Y Cao, H Meng arXiv preprint arXiv:2004.03782, 2020 | 9 | 2020 |
Comparative Study of Parametric and Representation Uncertainty Modeling for Recurrent Neural Network Language Models. J Yu, MWY Lam, S Hu, X Wu, X Li, Y Cao, X Liu, H Meng Interspeech, 3510-3514, 2019 | 9 | 2019 |
Exploring cross-lingual singing voice synthesis using speech data Y Cao, S Liu, S Kang, N Hu, P Liu, X Liu, D Su, D Yu, H Meng 2021 12th International Symposium on Chinese Spoken Language Processing …, 2021 | 3 | 2021 |