Learning latent representations for style control and transfer in end-to-end speech synthesis YJ Zhang, S Pan, L He, ZH Ling ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019 | 288 | 2019 |
Automatic detection and classification of marmoset vocalizations using deep and recurrent neural networks YJ Zhang, JF Huang, N Gong, ZH Ling, Y Hu The Journal of the Acoustical Society of America 144 (1), 478-487, 2018 | 23 | 2018 |
Extracting and predicting word-level style variations for speech synthesis YJ Zhang, ZH Ling IEEE/ACM Transactions on Audio, Speech, and Language Processing 29, 1582-1593, 2021 | 16 | 2021 |
Maskedspeech: Context-aware speech synthesis with masking strategy YJ Zhang, W Song, Y Yue, Z Zhang, Y Wu, X He arXiv preprint arXiv:2211.06170, 2022 | 6 | 2022 |
Prosody modelling with pre-trained cross-utterance representations for improved speech synthesis YJ Zhang, C Zhang, W Song, Z Zhang, Y Wu, X He IEEE/ACM Transactions on Audio, Speech, and Language Processing 31, 2812-2823, 2023 | 5 | 2023 |
Integrating Discrete Word-Level Style Variations into Non-Autoregressive Acoustic Models for Speech Synthesis. Z Liu, NQ Wu, Y Zhang, Z Ling INTERSPEECH, 5508-5512, 2022 | 5 | 2022 |
Learning deep and wide contextual representations using BERT for statistical parametric speech synthesis YJ Zhang, ZH Ling Proceedings of the 2021 5th International Conference on Digital Signal …, 2021 | 5 | 2021 |
An Experimental Investigation on Excitation Representation of WaveNet-Based Neural Vocoders YJ Zhang, ZH Ling 2018 14th IEEE International Conference on Signal Processing (ICSP), 1119-1123, 2018 | | 2018 |