Audioldm 2: Learning holistic audio generation with self-supervised pretraining H Liu, Y Yuan, X Liu, X Mei, Q Kong, Q Tian, Y Wang, W Wang, Y Wang, ... IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2024 | 54 | 2024 |
VoiceFixer: Toward general speech restoration with neural vocoder H Liu, Q Kong, Q Tian, Y Zhao, DL Wang, C Huang, Y Wang arXiv preprint arXiv:2109.13731, 2021 | 38 | 2021 |
Voicefixer: A unified framework for high-fidelity speech restoration H Liu, X Liu, Q Kong, Q Tian, Y Zhao, DL Wang, C Huang, Y Wang arXiv preprint arXiv:2204.05841, 2022 | 34 | 2022 |
Featherwave: An efficient high-fidelity neural vocoder with multi-band linear prediction Q Tian, Z Zhang, H Lu, LH Chen, S Liu arXiv preprint arXiv:2005.05551, 2020 | 34 | 2020 |
TFGAN: Time and frequency domain based generative adversarial network for high-fidelity speech synthesis Q Tian, Y Chen, Z Zhang, H Lu, L Chen, L Xie, S Liu arXiv preprint arXiv:2011.12206, 2020 | 32 | 2020 |
Qiuqiang Kong, Yuping Wang, Wenwu Wang, Yuxuan Wang, and Mark D Plumbley. Audioldm 2: Learning holistic audio generation with self-supervised pretraining H Liu, Q Tian, Y Yuan, X Liu, X Mei arXiv preprint arXiv:2308.05734 8 (1), 2023 | 31 | 2023 |
Neural vocoder is all you need for speech super-resolution H Liu, W Choi, X Liu, Q Kong, Q Tian, DL Wang arXiv preprint arXiv:2203.14941, 2022 | 31 | 2022 |
Adadurian: Few-shot adaptation for neural text-to-speech with durian Z Zhang, Q Tian, H Lu, LH Chen, S Liu arXiv preprint arXiv:2005.05642, 2020 | 29 | 2020 |
Efficient neural music generation MWY Lam, Q Tian, T Li, Z Yin, S Feng, M Tu, Y Ji, R Xia, M Ma, X Song, ... Advances in Neural Information Processing Systems 36, 2024 | 28 | 2024 |
Neural dubber: Dubbing for videos according to scripts C Hu, Q Tian, T Li, W Yuping, Y Wang, H Zhao Advances in neural information processing systems 34, 16582-16595, 2021 | 26 | 2021 |
Lm-vc: Zero-shot voice conversion via speech generation based on language models Z Wang, Y Chen, L Xie, Q Tian, Y Wang IEEE Signal Processing Letters, 2023 | 18 | 2023 |
PolyVoice: Language Models for Speech to Speech Translation Q Dong, Z Huang, C Xu, Y Zhao, K Wang, X Cheng, T Ko, Q Tian, T Li, ... arXiv preprint arXiv:2306.02982v2, 2023 | 18 | 2023 |
Neufa: Neural network based end-to-end forced alignment with bidirectional attention mechanism J Li, Y Meng, Z Wu, H Meng, Q Tian, Y Wang, Y Wang ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 17 | 2022 |
Generative adversarial network based speaker adaptation for high fidelity WaveNet vocoder Q Tian, X Wan, S Liu arXiv preprint arXiv:1812.02339, 2018 | 12 | 2018 |
Qiuqiang Kong, Yuping Wang, Wenwu Wang, Yuxuan Wang, and Mark D Plumbley. 2023. AudioLDM 2: Learning Holistic Audio Generation with Self-supervised Pretraining H Liu, Q Tian, Y Yuan, X Liu, X Mei arXiv preprint arXiv:2308.05734 3, 2023 | 11 | 2023 |
Controllable and lossless non-autoregressive end-to-end text-to-speech Z Liu, Q Tian, C Hu, X Liu, M Wu, Y Wang, H Zhao, Y Wang arXiv preprint arXiv:2207.06088, 2022 | 11 | 2022 |
Inferring speaking styles from multi-modal conversational context by multi-scale relational graph convolutional networks J Li, Y Meng, X Wu, Z Wu, J Jia, H Meng, Q Tian, Y Wang, Y Wang Proceedings of the 30th ACM International Conference on Multimedia, 5811-5820, 2022 | 10 | 2022 |
AudioSR: Versatile audio super-resolution at scale H Liu, K Chen, Q Tian, W Wang, MD Plumbley ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | 9 | 2024 |
Cloning one’s voice using very limited data in the wild D Dai, Y Chen, L Chen, M Tu, L Liu, R Xia, Q Tian, Y Wang, Y Wang ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 9 | 2022 |
Feathertts: Robust and efficient attention based neural tts Q Tian, Z Zhang, C Liu, H Lu, L Chen, B Wei, P He, S Liu arXiv preprint arXiv:2011.00935, 2020 | 8 | 2020 |