Prosospeech: Enhancing prosody with quantized vector pre-training in text-to-speech Y Ren, M Lei, Z Huang, S Zhang, Q Chen, Z Yan, Z Zhao ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 41 | 2022 |
Seed-TTS: A Family of High-Quality Versatile Speech Generation Models P Anastassiou, J Chen, J Chen, Y Chen, Z Chen, Z Chen, J Cong, L Deng, ... arXiv preprint arXiv:2406.02430, 2024 | 36 | 2024 |
Speaker adaptation of RNN-BLSTM for speech recognition based on speaker code Z Huang, J Tang, S Xue, L Dai 2016 IEEE International Conference on Acoustics, Speech and Signal …, 2016 | 29 | 2016 |
Polyvoice: Language models for speech to speech translation Q Dong, Z Huang, Q Tian, C Xu, T Ko, Y Zhao, S Feng, T Li, K Wang, ... arXiv preprint arXiv:2306.02982, 2023 | 20 | 2023 |
Linear networks based speaker adaptation for speech synthesis Z Huang, H Lu, M Lei, Z Yan 2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018 | 18 | 2018 |
Devicetts: A small-footprint, fast, stable network for on-device text-to-speech Z Huang, H Li, M Lei arXiv preprint arXiv:2010.15311, 2020 | 17 | 2020 |
Speech recognition method and apparatus Z Huang, S Xue, Z Yan US Patent App. 15/686,094, 2018 | 17 | 2018 |
基于深度学习的语音识别技术现状与展望 戴礼荣, 张仕良, 黄智颖 数据采集与处理 32 (2), 221-231, 2017 | 11 | 2017 |
RNN-BLSTM 声学模型的说话人自适应方法研究 黄智颖 中国科学技术大学, 2017 | 2 | 2017 |
Rapid speaker adaptation based on D-code extracted from BLSTM-RNN in LVCSR S Xue, Z Yan, Z Huang, L Dai 2016 10th International Symposium on Chinese Spoken Language Processing …, 2016 | 2 | 2016 |
PolyVoice: Language Models for Speech to Speech Translation Q qian Dong, Z Huang, Q Tian, C Xu, T Ko, S Feng, T Li, K Wang, ... The Twelfth International Conference on Learning Representations, 2023 | 1 | 2023 |
Audio Tagging with Compact Feedforward Sequential Memory Network and Audio-to-Audio Ratio Based Data Augmentation. Z Huang, S Zhang, M Lei INTERSPEECH, 3377-3381, 2019 | | 2019 |
Unsupervised speaker adaptation of BLSTM-RNN for LVCSR based on speaker code Z Huang, S Xue, Z Yan, L Dai 2016 10th International Symposium on Chinese Spoken Language Processing …, 2016 | | 2016 |