Avqvc: One-shot voice conversion by vector quantization with applying contrastive learning H Tang, X Zhang, J Wang, N Cheng, J Xiao ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 48 | 2022 |
Research on singing voice detection based on a long-term recurrent convolutional network with vocal separation and temporal smoothing X Zhang, Y Yu, Y Gao, X Chen, W Li Electronics 9 (9), 1458, 2020 | 36 | 2020 |
Reputation revision method for selecting cloud services based on prior knowledge and a market mechanism Q Wu, X Zhang, M Zhang, Y Lou, R Zheng, W Wei The Scientific World Journal 2014 (1), 617087, 2014 | 30 | 2014 |
Vocal melody extraction via hrnet-based singing voice separation and encoder-decoder-based f0 estimation Y Gao, X Zhang, W Li Electronics 10 (3), 298, 2021 | 27 | 2021 |
Drvc: A framework of any-to-any voice conversion with self-supervised learning Q Wang, X Zhang, J Wang, N Cheng, J Xiao ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 25 | 2022 |
Singer identification using deep timbre feature learning with knn-net X Zhang, J Qian, Y Yu, Y Sun, W Li ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 24 | 2021 |
Singer identification for metaverse with timbral and middle-level perceptual features X Zhang, J Wang, N Cheng, J Xiao 2022 International Joint Conference on Neural Networks (IJCNN), 1-7, 2022 | 22 | 2022 |
Tgavc: Improving autoencoder voice conversion with text-guided and adversarial training H Tang, X Zhang, J Wang, N Cheng, Z Zeng, E Xiao, J Xiao 2021 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2021 | 21 | 2021 |
nnspeech: Speaker-guided conditional variational autoencoder for zero-shot multi-speaker text-to-speech B Zhao, X Zhang, J Wang, N Cheng, J Xiao ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 20 | 2022 |
Emomix: Emotion mixing via diffusion models for emotional speech synthesis H Tang, X Zhang, J Wang, N Cheng, J Xiao arXiv preprint arXiv:2306.00648, 2023 | 15 | 2023 |
Tdass: Target domain adaptation speech synthesis framework for multi-speaker low-resource tts X Zhang, J Wang, N Cheng, J Xiao 2022 International Joint Conference on Neural Networks (IJCNN), 1-7, 2022 | 15 | 2022 |
Susing: Su-net for singing voice synthesis X Zhang, J Wang, N Cheng, J Xiao 2022 International Joint Conference on Neural Networks (IJCNN), 1-7, 2022 | 15 | 2022 |
CycleGEAN: cycle generative enhanced adversarial network for voice conversion X Zhang, J Wang, N Cheng, E Xiao, J Xiao 2021 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2021 | 15 | 2021 |
Metasid: Singer identification with domain adaptation for metaverse X Zhang, J Wang, N Cheng, J Xiao 2022 International Joint Conference on Neural Networks (IJCNN), 1-7, 2022 | 14 | 2022 |
Qi-tts: Questioning intonation control for emotional speech synthesis H Tang, X Zhang, J Wang, N Cheng, J Xiao ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 12 | 2023 |
Tiny-sepformer: A tiny time-domain transformer network for speech separation J Luo, J Wang, N Cheng, E Xiao, X Zhang, J Xiao arXiv preprint arXiv:2206.13689, 2022 | 11 | 2022 |
Sparks of Large Audio Models: A Survey and Outlook S Latif, M Shoukat, F Shamshad, M Usama, Y Ren, H Cuayáhuitl, W Wang, ... arXiv preprint arXiv:2308.12792, 2023 | 10 | 2023 |
Music artist classification with wavenet classifier for raw waveform audio data X Zhang, Y Gao, Y Yu, W Li arXiv preprint arXiv:2004.04371, 2020 | 9 | 2020 |
Investigation of singing voice separation for singing voice detection in polyphonic music Y Sun, X Zhang, X Chen, Y Yu, W Li Proceedings of the 9th Conference on Sound and Music Technology: Revised …, 2022 | 8 | 2022 |
Mdcnn-sid: Multi-scale dilated convolution network for singer identification X Zhang, J Wang, N Cheng, J Xiao 2022 International Joint Conference on Neural Networks (IJCNN), 1-7, 2022 | 8 | 2022 |