High-resolution piano transcription with pedals by regressing onsets and offsets times Q Kong, B Li, X Song, Y Wan, Y Wang IEEE/ACM Transactions on Audio, Speech, and Language Processing (TASLP), 2021 | 94 | 2021 |
Source separation with weakly labelled data: An approach to computational auditory scene analysis Q Kong, Y Wang, X Song, Y Cao, W Wang, MD Plumbley ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 47 | 2020 |
SpecTNT: a Time-Frequency Transformer for Music Audio WT Lu, JC Wang, M Won, K Choi, X Song 22nd International Society for Music Information Retrieval Conference (ISMIR …, 2021 | 33 | 2021 |
Modeling beats and downbeats with a time-frequency transformer YN Hung, JC Wang, X Song, WT Lu, M Won ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 28 | 2022 |
Efficient Neural Music Generation MWY Lam, Q Tian, T Li, Z Yin, S Feng, M Tu, Y Ji, R Xia, M Ma, X Song, ... NeurIPS 2023; Thirty-seventh Conference on Neural Information Processing Systems, 2023 | 22 | 2023 |
Supervised Metric Learning for Music Structure Feature JC Wang, JBL Smith, WT Lu, X Song 22nd International Society for Music Information Retrieval Conference (ISMIR …, 2021 | 18 | 2021 |
A handwritten Chinese characters recognition method based on sample set expansion and CNN X Song, X Gao, Y Ding, Z Wang 2016 3rd International Conference on Systems and Informatics (ICSAI), 843-849, 2016 | 16 | 2016 |
Speech recognition using natural language understanding related knowledge via deep feedforward neural networks Z Zhou, X Song US Patent 11,615,785, 2023 | 15 | 2023 |
Supervised chorus detection for popular music using convolutional neural network and multi-task learning JC Wang, JBL Smith, J Chen, X Song, Y Wang ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 12 | 2021 |
CatNet: Music source separation system with mix-audio augmentation X Song, Q Kong, X Du, Y Wang arXiv preprint arXiv:2102.09966, 2021 | 11 | 2021 |
InstructME: An instruction guided music edit and remix framework with latent diffusion models B Han, J Dai, X Song, W Hao, X He, D Guo, J Chen, Y Wang, Y Qian 2024 International Joint Conference on Artificial Intelligence (IJCAI), 2023 | 10 | 2023 |
Noise robust tts for low resource speakers using pre-trained model and speech enhancement D Dai, L Chen, Y Wang, M Wang, R Xia, X Song, Z Wu, Y Wang arXiv preprint arXiv:2005.12531, 2020 | 10 | 2020 |
Modeling the Compatibility of Stem Tracks to Generate Music Mashups J Huang, JC Wang, JBL Smith, X Song, Y Wang Proceedings of the AAAI Conference on Artificial Intelligence. Vol. 35. No …, 2021 | 9 | 2021 |
A Neural Network Based Ranking Framework to Improve ASR with NLU Related Knowledge Deployed Z Zhou, X Song, R Botros, L Zhao ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019 | 3 | 2019 |
ALCAP: Alignment-Augmented Music Captioner Z He, W Hao, WT Lu, C Chen, K Lerman, X Song EMNLP 2023 The 2023 Conference on Empirical Methods in Natural Language …, 2022 | 2* | 2022 |
System and method for training a transformer-in-transformer-based neural network model for audio data WT Lu, JC Wang, WON Minz, C Keunwoo, X Song US Patent 11,854,558, 2023 | | 2023 |
Transition type determination method and apparatus, and electronic device and storage medium X Jin, X Song, LI Gen, Y Wang, S Xiaohui US Patent App. 18/450,207, 2023 | | 2023 |
Transition type determination method and apparatus, and electronic device and storage medium X Jin, X Song, LI Gen, Y Wang, S Xiaohui US Patent 11,783,861, 2023 | | 2023 |
A Long-Tail Friendly Representation Framework for Artist and Music Similarity H Xiang, J Dai, X Song, F Shen arXiv preprint arXiv:2309.04182, 2023 | | 2023 |
Production method of multimedia work, apparatus, and computer-readable storage medium CAI Xiaojuan, X Song, LI Gen, H Zhong, MO Weishu, H Li US Patent App. 18/069,031, 2023 | | 2023 |