关注
Zhen Ye
Zhen Ye
在 connect.ust.hk 的电子邮件经过验证 - 首页
标题
引用次数
引用次数
年份
BLVD: Building a large-scale 5D semantics benchmark for autonomous driving
J Xue, J Fang, T Li, B Zhang, P Zhang, Z Ye, J Dou
2019 International Conference on Robotics and Automation (ICRA), 6685-6691, 2019
652019
Comospeech: One-step speech and singing voice synthesis via consistency model
Z Ye, W Xue, X Tan, J Chen, Q Liu, Y Guo
Proceedings of the 31st ACM International Conference on Multimedia, 1831-1839, 2023
262023
FlashSpeech: Efficient Zero-Shot Speech Synthesis
Z Ye, Z Ju, H Liu, X Tan, J Chen, Y Lu, P Sun, J Pan, W Bian, S He, Q Liu, ...
arXiv preprint arXiv:2404.14700, 2024
52024
CoMoSVC: Consistency Model-based Singing Voice Conversion
Y Lu, Z Ye, W Xue, X Tan, Q Liu, Y Guo
arXiv preprint arXiv:2401.01792, 2024
52024
NAS-FM: neural architecture search for tunable and interpretable sound synthesis based on frequency modulation
Z Ye, W Xue, X Tan, Q Liu, Y Guo
arXiv preprint arXiv:2305.12868, 2023
42023
Codec Does Matter: Exploring the Semantic Shortcoming of Codec for Audio Language Model
Z Ye, P Sun, J Lei, H Lin, X Tan, Z Dai, Q Kong, J Chen, J Pan, Q Liu, ...
arXiv preprint arXiv:2408.17175, 2024
12024
MFC-Bench: Benchmarking Multimodal Fact-Checking with Large Vision-Language Models
S Wang, H Lin, Z Luo, Z Ye, G Chen, J Ma
arXiv preprint arXiv:2406.11288, 2024
12024
FastSAG: Towards Fast Non-Autoregressive Singing Accompaniment Generation
J Chen, W Xue, X Tan, Z Ye, Q Liu, Y Guo
arXiv preprint arXiv:2405.07682, 2024
2024
系统目前无法执行此操作,请稍后再试。
文章 1–8