Graph neural networks for natural language processing: A survey L Wu, Y Chen, K Shen, X Guo, H Gao, S Li, J Pei, B Long Foundations and Trends® in Machine Learning 16 (2), 119-328, 2023 | 262 | 2023 |
Naturalspeech 2: Latent diffusion models are natural and zero-shot speech and singing synthesizers K Shen, Z Ju, X Tan, Y Liu, Y Leng, L He, T Qin, S Zhao, J Bian arXiv preprint arXiv:2304.09116, 2023 | 101 | 2023 |
A study on relu and softmax in transformer K Shen, J Guo, X Tan, S Tang, R Wang, J Bian arXiv preprint arXiv:2302.06461, 2023 | 33 | 2023 |
Hierarchical Attention Based Spatial-Temporal Graph-to-Sequence Learning for Grounded Video Description. K Shen, L Wu, F Xu, S Tang, J Xiao, Y Zhuang IJCAI, 941-947, 2020 | 23 | 2020 |
Naturalspeech 3: Zero-shot speech synthesis with factorized codec and diffusion models Z Ju, Y Wang, K Shen, X Tan, D Xin, D Yang, Y Liu, Y Leng, K Song, ... arXiv preprint arXiv:2403.03100, 2024 | 20 | 2024 |
Prompttts 2: Describing and generating voices with text prompt Y Leng, Z Guo, K Shen, X Tan, Z Ju, Y Liu, Y Liu, D Yang, L Zhang, ... arXiv preprint arXiv:2309.02285, 2023 | 16 | 2023 |
Learning to Generate Visual Questions with Noisy Supervision K Shen, L Wu, S Tang, Y Zhuang, Z Ding, Y Xiao, B Long Thirty-Fifth Conference on Neural Information Processing Systems, 2021 | 12* | 2021 |
Mask the correct tokens: An embarrassingly simple approach for error correction K Shen, Y Leng, X Tan, S Tang, Y Zhang, W Liu, E Lin arXiv preprint arXiv:2211.13252, 2022 | 10 | 2022 |
Graph neural networks for natural language processing: A survey. arXiv 2021 L Wu, Y Chen, K Shen, X Guo, H Gao, S Li, J Pei, B Long arXiv preprint arXiv:2106.06090, 0 | 8 | |
Scenario-based multi-product advertising copywriting generation for e-commerce X Zhang, K Shen, C Zhang, X Fan, Y Xiao, Z He, B Long, L Wu arXiv preprint arXiv:2205.10530, 2022 | 5 | 2022 |
Rall-e: Robust codec language modeling with chain-of-thought prompting for text-to-speech synthesis D Xin, X Tan, K Shen, Z Ju, D Yang, Y Wang, S Takamichi, H Saruwatari, ... arXiv preprint arXiv:2404.03204, 2024 | 4 | 2024 |
Graph Neural Networks in Computer Vision S Tang, W Zhang, Z Mu, K Shen, J Li, J Li, L Wu Graph Neural Networks: Foundations, Frontiers, and Applications, 447-462, 2022 | 1 | 2022 |
T2S-GPT: Dynamic Vector Quantization for Autoregressive Sign Language Production from Text A Yin, H Li, K Shen, S Tang, Y Zhuang arXiv preprint arXiv:2406.07119, 2024 | | 2024 |
Ask Question with Double Hints: Visual Question Generation with Answer-awareness and Region-reference K Shen, L Wu, S Tang, F Xu, Z Zhang, Y Qiang, Y Zhuang | | |