Remote sensing image captioning via variational autoencoder and reinforcement learning X Shen, B Liu, Y Zhou, J Zhao, M Liu Knowledge-Based Systems 203, 105920, 2020 | 72 | 2020 |
Remote sensing image caption generation via transformer and reinforcement learning X Shen, B Liu, Y Zhou, J Zhao Multimedia Tools and Applications 79 (35), 26661-26682, 2020 | 38 | 2020 |
Commonsense knowledge graph completion via contrastive pretraining and node clustering S Wu, X Shen, R Xia arXiv preprint arXiv:2305.17019, 2023 | 5 | 2023 |
Dense-atomic: Towards densely-connected ATOMIC with high knowledge coverage and massive multi-hop paths X Shen, S Wu, R Xia arXiv preprint arXiv:2210.07621, 2022 | 5* | 2022 |
Personality-aware human-centric multimodal reasoning: A new task Y Zhu, X Shen, R Xia CoRR, abs/2304.02313, 2023 | 4 | 2023 |
VCD: Knowledge Base Guided Visual Commonsense Discovery in Images X Shen, Y Song, S Wu, R Xia arXiv preprint arXiv:2402.17213, 2024 | 2 | 2024 |
A New Dialogue Response Generation Agent for Large Language Models by Asking Questions to Detect User's Intentions S Wu, X Shen, R Xia arXiv preprint arXiv:2310.03293, 2023 | 1 | 2023 |
Personality-aware Human-centric Multimodal Reasoning: A New Task, Dataset and Baselines Y Zhu, X Shen, R Xia arXiv preprint arXiv:2304.02313, 2023 | | 2023 |
Observe before Generate: Emotion-Cause aware Video Caption for Multimodal Emotion Cause Generation in Conversations F Wang, H Ma, X Shen, J Yu, R Xia ACM Multimedia 2024, 0 | | |