NExT-QA: Next Phase of Question-Answering to Explaining Temporal Actions J Xiao, X Shang, A Yao, TS Chua IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 9777-9786, 2021 | 222 | 2021 |
Annotating objects and relations in user-generated videos X Shang, D Di, J Xiao, Y Cao, X Yang, TS Chua International Conference on Multimedia Retrieval (ICMR), 279-287, 2019 | 153 | 2019 |
Invariant Grounding for Video Question Answering Y Li, X Wang, J Xiao, W Ji, TS Chua IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2928-2937, 2022 | 100 | 2022 |
Video as Conditional Graph Hierarchy for Multi-Granular Question Answering J Xiao, A Yao, Z Liu, Y Li, W Ji, TS Chua Proceedings of the AAAI Conference on Artificial Intelligence (AAAI), 2804-2812, 2022 | 100 | 2022 |
Video Graph Transformer for Video Question Answering J Xiao, P Zhou, TS Chua, S Yan European Conference on Computer Vision (ECCV), 39-58, 2022 | 70 | 2022 |
Video Question Answering: Datasets, Algorithms and Challenges Y Zhong, J Xiao, W Ji, Y Li, W Deng, TS Chua Empirical Methods in Natural Language Processing (EMNLP), 6349-6455, 2022 | 64 | 2022 |
Visual Relation Grounding in Videos J Xiao, X Shang, X Yang, S Tang, TS Chua European Conference on Computer Vision (ECCV), 447-464, 2020 | 46 | 2020 |
Video Visual Relation Detection via Iterative Inference X Shang, Y Li, J Xiao, W Ji, TS Chua ACM International Conference on Multimedia (MM), 3654-3663, 2021 | 30 | 2021 |
Equivariant and Invariant Grounding for Video Question Answering Y Li, X Wang, J Xiao, TS Chua ACM International Conference on Multimedia (MM), 4714–4722, 2022 | 24 | 2022 |
FakeSV: A Multimodal Benchmark with Rich Social Context for Fake News Detection on Short Video Platforms P Qi, Y Bu, J Cao, W Ji, R Shui, J Xiao, D Wang, TS Chua Proceedings of the AAAI Conference on Artificial Intelligence (AAAI),14444 …, 2023 | 21 | 2023 |
Contrastive Video Question Answering via Video Graph Transformer J Xiao, P Zhou, A Yao, Y Li, R Hong, S Yan, TS Chua IEEE Transactions on Pattern Analysis and Machine Intelligence (T-PAMI …, 2023 | 18 | 2023 |
Relation Understanding in Videos: A Grand Challenge Overview X Shang, J Xiao, D Di, TS Chua ACM International Conference on Multimedia (MM), 2652-2656, 2019 | 16 | 2019 |
Detection and tracking based tubelet generation for video object detection B Wang, S Tang, J Xiao, QF Yan, YD Zhang Journal of Visual Communication and Image Representation 58, 102-111, 2019 | 16 | 2019 |
Can I Trust Your Answer? Visually Grounded Video Question Answering J Xiao, A Yao, Y Li, TS Chua IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 13204 …, 2024 | 15 | 2024 |
VidVRD 2021: The Third Grand Challenge on Video Relation Detection W Ji, Y Li, M Wei, X Shang, J Xiao, T Ren, TS Chua ACM International Conference on Multimedia (MM)., 4779-4783, 2021 | 12 | 2021 |
Discovering Spatio-Temporal Rationales for Video Question Answering Y Li, J Xiao, C Feng, X Wang, TS Chua IEEE/CVF International Conference on Computer Vision (ICCV), 13869-13878, 2023 | 6 | 2023 |
Soargraph: Numerical reasoning over financial table-text data via semantic-oriented hierarchical graphs F Zhu, M Li, J Xiao, F Feng, C Wang, TS Chua Companion Proceedings of the ACM Web Conference 2023 (WWW), 1236-1244, 2023 | 6 | 2023 |
Joint Learning of Binary Classifiers and Pairwise Label Correlations for Multi-label Image Classification J Xiao, S Tang IEEE Conference on Multimedia Information Processing and Retrieval (MIPR), 25-30, 2020 | 6 | 2020 |
Transformer-Empowered Invariant Grounding for Video Question Answering Y Li, X Wang, J Xiao, W Ji, TS Chua IEEE Transactions on Pattern Analysis and Machine Intelligence (T-PAMI), 2023 | 4 | 2023 |
Abductive Ego-View Accident Video Understanding for Safe Driving Perception J Fang, L Li, J Zhou, J Xiao, H Yu, C Lv, J Xue, TS Chua IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 22030 …, 2024 | 2 | 2024 |