Weakly supervised dense event captioning in videos X Duan, W Huang, C Gan, J Wang, W Zhu, J Huang Advances in Neural Information Processing Systems 31, 2018 | 158 | 2018 |
Avqa: A dataset for audio-visual question answering on videos P Yang, X Wang, X Duan, H Chen, R Hou, C Jin, W Zhu Proceedings of the 30th ACM international conference on multimedia, 3480-3491, 2022 | 42 | 2022 |
Memor: A dataset for multimodal emotion reasoning in videos G Shen, X Wang, X Duan, H Li, W Zhu Proceedings of the 28th ACM international conference on multimedia, 493-502, 2020 | 34 | 2020 |
Disenbooth: Disentangled parameter-efficient tuning for subject-driven text-to-image generation H Chen, Y Zhang, X Wang, X Duan, Y Zhou, W Zhu arXiv preprint arXiv:2305.03374 3, 2023 | 28 | 2023 |
Disenbooth: Identity-preserving disentangled tuning for subject-driven text-to-image generation H Chen, Y Zhang, S Wu, X Wang, X Duan, Y Zhou, W Zhu arXiv preprint arXiv:2305.03374, 2023 | 25 | 2023 |
STDMANet: Spatio-temporal differential multiscale attention network for small moving infrared target detection P Yan, R Hou, X Duan, C Yue, X Wang, X Cao IEEE transactions on geoscience and remote sensing 61, 1-16, 2023 | 19 | 2023 |
Learning-to-ask: Knowledge acquisition via 20 questions Y Chen, B Chen, X Duan, JG Lou, Y Wang, W Zhu, Y Cao Proceedings of the 24th ACM SIGKDD International Conference on Knowledge …, 2018 | 17 | 2018 |
Curriculum-nas: Curriculum weight-sharing neural architecture search Y Zhou, X Wang, H Chen, X Duan, C Guan, W Zhu Proceedings of the 30th ACM International Conference on Multimedia, 6792-6801, 2022 | 12 | 2022 |
Dynamic spatio-temporal modular network for video question answering Z Qian, X Wang, X Duan, H Chen, W Zhu Proceedings of the 30th ACM International Conference on Multimedia, 4466-4477, 2022 | 11 | 2022 |
Deeplogic: Joint learning of neural perception and logical reasoning X Duan, X Wang, P Zhao, G Shen, W Zhu IEEE Transactions on Pattern Analysis and Machine Intelligence 45 (4), 4321-4334, 2022 | 10 | 2022 |
Multi-modal contextual graph neural network for text visual question answering Y Liang, X Wang, X Duan, W Zhu 2020 25th International Conference on Pattern Recognition (ICPR), 3491-3498, 2021 | 8 | 2021 |
Decouple before interact: Multi-modal prompt learning for continual visual question answering Z Qian, X Wang, X Duan, P Qin, Y Li, W Zhu Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023 | 7 | 2023 |
Watch, reason and code: Learning to represent videos using program X Duan, Q Wu, C Gan, Y Zhang, W Huang, A Van Den Hengel, W Zhu Proceedings of the 27th ACM International Conference on Multimedia, 1543-1551, 2019 | 7 | 2019 |
Curriculum-listener: Consistency-and complementarity-aware audio-enhanced temporal sentence grounding H Chen, X Wang, X Lan, H Chen, X Duan, J Jia, W Zhu Proceedings of the 31st ACM International Conference on Multimedia, 3117-3128, 2023 | 5 | 2023 |
DisenDreamer: Subject-Driven Text-to-Image Generation with Sample-aware Disentangled Tuning H Chen, Y Zhang, X Wang, X Duan, Y Zhou, W Zhu IEEE Transactions on Circuits and Systems for Video Technology, 2024 | 4 | 2024 |
Intra-and Inter-Modal Curriculum for Multimodal Learning Y Zhou, X Wang, H Chen, X Duan, W Zhu Proceedings of the 31st ACM International Conference on Multimedia, 3724-3735, 2023 | 3 | 2023 |
Parametric visual program induction with function modularization X Duan, X Wang, Z Zhang, W Zhu International Conference on Machine Learning, 5643-5658, 2022 | 2 | 2022 |
H2V4Sports: Real-Time Horizontal-to-Vertical Video Converter for Sports Lives via Fast Object Detection and Tracking Y Han, K Li, Z Song, W Feng, X Cao, S Guo, X Wang, X Duan, W Zhu Proceedings of the 31st ACM International Conference on Multimedia, 9376-9378, 2023 | 1 | 2023 |
Unsupervised Image Sequence Registration and Enhancement for Infrared Small Target Detection R Hou, P Yan, X Duan, X Wang IEEE Transactions on Geoscience and Remote Sensing, 2024 | | 2024 |
Modularized parametric visual program induction algorithm, device, medium and product W Zhu, X Wang, D Xuguang US Patent App. 18/197,746, 2024 | | 2024 |