Expectation-Maximization Contrastive Learning for Compact Video-and-Language Representations P Jin, J Huang, F Liu, X Wu, S Ge, G Song, D Clifton, J Chen NeurIPS 2022, Spotlight, 30291-30306, 2022 | 41 | 2022 |
Video-Text as Game Players: Hierarchical Banzhaf Interaction for Cross-Modal Representation Learning P Jin, J Huang, P Xiong, S Tian, C Liu, X Ji, L Yuan, J Chen CVPR 2023, Highlight, 2023 | 40 | 2023 |
Moe-llava: Mixture of experts for large vision-language models B Lin, Z Tang, Y Ye, J Cui, B Zhu, P Jin, J Huang, J Zhang, M Ning, ... arXiv preprint arXiv:2401.15947, 2024 | 35 | 2024 |
A Survey of Large Language Models in Medicine: Principles, Applications, and Challenges H Zhou, F Liu, B Gu, X Zou, J Huang, J Wu, Y Li, SS Chen, P Zhou, J Liu, ... arXiv preprint arXiv:2311.05112, 2023 | 35* | 2023 |
Weakly-supervised 3d spatial reasoning for text-based visual question answering H Li, J Huang, P Jin, G Song, Q Wu, J Chen IEEE Transactions on Image Processing 32, 3367-3382, 2023 | 28* | 2023 |
Text-Video Retrieval with Disentangled Conceptualization and Set-to-Set Alignment P Jin, H Li, Z Cheng, J Huang, Z Wang, L Yuan, C Liu, J Chen IJCAI 2023, 2023 | 18 | 2023 |
Guoym at SemEval-2020 task 8: Ensemble-based Classification of Visuo-lingual Metaphor in Memes Y Guo, J Huang, Y Dong, M Xu Proceedings of the Fourteenth Workshop on Semantic Evaluation, 1120-1125, 2020 | 15 | 2020 |
Gpt-4V (ision) as a Social Media Analysis Engine H Lyu, J Huang, D Zhang, Y Yu, X Mou, J Pan, Z Yang, Z Wei, J Luo arXiv preprint arXiv:2311.07547, 2023 | 14 | 2023 |
Cross-Modality Time-Variant Relation Learning for Generating Dynamic Scene Graphs J Wang, J Huang, C Zhang, Z Deng ICRA 2023, 2023 | 4 | 2023 |
LLMBind: A unified modality-task integration framework B Zhu, P Jin, M Ning, B Lin, J Huang, Q Song, M Pan, L Yuan arXiv preprint arXiv:2402.14891, 2024 | 3 | 2024 |
Improving Scene Graph Generation with Superpixel-Based Interaction Learning J Wang, C Zhang, J Huang, B Ren, Z Deng ACMMM 2023, 2023 | 3 | 2023 |
Ldnn: Linguistic Knowledge Injectable Deep Neural Network for Group Cohesiveness Understanding Y Wang, J Wu, J Huang, G Hattori, Y Takishima, S Wada, R Kimura, ... Proceedings of the 2020 International Conference on Multimodal Interaction …, 2020 | 3 | 2020 |
MagicTime: Time-lapse Video Generation Models as Metamorphic Simulators S Yuan, J Huang, Y Shi, Y Xu, R Zhu, B Lin, X Cheng, L Yuan, J Luo arXiv preprint arXiv:2404.05014, 2024 | 2 | 2024 |
Continuous-Multiple Image Outpainting in One-Step via Positional Query and A Diffusion-based Approach S Zhang, J Huang, Q Zhou, Z Wang, F Wang, J Luo, J Yan ICLR 2024, 2024 | 2 | 2024 |
RAP: Efficient Text-Video Retrieval with Sparse-and-Correlated Adapter M Cao, H Tang, J Huang, P Jin, C Zhang, R Liu, L Chen, X Liang, L Yuan, ... ACL 2024 Findings, 2024 | | 2024 |