关注
Jinfa Huang
Jinfa Huang
University of Rochester, Peking University
在 ur.rochester.edu 的电子邮件经过验证 - 首页
标题
引用次数
引用次数
年份
Expectation-Maximization Contrastive Learning for Compact Video-and-Language Representations
P Jin, J Huang, F Liu, X Wu, S Ge, G Song, D Clifton, J Chen
NeurIPS 2022, Spotlight, 30291-30306, 2022
412022
Video-Text as Game Players: Hierarchical Banzhaf Interaction for Cross-Modal Representation Learning
P Jin, J Huang, P Xiong, S Tian, C Liu, X Ji, L Yuan, J Chen
CVPR 2023, Highlight, 2023
402023
Moe-llava: Mixture of experts for large vision-language models
B Lin, Z Tang, Y Ye, J Cui, B Zhu, P Jin, J Huang, J Zhang, M Ning, ...
arXiv preprint arXiv:2401.15947, 2024
352024
A Survey of Large Language Models in Medicine: Principles, Applications, and Challenges
H Zhou, F Liu, B Gu, X Zou, J Huang, J Wu, Y Li, SS Chen, P Zhou, J Liu, ...
arXiv preprint arXiv:2311.05112, 2023
35*2023
Weakly-supervised 3d spatial reasoning for text-based visual question answering
H Li, J Huang, P Jin, G Song, Q Wu, J Chen
IEEE Transactions on Image Processing 32, 3367-3382, 2023
28*2023
Text-Video Retrieval with Disentangled Conceptualization and Set-to-Set Alignment
P Jin, H Li, Z Cheng, J Huang, Z Wang, L Yuan, C Liu, J Chen
IJCAI 2023, 2023
182023
Guoym at SemEval-2020 task 8: Ensemble-based Classification of Visuo-lingual Metaphor in Memes
Y Guo, J Huang, Y Dong, M Xu
Proceedings of the Fourteenth Workshop on Semantic Evaluation, 1120-1125, 2020
152020
Gpt-4V (ision) as a Social Media Analysis Engine
H Lyu, J Huang, D Zhang, Y Yu, X Mou, J Pan, Z Yang, Z Wei, J Luo
arXiv preprint arXiv:2311.07547, 2023
142023
Cross-Modality Time-Variant Relation Learning for Generating Dynamic Scene Graphs
J Wang, J Huang, C Zhang, Z Deng
ICRA 2023, 2023
42023
LLMBind: A unified modality-task integration framework
B Zhu, P Jin, M Ning, B Lin, J Huang, Q Song, M Pan, L Yuan
arXiv preprint arXiv:2402.14891, 2024
32024
Improving Scene Graph Generation with Superpixel-Based Interaction Learning
J Wang, C Zhang, J Huang, B Ren, Z Deng
ACMMM 2023, 2023
32023
Ldnn: Linguistic Knowledge Injectable Deep Neural Network for Group Cohesiveness Understanding
Y Wang, J Wu, J Huang, G Hattori, Y Takishima, S Wada, R Kimura, ...
Proceedings of the 2020 International Conference on Multimodal Interaction …, 2020
32020
MagicTime: Time-lapse Video Generation Models as Metamorphic Simulators
S Yuan, J Huang, Y Shi, Y Xu, R Zhu, B Lin, X Cheng, L Yuan, J Luo
arXiv preprint arXiv:2404.05014, 2024
22024
Continuous-Multiple Image Outpainting in One-Step via Positional Query and A Diffusion-based Approach
S Zhang, J Huang, Q Zhou, Z Wang, F Wang, J Luo, J Yan
ICLR 2024, 2024
22024
RAP: Efficient Text-Video Retrieval with Sparse-and-Correlated Adapter
M Cao, H Tang, J Huang, P Jin, C Zhang, R Liu, L Chen, X Liang, L Yuan, ...
ACL 2024 Findings, 2024
2024
系统目前无法执行此操作,请稍后再试。
文章 1–15