Hierarchical question-image co-attention for visual question answering J Lu, J Yang, D Batra, D Parikh Advances in neural information processing systems 29, 2016 | 1950 | 2016 |
Vinvl: Revisiting visual representations in vision-language models P Zhang, X Li, X Hu, J Yang, L Zhang, L Wang, Y Choi, J Gao Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2021 | 1063* | 2021 |
Joint unsupervised learning of deep representations and image clusters J Yang, D Parikh, D Batra Proceedings of the IEEE Conference on Computer Vision and Pattern …, 2016 | 968 | 2016 |
Graph R-CNN for Scene Graph Generation J Yang*, J Lu*, S Lee, D Batra, D Parikh arXiv preprint arXiv:1808.00191, 2018 | 936 | 2018 |
Grounding dino: Marrying dino with grounded pre-training for open-set object detection S Liu, Z Zeng, T Ren, F Li, H Zhang, J Yang, C Li, J Yang, H Su, J Zhu, ... arXiv preprint arXiv:2303.05499, 2023 | 772 | 2023 |
Grounded language-image pre-training LH Li, P Zhang, H Zhang, J Yang, C Li, Y Zhong, L Wang, L Yuan, ... Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022 | 766 | 2022 |
Florence: A new foundation model for computer vision L Yuan, D Chen, YL Chen, N Codella, X Dai, J Gao, H Hu, X Huang, B Li, ... arXiv preprint arXiv:2111.11432, 2021 | 754 | 2021 |
Neural Baby Talk J Lu*, J Yang*, D Batra, D Parikh arXiv preprint arXiv:1803.09845, 2018 | 548 | 2018 |
Focal attention for long-range interactions in vision transformers J Yang, C Li, P Zhang, X Dai, B Xiao, L Yuan, J Gao Advances in Neural Information Processing Systems 34, 30008-30022, 2021 | 537* | 2021 |
Learn convolutional neural network for face anti-spoofing J Yang, Z Lei, SZ Li arXiv preprint arXiv:1408.5601, 2014 | 533 | 2014 |
Regionclip: Region-based language-image pretraining Y Zhong, J Yang, P Zhang, C Li, N Codella, LH Li, L Zhou, X Dai, L Yuan, ... Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2022 | 403 | 2022 |
Gligen: Open-set grounded text-to-image generation Y Li, H Liu, Q Wu, F Mu, J Yang, J Gao, C Li, YJ Lee Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023 | 368* | 2023 |
Multi-scale vision longformer: A new vision transformer for high-resolution image encoding P Zhang, X Dai, J Yang, B Xiao, L Yuan, L Zhang, J Gao Proceedings of the IEEE/CVF international conference on computer vision …, 2021 | 334 | 2021 |
Face liveness detection with component dependent descriptor J Yang, Z Lei, S Liao, SZ Li 2013 International Conference on Biometrics (ICB), 1-6, 2013 | 317 | 2013 |
Segment everything everywhere all at once X Zou*, J Yang*, H Zhang*, F Li*, L Li, J Wang, L Wang, J Gao, YJ Lee Advances in Neural Information Processing Systems 36, 2024 | 306 | 2024 |
Lr-gan: Layered recursive generative adversarial networks for image generation J Yang, A Kannan, D Batra, D Parikh arXiv preprint arXiv:1703.01560, 2017 | 281 | 2017 |
Dynamic detr: End-to-end object detection with dynamic attention X Dai, Y Chen, J Yang, P Zhang, L Yuan, L Zhang Proceedings of the IEEE/CVF international conference on computer vision …, 2021 | 251 | 2021 |
Llava-med: Training a large language-and-vision assistant for biomedicine in one day C Li, C Wong, S Zhang, N Usuyama, H Liu, J Yang, T Naumann, H Poon, ... Advances in Neural Information Processing Systems 36, 2024 | 239 | 2024 |
Efficient self-supervised vision transformers for representation learning C Li, J Yang, P Zhang, M Gao, B Xiao, X Dai, L Yuan, J Gao arXiv preprint arXiv:2106.09785, 2021 | 210 | 2021 |
Unified contrastive learning in image-text-label space J Yang*, C Li*, P Zhang*, B Xiao*, C Liu, L Yuan, J Gao Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022 | 173 | 2022 |