Self-supervised vision-language pretraining for medial visual question answering P Li, G Liu, L Tan, J Liao, S Zhong 2023 IEEE 20th International Symposium on Biomedical Imaging (ISBI), 1-5, 2023 | 38 | 2023 |
Masked vision and language pre-training with unimodal and multimodal contrastive losses for medical visual question answering P Li, G Liu, J He, Z Zhao, S Zhong International Conference on Medical Image Computing and Computer-Assisted …, 2023 | 25 | 2023 |
Pefomed: Parameter efficient fine-tuning on multimodal large language models for medical visual question answering J He, P Li, G Liu, Z Zhao, S Zhong arXiv preprint arXiv:2401.02797, 2024 | 7 | 2024 |
Unified Transformer with Cross-Modal Mixture Experts for Remote-Sensing Visual Question Answering G Liu, J He, P Li, S Zhong, H Li, G He Remote Sensing 15 (19), 4682, 2023 | 2 | 2023 |
Cross-Modal Self-Supervised Vision Language Pre-training with Multiple Objectives for Medical Visual Question Answering G Liu, P Li, Z Zhao, J He, G He, S Zhong Authorea Preprints, 2023 | 1 | 2023 |
PERS: Parameter-Efficient Multi-modal Transfer Learning for Remote Sensing Visual Question Answering J He, G Liu, P Li, X Su, W Jiang, D Zhang, S Zhong IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing, 2024 | | 2024 |
RSMoDM: Multimodal Momentum Distillation Model for Remote Sensing Visual Question Answering P Li, G Liu, J He, X Meng, S Zhong, X Chen IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing, 2024 | | 2024 |
PECR: Parameter-Efficient Transfer Learning with Cross-Modal Representation Learning for Remote Sensing Visual Question Answering P Li, J He, G Liu, S Zhong ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | | 2024 |
Mlp-Based Vision Language Pre-Training for Medical Visual Question Answering J He, P Li, G Liu, J Liao, W Jiang, S Zhong Available at SSRN 4797404, 0 | | |