CARES: A Comprehensive Benchmark of Trustworthiness in Medical Vision Language Models P Xia, Z Chen, J Tian, Y Gong, R Hou, Y Xu, Z Wu, Z Fan, Y Zhou, K Zhu, ... NeurIPS 2024, 2024 | 17 | 2024 |
NurViD: A Large Expert-Level Video Database for Nursing Procedure Activity Understanding M Hu, L Wang, S Yan, D Ma, Q Ren, P Xia, W Feng, P Duan, L Ju, Z Ge NeurIPS 2023, 2023 | 8 | 2023 |
RULE: Reliable Multimodal RAG for Factuality in Medical Vision Language Models P Xia, K Zhu, H Li, H Zhu, Y Li, G Li, L Zhang, H Yao EMNLP 2024 Main, 2024 | 7 | 2024 |
OphNet: A Large-Scale Video Benchmark for Ophthalmic Surgical Workflow Understanding M Hu*, P Xia*, L Wang*, S Yan, F Tang, Z Xu, Y Luo, K Song, J Leitner, ... ECCV 2024, 2024 | 6 | 2024 |
LMPT: Prompt Tuning with Class-Specific Embedding Loss for Long-Tailed Multi-Label Visual Recognition P Xia, D Xu, M Hu, L Ju, Z Ge ACLW 2024, 2023 | 6 | 2023 |
Generalizing to Unseen Domains in Diabetic Retinopathy with Disentangled Representations P Xia, M Hu, F Tang, W Li, W Zheng, L Ju, P Duan, H Yao, Z Ge MICCAI 2024 (Early Accept), 2024 | 5 | 2024 |
HGCLIP: Exploring Vision-Language Models with Graph Representations for Hierarchical Understanding P Xia, X Yu, M Hu, L Ju, Z Wang, P Duan, Z Ge COLING 2025, 2023 | 5 | 2023 |
TP-DRSeg: Improving Diabetic Retinopathy Lesion Segmentation with Explicit Text-Prompts Assisted SAM W Li, X Xiong, P Xia, L Ju, Z Ge MICCAI 2024, 2024 | 4 | 2024 |
Diffusion Model Driven Test-Time Image Adaptation for Robust Skin Lesion Classification M Hu, S Yan, P Xia, F Tang, W Li, P Duan, L Zhang, Z Ge arXiv preprint arXiv:2405.11289, 2024 | 2 | 2024 |
MMed-RAG: Versatile Multimodal RAG System for Medical Vision Language Models P Xia, K Zhu, H Li, T Wang, W Shi, S Wang, L Zhang, J Zou, H Yao arXiv preprint arXiv:2410.13085, 2024 | 1 | 2024 |
Chinese Grammatical Error Correction Based on Knowledge Distillation P Xia, Y Zhou, Z Zhang, Z Tang, J Li Technical Report, 2022 | 1 | 2022 |
MMIE: Massive Multimodal Interleaved Comprehension Benchmark for Large Vision-Language Models P Xia, S Han, S Qiu, Y Zhou, Z Wang, W Zheng, Z Chen, C Cui, M Ding, ... arXiv preprint arXiv:2410.10139, 2024 | | 2024 |
Explore Vision-Language Model with Hierarchical Information for Multiple Retinal Disease Recognition L Ju, Y Zhou, P Xia, D Alexander, PA Keane, Z Ge Investigative Ophthalmology & Visual Science 65 (7), 1593-1593, 2024 | | 2024 |