LAMM: Language-Assisted Multi-Modal Instruction-Tuning Dataset, Framework, and Benchmark Z Yin, J Wang, J Cao, Z Shi, D Liu, M Li, L Sheng, L Bai, X Huang, Z Wang, ... NeurIPS2023, 2023 | 100 | 2023 |
Bilateral cross-modality graph matching attention for feature fusion in visual question answering J Cao, X Qin, S Zhao, J Shen IEEE Transactions on Neural Networks and Learning Systems, 2022 | 23 | 2022 |
MADTP: Multimodal Alignment-Guided Dynamic Token Pruning for Accelerating Vision-Language Transformer J Cao, P Ye, S Li, C Yu, Y Tang, J Lu, T Chen CVPR2024, 2024 | 4 | 2024 |
A2S-NAS: Asymmetric Spectral-Spatial Neural Architecture Search for Hyperspectral Image Classification L Zhan, J Fan, P Ye, J Cao ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 4 | 2023 |
ClipSAM: CLIP and SAM Collaboration for Zero-Shot Anomaly Segmentation S Li, J Cao, P Ye, Y Ding, C Tu, T Chen arXiv preprint arXiv:2401.12665, 2024 | 3 | 2024 |
-DiT: A Training-Free Acceleration Method Tailored for Diffusion Transformers P Chen, M Shen, P Ye, J Cao, C Tu, CS Bouganis, Y Zhao, T Chen arXiv preprint arXiv:2406.01125, 2024 | 2 | 2024 |
Jndmix: Jnd-based data augmentation for no-reference image quality assessment J Sheng, J Fan, P Ye, J Cao ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 1 | 2023 |
Collaborative Position Reasoning Network for Referring Image Segmentation J Cao, B Dai, Y Li, X Qin, J Wang arXiv preprint arXiv:2401.11775, 2024 | | 2024 |