Multimodal Sentiment Analysis with Preferential Fusion and Distance-aware Contrastive Learning F Ma, Y Zhang, X Sun 2023 IEEE International Conference on Multimedia and Expo (ICME), 1367-1372, 2023 | 4 | 2023 |
A similarity alignment model for video copy segment matching Z Liu, F Ma, T Wang, F Rao arXiv preprint arXiv:2305.15679, 2023 | 3 | 2023 |
A dual-level detection method for video copy detection T Wang, F Ma, Z Liu, F Rao arXiv preprint arXiv:2305.12361, 2023 | 3 | 2023 |
Image captioning with multi-context synthetic data F Ma, Y Zhou, F Rao, Y Zhang, X Sun Proceedings of the AAAI Conference on Artificial Intelligence 38 (5), 4089-4097, 2024 | 2 | 2024 |
Visual Perception by Large Language Model's Weights F Ma, H Xue, G Wang, Y Zhou, F Rao, S Yan, Y Zhang, S Wu, MZ Shou, ... arXiv preprint arXiv:2405.20339, 2024 | | 2024 |
Multi-Modal Generative Embedding Model F Ma, H Xue, G Wang, Y Zhou, F Rao, S Yan, Y Zhang, S Wu, MZ Shou, ... arXiv preprint arXiv:2405.19333, 2024 | | 2024 |
Task Navigator: Decomposing Complex Tasks for Multimodal Large Language Models F Ma, Y Zhou, Y Zhang, S Wu, Z Zhang, Z He, F Rao, X Sun Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024 | | 2024 |