MAC: Masked Contrastive Pre-Training for Efficient Video-Text Retrieval

F Shu, B Chen, Y Liao, J Wang… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
We present a simple yet effective end-to-end Video-language Pre-training (VidLP)
framework, Masked Contrastive Video-language Pre-training (MAC), for video-text retrieval …