关注
Zerun Feng
标题
引用次数
引用次数
年份
Exploiting visual semantic reasoning for video-text retrieval
Z Feng, Z Zeng, C Guo, Z Li
arXiv preprint arXiv:2006.08889, 2020
372020
Multi-View Visual Semantic Embedding.
Z Li, C Guo, Z Feng, JN Hwang, X Xue
IJCAI 2 (6), 7, 2022
312022
Temporal multimodal graph transformer with global-local alignment for video-text retrieval
Z Feng, Z Zeng, C Guo, Z Li
IEEE Transactions on Circuits and Systems for Video Technology 33 (3), 1438-1453, 2022
182022
Integrating language guidance into image-text matching for correcting false negatives
Z Li, C Guo, Z Feng, JN Hwang, Z Du
IEEE Transactions on Multimedia 26, 103-116, 2023
92023
A novel convolutional architecture for video-text retrieval
Z Li, C Guo, B Yang, Z Feng, H Zhang
2020 IEEE International Conference on Multimedia and Expo (ICME), 1-6, 2020
92020
LLaViLo: Boosting Video Moment Retrieval via Adapter-Based Multimodal Modeling
K Ma, X Zang, Z Feng, H Fang, C Ban, Y Wei, Z He, Y Li, H Sun
Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023
72023
Selectively Hard Negative Mining for Alleviating Gradient Vanishing in Image-Text Matching
Z Li, C Guo, X Wang, Z Feng, Z Du
arXiv preprint arXiv:2303.00181, 2023
32023
Alignment and generation adapter for efficient video-text understanding
H Fang, Z Yang, Y Wei, X Zang, C Ban, Z Feng, Z He, Y Li, H Sun
Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023
32023
Learning from noisy correspondence with tri-partition for cross-modal matching
Z Feng, Z Zeng, C Guo, Z Li, L Hu
IEEE Transactions on Multimedia, 2023
22023
Image-text retrieval with binary and continuous label supervision
Z Li, C Guo, Z Feng, JN Hwang, Y Jin, Y Zhang
arXiv preprint arXiv:2210.11319, 2022
22022
Unified loss of pair similarity optimization for vision-language retrieval
Z Li, C Guo, X Wang, Z Feng, JN Hwang, Z Du
arXiv preprint arXiv:2209.13869, 2022
22022
Spatial-Temporal Alignment via Optimal Transport for Video-Text Retrieval
Z Feng, Z Zeng, C Guo, Z Li, Y Zhang
2022 IEEE International Conference on Multimedia and Expo (ICME), 01-06, 2022
12022
ProTA: Probabilistic Token Aggregation for Text-Video Retrieval
H Fang, X Zang, C Ban, Z Feng, L Zhou, Z He, Y Li, H Sun
arXiv preprint arXiv:2404.12216, 2024
2024
系统目前无法执行此操作,请稍后再试。
文章 1–13