- 学术资源搜索

A survey on video moment localization

M Liu, L Nie, Y Wang, M Wang, Y Rui - ACM Computing Surveys, 2023 - dl.acm.org

Video moment localization, also known as video moment retrieval, aims to search a target
segment within a video described by a given natural language query. Beyond the task of …

被引用次数：25 相关文章所有 4 个版本

[PDF] arxiv.org

Temporal sentence grounding in videos: A survey and future directions

H Zhang, A Sun, W Jing, JT Zhou - IEEE Transactions on …, 2023 - ieeexplore.ieee.org

Temporal sentence grounding in videos (TSGV), aka, natural language video localization
(NLVL) or video moment retrieval (VMR), aims to retrieve a temporal moment that …

被引用次数：34 相关文章所有 8 个版本

[PDF] thecvf.com

Weakly supervised temporal sentence grounding with gaussian-based contrastive proposal learning

M Zheng, Y Huang, Q Chen… - Proceedings of the …, 2022 - openaccess.thecvf.com

Temporal sentence grounding aims to detect the most salient moment corresponding to the
natural language query from untrimmed videos. As labeling the temporal boundaries is labor …

被引用次数：62 相关文章所有 5 个版本

[PDF] thecvf.com

You can ground earlier than see: An effective and efficient pipeline for temporal sentence grounding in compressed videos

X Fang, D Liu, P Zhou, G Nan - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com

Given an untrimmed video, temporal sentence grounding (TSG) aims to locate a target
moment semantically according to a sentence query. Although previous respectable works …

被引用次数：27 相关文章所有 7 个版本

[PDF] thecvf.com

Are binary annotations sufficient? video moment retrieval via hierarchical uncertainty-based active learning

W Ji, R Liang, Z Zheng, W Zhang… - Proceedings of the …, 2023 - openaccess.thecvf.com

Recent research on video moment retrieval has mostly focused on enhancing the
performance of accuracy, efficiency, and robustness, all of which largely rely on the …

被引用次数：21 相关文章所有 7 个版本

[PDF] aaai.org

Weakly supervised video moment localization with contrastive negative sample mining

M Zheng, Y Huang, Q Chen, Y Liu - … of the AAAI Conference on Artificial …, 2022 - ojs.aaai.org

Video moment localization aims at localizing the video segments which are most related to
the given free-form natural language query. The weakly supervised setting, where only …

被引用次数：53 相关文章所有 7 个版本

[PDF] thecvf.com

Weakly supervised temporal sentence grounding with uncertainty-guided self-training

Y Huang, L Yang, Y Sato - … of the IEEE/CVF conference on …, 2023 - openaccess.thecvf.com

The task of weakly supervised temporal sentence grounding aims at finding the
corresponding temporal moments of a language description in the video, given video …

被引用次数：19 相关文章所有 3 个版本

[PDF] thecvf.com

Cascaded prediction network via segment tree for temporal video grounding

Y Zhao, Z Zhao, Z Zhang, Z Lin - Proceedings of the IEEE …, 2021 - openaccess.thecvf.com

Temporal video grounding aims to localize the target segment which is semantically aligned
with the given sentence in an untrimmed video. Existing methods can be divided into two …

被引用次数：76 相关文章所有 4 个版本

[PDF] thecvf.com

Cross-sentence temporal and semantic relations in video activity localisation

J Huang, Y Liu, S Gong, H Jin - Proceedings of the IEEE …, 2021 - openaccess.thecvf.com

Video activity localisation has recently attained increasing attention due to its practical
values in automatically localising the most salient visual segments corresponding to their …

被引用次数：63 相关文章所有 10 个版本

[PDF] aaai.org

Unsupervised temporal video grounding with deep semantic clustering

D Liu, X Qu, Y Wang, X Di, K Zou, Y Cheng… - Proceedings of the …, 2022 - ojs.aaai.org

Temporal video grounding (TVG) aims to localize a target segment in a video according to a
given sentence query. Though respectable works have made decent achievements in this …

被引用次数：47 相关文章所有 7 个版本