Detecting moments and highlights in videos via natural language queries

J Lei, TL Berg, M Bansal - Advances in Neural Information …, 2021 - proceedings.neurips.cc
Detecting customized moments and highlights from videos given natural language (NL) user
queries is an important but under-studied topic. One of the challenges in pursuing this …

Bridging the gap: A unified video comprehension framework for moment retrieval and highlight detection

Y Xiao, Z Luo, Y Liu, Y Ma, H Bian… - Proceedings of the …, 2024 - openaccess.thecvf.com
Abstract Video Moment Retrieval (MR) and Highlight Detection (HD) have attracted
significant attention due to the growing demand for video analysis. Recent approaches treat …

Joint visual and audio learning for video highlight detection

T Badamdorj, M Rochan, Y Wang… - Proceedings of the …, 2021 - openaccess.thecvf.com
In video highlight detection, the goal is to identify the interesting moments within an unedited
video. Although the audio component of the video provides important cues for highlight …

Correlation-guided query-dependency calibration in video representation learning for temporal grounding

WJ Moon, S Hyun, SB Lee, JP Heo - CoRR, 2023 - openreview.net
Temporal Grounding is to identify specific moments or highlights from a video corresponding
to textual descriptions. Typical approaches in temporal grounding treat all video clips …

Contrastive learning for unsupervised video highlight detection

T Badamdorj, M Rochan, Y Wang… - Proceedings of the …, 2022 - openaccess.thecvf.com
Video highlight detection can greatly simplify video browsing, potentially paving the way for
a wide range of applications. Existing efforts are mostly fully-supervised, requiring humans …

Multiple pairwise ranking networks for personalized video summarization

Y Saquil, D Chen, Y He, C Li… - Proceedings of the IEEE …, 2021 - openaccess.thecvf.com
In this paper, we investigate video summarization in the supervised setting. Since video
summarization is subjective to the preference of the end-user, the design of a unique model …

Learning pixel-level distinctions for video highlight detection

F Wei, B Wang, T Ge, Y Jiang, W Li… - Proceedings of the …, 2022 - openaccess.thecvf.com
The goal of video highlight detection is to select the most attractive segments from a long
video to depict the most interesting parts of the video. Existing methods typically focus on …

RealityReplay: Detecting and Replaying Temporal Changes In Situ Using Mixed Reality

H Cho, ML Komar, D Lindlbauer - Proceedings of the ACM on Interactive …, 2023 - dl.acm.org
Humans easily miss events in their surroundings due to limited short-term memory and field
of view. This happens, for example, while watching an instructor's machine repair …

Prior Knowledge Integration via LLM Encoding and Pseudo Event Regulation for Video Moment Retrieval

Y Jiang, W Zhang, X Zhang, XY Wei… - Proceedings of the 32nd …, 2024 - dl.acm.org
In this paper, we explore the use of large language models (LLMs) to enhance video
moment retrieval (VMR) by integrating general knowledge and pseudo-events as priors. We …

Gaze estimation via modulation-based adaptive network with auxiliary self-learning

Y Wu, G Li, Z Liu, M Huang… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
Given a face image, most of previous works in gaze estimation infer the gaze via a well-
trained model with supervised training. However, the distribution of test data may be very …