Detecting moments and highlights in videos via natural language queries
Detecting customized moments and highlights from videos given natural language (NL) user
queries is an important but under-studied topic. One of the challenges in pursuing this …
queries is an important but under-studied topic. One of the challenges in pursuing this …
Bridging the gap: A unified video comprehension framework for moment retrieval and highlight detection
Abstract Video Moment Retrieval (MR) and Highlight Detection (HD) have attracted
significant attention due to the growing demand for video analysis. Recent approaches treat …
significant attention due to the growing demand for video analysis. Recent approaches treat …
Joint visual and audio learning for video highlight detection
In video highlight detection, the goal is to identify the interesting moments within an unedited
video. Although the audio component of the video provides important cues for highlight …
video. Although the audio component of the video provides important cues for highlight …
Correlation-guided query-dependency calibration in video representation learning for temporal grounding
Temporal Grounding is to identify specific moments or highlights from a video corresponding
to textual descriptions. Typical approaches in temporal grounding treat all video clips …
to textual descriptions. Typical approaches in temporal grounding treat all video clips …
Contrastive learning for unsupervised video highlight detection
Video highlight detection can greatly simplify video browsing, potentially paving the way for
a wide range of applications. Existing efforts are mostly fully-supervised, requiring humans …
a wide range of applications. Existing efforts are mostly fully-supervised, requiring humans …
Multiple pairwise ranking networks for personalized video summarization
In this paper, we investigate video summarization in the supervised setting. Since video
summarization is subjective to the preference of the end-user, the design of a unique model …
summarization is subjective to the preference of the end-user, the design of a unique model …
Learning pixel-level distinctions for video highlight detection
The goal of video highlight detection is to select the most attractive segments from a long
video to depict the most interesting parts of the video. Existing methods typically focus on …
video to depict the most interesting parts of the video. Existing methods typically focus on …
RealityReplay: Detecting and Replaying Temporal Changes In Situ Using Mixed Reality
H Cho, ML Komar, D Lindlbauer - Proceedings of the ACM on Interactive …, 2023 - dl.acm.org
Humans easily miss events in their surroundings due to limited short-term memory and field
of view. This happens, for example, while watching an instructor's machine repair …
of view. This happens, for example, while watching an instructor's machine repair …
Prior Knowledge Integration via LLM Encoding and Pseudo Event Regulation for Video Moment Retrieval
In this paper, we explore the use of large language models (LLMs) to enhance video
moment retrieval (VMR) by integrating general knowledge and pseudo-events as priors. We …
moment retrieval (VMR) by integrating general knowledge and pseudo-events as priors. We …
Gaze estimation via modulation-based adaptive network with auxiliary self-learning
Given a face image, most of previous works in gaze estimation infer the gaze via a well-
trained model with supervised training. However, the distribution of test data may be very …
trained model with supervised training. However, the distribution of test data may be very …