Video summarization using deep neural networks: A survey

E Apostolidis, E Adamantidou, AI Metsai… - Proceedings of the …, 2021 - ieeexplore.ieee.org
Video summarization technologies aim to create a concise and complete synopsis by
selecting the most informative parts of the video content. Several approaches have been …

Align and attend: Multimodal summarization with dual contrastive losses

B He, J Wang, J Qiu, T Bui… - Proceedings of the …, 2023 - openaccess.thecvf.com
The goal of multimodal summarization is to extract the most important information from
different modalities to form summaries. Unlike unimodal summarization, the multimodal …

Clip-it! language-guided video summarization

M Narasimhan, A Rohrbach… - Advances in neural …, 2021 - proceedings.neurips.cc
A generic video summary is an abridged version of a video that conveys the whole story and
features the most important scenes. Yet the importance of scenes in a video is often …

Dsnet: A flexible detect-to-summarize network for video summarization

W Zhu, J Lu, J Li, J Zhou - IEEE Transactions on Image …, 2020 - ieeexplore.ieee.org
In this paper, we propose a Detect-to-Summarize network (DSNet) framework for supervised
video summarization. Our DSNet contains anchor-based and anchor-free counterparts. The …

Reconstructive sequence-graph network for video summarization

B Zhao, H Li, X Lu, X Li - IEEE Transactions on Pattern Analysis …, 2021 - ieeexplore.ieee.org
Exploiting the inner-shot and inter-shot dependencies is essential for key-shot based video
summarization. Current approaches mainly devote to modeling the video as a frame …

Tl; dw? summarizing instructional videos with task relevance and cross-modal saliency

M Narasimhan, A Nagrani, C Sun, M Rubinstein… - … on Computer Vision, 2022 - Springer
YouTube users looking for instructions for a specific task may spend a long time browsing
content trying to find the right video that matches their needs. Creating a visual summary …

Video summarization through reinforcement learning with a 3D spatio-temporal u-net

T Liu, Q Meng, JJ Huang, A Vlontzos… - IEEE transactions on …, 2022 - ieeexplore.ieee.org
Intelligent video summarization algorithms allow to quickly convey the most relevant
information in videos through the identification of the most essential and explanatory content …

Hierarchical multimodal transformer to summarize videos

B Zhao, M Gong, X Li - Neurocomputing, 2022 - Elsevier
Although video summarization has achieved tremendous success benefiting from Recurrent
Neural Networks (RNN), RNN-based methods neglect the global dependencies and multi …

Deep hierarchical LSTM networks with attention for video summarization

J Lin, S Zhong, A Fares - Computers & Electrical Engineering, 2022 - Elsevier
This paper studies the video summarization task by formulating it as a sequential decision-
making process, in which the input is a sequence of video frames and the output is a subset …

Summarizing videos using concentrated attention and considering the uniqueness and diversity of the video frames

E Apostolidis, G Balaouras, V Mezaris… - Proceedings of the 2022 …, 2022 - dl.acm.org
In this work, we describe a new method for unsupervised video summarization. To overcome
limitations of existing unsupervised video summarization approaches, that relate to the …