CSTA: CNN-based Spatiotemporal Attention for Video Summarization

J Son, J Park, K Kim - … of the IEEE/CVF Conference on …, 2024 - openaccess.thecvf.com
Video summarization aims to generate a concise representation of a video capturing its
essential content and key moments while reducing its overall length. Although several …

[HTML][HTML] Sustainable Personalized E-Learning through Integrated Cross-Course Learning Path Planning

Q Xiao, YW Zhang, XQ Xin, LW Cai - Sustainability, 2024 - mdpi.com
This study addresses the growing need for sustainable and personalized learning solutions
in online education by optimizing cross-course learning paths. With the increasing volume of …

A deep audio-visual model for efficient dynamic video summarization

G El-Nagar, A El-Sawy, M Rashad - Journal of Visual Communication and …, 2024 - Elsevier
The adage “a picture is worth a thousand words” resonates in the digital video domain,
suggesting that a video could be seen as a composition of millions of these words. Videos …

Dynamic video summarisation using stacked encoder-decoder architecture with residual learning network

M Dhanushree, R Priya, P Aruna… - … Journal of Intelligent …, 2024 - inderscienceonline.com
In the past decade, video summarisation has emerged as one of the most challenging
research fields in video understanding. Video summarisation is abstracting an original video …

Video Summarization using Denoising Diffusion Probabilistic Model

Z Shang, Y Zhu, H Li, X Wu - arXiv preprint arXiv:2412.08357, 2024 - arxiv.org
Video summarization aims to eliminate visual redundancy while retaining key parts of video
to construct concise and comprehensive synopses. Most existing methods use …

Deep Residual Network Video Summarization for Face Detection and Person Re-Identification

S babu Veesam, AR Satish - 2023 International Conference on …, 2023 - ieeexplore.ieee.org
The surge in the adoption of video-based applications can be attributed to the growing
accessibility of video data. Videos, in particular, serve as extensive, multimodal records of …