Space-time prompting for video class-incremental learning
Recently, prompt-based learning has made impressive progress on image class-
incremental learning, but it still lacks sufficient exploration in the video domain. In this paper …
incremental learning, but it still lacks sufficient exploration in the video domain. In this paper …
Learning deep multimodal feature representation with asymmetric multi-layer fusion
We propose a compact and effective framework to fuse multimodal features at multiple
layers in a single network. The framework consists of two innovative fusion schemes. Firstly …
layers in a single network. The framework consists of two innovative fusion schemes. Firstly …
Gcm: Efficient video recognition with glance and combine module
In this work, we present an efficient and powerful building block for video action recognition,
dubbed Glance and Combine Module (GCM). In order to obtain a broader perspective of the …
dubbed Glance and Combine Module (GCM). In order to obtain a broader perspective of the …
Local-global fusion network for video super-resolution
D Su, H Wang, L Jin, X Sun, X Peng - IEEE Access, 2020 - ieeexplore.ieee.org
The goal of video super-resolution technique is to address the problem of effectively
restoring high-resolution (HR) videos from low-resolution (LR) ones. Previous methods …
restoring high-resolution (HR) videos from low-resolution (LR) ones. Previous methods …
Dynamic normalization and relay for video action recognition
Abstract Convolutional Neural Networks (CNNs) have been the dominant model for video
action recognition. Due to the huge memory and compute demand, popular action …
action recognition. Due to the huge memory and compute demand, popular action …
Auto-X3D: Ultra-efficient video understanding via finer-grained neural architecture search
Efficient video architecture is the key to the deployment of video action recognition systems
on devices with limited computing capabilities. Unfortunately, existing video architectures …
on devices with limited computing capabilities. Unfortunately, existing video architectures …
Video content recognition method and apparatus, storage medium, and computer device
Y Li, B Ji, X Shi, K Bin - US Patent 11,983,926, 2024 - Google Patents
A video content recognition method is performed by a computer device, the method
including: obtaining an image feature corresponding to a video frame set extracted from a …
including: obtaining an image feature corresponding to a video frame set extracted from a …