Space-time prompting for video class-incremental learning

Y Pei, Z Qing, S Zhang, X Wang… - Proceedings of the …, 2023 - openaccess.thecvf.com
Recently, prompt-based learning has made impressive progress on image class-
incremental learning, but it still lacks sufficient exploration in the video domain. In this paper …

Learning deep multimodal feature representation with asymmetric multi-layer fusion

Y Wang, F Sun, M Lu, A Yao - Proceedings of the 28th ACM International …, 2020 - dl.acm.org
We propose a compact and effective framework to fuse multimodal features at multiple
layers in a single network. The framework consists of two innovative fusion schemes. Firstly …

Gcm: Efficient video recognition with glance and combine module

Y Zhou, Z Huang, X Yang, M Ang, TK Ng - Pattern Recognition, 2023 - Elsevier
In this work, we present an efficient and powerful building block for video action recognition,
dubbed Glance and Combine Module (GCM). In order to obtain a broader perspective of the …

Local-global fusion network for video super-resolution

D Su, H Wang, L Jin, X Sun, X Peng - IEEE Access, 2020 - ieeexplore.ieee.org
The goal of video super-resolution technique is to address the problem of effectively
restoring high-resolution (HR) videos from low-resolution (LR) ones. Previous methods …

Dynamic normalization and relay for video action recognition

D Cai, A Yao, Y Chen - Advances in neural information …, 2021 - proceedings.neurips.cc
Abstract Convolutional Neural Networks (CNNs) have been the dominant model for video
action recognition. Due to the huge memory and compute demand, popular action …

Auto-X3D: Ultra-efficient video understanding via finer-grained neural architecture search

Y Jiang, X Gong, J Wu, H Shi… - Proceedings of the …, 2022 - openaccess.thecvf.com
Efficient video architecture is the key to the deployment of video action recognition systems
on devices with limited computing capabilities. Unfortunately, existing video architectures …

Video content recognition method and apparatus, storage medium, and computer device

Y Li, B Ji, X Shi, K Bin - US Patent 11,983,926, 2024 - Google Patents
A video content recognition method is performed by a computer device, the method
including: obtaining an image feature corresponding to a video frame set extracted from a …