Learning efficient video representation with video shuffle networks

Y Pei, Z Qing, S Zhang, X Wang… - Proceedings of the …, 2023 - openaccess.thecvf.com

Recently, prompt-based learning has made impressive progress on image class-
incremental learning, but it still lacks sufficient exploration in the video domain. In this paper …

被引用次数：4 相关文章所有 3 个版本

[PDF] arxiv.org

Learning deep multimodal feature representation with asymmetric multi-layer fusion

Y Wang, F Sun, M Lu, A Yao - Proceedings of the 28th ACM International …, 2020 - dl.acm.org

We propose a compact and effective framework to fuse multimodal features at multiple
layers in a single network. The framework consists of two innovative fusion schemes. Firstly …

被引用次数：42 相关文章所有 5 个版本

Gcm: Efficient video recognition with glance and combine module

Y Zhou, Z Huang, X Yang, M Ang, TK Ng - Pattern Recognition, 2023 - Elsevier

In this work, we present an efficient and powerful building block for video action recognition,
dubbed Glance and Combine Module (GCM). In order to obtain a broader perspective of the …

被引用次数：9 相关文章所有 3 个版本

[PDF] ieee.org

Local-global fusion network for video super-resolution

D Su, H Wang, L Jin, X Sun, X Peng - IEEE Access, 2020 - ieeexplore.ieee.org

The goal of video super-resolution technique is to address the problem of effectively
restoring high-resolution (HR) videos from low-resolution (LR) ones. Previous methods …

被引用次数：12 相关文章所有 2 个版本

[PDF] neurips.cc

Dynamic normalization and relay for video action recognition

D Cai, A Yao, Y Chen - Advances in neural information …, 2021 - proceedings.neurips.cc

Abstract Convolutional Neural Networks (CNNs) have been the dominant model for video
action recognition. Due to the huge memory and compute demand, popular action …

被引用次数：5 相关文章所有 6 个版本

[PDF] thecvf.com

Auto-X3D: Ultra-efficient video understanding via finer-grained neural architecture search

Y Jiang, X Gong, J Wu, H Shi… - Proceedings of the …, 2022 - openaccess.thecvf.com

Efficient video architecture is the key to the deployment of video action recognition systems
on devices with limited computing capabilities. Unfortunately, existing video architectures …

被引用次数：2 相关文章所有 7 个版本

[PDF] googleapis.com

Video content recognition method and apparatus, storage medium, and computer device

Y Li, B Ji, X Shi, K Bin - US Patent 11,983,926, 2024 - Google Patents

A video content recognition method is performed by a computer device, the method
including: obtaining an image feature corresponding to a video frame set extracted from a …