Inceptionnext: When inception meets convnext

Q Fan, H Huang, M Chen, H Liu… - Proceedings of the IEEE …, 2024 - openaccess.thecvf.com

Abstract Vision Transformer (ViT) has gained increasing attention in the computer vision
community in recent years. However the core component of ViT Self-Attention lacks explicit …

被引用次数：61 相关文章所有 3 个版本

[PDF] arxiv.org

DenseNets reloaded: paradigm shift beyond ResNets and ViTs

D Kim, B Heo, D Han - European Conference on Computer Vision, 2025 - Springer

Abstract This paper revives Densely Connected Convolutional Networks (DenseNets) and
reveals the underrated effectiveness over predominant ResNet-style architectures. We …

被引用次数：9 相关文章所有 2 个版本

[PDF] arxiv.org

MambaOut: Do We Really Need Mamba for Vision?

W Yu, X Wang - arXiv preprint arXiv:2405.07992, 2024 - arxiv.org

Mamba, an architecture with RNN-like token mixer of state space model (SSM), was recently
introduced to address the quadratic complexity of the attention mechanism and …

被引用次数：42 相关文章所有 2 个版本

[PDF] thecvf.com

PeLK: Parameter-efficient Large Kernel ConvNets with Peripheral Convolution

H Chen, X Chu, Y Ren, X Zhao… - Proceedings of the …, 2024 - openaccess.thecvf.com

Recently some large kernel convnets strike back with appealing performance and efficiency.
However given the square complexity of convolution scaling up kernels can bring about an …

被引用次数：25 相关文章所有 3 个版本

[PDF] mdpi.com

Mixed receptive fields augmented YOLO with multi-path spatial pyramid pooling for steel surface defect detection

K Xia, Z Lv, C Zhou, G Gu, Z Zhao, K Liu, Z Li - Sensors, 2023 - mdpi.com

Aiming at the problems of low detection efficiency and poor detection accuracy caused by
texture feature interference and dramatic changes in the scale of defect on steel surfaces, an …

被引用次数：30 相关文章所有 7 个版本

[PDF] thecvf.com

Gramian Attention Heads are Strong yet Efficient Vision Learners

J Ryu, D Han, J Lim - Proceedings of the IEEE/CVF …, 2023 - openaccess.thecvf.com

We introduce a novel architecture design that enhances expressiveness by incorporating
multiple head classifiers (ie, classification heads) instead of relying on channel expansion or …

被引用次数：4 相关文章所有 5 个版本

[PDF] thecvf.com

Poly kernel inception network for remote sensing detection

X Cai, Q Lai, Y Wang, W Wang… - Proceedings of the …, 2024 - openaccess.thecvf.com

Object detection in remote sensing images (RSIs) often suffers from several increasing
challenges including the large variation in object scales and the diverse-ranging context …

被引用次数：66 相关文章所有 3 个版本

[HTML] nih.gov

MLP-based classification of COVID-19 and skin diseases

R Zhang, L Wang, S Cheng, S Song - Expert Systems with Applications, 2023 - Elsevier

Recent years have witnessed a growing interest in neural network-based medical image
classification methods, which have demonstrated remarkable performance in this field …

被引用次数：14 相关文章所有 5 个版本

[PDF] nature.com

YOLOFM: an improved fire and smoke object detection algorithm based on YOLOv5n

X Geng, Y Su, X Cao, H Li, L Liu - Scientific Reports, 2024 - nature.com

To address the current difficulties in fire detection algorithms, including inadequate feature
extraction, excessive computational complexity, limited deployment on devices with limited …

被引用次数：20 相关文章所有 8 个版本

An efficient medical image classification network based on multi-branch CNN, token grouping Transformer and mixer MLP

S Liu, L Wang, W Yue - Applied Soft Computing, 2024 - Elsevier

In recent years, medical image classification techniques based on deep learning have made
remarkable achievements, but most of the current models sacrifice the efficiency of the …

被引用次数：18 相关文章