SIL-Net: A Semi-Isotropic L-shaped network for dermoscopic image segmentation

Z Zhang, Y Jiang, H Qiao, M Wang, W Yan… - Computers in Biology and …, 2022 - Elsevier
Background: Dermoscopic image segmentation using deep learning algorithms is a critical
technology for skin cancer detection and therapy. Specifically, this technology is a spatially …

[HTML][HTML] Multi-granularity feature aggregation with self-attention and spatial reasoning for fine-grained crop disease classification

X Zuo, J Chu, J Shen, J Sun - Agriculture, 2022 - mdpi.com
Combining disease categories and crop species leads to complex intra-class and inter-class
differences. Significant intra-class difference and subtle inter-class difference pose a great …

A GCN-based fast CU partition method of intra-mode VVC

S Zhang, S Feng, J Chen, C Zhou, F Yang - Journal of Visual …, 2022 - Elsevier
In this paper, a global convolutional network (GCN)-based fast coding unit (CU) partition
method of intra-mode VVC is proposed. By using the GCN module with large kernel size …

QbyE-MLPMixer: query-by-example open-vocabulary keyword spotting using MLPMixer

J Huang, W Gharbieh, Q Wan, HS Shim… - arXiv preprint arXiv …, 2022 - arxiv.org
Current keyword spotting systems are typically trained with a large amount of pre-defined
keywords. Recognizing keywords in an open-vocabulary setting is essential for …

Convexifying transformers: Improving optimization and understanding of transformer networks

T Ergen, B Neyshabur, H Mehta - arXiv preprint arXiv:2211.11052, 2022 - arxiv.org
Understanding the fundamental mechanism behind the success of transformer networks is
still an open problem in the deep learning literature. Although their remarkable performance …

Research trends and applications of data augmentation algorithms

J Fonseca, F Bacao - arXiv preprint arXiv:2207.08817, 2022 - arxiv.org
In the Machine Learning research community, there is a consensus regarding the
relationship between model complexity and the required amount of data and computation …

R2-MLP: Round-roll MLP for multi-view 3D object recognition

S Chen, T Yu, P Li - arXiv preprint arXiv:2211.11085, 2022 - arxiv.org
Recently, vision architectures based exclusively on multi-layer perceptrons (MLPs) have
gained much attention in the computer vision community. MLP-like models achieve …

[HTML][HTML] Medical image segmentation model based on triple gate MultiLayer perceptron

J Yan, X Wang, J Cai, Q Qin, H Yang, Q Wang… - Scientific Reports, 2022 - nature.com
To alleviate the social contradiction between limited medical resources and increasing
medical needs, the medical image-assisted diagnosis based on deep learning has become …

Analysis of quantization on mlp-based vision models

L Zhao, Z Dong, K Keutzer - arXiv preprint arXiv:2209.06383, 2022 - arxiv.org
Quantization is wildly taken as a model compression technique, which obtains efficient
models by converting floating-point weights and activations in the neural network into lower …

Parameterization of cross-token relations with relative positional encoding for vision MLP

Z Wang, Y Hao, X Gao, H Zhang, S Wang… - Proceedings of the 30th …, 2022 - dl.acm.org
Vision multi-layer perceptrons (MLPs) have shown promising performance in computer
vision tasks, and become the main competitor of CNNs and vision Transformers. They use …