[HTML][HTML] Applying deep learning for breast cancer detection in radiology

E Mahoro, MA Akhloufi - Current Oncology, 2022 - mdpi.com
Recent advances in deep learning have enhanced medical imaging research. Breast cancer
is the most prevalent cancer among women, and many applications have been developed to …

[HTML][HTML] A review of deep learning-based visual multi-object tracking algorithms for autonomous driving

S Guo, S Wang, Z Yang, L Wang, H Zhang, P Guo… - Applied Sciences, 2022 - mdpi.com
Multi-target tracking, a high-level vision job in computer vision, is crucial to understanding
autonomous driving surroundings. Numerous top-notch multi-object tracking algorithms …

Uniformer: Unifying convolution and self-attention for visual recognition

K Li, Y Wang, J Zhang, P Gao, G Song… - … on Pattern Analysis …, 2023 - ieeexplore.ieee.org
It is a challenging task to learn discriminative representation from images and videos, due to
large local redundancy and complex global dependency in these visual data. Convolution …

Enhancing multiscale representations with transformer for remote sensing image semantic segmentation

T Xiao, Y Liu, Y Huang, M Li… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
Semantic segmentation is an extremely challenging task in high-resolution remote sensing
(HRRS) images as objects have complex spatial layouts and enormous variations in …

A crack-segmentation algorithm fusing transformers and convolutional neural networks for complex detection scenarios

C Xiang, J Guo, R Cao, L Deng - Automation in Construction, 2023 - Elsevier
The performance of crack segmentation is influenced by complex scenes, including
irregularly shaped cracks, complex image backgrounds, and limitations in acquiring global …

[HTML][HTML] Transformer-based decoder designs for semantic segmentation on remotely sensed images

T Panboonyuen, K Jitkajornwanich, S Lawawirojwong… - Remote Sensing, 2021 - mdpi.com
Transformers have demonstrated remarkable accomplishments in several natural language
processing (NLP) tasks as well as image processing tasks. Herein, we present a deep …

[HTML][HTML] RemainNet: explore road extraction from remote sensing image using mask image modeling

Z Li, H Chen, N Jing, J Li - Remote Sensing, 2023 - mdpi.com
Road extraction from a remote sensing image is a research hotspot due to its broad range of
applications. Despite recent advancements, achieving precise road extraction remains …

Morphmlp: An efficient mlp-like backbone for spatial-temporal representation learning

DJ Zhang, K Li, Y Wang, Y Chen, S Chandra… - … on Computer Vision, 2022 - Springer
Recently, MLP-Like networks have been revived for image recognition. However, whether it
is possible to build a generic MLP-Like architecture on video domain has not been explored …

SegTransConv: Transformer and CNN hybrid method for real-time semantic segmentation of autonomous vehicles

J Fan, B Gao, Q Ge, Y Ran, J Zhang… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
Real-time and high-performance semantic segmentation is a crucial task in the scene
understanding of autonomous vehicles. This paper focuses on this issue and proposes a …

The effectiveness of T5, GPT-2, and BERT on text-to-image generation task

M Bahani, A El Ouaazizi, K Maalmi - Pattern Recognition Letters, 2023 - Elsevier
Abstract Text-to-image (T2I) generation, which involves synthesizing an image from a textual
description, has emerged as a popular research topic in computer vision. Meanwhile …