Trseg: Transformer for semantic segmentation

E Mahoro, MA Akhloufi - Current Oncology, 2022 - mdpi.com

Recent advances in deep learning have enhanced medical imaging research. Breast cancer
is the most prevalent cancer among women, and many applications have been developed to …

被引用次数：39 相关文章所有 5 个版本

[HTML] mdpi.com

[HTML][HTML] A review of deep learning-based visual multi-object tracking algorithms for autonomous driving

S Guo, S Wang, Z Yang, L Wang, H Zhang, P Guo… - Applied Sciences, 2022 - mdpi.com

Multi-target tracking, a high-level vision job in computer vision, is crucial to understanding
autonomous driving surroundings. Numerous top-notch multi-object tracking algorithms …

被引用次数：34 相关文章所有 6 个版本

[PDF] arxiv.org

Uniformer: Unifying convolution and self-attention for visual recognition

K Li, Y Wang, J Zhang, P Gao, G Song… - … on Pattern Analysis …, 2023 - ieeexplore.ieee.org

It is a challenging task to learn discriminative representation from images and videos, due to
large local redundancy and complex global dependency in these visual data. Convolution …

被引用次数：263 相关文章所有 6 个版本

Enhancing multiscale representations with transformer for remote sensing image semantic segmentation

T Xiao, Y Liu, Y Huang, M Li… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org

Semantic segmentation is an extremely challenging task in high-resolution remote sensing
(HRRS) images as objects have complex spatial layouts and enormous variations in …

被引用次数：40 相关文章所有 4 个版本

A crack-segmentation algorithm fusing transformers and convolutional neural networks for complex detection scenarios

C Xiang, J Guo, R Cao, L Deng - Automation in Construction, 2023 - Elsevier

The performance of crack segmentation is influenced by complex scenes, including
irregularly shaped cracks, complex image backgrounds, and limitations in acquiring global …

被引用次数：28 相关文章

[HTML] mdpi.com

[HTML][HTML] Transformer-based decoder designs for semantic segmentation on remotely sensed images

T Panboonyuen, K Jitkajornwanich, S Lawawirojwong… - Remote Sensing, 2021 - mdpi.com

Transformers have demonstrated remarkable accomplishments in several natural language
processing (NLP) tasks as well as image processing tasks. Herein, we present a deep …

被引用次数：44 相关文章所有 8 个版本

[HTML] mdpi.com

[HTML][HTML] RemainNet: explore road extraction from remote sensing image using mask image modeling

Z Li, H Chen, N Jing, J Li - Remote Sensing, 2023 - mdpi.com

Road extraction from a remote sensing image is a research hotspot due to its broad range of
applications. Despite recent advancements, achieving precise road extraction remains …

被引用次数：7 相关文章所有 4 个版本

[PDF] arxiv.org

Morphmlp: An efficient mlp-like backbone for spatial-temporal representation learning

DJ Zhang, K Li, Y Wang, Y Chen, S Chandra… - … on Computer Vision, 2022 - Springer

Recently, MLP-Like networks have been revived for image recognition. However, whether it
is possible to build a generic MLP-Like architecture on video domain has not been explored …

被引用次数：30 相关文章所有 4 个版本

SegTransConv: Transformer and CNN hybrid method for real-time semantic segmentation of autonomous vehicles

J Fan, B Gao, Q Ge, Y Ran, J Zhang… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org

Real-time and high-performance semantic segmentation is a crucial task in the scene
understanding of autonomous vehicles. This paper focuses on this issue and proposes a …

被引用次数：9 相关文章所有 3 个版本

[PDF] academia.edu

The effectiveness of T5, GPT-2, and BERT on text-to-image generation task

M Bahani, A El Ouaazizi, K Maalmi - Pattern Recognition Letters, 2023 - Elsevier

Abstract Text-to-image (T2I) generation, which involves synthesizing an image from a textual
description, has emerged as a popular research topic in computer vision. Meanwhile …

被引用次数：10 相关文章所有 4 个版本