Advances in medical image analysis with vision transformers: a comprehensive review
The remarkable performance of the Transformer architecture in natural language processing
has recently also triggered broad interest in Computer Vision. Among other merits …
has recently also triggered broad interest in Computer Vision. Among other merits …
Star-transformer: a spatio-temporal cross attention transformer for human action recognition
In action recognition, although the combination of spatio-temporal videos and skeleton
features can improve the recognition performance, a separate model and balancing feature …
features can improve the recognition performance, a separate model and balancing feature …
Swinmm: masked multi-view with swin transformers for 3d medical image segmentation
Recent advancements in large-scale Vision Transformers have made significant strides in
improving pre-trained models for medical image segmentation. However, these methods …
improving pre-trained models for medical image segmentation. However, these methods …
A survey of neural trees
Neural networks (NNs) and decision trees (DTs) are both popular models of machine
learning, yet coming with mutually exclusive advantages and limitations. To bring the best of …
learning, yet coming with mutually exclusive advantages and limitations. To bring the best of …
Fine-grained visual classification with high-temperature refinement and background suppression
Fine-grained visual classification is a challenging task due to the high similarity between
categories and distinct differences among data within one single category. To address the …
categories and distinct differences among data within one single category. To address the …
Protopformer: Concentrating on prototypical parts in vision transformers for interpretable image recognition
Prototypical part network (ProtoPNet) has drawn wide attention and boosted many follow-up
studies due to its self-explanatory property for explainable artificial intelligence (XAI) …
studies due to its self-explanatory property for explainable artificial intelligence (XAI) …
Transformers pay attention to convolutions leveraging emerging properties of ViTs by dual attention-image network
Although purely transformer-based architectures pretrained on large datasets are introduced
as foundation models for general computer vision tasks, hybrid models that incorporate …
as foundation models for general computer vision tasks, hybrid models that incorporate …
Learning support and trivial prototypes for interpretable image classification
Prototypical part network (ProtoPNet) methods have been designed to achieve interpretable
classification by associating predictions with a set of training prototypes, which we refer to as …
classification by associating predictions with a set of training prototypes, which we refer to as …
Pixel-grounded prototypical part networks
Prototypical part neural networks (ProtoPartNNs), namely ProtoPNet and its derivatives, are
an intrinsically interpretable approach to machine learning. Their prototype learning scheme …
an intrinsically interpretable approach to machine learning. Their prototype learning scheme …
FET-FGVC: Feature-enhanced transformer for fine-grained visual classification
The challenge of Fine-grained visual classification (FGVC) comes from the small variations
between classes and the large variations within classes. Inspired by the fact that identifying …
between classes and the large variations within classes. Inspired by the fact that identifying …