Multimodal co-learning: Challenges, applications with datasets, recent advances and future directions

A Rahate, R Walambe, S Ramanna, K Kotecha - Information Fusion, 2022 - Elsevier
Multimodal deep learning systems that employ multiple modalities like text, image, audio,
video, etc., are showing better performance than individual modalities (ie, unimodal) …

A systematic review of robustness in deep learning for computer vision: Mind the gap?

N Drenkow, N Sani, I Shpitser, M Unberath - arXiv preprint arXiv …, 2021 - arxiv.org
Deep neural networks for computer vision are deployed in increasingly safety-critical and
socially-impactful applications, motivating the need to close the gap in model performance …

Efficient test-time model adaptation without forgetting

S Niu, J Wu, Y Zhang, Y Chen… - International …, 2022 - proceedings.mlr.press
Test-time adaptation provides an effective means of tackling the potential distribution shift
between model training and inference, by dynamically updating the model at test time. This …

The many faces of robustness: A critical analysis of out-of-distribution generalization

D Hendrycks, S Basart, N Mu… - Proceedings of the …, 2021 - openaccess.thecvf.com
We introduce four new real-world distribution shift datasets consisting of changes in image
style, image blurriness, geographic location, camera operation, and more. With our new …

MedViT: a robust vision transformer for generalized medical image classification

ON Manzari, H Ahmadabadi, H Kashiani… - Computers in Biology …, 2023 - Elsevier
Abstract Convolutional Neural Networks (CNNs) have advanced existing medical systems
for automatic disease diagnosis. However, there are still concerns about the reliability of …

Tent: Fully test-time adaptation by entropy minimization

D Wang, E Shelhamer, S Liu, B Olshausen… - arXiv preprint arXiv …, 2020 - arxiv.org
A model must adapt itself to generalize to new and different data during testing. In this
setting of fully test-time adaptation the model has only the test data and its own parameters …

Improving robustness against common corruptions by covariate shift adaptation

S Schneider, E Rusak, L Eck… - Advances in neural …, 2020 - proceedings.neurips.cc
Today's state-of-the-art machine vision models are vulnerable to image corruptions like
blurring or compression artefacts, limiting their performance in many real-world applications …

Back to the source: Diffusion-driven adaptation to test-time corruption

J Gao, J Zhang, X Liu, T Darrell… - Proceedings of the …, 2023 - openaccess.thecvf.com
Test-time adaptation harnesses test inputs to improve the accuracy of a model trained on
source data when tested on shifted target data. Most methods update the source model by …

3d common corruptions and data augmentation

OF Kar, T Yeo, A Atanov… - Proceedings of the IEEE …, 2022 - openaccess.thecvf.com
We introduce a set of image transformations that can be used as corruptions to evaluate the
robustness of models as well as data augmentation mechanisms for training neural …

Towards robust vision transformer

X Mao, G Qi, Y Chen, X Li, R Duan… - Proceedings of the …, 2022 - openaccess.thecvf.com
Abstract Recent advances on Vision Transformer (ViT) and its improved variants have
shown that self-attention-based networks surpass traditional Convolutional Neural Networks …