- 学术资源搜索

Multimodal co-learning: Challenges, applications with datasets, recent advances and future directions

A Rahate, R Walambe, S Ramanna, K Kotecha - Information Fusion, 2022 - Elsevier

Multimodal deep learning systems that employ multiple modalities like text, image, audio,
video, etc., are showing better performance than individual modalities (ie, unimodal) …

被引用次数：143 相关文章所有 4 个版本

[PDF] arxiv.org

A systematic review of robustness in deep learning for computer vision: Mind the gap?

N Drenkow, N Sani, I Shpitser, M Unberath - arXiv preprint arXiv …, 2021 - arxiv.org

Deep neural networks for computer vision are deployed in increasingly safety-critical and
socially-impactful applications, motivating the need to close the gap in model performance …

被引用次数：102 相关文章所有 3 个版本

[PDF] mlr.press

Efficient test-time model adaptation without forgetting

S Niu, J Wu, Y Zhang, Y Chen… - International …, 2022 - proceedings.mlr.press

Test-time adaptation provides an effective means of tackling the potential distribution shift
between model training and inference, by dynamically updating the model at test time. This …

被引用次数：306 相关文章所有 6 个版本

[PDF] thecvf.com

The many faces of robustness: A critical analysis of out-of-distribution generalization

D Hendrycks, S Basart, N Mu… - Proceedings of the …, 2021 - openaccess.thecvf.com

We introduce four new real-world distribution shift datasets consisting of changes in image
style, image blurriness, geographic location, camera operation, and more. With our new …

被引用次数：1707 相关文章所有 7 个版本

[PDF] arxiv.org

MedViT: a robust vision transformer for generalized medical image classification

ON Manzari, H Ahmadabadi, H Kashiani… - Computers in Biology …, 2023 - Elsevier

Abstract Convolutional Neural Networks (CNNs) have advanced existing medical systems
for automatic disease diagnosis. However, there are still concerns about the reliability of …

被引用次数：138 相关文章所有 5 个版本

[PDF] arxiv.org

Tent: Fully test-time adaptation by entropy minimization

D Wang, E Shelhamer, S Liu, B Olshausen… - arXiv preprint arXiv …, 2020 - arxiv.org

A model must adapt itself to generalize to new and different data during testing. In this
setting of fully test-time adaptation the model has only the test data and its own parameters …

被引用次数：1148 相关文章所有 7 个版本

[PDF] neurips.cc

Improving robustness against common corruptions by covariate shift adaptation

S Schneider, E Rusak, L Eck… - Advances in neural …, 2020 - proceedings.neurips.cc

Today's state-of-the-art machine vision models are vulnerable to image corruptions like
blurring or compression artefacts, limiting their performance in many real-world applications …

被引用次数：511 相关文章所有 11 个版本

[PDF] thecvf.com

Back to the source: Diffusion-driven adaptation to test-time corruption

J Gao, J Zhang, X Liu, T Darrell… - Proceedings of the …, 2023 - openaccess.thecvf.com

Test-time adaptation harnesses test inputs to improve the accuracy of a model trained on
source data when tested on shifted target data. Most methods update the source model by …

被引用次数：99 相关文章所有 6 个版本

[PDF] thecvf.com

3d common corruptions and data augmentation

OF Kar, T Yeo, A Atanov… - Proceedings of the IEEE …, 2022 - openaccess.thecvf.com

We introduce a set of image transformations that can be used as corruptions to evaluate the
robustness of models as well as data augmentation mechanisms for training neural …

被引用次数：118 相关文章所有 6 个版本

[PDF] thecvf.com

Towards robust vision transformer

X Mao, G Qi, Y Chen, X Li, R Duan… - Proceedings of the …, 2022 - openaccess.thecvf.com

Abstract Recent advances on Vision Transformer (ViT) and its improved variants have
shown that self-attention-based networks surpass traditional Convolutional Neural Networks …

被引用次数：217 相关文章所有 8 个版本