Attack to explain deep representation- 学术资源搜索

Attack to explain deep representation

MAAK Jalwana, N Akhtar… - Proceedings of the …, 2020 - openaccess.thecvf.com

… deep models in terms of salient visual features for class labels and highlighting the alignment
of deep representation with human perception, our attack … the first attack on deep learning …

被引用次数：14 相关文章所有 5 个版本

[PDF] arxiv.org

Attack to fool and explain deep networks

N Akhtar, MAAK Jalwana… - IEEE Transactions on …, 2021 - ieeexplore.ieee.org

… 5.2 Explanation Experiments Preliminary results of model interpretation through our attack
to explain deep representation were presented in CVPR2020 [18]. Setup: Our setup for these …

被引用次数：35 相关文章所有 10 个版本

[PDF] thecvf.com

Exploiting explanations for model inversion attacks

X Zhao, W Zhang, X Xiao, B Lim - Proceedings of the IEEE …, 2021 - openaccess.thecvf.com

… risks for privacy attacks. Hence, providing explanation harms privacy. We study this risk
for image-based model inversion attacks and identified several attack architectures with …

被引用次数：99 相关文章所有 5 个版本

[PDF] springer.com

Unveiling the Anatomy of Adversarial Attacks: Concept-Based XAI Dissection of CNNs

G Mikriukov, G Schwalbe, F Motzkus… - World Conference on …, 2024 - Springer

… In these layers, we assess the impact of adversarial attacks on the internal representations
… To assess this, we evaluate the cosine similarities between attacked and non-attacked …

Towards explainable model extraction attacks

A Yan, R Hou, X Liu, H Yan, T Huang… - International Journal of …, 2022 - Wiley Online Library

… layer to construct a general and localizable deep representation. … of representation-guided
reveals more privacy. In view of different attack datasets, we analyze that even if the attack …

被引用次数：10 相关文章所有 3 个版本

[PDF] arxiv.org

The Anatomy of Adversarial Attacks: Concept-based XAI Dissection

G Mikriukov, G Schwalbe, F Motzkus… - arXiv preprint arXiv …, 2024 - arxiv.org

Representation Quality Explain Adversarial Attacks

DV Vargas, S Kotyan, M Matsuki - openreview.net

… representation of machine learning methods. Based on these metrics, we reveal a link between
deep representations’ quality and attack … two metrics to evaluate DNN’s representations. …

被引用次数：1 相关文章

[PDF] ieee.org

Robust adversarial attack against explainable deep classification models based on adversarial images with different patch sizes and perturbation ratios

H Kang, H Kim - IEEE Access, 2021 - ieeexplore.ieee.org

… • Experiment 3: To validate our proposed algorithm, we test the proposed attack model on
two representative pre-trained models, including feature module and no feature module. For …

被引用次数：10 相关文章所有 3 个版本

[PDF] wiley.com

Adversarial Attacks in Explainable Machine Learning: A Survey of Threats Against Models and Humans

J Vadillo, R Santana, JA Lozano - … Reviews: Data Mining and …, 2024 - Wiley Online Library

… attack paradigms existing in this domain, identify current gaps and future research directions,
and illustrate the main attack … We will consider two representative explanation methods in …

[PDF] aaai.org

Backdoor attacks on the DNN interpretation system

S Fang, A Choromanska - Proceedings of the AAAI Conference on …, 2022 - ojs.aaai.org

… a deep model and its influence on model training is conditioned upon the presence of a trigger.
We design two types of attacks: a targeted attack … cases the hidden representations of the …

被引用次数：21 相关文章所有 6 个版本