Attack to explain deep representation
MAAK Jalwana, N Akhtar… - Proceedings of the …, 2020 - openaccess.thecvf.com
… deep models in terms of salient visual features for class labels and highlighting the alignment
of deep representation with human perception, our attack … the first attack on deep learning …
of deep representation with human perception, our attack … the first attack on deep learning …
Attack to fool and explain deep networks
N Akhtar, MAAK Jalwana… - IEEE Transactions on …, 2021 - ieeexplore.ieee.org
… 5.2 Explanation Experiments Preliminary results of model interpretation through our attack
to explain deep representation were presented in CVPR2020 [18]. Setup: Our setup for these …
to explain deep representation were presented in CVPR2020 [18]. Setup: Our setup for these …
Exploiting explanations for model inversion attacks
… risks for privacy attacks. Hence, providing explanation harms privacy. We study this risk
for image-based model inversion attacks and identified several attack architectures with …
for image-based model inversion attacks and identified several attack architectures with …
Unveiling the Anatomy of Adversarial Attacks: Concept-Based XAI Dissection of CNNs
… In these layers, we assess the impact of adversarial attacks on the internal representations
… To assess this, we evaluate the cosine similarities between attacked and non-attacked …
… To assess this, we evaluate the cosine similarities between attacked and non-attacked …
Towards explainable model extraction attacks
A Yan, R Hou, X Liu, H Yan, T Huang… - International Journal of …, 2022 - Wiley Online Library
… layer to construct a general and localizable deep representation. … of representation-guided
reveals more privacy. In view of different attack datasets, we analyze that even if the attack …
reveals more privacy. In view of different attack datasets, we analyze that even if the attack …
The Anatomy of Adversarial Attacks: Concept-based XAI Dissection
… In these layers, we assess the impact of adversarial attacks on the internal representations
… To assess this, we evaluate the cosine similarities between attacked and nonattacked …
… To assess this, we evaluate the cosine similarities between attacked and nonattacked …
Representation Quality Explain Adversarial Attacks
… representation of machine learning methods. Based on these metrics, we reveal a link between
deep representations’ quality and attack … two metrics to evaluate DNN’s representations. …
deep representations’ quality and attack … two metrics to evaluate DNN’s representations. …
Robust adversarial attack against explainable deep classification models based on adversarial images with different patch sizes and perturbation ratios
H Kang, H Kim - IEEE Access, 2021 - ieeexplore.ieee.org
… • Experiment 3: To validate our proposed algorithm, we test the proposed attack model on
two representative pre-trained models, including feature module and no feature module. For …
two representative pre-trained models, including feature module and no feature module. For …
Adversarial Attacks in Explainable Machine Learning: A Survey of Threats Against Models and Humans
… attack paradigms existing in this domain, identify current gaps and future research directions,
and illustrate the main attack … We will consider two representative explanation methods in …
and illustrate the main attack … We will consider two representative explanation methods in …
Backdoor attacks on the DNN interpretation system
S Fang, A Choromanska - Proceedings of the AAAI Conference on …, 2022 - ojs.aaai.org
… a deep model and its influence on model training is conditioned upon the presence of a trigger.
We design two types of attacks: a targeted attack … cases the hidden representations of the …
We design two types of attacks: a targeted attack … cases the hidden representations of the …