What do deep nets learn? class-wise patterns revealed in the input space

Y Li, X Lyu, N Koren, L Lyu, B Li… - Advances in Neural …, 2021 - proceedings.neurips.cc

Backdoor attack has emerged as a major security threat to deep neural networks (DNNs).
While existing defense methods have demonstrated promising results on detecting or …

被引用次数：292 相关文章所有 7 个版本

[PDF] arxiv.org

Backdoor learning: A survey

Y Li, Y Jiang, Z Li, ST Xia - IEEE Transactions on Neural …, 2022 - ieeexplore.ieee.org

Backdoor attack intends to embed hidden backdoors into deep neural networks (DNNs), so
that the attacked models perform well on benign samples, whereas their predictions will be …

被引用次数：641 相关文章所有 6 个版本

[PDF] thecvf.com

Revisiting adversarial robustness distillation: Robust soft labels make student better

B Zi, S Zhao, X Ma, YG Jiang - Proceedings of the IEEE/CVF …, 2021 - openaccess.thecvf.com

Adversarial training is one effective approach for training robust deep neural networks
against adversarial attacks. While being able to bring reliable robustness, adversarial …

被引用次数：106 相关文章所有 5 个版本

[PDF] arxiv.org

A survey of neural trojan attacks and defenses in deep learning

J Wang, GM Hassan, N Akhtar - arXiv preprint arXiv:2202.07183, 2022 - arxiv.org

Artificial Intelligence (AI) relies heavily on deep learning-a technology that is becoming
increasingly popular in real-life applications of AI, even in the safety-critical and high-risk …

被引用次数：23 相关文章所有 4 个版本

[PDF] neurips.cc

Better safe than sorry: Preventing delusive adversaries with adversarial training

L Tao, L Feng, J Yi, SJ Huang… - Advances in Neural …, 2021 - proceedings.neurips.cc

Delusive attacks aim to substantially deteriorate the test accuracy of the learning model by
slightly perturbing the features of correctly labeled training examples. By formalizing this …

被引用次数：72 相关文章所有 8 个版本

[PDF] neurips.cc

Training with more confidence: Mitigating injected and natural backdoors during training

Z Wang, H Ding, J Zhai, S Ma - Advances in Neural …, 2022 - proceedings.neurips.cc

The backdoor or Trojan attack is a severe threat to deep neural networks (DNNs).
Researchers find that DNNs trained on benign data and settings can also learn backdoor …

被引用次数：40 相关文章所有 7 个版本

[PDF] thecvf.com

Beating backdoor attack at its own game

M Liu, A Sangiovanni-Vincentelli… - Proceedings of the …, 2023 - openaccess.thecvf.com

Deep neural networks (DNNs) are vulnerable to backdoor attack, which does not affect the
network's performance on clean data but would manipulate the network behavior once a …

被引用次数：6 相关文章所有 3 个版本

[PDF] arxiv.org

Distilling cognitive backdoor patterns within an image

H Huang, X Ma, S Erfani, J Bailey - arXiv preprint arXiv:2301.10908, 2023 - arxiv.org

This paper proposes a simple method to distill and detect backdoor patterns within an
image:\emph {Cognitive Distillation}(CD). The idea is to extract the" minimal essence" from …

被引用次数：16 相关文章所有 3 个版本

[PDF] arxiv.org

BaDExpert: Extracting Backdoor Functionality for Accurate Backdoor Input Detection

T Xie, X Qi, P He, Y Li, JT Wang, P Mittal - arXiv preprint arXiv:2308.12439, 2023 - arxiv.org

We present a novel defense, against backdoor attacks on Deep Neural Networks (DNNs),
wherein adversaries covertly implant malicious behaviors (backdoors) into DNNs. Our …

被引用次数：5 相关文章所有 4 个版本

[PDF] aaai.org

Towards Modeling Uncertainties of Self-Explaining Neural Networks via Conformal Prediction

W Qian, C Zhao, Y Li, F Ma, C Zhang… - Proceedings of the AAAI …, 2024 - ojs.aaai.org

Despite the recent progress in deep neural networks (DNNs), it remains challenging to
explain the predictions made by DNNs. Existing explanation methods for DNNs mainly focus …

被引用次数：1 相关文章所有 4 个版本