Fusion is Not Enough: Single Modal Attacks on Fusion Models for 3D Object Detection

Toward Robust 3D Perception for Autonomous Vehicles: A Review of Adversarial Attacks and Countermeasures

KTY Mahima, AG Perera, S Anavatti… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org

At present the perception system of autonomous vehicles is grounded on 3D vision
technologies along with deep learning to process depth information. Although deep learning …

被引用次数：1 相关文章所有 2 个版本

[PDF] arxiv.org

Poly-CAM: High resolution class activation map for convolutional neural networks

A Englebert, O Cornu, CD Vleeschouwer - Machine Vision and …, 2024 - Springer

The demand for explainable AI continues to rise alongside advancements in deep learning
technology. Existing methods such as convolutional neural networks often struggle to …

被引用次数：12 相关文章所有 5 个版本

Face3DAdv: Exploiting Robust Adversarial 3D Patches on Physical Face Recognition

X Yang, L Xu, T Pang, Y Dong, Y Wang, H Su… - International Journal of …, 2024 - Springer

Recent research has elucidated the susceptibility of face recognition models to physical
adversarial patches, thus provoking security concerns about the deployed face recognition …

3D Visual Grounding-Audio: 3D scene object detection based on audio

C Zhang, Z Cai, X Chen, F Da, S Gai - Neurocomputing, 2024 - Elsevier

Abstract 3D Visual Grounding (3DVG) is a prevalent multi-modal information fusion task
capable of accurately localizing target objects referenced in natural language descriptions …

[PDF] arxiv.org

Prototypical Transformer as Unified Motion Learners

C Han, Y Lu, G Sun, JC Liang, Z Cao, Q Wang… - arXiv preprint arXiv …, 2024 - arxiv.org

In this work, we introduce the Prototypical Transformer (ProtoFormer), a general and unified
framework that approaches various motion tasks from a prototype perspective. ProtoFormer …

被引用次数：1 相关文章所有 3 个版本

[PDF] arxiv.org

Self-supervised Adversarial Training of Monocular Depth Estimation against Physical-World Attacks

Z Cheng, C Han, J Liang, Q Wang, X Zhang… - arXiv preprint arXiv …, 2024 - arxiv.org

Monocular Depth Estimation (MDE) plays a vital role in applications such as autonomous
driving. However, various attacks target MDE models, with physical attacks posing …

ControlLoc: Physical-World Hijacking Attack on Visual Perception in Autonomous Driving

C Ma, N Wang, Z Zhao, Q Wang, QA Chen… - arXiv preprint arXiv …, 2024 - arxiv.org

Recent research in adversarial machine learning has focused on visual perception in
Autonomous Driving (AD) and has shown that printed adversarial patches can attack object …

被引用次数：1 相关文章所有 2 个版本

Dyna-MSDepth: multi-scale self-supervised monocular depth estimation network for visual SLAM in dynamic scenes

J Yao, Y Li, J Li - Machine Vision and Applications, 2024 - Springer

Abstract Monocular Simultaneous Localization And Mapping (SLAM) suffers from scale drift,
leading to tracking failure due to scale ambiguity. Deep learning has significantly advanced …

[PDF][PDF] Trustworthy and Robust Machine Learning for Multimedia: Challenges and Perspectives

K Nakano, M Zuzak, C Merkel, AC Loui - mzuzak.github.io

Multimedia applications for machine learning models are characterized by the fusion of
multiple modalities of data. In this work, we highlight the trust and robustness challenges of …