Early-Exit Deep Neural Network-A Comprehensive Survey

H Rahmath P, V Srivastava, K Chaurasia… - ACM Computing …, 2024 - dl.acm.org
Deep neural networks (DNNs) typically have a single exit point that makes predictions by
running the entire stack of neural layers. Since not all inputs require the same amount of …

Harmful fine-tuning attacks and defenses for large language models: A survey

T Huang, S Hu, F Ilhan, SF Tekin, L Liu - arXiv preprint arXiv:2409.18169, 2024 - arxiv.org
Recent research demonstrates that the nascent fine-tuning-as-a-service business model
exposes serious safety concerns--fine-tuning over a few harmful data uploaded by the users …

Learning-based edge-device collaborative dnn inference in iovt networks

X Xu, K Yan, S Han, B Wang, X Tao… - IEEE Internet of Things …, 2023 - ieeexplore.ieee.org
Deep neural network (DNN) is a promising technology for Internet of Visual Things (IoVT)
devices to extrct their visual information from unstructured data. However, it is hard to deploy …

EDANAS: adaptive neural architecture search for early exit neural networks

M Gambella, M Roveri - 2023 International Joint Conference on …, 2023 - ieeexplore.ieee.org
Early Exit Neural Networks (EENNs) endow neural network architectures with auxiliary
classifiers to progressively process the input and make decisions at intermediate points of …

Energy-Efficient Inference With Software-Hardware Co-Design for Sustainable Artificial Intelligence of Things

S Dai, Z Luo, W Luo, S Wang, C Dai… - IEEE Internet of …, 2024 - ieeexplore.ieee.org
The emerging field of artificial intelligence of things (AIoT) is propelled by the remarkable
success of deep learning and hardware evolution, which has a significant impact on our …

Multimodal adaptive inference for document image classification with anytime early exiting

O Hamed, S Bakkali, M Blaschko, S Moens… - … on Document Analysis …, 2024 - Springer
This work addresses the need for a balanced approach between performance and efficiency
in scalable production environments for visually-rich document understanding (VDU) tasks …

Jointly-learnt exit and inference for dynamic neural networks

J Chataoui - 2024 - escholarship.mcgill.ca
Les réseaux neuronaux artificiels dynamiques à sortie anticipée (RNDSA) ont pour but de
réduire le coût des prédictions en sautant les couches les plus profondes du réseau pour …

CE-CoLLM: Efficient and Adaptive Large Language Models Through Cloud-Edge Collaboration

H Jin, Y Wu - arXiv preprint arXiv:2411.02829, 2024 - arxiv.org
Large Language Models (LLMs) have achieved remarkable success in serving end-users
with human-like intelligence. However, LLMs demand high computational resources …

Early Classification for Dynamic Inference of Neural Networks

J Wang, B Li, GL Zhang - arXiv preprint arXiv:2309.13443, 2023 - arxiv.org
Deep neural networks (DNNs) have been successfully applied in various fields. In DNNs, a
large number of multiply-accumulate (MAC) operations is required to be performed, posing …

Accelerated AI Inference via Dynamic Execution Methods

H Barad, J Achterberg, TP Chou, J Yu - arXiv preprint arXiv:2411.00853, 2024 - arxiv.org
In this paper, we focus on Dynamic Execution techniques that optimize the computation flow
based on input. This aims to identify simpler problems that can be solved using fewer …