Early-Exit Deep Neural Network-A Comprehensive Survey
Deep neural networks (DNNs) typically have a single exit point that makes predictions by
running the entire stack of neural layers. Since not all inputs require the same amount of …
running the entire stack of neural layers. Since not all inputs require the same amount of …
Harmful fine-tuning attacks and defenses for large language models: A survey
Recent research demonstrates that the nascent fine-tuning-as-a-service business model
exposes serious safety concerns--fine-tuning over a few harmful data uploaded by the users …
exposes serious safety concerns--fine-tuning over a few harmful data uploaded by the users …
Learning-based edge-device collaborative dnn inference in iovt networks
Deep neural network (DNN) is a promising technology for Internet of Visual Things (IoVT)
devices to extrct their visual information from unstructured data. However, it is hard to deploy …
devices to extrct their visual information from unstructured data. However, it is hard to deploy …
EDANAS: adaptive neural architecture search for early exit neural networks
M Gambella, M Roveri - 2023 International Joint Conference on …, 2023 - ieeexplore.ieee.org
Early Exit Neural Networks (EENNs) endow neural network architectures with auxiliary
classifiers to progressively process the input and make decisions at intermediate points of …
classifiers to progressively process the input and make decisions at intermediate points of …
Energy-Efficient Inference With Software-Hardware Co-Design for Sustainable Artificial Intelligence of Things
S Dai, Z Luo, W Luo, S Wang, C Dai… - IEEE Internet of …, 2024 - ieeexplore.ieee.org
The emerging field of artificial intelligence of things (AIoT) is propelled by the remarkable
success of deep learning and hardware evolution, which has a significant impact on our …
success of deep learning and hardware evolution, which has a significant impact on our …
Multimodal adaptive inference for document image classification with anytime early exiting
This work addresses the need for a balanced approach between performance and efficiency
in scalable production environments for visually-rich document understanding (VDU) tasks …
in scalable production environments for visually-rich document understanding (VDU) tasks …
Jointly-learnt exit and inference for dynamic neural networks
J Chataoui - 2024 - escholarship.mcgill.ca
Les réseaux neuronaux artificiels dynamiques à sortie anticipée (RNDSA) ont pour but de
réduire le coût des prédictions en sautant les couches les plus profondes du réseau pour …
réduire le coût des prédictions en sautant les couches les plus profondes du réseau pour …
CE-CoLLM: Efficient and Adaptive Large Language Models Through Cloud-Edge Collaboration
Large Language Models (LLMs) have achieved remarkable success in serving end-users
with human-like intelligence. However, LLMs demand high computational resources …
with human-like intelligence. However, LLMs demand high computational resources …
Early Classification for Dynamic Inference of Neural Networks
J Wang, B Li, GL Zhang - arXiv preprint arXiv:2309.13443, 2023 - arxiv.org
Deep neural networks (DNNs) have been successfully applied in various fields. In DNNs, a
large number of multiply-accumulate (MAC) operations is required to be performed, posing …
large number of multiply-accumulate (MAC) operations is required to be performed, posing …
Accelerated AI Inference via Dynamic Execution Methods
In this paper, we focus on Dynamic Execution techniques that optimize the computation flow
based on input. This aims to identify simpler problems that can be solved using fewer …
based on input. This aims to identify simpler problems that can be solved using fewer …