Hardware-Assisted Virtualization of Neural Processing Units for Cloud Platforms

Y Xue, Y Liu, L Nai, J Huang - 2024 57th IEEE/ACM …, 2024 - ieeexplore.ieee.org
Cloud platforms today have been deploying hardware accelerators like neural processing
units (NPUs) for powering machine learning (ML) inference services. To maximize the …

Research on High-Performance Fourier Transform Algorithms Based on the NPU

Q Li, D Zuo, Y Feng, D Wen - Applied Sciences, 2024 - mdpi.com
Backpack computers require powerful, intelligent computing capabilities for field wearables
while taking energy consumption into careful consideration. A recommended solution for this …

sNPU: Trusted Execution Environments on Integrated NPUs

E Feng, D Feng, D Du, Y Xia… - 2024 ACM/IEEE 51st …, 2024 - ieeexplore.ieee.org
Trusted execution environment (TEE) promises strong security guarantee with hardware
extensions for security-sensitive tasks. Due to its numerous benefits, TEE has gained …

System Virtualization for Neural Processing Units

Y Xue, Y Liu, J Huang - Proceedings of the 19th Workshop on Hot Topics …, 2023 - dl.acm.org
Modern cloud platforms have been employing hardware accelerators such as neural
processing units (NPUs) to meet the increasing demand for computing resources for AI …

M2M: A Fine-Grained Mapping Framework to Accelerate Multiple DNNs on a Multi-Chiplet Architecture

J Zhang, X Wang, Y Ye, D Lyu, G Xiong… - … Transactions on Very …, 2024 - ieeexplore.ieee.org
With the advancement of artificial intelligence, the collaboration of multiple deep neural
networks (DNNs) has been crucial to existing embedded systems and cloud systems …

[HTML][HTML] Hardware-Assisted Low-Latency NPU Virtualization Method for Multi-Sensor AI Systems

JH Jean, DS Kim - Sensors, 2024 - mdpi.com
Recently, AI systems such as autonomous driving and smart homes have become integral to
daily life. Intelligent multi-sensors, once limited to single data types, now process complex …

GTA: a new General Tensor Accelerator with Better Area Efficiency and Data Reuse

C Ai, L Zhao, Z Huang, C Li, X Wang… - arXiv preprint arXiv …, 2024 - arxiv.org
Recently, tensor algebra have witnessed significant applications across various domains.
Each operator in tensor algebra features different computational workload and precision …

[PDF][PDF] Hardware-assisted virtualization for neural processing units

Y Xue, Y Liu, L Nai, J Huang - The 1st Workshop on Hot Topics …, 2023 - hotinfra23.github.io
Modern cloud platforms have deployed neural processing units (NPUs) to meet the
increasing demand for machine learning (ML) services. However, the current way of using …