Hypertune: Dynamic hyperparameter tuning for efficient distribution of dnn training over heterogeneous systems

A HeydariGorji, S Rezaei, M Torabzadehkashi… - Proceedings of the 39th …, 2020 - dl.acm.org
Distributed training is a novel approach to accelerating training of Deep Neural Networks
(DNN), but common training libraries fall short of addressing the distributed nature of …

Encodeep: Realizing bit-flexible encoding for deep neural networks

M Samragh, M Javaheripi, F Koushanfar - ACM Transactions on …, 2020 - dl.acm.org
This article proposes EncoDeep, an end-to-end framework that facilitates encoding, bitwidth
customization, fine-tuning, and implementation of neural networks on FPGA platforms …

A Portable Linux-based Firmware for NVMe Computational Storage Devices

R Wertenbroek, Y Thoma, A Dassatti - ACM Transactions on Storage, 2024 - dl.acm.org
Over the years, interest in computational storage devices has been growing steadily. This is
largely due to the rise of data-intensive applications, such as machine learning, online video …

ISP Agent: A Generalized In-storage-processing Workload Offloading Framework by Providing Multiple Optimization Opportunities

S Kang, J Kim, G Lee, J Lee, J Seo, H Jung… - ACM Transactions on …, 2024 - dl.acm.org
As solid-state drives (SSDs) with sufficient computing power have recently become the
dominant devices in modern computer systems, in-storage processing (ISP), which …

Horae: A Hybrid I/O Request Scheduling Technique for Near-Data Processing-Based SSD

J Li, X Chen, D Liu, L Li, J Wang, Z Zeng… - … on Computer-Aided …, 2022 - ieeexplore.ieee.org
Near-data processing (NDP) architecture is promised to break the bottleneck of data
movement in many scenarios (eg, databases and recommendation systems), which limits …

PELSI: Power-Efficient Layer-Switched Inference

E Aghapour, D Sapra, AD Pimentel… - 2023 IEEE 29th …, 2023 - ieeexplore.ieee.org
Convolutional Neural Networks (CNNs) are now quintessential kernels within embedded
computer vision applications deployed in edge devices. Heterogeneous Multi-Processor …

Experience and practice teaching an undergraduate course on diverse heterogeneous architectures

E Frachtenberg - 2021 IEEE/ACM Ninth Workshop on …, 2021 - ieeexplore.ieee.org
Heterogeneous computing is growing as an important hardware and software paradigm,
both in high-performance computing and in application computing in general. Nevertheless …

Towards a scalable and distributed infrastructure for deep learning applications

B Hasheminezhad, S Shirzad, N Wu… - 2020 IEEE/ACM …, 2020 - ieeexplore.ieee.org
Although recent scaling up approaches to train deep neural networks have proven to be
effective, the computational intensity of large and complex models, as well as the availability …

In-storage processing of I/O intensive applications on computational storage drives

A HeydariGorji, M Torabzadehkashi… - … on Quality Electronic …, 2022 - ieeexplore.ieee.org
Computational storage drives (CSD) are solid-state drives (SSD) empowered by general-
purpose processors that can perform in-storage processing. They have the potential to …

Domain-specific computational storage for serverless computing

R Mahapatra, S Ghodrati, BH Ahn, S Kinzer… - arXiv preprint arXiv …, 2023 - arxiv.org
While (1) serverless computing is emerging as a popular form of cloud execution,
datacenters are going through major changes:(2) storage dissaggregation in the system …