Hypertune: Dynamic hyperparameter tuning for efficient distribution of dnn training over heterogeneous systems
Distributed training is a novel approach to accelerating training of Deep Neural Networks
(DNN), but common training libraries fall short of addressing the distributed nature of …
(DNN), but common training libraries fall short of addressing the distributed nature of …
Encodeep: Realizing bit-flexible encoding for deep neural networks
This article proposes EncoDeep, an end-to-end framework that facilitates encoding, bitwidth
customization, fine-tuning, and implementation of neural networks on FPGA platforms …
customization, fine-tuning, and implementation of neural networks on FPGA platforms …
A Portable Linux-based Firmware for NVMe Computational Storage Devices
Over the years, interest in computational storage devices has been growing steadily. This is
largely due to the rise of data-intensive applications, such as machine learning, online video …
largely due to the rise of data-intensive applications, such as machine learning, online video …
ISP Agent: A Generalized In-storage-processing Workload Offloading Framework by Providing Multiple Optimization Opportunities
As solid-state drives (SSDs) with sufficient computing power have recently become the
dominant devices in modern computer systems, in-storage processing (ISP), which …
dominant devices in modern computer systems, in-storage processing (ISP), which …
Horae: A Hybrid I/O Request Scheduling Technique for Near-Data Processing-Based SSD
Near-data processing (NDP) architecture is promised to break the bottleneck of data
movement in many scenarios (eg, databases and recommendation systems), which limits …
movement in many scenarios (eg, databases and recommendation systems), which limits …
PELSI: Power-Efficient Layer-Switched Inference
Convolutional Neural Networks (CNNs) are now quintessential kernels within embedded
computer vision applications deployed in edge devices. Heterogeneous Multi-Processor …
computer vision applications deployed in edge devices. Heterogeneous Multi-Processor …
Experience and practice teaching an undergraduate course on diverse heterogeneous architectures
E Frachtenberg - 2021 IEEE/ACM Ninth Workshop on …, 2021 - ieeexplore.ieee.org
Heterogeneous computing is growing as an important hardware and software paradigm,
both in high-performance computing and in application computing in general. Nevertheless …
both in high-performance computing and in application computing in general. Nevertheless …
Towards a scalable and distributed infrastructure for deep learning applications
Although recent scaling up approaches to train deep neural networks have proven to be
effective, the computational intensity of large and complex models, as well as the availability …
effective, the computational intensity of large and complex models, as well as the availability …
In-storage processing of I/O intensive applications on computational storage drives
A HeydariGorji, M Torabzadehkashi… - … on Quality Electronic …, 2022 - ieeexplore.ieee.org
Computational storage drives (CSD) are solid-state drives (SSD) empowered by general-
purpose processors that can perform in-storage processing. They have the potential to …
purpose processors that can perform in-storage processing. They have the potential to …
Domain-specific computational storage for serverless computing
While (1) serverless computing is emerging as a popular form of cloud execution,
datacenters are going through major changes:(2) storage dissaggregation in the system …
datacenters are going through major changes:(2) storage dissaggregation in the system …