DAMOV: A new methodology and benchmark suite for evaluating data movement bottlenecks

GF Oliveira, J Gómez-Luna, L Orosa, S Ghose… - IEEE …, 2021 - ieeexplore.ieee.org
Data movement between the CPU and main memory is a first-order obstacle against improv
ing performance, scalability, and energy efficiency in modern systems. Computer systems …

Flash-Cosmos: In-flash bulk bitwise operations using inherent computation capability of nand flash memory

J Park, R Azizi, GF Oliveira… - 2022 55th IEEE/ACM …, 2022 - ieeexplore.ieee.org
Bulk bitwise operations, ie, bitwise operations on large bit vectors, are prevalent in a wide
range of important application domains, including databases, graph processing, genome …

Fpga-based deep learning inference accelerators: Where are we standing?

A Nechi, L Groth, S Mulhem, F Merchant… - ACM Transactions on …, 2023 - dl.acm.org
Recently, artificial intelligence applications have become part of almost all emerging
technologies around us. Neural networks, in particular, have shown significant advantages …

Optimstore: In-storage optimization of large scale dnns with on-die processing

J Kim, M Kang, Y Han, YG Kim… - 2023 IEEE International …, 2023 - ieeexplore.ieee.org
Training deep neural network (DNN) models is a resource-intensive, iterative process. For
this reason, nowadays, complex optimizers like Adam are widely adopted as it increases the …

Is-hbase: An in-storage computing optimized hbase with i/o offloading and self-adaptive caching in compute-storage disaggregated infrastructure

Z Cao, H Dong, Y Wei, S Liu, DHC Du - ACM Transactions on Storage …, 2022 - dl.acm.org
Active storage devices and in-storage computing are proposed and developed in recent
years to effectively reduce the amount of required data traffic and to improve the overall …

{ScalaAFA}: Constructing {User-Space}{All-Flash} Array Engine with Holistic Designs

S Yi, X Pan, Q Li, Q Li, C Wang, B Mao, M Jung… - 2024 USENIX Annual …, 2024 - usenix.org
All-flash array (AFA) is a popular approach to aggregate the capacity of multiple solid-state
drives (SSDs) while guaranteeing fault tolerance. Unfortunately, existing AFA engines inflict …

PASM: Parallelism Aware Space Management strategy for hybrid SSD towards in-storage DNN training acceleration

C Xiao, S Qiu, D Xu - Journal of Systems Architecture, 2022 - Elsevier
With the explosive growth of data volume and great improvement in flash technologies, SSD-
based In-Storage Computing (ISC) is becoming one of the most important means to …

MegIS: High-Performance, Energy-Efficient, and Low-Cost Metagenomic Analysis with In-Storage Processing

NM Ghiasi, M Sadrosadati, H Mustafa… - 2024 ACM/IEEE 51st …, 2024 - ieeexplore.ieee.org
Metagenomics, the study of the genome sequences of diverse organisms in a common
environment, has led to significant advances in many fields. Since the species present in a …

ReIPE: Recycling Idle PEs in CNN Accelerator for Vulnerable Filters Soft-Error Detection

X Wei, C Wang, H Yue, J Tan, Z Guan, N Jiang… - ACM Transactions on …, 2024 - dl.acm.org
To satisfy prohibitively massive computational requirements of current deep Convolutional
Neural Networks (CNNs), CNN-specific accelerators are widely deployed in large-scale …

Deploying Computational Storage for HTAP DBMSs Takes More Than Just Computation Offloading

K Lee, I Jo, J Ahn, H Lee, H Lee, W Sul… - Proceedings of the VLDB …, 2023 - dl.acm.org
Hybrid transactional/analytical processing (HTAP) would overload database systems. To
alleviate performance interference between transactions and analytics, recent research …