Architectural implications of function-as-a-service computing

M Shahrad, J Balkind, D Wentzlaff - … of the 52nd annual IEEE/ACM …, 2019 - dl.acm.org
Serverless computing is a rapidly growing cloud application model, popularized by
Amazon's Lambda platform. Serverless cloud services provide fine-grained provisioning of …

Crash consistency in encrypted non-volatile main memory systems

S Liu, A Kolli, J Ren, S Khan - 2018 IEEE International …, 2018 - ieeexplore.ieee.org
Non-Volatile Main Memory (NVMM) systems provide high performance by directly
manipulating persistent data in-memory, but require crash consistency support to recover …

Adapt-noc: A flexible network-on-chip design for heterogeneous manycore architectures

H Zheng, K Wang, A Louri - 2021 IEEE international symposium …, 2021 - ieeexplore.ieee.org
The increased computational capability in heterogeneous manycore architectures facilitates
the concurrent execution of many applications. This requires, among other things, a flexible …

Density tradeoffs of non-volatile memory as a replacement for SRAM based last level cache

K Korgaonkar, I Bhati, H Liu, J Gaur… - 2018 ACM/IEEE 45th …, 2018 - ieeexplore.ieee.org
Increasing the capacity of the Last Level Cache (LLC) can help scale the memory wall. Due
to prohibitive area and leakage power, however, growing conventional SRAM LLC already …

Opportunistic computing in gpu architectures

A Pattnaik, X Tang, O Kayiran, A Jog, A Mishra… - Proceedings of the 46th …, 2019 - dl.acm.org
Data transfer overhead between computing cores and memory hierarchy has been a
persistent issue for von Neumann architectures and the problem has only become more …

Job scheduling for large-scale machine learning clusters

H Wang, Z Liu, H Shen - … of the 16th International Conference on …, 2020 - dl.acm.org
With the rapid proliferation of Machine Learning (ML) and Deep learning (DL) applications
running on modern platforms, it is crucial to satisfy application performance requirements …

AMPT-GA: automatic mixed precision floating point tuning for GPU applications

PV Kotipalli, R Singh, P Wood, I Laguna… - Proceedings of the ACM …, 2019 - dl.acm.org
Mixed precision computations improve high performance computing throughput for
applications that can tolerate decreased mathematical precision in their computations …

Machine learning feature based job scheduling for distributed machine learning clusters

H Wang, Z Liu, H Shen - IEEE/ACM Transactions on …, 2022 - ieeexplore.ieee.org
With the rapid proliferation of Machine Learning (ML) and Deep learning (DL) applications
running on modern platforms, it is crucial to satisfy application performance requirements …

Morpheus: Extending the last level cache capacity in GPU systems using idle GPU core resources

S Darabi, M Sadrosadati, N Akbarzadeh… - 2022 55th IEEE/ACM …, 2022 - ieeexplore.ieee.org
Graphics Processing Units (GPUs) are widely-used accelerators for data-parallel
applications. In many GPU applications, GPU memory bandwidth bottlenecks performance …

The advances, challenges and future possibilities of millimeter-wave chip-to-chip interconnections for multi-chip systems

A Ganguly, MM Ahmed, R Singh Narde… - Journal of Low Power …, 2018 - mdpi.com
With aggressive scaling of device geometries, density of manufacturing faults is expected to
increase. Therefore, yield of complex Multi-Processor Systems-on-Chips (MP-SoCs) will …