Architectural implications of function-as-a-service computing
Serverless computing is a rapidly growing cloud application model, popularized by
Amazon's Lambda platform. Serverless cloud services provide fine-grained provisioning of …
Amazon's Lambda platform. Serverless cloud services provide fine-grained provisioning of …
Crash consistency in encrypted non-volatile main memory systems
Non-Volatile Main Memory (NVMM) systems provide high performance by directly
manipulating persistent data in-memory, but require crash consistency support to recover …
manipulating persistent data in-memory, but require crash consistency support to recover …
Adapt-noc: A flexible network-on-chip design for heterogeneous manycore architectures
The increased computational capability in heterogeneous manycore architectures facilitates
the concurrent execution of many applications. This requires, among other things, a flexible …
the concurrent execution of many applications. This requires, among other things, a flexible …
Density tradeoffs of non-volatile memory as a replacement for SRAM based last level cache
Increasing the capacity of the Last Level Cache (LLC) can help scale the memory wall. Due
to prohibitive area and leakage power, however, growing conventional SRAM LLC already …
to prohibitive area and leakage power, however, growing conventional SRAM LLC already …
Opportunistic computing in gpu architectures
Data transfer overhead between computing cores and memory hierarchy has been a
persistent issue for von Neumann architectures and the problem has only become more …
persistent issue for von Neumann architectures and the problem has only become more …
Job scheduling for large-scale machine learning clusters
With the rapid proliferation of Machine Learning (ML) and Deep learning (DL) applications
running on modern platforms, it is crucial to satisfy application performance requirements …
running on modern platforms, it is crucial to satisfy application performance requirements …
AMPT-GA: automatic mixed precision floating point tuning for GPU applications
Mixed precision computations improve high performance computing throughput for
applications that can tolerate decreased mathematical precision in their computations …
applications that can tolerate decreased mathematical precision in their computations …
Machine learning feature based job scheduling for distributed machine learning clusters
With the rapid proliferation of Machine Learning (ML) and Deep learning (DL) applications
running on modern platforms, it is crucial to satisfy application performance requirements …
running on modern platforms, it is crucial to satisfy application performance requirements …
Morpheus: Extending the last level cache capacity in GPU systems using idle GPU core resources
Graphics Processing Units (GPUs) are widely-used accelerators for data-parallel
applications. In many GPU applications, GPU memory bandwidth bottlenecks performance …
applications. In many GPU applications, GPU memory bandwidth bottlenecks performance …
The advances, challenges and future possibilities of millimeter-wave chip-to-chip interconnections for multi-chip systems
With aggressive scaling of device geometries, density of manufacturing faults is expected to
increase. Therefore, yield of complex Multi-Processor Systems-on-Chips (MP-SoCs) will …
increase. Therefore, yield of complex Multi-Processor Systems-on-Chips (MP-SoCs) will …