A survey of machine learning for computer architecture and systems
It has been a long time that computer architecture and systems are optimized for efficient
execution of machine learning (ML) models. Now, it is time to reconsider the relationship …
execution of machine learning (ML) models. Now, it is time to reconsider the relationship …
A survey of machine learning applied to computer architecture design
DD Penney, L Chen - arXiv preprint arXiv:1909.12373, 2019 - arxiv.org
Machine learning has enabled significant benefits in diverse fields, but, with a few
exceptions, has had limited impact on computer architecture. Recent work, however, has …
exceptions, has had limited impact on computer architecture. Recent work, however, has …
Mira: A program-behavior-guided far memory system
Far memory, where memory accesses are non-local, has become more popular in recent
years as a solution to expand memory size and avoid memory stranding. Prior far memory …
years as a solution to expand memory size and avoid memory stranding. Prior far memory …
Fine-grained address segmentation for attention-based variable-degree prefetching
Machine learning algorithms have shown potential to improve prefetching performance by
accurately predicting future memory accesses. Existing approaches are based on the …
accurately predicting future memory accesses. Existing approaches are based on the …
Cache in hand: Expander-driven cxl prefetcher for next generation cxl-ssd
Integrating compute express link (CXL) with SSDs allows scalable access to large memory
but has slower speeds than DRAMs. We present ExPAND, an expander-driven CXL …
but has slower speeds than DRAMs. We present ExPAND, an expander-driven CXL …
Twig: Profile-guided btb prefetching for data center applications
Modern data center applications have deep software stacks, with instruction footprints that
are orders of magnitude larger than typical instruction cache (I-cache) sizes. To efficiently …
are orders of magnitude larger than typical instruction cache (I-cache) sizes. To efficiently …
CRISP: critical slice prefetching
H Litz, G Ayers, P Ranganathan - Proceedings of the 27th ACM …, 2022 - dl.acm.org
The high access latency of DRAM continues to be a performance challenge for
contemporary microprocessor systems. Prefetching is a well-established technique to …
contemporary microprocessor systems. Prefetching is a well-established technique to …
Improving HLS efficiency by combining hardware flow optimizations with LSTMs via hardware-software co-design
H Sadasivan, F Lai, H Al Muraf… - Journal of Engineering and …, 2020 - mzjournal.com
The translation of C programs to Verilog can present significant challenges for programmers
aiming to synthesize hardware. To address these challenges, several High-Level Synthesis …
aiming to synthesize hardware. To address these challenges, several High-Level Synthesis …
Improving the accuracy, adaptability, and interpretability of SSD failure prediction models
C Chakraborttii, H Litz - Proceedings of the 11th ACM Symposium on …, 2020 - dl.acm.org
Flash-based solid state drives represent an important storage tier in today's hyperscale data
centers. Although solid state drives (SSDs) are relatively reliable, data center operators are …
centers. Although solid state drives (SSDs) are relatively reliable, data center operators are …
Raop: Recurrent neural network augmented offset prefetcher
The rapid development of Big Data coupled with slowing down of Moore's law has made the
memory performance a bottleneck in the von Neumann architecture. Machine learning has …
memory performance a bottleneck in the von Neumann architecture. Machine learning has …