SEUSS: skip redundant paths to make serverless fast

J Cadden, T Unger, Y Awad, H Dong… - Proceedings of the …, 2020 - dl.acm.org
This paper presents a system-level method for achieving the rapid deployment and high-
density caching of serverless functions in a FaaS environment. For reduced start times …

Towards general-purpose neural network computing

S Eldridge, A Waterland, M Seltzer… - 2015 International …, 2015 - ieeexplore.ieee.org
Machine learning is becoming pervasive, decades of research in neural network
computation is now being leveraged to learn patterns in data and perform computations that …

Boosting efficiency of external pipelines by blurring application boundaries

A Herlihy, P Chrysogelos… - 12th Annual Conference …, 2022 - infoscience.epfl.ch
Modern application development addresses increasingly specialized problems using
domain-specific utilities, such as Optical Code Recognition and standalone statistical tools …

ExanaDBT: A dynamic compilation system for transparent polyhedral optimizations at runtime

Y Sato, T Yuki, T Endo - Proceedings of the Computing Frontiers …, 2017 - dl.acm.org
In this paper, we present a dynamic compilation system called ExanaDBT for transparently
optimizing and parallelizing binaries at runtime based on the polyhedral model. Starting …

Automatic parallelization of sequential programs

P Kraft, A Waterland, DY Fu, A Gollamudi… - arXiv preprint arXiv …, 2018 - arxiv.org
Prior work on Automatically Scalable Computation (ASC) suggests that it is possible to
parallelize sequential computation by building a model of whole-program execution, using …

Lerna: parallelizing dependent loops using speculation

MM Saad, R Palmieri, B Ravindran - ACM Transactions on Storage (TOS …, 2019 - dl.acm.org
We present Lerna, an end-to-end tool that automatically and transparently detects and
extracts parallelism from data-dependent sequential loops. Lerna uses speculation …

[PDF][PDF] Exploiting hidden layer modular redundancy for fault-tolerance in neural network accelerators

S Eldridge, A Joshi - Proc. Boston area ARChitecture (BARC) …, 2015 - people.bu.edu
Neural network accelerators are an increasingly utilized component of heterogeneous
multicore architectures. This new utilization stems from their capability to improve the power …

Generating Thread-Level Parallelism in Nondeterministic Programs

EA Deiana - 2023 - search.proquest.com
Chip Multiprocessors (CMP) are everywhere, from mobile systems to servers. Thread-Level
Parallelism (TLP) is the characteristic of a program that makes use of the parallel cores of a …

[PDF][PDF] Nonclassical Temporal Correlations Under Finite-Memory Constraints

LV Barbosa - 2024 - phaidra.univie.ac.at
Any concrete information processing task can be viewed as the sequential manipulation of
information over time, receiving inputs and producing outputs. To accomplish this …

Structured parallelism discovery with hybrid static-dynamic analysis and evaluation technique

C Vasiladiotis - 2023 - era.ed.ac.uk
Parallel computer architectures have dominated the computing landscape for the past two
decades; a trend that is only expected to continue and intensify, with increasing …