SEUSS: skip redundant paths to make serverless fast
This paper presents a system-level method for achieving the rapid deployment and high-
density caching of serverless functions in a FaaS environment. For reduced start times …
density caching of serverless functions in a FaaS environment. For reduced start times …
Towards general-purpose neural network computing
Machine learning is becoming pervasive, decades of research in neural network
computation is now being leveraged to learn patterns in data and perform computations that …
computation is now being leveraged to learn patterns in data and perform computations that …
Boosting efficiency of external pipelines by blurring application boundaries
A Herlihy, P Chrysogelos… - 12th Annual Conference …, 2022 - infoscience.epfl.ch
Modern application development addresses increasingly specialized problems using
domain-specific utilities, such as Optical Code Recognition and standalone statistical tools …
domain-specific utilities, such as Optical Code Recognition and standalone statistical tools …
ExanaDBT: A dynamic compilation system for transparent polyhedral optimizations at runtime
In this paper, we present a dynamic compilation system called ExanaDBT for transparently
optimizing and parallelizing binaries at runtime based on the polyhedral model. Starting …
optimizing and parallelizing binaries at runtime based on the polyhedral model. Starting …
Automatic parallelization of sequential programs
Prior work on Automatically Scalable Computation (ASC) suggests that it is possible to
parallelize sequential computation by building a model of whole-program execution, using …
parallelize sequential computation by building a model of whole-program execution, using …
Lerna: parallelizing dependent loops using speculation
MM Saad, R Palmieri, B Ravindran - ACM Transactions on Storage (TOS …, 2019 - dl.acm.org
We present Lerna, an end-to-end tool that automatically and transparently detects and
extracts parallelism from data-dependent sequential loops. Lerna uses speculation …
extracts parallelism from data-dependent sequential loops. Lerna uses speculation …
[PDF][PDF] Exploiting hidden layer modular redundancy for fault-tolerance in neural network accelerators
S Eldridge, A Joshi - Proc. Boston area ARChitecture (BARC) …, 2015 - people.bu.edu
Neural network accelerators are an increasingly utilized component of heterogeneous
multicore architectures. This new utilization stems from their capability to improve the power …
multicore architectures. This new utilization stems from their capability to improve the power …
Generating Thread-Level Parallelism in Nondeterministic Programs
EA Deiana - 2023 - search.proquest.com
Chip Multiprocessors (CMP) are everywhere, from mobile systems to servers. Thread-Level
Parallelism (TLP) is the characteristic of a program that makes use of the parallel cores of a …
Parallelism (TLP) is the characteristic of a program that makes use of the parallel cores of a …
[PDF][PDF] Nonclassical Temporal Correlations Under Finite-Memory Constraints
LV Barbosa - 2024 - phaidra.univie.ac.at
Any concrete information processing task can be viewed as the sequential manipulation of
information over time, receiving inputs and producing outputs. To accomplish this …
information over time, receiving inputs and producing outputs. To accomplish this …
Structured parallelism discovery with hybrid static-dynamic analysis and evaluation technique
C Vasiladiotis - 2023 - era.ed.ac.uk
Parallel computer architectures have dominated the computing landscape for the past two
decades; a trend that is only expected to continue and intensify, with increasing …
decades; a trend that is only expected to continue and intensify, with increasing …