Massively parallel computation: Algorithms and applications
The algorithms community has been modeling the underlying key features and constraints of
massively parallel frameworks and using these models to discover new algorithmic …
massively parallel frameworks and using these models to discover new algorithmic …
Shuffles and circuits (on lower bounds for modern parallel computation)
The goal of this article is to identify fundamental limitations on how efficiently algorithms
implemented on platforms such as MapReduce and Hadoop can compute the central …
implemented on platforms such as MapReduce and Hadoop can compute the central …
Optimal parallel algorithms in the binary-forking model
In this paper we develop optimal algorithms in the binary-forking model for a variety of
fundamental problems, including sorting, semisorting, list ranking, tree contraction, range …
fundamental problems, including sorting, semisorting, list ranking, tree contraction, range …
Equivalence classes and conditional hardness in massively parallel computations
D Nanongkai, M Scquizzato - Distributed Computing, 2022 - Springer
Abstract The Massively Parallel Computation (MPC) model serves as a common abstraction
of many modern large-scale data processing frameworks, and has been receiving …
of many modern large-scale data processing frameworks, and has been receiving …
Tight bounds on the round complexity of the distributed maximum coverage problem
We study the maximum $ k $-set coverage problem in the following distributed setting. A
collection of sets $ S_1,\ldots, S_m $ over a universe $[n] $ is partitioned across $ p …
collection of sets $ S_1,\ldots, S_m $ over a universe $[n] $ is partitioned across $ p …
Reproducible floating-point aggregation in RDBMSs
Industry-grade database systems are expected to produce the same result if the same query
is repeatedly run on the same input. However, the numerous sources of non-determinism in …
is repeatedly run on the same input. However, the numerous sources of non-determinism in …
The read-only semi-external model
Abstract We introduce the Read-Only Semi-External (ROSE) Model for the design and
analysis of algorithms on large graphs. As in the well-studied semi-external model for graph …
analysis of algorithms on large graphs. As in the well-studied semi-external model for graph …
A conditional lower bound on graph connectivity in mapreduce
MapReduce (and its open source implementation Hadoop) has become the de facto
platform for processing large data sets. MapReduce offers a streamlined computational …
platform for processing large data sets. MapReduce offers a streamlined computational …
[PDF][PDF] Engineering Aggregation Operators for Relational In-Memory Database Systems
I Müller - 2016 - researchgate.net
Relational Aggregation is one of the major means to analyze large data sets since the
creation of the first database systems. Available hardware performance continues to grow at …
creation of the first database systems. Available hardware performance continues to grow at …
Treewidth computation and kernelization in the parallel external memory model
We present a randomized algorithm which computes, for any fixed k, a tree decomposition of
width at most k of any input graph. We analyze it in the parallel external memory (PEM) …
width at most k of any input graph. We analyze it in the parallel external memory (PEM) …