Massively parallel computation: Algorithms and applications

S Im, R Kumar, S Lattanzi, B Moseley… - … and Trends® in …, 2023 - nowpublishers.com
The algorithms community has been modeling the underlying key features and constraints of
massively parallel frameworks and using these models to discover new algorithmic …

Shuffles and circuits (on lower bounds for modern parallel computation)

T Roughgarden, S Vassilvitskii, JR Wang - Journal of the ACM (JACM), 2018 - dl.acm.org
The goal of this article is to identify fundamental limitations on how efficiently algorithms
implemented on platforms such as MapReduce and Hadoop can compute the central …

Optimal parallel algorithms in the binary-forking model

GE Blelloch, JT Fineman, Y Gu, Y Sun - … of the 32nd ACM Symposium on …, 2020 - dl.acm.org
In this paper we develop optimal algorithms in the binary-forking model for a variety of
fundamental problems, including sorting, semisorting, list ranking, tree contraction, range …

Equivalence classes and conditional hardness in massively parallel computations

D Nanongkai, M Scquizzato - Distributed Computing, 2022 - Springer
Abstract The Massively Parallel Computation (MPC) model serves as a common abstraction
of many modern large-scale data processing frameworks, and has been receiving …

Tight bounds on the round complexity of the distributed maximum coverage problem

S Assadi, S Khanna - arXiv preprint arXiv:1801.02793, 2018 - arxiv.org
We study the maximum $ k $-set coverage problem in the following distributed setting. A
collection of sets $ S_1,\ldots, S_m $ over a universe $[n] $ is partitioned across $ p …

Reproducible floating-point aggregation in RDBMSs

I Müller, A Arteaga, T Hoefler… - 2018 IEEE 34th …, 2018 - ieeexplore.ieee.org
Industry-grade database systems are expected to produce the same result if the same query
is repeatedly run on the same input. However, the numerous sources of non-determinism in …

The read-only semi-external model

GE Blelloch, L Dhulipala, PB Gibbons, Y Gu… - … on Algorithmic Principles …, 2021 - SIAM
Abstract We introduce the Read-Only Semi-External (ROSE) Model for the design and
analysis of algorithms on large graphs. As in the well-studied semi-external model for graph …

A conditional lower bound on graph connectivity in mapreduce

S Im, B Moseley - arXiv preprint arXiv:1904.08954, 2019 - arxiv.org
MapReduce (and its open source implementation Hadoop) has become the de facto
platform for processing large data sets. MapReduce offers a streamlined computational …

[PDF][PDF] Engineering Aggregation Operators for Relational In-Memory Database Systems

I Müller - 2016 - researchgate.net
Relational Aggregation is one of the major means to analyze large data sets since the
creation of the first database systems. Available hardware performance continues to grow at …

Treewidth computation and kernelization in the parallel external memory model

R Jacob, T Lieber, M Mnich - Theoretical Computer Science: 8th IFIP TC 1 …, 2014 - Springer
We present a randomized algorithm which computes, for any fixed k, a tree decomposition of
width at most k of any input graph. We analyze it in the parallel external memory (PEM) …