A catalog of stream processing optimizations

M Hirzel, R Soulé, S Schneider, B Gedik… - ACM Computing Surveys …, 2014 - dl.acm.org
Various research communities have independently arrived at stream processing as a
programming model for efficient and parallel computing. These communities include digital …

A comprehensive survey on parallelization and elasticity in stream processing

H Röger, R Mayer - ACM Computing Surveys (CSUR), 2019 - dl.acm.org
Stream Processing (SP) has evolved as the leading paradigm to process and gain value
from the high volume of streaming data produced, eg, in the domain of the Internet of Things …

[图书][B] Benchmarking modern multiprocessors

C Bienia - 2011 - search.proquest.com
Benchmarking has become one of the most important methods for quantitative performance
evaluation of processor and computer system designs. Benchmarking of modern …

Futhark: purely functional GPU-programming with nested parallelism and in-place array updates

T Henriksen, NGW Serup, M Elsman… - Proceedings of the 38th …, 2017 - dl.acm.org
Futhark is a purely functional data-parallel array language that offers a machine-neutral
programming model and an optimising compiler that generates OpenCL code for GPUs …

The tao of parallelism in algorithms

K Pingali, D Nguyen, M Kulkarni, M Burtscher… - Proceedings of the …, 2011 - dl.acm.org
For more than thirty years, the parallel programming community has used the dependence
graph as the main abstraction for reasoning about and exploiting parallelism in" regular" …

Polymage: Automatic optimization for image processing pipelines

RT Mullapudi, V Vasista, U Bondhugula - ACM SIGARCH Computer …, 2015 - dl.acm.org
This paper presents the design and implementation of PolyMage, a domain-specific
language and compiler for image processing pipelines. An image processing pipeline can …

Do code clones matter?

E Juergens, F Deissenboeck… - 2009 IEEE 31st …, 2009 - ieeexplore.ieee.org
Code cloning is not only assumed to inflate maintenance costs but also considered defect-
prone as inconsistent changes to code duplicates can lead to unexpected behavior …

Elastic scaling for data stream processing

B Gedik, S Schneider, M Hirzel… - IEEE Transactions on …, 2013 - ieeexplore.ieee.org
This article addresses the profitability problem associated with auto-parallelization of
general-purpose distributed data stream processing applications. Auto-parallelization …

Fifer: Practical acceleration of irregular applications on reconfigurable architectures

QM Nguyen, D Sanchez - MICRO-54: 54th Annual IEEE/ACM …, 2021 - dl.acm.org
Coarse-grain reconfigurable arrays (CGRAs) can achieve much higher performance and
efficiency than general-purpose cores, approaching the performance of a specialized design …

Cooperative caching for chip multiprocessors

J Chang, GS Sohi - ACM SIGARCH Computer Architecture News, 2006 - dl.acm.org
This paper presents CMP Cooperative Caching, a unified framework to manage a CMP's
aggregate on-chip cache resources. Cooperative caching combines the strengths of private …