Towards general purpose acceleration by exploiting common data-dependence forms

V Dadu, J Weng, S Liu, T Nowatzki - … of the 52nd Annual IEEE/ACM …, 2019 - dl.acm.org
With slowing technology scaling, specialized accelerators are increasingly attractive
solutions to continue expected generational scaling of performance. However, in order to …

Accelerating pattern matching queries in hybrid CPU-FPGA architectures

D Sidler, Z István, M Owaida, G Alonso - Proceedings of the 2017 ACM …, 2017 - dl.acm.org
Taking advantage of recently released hybrid multicore architectures, such as the Intel's
Xeon+ FPGA machine, where the FPGA has coherent access to the main memory through …

SeedEx: A genome sequencing accelerator for optimal alignments in subminimal space

D Fujiki, S Wu, N Ozog, K Goliya… - 2020 53rd Annual …, 2020 - ieeexplore.ieee.org
Innovations in genome sequencing techniques are enabling remarkably fast and low cost
production of raw genome data. As Moore's law tapers off, bottlenecks in genome …

Cache automaton

A Subramaniyan, J Wang… - Proceedings of the 50th …, 2017 - dl.acm.org
Finite State Automata are widely used to accelerate pattern matching in many emerging
application domains like DNA sequencing and XML parsing. Conventional CPUs and …

Fleet: A framework for massively parallel streaming on FPGAs

J Thomas, P Hanrahan, M Zaharia - Proceedings of the Twenty-Fifth …, 2020 - dl.acm.org
We present Fleet, a framework that offers a massively parallel streaming model for FPGAs
and is effective in a number of domains well-suited for FPGA acceleration, including parsing …

ANMLzoo: a benchmark suite for exploring bottlenecks in automata processing engines and architectures

J Wadden, V Dang, N Brunelle, T Tracy II… - 2016 IEEE …, 2016 - ieeexplore.ieee.org
High-performance automata-processing engines are traditionally evaluated using a limited
set of regular expression-rulesets. While regular expression rulesets are valid real-world …

HARE: Hardware accelerator for regular expressions

V Gogte, A Kolli, MJ Cafarella… - 2016 49th Annual …, 2016 - ieeexplore.ieee.org
Rapidly processing text data is critical for many technical and business applications.
Traditional software-based tools for processing large text corpora use memory bandwidth …

UDP: a programmable accelerator for extract-transform-load workloads and more

Y Fang, C Zou, AJ Elmore, AA Chien - … of the 50th Annual IEEE/ACM …, 2017 - dl.acm.org
Big data analytic applications give rise to large-scale extract-transform-load (ETL) as a
fundamental step to transform new data into a native representation. ETL workloads pose …

Impala: Algorithm/architecture co-design for in-memory multi-stride pattern matching

E Sadredini, R Rahimi, M Lenjani… - … symposium on high …, 2020 - ieeexplore.ieee.org
High-throughput and concurrent processing of thousands of patterns on each byte of an
input stream is critical for many applications with real-time processing needs, such as …

Demystifying automata processing: GPUs, FPGAs or Micron's AP?

M Nourian, X Wang, X Yu, W Feng… - Proceedings of the …, 2017 - dl.acm.org
Many established and emerging applications perform at their core some form of pattern
matching, a computation that maps naturally onto finite automata abstractions. As a …