Conventional, Heuristic and Learning-Based Robot Motion Planning: Reviewing Frameworks of Current Practical Significance
Motion planning algorithms have seen considerable progress and expansion across various
domains of science and technology during the last few decades, where rapid advancements …
domains of science and technology during the last few decades, where rapid advancements …
Optimal parallel algorithms in the binary-forking model
In this paper we develop optimal algorithms in the binary-forking model for a variety of
fundamental problems, including sorting, semisorting, list ranking, tree contraction, range …
fundamental problems, including sorting, semisorting, list ranking, tree contraction, range …
Brief announcement: Open cilk
Open Cilk is a new open-source platform to support Cilk multithreaded programming,
especially for researchers and teachers. Open Cilk aims to provide a full-featured …
especially for researchers and teachers. Open Cilk aims to provide a full-featured …
Multidimensional intratile parallelization for memory-starved stencil computations
Optimizing the performance of stencil algorithms has been the subject of intense research
over the last two decades. Since many stencil schemes have low arithmetic intensity, most …
over the last two decades. Since many stencil schemes have low arithmetic intensity, most …
Parallel Longest Increasing Subsequence and van Emde Boas Trees
This paper studies parallel algorithms for the longest increasing subsequence (LIS)
problem. Let n be the input size and k be the LIS length of the input. Sequentially, LIS is a …
problem. Let n be the input size and k be the LIS length of the input. Sequentially, LIS is a …
Revisiting Temporal Blocking Stencil Optimizations
Iterative stencils are used widely across the spectrum of High Performance Computing
(HPC) applications. Many efforts have been put into optimizing stencil GPU kernels, given …
(HPC) applications. Many efforts have been put into optimizing stencil GPU kernels, given …
Tessellating stencils
L Yuan, Y Zhang, P Guo, S Huang - Proceedings of the International …, 2017 - dl.acm.org
Stencil computations represent a very common class of nested loops in scientific and
engineering applications. The exhaustively studied tiling is one of the most powerful …
engineering applications. The exhaustively studied tiling is one of the most powerful …
Provably efficient scheduling of cache-oblivious wavefront algorithms
Iterative wavefront algorithms for evaluating dynamic programming recurrences exploit
optimal parallelism but show poor cache performance. Tiled-iterative wavefront algorithms …
optimal parallelism but show poor cache performance. Tiled-iterative wavefront algorithms …
Extending the nested parallel model to the nested dataflow model with provably efficient schedulers
D Dinh, HV Simhadri, Y Tang - … of the 28th ACM Symposium on …, 2016 - dl.acm.org
The nested parallel (aka fork-join) model is widely used for writing parallel programs.
However, the two composition constructs, ie"||"(parallel) and";"(serial), that comprise the …
However, the two composition constructs, ie"||"(parallel) and";"(serial), that comprise the …
Autogen: Automatic discovery of efficient recursive divide-8-conquer algorithms for solving dynamic programming problems
We present Autogen—an algorithm that for a wide class of dynamic programming (DP)
problems automatically discovers highly efficient cache-oblivious parallel recursive divide …
problems automatically discovers highly efficient cache-oblivious parallel recursive divide …