A case for malleable thread-level linear algebra libraries: The LU factorization with partial pivoting
We propose two novel techniques for overcoming load-imbalance encountered when
implementing so-called look-ahead mechanisms in relevant dense matrix factorizations for
the solution of linear systems. Both techniques target the scenario where two thread teams
are created/activated during the factorization, with each team in charge of performing an
independent task/branch of execution. The first technique promotes worker sharing (WS)
between the two tasks, allowing the threads of the task that completes first to be reallocated …
implementing so-called look-ahead mechanisms in relevant dense matrix factorizations for
the solution of linear systems. Both techniques target the scenario where two thread teams
are created/activated during the factorization, with each team in charge of performing an
independent task/branch of execution. The first technique promotes worker sharing (WS)
between the two tasks, allowing the threads of the task that completes first to be reallocated …
以上显示的是最相近的搜索结果。 查看全部搜索结果