Harnessing the computing continuum for programming our world
This chapter outlines a vision for how best to harness the computing continuum of
interconnected sensors, actuators, instruments, and computing systems, from small numbers …
interconnected sensors, actuators, instruments, and computing systems, from small numbers …
A survey on evaluating and optimizing performance of Intel Xeon Phi
S Mittal - Concurrency and Computation: Practice and …, 2020 - Wiley Online Library
Summary Intel's Xeon Phi combines the parallel processing power of a many‐core
accelerator with the programming ease of CPUs. In this paper, we present a survey of works …
accelerator with the programming ease of CPUs. In this paper, we present a survey of works …
Dlobd: A comprehensive study of deep learning over big data stacks on hpc clusters
D eep L earning o ver B ig D ata (DLoBD) is an emerging paradigm to mine value from the
massive amount of gathered data. Many Deep Learning frameworks, like Caffe, TensorFlow …
massive amount of gathered data. Many Deep Learning frameworks, like Caffe, TensorFlow …
Effect of distributed directories in mesh interconnects
Recent manycore processors are kept coherent using scalable distributed directories. A
paramount example is the Xeon Phi Knights Landing. It features 38 tiles packed in a single …
paramount example is the Xeon Phi Knights Landing. It features 38 tiles packed in a single …
HeAT–a distributed and GPU-accelerated tensor framework for data analytics
To cope with the rapid growth in available data, the efficiency of data analysis and machine
learning libraries has recently received increased attention. Although great advancements …
learning libraries has recently received increased attention. Although great advancements …
HySec-Flow: privacy-preserving genomic computing with SGX-based big-data analytics framework
Trusted execution environments (TEE) such as In-tel's Software Guard Extension (SGX)
have been widely studied to boost security and privacy protection for the computation of …
have been widely studied to boost security and privacy protection for the computation of …
Harpgbdt: Optimizing gradient boosting decision tree for parallel efficiency
Gradient Boosting Decision Tree (GBDT) is a widely used machine learning algorithm,
whose training involves both irregular computation and random memory access and is …
whose training involves both irregular computation and random memory access and is …
Exploring HPC and big data convergence: A graph processing study on Intel Knights Landing
A Uta, AL Varbanescu, A Musaafir… - 2018 IEEE …, 2018 - ieeexplore.ieee.org
The question" Can big data and HPC infrastructure converge?" has important implications
for many operators and clients of modern computing. However, answering it is challenging …
for many operators and clients of modern computing. However, answering it is challenging …
Optimizing coherence traffic in manycore processors using closed-form caching/home agent mappings
Manycore processors feature a high number of general-purpose cores designed to work in a
multithreaded fashion. Recent manycore processors are kept coherent using scalable …
multithreaded fashion. Recent manycore processors are kept coherent using scalable …
Simulating the network activity of modern manycores
M Horro, G Rodríguez, J Touriño - IEEE Access, 2019 - ieeexplore.ieee.org
Manycore architectures are one of the most promising candidates to reach the exascale.
However, the increase in the number of cores on a single die exacerbates the memory wall …
However, the increase in the number of cores on a single die exacerbates the memory wall …