Harnessing the computing continuum for programming our world

P Beckman, J Dongarra, N Ferrier, G Fox… - … : Theory and Practice, 2020 - Wiley Online Library
This chapter outlines a vision for how best to harness the computing continuum of
interconnected sensors, actuators, instruments, and computing systems, from small numbers …

A survey on evaluating and optimizing performance of Intel Xeon Phi

S Mittal - Concurrency and Computation: Practice and …, 2020 - Wiley Online Library
Summary Intel's Xeon Phi combines the parallel processing power of a many‐core
accelerator with the programming ease of CPUs. In this paper, we present a survey of works …

Dlobd: A comprehensive study of deep learning over big data stacks on hpc clusters

X Lu, H Shi, R Biswas, MH Javed… - IEEE Transactions on …, 2018 - ieeexplore.ieee.org
D eep L earning o ver B ig D ata (DLoBD) is an emerging paradigm to mine value from the
massive amount of gathered data. Many Deep Learning frameworks, like Caffe, TensorFlow …

Effect of distributed directories in mesh interconnects

M Horro, MT Kandemir, LN Pouchet… - Proceedings of the 56th …, 2019 - dl.acm.org
Recent manycore processors are kept coherent using scalable distributed directories. A
paramount example is the Xeon Phi Knights Landing. It features 38 tiles packed in a single …

HeAT–a distributed and GPU-accelerated tensor framework for data analytics

M Götz, C Debus, D Coquelin, K Krajsek… - … Conference on Big …, 2020 - ieeexplore.ieee.org
To cope with the rapid growth in available data, the efficiency of data analysis and machine
learning libraries has recently received increased attention. Although great advancements …

HySec-Flow: privacy-preserving genomic computing with SGX-based big-data analytics framework

C Widanage, W Liu, J Li, H Chen… - 2021 IEEE 14th …, 2021 - ieeexplore.ieee.org
Trusted execution environments (TEE) such as In-tel's Software Guard Extension (SGX)
have been widely studied to boost security and privacy protection for the computation of …

Harpgbdt: Optimizing gradient boosting decision tree for parallel efficiency

B Peng, L Chen, J Li, M Jiang, S Akkas… - 2019 IEEE …, 2019 - ieeexplore.ieee.org
Gradient Boosting Decision Tree (GBDT) is a widely used machine learning algorithm,
whose training involves both irregular computation and random memory access and is …

Exploring HPC and big data convergence: A graph processing study on Intel Knights Landing

A Uta, AL Varbanescu, A Musaafir… - 2018 IEEE …, 2018 - ieeexplore.ieee.org
The question" Can big data and HPC infrastructure converge?" has important implications
for many operators and clients of modern computing. However, answering it is challenging …

Optimizing coherence traffic in manycore processors using closed-form caching/home agent mappings

S Kommrusch, M Horro, LN Pouchet… - IEEE …, 2021 - ieeexplore.ieee.org
Manycore processors feature a high number of general-purpose cores designed to work in a
multithreaded fashion. Recent manycore processors are kept coherent using scalable …

Simulating the network activity of modern manycores

M Horro, G Rodríguez, J Touriño - IEEE Access, 2019 - ieeexplore.ieee.org
Manycore architectures are one of the most promising candidates to reach the exascale.
However, the increase in the number of cores on a single die exacerbates the memory wall …