Affinity-based thread and data mapping in shared memory systems

M Diener, EHM Cruz, MAZ Alves, POA Navaux… - ACM Computing …, 2016 - dl.acm.org
Shared memory architectures have recently experienced a large increase in thread-level
parallelism, leading to complex memory hierarchies with multiple cache memory levels and …

Bandwidth-aware page placement in numa

D Gureya, J Neto, R Karimi, J Barreto… - 2020 IEEE …, 2020 - ieeexplore.ieee.org
Page placement is a critical problem for memory-intensive applications running on a shared-
memory multiprocessor with a non-uniform memory access (NUMA) architecture. State-of …

Optimizing machine learning algorithms on multi-core and many-core architectures using thread and data mapping

MS Serpa, AM Krause, EHM Cruz… - 2018 26th Euromicro …, 2018 - ieeexplore.ieee.org
Driven by the development of new technologies such as personal assistants or autonomous
cars, machine learning has rapidly become one of the most active fields in computer …

PMAlloc: A Holistic Approach to Improving Persistent Memory Allocation

Z Dang, S He, X Zhang, P Hong, Z Li, X Chen… - ACM Transactions on …, 2024 - dl.acm.org
Persistent memory allocation is a fundamental building block for developing high-
performance and in-memory applications. Existing persistent memory allocators suffer from …

Memory‐aware kernel mechanism and policies for improving internode load balancing on NUMA systems

ML Chiang, WL Su, SW Tu… - Software: Practice and …, 2019 - Wiley Online Library
Although nonuniform memory access architecture provides better scalability for multicore
systems, cores accessing memory on remote nodes take longer than those accessing on …

Thread-aware mechanism to enhance inter-node load balancing for multithreaded applications on numa systems

ML Chiang, WL Su - Applied Sciences, 2021 - mdpi.com
NUMA multi-core systems divide system resources into several nodes. When an imbalance
in the load between cores occurs, the kernel scheduler's load balancing mechanism then …

Improving communication and load balancing with thread mapping in manycore systems

EHM Cruz, M Diener, MS Serpa… - 2018 26th Euromicro …, 2018 - ieeexplore.ieee.org
Communication and load balancing have a significant impact on the performance of parallel
applications and have been the subject of extensive research in multicore architectures …

Resource management and security issues in mobile phone operating systems: A comparative analysis

KM Awan, M Waqar, M Faseeh, F Ullah, MQ Saleem - 2017 - peerj.com
Nowadays Mobile phones are becoming more popular in our daily lives. Mobile technology
has a great effect on human life. Our daily tasks are dependent on mobile devices. Memory …

Combining thread throttling and mapping to optimize the edp of parallel applications

GP Berned, TS Medeiros, M Serpa… - 2021 29th Euromicro …, 2021 - ieeexplore.ieee.org
Thread-throttling and mapping strategies have been used together to make better use of
hardware resources and improve the energy-delay product (EDP) of high-performance …

A performance-stable NUMA management scheme for linux-based HPC systems

J Song, M Ahn, G Lee, E Seo, J Jeong - IEEE Access, 2021 - ieeexplore.ieee.org
Linux is becoming the de-facto standard operating system for today's high-performance
computing (HPC) systems because it can satisfy the demands of many HPC systems for rich …