Affinity-based thread and data mapping in shared memory systems
Shared memory architectures have recently experienced a large increase in thread-level
parallelism, leading to complex memory hierarchies with multiple cache memory levels and …
parallelism, leading to complex memory hierarchies with multiple cache memory levels and …
Bandwidth-aware page placement in numa
Page placement is a critical problem for memory-intensive applications running on a shared-
memory multiprocessor with a non-uniform memory access (NUMA) architecture. State-of …
memory multiprocessor with a non-uniform memory access (NUMA) architecture. State-of …
Optimizing machine learning algorithms on multi-core and many-core architectures using thread and data mapping
Driven by the development of new technologies such as personal assistants or autonomous
cars, machine learning has rapidly become one of the most active fields in computer …
cars, machine learning has rapidly become one of the most active fields in computer …
PMAlloc: A Holistic Approach to Improving Persistent Memory Allocation
Persistent memory allocation is a fundamental building block for developing high-
performance and in-memory applications. Existing persistent memory allocators suffer from …
performance and in-memory applications. Existing persistent memory allocators suffer from …
Memory‐aware kernel mechanism and policies for improving internode load balancing on NUMA systems
ML Chiang, WL Su, SW Tu… - Software: Practice and …, 2019 - Wiley Online Library
Although nonuniform memory access architecture provides better scalability for multicore
systems, cores accessing memory on remote nodes take longer than those accessing on …
systems, cores accessing memory on remote nodes take longer than those accessing on …
Thread-aware mechanism to enhance inter-node load balancing for multithreaded applications on numa systems
ML Chiang, WL Su - Applied Sciences, 2021 - mdpi.com
NUMA multi-core systems divide system resources into several nodes. When an imbalance
in the load between cores occurs, the kernel scheduler's load balancing mechanism then …
in the load between cores occurs, the kernel scheduler's load balancing mechanism then …
Improving communication and load balancing with thread mapping in manycore systems
Communication and load balancing have a significant impact on the performance of parallel
applications and have been the subject of extensive research in multicore architectures …
applications and have been the subject of extensive research in multicore architectures …
Resource management and security issues in mobile phone operating systems: A comparative analysis
Nowadays Mobile phones are becoming more popular in our daily lives. Mobile technology
has a great effect on human life. Our daily tasks are dependent on mobile devices. Memory …
has a great effect on human life. Our daily tasks are dependent on mobile devices. Memory …
Combining thread throttling and mapping to optimize the edp of parallel applications
GP Berned, TS Medeiros, M Serpa… - 2021 29th Euromicro …, 2021 - ieeexplore.ieee.org
Thread-throttling and mapping strategies have been used together to make better use of
hardware resources and improve the energy-delay product (EDP) of high-performance …
hardware resources and improve the energy-delay product (EDP) of high-performance …
A performance-stable NUMA management scheme for linux-based HPC systems
Linux is becoming the de-facto standard operating system for today's high-performance
computing (HPC) systems because it can satisfy the demands of many HPC systems for rich …
computing (HPC) systems because it can satisfy the demands of many HPC systems for rich …