Performance evaluation of Intel® transactional synchronization extensions for high-performance computing
Intel has recently introduced Intel® Transactional Synchronization Extensions (Intel® TSX)
in the Intel 4th Generation Core™ Processors. With Intel TSX, a processor can dynamically …
in the Intel 4th Generation Core™ Processors. With Intel TSX, a processor can dynamically …
Hardware transactional memory for GPU architectures
WWL Fung, I Singh, A Brownsword… - Proceedings of the 44th …, 2011 - dl.acm.org
Graphics processor units (GPUs) are designed to efficiently exploit thread level parallelism
(TLP), multiplexing execution of 1000s of concurrent threads on a relatively smaller set of …
(TLP), multiplexing execution of 1000s of concurrent threads on a relatively smaller set of …
Leveraging crowd knowledge for software comprehension and development
L Ponzanelli, A Bacchelli… - 2013 17th European …, 2013 - ieeexplore.ieee.org
Question and Answer (Q&A) services, such as Stack Overflow, rely on a community of
programmers who post questions, provide and rate answers, to create what is termed" crowd …
programmers who post questions, provide and rate answers, to create what is termed" crowd …
Energy efficient GPU transactional memory via space-time optimizations
Many applications with regular parallelism have been shown to benefit from using Graphics
Processing Units (GPUs). However, employing GPUs for applications with irregular …
Processing Units (GPUs). However, employing GPUs for applications with irregular …
Software partitioning of hardware transactions
Best-effort hardware transactional memory (HTM) allows complex operations to execute
atomically and in parallel, so long as hardware buffers do not overflow, and conflicts are not …
atomically and in parallel, so long as hardware buffers do not overflow, and conflicts are not …
Fault tolerance for multi-threaded applications by leveraging hardware transactional memory
Providing fault tolerance especially to mission critical applications in order to detect transient
and permanent faults and to recover from them is one of the main necessity for processor …
and permanent faults and to recover from them is one of the main necessity for processor …
Quantifying the performance and energy-efficiency impact of hardware transactional memory on scientific applications on large-scale NUMA systems
Hardware transactional memory (HTM) is supported by widely-used commodity processors.
While the effectiveness of HTM has been evaluated based on small-scale multi-core …
While the effectiveness of HTM has been evaluated based on small-scale multi-core …
High-performance GPU transactional memory via eager conflict detection
GPUs transactional memory (TM) proposals to date have relied on lazy, value-based conflict
detection, assuming that GPUs can amortize the latency by executing other warps. In …
detection, assuming that GPUs can amortize the latency by executing other warps. In …
REMEDIATE: A scalable fault-tolerant architecture for low-power NUCA cache in tiled CMPs
A BanaiyanMofrad, H Homayoun… - 2013 International …, 2013 - ieeexplore.ieee.org
Technology scaling and process variation severely degrade the reliability of Chip
Multiprocessors (CMPs), especially their large cache blocks. To improve cache reliability, we …
Multiprocessors (CMPs), especially their large cache blocks. To improve cache reliability, we …
Stm2: A parallel stm for high performance simultaneous multithreading systems
Extracting high performance from modern chip multithreading (CMT) processors is a
complex task, especially for large CMT systems. Programmers must efficiently parallelize …
complex task, especially for large CMT systems. Programmers must efficiently parallelize …