Power efficient processor architecture and the Cell processor

R Wilhelm, J Engblom, A Ermedahl, N Holsti… - ACM Transactions on …, 2008 - dl.acm.org

The determination of upper bounds on execution times, commonly called worst-case
execution times (WCETs), is a necessary step in the development and validation process for …

被引用次数：2715 相关文章所有 27 个版本

[HTML] nih.gov

GPU-accelerated molecular modeling coming of age

JE Stone, DJ Hardy, IS Ufimtsev, K Schulten - Journal of Molecular …, 2010 - Elsevier

Graphics processing units (GPUs) have traditionally been used in molecular modeling solely
for visualization of molecular structures and animation of trajectories resulting from …

被引用次数：527 相关文章所有 9 个版本

[HTML] nih.gov

[HTML][HTML] OpenCL: A parallel programming standard for heterogeneous computing systems

JE Stone, D Gohara, G Shi - Computing in science & engineering, 2010 - ncbi.nlm.nih.gov

OpenCL: A Parallel Programming Standard for Heterogeneous Computing Systems - PMC Back
to Top Skip to main content NIH NLM Logo Access keys NCBI Homepage MyNCBI Homepage …

被引用次数：2368 相关文章所有 15 个版本

[PDF] utexas.edu

Introduction to the Cell multiprocessor

JA Kahle, MN Day, HP Hofstee… - IBM journal of …, 2005 - ieeexplore.ieee.org

This paper provides an introductory overview of the Cell multiprocessor. Cell represents a
revolutionary extension of conventional microprocessor architecture and organization. The …

被引用次数：1449 相关文章所有 22 个版本

[PDF] microsoft.com

Exploiting coarse-grained task, data, and pipeline parallelism in stream programs

MI Gordon, W Thies, S Amarasinghe - ACM SIGPLAN Notices, 2006 - dl.acm.org

As multicore architectures enter the mainstream, there is a pressing demand for high-level
programming models that can effectively map to them. Stream programming offers an …

被引用次数：768 相关文章所有 24 个版本

Interconnects in the third dimension: Design challenges for 3D ICs

K Bernstein, P Andry, J Cann, P Emma… - Proceedings of the 44th …, 2007 - dl.acm.org

Despite generation upon generation of scaling, computer chips have until now remained
essentially 2-dimensional. Improvements in on-chip wire delay and in the maximum number …

被引用次数：442 相关文章所有 9 个版本

[PDF] tolia.org

GViM: GPU-accelerated virtual machines

V Gupta, A Gavrilovska, K Schwan, H Kharche… - Proceedings of the 3rd …, 2009 - dl.acm.org

The use of virtualization to abstract underlying hardware can aid in sharing such resources
and in efficiently managing their use by high performance applications. Unfortunately …

被引用次数：373 相关文章所有 13 个版本

[PDF] wustl.edu

Synergistic processing in cell's multicore architecture

M Gschwind, HP Hofstee, B Flachs, M Hopkins… - IEEE micro, 2006 - ieeexplore.ieee.org

Eight synergistic processor units enable the Cell Broadband Engine's breakthrough
performance. The SPU architecture implements a novel, pervasively data-parallel …

被引用次数：576 相关文章所有 22 个版本

[PDF] anu.edu.au

Cell multiprocessor communication network: Built for speed

M Kistler, M Perrone, F Petrini - IEEE micro, 2006 - ieeexplore.ieee.org

Multicore designs promise various power-performance and area-performance benefits. But
inadequate design of the on-chip communication network can deprive applications of these …

被引用次数：517 相关文章所有 16 个版本

[PDF] auburn.edu

Extending Amdahl's law for energy-efficient computing in the many-core era

DH Woo, HHS Lee - Computer, 2008 - ieeexplore.ieee.org

Extending Amdahl's Law for Energy-Efficient Computing in the Many-Core Era Page 1
Extending Amdahl’s Law for Energy-Efficient Computing in the Many-Core Era Dong Hyuk Woo …

被引用次数：331 相关文章所有 16 个版本