Program scalability analysis for hpc cloud: Applying amdahl's law to nas benchmarks

JY Shi, M Taifi, A Pradeep… - 2012 SC Companion …, 2012 - ieeexplore.ieee.org
The availability of high performance computing (HPC) clouds requires scalability analysis of
parallel programs for multiple different environments in order to maximize the promised …

Ankacom: A development and experiment for extreme scale computing

Y Celik, A Pradeep, JY Shi - 2015 IEEE International …, 2015 - ieeexplore.ieee.org
Extreme scale computing has no implied scaling limit. The impossibility of implementing
reliable communication between crashing hosts prohibits explicit-communication primitives …

Seeking the principles of sustainable software engineering

J Shi - arXiv preprint arXiv:1405.4464, 2014 - arxiv.org
Like other engineering disciplines, software engineering should also have principles to
guide the construction of sustainable computer applications. Tangible properties include a) …

Osi standards and the top fallacy of distributed computing

JY Shi - 2016 45th International Conference on Parallel …, 2016 - ieeexplore.ieee.org
In 1994, the OSI (Open System Interconnection)/IEC 7498-1: 1994 Model was published.
The two-sided point-to-pointcommunication TCP/IP protocol became widely accepted as the …

Use HDLC principles for distributed applications

JY Shi - 2015 IEEE International Conference on Computer and …, 2015 - ieeexplore.ieee.org
A dependable computing system should not have uncontrolled data losses. This simple
requirement is easy to satisfy at the systems level but seems impossible at the application …

Jenergy: A fault tolerant stateless architecture for high performance computing

M Taifi, JY Shi, Y Celik - 2015 IEEE Symposium on Service …, 2015 - ieeexplore.ieee.org
Large scale HPC (high performance computing) applications require thousands of nodes for
computing parallel scientific applications. At this scale, hardware and software failures …

On the resilience of mission critical applications

JY Shi - 2015 Resilience Week (RWS), 2015 - ieeexplore.ieee.org
Most mission critical applications are distributed onto multiple computers. They need to
survive extreme conditions. Today, regardless methodologies, all existing mission critical …

[图书][B] Stateless parallel processing architecture for extreme scale hpc and auction-based clouds

M Taifi - 2013 - search.proquest.com
Extreme scale HPC (high performance computing) applications require massively many
nodes. At these scales, transient hardware and software failures, as well as network …

Single-Sided Statistic Multiplexed High Performance Computing

J Shi, Y Celik - … Workshop on Software Engineering for High …, 2016 - ieeexplore.ieee.org
For the last three decades, end-to-end computing has been the de facto paradigm for
distributed and parallel computing. MPI (Message Passing Interface), RPC (Remote …

Symposium: Cap-plus for big data

JY Shi - Proceedings of the 2014 International Conference on …, 2014 - dl.acm.org
Data intensive parallel applications are harder to protect against transient software and
hardware failures compared to traditional parallel applications. Due to the need for …