[PDF][PDF] 云计算: 体系架构与关键技术

罗军舟, 金嘉晖, 宋爱波, 东方 - 通信学报, 2011 - fs.gongkong.com
系统地分析和总结云计算的研究现状, 划分云计算体系架构为核心服务, 服务管理,
用户访问接口等3 个层次. 围绕低成本, 高可靠, 高可用, 规模可伸缩等研究目标 …

Twister: a runtime for iterative mapreduce

J Ekanayake, H Li, B Zhang, T Gunarathne… - Proceedings of the 19th …, 2010 - dl.acm.org
MapReduce programming model has simplified the implementation of many data parallel
applications. The simplicity of the programming model and the quality of services provided …

Cloud computing in e-Science: research challenges and opportunities

X Yang, D Wallom, S Waddington, J Wang… - The Journal of …, 2014 - Springer
Abstract Service-oriented architecture (SOA), workflow, the Semantic Web, and Grid
computing are key enabling information technologies in the development of increasingly …

Load balancing for mapreduce-based entity resolution

L Kolb, A Thor, E Rahm - 2012 IEEE 28th international …, 2012 - ieeexplore.ieee.org
The effectiveness and scalability of MapReduce-based implementations of complex data-
intensive tasks depend on an even redistribution of data between map and reduce tasks. In …

High performance parallel computing with clouds and cloud technologies

J Ekanayake, G Fox - … , CloudComp 2009 Munich, Germany, October 19 …, 2010 - Springer
Infrastructure services (Infrastructure-as-a-service), provided by cloud vendors, allow any
user to provision a large number of compute instances fairly easily. Whether leased from …

Multi-objective scheduling of many tasks in cloud platforms

F Zhang, J Cao, K Li, SU Khan, K Hwang - Future Generation Computer …, 2014 - Elsevier
The scheduling of a many-task workflow in a distributed computing platform is a well known
NP-hard problem. The problem is even more complex and challenging when the virtualized …

Optimizing load balancing and data-locality with data-aware scheduling

K Wang, X Zhou, T Li, D Zhao, M Lang… - … Conference on Big …, 2014 - ieeexplore.ieee.org
Load balancing techniques (eg work stealing) are important to obtain the best performance
for distributed task scheduling systems that have multiple schedulers making scheduling …

Cloud technologies for bioinformatics applications

J Ekanayake, T Gunarathne… - IEEE Transactions on …, 2010 - ieeexplore.ieee.org
Executing large number of independent jobs or jobs comprising of large number of tasks that
perform minimal intertask communication is a common requirement in many domains …

[HTML][HTML] Genomic big data hitting the storage bottleneck

L Papageorgiou, P Eleni, S Raftopoulou… - EMBnet …, 2018 - ncbi.nlm.nih.gov
During the last decades, there is a vast data explosion in bioinformatics. Big data centres are
trying to face this data crisis, reaching high storage capacity levels. Although several …

Challenges and approaches for distributed workflow-driven analysis of large-scale biological data: vision paper

I Altintas, J Wang, D Crawl, W Li - Proceedings of the 2012 Joint EDBT …, 2012 - dl.acm.org
Next-generation DNA sequencing machines are generating a very large amount of
sequence data with applications in many scientific challenges and placing unprecedented …