High-Performance Spatial Data Analytics: Systematic R&D for Scale-Out and Scale-Up Solutions from the Past to Now
We released open-source software Hadoop-GIS in 2011, and presented and published the
work in VLDB 2013. This work initiated the development of a new spatial data analytical …
work in VLDB 2013. This work initiated the development of a new spatial data analytical …
Imageminer: a software system for comparative analysis of tissue microarrays using content-based image retrieval, high-performance computing, and grid technology
DJ Foran, L Yang, W Chen, J Hu… - Journal of the …, 2011 - academic.oup.com
Objective and design The design and implementation of ImageMiner, a software platform for
performing comparative analysis of expression patterns in imaged microscopy specimens …
performing comparative analysis of expression patterns in imaged microscopy specimens …
Robust heuristic algorithms for exploiting the common tasks of relational cloud database queries
Cloud computing enables a conventional relational database system's hardware to be
adjusted dynamically according to query workload, performance and deadline constraints …
adjusted dynamically according to query workload, performance and deadline constraints …
Grid-based parallel data streaming implemented for the gyrokinetic toroidal code
We have developed a threaded parallel data streaming approach using Globus to transfer
multi-terabyte simulation data from a remote supercomputer to the scientistýs home …
multi-terabyte simulation data from a remote supercomputer to the scientistýs home …
Image processing for the grid: a toolkit for building grid-enabled image processing applications
S Hastings, T Kurc, S Langella… - CCGrid 2003. 3rd …, 2003 - ieeexplore.ieee.org
Analyzing large and distributed image datasets is a crucial step in understanding the
structural and functional characteristics of biological systems. In this paper, we present the …
structural and functional characteristics of biological systems. In this paper, we present the …
Textiverse: A scalable visual analytics system for exploring geotagged and timestamped text corpora
We propose Textiverse, a big data approach for mining geotagged timestamped textual data
on a map, such as for Twitter feeds, crime reports, or restaurant reviews. We use a scalable …
on a map, such as for Twitter feeds, crime reports, or restaurant reviews. We use a scalable …
Improving the performance of Hadoop Hive by sharing scan and computation tasks
MapReduce is a popular programming model for executing time-consuming analytical
queries as a batch of tasks on large scale data clusters. In environments where multiple …
queries as a batch of tasks on large scale data clusters. In environments where multiple …
Executing multiple pipelined data analysis operations in the grid
M Spencer, R Ferreira, M Beynon… - SC'02: Proceedings …, 2002 - ieeexplore.ieee.org
Processing of data in many data analysis applications can be represented as an acyclic,
coarse grain data flow, from data sources to the client. This paper is concerned with …
coarse grain data flow, from data sources to the client. This paper is concerned with …
Cumulvs: Interacting with high-performance scientific simulations, for visualization, steering and fault tolerance
JA Kohl, T Wilde, DE Bernholdt - The International Journal …, 2006 - journals.sagepub.com
High-performance computer simulations are an increasingly popular alternative or
complement to physical experiments or prototypes. However, as these simulations grow …
complement to physical experiments or prototypes. However, as these simulations grow …
Anthill: A scalable run-time environment for data mining applications
Data mining techniques are becoming increasingly more popular as a reasonable means to
collect summaries from the rapidly growing datasets in many areas. However, as the size of …
collect summaries from the rapidly growing datasets in many areas. However, as the size of …