High-Performance Spatial Data Analytics: Systematic R&D for Scale-Out and Scale-Up Solutions from the Past to Now

F Wang, R Lee, D Teng, X Zhang, J Saltz - Proceedings of the VLDB …, 2024 - dl.acm.org
We released open-source software Hadoop-GIS in 2011, and presented and published the
work in VLDB 2013. This work initiated the development of a new spatial data analytical …

Imageminer: a software system for comparative analysis of tissue microarrays using content-based image retrieval, high-performance computing, and grid technology

DJ Foran, L Yang, W Chen, J Hu… - Journal of the …, 2011 - academic.oup.com
Objective and design The design and implementation of ImageMiner, a software platform for
performing comparative analysis of expression patterns in imaged microscopy specimens …

Robust heuristic algorithms for exploiting the common tasks of relational cloud database queries

T Dokeroglu, MA Bayir, A Cosar - Applied Soft Computing, 2015 - Elsevier
Cloud computing enables a conventional relational database system's hardware to be
adjusted dynamically according to query workload, performance and deadline constraints …

Grid-based parallel data streaming implemented for the gyrokinetic toroidal code

S Klasky, S Ethier, Z Lin, K Martins, D McCune… - Proceedings of the …, 2003 - dl.acm.org
We have developed a threaded parallel data streaming approach using Globus to transfer
multi-terabyte simulation data from a remote supercomputer to the scientistýs home …

Image processing for the grid: a toolkit for building grid-enabled image processing applications

S Hastings, T Kurc, S Langella… - CCGrid 2003. 3rd …, 2003 - ieeexplore.ieee.org
Analyzing large and distributed image datasets is a crucial step in understanding the
structural and functional characteristics of biological systems. In this paper, we present the …

Textiverse: A scalable visual analytics system for exploring geotagged and timestamped text corpora

C Berger, H Xian, K Madhavan, N Elmqvist - arXiv preprint arXiv …, 2023 - arxiv.org
We propose Textiverse, a big data approach for mining geotagged timestamped textual data
on a map, such as for Twitter feeds, crime reports, or restaurant reviews. We use a scalable …

Improving the performance of Hadoop Hive by sharing scan and computation tasks

T Dokeroglu, S Ozal, MA Bayir, MS Cinar… - Journal of Cloud …, 2014 - Springer
MapReduce is a popular programming model for executing time-consuming analytical
queries as a batch of tasks on large scale data clusters. In environments where multiple …

Executing multiple pipelined data analysis operations in the grid

M Spencer, R Ferreira, M Beynon… - SC'02: Proceedings …, 2002 - ieeexplore.ieee.org
Processing of data in many data analysis applications can be represented as an acyclic,
coarse grain data flow, from data sources to the client. This paper is concerned with …

Cumulvs: Interacting with high-performance scientific simulations, for visualization, steering and fault tolerance

JA Kohl, T Wilde, DE Bernholdt - The International Journal …, 2006 - journals.sagepub.com
High-performance computer simulations are an increasingly popular alternative or
complement to physical experiments or prototypes. However, as these simulations grow …

Anthill: A scalable run-time environment for data mining applications

RA Ferreira, W Meira, D Guedes… - … (SBAC-PAD'05), 2005 - ieeexplore.ieee.org
Data mining techniques are becoming increasingly more popular as a reasonable means to
collect summaries from the rapidly growing datasets in many areas. However, as the size of …