Big Data and cloud computing: innovation opportunities and challenges

C Yang, Q Huang, Z Li, K Liu, F Hu - International Journal of Digital …, 2017 - Taylor & Francis
Big Data has emerged in the past few years as a new paradigm providing abundant data
and opportunities to improve and/or enable research and decision-support applications with …

Geospatial big data: challenges and opportunities

JG Lee, M Kang - Big Data Research, 2015 - Elsevier
Geospatial big data refers to spatial data sets exceeding capacity of current computing
systems. A significant portion of big data is actually geospatial data, and the size of such …

Geospark: A cluster computing framework for processing large-scale spatial data

J Yu, J Wu, M Sarwat - … of the 23rd SIGSPATIAL international conference …, 2015 - dl.acm.org
This paper introduces GeoSpark an in-memory cluster computing framework for processing
large-scale spatial data. GeoSpark consists of three layers: Apache Spark Layer, Spatial …

Large-scale spatial join query processing in cloud

S You, J Zhang, L Gruenwald - 2015 31st IEEE international …, 2015 - ieeexplore.ieee.org
The rapidly increasing amount of location data available in many applications has made it
desirable to process their large-scale spatial queries in Cloud for performance and …

[PDF][PDF] A review paper on big data and hadoop

HS Bhosale, DP Gadekar - International Journal of Scientific and …, 2014 - Citeseer
The term 'Big Data'describes innovative techniques and technologies to capture, store,
distribute, manage and analyze petabyte-or larger-sized datasets with high-velocity and …

Spatial partitioning techniques in SpatialHadoop

A Eldawy, L Alarabi, MF Mokbel - Proceedings of the VLDB Endowment, 2015 - dl.acm.org
SpatialHadoop is an extended MapReduce framework that supports global indexing that
spatial partitions the data across machines providing orders of magnitude speedup …

CG_Hadoop: computational geometry in MapReduce

A Eldawy, Y Li, MF Mokbel, R Janardan - Proceedings of the 21st ACM …, 2013 - dl.acm.org
Hadoop, employing the MapReduce programming paradigm, has been widely accepted as
the standard framework for analyzing big data in distributed environments. Unfortunately …

Shahed: A mapreduce-based system for querying and visualizing spatio-temporal satellite data

A Eldawy, MF Mokbel, S Alharthi… - 2015 IEEE 31st …, 2015 - ieeexplore.ieee.org
Remote sensing data collected by satellites are now made publicly available by several
space agencies. This data is very useful for scientists pursuing research in several …

Big spatial vector data management: a review

X Yao, G Li - Big Earth Data, 2018 - Taylor & Francis
Spatial vector data with high-precision and wide-coverage has exploded globally, such as
land cover, social media, and other data-sets, which provides a good opportunity to enhance …

The STARK framework for spatio-temporal data analytics on spark

S Hagedorn, P Gotze, KU Sattler - 2017 - dl.gi.de
Big Data sets can contain all types of information: from server log files to tracking information
of mobile users with their location at a point in time. Apache Spark has been widely …