A survey on NoSQL stores

A Davoudian, L Chen, M Liu - ACM Computing Surveys (CSUR), 2018 - dl.acm.org
Recent demands for storing and querying big data have revealed various shortcomings of
traditional relational database systems. This, in turn, has led to the emergence of a new kind …

Federated learning on non-iid data silos: An experimental study

Q Li, Y Diao, Q Chen, B He - 2022 IEEE 38th international …, 2022 - ieeexplore.ieee.org
Due to the increasing privacy concerns and data regulations, training data have been
increasingly fragmented, forming distributed databases of multiple “data silos”(eg, within …

Sprocket: A serverless video processing framework

L Ao, L Izhikevich, GM Voelker, G Porter - Proceedings of the ACM …, 2018 - dl.acm.org
Sprocket is a highly configurable, stage-based, scalable, serverless video processing
framework that exploits intra-video parallelism to achieve low latency. Sprocket enables …

[图书][B] Magellan: Toward building entity matching management systems

PV Konda - 2018 - search.proquest.com
Entity matching (EM) identifies data instances that refer to the same real-world entity, such
as (David Smith, UWMadison) and (DM Smith, UWM). This problem has been a long …

[PDF][PDF] Reining in the outliers in {Map-Reduce} clusters using mantri

G Ananthanarayanan, S Kandula… - … USENIX Symposium on …, 2010 - usenix.org
Experience from an operational Map-Reduce cluster reveals that outliers significantly
prolong job completion. e causes for outliers include run-time contention for processor …

Skewtune: mitigating skew in mapreduce applications

YC Kwon, M Balazinska, B Howe, J Rolia - Proceedings of the 2012 …, 2012 - dl.acm.org
We present an automatic skew mitigation approach for user-defined MapReduce programs
and present SkewTune, a system that implements this approach as a drop-in replacement …

Profiling, what-if analysis, and cost-based optimization of mapreduce programs

H Herodotou, S Babu - Proceedings of the VLDB Endowment, 2011 - dl.acm.org
MapReduce has emerged as a viable competitor to database systems in big data analytics.
MapReduce programs are being written for a wide variety of application domains including …

A survey of large-scale analytical query processing in MapReduce

C Doulkeridis, K Nørvåg - The VLDB journal, 2014 - Springer
Enterprises today acquire vast volumes of data from different sources and leverage this
information by means of data analysis to support effective decision-making and provide new …

Processing theta-joins using mapreduce

A Okcan, M Riedewald - Proceedings of the 2011 ACM SIGMOD …, 2011 - dl.acm.org
Joins are essential for many data analysis tasks, but are not supported directly by the
MapReduce paradigm. While there has been progress on equi-joins, implementation of join …

How good are modern spatial analytics systems?

V Pandey, A Kipf, T Neumann, A Kemper - Proceedings of the VLDB …, 2018 - dl.acm.org
Spatial data is pervasive. Large amount of spatial data is produced every day from GPS-
enabled devices such as cell phones, cars, sensors, and various consumer based …