A survey on spark ecosystem: Big data processing infrastructure, machine learning, and applications

S Tang, B He, C Yu, Y Li, K Li - IEEE Transactions on …, 2020 - ieeexplore.ieee.org
With the explosive increase of big data in industry and academic fields, it is important to
apply large-scale data processing systems to analyze Big Data. Arguably, Spark is the state …

Tensorscone: A secure tensorflow framework using intel sgx

R Kunkel, DL Quoc, F Gregor, S Arnautov… - arXiv preprint arXiv …, 2019 - arxiv.org
Machine learning has become a critical component of modern data-driven online services.
Typically, the training phase of machine learning techniques requires to process large-scale …

Approxjoin: Approximate distributed joins

DL Quoc, IE Akkus, P Bhatotia, S Blanas… - Proceedings of the …, 2018 - dl.acm.org
A distributed join is a fundamental operation for processing massive datasets in parallel.
Unfortunately, computing an equi-join over such datasets is very resource-intensive, even …

[PDF][PDF] Optimized bootstrap sampling for σ-aqp error estimation: A pilot study

S Cal, E Cheng, F Yu - Proceedings of ISCA 30th International Confer, 2021 - easychair.org
Approximate query processing (AQP) aims to provide an approximated answer close to the
exact answer efficiently for a complex query on large datasets, especially big data. It brings …

Approximate edge analytics for the iot ecosystem

Z Wen, DL Quoc, P Bhatotia, R Chen, M Lee - arXiv preprint arXiv …, 2018 - arxiv.org
IoT-enabled devices continue to generate a massive amount of data. Transforming this
continuously arriving raw data into timely insights is critical for many modern online services …

[PDF][PDF] Non-Parametric Error Estimation for σ-AQP using Optimized Bootstrap Sampling.

F Yu, S Cal, E Cheng, L Kerns, W Xiong - International Journal for …, 2022 - isca-hq.org
Approximate query processing (or AQP) aims to quickly provide approximated answers for
time-consuming search queries on large datasets. It brings enormous benefits in data …

[PDF][PDF] ApproxJoin: Approximate Distributed Joins

S Blanas, R Chen, C Fetzer, T Strufe - 2018 - iakkus.github.io
ABSTRACT A distributed join is a fundamental operation for processing massive datasets in
parallel. Unfortunately, computing an equi-join over such datasets is very resource …