Approximate distributed joins in apache spark

S Tang, B He, C Yu, Y Li, K Li - IEEE Transactions on …, 2020 - ieeexplore.ieee.org

With the explosive increase of big data in industry and academic fields, it is important to
apply large-scale data processing systems to analyze Big Data. Arguably, Spark is the state …

被引用次数：91 相关文章所有 6 个版本

[PDF] arxiv.org

Tensorscone: A secure tensorflow framework using intel sgx

R Kunkel, DL Quoc, F Gregor, S Arnautov… - arXiv preprint arXiv …, 2019 - arxiv.org

Machine learning has become a critical component of modern data-driven online services.
Typically, the training phase of machine learning techniques requires to process large-scale …

被引用次数：81 相关文章所有 2 个版本

[PDF] nsf.gov

Approxjoin: Approximate distributed joins

DL Quoc, IE Akkus, P Bhatotia, S Blanas… - Proceedings of the …, 2018 - dl.acm.org

A distributed join is a fundamental operation for processing massive datasets in parallel.
Unfortunately, computing an equi-join over such datasets is very resource-intensive, even …

被引用次数：22 相关文章所有 5 个版本

[PDF] easychair.org

[PDF][PDF] Optimized bootstrap sampling for σ-aqp error estimation: A pilot study

S Cal, E Cheng, F Yu - Proceedings of ISCA 30th International Confer, 2021 - easychair.org

Approximate query processing (AQP) aims to provide an approximated answer close to the
exact answer efficiently for a complex query on large datasets, especially big data. It brings …

被引用次数：2 相关文章所有 2 个版本

[PDF] arxiv.org

Approximate edge analytics for the iot ecosystem

Z Wen, DL Quoc, P Bhatotia, R Chen, M Lee - arXiv preprint arXiv …, 2018 - arxiv.org

IoT-enabled devices continue to generate a massive amount of data. Transforming this
continuously arriving raw data into timely insights is critical for many modern online services …

被引用次数：2 相关文章所有 2 个版本

[PDF] isca-hq.org

[PDF][PDF] Non-Parametric Error Estimation for σ-AQP using Optimized Bootstrap Sampling.

F Yu, S Cal, E Cheng, L Kerns, W Xiong - International Journal for …, 2022 - isca-hq.org

Approximate query processing (or AQP) aims to quickly provide approximated answers for
time-consuming search queries on large datasets. It brings enormous benefits in data …

[PDF][PDF] ApproxJoin: Approximate Distributed Joins

S Blanas, R Chen, C Fetzer, T Strufe - 2018 - iakkus.github.io

ABSTRACT A distributed join is a fundamental operation for processing massive datasets in
parallel. Unfortunately, computing an equi-join over such datasets is very resource …