A survey on spark ecosystem: Big data processing infrastructure, machine learning, and applications
With the explosive increase of big data in industry and academic fields, it is important to
apply large-scale data processing systems to analyze Big Data. Arguably, Spark is the state …
apply large-scale data processing systems to analyze Big Data. Arguably, Spark is the state …
Tensorscone: A secure tensorflow framework using intel sgx
R Kunkel, DL Quoc, F Gregor, S Arnautov… - arXiv preprint arXiv …, 2019 - arxiv.org
Machine learning has become a critical component of modern data-driven online services.
Typically, the training phase of machine learning techniques requires to process large-scale …
Typically, the training phase of machine learning techniques requires to process large-scale …
Approxjoin: Approximate distributed joins
A distributed join is a fundamental operation for processing massive datasets in parallel.
Unfortunately, computing an equi-join over such datasets is very resource-intensive, even …
Unfortunately, computing an equi-join over such datasets is very resource-intensive, even …
[PDF][PDF] Optimized bootstrap sampling for σ-aqp error estimation: A pilot study
Approximate query processing (AQP) aims to provide an approximated answer close to the
exact answer efficiently for a complex query on large datasets, especially big data. It brings …
exact answer efficiently for a complex query on large datasets, especially big data. It brings …
Approximate edge analytics for the iot ecosystem
IoT-enabled devices continue to generate a massive amount of data. Transforming this
continuously arriving raw data into timely insights is critical for many modern online services …
continuously arriving raw data into timely insights is critical for many modern online services …
[PDF][PDF] Non-Parametric Error Estimation for σ-AQP using Optimized Bootstrap Sampling.
Approximate query processing (or AQP) aims to quickly provide approximated answers for
time-consuming search queries on large datasets. It brings enormous benefits in data …
time-consuming search queries on large datasets. It brings enormous benefits in data …
[PDF][PDF] ApproxJoin: Approximate Distributed Joins
ABSTRACT A distributed join is a fundamental operation for processing massive datasets in
parallel. Unfortunately, computing an equi-join over such datasets is very resource …
parallel. Unfortunately, computing an equi-join over such datasets is very resource …