[HTML][HTML] Big data analytics on Apache Spark

S Salloum, R Dautov, X Chen, PX Peng… - International Journal of …, 2016 - Springer
Apache Spark has emerged as the de facto framework for big data analytics with its
advanced in-memory programming model and upper-level libraries for scalable machine …

Attribute-based encryption with parallel outsourced decryption for edge intelligent IoV

C Feng, K Yu, M Aloqaily, M Alazab… - IEEE Transactions on …, 2020 - ieeexplore.ieee.org
Edge intelligence is an emerging concept referring to processes in which data are collected
and analyzed and insights are delivered close to where the data are captured in a network …

Skyfire: Data-driven seed generation for fuzzing

J Wang, B Chen, L Wei, Y Liu - 2017 IEEE Symposium on …, 2017 - ieeexplore.ieee.org
Programs that take highly-structured files as inputs normally process inputs in stages: syntax
parsing, semantic checking, and application execution. Deep bugs are often hidden in the …

[HTML][HTML] A comprehensive performance analysis of Apache Hadoop and Apache Spark for large scale data sets using HiBench

N Ahmed, ALC Barczak, T Susnjak, MA Rashid - Journal of Big Data, 2020 - Springer
Big Data analytics for storing, processing, and analyzing large-scale datasets has become
an essential tool for the industry. The advent of distributed computing frameworks such as …

Big data in cloud computing: A resource management perspective

S Ullah, MD Awan… - Scientific …, 2018 - Wiley Online Library
The modern day advancement is increasingly digitizing our lives which has led to a rapid
growth of data. Such multidimensional datasets are precious due to the potential of …

An experimental survey on big data frameworks

W Inoubli, S Aridhi, H Mezni, M Maddouri… - Future Generation …, 2018 - Elsevier
Recently, increasingly large amounts of data are generated from a variety of sources.
Existing data processing technologies are not suitable to cope with the huge amounts of …

Spark versus flink: Understanding performance in big data analytics frameworks

OC Marcu, A Costan, G Antoniu… - 2016 IEEE …, 2016 - ieeexplore.ieee.org
Big Data analytics has recently gained increasing popularity as a tool to process large
amounts of data on-demand. Spark and Flink are two Apache-hosted data analytics …

Datasize-aware high dimensional configurations auto-tuning of in-memory cluster computing

Z Yu, Z Bei, X Qian - Proceedings of the Twenty-Third International …, 2018 - dl.acm.org
In-Memory cluster Computing (IMC) frameworks (eg, Spark) have become increasingly
important because they typically achieve more than 10× speedups over the traditional On …

An integrated big and fast data analytics platform for smart urban transportation management

S Fiore, D Elia, CE Pires, DG Mestre, C Cappiello… - IEEE …, 2019 - ieeexplore.ieee.org
Smart urban transportation management can be considered as a multifaceted big data
challenge. It strongly relies on the information collected into multiple, widespread, and …

[HTML][HTML] Processing big data with apache hadoop in the current challenging era of COVID-19

O Azeroual, R Fabre - Big Data and Cognitive Computing, 2021 - mdpi.com
Big data have become a global strategic issue, as increasingly large amounts of
unstructured data challenge the IT infrastructure of global organizations and threaten their …