Chi squared feature selection over Apache Spark

M Nassar, H Safa, AA Mutawa, A Helal… - Proceedings of the 23rd …, 2019 - dl.acm.org
We live in the age of big data and distributed computing. The current large scale
computation frameworks are based on a scaling-out approach for distributing tasks over a …

Fine and coarse grained composition and adaptation of spark applications

Z Shmeis, M Jaber - Future Generation Computer Systems, 2018 - Elsevier
Spark is a framework used to analyze big data applications. In this paper, we introduce a
framework to build complex Spark applications by composing simpler ones. We use two …

[引用][C] Component and transformation based frameworks for building and optimizing Spark programs

ZH Shmeiss - 2018