[PDF][PDF] Big data clustering techniques based on spark: a literature review

MM Saeed, Z Al Aghbari, M Alsharidah - PeerJ Computer Science, 2020 - peerj.com
A popular unsupervised learning method, known as clustering, is extensively used in data
mining, machine learning and pattern recognition. The procedure involves grouping of …

Distributed deep learning for remote sensing data interpretation

JM Haut, ME Paoletti, S Moreno-Álvarez… - Proceedings of the …, 2021 - ieeexplore.ieee.org
As a newly emerging technology, deep learning (DL) is a very promising field in big data
applications. Remote sensing often involves huge data volumes obtained daily by numerous …

A big data approach to black Friday sales

M Javed Awan, MS Mohd Rahim… - … OI Khalaf et al.," A big …, 2021 - papers.ssrn.com
Retail companies recognize the need to analyze and predict their sales and customer
behavior against their products and product categories. Our study aims to help retail …

An experimental survey on big data frameworks

W Inoubli, S Aridhi, H Mezni, M Maddouri… - Future Generation …, 2018 - Elsevier
Recently, increasingly large amounts of data are generated from a variety of sources.
Existing data processing technologies are not suitable to cope with the huge amounts of …

Role of IoT technologies in big data management systems: A review and Smart Grid case study

AR Al-Ali, R Gupta, I Zualkernan, SK Das - Pervasive and Mobile …, 2024 - Elsevier
Abstract Empowered by Internet of Things (IoT) and cloud computing platforms, the concept
of smart cities is making a transition from conceptual models to development and …

A survey on securing federated learning: Analysis of applications, attacks, challenges, and trends

HNC Neto, J Hribar, I Dusparic, DMF Mattos… - IEEE …, 2023 - ieeexplore.ieee.org
The growth of data generation capabilities, facilitated by advancements in communication
and computation technologies, as well as the rise of the Internet of Things (IoT), results in …

Ten quick tips for bioinformatics analyses using an Apache Spark distributed computing environment

D Chicco, U Ferraro Petrillo… - PLOS Computational …, 2023 - journals.plos.org
Some scientific studies involve huge amounts of bioinformatics data that cannot be analyzed
on personal computers usually employed by researchers for day-to-day activities but rather …

State of art of data mining and learning analytics tools in higher education

M Salihoun - International Journal of Emerging Technologies in …, 2020 - learntechlib.org
In this decade, the use of learning management systems (LMS) does not cease to increase,
becoming one of the most popular approaches adopted and widely used in the learning …

Trust assessment in online social networks

G Liu, Q Yang, H Wang, AX Liu - IEEE Transactions on …, 2019 - ieeexplore.ieee.org
Assessing trust in online social networks (OSNs) is critical for many applications such as
online marketing and network security. It is a challenging problem, however, due to the …

Efficient performance prediction for apache spark

G Cheng, S Ying, B Wang, Y Li - Journal of Parallel and Distributed …, 2021 - Elsevier
Spark is a more efficient distributed big data processing framework following Hadoop. It
provides users with more than 180 adjustable configuration parameters, and how to choose …