Priority research directions for in situ data management: Enabling scientific discovery from diverse data sources

T Peterka, D Bard, JC Bennett… - … Journal of High …, 2020 - journals.sagepub.com
In January 2019, the US Department of Energy, Office of Science program in Advanced
Scientific Computing Research, convened a workshop to identify priority research directions …

Rider chaotic biography optimization-driven deep stacked auto-encoder for big data classification using spark architecture: rider chaotic biography optimization

AV Brahmane, CB Krishna - International Journal of Web Services …, 2021 - igi-global.com
The novelty in big data is rising day-by-day in such a way that the existing software tools
face difficulty in supervision of big data. Furthermore, the rate of the imbalanced data in the …

Collaborative Filtering‐Based Music Recommendation in Spark Architecture

Y Niu - Mathematical Problems in Engineering, 2022 - Wiley Online Library
The use of recommendation algorithms to recommend music MOOC resources is a method
that is gradually gaining ground in people's lives along with the development of the Internet …

Toward high-performance computing and big data analytics convergence: The case of spark-diy

S Caino-Lores, J Carretero, B Nicolae, O Yildiz… - IEEE …, 2019 - ieeexplore.ieee.org
Convergence between high-performance computing (HPC) and big data analytics (BDA) is
currently an established research area that has spawned new opportunities for unifying the …

ASCR workshop on in situ data management: Enabling scientific discovery from diverse data sources

T Peterka, D Bard, J Bennett, E Bethel, R Oldfield… - 2019 - osti.gov
In January 2019, the US Department of Energy, Office of Science program in Advanced
Scientific Computing Research, convened a workshop to identify priority research directions …

Data balancing-based intermediate data partitioning and check point-based cache recovery in Spark environment

C Li, Q Cai, Y Luo - The Journal of Supercomputing, 2022 - Springer
Both data shuffling and cache recovery are essential parts of the Spark system, and they
directly affect Spark parallel computing performance. Existing dynamic partitioning schemes …

Convergence of HPC and Big Data in extreme-scale data analysis through the DCEx programming model

J Garcia-Blas, JF Muñoz, J Carretero… - 2022 IEEE 34th …, 2022 - ieeexplore.ieee.org
High-level programming models can help application developers to access and use
resources without the need to manage low-level architectural entities, as a parallel …

Tile & merge: Distributed delaunay triangulations for cloud computing

L Caraffa, P Memari, M Yirci… - 2019 IEEE International …, 2019 - ieeexplore.ieee.org
Motivated by the needs of a scalable out-of-core surface reconstruction algorithm available
on the cloud, this paper addresses the computation of distributed Delaunay triangulations of …

Big data classification using deep learning and apache spark architecture

AV Brahmane, BC Krishna - Neural Computing and Applications, 2021 - Springer
The oddity in large information is rising step by step so that the current programming
instruments faces trouble in supervision of huge information. Moreover, the pace of the …

[HTML][HTML] DDS: integrating data analytics transformations in task-based workflows

N Mammadli, J Ejarque, J Alvarez… - Open Research …, 2022 - ncbi.nlm.nih.gov
High-performance data analytics (HPDA) is a current trend in e-science research that aims
to integrate traditional HPC with recent data analytic frameworks. Most of the work done in …