[HTML][HTML] Small files' problem in Hadoop: A systematic literature review

R Aggarwal, J Verma, M Siwach - … of King Saud University-Computer and …, 2022 - Elsevier
Apache Hadoop is an open-source software library which integrates a wide variety of
software tools and utilities to facilitate the distributed batch processing of big data sets …

Large scale data analysis using MLlib

AH Ali, MN Abbod, MK Khaleel… - Telkomnika …, 2021 - telkomnika.uad.ac.id
Recent advancements in the internet, social media, and internet of things (IoT) devices have
significantly increased the amount of data generated in a variety of formats. The data must …

Big data classification based on improved parallel k-nearest neighbor

AH Ali, MA Mohammed, RA Hasan… - TELKOMNIKA …, 2023 - telkomnika.uad.ac.id
In response to the rapid growth of many sorts of information, highway data has continued to
evolve in the direction of big data in terms of scale, type, and structure, exhibiting …

Big Data Analytics: Integrating Machine Learning with Big Data Using Hadoop and Mahout

P Rani, R Lamba, RK Sachdeva… - … Systems and Smart …, 2023 - taylorfrancis.com
As technology is advancing, data generated by users is increasing day by day. Traditional
database management systems are not capable of storing and processing large sets of data …

Research on mass image data storage method for data center

S Pan, J Jiang, H Qiu, J Qiao, M Xu - 3D Imaging—Multidimensional …, 2023 - Springer
With the advancement of technology and the development of the electric power business,
power enterprises have generated a large amount of image data, which contains rich …

Application Analysis of Hadoop in Big Data Processing

Z Zheng - 2021 International Conference on Aviation Safety and …, 2021 - dl.acm.org
How to use big data technology to effectively excavate and identify data information
contained in user behaviors and further innovate services has become a development trend …

[PDF][PDF] Tiny datablock in saving Hadoop distributed file system wasted memory.

MB Al-Masadeh, MS Azmi… - International Journal of …, 2023 - academia.edu
Hadoop distributed file system (HDFS) is the file system whereby Hadoop is use it to store all
the upcoming data inside it. Since it been declared, HDFS is consuming a huge memory …

[PDF][PDF] Analisa Performa Klastering Data Besar pada Hadoop

HM Putra, T Akbar, A Ahmadi… - Infotek: Jurnal Informatika …, 2021 - academia.edu
Data Besar adalah suatu kumpulan data dengan ukuran besar dan komplek, terdiri dari
berbagai tipe data serta diperoleh dari berbagai macam sumber, berkembang pesar dalam …

Pengolahan data sensor IOT dengan Apache Spark menggunakan metode Batch Processing

AA Nuzuli, MD Khairy, AN Defitri… - Journal of Network …, 2023 - jurnal.netplg.com
Penelitian ini mengeksplorasi pengolahan data sensor IoT menggunakan Apache Spark
melalui batch processing. Dalam era IoT, di mana data sensor dari berbagai sumber online …

An Information Fusion Technology for Statistical Analysis of Computer Professional Certificates

J Liao, B Li, Z Ren - … on 3D Immersion, Interaction and Multi …, 2022 - ieeexplore.ieee.org
In order to adapt to the information construction, improve work efficiency and reduce the
work pressure of staff, this set of certificate statistical analysis system is developed. Using …