[HTML][HTML] Small files' problem in Hadoop: A systematic literature review
R Aggarwal, J Verma, M Siwach - … of King Saud University-Computer and …, 2022 - Elsevier
Apache Hadoop is an open-source software library which integrates a wide variety of
software tools and utilities to facilitate the distributed batch processing of big data sets …
software tools and utilities to facilitate the distributed batch processing of big data sets …
Large scale data analysis using MLlib
AH Ali, MN Abbod, MK Khaleel… - Telkomnika …, 2021 - telkomnika.uad.ac.id
Recent advancements in the internet, social media, and internet of things (IoT) devices have
significantly increased the amount of data generated in a variety of formats. The data must …
significantly increased the amount of data generated in a variety of formats. The data must …
Big data classification based on improved parallel k-nearest neighbor
In response to the rapid growth of many sorts of information, highway data has continued to
evolve in the direction of big data in terms of scale, type, and structure, exhibiting …
evolve in the direction of big data in terms of scale, type, and structure, exhibiting …
Big Data Analytics: Integrating Machine Learning with Big Data Using Hadoop and Mahout
As technology is advancing, data generated by users is increasing day by day. Traditional
database management systems are not capable of storing and processing large sets of data …
database management systems are not capable of storing and processing large sets of data …
Research on mass image data storage method for data center
S Pan, J Jiang, H Qiu, J Qiao, M Xu - 3D Imaging—Multidimensional …, 2023 - Springer
With the advancement of technology and the development of the electric power business,
power enterprises have generated a large amount of image data, which contains rich …
power enterprises have generated a large amount of image data, which contains rich …
Application Analysis of Hadoop in Big Data Processing
Z Zheng - 2021 International Conference on Aviation Safety and …, 2021 - dl.acm.org
How to use big data technology to effectively excavate and identify data information
contained in user behaviors and further innovate services has become a development trend …
contained in user behaviors and further innovate services has become a development trend …
[PDF][PDF] Tiny datablock in saving Hadoop distributed file system wasted memory.
MB Al-Masadeh, MS Azmi… - International Journal of …, 2023 - academia.edu
Hadoop distributed file system (HDFS) is the file system whereby Hadoop is use it to store all
the upcoming data inside it. Since it been declared, HDFS is consuming a huge memory …
the upcoming data inside it. Since it been declared, HDFS is consuming a huge memory …
[PDF][PDF] Analisa Performa Klastering Data Besar pada Hadoop
Data Besar adalah suatu kumpulan data dengan ukuran besar dan komplek, terdiri dari
berbagai tipe data serta diperoleh dari berbagai macam sumber, berkembang pesar dalam …
berbagai tipe data serta diperoleh dari berbagai macam sumber, berkembang pesar dalam …
Pengolahan data sensor IOT dengan Apache Spark menggunakan metode Batch Processing
AA Nuzuli, MD Khairy, AN Defitri… - Journal of Network …, 2023 - jurnal.netplg.com
Penelitian ini mengeksplorasi pengolahan data sensor IoT menggunakan Apache Spark
melalui batch processing. Dalam era IoT, di mana data sensor dari berbagai sumber online …
melalui batch processing. Dalam era IoT, di mana data sensor dari berbagai sumber online …
An Information Fusion Technology for Statistical Analysis of Computer Professional Certificates
J Liao, B Li, Z Ren - … on 3D Immersion, Interaction and Multi …, 2022 - ieeexplore.ieee.org
In order to adapt to the information construction, improve work efficiency and reduce the
work pressure of staff, this set of certificate statistical analysis system is developed. Using …
work pressure of staff, this set of certificate statistical analysis system is developed. Using …