Cleaning Big Data Streams: A Systematic Literature Review

O Alotaibi, E Pardede, S Tomy - Technologies, 2023 - mdpi.com
In today's big data era, cleaning big data streams has become a challenging task because of
the different formats of big data and the massive amount of big data which is being …

Content-Defined Chunking Algorithms in Data Deduplication: Performance, Trade-Offs and Future-Oriented Techniques

SAA Hussein, RB Ahmad, N Yaakob… - … in Applied Sciences …, 2025 - semarakilmu.com.my
In the digital era, the exponential growth of data presents significant challenges for storage
efficiency and processing speed. This paper reviews Content-Defined Chunking (CDC), a …

A Thorough Investigation of Content-Defined Chunking Algorithms for Data Deduplication

M Gregoriadis, L Balduf, B Scheuermann… - arXiv preprint arXiv …, 2024 - arxiv.org
Data deduplication emerged as a powerful solution for reducing storage and bandwidth
costs in cloud settings by eliminating redundancies at the level of chunks. This has spurred …

[PDF][PDF] Enhancing Deduplication Efficiency Using Triple Bytes Cutters and Multi Hash Function.

HB Jehlol, LE George - International Journal of Intelligent Engineering & …, 2023 - inass.org
Managing big data backups is challenging due to high volumes of redundant data. Data
deduplication is widely used but incurs significant computational and time costs. This paper …