A comprehensive study of the past, present, and future of data deduplication

W Xia, H Jiang, D Feng, F Douglis… - Proceedings of the …, 2016 - ieeexplore.ieee.org
Data deduplication, an efficient approach to data reduction, has gained increasing attention
and popularity in large-scale storage systems due to the explosive growth of digital data. It …

{FastCDC}: A fast and efficient {Content-Defined} chunking approach for data deduplication

W Xia, Y Zhou, H Jiang, D Feng, Y Hua, Y Hu… - 2016 USENIX Annual …, 2016 - usenix.org
Content-Defined Chunking (CDC) has been playing a key role in data deduplication
systems in the past 15 years or so due to its high redundancy detection abil-ity. However …

AE: An asymmetric extremum content defined chunking algorithm for fast and bandwidth-efficient data deduplication

Y Zhang, H Jiang, D Feng, W Xia, M Fu… - … IEEE Conference on …, 2015 - ieeexplore.ieee.org
Data deduplication, a space-efficient and bandwidth-saving technology, plays an important
role in bandwidth-efficient data transmission in various data-intensive network and cloud …

The design of fast content-defined chunking for data deduplication based storage systems

W Xia, X Zou, H Jiang, Y Zhou, C Liu… - … on Parallel and …, 2020 - ieeexplore.ieee.org
Content-Defined Chunking (CDC) has been playing a key role in data deduplication
systems recently due to its high redundancy detection ability. However, existing CDC-based …

Ddelta: A deduplication-inspired fast delta compression approach

W Xia, H Jiang, D Feng, L Tian, M Fu, Y Zhou - Performance Evaluation, 2014 - Elsevier
Delta compression is an efficient data reduction approach to removing redundancy among
similar data chunks and files in storage systems. One of the main challenges facing delta …

A fast asymmetric extremum content defined chunking algorithm for data deduplication in backup storage systems

Y Zhang, D Feng, H Jiang, W Xia, M Fu… - IEEE Transactions …, 2016 - ieeexplore.ieee.org
Chunk-level deduplication plays an important role in backup storage systems. Existing
Content-Defined Chunking (CDC) algorithms, while robust in finding suitable chunk …

A survey on novel classification of deduplication storage systems

SMA Mohamed, Y Wang - Distributed and Parallel Databases, 2021 - Springer
The huge blast of information caused a lot of dilemmas in both storage and retrieval
procedures. The enlargement in a massive quantity of digital data requirements imposes …

Accelerating content-defined-chunking based data deduplication by exploiting parallelism

W Xia, D Feng, H Jiang, Y Zhang, V Chang… - Future Generation …, 2019 - Elsevier
Data deduplication, a data reduction technique that efficiently detects and eliminates
redundant data chunks and files, has been widely applied in large-scale storage systems …

QuickDedup: Efficient VM deduplication in cloud computing environments

S Saharan, G Somani, G Gupta, R Verma… - Journal of Parallel and …, 2020 - Elsevier
Deduplication is one of the major storage optimisation techniques for Virtual Machines
(VMs) in cloud environment. Usually, hashing of blocks helps in identifying duplicate data …

CIDR: A cost-effective in-line data reduction system for terabit-per-second scale SSD arrays

M Ajdari, P Park, J Kim, D Kwon… - 2019 IEEE International …, 2019 - ieeexplore.ieee.org
An SSD array, a storage system consisting of multiple SSDs per node, has become a design
choice to implement a fast primary storage system, and modern storage architects now aim …