A survey and classification of storage deduplication systems

J Paulo, J Pereira - ACM Computing Surveys (CSUR), 2014 - dl.acm.org
The automatic elimination of duplicate data in a storage system, commonly known as
deduplication, is increasingly accepted as an effective technique to reduce storage costs …

Data deduplication techniques for efficient cloud storage management: a systematic review

R Kaur, I Chana, J Bhattacharya - The Journal of Supercomputing, 2018 - Springer
The exponential growth of digital data in cloud storage systems is a critical issue presently
as a large amount of duplicate data in the storage systems exerts an extra load on it …

System for redirecting requests after a secondary storage computing device failure

MK Vijayan, JO Kochunni, DR Attarde… - US Patent …, 2019 - Google Patents
Abstract Systems and methods are provided herein for automatically configuring newly
installed secondary storage computing devices and managing secondary storage …

Distributed deduplicated storage system

MKV Retnamma, R Kottomtharayil… - US Patent 9,020,900, 2015 - Google Patents
A distributed, deduplicated storage system according to certain embodiments is arranged in
a parallel configuration including multiple deduplication nodes. Deduplicated data is …

A comprehensive study of the past, present, and future of data deduplication

W Xia, H Jiang, D Feng, F Douglis… - Proceedings of the …, 2016 - ieeexplore.ieee.org
Data deduplication, an efficient approach to data reduction, has gained increasing attention
and popularity in large-scale storage systems due to the explosive growth of digital data. It …

High availability distributed deduplicated storage system

MK Vijayan, JO Kochunni, S Agrawal… - US Patent …, 2017 - Google Patents
(57) ABSTRACT A high availability distributed, deduplicated storage system according to
certain embodiments is arranged to include multiple deduplication database media agents …

A study of practical deduplication

DT Meyer, WJ Bolosky - ACM Transactions on Storage (ToS), 2012 - dl.acm.org
We collected file system content data from 857 desktop computers at Microsoft over a span
of 4 weeks. We analyzed the data to determine the relative efficacy of data deduplication …

Systems and methods for retaining and using data block signatures in data protection operations

MK Vijayan, DR Attarde - US Patent 9,239,687, 2016 - Google Patents
A system according to certain embodiments associates a signature value corresponding to a
data block with one or more data blocks and a reference to the data block to form a …

[PDF][PDF] {CAFTL}: A {Content-Aware} flash translation layer enhancing the lifespan of flash memory based solid state drives

F Chen, T Luo, X Zhang - 9th USENIX Conference on File and Storage …, 2011 - usenix.org
Abstract Although Flash Memory based Solid State Drive (SSD) exhibits high performance
and low power consumption, a critical concern is its limited lifespan along with the …

[PDF][PDF] iDedup: latency-aware, inline data deduplication for primary storage.

K Srinivasan, T Bisson, GR Goodson, K Voruganti - Fast, 2012 - usenix.org
Deduplication technologies are increasingly being deployed to reduce cost and increase
space-efficiency in corporate data centers. However, prior research has not applied …