Scalable metadata management techniques for ultra-large distributed storage systems--A systematic review

HJ Singh, S Bawa - ACM Computing Surveys (CSUR), 2018 - dl.acm.org
The provisioning of an efficient ultra-large scalable distributed storage system for expanding
cloud applications has been a challenging job for researchers in academia and industry. In …

MAD2: A scalable high-throughput exact deduplication approach for network backup services

J Wei, H Jiang, K Zhou, D Feng - 2010 IEEE 26th Symposium …, 2010 - ieeexplore.ieee.org
Deduplication has been widely used in disk-based secondary storage systems to improve
space efficiency. However, there are two challenges facing scalable high-throughput …

SmartStore: A new metadata organization paradigm with semantic-awareness for next-generation file systems

Y Hua, H Jiang, Y Zhu, D Feng, L Tian - Proceedings of the conference …, 2009 - dl.acm.org
Existing storage systems using hierarchical directory tree do not meet scalability and
functionality requirements for exponentially growing datasets and increasingly complex …

Locality-sensitive bloom filter for approximate membership query

Y Hua, B Xiao, B Veeravalli… - IEEE Transactions on …, 2011 - ieeexplore.ieee.org
In many network applications, Bloom filters are used to support exact-matching membership
query for their randomized space-efficient data structure with a small probability of false …

Metadata distribution and consistency techniques for large-scale cluster file systems

J Xiong, Y Hu, G Li, R Tang… - IEEE Transactions on …, 2010 - ieeexplore.ieee.org
Most supercomputers nowadays are based on large clusters, which call for sophisticated,
scalable, and decentralized metadata processing techniques. From the perspective of …

Adaptive and scalable metadata management to support a trillion files

J Xing, J Xiong, N Sun, J Ma - Proceedings of the Conference on High …, 2009 - dl.acm.org
Nowadays more and more applications require file systems to efficiently maintain million or
more files. How to provide high access performance with such a huge number of files and …

Using parallel bloom filters for multiattribute representation on network services

B Xiao, Y Hua - IEEE Transactions on parallel and distributed …, 2009 - ieeexplore.ieee.org
One widely used mechanism for representing membership of a set of items is the simple
space-efficient randomized data structure known as Bloom filters. Yet, Bloom filters are not …

Direct lookup and hash-based metadata placement for local file systems

PH Lensing, T Cortes, A Brinkmann - Proceedings of the 6th …, 2013 - dl.acm.org
New challenges to file systems' metadata performance are imposed by the continuously
growing number of files existing in file systems. The total amount of metadata can become …

Variability driven quality evaluation in software product lines

L Etxeberria, G Sagardui - 2008 12th International Software …, 2008 - ieeexplore.ieee.org
Variability is a key aspect in software product lines. Functional variability has been largely
studied as a way to obtain all the desired products for a line. Quality variability, less …

Supporting scalable and adaptive metadata management in ultralarge-scale file systems

Y Hua, Y Zhu, H Jiang, D Feng… - IEEE Transactions on …, 2010 - ieeexplore.ieee.org
This paper presents a scalable and adaptive decentralized metadata lookup scheme for
ultralarge-scale file systems (more than Petabytes or even Exabytes). Our scheme logically …