作者
Tungadri Bose, Monzoorul Haque Mohammed, Anirban Dutta, Sharmila S Mande
发表日期
2012/9
期刊
Journal of biosciences
卷号
37
页码范围
785-789
出版商
Springer-Verlag
简介
Recent advances in DNA sequencing technologies have enabled the current generation of life science researchers to probe deeper into the genomic blueprint. The amount of data generated by these technologies has been increasing exponentially since the last decade. Storage, archival and dissemination of such huge data sets require efficient solutions, both from the hardware as well as software perspective. The present paper describes BIND – an algorithm specialized for compressing nucleotide sequence data. By adopting a unique ‘block-length’ encoding for representing binary data (as a key step), BIND achieves significant compression gains as compared to the widely used general purpose compression algorithms (gzip, bzip2 and lzma). Moreover, in contrast to implementations of existing specialized genomic compression approaches, the implementation of BIND is enabled to handle non-ATGC …
引用总数
2014201520162017201820192020202120222023202416743385111
学术搜索中的文章