Big data in biology: the hope and present-day challenges in it
The wave of new technologies has opened up the opportunity for cost-effective generation of
high-throughput profiles of biological systems. This is generating tons of biological data. It is …
high-throughput profiles of biological systems. This is generating tons of biological data. It is …
High-throughput DNA sequence data compression
The exponential growth of high-throughput DNA sequence data has posed great challenges
to genomic data storage, retrieval and transmission. Compression is a critical tool to address …
to genomic data storage, retrieval and transmission. Compression is a critical tool to address …
Efficient storage of high throughput DNA sequencing data using reference-based compression
MHY Fritz, R Leinonen, G Cochrane… - Genome research, 2011 - genome.cshlp.org
Data storage costs have become an appreciable proportion of total cost in the creation and
analysis of DNA sequence data. Of particular concern is that the rate of increase in DNA …
analysis of DNA sequence data. Of particular concern is that the rate of increase in DNA …
[PDF][PDF] DNACompress: fast and effective DNA sequence compression
While achieving the best compression ratios for DNA sequences, our new DNACompress
program significantly improves the running time of all previous DNA compression programs …
program significantly improves the running time of all previous DNA compression programs …
A simple statistical algorithm for biological sequence compression
This paper introduces a novel algorithm for biological sequence compression that makes
use of both statistical properties and repetition within sequences. A panel of experts is …
use of both statistical properties and repetition within sequences. A panel of experts is …
A survey on data compression methods for biological sequences
The ever increasing growth of the production of high-throughput sequencing data poses a
serious challenge to the storage, processing and transmission of these data. As frequently …
serious challenge to the storage, processing and transmission of these data. As frequently …
GReEn: a tool for efficient compression of genome resequencing data
Research in the genomic sciences is confronted with the volume of sequencing and
resequencing data increasing at a higher pace than that of data storage and communication …
resequencing data increasing at a higher pace than that of data storage and communication …
DNA sequence compression using adaptive particle swarm optimization-based memetic algorithm
With the rapid development of high-throughput DNA sequencing technologies, the amount
of DNA sequence data is accumulating exponentially. The huge influx of data creates new …
of DNA sequence data is accumulating exponentially. The huge influx of data creates new …
Textual data compression in computational biology: a synopsis
R Giancarlo, D Scaturro, F Utro - Bioinformatics, 2009 - academic.oup.com
Motivation: Textual data compression, and the associated techniques coming from
information theory, are often perceived as being of interest for data communication and …
information theory, are often perceived as being of interest for data communication and …
Efficient DNA sequence compression with neural networks
Background The increasing production of genomic data has led to an intensified need for
models that can cope efficiently with the lossless compression of DNA sequences. Important …
models that can cope efficiently with the lossless compression of DNA sequences. Important …