[PDF][PDF] Computational pan-genomics: status, promises and challenges

Briefings in bioinformatics, 2018 - academic.oup.com
Many disciplines, from human genetics and oncology to plant breeding, microbiology and
virology, commonly face the challenge of analyzing rapidly increasing numbers of genomes …

A survey of sequence alignment algorithms for next-generation sequencing

H Li, N Homer - Briefings in bioinformatics, 2010 - academic.oup.com
Rapidly evolving sequencing technologies produce data on an unparalleled scale. A central
challenge to the analysis of this data is sequence alignment, whereby sequence reads must …

Fully functional suffix trees and optimal text searching in BWT-runs bounded space

T Gagie, G Navarro, N Prezza - Journal of the ACM (JACM), 2020 - dl.acm.org
Indexing highly repetitive texts—such as genomic databases, software repositories and
versioned text collections—has become an important problem since the turn of the …

Exploring single-sample SNP and INDEL calling with whole-genome de novo assembly

H Li - Bioinformatics, 2012 - academic.oup.com
Abstract Motivation: Eugene Myers in his string graph paper suggested that in a string graph
or equivalently a unitig graph, any path spells a valid assembly. As a string/unitig graph also …

[图书][B] Genome-scale algorithm design

V Mäkinen, D Belazzougui, F Cunial, AI Tomescu - 2015 - books.google.com
High-throughput sequencing has revolutionised the field of biological sequence analysis. Its
application has enabled researchers to address important biological questions, often for the …

Optimal-time text indexing in BWT-runs bounded space

T Gagie, G Navarro, N Prezza - Proceedings of the Twenty-Ninth Annual ACM …, 2018 - SIAM
Indexing highly repetitive texts—such as genomic databases, software repositories and
versioned text collections—has become an important problem since the turn of the …

Relative Lempel-Ziv compression of genomes for large-scale storage and retrieval

S Kuruppu, SJ Puglisi, J Zobel - International Symposium on String …, 2010 - Springer
Self-indexes–data structures that simultaneously provide fast search of and access to
compressed text–are promising for genomic data but in their usual form are not able to …

Alignment of next-generation sequencing reads

K Reinert, B Langmead, D Weese… - Annual review of …, 2015 - annualreviews.org
High-throughput DNA sequencing has considerably changed the possibilities for conducting
biomedical research by measuring billions of short DNA or RNA fragments. A central …

Short read alignment with populations of genomes

L Huang, V Popic, S Batzoglou - Bioinformatics, 2013 - academic.oup.com
The increasing availability of high-throughput sequencing technologies has led to
thousands of human genomes having been sequenced in the past years. Efforts such as the …

Optimized succinct data structures for massive data

S Gog, M Petri - Software: Practice and Experience, 2014 - Wiley Online Library
Succinct data structures provide the same functionality as their corresponding traditional
data structure in compact space. We improve on functions rank and select, which are the …