Toward better understanding of artifacts in variant calling from high-coverage samples

H Li - Bioinformatics, 2014 - academic.oup.com
Motivation: Whole-genome high-coverage sequencing has been widely used for personal
and cancer genomics as well as in various research areas. However, in the lack of an …

[PDF][PDF] Computational pan-genomics: status, promises and challenges

Briefings in bioinformatics, 2018 - academic.oup.com
Many disciplines, from human genetics and oncology to plant breeding, microbiology and
virology, commonly face the challenge of analyzing rapidly increasing numbers of genomes …

Indexing graphs for path queries with applications in genome research

J Sirén, N Välimäki, V Mäkinen - IEEE/ACM transactions on …, 2014 - ieeexplore.ieee.org
We propose a generic approach to replace the canonical sequence representation of
genomes with graph representations, and study several applications of such extensions. We …

Improved genome inference in the MHC using a population reference graph

A Dilthey, C Cox, Z Iqbal, MR Nelson, G McVean - Nature genetics, 2015 - nature.com
Although much is known about human genetic variation, such information is typically
ignored in assembling new genomes. Instead, reads are mapped to a single reference …

Short read alignment with populations of genomes

L Huang, V Popic, S Batzoglou - Bioinformatics, 2013 - academic.oup.com
The increasing availability of high-throughput sequencing technologies has led to
thousands of human genomes having been sequenced in the past years. Efforts such as the …

RCSI: Scalable similarity search in thousand (s) of genomes

S Wandelt, J Starlinger, M Bux, U Leser - Proceedings of the VLDB …, 2013 - dl.acm.org
Until recently, genomics has concentrated on comparing sequences between species.
However, due to the sharply falling cost of sequencing technology, studies of populations of …

Pan-genome storage and analysis techniques

T Zekic, G Holley, J Stoye - Comparative Genomics: Methods and …, 2018 - Springer
Computational pan-genome analysis has emerged from the rapid increase of available
genome sequencing data. Starting from a microbial pan-genome, the concept has spread to …

Journaled string tree—a scalable data structure for analyzing thousands of similar genomes on your laptop

R Rahn, D Weese, K Reinert - Bioinformatics, 2014 - academic.oup.com
Motivation: Next-generation sequencing (NGS) has revolutionized biomedical research in
the past decade and led to a continuous stream of developments in bioinformatics …

PanSVR: Pan-genome augmented short read realignment for sensitive detection of structural variations

G Li, T Jiang, J Li, Y Wang - Frontiers in Genetics, 2021 - frontiersin.org
The comprehensive discovery of structure variations (SVs) is fundamental to many genomics
studies and high-throughput sequencing has become a common approach to this task …

Graphical pangenomics

E Garrison - 2019 - repository.cam.ac.uk
Completely sequencing genomes is expensive, and to save costs we often analyze new
genomic data in the context of a reference genome. This approach distorts our image of the …