Pan-genomics in the human genome era
RM Sherman, SL Salzberg - Nature Reviews Genetics, 2020 - nature.com
Since the early days of the genome era, the scientific community has relied on a single
'reference'genome for each species, which is used as the basis for a wide range of genetic …
'reference'genome for each species, which is used as the basis for a wide range of genetic …
Data structures based on k-mers for querying large collections of sequencing data sets
High-throughput sequencing data sets are usually deposited in public repositories (eg, the
European Nucleotide Archive) to ensure reproducibility. As the amount of data has reached …
European Nucleotide Archive) to ensure reproducibility. As the amount of data has reached …
[HTML][HTML] Bifrost: highly parallel construction and indexing of colored and compacted de Bruijn graphs
Memory consumption of de Bruijn graphs is often prohibitive. Most de Bruijn graph-based
assemblers reduce the complexity by compacting paths into single vertices, but this is …
assemblers reduce the complexity by compacting paths into single vertices, but this is …
Ultrafast search of all deposited bacterial and viral genomic data
Exponentially increasing amounts of unprocessed bacterial and viral genomic sequence
data are stored in the global archives. The ability to query these data for sequence search …
data are stored in the global archives. The ability to query these data for sequence search …
Themisto: a scalable colored k-mer index for sensitive pseudoalignment against hundreds of thousands of bacterial genomes
Motivation Huge datasets containing whole-genome sequences of bacterial strains are now
commonplace and represent a rich and important resource for modern genomic …
commonplace and represent a rich and important resource for modern genomic …
[HTML][HTML] Metabolic framework of spontaneous and synthetic sourdough metacommunities to reveal microbial players responsible for resilience and performance
Background In nature, microbial communities undergo changes in composition that threaten
their resiliency. Here, we interrogated sourdough, a natural cereal-fermenting …
their resiliency. Here, we interrogated sourdough, a natural cereal-fermenting …
[HTML][HTML] Current affairs of microbial genome-wide association studies: approaches, bottlenecks and analytical pitfalls
Microbial genome-wide association studies (mGWAS) are a new and exciting research field
that is adapting human GWAS methods to understand how variations in microbial genomes …
that is adapting human GWAS methods to understand how variations in microbial genomes …
[PDF][PDF] Mantis: a fast, small, and exact large-scale sequence-search index
Sequence-level searches on large collections of RNA sequencing experiments, such as the
NCBI Sequence Read Archive (SRA), would enable one to ask many questions about the …
NCBI Sequence Read Archive (SRA), would enable one to ask many questions about the …
[HTML][HTML] Genome-wide somatic variant calling using localized colored de Bruijn graphs
Reliable detection of somatic variations is of critical importance in cancer research. Here we
present Lancet, an accurate and sensitive somatic variant caller, which detects SNVs and …
present Lancet, an accurate and sensitive somatic variant caller, which detects SNVs and …
COBS: a compact bit-sliced signature index
We present COBS, a COmpact Bit-sliced Signature index, which is a cross-over between an
inverted index and Bloom filters. Our target application is to index k-mers of DNA samples or …
inverted index and Bloom filters. Our target application is to index k-mers of DNA samples or …