Repetitive DNA and next-generation sequencing: computational challenges and solutions
TJ Treangen, SL Salzberg - Nature Reviews Genetics, 2012 - nature.com
Repetitive DNA sequences are abundant in a broad range of species, from bacteria to
mammals, and they cover nearly half of the human genome. Repeats have always …
mammals, and they cover nearly half of the human genome. Repeats have always …
Global analysis of repetitive DNA from unassembled sequence reads using RepeatExplorer2
RepeatExplorer2 is a novel version of a computational pipeline that uses graph-based
clustering of next-generation sequencing reads for characterization of repetitive DNA in …
clustering of next-generation sequencing reads for characterization of repetitive DNA in …
Repetitive DNA in eukaryotic genomes
MA Biscotti, E Olmo, JS Heslop-Harrison - Chromosome Research, 2015 - Springer
Repetitive DNA—sequence motifs repeated hundreds or thousands of times in the genome—
makes up the major proportion of all the nuclear DNA in most eukaryotic genomes …
makes up the major proportion of all the nuclear DNA in most eukaryotic genomes …
RepeatExplorer: a Galaxy-based web server for genome-wide characterization of eukaryotic repetitive elements from next-generation sequence reads
Motivation: Repetitive DNA makes up large portions of plant and animal nuclear genomes,
yet it remains the least-characterized genome component in most species studied so far …
yet it remains the least-characterized genome component in most species studied so far …
Repseek, a tool to retrieve approximate repeats from large DNA sequences
Chromosomes or other long DNA sequences contain many highly similar repeated sub-
sequences. While there are efficient methods for detecting strict repeats or detecting already …
sequences. While there are efficient methods for detecting strict repeats or detecting already …
Tandem repeats lead to sequence assembly errors and impose multi-level challenges for genome and protein databases
The widespread occurrence of repetitive stretches of DNA in genomes of organisms across
the tree of life imposes fundamental challenges for sequencing, genome assembly, and …
the tree of life imposes fundamental challenges for sequencing, genome assembly, and …
[PDF][PDF] De novo identification of repeat families in large genomes
AL Price, NC Jones, PA Pevzner - Bioinformatics, 2005 - Citeseer
Motivation: De novo repeat family identification is a challenging algorithmic problem of great
practical importance. As the number of genome sequencing projects increases, there is a …
practical importance. As the number of genome sequencing projects increases, there is a …
Tandem repeats over the edit distance
Motivation: A tandem repeat in DNA is a sequence of two or more contiguous, approximate
copies of a pattern of nucleotides. Tandem repeats occur in the genomes of both eukaryotic …
copies of a pattern of nucleotides. Tandem repeats occur in the genomes of both eukaryotic …
REPuter: the manifold applications of repeat analysis on a genomic scale
S Kurtz, JV Choudhuri, E Ohlebusch… - Nucleic acids …, 2001 - academic.oup.com
The repetitive structure of genomic DNA holds many secrets to be discovered. A systematic
study of repetitive DNA on a genomic or inter-genomic scale requires extensive algorithmic …
study of repetitive DNA on a genomic or inter-genomic scale requires extensive algorithmic …
Resolving complex tandem repeats with long reads
A Ummat, A Bashir - Bioinformatics, 2014 - academic.oup.com
Motivation: Resolving tandemly repeated genomic sequences is a necessary step in
improving our understanding of the human genome. Short tandem repeats (TRs), or …
improving our understanding of the human genome. Short tandem repeats (TRs), or …