Repetitive DNA and next-generation sequencing: computational challenges and solutions

TJ Treangen, SL Salzberg - Nature Reviews Genetics, 2012 - nature.com
Repetitive DNA sequences are abundant in a broad range of species, from bacteria to
mammals, and they cover nearly half of the human genome. Repeats have always …

Global analysis of repetitive DNA from unassembled sequence reads using RepeatExplorer2

P Novák, P Neumann, J Macas - Nature Protocols, 2020 - nature.com
RepeatExplorer2 is a novel version of a computational pipeline that uses graph-based
clustering of next-generation sequencing reads for characterization of repetitive DNA in …

Repetitive DNA in eukaryotic genomes

MA Biscotti, E Olmo, JS Heslop-Harrison - Chromosome Research, 2015 - Springer
Repetitive DNA—sequence motifs repeated hundreds or thousands of times in the genome—
makes up the major proportion of all the nuclear DNA in most eukaryotic genomes …

RepeatExplorer: a Galaxy-based web server for genome-wide characterization of eukaryotic repetitive elements from next-generation sequence reads

P Novák, P Neumann, J Pech, J Steinhaisl… - …, 2013 - academic.oup.com
Motivation: Repetitive DNA makes up large portions of plant and animal nuclear genomes,
yet it remains the least-characterized genome component in most species studied so far …

Repseek, a tool to retrieve approximate repeats from large DNA sequences

G Achaz, F Boyer, EPC Rocha, A Viari… - Bioinformatics, 2007 - academic.oup.com
Chromosomes or other long DNA sequences contain many highly similar repeated sub-
sequences. While there are efficient methods for detecting strict repeats or detecting already …

Tandem repeats lead to sequence assembly errors and impose multi-level challenges for genome and protein databases

OK Tørresen, B Star, P Mier… - Nucleic acids …, 2019 - academic.oup.com
The widespread occurrence of repetitive stretches of DNA in genomes of organisms across
the tree of life imposes fundamental challenges for sequencing, genome assembly, and …

[PDF][PDF] De novo identification of repeat families in large genomes

AL Price, NC Jones, PA Pevzner - Bioinformatics, 2005 - Citeseer
Motivation: De novo repeat family identification is a challenging algorithmic problem of great
practical importance. As the number of genome sequencing projects increases, there is a …

Tandem repeats over the edit distance

D Sokol, G Benson, J Tojeira - Bioinformatics, 2007 - academic.oup.com
Motivation: A tandem repeat in DNA is a sequence of two or more contiguous, approximate
copies of a pattern of nucleotides. Tandem repeats occur in the genomes of both eukaryotic …

REPuter: the manifold applications of repeat analysis on a genomic scale

S Kurtz, JV Choudhuri, E Ohlebusch… - Nucleic acids …, 2001 - academic.oup.com
The repetitive structure of genomic DNA holds many secrets to be discovered. A systematic
study of repetitive DNA on a genomic or inter-genomic scale requires extensive algorithmic …

Resolving complex tandem repeats with long reads

A Ummat, A Bashir - Bioinformatics, 2014 - academic.oup.com
Motivation: Resolving tandemly repeated genomic sequences is a necessary step in
improving our understanding of the human genome. Short tandem repeats (TRs), or …