Tandem repeats lead to sequence assembly errors and impose multi-level challenges for genome and protein databases

OK Tørresen, B Star, P Mier… - Nucleic acids …, 2019 - academic.oup.com
The widespread occurrence of repetitive stretches of DNA in genomes of organisms across
the tree of life imposes fundamental challenges for sequencing, genome assembly, and …

Repetitive DNA and next-generation sequencing: computational challenges and solutions

TJ Treangen, SL Salzberg - Nature Reviews Genetics, 2012 - nature.com
Repetitive DNA sequences are abundant in a broad range of species, from bacteria to
mammals, and they cover nearly half of the human genome. Repeats have always …

[图书][B] Next generation sequencing technologies and challenges in sequence assembly

S El-Metwally, OM Ouda, M Helmy - 2014 - books.google.com
The introduction of Next Generation Sequencing (NGS) technologies resulted in a major
transformation in the way scientists extract genetic information from biological systems …

Twelve quick steps for genome assembly and annotation in the classroom

H Jung, T Ventura, JS Chung, WJ Kim… - PLoS computational …, 2020 - journals.plos.org
Eukaryotic genome sequencing and de novo assembly, once the exclusive domain of well-
funded international consortia, have become increasingly affordable, thus fitting the budgets …

RepeatsDB in 2021: improved data and extended classification for protein tandem repeat structures

L Paladin, M Bevilacqua, S Errigo… - Nucleic Acids …, 2021 - academic.oup.com
The RepeatsDB database (URL: https://repeatsdb. org/) provides annotations and
classification for protein tandem repeat structures from the Protein Data Bank (PDB). Protein …

Characterization and visualization of tandem repeats at genome scale

E Dolzhenko, A English, H Dashnow… - Nature …, 2024 - nature.com
Tandem repeat (TR) variation is associated with gene expression changes and numerous
rare monogenic diseases. Although long-read sequencing provides accurate full-length …

Bioinformatics challenges of new sequencing technology

M Pop, SL Salzberg - Trends in genetics, 2008 - cell.com
New DNA sequencing technologies can sequence up to one billion bases in a single day at
low cost, putting large-scale sequencing within the reach of many scientists. Many …

Tigmint: correcting assembly errors using linked reads from large molecules

SD Jackman, L Coombe, J Chu, RL Warren… - BMC …, 2018 - Springer
Background Genome sequencing yields the sequence of many short snippets of DNA
(reads) from a genome. Genome assembly attempts to reconstruct the original genome from …

RepeatExplorer: a Galaxy-based web server for genome-wide characterization of eukaryotic repetitive elements from next-generation sequence reads

P Novák, P Neumann, J Pech, J Steinhaisl… - …, 2013 - academic.oup.com
Motivation: Repetitive DNA makes up large portions of plant and animal nuclear genomes,
yet it remains the least-characterized genome component in most species studied so far …

An improved genome assembly uncovers prolific tandem repeats in Atlantic cod

OK Tørresen, B Star, S Jentoft, WB Reinar, H Grove… - BMC genomics, 2017 - Springer
Abstract Background The first Atlantic cod (Gadus morhua) genome assembly published in
2011 was one of the early genome assemblies exclusively based on high-throughput 454 …