Tandem repeats lead to sequence assembly errors and impose multi-level challenges for genome and protein databases
The widespread occurrence of repetitive stretches of DNA in genomes of organisms across
the tree of life imposes fundamental challenges for sequencing, genome assembly, and …
the tree of life imposes fundamental challenges for sequencing, genome assembly, and …
Repetitive DNA and next-generation sequencing: computational challenges and solutions
TJ Treangen, SL Salzberg - Nature Reviews Genetics, 2012 - nature.com
Repetitive DNA sequences are abundant in a broad range of species, from bacteria to
mammals, and they cover nearly half of the human genome. Repeats have always …
mammals, and they cover nearly half of the human genome. Repeats have always …
[图书][B] Next generation sequencing technologies and challenges in sequence assembly
The introduction of Next Generation Sequencing (NGS) technologies resulted in a major
transformation in the way scientists extract genetic information from biological systems …
transformation in the way scientists extract genetic information from biological systems …
Twelve quick steps for genome assembly and annotation in the classroom
Eukaryotic genome sequencing and de novo assembly, once the exclusive domain of well-
funded international consortia, have become increasingly affordable, thus fitting the budgets …
funded international consortia, have become increasingly affordable, thus fitting the budgets …
RepeatsDB in 2021: improved data and extended classification for protein tandem repeat structures
L Paladin, M Bevilacqua, S Errigo… - Nucleic Acids …, 2021 - academic.oup.com
The RepeatsDB database (URL: https://repeatsdb. org/) provides annotations and
classification for protein tandem repeat structures from the Protein Data Bank (PDB). Protein …
classification for protein tandem repeat structures from the Protein Data Bank (PDB). Protein …
Characterization and visualization of tandem repeats at genome scale
Tandem repeat (TR) variation is associated with gene expression changes and numerous
rare monogenic diseases. Although long-read sequencing provides accurate full-length …
rare monogenic diseases. Although long-read sequencing provides accurate full-length …
Bioinformatics challenges of new sequencing technology
M Pop, SL Salzberg - Trends in genetics, 2008 - cell.com
New DNA sequencing technologies can sequence up to one billion bases in a single day at
low cost, putting large-scale sequencing within the reach of many scientists. Many …
low cost, putting large-scale sequencing within the reach of many scientists. Many …
Tigmint: correcting assembly errors using linked reads from large molecules
Background Genome sequencing yields the sequence of many short snippets of DNA
(reads) from a genome. Genome assembly attempts to reconstruct the original genome from …
(reads) from a genome. Genome assembly attempts to reconstruct the original genome from …
RepeatExplorer: a Galaxy-based web server for genome-wide characterization of eukaryotic repetitive elements from next-generation sequence reads
Motivation: Repetitive DNA makes up large portions of plant and animal nuclear genomes,
yet it remains the least-characterized genome component in most species studied so far …
yet it remains the least-characterized genome component in most species studied so far …
An improved genome assembly uncovers prolific tandem repeats in Atlantic cod
Abstract Background The first Atlantic cod (Gadus morhua) genome assembly published in
2011 was one of the early genome assemblies exclusively based on high-throughput 454 …
2011 was one of the early genome assemblies exclusively based on high-throughput 454 …