Minirmd: accurate and fast duplicate removal tool for short reads via multiple minimizers
Removing duplicate and near-duplicate reads, generated by high-throughput sequencing
technologies, is able to reduce computational resources in downstream applications. Here …
technologies, is able to reduce computational resources in downstream applications. Here …
ParDRe: faster parallel duplicated reads removal tool for sequencing studies
J González-Domínguez, B Schmidt - Bioinformatics, 2016 - academic.oup.com
Current next generation sequencing technologies often generate duplicated or near-
duplicated reads that (depending on the application scenario) do not provide any interesting …
duplicated reads that (depending on the application scenario) do not provide any interesting …
FastUniq: A Fast De Novo Duplicates Removal Tool for Paired Short Reads
H Xu, X Luo, J Qian, X Pang, J Song, G Qian, J Chen… - PloS one, 2012 - journals.plos.org
The presence of duplicates introduced by PCR amplification is a major issue in paired short
reads from next-generation sequencing platforms. These duplicates might have a serious …
reads from next-generation sequencing platforms. These duplicates might have a serious …
Nubeam-dedup: a fast and RAM-efficient tool to de-duplicate sequencing reads without mapping
We present Nubeam-dedup, a fast and RAM-efficient tool to de-duplicate sequencing reads
without reference genome. Nubeam-dedup represents nucleotides by matrices, transforms …
without reference genome. Nubeam-dedup represents nucleotides by matrices, transforms …
Removing duplicate reads using graphics processing units
Background During library construction polymerase chain reaction is used to enrich the DNA
before sequencing. Typically, this process generates duplicate read sequences. Removal of …
before sequencing. Typically, this process generates duplicate read sequences. Removal of …
Fulcrum: condensing redundant reads from high-throughput sequencing studies
MS Burriesci, EM Lehnert, JR Pringle - Bioinformatics, 2012 - academic.oup.com
Motivation: Ultra-high-throughput sequencing produces duplicate and near-duplicate reads,
which can consume computational resources in downstream applications. A tool that …
which can consume computational resources in downstream applications. A tool that …
Gencore: an efficient tool to generate consensus reads for error suppressing and duplicate removing of NGS data
Background Removing duplicates might be considered as a well-resolved problem in next-
generation sequencing (NGS) data processing domain. However, as NGS technology gains …
generation sequencing (NGS) data processing domain. However, as NGS technology gains …
NGmerge: merging paired-end reads via novel empirically-derived models of sequencing errors
JM Gaspar - BMC bioinformatics, 2018 - Springer
Abstract Background Advances in Illumina DNA sequencing technology have produced
longer paired-end reads that increasingly have sequence overlaps. These reads can be …
longer paired-end reads that increasingly have sequence overlaps. These reads can be …
Super deduper, fast PCR duplicate detection in fastq files
KR Petersen, DA Streett, AT Gerritsen… - Proceedings of the 6th …, 2015 - dl.acm.org
Our goal was to explore the accuracy and utility of identifying and removing PCR duplicates
from HTS data using Super Deduper. Super Deduper is a pre-alignment, sequence read …
from HTS data using Super Deduper. Super Deduper is a pre-alignment, sequence read …
Bias from removing read duplication in ultra-deep sequencing experiments
Motivation: Identifying subclonal mutations and their implications requires accurate
estimation of mutant allele fractions from possibly duplicated sequencing reads. Removing …
estimation of mutant allele fractions from possibly duplicated sequencing reads. Removing …