RefSeq: expanding the Prokaryotic Genome Annotation Pipeline reach with protein family model curation

W Li, KR O'Neill, DH Haft, M DiCuccio… - Nucleic acids …, 2021 - academic.oup.com
Abstract The Reference Sequence (RefSeq) project at the National Center for Biotechnology
Information (NCBI) contains nearly 200 000 bacterial and archaeal genomes and 150 …

RefSeq: an update on prokaryotic genome annotation and curation

DH Haft, M DiCuccio, A Badretdin, V Brover… - Nucleic acids …, 2018 - academic.oup.com
Abstract The Reference Sequence (RefSeq) project at the National Center for Biotechnology
Information (NCBI) provides annotation for over 95 000 prokaryotic genomes that meet …

RefSeq and the prokaryotic genome annotation pipeline in the age of metagenomes

DH Haft, A Badretdin, G Coulouris… - Nucleic Acids …, 2024 - academic.oup.com
Abstract The Reference Sequence (RefSeq) project at the National Center for Biotechnology
Information (NCBI) contains over 315 000 bacterial and archaeal genomes and 236 million …

Update on RefSeq microbial genomes resources

T Tatusova, S Ciufo, S Federhen, B Fedorov… - Nucleic acids …, 2015 - academic.oup.com
NCBI RefSeq genome collection http://www. ncbi. nlm. nih. gov/genome represents all three
major domains of life: Eukarya, Bacteria and Archaea as well as Viruses. Prokaryotic …

An integrative strategy to identify the entire protein coding potential of prokaryotic genomes by proteogenomics

U Omasits, AR Varadarajan, M Schmid… - Genome …, 2017 - genome.cshlp.org
Accurate annotation of all protein-coding sequences (CDSs) is an essential prerequisite to
fully exploit the rapidly growing repertoire of completely sequenced prokaryotic genomes …

NCBI prokaryotic genome annotation pipeline

T Tatusova, M DiCuccio, A Badretdin… - Nucleic acids …, 2016 - academic.oup.com
Recent technological advances have opened unprecedented opportunities for large-scale
sequencing and analysis of populations of pathogenic species in disease outbreaks, as well …

No one tool to rule them all: prokaryotic gene prediction tool annotations are highly dependent on the organism of study

NJ Dimonaco, W Aubrey, K Kenobi, A Clare… - …, 2022 - academic.oup.com
Abstract Motivation The biases in CoDing Sequence (CDS) prediction tools, which have
been based on historic genomic annotations from model organisms, impact our …

DFAST: a flexible prokaryotic genome annotation pipeline for faster genome publication

Y Tanizawa, T Fujisawa, Y Nakamura - Bioinformatics, 2018 - academic.oup.com
We developed a prokaryotic genome annotation pipeline, DFAST, that also supports
genome submission to public sequence databases. DFAST was originally started as an on …

Large-scale prokaryotic gene prediction and comparison to genome annotation

P Nielsen, A Krogh - Bioinformatics, 2005 - academic.oup.com
Motivation: Prokaryotic genomes are sequenced and annotated at an increasing rate. The
methods of annotation vary between sequencing groups. It makes genome comparison …

Reference sequence (RefSeq) database at NCBI: current status, taxonomic expansion, and functional annotation

NA O'Leary, MW Wright, JR Brister, S Ciufo… - Nucleic acids …, 2016 - academic.oup.com
The RefSeq project at the National Center for Biotechnology Information (NCBI) maintains
and curates a publicly available database of annotated genomic, transcript, and protein …