作者
Julia V Halo, Amanda L Pendleton, Feichen Shen, Aurélien J Doucet, Thomas Derrien, Christophe Hitte, Laura E Kirby, Bridget Myers, Elzbieta Sliwerska, Sarah Emery, John V Moran, Adam R Boyko, Jeffrey M Kidd
发表日期
2021/3/16
期刊
Proceedings of the National Academy of Sciences
卷号
118
期号
11
页码范围
e2016274118
出版商
National Academy of Sciences
简介
Technological advances have allowed improvements in genome reference sequence assemblies. Here, we combined long- and short-read sequence resources to assemble the genome of a female Great Dane dog. This assembly has improved continuity compared to the existing Boxer-derived (CanFam3.1) reference genome. Annotation of the Great Dane assembly identified 22,182 protein-coding gene models and 7,049 long noncoding RNAs, including 49 protein-coding genes not present in the CanFam3.1 reference. The Great Dane assembly spans the majority of sequence gaps in the CanFam3.1 reference and illustrates that 2,151 gaps overlap the transcription start site of a predicted protein-coding gene. Moreover, a subset of the resolved gaps, which have an 80.95% median GC content, localize to transcription start sites and recombination hotspots more often than expected by chance, suggesting the …
引用总数
202020212022202320242611158
学术搜索中的文章