作者
Federico Abascal, David Juan, Irwin Jungreis, Laura Martinez, Maria Rigau, Jose Manuel Rodriguez, Jesus Vazquez, Michael L Tress
发表日期
2018/8/21
期刊
Nucleic acids research
卷号
46
期号
14
页码范围
7070-7084
出版商
Oxford University Press
简介
Seventeen years after the sequencing of the human genome, the human proteome is still under revision. One in eight of the 22 210 coding genes listed by the Ensembl/GENCODE, RefSeq and UniProtKB reference databases are annotated differently across the three sets. We have carried out an in-depth investigation on the 2764 genes classified as coding by one or more sets of manual curators and not coding by others. Data from large-scale genetic variation analyses suggests that most are not under protein-like purifying selection and so are unlikely to code for functional proteins. A further 1470 genes annotated as coding in all three reference sets have characteristics that are typical of non-coding genes or pseudogenes. These potential non-coding genes also appear to be undergoing neutral evolution and have considerably less supporting transcript and protein evidence than other coding genes. We …
引用总数
201820192020202120222023202441612121781
学术搜索中的文章
F Abascal, D Juan, I Jungreis, L Martinez, M Rigau… - Nucleic acids research, 2018