作者
Xikun Zhang, Deepak Ramachandran, Ian Tenney, Yanai Elazar, Dan Roth
发表日期
2020/10/11
期刊
arXiv preprint arXiv:2010.05345
简介
Pretrained Language Models (LMs) have been shown to possess significant linguistic, common sense, and factual knowledge. One form of knowledge that has not been studied yet in this context is information about the scalar magnitudes of objects. We show that pretrained language models capture a significant amount of this information but are short of the capability required for general common-sense reasoning. We identify contextual information in pre-training and numeracy as two key factors affecting their performance and show that a simple method of canonicalizing numbers can have a significant effect on the results.
引用总数
学术搜索中的文章
X Zhang, D Ramachandran, I Tenney, Y Elazar, D Roth - arXiv preprint arXiv:2010.05345, 2020