查看文章

ed.gov 中的 [PDF]

Whose Truth Is the" Ground Truth"? College Admissions Essays and Bias in Word Vector Evaluation Methods.

作者

Noah Arthurs, AJ Alvero

发表日期

2020/7

期刊

International Educational Data Mining Society

出版商

International Educational Data Mining Society

简介

Word vectors are widely used as input features in natural language processing (NLP) tasks. Researchers have found that word vectors often encode the biases of society, and steps have been taken towards debiasing the vectors themselves. However, little has been said about the fairness of the methods used to evaluate the quality of vectors. Analogical and word similarity tasks are commonplace, but both rely on purportedly ground truth statements about the semantic relationships between words (e.g. "man is to woman as king is to queen"). These analogies look reasonable when only taking into account the literal meanings of words, but two issues arise: (1) people don't always use words in a literal sense, and (2) the same word may be used differently by different groups of people. In this paper, we split a dataset of over 800,000 college admissions essays into quartiles based on reported household income (RHI) and train sets of word vectors on each quartile. We then test these

引用总数

被引用次数：24

202020212022202320241 5 7 9 2

学术搜索中的文章

Whose Truth Is the" Ground Truth"? College Admissions Essays and Bias in Word Vector Evaluation Methods.

N Arthurs, AJ Alvero - International Educational Data Mining Society, 2020

被引用次数：24 相关文章所有 4 个版本