相关文章- 学术资源搜索

Beyond kappa: A review of interrater agreement measures

M Banerjee, M Capozzoli… - Canadian journal of …, 1999 - Wiley Online Library

In 1960, Cohen introduced the kappa coefficient to measure chance‐corrected nominal
scale agreement between two raters. Since then, numerous extensions and generalizations …

被引用次数：1367 相关文章所有 8 个版本

[PDF] academia.edu

Interrater Agreement Measures: Comments on Kappa_n, Cohen's Kappa, Scott's π, and Aickin's α

LM Hsu, R Field - Understanding Statistics, 2003 - Taylor & Francis

The Cohen (1960) kappa interrater agreement coefficient has been criticized for penalizing
raters (eg, diagnosticians) for their a priori agreement about the base rates of categories (eg …

被引用次数：182 相关文章所有 2 个版本

[PDF] psu.edu

Computing inter‐rater reliability and its variance in the presence of high agreement

KL Gwet - British Journal of Mathematical and Statistical …, 2008 - Wiley Online Library

Pi (π) and kappa (κ) statistics are widely used in the areas of psychiatry and psychological
testing to compute the extent of agreement between raters on nominally scaled data. It is a …

被引用次数：1787 相关文章所有 14 个版本

[PDF] springer.com

Inequalities between multi-rater kappas

MJ Warrens - Advances in data analysis and classification, 2010 - Springer

The paper presents inequalities between four descriptive statistics that have been used to
measure the nominal agreement between two or more raters. Each of the four statistics is a …

被引用次数：252 相关文章所有 18 个版本

Sample size determinations for the two rater kappa statistic

VF Flack, AA Afifi, PA Lachenbruch, HJA Schouten - Psychometrika, 1988 - Springer

This paper gives a method for determining a sample size that will achieve a prespecified
bound on confidence interval width for the interrater agreement measure, κ. The same …

被引用次数：188 相关文章所有 8 个版本

[PDF] agreestat.com

[PDF][PDF] Kappa statistic is not satisfactory for assessing the extent of agreement between raters

K Gwet - Statistical methods for inter-rater reliability assessment, 2002 - agreestat.com

Evaluating the extent of agreement between 2 or between several raters is common in
social, behavioral and medical sciences. The objective of this paper is to provide a detailed …

被引用次数：443 相关文章所有 4 个版本

[PDF] psu.edu

[PDF][PDF] The measurement of interrater agreement

JL Fleiss, B Levin, MC Paik - Statistical methods for rates and …, 1981 - Citeseer

The statistical methods described in the preceding chapter for controlling for error are
applicable only when the rates of misclassification are known from external sources or are …

被引用次数：2165 相关文章所有 3 个版本

[PDF] sdsu.edu

How robust are multirater interrater reliability indices to changes in frequency distribution?

D Quarfoot, RA Levine - The American Statistician, 2016 - Taylor & Francis

Interrater reliability studies are used in a diverse set of fields. Often, these investigations
involve three or more raters, and thus, require the use of indices such as Fleiss's kappa …

被引用次数：112 相关文章所有 4 个版本

A generalized kappa coefficient

JS Uebersax - Educational and Psychological Measurement, 1982 - journals.sagepub.com

Previously proposed methods for calculating the kappa measure of nominal rating
agreement among multiple raters are not applicable in many situations. This paper presents …

被引用次数：110 相关文章所有 7 个版本

[PDF] rug.nl

Five ways to look at Cohen's kappa

MJ Warrens - Journal of Psychology & Psychotherapy, 2015 - research.rug.nl

The kappa statistic is commonly used for quantifying inter-rater agreement on a nominal
scale. In this review article we discuss five interpretations of this popular coefficient. Kappa is …

被引用次数：237 相关文章所有 7 个版本