Statistical independence for the evaluation of classifier-based diagnosis
Brain informatics, 2015•Springer
Abstract Machine learning techniques are increasingly adopted in computer-aided
diagnosis. Evaluation methods for classification results that are based on the study of one or
more metrics can be unable to distinguish between cases in which the classifier is
discriminating the classes from cases in which it is not. In the binary setting, such
circumstances can be encountered when data are unbalanced with respect to the diagnostic
groups. Having more healthy controls than pathological subjects, datasets meant for …
diagnosis. Evaluation methods for classification results that are based on the study of one or
more metrics can be unable to distinguish between cases in which the classifier is
discriminating the classes from cases in which it is not. In the binary setting, such
circumstances can be encountered when data are unbalanced with respect to the diagnostic
groups. Having more healthy controls than pathological subjects, datasets meant for …
Abstract
Machine learning techniques are increasingly adopted in computer-aided diagnosis. Evaluation methods for classification results that are based on the study of one or more metrics can be unable to distinguish between cases in which the classifier is discriminating the classes from cases in which it is not. In the binary setting, such circumstances can be encountered when data are unbalanced with respect to the diagnostic groups. Having more healthy controls than pathological subjects, datasets meant for diagnosis frequently show a certain degree of unbalancedness. In this work, we propose to recast the evaluation of classification results as a test of statistical independence between the predicted and the actual diagnostic groups. We address the problem within the Bayesian hypothesis testing framework. Different from the standard metrics, the proposed method is able to handle unbalanced data and takes into account the size of the available data. We show experimental evidence of the efficacy of the approach both on simulated data and on real data about the diagnosis of the Attention Deficit Hyperactivity Disorder (ADHD).
Springer
以上显示的是最相近的搜索结果。 查看全部搜索结果