作者
Mark J Van der Laan, Eric C Polley, Alan E Hubbard
发表日期
2007/9/16
期刊
Statistical applications in genetics and molecular biology
卷号
6
期号
1
出版商
De Gruyter
简介
When trying to learn a model for the prediction of an outcome given a set of covariates, a statistician has many estimation procedures in their toolbox. A few examples of these candidate learners are: least squares, least angle regression, random forests, and spline regression. Previous articles (van der Laan and Dudoit (2003); van der Laan et al. (2006); Sinisi et al. (2007)) theoretically validated the use of cross validation to select an optimal learner among many candidate learners. Motivated by this use of cross validation, we propose a new prediction method for creating a weighted combination of many candidate learners to build the super learner. This article proposes a fast algorithm for constructing a super learner in prediction which uses V-fold cross-validation to select weights to combine an initial set of candidate learners. In addition, this paper contains a practical demonstration of the adaptivity of this so …
引用总数
2009201020112012201320142015201620172018201920202021202220232024919263649685979106153168230305323355225
学术搜索中的文章
MJ Van der Laan, EC Polley, AE Hubbard - Statistical applications in genetics and molecular …, 2007