作者
Anne‐Laure Boulesteix, Silke Janitza, Jochen Kruppa, Inke R König
发表日期
2012/11
来源
Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery
卷号
2
期号
6
页码范围
493-507
出版商
John Wiley & Sons, Inc.
简介
The random forest (RF) algorithm by Leo Breiman has become a standard data analysis tool in bioinformatics. It has shown excellent performance in settings where the number of variables is much larger than the number of observations, can cope with complex interaction structures as well as highly correlated variables and return measures of variable importance. This paper synthesizes 10 years of RF development with emphasis on applications to bioinformatics and computational biology. Special attention is paid to practical aspects such as the selection of parameters, available RF implementations, and important pitfalls and biases of RF and its variable importance measures (VIMs). The paper surveys recent developments of the methodology relevant to bioinformatics as well as some representative examples of RF applications in this context and possible directions for future research. © 2012 Wiley Periodicals …
引用总数
201320142015201620172018201920202021202220232024926466359809610812114214080
学术搜索中的文章