Hyperparameters and tuning strategies for random forest

P Probst, MN Wright… - … Reviews: data mining and …, 2019 - Wiley Online Library
The random forest (RF) algorithm has several hyperparameters that have to be set by the
user, for example, the number of observations drawn randomly for each tree and whether …

Ensembles for feature selection: A review and future trends

V Bolón-Canedo, A Alonso-Betanzos - Information fusion, 2019 - Elsevier
Ensemble learning is a prolific field in Machine Learning since it is based on the assumption
that combining the output of multiple models is better than using a single model, and it …

An improved random forest based on the classification accuracy and correlation measurement of decision trees

Z Sun, G Wang, P Li, H Wang, M Zhang… - Expert Systems with …, 2024 - Elsevier
Random forest is one of the most widely used machine learning algorithms. Decision trees
used to construct the random forest may have low classification accuracies or high …

Evolutionary bagging for ensemble learning

G Ngo, R Beard, R Chandra - Neurocomputing, 2022 - Elsevier
Ensemble learning has gained success in machine learning with major advantages over
other learning methods. Bagging is a prominent ensemble learning method that creates …

Surrogate-assisted evolutionary deep learning using an end-to-end random forest-based performance predictor

Y Sun, H Wang, B Xue, Y Jin, GG Yen… - IEEE Transactions on …, 2019 - ieeexplore.ieee.org
Convolutional neural networks (CNNs) have shown remarkable performance in various real-
world applications. Unfortunately, the promising performance of CNNs can be achieved only …

ATTED-II v11: a plant gene coexpression database using a sample balancing technique by subagging of principal components

T Obayashi, H Hibara, Y Kagaya, Y Aoki… - Plant and Cell …, 2022 - academic.oup.com
Abstract ATTED-II (https://atted. jp) is a gene coexpression database for nine plant species
based on publicly available RNAseq and microarray data. One of the challenges in …

Biochar application as a tool to decrease soil nitrogen losses (NH3 volatilization, N2O emissions, and N leaching) from croplands: Options and mitigation strength in …

Q Liu, B Liu, Y Zhang, T Hu, Z Lin, G Liu… - Global Change …, 2019 - Wiley Online Library
Biochar application to croplands has been proposed as a potential strategy to decrease
losses of soil‐reactive nitrogen (N) to the air and water. However, the extent and spatial …

CURE-SMOTE algorithm and hybrid algorithm for feature selection and parameter optimization based on random forests

L Ma, S Fan - BMC bioinformatics, 2017 - Springer
Background The random forests algorithm is a type of classifier with prominent universality,
a wide application range, and robustness for avoiding overfitting. But there are still some …

Offline data-driven evolutionary optimization using selective surrogate ensembles

H Wang, Y Jin, C Sun, J Doherty - IEEE Transactions on …, 2018 - ieeexplore.ieee.org
In solving many real-world optimization problems, neither mathematical functions nor
numerical simulations are available for evaluating the quality of candidate solutions. Instead …

Optimal weighted nearest neighbour classifiers

RJ Samworth - 2012 - projecteuclid.org
Supplement to “Optimal weighted nearest neighbour classifiers”. We complete the proof of
Theorem 1, and give the proofs of the other results in the paper. We also discuss minimax …