查看文章

icm.edu.pl 中的 [PDF]

A comparative study on performance of basic and ensemble classifiers with various datasets

作者

Archana Gunakala, Afzal Hussain Shahid

发表日期

2023

期刊

Applied Computer Science

卷号

期号

简介

Classification plays a critical role in machine learning (ML) systems for processing images, text and high-dimensional data. Predicting class labels from training data is the primary goal of classification. An optimal model for a particular classification problem is chosen based on the model's performance and execution time. This paper compares and analyzes the performance of basic as well as ensemble classifiers utilizing 10-fold cross validation and also discusses their essential concepts, advantages, and disadvantages. In this study five basic classifiers namely Naïve Bayes (NB), Multilayer Perceptron (MLP), Support Vector Machine (SVM), Decision Tree (DT), and Random Forest (RF) and the ensemble of all the five classifiers along with few more combinations are compared with five University of California Irvine (UCI) ML Repository datasets and a Diabetes Health Indicators dataset from Kaggle repository. To analyze and compare the performance of classifiers, evaluation metrics like Accuracy, Recall, Precision, Area Under Curve (AUC) and F-Score are used. Experimental results showed that SVM performs best on two out of the six datasets (Diabetes Health Indicators and waveform), RF performs best for Arrhythmia, Sonar, Tic-tac-toe datasets, and the best ensemble combination is found to be DT+ SVM+ RF on Ionosphere dataset having respective accuracies 72.58%, 90.38%, 81.63%, 73.59%, 94.78% and 94.01%. The proposed ensemble combinations outperformed the conventional models for few datasets.

引用总数

被引用次数：2

20242

学术搜索中的文章

A comparative study on performance of basic and ensemble classifiers with various datasets

A Gunakala, AH Shahid - Applied Computer Science, 2023

被引用次数：2 相关文章所有 5 个版本