ISSN 2353-6977 (Online)

A COMPARATIVE STUDY ON PERFORMANCE OF BASIC AND ENSEMBLE CLASSIFIERS WITH VARIOUS DATASETS

Archana GUNAKALA, Afzal Hussain SHAHID

Classification plays a critical role in machine learning (ML) systems for processing images, text and high -dimensional data. Predicting class labels from training data is the primary goal of classification. An optimal model for a particular classification problem is chosen based on the model's performance and execution time. This paper compares and analyzes the performance of basic as well as ensemble classifiers utilizing 10-fold cross validation and also discusses their essential concepts, advantages, and disadvantages. In this study five basic classifiers namely Naïve Bayes (NB), Multi-layer Perceptron (MLP), Support Vector Machine (SVM), Decision Tree (DT), and Random Forest (RF) and the ensemble of all the five classifiers along with few more combinations are compared with five University of California Irvine (UCI) ML Repository datasets and a Diabetes Health Indicators dataset from Kaggle repository. To analyze and compare the performance of classifiers, evaluation metrics like Accuracy, Recall, Precision, Area Under Curve (AUC) and F-Score are used. Experimental results showed that SVM performs best on two out of the six datasets (Diabetes Health Indicators and waveform), RF performs best for Arrhythmia, Sonar, Tic-tac-toe datasets, and the best ensemble combination is found to be DT+SVM+RF on Ionosphere dataset having respective accuracies 72.58%, 90.38%, 81.63%, 73.59%, 94.78% and 94.01%. The proposed ensemble combinations outperformed the conven¬tional models for few datasets.

+ - FULL TEXT Click to collapse

Download article

+ - HOW TO CITE THIS PAPER Click to collapse

APA 7th style

Gunakala, A., & Shahid, A. H. (2023). A comparative study on performance of basic and ensemble classifiers with various datasets. Applied Computer Science, 19(1), 107-132. https://doi.org/10.35784/acs-2023-08

Chicago style

Gunakala, Archana, and Afzal Hussain Shahid. "A comparative study on performance of basic and ensemble classifiers with various datasets." Applied Computer Science 19, no. 1 (2023): 107-132.

IEEE style

A. Gunakala and A. H. Shahid, "A comparative study on performance of basic and ensemble classifiers with various datasets," Applied Computer Science, vol. 19, no. 1, pp.107-132, 2023, doi: 10.35784/acs-2023-08.

Vancouver style

Gunakala A, Shahid A H. A comparative study on performance of basic and ensemble classifiers with various datasets. Applied Computer Science. 2023;19(1):107-132.

< Prev		Next >

ISSN 2353-6977 (Online)

A COMPARATIVE STUDY ON PERFORMANCE OF BASIC AND ENSEMBLE CLASSIFIERS WITH VARIOUS DATASETS

News

Submit

Time of publication