Improving support vector machine and backpropagation performance for diabetes mellitus classification
Angga Prastyo, Sutikno Sutikno, Khadijah Khadijah
Abstract
Diabetes mellitus (DM) is a glucose disorder disease in the human body that contributes significantly to the high mortality rate. Various studies on early detection and classification have been conducted as a DM prevention effort by applying a machine learning model. The problems that may occur are weak model performance and misclassification caused by imbalanced data. The existence of dominating (majority) data causes poor model performance in identifying minority data. This paper proposed handling the problem of imbalanced data by performing the synthetic minority oversampling technique (SMOTE) and observing its effect on the classification performance of the support vector machine (SVM) and Backpropagation artificial neural network (ANN) methods. The experiment showed that the SVM method and imbalanced data achieved 94.31% accuracy, and the Backpropagation ANN achieved 91.56% accuracy. At the same time, the SVM method and balanced data produced an accuracy of 98.85%, while the Backpropagation ANN method and balanced data produced an accuracy of 94.90%. The results show that oversampling techniques can improve the performance of the classification model for each data class.
Keywords
Backpropagation; Diabetes mellitus; Imbalanced data; Support vector machine; Synthetic minority oversampling technique
DOI:
https://doi.org/10.11591/csit.v5i2.p140-149
Refbacks
There are currently no refbacks.
Computer Science and Information Technologies p-ISSN: 2722-323X, e-ISSN: 2722-3221 This journal is published by the Institute of Advanced Engineering and Science (IAES) in collaboration with Universitas Ahmad Dahlan (UAD) .
<a title="Web Analytics" href="https://statcounter.com/" target="_blank"<img class="statcounter" src="https://c.statcounter.com/11992001/0/5fa2f457/0/" alt="Web Analytics"</div> <br> CSIT Visitor Stats
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License .