An Efficient Categorization of Diabetes Imbalanced Data Using SMOTE-ENN With Fine-Tuned LS-SVM Algorithm
DOI:
https://doi.org/10.25195/ijci.v51i1.579Keywords:
Diabetes Mellitus; Imbalanced datasets; Preprocessing; Resampling; SMOTE-ENN; least square Support vector machine; Hyperparameter; Optimization.Abstract
Diabetes has been recognized as a major cause of death. Diabetes is a chronic disease. In recent years, the impact of diabetes has increased dramatically, and it has become a global threat. Machine learning is a part of computational algorithms designed to imitate human intelligence by learning from the surrounding environment. Type 2 diabetes is indicated by deviation high blood glucose levels attributable to insulin resistance and reduced pancreatic insulin production. In this study, two diabetes datasets are used, the Pima Indians diabetes and Iraqi Society Diabetes ISD datasets. They are collection of data on diabetes which characterized by an imbalanced distribution and the presence of outliers. The diabetes data sets are preprocessed. Many methods, including data resampling have been proposed to address the data sets imbalance issue. We utilized the resampling SMOTE-ENN technique to address the imbalance diabetes datasets issue and imputation. The classification of imbalanced datasets is a crucial field in machine learning. The machine learning approach that is used in this study is the Least Square Support Vector Machine LS-SVM to categorize the diabetes patients. Machine Learning ML algorithms are constructed by a set of hyperparameters. Thus, hyperparameters values should be carefully chosen. We used grid search algorithm to optimize LS-SVM algorithm hyperparameters. The classification results were improved. In addition, we could enhance the performance of the fine-tuned LS-SVM with the used resampling technique, SMOTE-ENN, that processes diabetes datasets. The performance metrics that evaluate the proposed algorithm SMOTE-ENN and fine-tuned LS-SVM are accuracy, recall and precision. The metrics measurements obtained were much better and higher when the proposed algorithm was used to categorize diabetes patients.
Downloads
Downloads
Published
Issue
Section
License
Copyright (c) 2025 Iraqi Journal for Computers and Informatics

This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.
IJCI applies the Creative Commons Attribution (CC BY) license to articles. The author of the submitted paper for publication by IJCI has the CC BY license. Under this Open Access license, the author gives an agreement to any author to reuse the article in whole or part for any purpose, even for commercial purposes. Anyone may copy, distribute, or reuse the content as long as the author and source are properly cited. This facility helps in re-use and ensures that journal content is available for the needs of research.
If the manuscript contains photos, images, figures, tables, audio files, videos, etc., that the author or the co-authors do not own, IJCI will require the author to provide the journal with proof that the owner of that content has given the author written permission to use it, and the owner has approved that the CC BY license being applied to content. IJCI provides a form that the author can use to ask for permission from the owner. If the author does not have owner permission, IJCI will ask the author to remove that content and/or replace it with other content that the author owns or has such permission to use.
Many authors assume that if they previously published a paper through another publisher, they have the right to reuse that content in their PLOS paper, but that is not necessarily the case – it depends on the license that covers the other paper. The author must ascertain the rights he/she has of a specific license (a license that enables the author to use the content). The author must obtain written permission from the publisher to use the content in the IJCI paper. The author should not include any content in her/his IJCI paper without having the right to use it, and always give proper attribution.
The accompanying submitted data should be stated with licensing policies, the policies should not be more restrictive than CC BY.
IJCI has the right to remove photos, captures, images, figures, tables, illustrations, audio, and video files, from a paper before or after publication, if these contents were included in the author's paper without permission from the owner of the content.







