Predicting Diabetes Mellitus with Machine Learning Techniques

Blood sugar issues are a major health issue worldwide, with their incidence growing rapidly and affecting human health, economic systems, and societal structures. If diabetes remains untreated and undiagnosed, it can cause blood sugar levels to vary significantly, potentially damaging essential orga...

Full description

Saved in:
Bibliographic Details
Published in:Al-Iraqia Journal for Scientific Engineering Research Vol. 4; no. 2; pp. 20 - 32
Main Authors: Ahmed Jassim, Heba, R. Kadhim, Omar, Khduair Taha, Zahraa, Siaw Paw, Johnny Koh, Tak, Yaw Chong, Kiong, Tiong Sieh
Format: Journal Article
Language:English
Published: Al-Iraqia University - College of Engineering 19.06.2025
Subjects:
ISSN:2710-2165, 2710-2165
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Abstract Blood sugar issues are a major health issue worldwide, with their incidence growing rapidly and affecting human health, economic systems, and societal structures. If diabetes remains untreated and undiagnosed, it can cause blood sugar levels to vary significantly, potentially damaging essential organs like the kidneys, eyes, and arteries of the heart in critical cases. As a result, there is an increasing focus on the prevention and early detection of diabetes mellitus within the medical community. Utilizing machine learning algorithms to analyze appropriate datasets for early disease prediction could prove life-saving. The objective of this paper is to examine four algorithms that are proposed to enhance the diagnosis of diabetes. This research analyzes the effectiveness of various machine learning algorithms in processing datasets with minority classes. The evaluation was based on the classification report (including accuracy, precision, recall, and F1-score), the confusion matrix, and the ROC AUC. The Diabetes Prediction Dataset is used to evaluate four machine learning algorithms. The classifier that deserves a singular mention is the Artificial Neural Network (ANN), which achieves a 97% accuracy rate. This demonstrates its capability of classifying instances that are common and less common types. The Random Forest and Decision Tree models also perform well in terms of their ability to deliver strong performance, and the outcome shows some incremental differences, suggesting their ability to manage the dataset is quite high. However, the Support Vector Machine (SVM) model performs worse than all the above models at 96.36% and seems to struggle with the correct classification of less frequent instances. Therefore, it would be problematic to distinguish between classes that are prominent and those that are not. Notably, the ANN, Random Forest, and Decision Tree models effectively identify cases that are more likely to capture rare cases, an important aspect when dealing with datasets that have class imbalance.
AbstractList Blood sugar issues are a major health issue worldwide, with their incidence growing rapidly and affecting human health, economic systems, and societal structures. If diabetes remains untreated and undiagnosed, it can cause blood sugar levels to vary significantly, potentially damaging essential organs like the kidneys, eyes, and arteries of the heart in critical cases. As a result, there is an increasing focus on the prevention and early detection of diabetes mellitus within the medical community. Utilizing machine learning algorithms to analyze appropriate datasets for early disease prediction could prove life-saving. The objective of this paper is to examine four algorithms that are proposed to enhance the diagnosis of diabetes. This research analyzes the effectiveness of various machine learning algorithms in processing datasets with minority classes. The evaluation was based on the classification report (including accuracy, precision, recall, and F1-score), the confusion matrix, and the ROC AUC. The Diabetes Prediction Dataset is used to evaluate four machine learning algorithms. The classifier that deserves a singular mention is the Artificial Neural Network (ANN), which achieves a 97% accuracy rate. This demonstrates its capability of classifying instances that are common and less common types. The Random Forest and Decision Tree models also perform well in terms of their ability to deliver strong performance, and the outcome shows some incremental differences, suggesting their ability to manage the dataset is quite high. However, the Support Vector Machine (SVM) model performs worse than all the above models at 96.36% and seems to struggle with the correct classification of less frequent instances. Therefore, it would be problematic to distinguish between classes that are prominent and those that are not. Notably, the ANN, Random Forest, and Decision Tree models effectively identify cases that are more likely to capture rare cases, an important aspect when dealing with datasets that have class imbalance.
Author Ahmed Jassim, Heba
Siaw Paw, Johnny Koh
Kiong, Tiong Sieh
Khduair Taha, Zahraa
R. Kadhim, Omar
Tak, Yaw Chong
Author_xml – sequence: 1
  givenname: Heba
  surname: Ahmed Jassim
  fullname: Ahmed Jassim, Heba
– sequence: 2
  givenname: Omar
  surname: R. Kadhim
  fullname: R. Kadhim, Omar
– sequence: 3
  givenname: Zahraa
  surname: Khduair Taha
  fullname: Khduair Taha, Zahraa
– sequence: 4
  givenname: Johnny Koh
  surname: Siaw Paw
  fullname: Siaw Paw, Johnny Koh
– sequence: 5
  givenname: Yaw Chong
  surname: Tak
  fullname: Tak, Yaw Chong
– sequence: 6
  givenname: Tiong Sieh
  surname: Kiong
  fullname: Kiong, Tiong Sieh
BookMark eNpNkM1KAzEUhYNUsNY-gZt5gRnzM0kmCxdSq1ZaFK3rkElu2pRxRpMp4tvbHxFX93A5fAe-czRouxYQuiS44BUX5dXs8XX6UpQFLSimvGCEn6AhlQTnlAg--JfP0DilDcaYMcIkl0N0_RzBBduHdpXdBlNDDylbQNOEfpuyr9Cvs4Wx69BCNgcT231vCXbdhs8tpAt06k2TYPx7R-jtbrqcPOTzp_vZ5GaeW8I5z73kJVRW-dJZ45iQHoMSmBPnKBipgHulZAW0rqvaGMkNp8oCJXXFGHWSjdDsyHWd2eiPGN5N_NadCfrw6OJKm9gH24CulGFYydrv57zyylEhlFTYeFsTIXYsdmTZ2KUUwf_xCNYHofogVJea6r1QvRPKfgAzh2r6
Cites_doi 10.1016/j.procs.2015.03.182
10.1109/HI-POCT45284.2019.8962811
10.1177/1460458216675500
10.33889/IJMEMS.2019.4.3-057
10.3390/s21113704
10.1186/s13098-021-00767-9
10.1016/j.imu.2016.02.001
10.1186/s12911-020-01318-4
10.1155/2021/5854966
10.1016/j.procs.2016.04.016
10.1186/s13638-020-01765-7
10.35940/ijitee.B7586.129219
10.35940/ijeat.C4819.029320
10.38094/jastt20165
10.5220/0009839405330540
10.1590/1517-3151.0608
10.1016/j.procs.2015.10.014
10.31590/ejosat.899716
10.1016/j.procs.2019.08.140
10.1016/j.procs.2017.08.193
10.58564/IJSER.3.4.2024.275
10.1007/s13755-019-0095-z
ContentType Journal Article
DBID AAYXX
CITATION
DOA
DOI 10.58564/IJSER.4.2.2025.315
DatabaseName CrossRef
DOAJ Directory of Open Access Journals
DatabaseTitle CrossRef
DatabaseTitleList CrossRef

Database_xml – sequence: 1
  dbid: DOA
  name: DOAJ Directory of Open Access Journals
  url: https://www.doaj.org/
  sourceTypes: Open Website
DeliveryMethod fulltext_linktorsrc
EISSN 2710-2165
EndPage 32
ExternalDocumentID oai_doaj_org_article_89a3097bfc9f4f9f9d2669790afcb166
10_58564_IJSER_4_2_2025_315
GroupedDBID AAYXX
ALMA_UNASSIGNED_HOLDINGS
CITATION
GROUPED_DOAJ
ID FETCH-LOGICAL-c1555-f754e8c9f4dcad367f0e96051dd2ea79e5f9978e2bb8baa75a529ce21b8332d73
IEDL.DBID DOA
ISSN 2710-2165
IngestDate Fri Oct 03 12:39:20 EDT 2025
Sat Nov 29 07:45:26 EST 2025
IsDoiOpenAccess true
IsOpenAccess true
IsPeerReviewed true
IsScholarly true
Issue 2
Language English
License https://creativecommons.org/licenses/by-sa/4.0
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-c1555-f754e8c9f4dcad367f0e96051dd2ea79e5f9978e2bb8baa75a529ce21b8332d73
OpenAccessLink https://doaj.org/article/89a3097bfc9f4f9f9d2669790afcb166
PageCount 13
ParticipantIDs doaj_primary_oai_doaj_org_article_89a3097bfc9f4f9f9d2669790afcb166
crossref_primary_10_58564_IJSER_4_2_2025_315
PublicationCentury 2000
PublicationDate 2025-06-19
PublicationDateYYYYMMDD 2025-06-19
PublicationDate_xml – month: 06
  year: 2025
  text: 2025-06-19
  day: 19
PublicationDecade 2020
PublicationTitle Al-Iraqia Journal for Scientific Engineering Research
PublicationYear 2025
Publisher Al-Iraqia University - College of Engineering
Publisher_xml – name: Al-Iraqia University - College of Engineering
References 7740
7721
7720
7719
7734
7733
7714
7736
7735
7716
7738
7715
7737
7718
7717
7739
7730
7732
7731
7723
7722
7725
7724
7727
7726
7729
7728
References_xml – ident: 7717
  doi: 10.1016/j.procs.2015.03.182
– ident: 7725
  doi: 10.1109/HI-POCT45284.2019.8962811
– ident: 7722
  doi: 10.1177/1460458216675500
– ident: 7724
  doi: 10.33889/IJMEMS.2019.4.3-057
– ident: 7733
  doi: 10.3390/s21113704
– ident: 7736
  doi: 10.1186/s13098-021-00767-9
– ident: 7719
  doi: 10.1016/j.imu.2016.02.001
– ident: 7727
  doi: 10.1186/s12911-020-01318-4
– ident: 7732
  doi: 10.1155/2021/5854966
– ident: 7715
  doi: 10.1016/j.procs.2016.04.016
– ident: 7731
  doi: 10.1186/s13638-020-01765-7
– ident: 7726
  doi: 10.35940/ijitee.B7586.129219
– ident: 7728
  doi: 10.35940/ijeat.C4819.029320
– ident: 7738
  doi: 10.38094/jastt20165
– ident: 7729
  doi: 10.5220/0009839405330540
– ident: 7730
– ident: 7718
  doi: 10.1590/1517-3151.0608
– ident: 7716
  doi: 10.1016/j.procs.2015.10.014
– ident: 7721
– ident: 7734
  doi: 10.31590/ejosat.899716
– ident: 7723
  doi: 10.1016/j.procs.2019.08.140
– ident: 7739
– ident: 7720
  doi: 10.1016/j.procs.2017.08.193
– ident: 7740
  doi: 10.58564/IJSER.3.4.2024.275
– ident: 7735
– ident: 7737
– ident: 7714
  doi: 10.1007/s13755-019-0095-z
SSID ssj0003313757
Score 2.2950401
Snippet Blood sugar issues are a major health issue worldwide, with their incidence growing rapidly and affecting human health, economic systems, and societal...
SourceID doaj
crossref
SourceType Open Website
Index Database
StartPage 20
SubjectTerms AES, RSA, ECC, ChaCha20, Hybrid Algorithms and Encryption Algorithms
Title Predicting Diabetes Mellitus with Machine Learning Techniques
URI https://doaj.org/article/89a3097bfc9f4f9f9d2669790afcb166
Volume 4
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
journalDatabaseRights – providerCode: PRVAON
  databaseName: DOAJ Directory of Open Access Journals
  customDbUrl:
  eissn: 2710-2165
  dateEnd: 99991231
  omitProxy: false
  ssIdentifier: ssj0003313757
  issn: 2710-2165
  databaseCode: DOA
  dateStart: 20220101
  isFulltext: true
  titleUrlDefault: https://www.doaj.org/
  providerName: Directory of Open Access Journals
link http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwrV07T8MwGLRQxcCCQIAoL3lgJG3iR-xvYADUCpBaVVCkbpafiKWgPvj92E5AZWJhjZLIvi_K-ZLzfQhd-mAodcxFbWJIwVyoCy2NKUouS6JNIo2Qm02I8VjOZjDZaPWVPGFNPHADXF-CpiUIEywEFiCAi5QCAkodrKnqHLZdCtgQU-kdTGlFBRdNzFBcEdes__D4PHjqsV7afUV41Kr8FxVtJPZnahnuod12TYhvmrHsoy0_P0DXk0X6h5Jcybi1rSzxKMVnrtZLnD6f4lE2QnrcZqS-4ul3IOvyEL0MB9O7-6LtdVDYyOi8CIIzL9McndWO1iKUPooLXjlHvBbgeYAo-DwxRhqtBdecgPWkMpJS4gQ9Qp35-9wfI0wcB1sFEbSumbZWRtERry4lMKutYV109T1t9dFEWqgoBTJKKqOkmCIqoaQiSl10m6D5OTXlUecDsUqqrZL6q0on_3GTU7STBpWMWhWcoc5qsfbnaNt-rt6Wi4v8AHwBJve1cQ
linkProvider Directory of Open Access Journals
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Predicting+Diabetes+Mellitus+with+Machine+Learning+Techniques&rft.jtitle=Al-Iraqia+Journal+for+Scientific+Engineering+Research&rft.au=Ahmed+Jassim%2C+Heba&rft.au=R.+Kadhim%2C+Omar&rft.au=Khduair+Taha%2C+Zahraa&rft.au=Siaw+Paw%2C+Johnny+Koh&rft.date=2025-06-19&rft.issn=2710-2165&rft.eissn=2710-2165&rft.volume=4&rft.issue=2&rft.spage=20&rft.epage=32&rft_id=info:doi/10.58564%2FIJSER.4.2.2025.315&rft.externalDBID=n%2Fa&rft.externalDocID=10_58564_IJSER_4_2_2025_315
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=2710-2165&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=2710-2165&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=2710-2165&client=summon