Predicting Diabetes Mellitus with Machine Learning Techniques
Blood sugar issues are a major health issue worldwide, with their incidence growing rapidly and affecting human health, economic systems, and societal structures. If diabetes remains untreated and undiagnosed, it can cause blood sugar levels to vary significantly, potentially damaging essential orga...
Uložené v:
| Vydané v: | Al-Iraqia Journal for Scientific Engineering Research Ročník 4; číslo 2; s. 20 - 32 |
|---|---|
| Hlavní autori: | , , , , , |
| Médium: | Journal Article |
| Jazyk: | English |
| Vydavateľské údaje: |
Al-Iraqia University - College of Engineering
19.06.2025
|
| Predmet: | |
| ISSN: | 2710-2165, 2710-2165 |
| On-line prístup: | Získať plný text |
| Tagy: |
Pridať tag
Žiadne tagy, Buďte prvý, kto otaguje tento záznam!
|
| Abstract | Blood sugar issues are a major health issue worldwide, with their incidence growing rapidly and affecting human health, economic systems, and societal structures. If diabetes remains untreated and undiagnosed, it can cause blood sugar levels to vary significantly, potentially damaging essential organs like the kidneys, eyes, and arteries of the heart in critical cases. As a result, there is an increasing focus on the prevention and early detection of diabetes mellitus within the medical community. Utilizing machine learning algorithms to analyze appropriate datasets for early disease prediction could prove life-saving. The objective of this paper is to examine four algorithms that are proposed to enhance the diagnosis of diabetes. This research analyzes the effectiveness of various machine learning algorithms in processing datasets with minority classes. The evaluation was based on the classification report (including accuracy, precision, recall, and F1-score), the confusion matrix, and the ROC AUC. The Diabetes Prediction Dataset is used to evaluate four machine learning algorithms. The classifier that deserves a singular mention is the Artificial Neural Network (ANN), which achieves a 97% accuracy rate. This demonstrates its capability of classifying instances that are common and less common types. The Random Forest and Decision Tree models also perform well in terms of their ability to deliver strong performance, and the outcome shows some incremental differences, suggesting their ability to manage the dataset is quite high. However, the Support Vector Machine (SVM) model performs worse than all the above models at 96.36% and seems to struggle with the correct classification of less frequent instances. Therefore, it would be problematic to distinguish between classes that are prominent and those that are not. Notably, the ANN, Random Forest, and Decision Tree models effectively identify cases that are more likely to capture rare cases, an important aspect when dealing with datasets that have class imbalance. |
|---|---|
| AbstractList | Blood sugar issues are a major health issue worldwide, with their incidence growing rapidly and affecting human health, economic systems, and societal structures. If diabetes remains untreated and undiagnosed, it can cause blood sugar levels to vary significantly, potentially damaging essential organs like the kidneys, eyes, and arteries of the heart in critical cases. As a result, there is an increasing focus on the prevention and early detection of diabetes mellitus within the medical community. Utilizing machine learning algorithms to analyze appropriate datasets for early disease prediction could prove life-saving. The objective of this paper is to examine four algorithms that are proposed to enhance the diagnosis of diabetes. This research analyzes the effectiveness of various machine learning algorithms in processing datasets with minority classes. The evaluation was based on the classification report (including accuracy, precision, recall, and F1-score), the confusion matrix, and the ROC AUC. The Diabetes Prediction Dataset is used to evaluate four machine learning algorithms. The classifier that deserves a singular mention is the Artificial Neural Network (ANN), which achieves a 97% accuracy rate. This demonstrates its capability of classifying instances that are common and less common types. The Random Forest and Decision Tree models also perform well in terms of their ability to deliver strong performance, and the outcome shows some incremental differences, suggesting their ability to manage the dataset is quite high. However, the Support Vector Machine (SVM) model performs worse than all the above models at 96.36% and seems to struggle with the correct classification of less frequent instances. Therefore, it would be problematic to distinguish between classes that are prominent and those that are not. Notably, the ANN, Random Forest, and Decision Tree models effectively identify cases that are more likely to capture rare cases, an important aspect when dealing with datasets that have class imbalance. |
| Author | Ahmed Jassim, Heba Siaw Paw, Johnny Koh Kiong, Tiong Sieh Khduair Taha, Zahraa R. Kadhim, Omar Tak, Yaw Chong |
| Author_xml | – sequence: 1 givenname: Heba surname: Ahmed Jassim fullname: Ahmed Jassim, Heba – sequence: 2 givenname: Omar surname: R. Kadhim fullname: R. Kadhim, Omar – sequence: 3 givenname: Zahraa surname: Khduair Taha fullname: Khduair Taha, Zahraa – sequence: 4 givenname: Johnny Koh surname: Siaw Paw fullname: Siaw Paw, Johnny Koh – sequence: 5 givenname: Yaw Chong surname: Tak fullname: Tak, Yaw Chong – sequence: 6 givenname: Tiong Sieh surname: Kiong fullname: Kiong, Tiong Sieh |
| BookMark | eNpNkM1KAzEUhYNUsNY-gZt5gRnzM0kmCxdSq1ZaFK3rkElu2pRxRpMp4tvbHxFX93A5fAe-czRouxYQuiS44BUX5dXs8XX6UpQFLSimvGCEn6AhlQTnlAg--JfP0DilDcaYMcIkl0N0_RzBBduHdpXdBlNDDylbQNOEfpuyr9Cvs4Wx69BCNgcT231vCXbdhs8tpAt06k2TYPx7R-jtbrqcPOTzp_vZ5GaeW8I5z73kJVRW-dJZ45iQHoMSmBPnKBipgHulZAW0rqvaGMkNp8oCJXXFGHWSjdDsyHWd2eiPGN5N_NadCfrw6OJKm9gH24CulGFYydrv57zyylEhlFTYeFsTIXYsdmTZ2KUUwf_xCNYHofogVJea6r1QvRPKfgAzh2r6 |
| Cites_doi | 10.1016/j.procs.2015.03.182 10.1109/HI-POCT45284.2019.8962811 10.1177/1460458216675500 10.33889/IJMEMS.2019.4.3-057 10.3390/s21113704 10.1186/s13098-021-00767-9 10.1016/j.imu.2016.02.001 10.1186/s12911-020-01318-4 10.1155/2021/5854966 10.1016/j.procs.2016.04.016 10.1186/s13638-020-01765-7 10.35940/ijitee.B7586.129219 10.35940/ijeat.C4819.029320 10.38094/jastt20165 10.5220/0009839405330540 10.1590/1517-3151.0608 10.1016/j.procs.2015.10.014 10.31590/ejosat.899716 10.1016/j.procs.2019.08.140 10.1016/j.procs.2017.08.193 10.58564/IJSER.3.4.2024.275 10.1007/s13755-019-0095-z |
| ContentType | Journal Article |
| DBID | AAYXX CITATION DOA |
| DOI | 10.58564/IJSER.4.2.2025.315 |
| DatabaseName | CrossRef Directory of Open Access Journals |
| DatabaseTitle | CrossRef |
| DatabaseTitleList | CrossRef |
| Database_xml | – sequence: 1 dbid: DOA name: DOAJ Directory of Open Access Journals url: https://www.doaj.org/ sourceTypes: Open Website |
| DeliveryMethod | fulltext_linktorsrc |
| EISSN | 2710-2165 |
| EndPage | 32 |
| ExternalDocumentID | oai_doaj_org_article_89a3097bfc9f4f9f9d2669790afcb166 10_58564_IJSER_4_2_2025_315 |
| GroupedDBID | AAYXX ALMA_UNASSIGNED_HOLDINGS CITATION GROUPED_DOAJ |
| ID | FETCH-LOGICAL-c1555-f754e8c9f4dcad367f0e96051dd2ea79e5f9978e2bb8baa75a529ce21b8332d73 |
| IEDL.DBID | DOA |
| ISSN | 2710-2165 |
| IngestDate | Fri Oct 03 12:39:20 EDT 2025 Sat Nov 29 07:45:26 EST 2025 |
| IsDoiOpenAccess | true |
| IsOpenAccess | true |
| IsPeerReviewed | true |
| IsScholarly | true |
| Issue | 2 |
| Language | English |
| License | https://creativecommons.org/licenses/by-sa/4.0 |
| LinkModel | DirectLink |
| MergedId | FETCHMERGED-LOGICAL-c1555-f754e8c9f4dcad367f0e96051dd2ea79e5f9978e2bb8baa75a529ce21b8332d73 |
| OpenAccessLink | https://doaj.org/article/89a3097bfc9f4f9f9d2669790afcb166 |
| PageCount | 13 |
| ParticipantIDs | doaj_primary_oai_doaj_org_article_89a3097bfc9f4f9f9d2669790afcb166 crossref_primary_10_58564_IJSER_4_2_2025_315 |
| PublicationCentury | 2000 |
| PublicationDate | 2025-06-19 |
| PublicationDateYYYYMMDD | 2025-06-19 |
| PublicationDate_xml | – month: 06 year: 2025 text: 2025-06-19 day: 19 |
| PublicationDecade | 2020 |
| PublicationTitle | Al-Iraqia Journal for Scientific Engineering Research |
| PublicationYear | 2025 |
| Publisher | Al-Iraqia University - College of Engineering |
| Publisher_xml | – name: Al-Iraqia University - College of Engineering |
| References | 7740 7721 7720 7719 7734 7733 7714 7736 7735 7716 7738 7715 7737 7718 7717 7739 7730 7732 7731 7723 7722 7725 7724 7727 7726 7729 7728 |
| References_xml | – ident: 7717 doi: 10.1016/j.procs.2015.03.182 – ident: 7725 doi: 10.1109/HI-POCT45284.2019.8962811 – ident: 7722 doi: 10.1177/1460458216675500 – ident: 7724 doi: 10.33889/IJMEMS.2019.4.3-057 – ident: 7733 doi: 10.3390/s21113704 – ident: 7736 doi: 10.1186/s13098-021-00767-9 – ident: 7719 doi: 10.1016/j.imu.2016.02.001 – ident: 7727 doi: 10.1186/s12911-020-01318-4 – ident: 7732 doi: 10.1155/2021/5854966 – ident: 7715 doi: 10.1016/j.procs.2016.04.016 – ident: 7731 doi: 10.1186/s13638-020-01765-7 – ident: 7726 doi: 10.35940/ijitee.B7586.129219 – ident: 7728 doi: 10.35940/ijeat.C4819.029320 – ident: 7738 doi: 10.38094/jastt20165 – ident: 7729 doi: 10.5220/0009839405330540 – ident: 7730 – ident: 7718 doi: 10.1590/1517-3151.0608 – ident: 7716 doi: 10.1016/j.procs.2015.10.014 – ident: 7721 – ident: 7734 doi: 10.31590/ejosat.899716 – ident: 7723 doi: 10.1016/j.procs.2019.08.140 – ident: 7739 – ident: 7720 doi: 10.1016/j.procs.2017.08.193 – ident: 7740 doi: 10.58564/IJSER.3.4.2024.275 – ident: 7735 – ident: 7737 – ident: 7714 doi: 10.1007/s13755-019-0095-z |
| SSID | ssj0003313757 |
| Score | 2.2949438 |
| Snippet | Blood sugar issues are a major health issue worldwide, with their incidence growing rapidly and affecting human health, economic systems, and societal... |
| SourceID | doaj crossref |
| SourceType | Open Website Index Database |
| StartPage | 20 |
| SubjectTerms | AES, RSA, ECC, ChaCha20, Hybrid Algorithms and Encryption Algorithms |
| Title | Predicting Diabetes Mellitus with Machine Learning Techniques |
| URI | https://doaj.org/article/89a3097bfc9f4f9f9d2669790afcb166 |
| Volume | 4 |
| hasFullText | 1 |
| inHoldings | 1 |
| isFullTextHit | |
| isPrint | |
| journalDatabaseRights | – providerCode: PRVAON databaseName: DOAJ Directory of Open Access Journals customDbUrl: eissn: 2710-2165 dateEnd: 99991231 omitProxy: false ssIdentifier: ssj0003313757 issn: 2710-2165 databaseCode: DOA dateStart: 20220101 isFulltext: true titleUrlDefault: https://www.doaj.org/ providerName: Directory of Open Access Journals |
| link | http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwrV27TsMwFLVQxcCCQIAoL3lgxG38qu2BAVArQGpVQZG6WX4iloKalu_HdgIqEwtr5CT2uVGuj318LgCXzuJKMoyR5bJCjDqFDKER-UQvOPXGGOlKsQkxmcj5XE03Sn1lTVhjD9wA15fK0EoJG52KLKqofEopSqjKxPSeQTHbroTaIFP5H0wppoKLxmYozYgHrP_w-Dx86rFePn1FeOKq_Fcq2nDsL6lltAd22zkhvGn6sg-2wuIAXE-XeQ8lq5JhK1up4TjbZ67WNczLp3BchJABth6pr3D2bchaH4KX0XB2d4_aWgfIpYzOURScBZnH6J3xdCBiFRK54Nh7EoxQgUeVCF8g1kprjOCGE-UCwVZSSrygR6CzeF-EYwCl5JZ7Y6l3lIUEHWXR0JDuSEFxmHXB1few9UdjaaETFSgo6YKSZprojJJOKHXBbYbmp2n2oy4XUpR0GyX9V5RO_uMhp2AndyoLtbA6A53Vch3Owbb7XL3Vy4vyAXwB8zC01A |
| linkProvider | Directory of Open Access Journals |
| openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Predicting+Diabetes+Mellitus+with+Machine+Learning+Techniques&rft.jtitle=Al-Iraqia+Journal+for+Scientific+Engineering+Research&rft.au=Heba+Ahmed+Jassim&rft.au=Omar+R.+Kadhim&rft.au=Zahraa+Khduair+Taha&rft.au=Johnny+Koh+Siaw+Paw&rft.date=2025-06-19&rft.pub=Al-Iraqia+University+-+College+of+Engineering&rft.eissn=2710-2165&rft.volume=4&rft.issue=2&rft_id=info:doi/10.58564%2FIJSER.4.2.2025.315&rft.externalDBID=DOA&rft.externalDocID=oai_doaj_org_article_89a3097bfc9f4f9f9d2669790afcb166 |
| thumbnail_l | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=2710-2165&client=summon |
| thumbnail_m | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=2710-2165&client=summon |
| thumbnail_s | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=2710-2165&client=summon |