Machine Learning-Based Heart Disease Detection with ANOVA Feature Selection

Heart disease(HD) has emerged as one of the most critical health issues that significantly impact human existence. It has become one of the primary causes of mortality worldwide over the past decade. The World Health Organization announced in 2022 that heart disease was the cause of death for nearly...

Full description

Saved in:
Bibliographic Details
Published in:Journal of Al-Qadisiyah for Computer Science and Mathematics Vol. 17; no. 3
Main Authors: Shaker, Fatima, Raad Shaker Alnaily, Rana, Naeem Turky, Saja, Kareem Wanas, Elham, Sadiq Sadon, Saja
Format: Journal Article
Language:English
Published: 30.09.2025
ISSN:2074-0204, 2521-3504
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Abstract Heart disease(HD) has emerged as one of the most critical health issues that significantly impact human existence. It has become one of the primary causes of mortality worldwide over the past decade. The World Health Organization announced in 2022 that heart disease was the cause of death for nearly one million people, equivalent to 33% of global mortality. In the current century, there is an increase in the use of non-surgical medical technologies, including artificial intelligence methods in the medical field. Machine learning employs many widely utilized algorithms and techniques that are essential in the rapid and efficient diagnosis of heart issues. However, diagnosing heart disease is a difficult task. The vast and expanding scale of medical datasets has hindered professionals' ability to comprehend the intricate correlations among variables and generate precise predictions. Accordingly, the proposed research aims to examine the role of feature selection techniques in supporting machine learning algorithms and improving model accuracy. A medical database of heart diseases with different features was relied upon. In the first stage, data analysis was conducted to understand the nature of the data and ensure its balance before the classification. This encompassed displaying statistical distributions of the data, identifying missing values, and analyzing the relationships between the variables that are independent and the target variable. This step was followed by implementing feature selection techniques, specifically using the ANOVA algorithm to identify the most pertinent features for heart disease detection. Finally, the machine learning algorithms were used on both the complete and reduced datasets to perform the classification. Accuracy, precision, recall, and F1-score were used to evaluate the trained classifiers. The results also show that when the number of features is reduced, the accuracy of classification models improves slightly compared to models trained on the entire set of features
AbstractList Heart disease(HD) has emerged as one of the most critical health issues that significantly impact human existence. It has become one of the primary causes of mortality worldwide over the past decade. The World Health Organization announced in 2022 that heart disease was the cause of death for nearly one million people, equivalent to 33% of global mortality. In the current century, there is an increase in the use of non-surgical medical technologies, including artificial intelligence methods in the medical field. Machine learning employs many widely utilized algorithms and techniques that are essential in the rapid and efficient diagnosis of heart issues. However, diagnosing heart disease is a difficult task. The vast and expanding scale of medical datasets has hindered professionals' ability to comprehend the intricate correlations among variables and generate precise predictions. Accordingly, the proposed research aims to examine the role of feature selection techniques in supporting machine learning algorithms and improving model accuracy. A medical database of heart diseases with different features was relied upon. In the first stage, data analysis was conducted to understand the nature of the data and ensure its balance before the classification. This encompassed displaying statistical distributions of the data, identifying missing values, and analyzing the relationships between the variables that are independent and the target variable. This step was followed by implementing feature selection techniques, specifically using the ANOVA algorithm to identify the most pertinent features for heart disease detection. Finally, the machine learning algorithms were used on both the complete and reduced datasets to perform the classification. Accuracy, precision, recall, and F1-score were used to evaluate the trained classifiers. The results also show that when the number of features is reduced, the accuracy of classification models improves slightly compared to models trained on the entire set of features
Author Raad Shaker Alnaily, Rana
Naeem Turky, Saja
Kareem Wanas, Elham
Sadiq Sadon, Saja
Shaker, Fatima
Author_xml – sequence: 1
  givenname: Fatima
  surname: Shaker
  fullname: Shaker, Fatima
– sequence: 2
  givenname: Rana
  surname: Raad Shaker Alnaily
  fullname: Raad Shaker Alnaily, Rana
– sequence: 3
  givenname: Saja
  surname: Naeem Turky
  fullname: Naeem Turky, Saja
– sequence: 4
  givenname: Elham
  surname: Kareem Wanas
  fullname: Kareem Wanas, Elham
– sequence: 5
  givenname: Saja
  surname: Sadiq Sadon
  fullname: Sadiq Sadon, Saja
BookMark eNotkMtOwzAQRS1UJErpH7DwDyT4GcfL0FKKCHRBxdaynQk1al2IgxB_T0hZzZx7pZHmXKJJPEZA6JqSnGlOxM37p0-HnBEmc6pyzgRTZ2jKJKMZl0RMhp0okRFGxAWapxQcEUJJqgsyRY9P1u9CBFyD7WKIb9mtTdDg9YA9XoYEA-Il9OD7cIz4O_Q7XD1vXiu8Att_dYBfYH8qr9B5a_cJ5v9zhraru-1indWb-4dFVWdeU5UVygFvCs54KxtdOim1p0Qpp1tJm-GhohV6yMuSSS4dp5ZKELxgApRjtuQzJE5nfXdMqYPWfHThYLsfQ4kZlZhRiflTYqgyoxL-C5vrVds
ContentType Journal Article
DBID AAYXX
CITATION
DOI 10.29304/jqcsm.2025.17.32427
DatabaseName CrossRef
DatabaseTitle CrossRef
DatabaseTitleList CrossRef
DeliveryMethod fulltext_linktorsrc
EISSN 2521-3504
ExternalDocumentID 10_29304_jqcsm_2025_17_32427
GroupedDBID AAYXX
ALMA_UNASSIGNED_HOLDINGS
CITATION
OK1
ID FETCH-LOGICAL-c917-67be3d6323f5d98b559c1077b9f51d3046f4998b882535b31a15e43624e7b2a83
ISSN 2074-0204
IngestDate Wed Nov 05 20:54:11 EST 2025
IsDoiOpenAccess false
IsOpenAccess true
IsPeerReviewed false
IsScholarly false
Issue 3
Language English
License http://creativecommons.org/licenses/by-nc-nd/4.0
LinkModel OpenURL
MergedId FETCHMERGED-LOGICAL-c917-67be3d6323f5d98b559c1077b9f51d3046f4998b882535b31a15e43624e7b2a83
OpenAccessLink https://jqcsm.qu.edu.iq/index.php/journalcm/article/download/2427/1121
ParticipantIDs crossref_primary_10_29304_jqcsm_2025_17_32427
PublicationCentury 2000
PublicationDate 2025-09-30
PublicationDateYYYYMMDD 2025-09-30
PublicationDate_xml – month: 09
  year: 2025
  text: 2025-09-30
  day: 30
PublicationDecade 2020
PublicationTitle Journal of Al-Qadisiyah for Computer Science and Mathematics
PublicationYear 2025
SSID ssib044751960
ssib016479590
ssib032177102
ssib046619541
Score 1.9235948
Snippet Heart disease(HD) has emerged as one of the most critical health issues that significantly impact human existence. It has become one of the primary causes of...
SourceID crossref
SourceType Index Database
Title Machine Learning-Based Heart Disease Detection with ANOVA Feature Selection
Volume 17
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
journalDatabaseRights – providerCode: PRVHPJ
  databaseName: ROAD: Directory of Open Access Scholarly Resources
  customDbUrl:
  eissn: 2521-3504
  dateEnd: 99991231
  omitProxy: false
  ssIdentifier: ssib044751960
  issn: 2074-0204
  databaseCode: M~E
  dateStart: 20090101
  isFulltext: true
  titleUrlDefault: https://road.issn.org
  providerName: ISSN International Centre
link http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwtV3Nb9MwFLfK4MBlAgHiYyAfuFUeSRzPzrGgTUhoBdYKdoteYod26sJou2lc-Kv4A3m2YzegCrEDl6iyrddE76f34fdFyEuhMl03Sc20FoLlXBcMUDGwTBYmqbXWqazcsAk5HqvT0-LDYPAz1MJcLWTbquvr4uK_shrXkNm2dPYG7I5EcQF_I9PxiWzH5z8x_tilR5rQOfULe42KStt6o-XaNtu08RgUM2vjh4S7i9jR-P2n0dCagzaeMHGzcQLDtliuC_YR9Hw1_w4zl6YYRkNESeGzN0JD2Gi2T2bQJXEc4fpGI5wA6KHfROItzP346xNo45ExGHM-nF4u_V3vBM7ilq1nw73PeNrXVixmcN6_zchESL0IQi-z-aG2YNfrJ7-GJgbjolsLUlv20Mm3KQM0ZJLcaoNv9cr2HMjEfir3rQEpN8ovBPz_0IkxUxF9JEendFRKS6VMZemo3CK3MykKK0uPfxwGKWYbtBViE3vk6PTJXpM222ERxV7cz9FCKoQbsRo_3td4uj9-teX1ezZUzxia3iO7HRboyKPvPhmY9gF51yGP_o486pBHO-TRiDxqkUcd8miHPBqR95BMjw6nb96yblYHq9HhZweyMlwf8Iw3QheqQj-1ThMpq6IRqbbR9wZda1WhPye4qHgKqTA5Gk-5kVUGij8iO-3X1jwmVDRZ0lRQaa50zqGBlMtEgUok5FCr9Alh4ePLC9-Rpfwbl57e8PwzcncDzD2ys15emufkTn21nq-WLxyrfwHuhX9A
linkProvider ISSN International Centre
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Machine+Learning-Based+Heart+Disease+Detection+with+ANOVA+Feature+Selection&rft.jtitle=Journal+of+Al-Qadisiyah+for+Computer+Science+and+Mathematics&rft.au=Shaker%2C+Fatima&rft.au=Raad+Shaker+Alnaily%2C+Rana&rft.au=Naeem+Turky%2C+Saja&rft.au=Kareem+Wanas%2C+Elham&rft.date=2025-09-30&rft.issn=2074-0204&rft.eissn=2521-3504&rft.volume=17&rft.issue=3&rft_id=info:doi/10.29304%2Fjqcsm.2025.17.32427&rft.externalDBID=n%2Fa&rft.externalDocID=10_29304_jqcsm_2025_17_32427
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=2074-0204&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=2074-0204&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=2074-0204&client=summon