A comprehensive evaluation of machine learning techniques for cancer class prediction based on microarray data

Prostate cancer is among the most common cancer in males and its heterogeneity is well known. The genomic level changes can be detected in gene expression data and those changes may serve as standard model for any random cancer data for class prediction. Various techniques were implied on prostate c...

Full description

Saved in:
Bibliographic Details
Published in:International journal of bioinformatics research and applications Vol. 11; no. 5; p. 397
Main Authors: Raza, Khalid, Hasan, Atif N
Format: Journal Article
Language:English
Published: Switzerland 2015
Subjects:
ISSN:1744-5485
Online Access:Get more information
Tags: Add Tag
No Tags, Be the first to tag this record!
Abstract Prostate cancer is among the most common cancer in males and its heterogeneity is well known. The genomic level changes can be detected in gene expression data and those changes may serve as standard model for any random cancer data for class prediction. Various techniques were implied on prostate cancer data set in order to accurately predict cancer class including machine learning techniques. Large number of attributes but few numbers of samples in microarray data leads to poor training; therefore, the most challenging part is attribute reduction or non-significant gene reduction. In this work, a combination of interquartile range and t-test is used for attribute reduction. Further, a comprehensive evaluation of ten state-of-the-art machine learning techniques for their accuracy in class prediction of prostate cancer is done. Out of these techniques, Bayes Network outperformed with an accuracy of 94.11% followed by Naïve Bayes with an accuracy of 91.17%.
AbstractList Prostate cancer is among the most common cancer in males and its heterogeneity is well known. The genomic level changes can be detected in gene expression data and those changes may serve as standard model for any random cancer data for class prediction. Various techniques were implied on prostate cancer data set in order to accurately predict cancer class including machine learning techniques. Large number of attributes but few numbers of samples in microarray data leads to poor training; therefore, the most challenging part is attribute reduction or non-significant gene reduction. In this work, a combination of interquartile range and t-test is used for attribute reduction. Further, a comprehensive evaluation of ten state-of-the-art machine learning techniques for their accuracy in class prediction of prostate cancer is done. Out of these techniques, Bayes Network outperformed with an accuracy of 94.11% followed by Naïve Bayes with an accuracy of 91.17%.
Prostate cancer is among the most common cancer in males and its heterogeneity is well known. The genomic level changes can be detected in gene expression data and those changes may serve as standard model for any random cancer data for class prediction. Various techniques were implied on prostate cancer data set in order to accurately predict cancer class including machine learning techniques. Large number of attributes but few numbers of samples in microarray data leads to poor training; therefore, the most challenging part is attribute reduction or non-significant gene reduction. In this work, a combination of interquartile range and t-test is used for attribute reduction. Further, a comprehensive evaluation of ten state-of-the-art machine learning techniques for their accuracy in class prediction of prostate cancer is done. Out of these techniques, Bayes Network outperformed with an accuracy of 94.11% followed by Naïve Bayes with an accuracy of 91.17%.Prostate cancer is among the most common cancer in males and its heterogeneity is well known. The genomic level changes can be detected in gene expression data and those changes may serve as standard model for any random cancer data for class prediction. Various techniques were implied on prostate cancer data set in order to accurately predict cancer class including machine learning techniques. Large number of attributes but few numbers of samples in microarray data leads to poor training; therefore, the most challenging part is attribute reduction or non-significant gene reduction. In this work, a combination of interquartile range and t-test is used for attribute reduction. Further, a comprehensive evaluation of ten state-of-the-art machine learning techniques for their accuracy in class prediction of prostate cancer is done. Out of these techniques, Bayes Network outperformed with an accuracy of 94.11% followed by Naïve Bayes with an accuracy of 91.17%.
Author Hasan, Atif N
Raza, Khalid
Author_xml – sequence: 1
  givenname: Khalid
  surname: Raza
  fullname: Raza, Khalid
  organization: 1 Department of Computer Science, Jamia Millia Islamia (Central University), New Delhi 110025, India
– sequence: 2
  givenname: Atif N
  surname: Hasan
  fullname: Hasan, Atif N
  organization: 2 Department of Computer Science, Jamia Millia Islamia (Central University), New Delhi 110025, India
BackLink https://www.ncbi.nlm.nih.gov/pubmed/26558300$$D View this record in MEDLINE/PubMed
BookMark eNo1kD1PwzAYhD0U0Q_4ASzII0uKndiJM5aKj6JKSAjm6I39mholTrGTSv33BCjTc8Pd6XRzMvGdR0KuOFtyycTt5vnudbVMGZdLVvBSsAmZ8UKIRAolp2Qe4ydjIk-FPCfTNJdSZYzNiF9R3bX7gDv00R2Q4gGaAXrXedpZ2oLeOY-0QQje-Q_ao9559zVgpLYLVIPXOKKBGOnYYpz-jdYQ0dBRtE6HDkKAIzXQwwU5s9BEvDxxQd4f7t_WT8n25XGzXm0TnbG8T1CB4tqCRMFK4AatFdqWSkldA9SZzQUiR6uUKGqWS64ZM2XJOdQGUzTpgtz89e5D9zO2r1oXNTYNeOyGWPEiy7hSquSj9fpkHeoWTbUProVwrP4_Sr8BLSNsNg
CitedBy_id crossref_primary_10_1049_iet_syb_2018_5023
crossref_primary_10_1080_0952813X_2023_2196984
crossref_primary_10_1002_widm_70038
crossref_primary_10_1007_s11042_021_11653_x
crossref_primary_10_3390_pathogens14080788
crossref_primary_10_23736_S1120_4826_20_02687_7
crossref_primary_10_1016_j_sjbs_2021_04_022
crossref_primary_10_1016_j_prevetmed_2019_104869
crossref_primary_10_1016_j_biosystems_2021_104411
crossref_primary_10_7717_peerj_cs_1194
ContentType Journal Article
DBID NPM
7X8
DOI 10.1504/IJBRA.2015.071940
DatabaseName PubMed
MEDLINE - Academic
DatabaseTitle PubMed
MEDLINE - Academic
DatabaseTitleList PubMed
MEDLINE - Academic
Database_xml – sequence: 1
  dbid: NPM
  name: PubMed
  url: http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=PubMed
  sourceTypes: Index Database
– sequence: 2
  dbid: 7X8
  name: MEDLINE - Academic
  url: https://search.proquest.com/medline
  sourceTypes: Aggregation Database
DeliveryMethod no_fulltext_linktorsrc
Discipline Biology
ExternalDocumentID 26558300
Genre Journal Article
GroupedDBID ---
0R~
29J
4.4
53G
5GY
ABJNI
ACGFS
ACIWK
ACPRK
AFRAH
ALMA_UNASSIGNED_HOLDINGS
ALSBL
CS3
DU5
EBS
EJD
F5P
H13
HZ~
MET
MIE
NPM
O9-
P2P
RTD
7X8
ID FETCH-LOGICAL-c306t-e8a81cfa5e409a1deff4cf9885cbaab3f64ee1ef8847b0651c00d9911abde2ed2
IEDL.DBID 7X8
ISSN 1744-5485
IngestDate Fri Jul 11 09:27:42 EDT 2025
Sat Sep 28 08:37:14 EDT 2024
IsPeerReviewed true
IsScholarly true
Issue 5
Keywords Bayes network
prostate cancer
cancer class prediction
microarray analysis
machine learning
bioinformatics
Language English
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-c306t-e8a81cfa5e409a1deff4cf9885cbaab3f64ee1ef8847b0651c00d9911abde2ed2
Notes ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
PMID 26558300
PQID 1733188891
PQPubID 23479
ParticipantIDs proquest_miscellaneous_1733188891
pubmed_primary_26558300
PublicationCentury 2000
PublicationDate 2015-00-00
20150101
PublicationDateYYYYMMDD 2015-01-01
PublicationDate_xml – year: 2015
  text: 2015-00-00
PublicationDecade 2010
PublicationPlace Switzerland
PublicationPlace_xml – name: Switzerland
PublicationTitle International journal of bioinformatics research and applications
PublicationTitleAlternate Int J Bioinform Res Appl
PublicationYear 2015
SSID ssj0046245
Score 2.0886002
Snippet Prostate cancer is among the most common cancer in males and its heterogeneity is well known. The genomic level changes can be detected in gene expression data...
SourceID proquest
pubmed
SourceType Aggregation Database
Index Database
StartPage 397
Title A comprehensive evaluation of machine learning techniques for cancer class prediction based on microarray data
URI https://www.ncbi.nlm.nih.gov/pubmed/26558300
https://www.proquest.com/docview/1733188891
Volume 11
hasFullText
inHoldings 1
isFullTextHit
isPrint
link http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwpV07T8MwELaAgsTC-1FeMhKr1SSOU2dCFaJigKoDoG6RY5-hSE1KUpD67zknaTshIbEkWWJF9sX33d3n-wi5iaSNPBNIxrnPWcghYhJ3PYZgQ2rhDndWecjXx-5gIEejeNgk3MqGVrnYE6uN2uTa5cg7vhMXxHAt9m-nn8ypRrnqaiOhsU5a3I2O9twdLasIYRRUIsUIukOGyFw0VU3hhZ3xBwajjtklXIu-OPR-R5iVp-nv_vcb98hOgzFprzaKfbIG2QHZqlUn54ck61FHJC_gvSav01XHb5pbOqnolUAbPYk3umzzWlJEuFQ7O8Gbg90URzHj6mgEdf7QUHyYOI6fKgo1p45_ekRe-vfPdw-skV1gGuOHGQOppK-tEoCxn_INWBtqG0spdKpUym0UAvhgJTq2FBGMrz3PIMz0VWogABMck40sz-CUUCu1DTgYrbsy1KDTWIKWGJIizkjRM7bJ9WIiEzRrV6tQGeRfZbKayjY5qVcjmdb9N5IgEkJyzzv7w9vnZNstcp00uSAtiz81XJJN_T0bl8VVZS94HQyffgCi_MyN
linkProvider ProQuest
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=A+comprehensive+evaluation+of+machine+learning+techniques+for+cancer+class+prediction+based+on+microarray+data&rft.jtitle=International+journal+of+bioinformatics+research+and+applications&rft.au=Raza%2C+Khalid&rft.au=Hasan%2C+Atif+N&rft.date=2015-01-01&rft.issn=1744-5485&rft.volume=11&rft.issue=5&rft.spage=397&rft_id=info:doi/10.1504%2Fijbra.2015.071940&rft.externalDBID=NO_FULL_TEXT
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1744-5485&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1744-5485&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1744-5485&client=summon