Evaluation multi label feature selection for text classification using weighted borda count approach

Due to the existence of text data, multi-label (ML) text classification is an essential task in machine learning. Feature selection is an essential and effective preprocess to enhance the learning process. Choosing a Multi-Label Feature Selection (MLFS) algorithm is the most basic, critical, and sen...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Iranian Joint Congress on Fuzzy and Intelligent Systems (Online) S. 1 - 6
Hauptverfasser: Miri, Mohsen, Dowlatshahi, Mohammad Bagher, Hashemi, Amin
Format: Tagungsbericht
Sprache:Englisch
Veröffentlicht: IEEE 02.03.2022
Schlagworte:
ISSN:2771-1374
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Abstract Due to the existence of text data, multi-label (ML) text classification is an essential task in machine learning. Feature selection is an essential and effective preprocess to enhance the learning process. Choosing a Multi-Label Feature Selection (MLFS) algorithm is the most basic, critical, and sensitive choice in ML classification operations. If this choice is based on a criterion, it cannot be attributed to always being sound. Choosing the best algorithm must be evaluated using several different criteria to be examined from different aspects. In this article, we turn the issue into an election and use the Weighted Borda Count method for voting. We do the voting in three stages continuously so that a subset of different features does the voting. In the second stage, voting of different methods is done with six criteria, and each criterion selects the methods in order of priority from the beginning to the end. Voting steps 1 and 2 are performed on eighteen text datasets used. Finally, in the final voting stage, the methods are evaluated and voted on by different text datasets. The final result of the voting in the third stage shows the desired MLFS methods based on their performance from beginning to end. According to the experiments performed and the results obtained, it can be seen that the selection of the algorithm based on several different criteria and considering the overall performance of the algorithm will be better than the selection based on one criterion.
AbstractList Due to the existence of text data, multi-label (ML) text classification is an essential task in machine learning. Feature selection is an essential and effective preprocess to enhance the learning process. Choosing a Multi-Label Feature Selection (MLFS) algorithm is the most basic, critical, and sensitive choice in ML classification operations. If this choice is based on a criterion, it cannot be attributed to always being sound. Choosing the best algorithm must be evaluated using several different criteria to be examined from different aspects. In this article, we turn the issue into an election and use the Weighted Borda Count method for voting. We do the voting in three stages continuously so that a subset of different features does the voting. In the second stage, voting of different methods is done with six criteria, and each criterion selects the methods in order of priority from the beginning to the end. Voting steps 1 and 2 are performed on eighteen text datasets used. Finally, in the final voting stage, the methods are evaluated and voted on by different text datasets. The final result of the voting in the third stage shows the desired MLFS methods based on their performance from beginning to end. According to the experiments performed and the results obtained, it can be seen that the selection of the algorithm based on several different criteria and considering the overall performance of the algorithm will be better than the selection based on one criterion.
Author Hashemi, Amin
Miri, Mohsen
Dowlatshahi, Mohammad Bagher
Author_xml – sequence: 1
  givenname: Mohsen
  surname: Miri
  fullname: Miri, Mohsen
  email: miri.mo@fe.lu.ac.ir
  organization: Lorestan University,Faculty of Engineering,Department of Computer Engineering,Khorramabad,Iran
– sequence: 2
  givenname: Mohammad Bagher
  surname: Dowlatshahi
  fullname: Dowlatshahi, Mohammad Bagher
  email: dowlatshahi.mb@lu.ac.ir
  organization: Lorestan University,Faculty of Engineering,Department of Computer Engineering,Khorramabad,Iran
– sequence: 3
  givenname: Amin
  surname: Hashemi
  fullname: Hashemi, Amin
  email: hashemi.am@fe.lu.ac.ir
  organization: Lorestan University,Faculty of Engineering,Department of Computer Engineering,Khorramabad,Iran
BookMark eNotkMtKAzEYhaMo2NY-gSB5gRmTTG6zlNJqoeBCXZd_Mn_aSDpTJhkvb6_Yrg6cj_MtzpRcdX2HhNxzVnLO6ofFav2qpDGyFEyIsjZKS20uyJRr_ddbI_glmQhjeMErI2_IPKUPxlglmGRCTUi7_IQ4Qg59Rw9jzIFGaDBSj5DHAWnCiO6f-n6gGb8zdRFSCj6402pModvRLwy7fcaWNv3QAnX92GUKx-PQg9vfkmsPMeH8nDPyvlq-LZ6LzcvTevG4KQLnNhfWK4mmUaiFakQrtTAordLKV8xqz6AGLR0HXzWKN7bmEkRjlFOibaVStpqRu5M3IOL2OIQDDD_b8ynVLwLsWiE
ContentType Conference Proceeding
DBID 6IE
6IL
CBEJK
RIE
RIL
DOI 10.1109/CFIS54774.2022.9756467
DatabaseName IEEE Electronic Library (IEL) Conference Proceedings
IEEE Xplore POP ALL
IEEE Xplore All Conference Proceedings
IEEE/IET Electronic Library
IEEE Proceedings Order Plans (POP All) 1998-Present
DatabaseTitleList
Database_xml – sequence: 1
  dbid: RIE
  name: IEEE Electronic Library (IEL)
  url: https://ieeexplore.ieee.org/
  sourceTypes: Publisher
DeliveryMethod fulltext_linktorsrc
EISBN 1665478721
9781665478724
EISSN 2771-1374
EndPage 6
ExternalDocumentID 9756467
Genre orig-research
GroupedDBID 6IE
6IF
6IL
6IN
AAWTH
ABLEC
ADZIZ
ALMA_UNASSIGNED_HOLDINGS
BEFXN
BFFAM
BGNUA
BKEBE
BPEOZ
CBEJK
CHZPO
IEGSK
OCL
RIE
RIL
ID FETCH-LOGICAL-i118t-8f54e7b5e625b2d4627e48565f3086f0a9a64c1af3b51b8914a2b75c52dd45583
IEDL.DBID RIE
IngestDate Wed Aug 27 02:24:19 EDT 2025
IsPeerReviewed false
IsScholarly false
Language English
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-i118t-8f54e7b5e625b2d4627e48565f3086f0a9a64c1af3b51b8914a2b75c52dd45583
PageCount 6
ParticipantIDs ieee_primary_9756467
PublicationCentury 2000
PublicationDate 2022-March-2
PublicationDateYYYYMMDD 2022-03-02
PublicationDate_xml – month: 03
  year: 2022
  text: 2022-March-2
  day: 02
PublicationDecade 2020
PublicationTitle Iranian Joint Congress on Fuzzy and Intelligent Systems (Online)
PublicationTitleAbbrev CFIS
PublicationYear 2022
Publisher IEEE
Publisher_xml – name: IEEE
SSID ssj0003204025
Score 1.8287859
Snippet Due to the existence of text data, multi-label (ML) text classification is an essential task in machine learning. Feature selection is an essential and...
SourceID ieee
SourceType Publisher
StartPage 1
SubjectTerms Classification algorithms
Feature extraction
Machine learning
Machine learning algorithms
Multi-label feature selection
Task analysis
Text categorization
Text classification
Voting
Weighted Borda Count
Title Evaluation multi label feature selection for text classification using weighted borda count approach
URI https://ieeexplore.ieee.org/document/9756467
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV1LSwMxEB7a4sGTSiu-ycGjW7t5bHbPYlGQUvBBbyWPiRSklT707ztJ1xXBi5dl2WUIZCbMI998A3AZpBOWW5fZkNtM5qXPrDdFepDD0zmmMZ0vD3o0KieTatyCq6YXBhET-Az78TXd5fuF28RS2XWlVUEHuw1trYttr1ZTTxGczJGrugk4H1RkNfePSlJ4Q1kg5_1a-NcUleREhnv_W34fej_deGzc-JkDaOG8C_62oelmCRTISJ34xgImpk62SvNt4l-KSlmEdzAXA-WIDNpKRcT7K_tMpVH0jGzBG5ZGR7BvovEePA9vn27usnpiQjajRGGdlUFJ1FYhZTWWe1lwjbKkmC0ISl3CwFSmkC43QViV27LKpeFWK6e491KpUhxCZ76Y4xEwMTAYG-W9oRNrRGTVLyoeSM5Ip609hm7coen7lhRjWm_Oyd-fT2E3KiGBt_gZdNbLDZ7DjvtYz1bLi6TJL2XwoYg
linkProvider IEEE
linkToHtml http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV1LSwMxEB5qFfSk0opvc_Do1t1sso-ztLRYS8EqvZU8JiJIK33o33eS1hXBi5dl2WUgZCbMI998A3DthEk11ybSLtGRSAobaauy8CCHlycYxnQ-9_PBoBiPy2ENbqpeGEQM4DNs-ddwl29nZuVLZbdlLjM62FuwLYXg8bpbq6qopJwMkstNG3ASl2Q3vUcpKMChPJDz1kb81xyV4EY6-_9bwAE0f_rx2LDyNIdQw2kDbLsi6mYBFshIofjGHAauTrYIE278X4pLmQd4MONDZY8NWkt5zPsL-wzFUbSMrMEqFoZHsG-q8SY8ddqju260mZkQvVKqsIwKJwXmWiLlNZpbkfEcRUFRm0speXGxKlUmTKJcqmWiizIRiutcGsmtFVIW6RHUp7MpHgNLY4W-Vd4qOrMq9bz6WckdySlhcq1PoOF3aPK-psWYbDbn9O_PV7DbHT30J_3e4P4M9rxCApSLn0N9OV_hBeyYj-XrYn4ZtPoF-wSkzw
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=proceeding&rft.title=Iranian+Joint+Congress+on+Fuzzy+and+Intelligent+Systems+%28Online%29&rft.atitle=Evaluation+multi+label+feature+selection+for+text+classification+using+weighted+borda+count+approach&rft.au=Miri%2C+Mohsen&rft.au=Dowlatshahi%2C+Mohammad+Bagher&rft.au=Hashemi%2C+Amin&rft.date=2022-03-02&rft.pub=IEEE&rft.eissn=2771-1374&rft.spage=1&rft.epage=6&rft_id=info:doi/10.1109%2FCFIS54774.2022.9756467&rft.externalDocID=9756467