Text Classification Using Enhanced Binary Wind Driven Optimization Algorithm

Document classification using supervised machine learning is now widely used on the internet and in digital libraries. Several studies have focused on English-language document classification. However, Arabic text includes high variation in its morphology, which leads to high extracted features and...

Full description

Saved in:
Bibliographic Details
Published in:International journal of advanced computer science & applications Vol. 16; no. 6
Main Authors: Atwan, Jaffar, Wedyan, Mohammad, Hamadeen, Ahmad, Bsoul, Qusay, Alrosan, Ayat, Alturki, Ryan
Format: Journal Article
Language:English
Published: West Yorkshire Science and Information (SAI) Organization Limited 2025
Subjects:
ISSN:2158-107X, 2156-5570
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Document classification using supervised machine learning is now widely used on the internet and in digital libraries. Several studies have focused on English-language document classification. However, Arabic text includes high variation in its morphology, which leads to high extracted features and increases the dimensionality of the classification task. Towards reducing the curse of dimension in Arabic text classification, a wrapper feature selection method is proposed in this study. In more detail, a hybrid metaheuristic model based on the Wind Driven and Simulated Annealing is designed to solve FS task in Arabic text, known as WDFS. The Wind Driven method is initially introduced to optimize the Fs task in the exploration phase. Then, WD is hybridized with simulated annealing as a local search in the exploitation phase to enhance the solutions located by the WD. Three classifiers are utilized to evaluate the selected features using the proposed WDFS: K-nearest Neighbor, Naïve Bayesian, and Decision Tree. The proposed WDFS method was assessed on selected four groups of files from a benchmark TREC Arabic text newswire dataset. Comparative results showed that the WDFS method outperforms other existing Arabic text classification methods in term of the accuracy. The obtained results reveal the high potentiality of WDFS in reliably searching the feature space to obtain the optimal combination of features.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
ISSN:2158-107X
2156-5570
DOI:10.14569/IJACSA.2025.01606107