The Improved Kurdish Dialect Classification Using Data Augmentation and ANOVA-Based Feature Selection

Analyzing dialects in the Kurdish language proves to be tough because of the tiny phonetic distinctions among the dialects. We applied advanced methods to enhance the precision of Kurdish dialect classification in this research. We examined the dataset’s stability and variation through the use of ti...

Celý popis

Uložené v:
Podrobná bibliografia
Vydané v:ARO (Koya) Ročník 13; číslo 1; s. 94 - 103
Hlavní autori: Ghafoor, Karzan J., Taher, Sarkhel H., Hama Rawf, Karwan M., Abdulrahman, Ayub O.
Médium: Journal Article
Jazyk:English
Vydavateľské údaje: Koya University 07.03.2025
Predmet:
ISSN:2410-9355, 2307-549X
On-line prístup:Získať plný text
Tagy: Pridať tag
Žiadne tagy, Buďte prvý, kto otaguje tento záznam!
Popis
Shrnutí:Analyzing dialects in the Kurdish language proves to be tough because of the tiny phonetic distinctions among the dialects. We applied advanced methods to enhance the precision of Kurdish dialect classification in this research. We examined the dataset’s stability and variation through the use of time-stretching and noise-augmenting methods. Analysis of variance (ANOVA) filter approach is applied to improve feature selection (FS) more efficiently and highlight the most relevant features for dialect classification. The ANOVA filter method ranks features based on the means from different dialect groups, which made FS better. To make dialect classification work better, a 1D convolutional neural network model was given a dataset that had ANOVA FS added to it. The model showed a very strong performance, reaching a remarkable accuracy of 99.42%. This noteworthy increase in accuracy beat former research with an accuracy of 95.5%. The findings demonstrate how combining time stretch and FS methods can improve the accuracy of Kurdish dialect classification. This project improves our understanding and implementation of machine learning in the field of linguistic diversity and dialectology.
ISSN:2410-9355
2307-549X
DOI:10.14500/aro.11897