The Improved Kurdish Dialect Classification Using Data Augmentation and ANOVA-Based Feature Selection

Analyzing dialects in the Kurdish language proves to be tough because of the tiny phonetic distinctions among the dialects. We applied advanced methods to enhance the precision of Kurdish dialect classification in this research. We examined the dataset’s stability and variation through the use of ti...

Celý popis

Uloženo v:
Podrobná bibliografie
Vydáno v:ARO (Koya) Ročník 13; číslo 1; s. 94 - 103
Hlavní autoři: Ghafoor, Karzan J., Taher, Sarkhel H., Hama Rawf, Karwan M., Abdulrahman, Ayub O.
Médium: Journal Article
Jazyk:angličtina
Vydáno: Koya University 07.03.2025
Témata:
ISSN:2410-9355, 2307-549X
On-line přístup:Získat plný text
Tagy: Přidat tag
Žádné tagy, Buďte první, kdo vytvoří štítek k tomuto záznamu!
Popis
Shrnutí:Analyzing dialects in the Kurdish language proves to be tough because of the tiny phonetic distinctions among the dialects. We applied advanced methods to enhance the precision of Kurdish dialect classification in this research. We examined the dataset’s stability and variation through the use of time-stretching and noise-augmenting methods. Analysis of variance (ANOVA) filter approach is applied to improve feature selection (FS) more efficiently and highlight the most relevant features for dialect classification. The ANOVA filter method ranks features based on the means from different dialect groups, which made FS better. To make dialect classification work better, a 1D convolutional neural network model was given a dataset that had ANOVA FS added to it. The model showed a very strong performance, reaching a remarkable accuracy of 99.42%. This noteworthy increase in accuracy beat former research with an accuracy of 95.5%. The findings demonstrate how combining time stretch and FS methods can improve the accuracy of Kurdish dialect classification. This project improves our understanding and implementation of machine learning in the field of linguistic diversity and dialectology.
ISSN:2410-9355
2307-549X
DOI:10.14500/aro.11897