An Improved Multi-Imputation Technique Based on Chained Equations and Decision Trees: Application to Wind Energy Conversion Systems

Missing data (MD) is a prevalent issue that researchers and data scientists frequently encounter. It can significantly impact the quality of analyzed data, affecting the relevance of the interpreted results and the inferred conclusions. In response to this challenge, a novel multiimputation techniqu...

Celý popis

Uloženo v:
Podrobná bibliografie
Vydáno v:Advances in Electrical and Computer Engineering Ročník 25; číslo 1; s. 71 - 78
Hlavní autoři: JAFFEL, I., GUERFEL, M., MESSAOUD, H.
Médium: Journal Article
Jazyk:angličtina
Vydáno: Suceava Stefan cel Mare University of Suceava 01.02.2025
Témata:
ISSN:1582-7445, 1844-7600
On-line přístup:Získat plný text
Tagy: Přidat tag
Žádné tagy, Buďte první, kdo vytvoří štítek k tomuto záznamu!
Popis
Shrnutí:Missing data (MD) is a prevalent issue that researchers and data scientists frequently encounter. It can significantly impact the quality of analyzed data, affecting the relevance of the interpreted results and the inferred conclusions. In response to this challenge, a novel multiimputation technique that combines Multivariate Imputation by Chained Equation (MICE) with Decision Tree (DT), namely (MICE-DT), is proposed. This developed method was evaluated against several established imputation techniques, including K-Nearest Neighbors (KNN), K-Means clustering, Decision Tree (DT), and MICE, under the assumption of Missing at Random (MAR). The performance of the MICE-DT algorithm, along with the comparative analysis of the studied techniques, was demonstrated on a Wind Energy Conversion System (WEC), yielding satisfactory results. Index Terms--data preprocessing, decision trees, multidimensional signal processing, statistical analysis, wind energy.
Bibliografie:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
ISSN:1582-7445
1844-7600
DOI:10.4316/AECE.2025.01008