An Improved Multi-Imputation Technique Based on Chained Equations and Decision Trees: Application to Wind Energy Conversion Systems

Missing data (MD) is a prevalent issue that researchers and data scientists frequently encounter. It can significantly impact the quality of analyzed data, affecting the relevance of the interpreted results and the inferred conclusions. In response to this challenge, a novel multiimputation techniqu...

Full description

Saved in:
Bibliographic Details
Published in:Advances in Electrical and Computer Engineering Vol. 25; no. 1; pp. 71 - 78
Main Authors: JAFFEL, I., GUERFEL, M., MESSAOUD, H.
Format: Journal Article
Language:English
Published: Suceava Stefan cel Mare University of Suceava 01.02.2025
Subjects:
ISSN:1582-7445, 1844-7600
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Missing data (MD) is a prevalent issue that researchers and data scientists frequently encounter. It can significantly impact the quality of analyzed data, affecting the relevance of the interpreted results and the inferred conclusions. In response to this challenge, a novel multiimputation technique that combines Multivariate Imputation by Chained Equation (MICE) with Decision Tree (DT), namely (MICE-DT), is proposed. This developed method was evaluated against several established imputation techniques, including K-Nearest Neighbors (KNN), K-Means clustering, Decision Tree (DT), and MICE, under the assumption of Missing at Random (MAR). The performance of the MICE-DT algorithm, along with the comparative analysis of the studied techniques, was demonstrated on a Wind Energy Conversion System (WEC), yielding satisfactory results. Index Terms--data preprocessing, decision trees, multidimensional signal processing, statistical analysis, wind energy.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
ISSN:1582-7445
1844-7600
DOI:10.4316/AECE.2025.01008