Evaluating the impact of data preprocessing to develop a robust MEP-based forecasting model for building integrated with PCM

Data quality is a crucial aspect to accurately predict the energy use of buildings utilizing machine learning methods. Data preprocessing can ensure data quality when a database does not match the criteria for evolving a robust prediction model. Regarding phase change material (PCM)-incorporated bui...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Energy (Oxford) Jg. 324; S. 135763
Hauptverfasser: Nazir, Kashif, Memon, Shazim Ali
Format: Journal Article
Sprache:Englisch
Veröffentlicht: Elsevier Ltd 01.06.2025
Schlagworte:
ISSN:0360-5442
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Data quality is a crucial aspect to accurately predict the energy use of buildings utilizing machine learning methods. Data preprocessing can ensure data quality when a database does not match the criteria for evolving a robust prediction model. Regarding phase change material (PCM)-incorporated buildings, there was no study before this research evaluating the impact of data preprocessing for establishing a robust machine learning based model to forecast their energy consumption (EC). Therefore, for the first time, this research presents an application of the data preprocessing process to compare the results of the formulated multi-expression programming (MEP)-based prediction model's accuracy for predicting the EC of PCM-integrated buildings using processed with actual databases. Data cleaning, outlier detection and removal, and data smoothing were performed on the actual EC database during the data preprocessing process. Results of model evaluation and validation processes for the articulated prediction models showed that the data preprocessing improved the MEP-based prediction model by 33 % to predict the EC precisely. Conclusively, model interpretability (sensitivity, parametric, and energy saving analysis) demonstrated that the developed more reliable prediction model provides energy savings of approximately 20 % by integrating optimum PCM. •Evaluated impact of data preprocessing to develop robust prediction model.•Compared MEP's prediction model precision for actual and processed EC database.•Prediction model (MEPPP23) using processed database was the most reliable.•MEPPP23 developed for PCM-integrated building showed R2>95 % with CV-RMSE<10 %.•Performed sensitivity & parametric analysis on best prediction model.
Bibliographie:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
ISSN:0360-5442
DOI:10.1016/j.energy.2025.135763