Data Quality Improvement Method for Power Equipment Condition Based on Stacked Denoising Autoencoders Improved by Particle Swarm Optimization

Uložené v:
Podrobná bibliografia
Názov: Data Quality Improvement Method for Power Equipment Condition Based on Stacked Denoising Autoencoders Improved by Particle Swarm Optimization
Autori: JI Rong, HOU Huijuan, SHENG Gehao, ZHANG Lijing, SHU Bo, JIANG Xiuchen
Zdroj: Shanghai Jiaotong Daxue xuebao, Vol 59, Iss 6, Pp 780-788 (2025)
Informácie o vydavateľovi: Editorial Office of Journal of Shanghai Jiao Tong University, 2025.
Rok vydania: 2025
Zbierka: LCC:Engineering (General). Civil engineering (General)
LCC:Chemical engineering
LCC:Naval architecture. Shipbuilding. Marine engineering
Predmety: power equipment, status data, stacked denoising autoencoder, data cleaning, Engineering (General). Civil engineering (General), TA1-2040, Chemical engineering, TP155-156, Naval architecture. Shipbuilding. Marine engineering, VM1-989
Popis: Big data related to power equipment condition is experiencing explosive growth. However, equipment failures and personnel errors result in dirty data, having a negative effect on data quality and subsequent analysis results. Therefore, data cleaning is of great significance. Most existing research focuses on direct identification and elimination of abnormal data, which compromises the integrity of the data. In order to solve this problem, a data cleaning method based on improved stack noise reduction autoencoder is proposed in this paper. First, particle swarm optimization is used to optimize the hyperparameters of the stack noise reduction autoencoder. Then, the characteristics of the autoencoder is used to extract and restore the data features to clean the data. The method improves data quality of power equipment condition by repairing isolated data points and filling in missing data, which is simple and efficient for improving the accuracy and integrity of the data set. Finally, the historical operation data of power equipment is taken as an example. The simulation results show that the proposed method outperforms other classical methods providing good cleaning results for data sets with different abnormal degrees in different running states. The proposed method offers an effective solution for improving the quality of power equipment status data effectively.
Druh dokumentu: article
Popis súboru: electronic resource
Jazyk: Chinese
ISSN: 1006-2467
Relation: https://xuebao.sjtu.edu.cn/article/2025/1006-2467/1006-2467-59-6-780.shtml; https://doaj.org/toc/1006-2467
DOI: 10.16183/j.cnki.jsjtu.2023.388
Prístupová URL adresa: https://doaj.org/article/a9b9c95033014d53b01068f2d1ca54df
Prístupové číslo: edsdoj.9b9c95033014d53b01068f2d1ca54df
Databáza: Directory of Open Access Journals
Popis
Abstrakt:Big data related to power equipment condition is experiencing explosive growth. However, equipment failures and personnel errors result in dirty data, having a negative effect on data quality and subsequent analysis results. Therefore, data cleaning is of great significance. Most existing research focuses on direct identification and elimination of abnormal data, which compromises the integrity of the data. In order to solve this problem, a data cleaning method based on improved stack noise reduction autoencoder is proposed in this paper. First, particle swarm optimization is used to optimize the hyperparameters of the stack noise reduction autoencoder. Then, the characteristics of the autoencoder is used to extract and restore the data features to clean the data. The method improves data quality of power equipment condition by repairing isolated data points and filling in missing data, which is simple and efficient for improving the accuracy and integrity of the data set. Finally, the historical operation data of power equipment is taken as an example. The simulation results show that the proposed method outperforms other classical methods providing good cleaning results for data sets with different abnormal degrees in different running states. The proposed method offers an effective solution for improving the quality of power equipment status data effectively.
ISSN:10062467
DOI:10.16183/j.cnki.jsjtu.2023.388