Synthetic Time-Series Data Generation for Smart Grids Using 3D Autoencoder GAN

Given the growing significance of data-driven approaches in analysis and decision-making in smart grid, the availability of diverse and representative datasets is paramount. However, challenges such as privacy concerns, data size limitations, and data quality issues have constrained the usage of rea...

Celý popis

Uloženo v:
Podrobná bibliografie
Vydáno v:IEEE transactions on industrial informatics Ročník 21; číslo 7; s. 5047 - 5058
Hlavní autoři: Zhang, Guihai, Sikdar, Biplab
Médium: Journal Article
Jazyk:angličtina
Vydáno: Piscataway IEEE 01.07.2025
The Institute of Electrical and Electronics Engineers, Inc. (IEEE)
Témata:
ISSN:1551-3203, 1941-0050
On-line přístup:Získat plný text
Tagy: Přidat tag
Žádné tagy, Buďte první, kdo vytvoří štítek k tomuto záznamu!
Popis
Shrnutí:Given the growing significance of data-driven approaches in analysis and decision-making in smart grid, the availability of diverse and representative datasets is paramount. However, challenges such as privacy concerns, data size limitations, and data quality issues have constrained the usage of real-world data. In this paper, we introduce the 3D Autoencoder Generative Adversarial Network (3DAE GAN) as a solution to generate high-resolution and multivariate synthetic time-series data capable of representing various real power consumption patterns across different households and driving data for Electric Vehicles (EVs). Beyond the conventional GAN structure, the incorporation of both the Autoencoder and 3D-convolution processes enables a more comprehensive extraction of patterns in data, thereby addressing limitations present in existing data generation methods. Evaluation results using the Pecan Street and Emobpy simulated EV dataset demonstrate that the proposed method generates synthetic data with higher similarity scores compared to existing approaches. Furthermore, downstream prediction tasks are conducted to establish the comparability between using the original data and the synthetic data, revealing no significant differences. Moreover, the risk of possible information leakage from synthetic data about original data is evaluated by performing membership inference attacks and population attacks on the prediction models that are trained with synthetic data. The robustness of the synthetic data are examined when facing FGSM attacks.
Bibliografie:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
ISSN:1551-3203
1941-0050
DOI:10.1109/TII.2025.3538115