Semi-Supervised Deep Conditional Variational Autoencoder for Soft Sensor Modeling

Variational autoencoder (VAE) as an unsupervised deep generated model has been widely applied to process modeling for industrial processes due to its excellent ability in nonlinear and uncertain feature extraction. However, soft sensor based on VAE model faces three challenges. First, the constructe...

Full description

Saved in:
Bibliographic Details
Published in:IEEE sensors journal Vol. 24; no. 5; pp. 7153 - 7164
Main Authors: Tang, Xiaochu, Yan, Jiawei, Li, Yuan, Zhang, Xinmin, Song, Zhihuan
Format: Journal Article
Language:English
Published: New York IEEE 01.03.2024
The Institute of Electrical and Electronics Engineers, Inc. (IEEE)
Subjects:
ISSN:1530-437X, 1558-1748
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Variational autoencoder (VAE) as an unsupervised deep generated model has been widely applied to process modeling for industrial processes due to its excellent ability in nonlinear and uncertain feature extraction. However, soft sensor based on VAE model faces three challenges. First, the constructed supervised VAE model makes it difficult to describe the correlation between input and output based on self-network. Second, the output of the VAE may suffer from instability and uncontrollability. In addition, the limited labeled data in industries are the third challenge. To solve the above problems, a semi-supervised deep conditional VAE (SS-DCVAE) is constructed for soft sensor based on a supervised DCVAE (S-DCVAE) and an unsupervised DCVAE (U-DCVAE). The S-DCVAE model is constructed by injecting unlabeled data, the actual labels, and estimated labels as constraint conditions from the preneural network. Based on such a conditional supervised structure, the input-output correlation can be strengthened and the generated data can be controlled toward the aim direction. Furthermore, the U-DCVAE model can be built by making the latent distribution as similar as possible to S-DCVAE, as well as only using unlabeled data with corresponding estimated labels. In this way, the unlabeled data can be fully utilized and online prediction can be achieved. Finally, combining the decoder of S-DCVAE model with the encoder of U-DCVAE, the SS-DCVAE model is constructed with both advantages. The effectiveness and superiority of the SS-DCVAE model are demonstrated by comparing the prediction results of the proposed model with other deep learning methods based on industrial cases.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
ISSN:1530-437X
1558-1748
DOI:10.1109/JSEN.2024.3351431