Convolutional autoencoder based on latent subspace projection for anomaly detection

Image anomaly detection (AD) is widely researched on many occasions in computer vision tasks. High-dimensional data, such as image data, with noise and complex background is still challenging to detect anomalies under the situation that imbalanced or incomplete data are available. Some deep learning...

Full description

Saved in:
Bibliographic Details
Published in:Methods (San Diego, Calif.) Vol. 214; pp. 48 - 59
Main Authors: Yu, Qien, Li, Chen, Zhu, Ye, Kurita, Takio
Format: Journal Article
Language:English
Published: United States Elsevier Inc 01.06.2023
Subjects:
ISSN:1046-2023, 1095-9130, 1095-9130
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Image anomaly detection (AD) is widely researched on many occasions in computer vision tasks. High-dimensional data, such as image data, with noise and complex background is still challenging to detect anomalies under the situation that imbalanced or incomplete data are available. Some deep learning methods can be trained in an unsupervised way and map the original input into low-dimensional manifolds to predict larger differences in anomalies according to normal ones by dimension reduction. However, training a single low-dimension latent space is limited to present the low-dimensional features due to the fact that the noise and irreverent features are mapped into this space, resulting in that the manifolds are not discriminative for detecting anomalies. To address this problem, a new autoencoder framework is proposed in this study with two trainable mutually orthogonal complementary subspaces in the latent space, by latent subspace projection (LSP) mechanism, which is named as LSP-CAE. Specifically, latent subspace projection is used to train the latent image subspace (LIS) and the latent kernel subspace (LKS) in the latent space of the autoencoder-like model respectively, which can enhance learning power of different features from the input instance. The features of normal data are projected into the latent image subspace, while the latent kernel subspace is trained to extract the irrelevant information according to normal features by end-to-end training. To verify the generality and effectiveness of the proposed method, we replace the convolutional network with the fully-connected network contucted in the real-world medical datasets. The anomaly score based on projection norms in two subspace is used to evaluate the anomalies in the testing. Consequently, our proposed method can achieve the best performance according to four public datasets in comparison of the state-of-the-art methods. •We propose a latent subspace projection (LSP) constraints by incorporating deep encoder-decoder structure.•Complementary orthogonal subspaces can be trained in the end-to-end fashion to learn discriminative latent manifolds.•The proposed method shows the best performance in comparison of the state-of-the-art methods on public datasets.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
ISSN:1046-2023
1095-9130
1095-9130
DOI:10.1016/j.ymeth.2023.04.007