Deep Clustering With Variational Autoencoder

An autoencoder that learns a latent space in an unsupervised manner has many applications in signal processing. However, the latent space of an autoencoder does not pursue the same clustering goal as Kmeans or GMM. A recent work proposes to artificially re-align each point in the latent space of an...

Full description

Saved in:

Bibliographic Details
Published in:	IEEE signal processing letters Vol. 27; pp. 231 - 235
Main Authors:	Lim, Kart-Leong, Jiang, Xudong, Yi, Chenyu
Format:	Journal Article
Language:	English
Published:	New York IEEE 2020 The Institute of Electrical and Electronics Engineers, Inc. (IEEE)
Subjects:	Bayes methods Clustering Datasets Energy management Euclidean geometry Gaussian distribution Optimization Photovoltaic cells Probabilistic logic Probabilistic models Probability distribution Random variables Signal processing Statistical analysis
ISSN:	1070-9908, 1558-2361
Online Access:	Get full text
Tags:	Add Tag No Tags, Be the first to tag this record!

Description
Summary:	An autoencoder that learns a latent space in an unsupervised manner has many applications in signal processing. However, the latent space of an autoencoder does not pursue the same clustering goal as Kmeans or GMM. A recent work proposes to artificially re-align each point in the latent space of an autoencoder to its nearest class neighbors during training (Song et al. 2013). The resulting new latent space is found to be much more suitable for clustering, since clustering information is used. Inspired by previous works (Song et al. 2013), in this letter we propose several extensions to this technique. First, we propose a probabilistic approach to generalize Song's approach, such that Euclidean distance in the latent space is now represented by KL divergence. Second, as a consequence of this generalization we can now use probability distributions as inputs rather than points in the latent space. Third, we propose using Bayesian Gaussian mixture model for clustering in the latent space. We demonstrated our proposed method on digit recognition datasets, MNIST, USPS and SHVN as well as scene datasets, Scene15 and MIT67 with interesting findings.
Bibliography:	ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14
ISSN:	1070-9908 1558-2361
DOI:	10.1109/LSP.2020.2965328