Multi-task adversarial autoencoder network for face alignment in the wild

•Proposes a novel Multi-task Adversarial Autoencoder (MTAAE) network.•Combines face reconstruction, facial attribute prediction with heatmap regression.•Designs a dynamic weight network to assign a reasonable weight for each task.•Introduces a discriminator on the generated heatmaps.•The proposed wo...

Full description

Saved in:
Bibliographic Details
Published in:Neurocomputing (Amsterdam) Vol. 437; pp. 261 - 273
Main Authors: Yue, Xiaoqian, Li, Jing, Wu, Jia, Chang, Jun, Wan, Jun, Ma, Jinyan
Format: Journal Article
Language:English
Published: Elsevier B.V 21.05.2021
Subjects:
ISSN:0925-2312, 1872-8286
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:•Proposes a novel Multi-task Adversarial Autoencoder (MTAAE) network.•Combines face reconstruction, facial attribute prediction with heatmap regression.•Designs a dynamic weight network to assign a reasonable weight for each task.•Introduces a discriminator on the generated heatmaps.•The proposed work is competitive with the state-of-the-art methods. Face alignment has been applied widely in the field of computer vision, which is still a very challenging task for the existence of large pose, partial occlusion, and illumination, etc. The method based on deep regression neural network has achieved the most advanced performance in the field of face alignment in recent years, and how to learn more representative facial appearance is the key to face alignment. Based on the idea of Multi-task Learning, we propose a Multi-task Adversarial Autoencoder (MTAAE) network, which can learn more representative facial appearance for heatmap regression and improve the performance of face alignment in the wild. MTAAE is composed of three tasks. The main task uses the heatmap regression method to locate the position of landmarks and introduces a discriminator on the landmark heatmaps to generate more realistic heatmaps. Facial attribute estimation tasks and face reconstruction task based on Adversarial Autoencoder respectively extract discriminative and generative representations to improve the effect of heatmap regression. At the same time, the dynamic weight network is designed to assign a weight coefficient dynamically and reasonably for each auxiliary task. Extensive experiments on 300 W, MTFL, and WFLW datasets demonstrate that our method is more robust in complex environments and outperforms state-of-the-art methods.
ISSN:0925-2312
1872-8286
DOI:10.1016/j.neucom.2021.01.027