Multi‐scale feature fusion pyramid attention network for single image dehazing

Texture and color distortion are common in existing learning‐based dehazing algorithms, and it is argued that one of the major reasons is that the shallow features of fog images are underutilized, and the deep features of fog images are insufficient for single image dehazing. In order to provide mor...

Full description

Saved in:

Bibliographic Details
Published in:	IET image processing Vol. 17; no. 9; pp. 2726 - 2735
Main Authors:	Liu, Jianlei, Liu, Peng, Zhang, Yuanke
Format:	Journal Article
Language:	English
Published:	Wiley 01.07.2023
Subjects:	attention mechanism feature fusion image dehazing pyramid autoencoder
ISSN:	1751-9659, 1751-9667
Online Access:	Get full text
Tags:	Add Tag No Tags, Be the first to tag this record!

Description
Summary:	Texture and color distortion are common in existing learning‐based dehazing algorithms, and it is argued that one of the major reasons is that the shallow features of fog images are underutilized, and the deep features of fog images are insufficient for single image dehazing. In order to provide more texture and color information for image restoration, more shallow features need to be added in the process of image decoding. Therefore, a multi‐scale feature fusion pyramid attention network (PAN) for single image dehazing is proposed. In PAN, combined with the attention mechanism, a shallow and deep feature fusion (SDF) strategy is designed. SDF considers multi‐scale as well as channel‐level fusion to provide feature information under different receptive fields while also highlighting important channels, such as texture and color information. DC is designed as a latent space mapping module to learn a mapping relationship between the latent space representation of the hazy image at low resolution and the corresponding latent space representation of the haze‐free image. Additionally, network deconvolution (ND) and deformed convolution network (DCN) are introduced into PAN. The ND module can remove pixel‐wise and channel‐wise correlation of features, reduce data redundancy to obtain sparse representation of features, and speed up network convergence. The DCN module can use its adaptive receptive field to focus on the area of interest for calculation and play a role in texture feature enhancement. Finally, the perceptual loss is chosen as the regularization item of the loss function, which makes style features of the restored image closer to the real fog‐free image. Extensive experiments reveal that the proposed PAN outperforms other existing dehazing methods on real‐world and synthetic datasets. Texture and color distortion are common in existing learning‐based dehazing algorithms, and we argue that one of the major reasons is that the shallow features of fog images are underutilized, and the deep features of fog images are insufficient for single image dehazing. In order to provide more texture and color information for image restoration, more shallow features need to be added in the process of image decoding. Therefore, we propose a multi‐scale feature fusion pyramid attention network (PAN) for single image dehazing.
ISSN:	1751-9659 1751-9667
DOI:	10.1049/ipr2.12823