Visual saliency detection via invariant feature constrained stacked denoising autoencoder

Visual saliency detection is usually regarded as an image pre-processing method to predict and locate the position and shape of saliency regions. However, many existing saliency detection methods can only obtain the local or even incorrect position and shape of saliency regions, resulting in incompl...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Multimedia tools and applications Jg. 82; H. 18; S. 27451 - 27472
Hauptverfasser:	Ma, Yunpeng, Yu, Zhihong, Zhou, Yaqin, Xu, Chang, Yu, Dabing
Format:	Journal Article
Sprache:	Englisch
Veröffentlicht:	New York Springer US 01.07.2023 Springer Nature B.V
Schlagworte:	Accuracy Belief networks Bias Computer Communication Networks Computer Science Data Structures and Information Theory Deep learning Image reconstruction Image restoration Image retrieval Image segmentation Invariants Laboratories Methods Multimedia Multimedia Information Systems Neural networks Noise reduction Object recognition Parameters Salience Special Purpose and Application-Based Systems Visual saliency detection Scale invariant feature Saliency prediction Saliency object segmentation Stacked denoising autoencoder Reconstruction network
ISSN:	1380-7501, 1573-7721
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	Visual saliency detection is usually regarded as an image pre-processing method to predict and locate the position and shape of saliency regions. However, many existing saliency detection methods can only obtain the local or even incorrect position and shape of saliency regions, resulting in incomplete detection and segmentation of the salient target region. In order to solve this problem, a visual saliency detection method based on scale invariant feature and stacked denoising autoencoder is proposed. Firstly, the deep belief network would be pretrained to initialize the parameters of stacked denoising autoencoder network. Secondly, different from traditional features, scale invariant feature is not limited to the size, resolution, and content of original images. At the same time, it can help the network to restore important features of original images more accurately in multi-scale space. So, scale invariant feature is adopted to design the loss function of the network to complete self-training and update the parameters. Finally, the difference between the final reconstructed image obtained by stacked denoising autoencoder and the original is regarded as the final saliency map. In the experiment, we test the performance of the proposed method in both saliency prediction and saliency object segmentation. The experimental results show that the proposed method has good ability in saliency prediction and has the best performance in saliency object segmentation than other comparison saliency prediction methods and saliency object detection methods.
Bibliographie:	ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14
ISSN:	1380-7501 1573-7721
DOI:	10.1007/s11042-023-14525-8