A multiple‐attention refinement network for RGB‐D salient object detection.
Saved in:
| Title: | A multiple‐attention refinement network for RGB‐D salient object detection. |
|---|---|
| Authors: | Jiang, Zijian, Yu, Ling, Li, Junru, Niu, Fanglin |
| Source: | IET Image Processing (Wiley-Blackwell); 12/11/2024, Vol. 18 Issue 14, p4551-4562, 12p |
| Subject Terms: | OBJECT recognition (Computer vision), FEATURE extraction, COMPUTER vision, IMAGE processing, ALGORITHMS |
| Abstract: | Most existing RGB‐D salient object detection (SOD) algorithms, when using depth information for cross‐modal fusion, result in significant information redundancy due to issues with low‐quality depth information, ambiguity, and difficulty in discriminating complex scenes, ultimately leading to poor‐quality saliency maps. This article proposes a Multiple‐Attention Refinement Network (MARNet) to address the issues of insufficient cross‐modal fusion and poor quality of depth images in RGB‐D salient object detection. MARNet adopts an end‐to‐end structure and enables the fusion of cross‐modal features through multiple‐attention refinement and cross‐attention fusion with each other. This article, in particular, designs an Attention Interaction Module (AIM), which uses multiple‐attention and cross‐attention to refine and fuse the two modalities, reducing the information redundancy generated during cross‐modal interactions and background noise interference. This article designs a Multi‐Scale Compensation Module (MSCM) to guide the multi‐scale feature fusion step‐by‐step, enabling the fusion of local and global contexts of multi‐scale features. Extensive experimental results demonstrate that the MARNet in this article has significant advantages over 16 state‐of‐the‐art RGB‐D methods on five publicly available datasets. The codes can be found at https://github.com/wzxxmj/MARNet. [ABSTRACT FROM AUTHOR] |
| Copyright of IET Image Processing (Wiley-Blackwell) is the property of Wiley-Blackwell and its content may not be copied or emailed to multiple sites without the copyright holder's express written permission. Additionally, content may not be used with any artificial intelligence tools or machine learning technologies. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.) | |
| Database: | Biomedical Index |
Be the first to leave a comment!
Full Text Finder
Nájsť tento článok vo Web of Science