FSE-Net: Rethinking the up-sampling operation in encoder-decoder structure for retinal vessel segmentation
•We present an efficient medical image segmentation framework called FSE-Net for retinal vessel segmentation, which eliminates the up-sampling operation.•To enhance the feature extraction capability in the encoder stage, we introduce the Residual Feature Separable Block (RFSB) module, which effectiv...
Gespeichert in:
| Veröffentlicht in: | Biomedical signal processing and control Jg. 90; S. 105861 |
|---|---|
| Hauptverfasser: | , , , |
| Format: | Journal Article |
| Sprache: | Englisch |
| Veröffentlicht: |
Elsevier Ltd
01.04.2024
|
| Schlagworte: | |
| ISSN: | 1746-8094, 1746-8108 |
| Online-Zugang: | Volltext |
| Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
| Zusammenfassung: | •We present an efficient medical image segmentation framework called FSE-Net for retinal vessel segmentation, which eliminates the up-sampling operation.•To enhance the feature extraction capability in the encoder stage, we introduce the Residual Feature Separable Block (RFSB) module, which effectively extracts and refines low-level features, increasing the receptive field of the network.•We introduce the Multi-Head Feature Fusion (MFF) module, which effectively merges low-level and high-level features, streamlining encoder-decoder structure.
Automatic retinal vessel segmentation plays a crucial role in the diagnosis and assessment of various ophthalmologic diseases. Currently, the primary retinal vessel segmentation algorithms are based on the encoder-decoder structure. However, these U-Net analogs suffer from the loss of both spatial and semantic information, caused by continuous up-sampling operations in the decoder structure. In this paper, we rethink the above problem and build a novel deep neural network for retinal vessel segmentation, called FSE-Net. Specifically, to address the issue of feature information loss and enhance the performance of retinal vessel segmentation, we eliminate the decoder structure. In particular, we introduced a multi-head feature fusion block (MFF) as a substitute for the continuous up-sampling operation. Additionally, the encoder stage of FSE-Net incorporates a residual feature separable block (RFSB) to further refine and distill features, thereby enhancing the capability of feature extraction. Subsequently, we employ a residual atrous spatial feature aggregate module (RASF) to expand the network's receptive field by incorporating multi-scale feature information. We conducted experiments on five widely recognized databases for retinal vessel segmentation, namely DRIVE, CHASEDB1, STARE, IOSTAR, and LES-AV. The results demonstrate that our proposed FSE-Net outperforms state-of-the-art approaches in terms of segmentation performance. Moreover, we demonstrate the feasibility of achieving superior segmentation performance without employing the traditional U-Net analog network structure. |
|---|---|
| ISSN: | 1746-8094 1746-8108 |
| DOI: | 10.1016/j.bspc.2023.105861 |