MHAED-Net: a lightweight multiscale hybrid attention encoder-decoder network for the efficient segmentation of industrial forging images
Accurate and efficient segmentation of the boundaries, shapes, and sizes of forgings is crucial for intelligent forging perception. Current image segmentation techniques frequently face challenges in achieving an effective balance between speed and accuracy. Moreover, these techniques often fail to...
Uloženo v:
| Vydáno v: | The Journal of supercomputing Ročník 81; číslo 8; s. 1001 |
|---|---|
| Hlavní autoři: | , , , , , , , , , |
| Médium: | Journal Article |
| Jazyk: | angličtina |
| Vydáno: |
New York
Springer US
09.06.2025
Springer Nature B.V |
| Témata: | |
| ISSN: | 1573-0484, 0920-8542, 1573-0484 |
| On-line přístup: | Získat plný text |
| Tagy: |
Přidat tag
Žádné tagy, Buďte první, kdo vytvoří štítek k tomuto záznamu!
|
| Shrnutí: | Accurate and efficient segmentation of the boundaries, shapes, and sizes of forgings is crucial for intelligent forging perception. Current image segmentation techniques frequently face challenges in achieving an effective balance between speed and accuracy. Moreover, these techniques often fail to adapt well to the complex working conditions and diverse scales of forgings. In this study, a lightweight multiscale hybrid attention encoder-decoder network (MHAED-Net) is designed for the efficient segmentation of industrial forging images. MHAED-Net is characterized by only 0.076 M parameters and 0.087 Giga Floating-point Operations Per Second. The model employs a novel multiscale hybrid attention block (MHAB) that integrates the convolution normalization activation block and the ShuffleNetV2 block to create an encoder-decoder network. The proposed MHAB integrates CNN-based and Transformer-based attention through a multi-branch fusion approach. It employs dilated convolutions for multi-scale feature learning and incorporates a Lightweight Transformer to capture long-range dependencies. MHAED-Net achieves a mean Intersection over Union of 94.82% and Dice Similarity Coefficient of 97.34% in the segmenting the FORSeg dataset. Extensive experimental results demonstrate that MHAED-Net achieves state-of-the-art performance under complex conditions and multi-scale scenarios, highlighting its significant potential for industrial applications. |
|---|---|
| Bibliografie: | ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14 |
| ISSN: | 1573-0484 0920-8542 1573-0484 |
| DOI: | 10.1007/s11227-025-07456-8 |