MHAED-Net: a lightweight multiscale hybrid attention encoder-decoder network for the efficient segmentation of industrial forging images

Accurate and efficient segmentation of the boundaries, shapes, and sizes of forgings is crucial for intelligent forging perception. Current image segmentation techniques frequently face challenges in achieving an effective balance between speed and accuracy. Moreover, these techniques often fail to...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:The Journal of supercomputing Jg. 81; H. 8; S. 1001
Hauptverfasser: Wan, Miao, Lin, Y. C., Li, Shu-Xin, Wu, Gui-Cheng, Zeng, Ning-Fu, Zhang, Song, Chen, Ming-Song, Li, Chao, Zhan, Xiao-Dong, Qiu, Yu-Liang
Format: Journal Article
Sprache:Englisch
Veröffentlicht: New York Springer US 09.06.2025
Springer Nature B.V
Schlagworte:
ISSN:1573-0484, 0920-8542, 1573-0484
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Accurate and efficient segmentation of the boundaries, shapes, and sizes of forgings is crucial for intelligent forging perception. Current image segmentation techniques frequently face challenges in achieving an effective balance between speed and accuracy. Moreover, these techniques often fail to adapt well to the complex working conditions and diverse scales of forgings. In this study, a lightweight multiscale hybrid attention encoder-decoder network (MHAED-Net) is designed for the efficient segmentation of industrial forging images. MHAED-Net is characterized by only 0.076 M parameters and 0.087 Giga Floating-point Operations Per Second. The model employs a novel multiscale hybrid attention block (MHAB) that integrates the convolution normalization activation block and the ShuffleNetV2 block to create an encoder-decoder network. The proposed MHAB integrates CNN-based and Transformer-based attention through a multi-branch fusion approach. It employs dilated convolutions for multi-scale feature learning and incorporates a Lightweight Transformer to capture long-range dependencies. MHAED-Net achieves a mean Intersection over Union of 94.82% and Dice Similarity Coefficient of 97.34% in the segmenting the FORSeg dataset. Extensive experimental results demonstrate that MHAED-Net achieves state-of-the-art performance under complex conditions and multi-scale scenarios, highlighting its significant potential for industrial applications.
Bibliographie:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
ISSN:1573-0484
0920-8542
1573-0484
DOI:10.1007/s11227-025-07456-8