Semantic Segmentation of Sika Deer Antler Image by U-Net Based on Two-Dimensional Discrete Wavelet Transform Fusion and Multi-Attention Mechanism.

Saved in:
Bibliographic Details
Title: Semantic Segmentation of Sika Deer Antler Image by U-Net Based on Two-Dimensional Discrete Wavelet Transform Fusion and Multi-Attention Mechanism.
Authors: Gong, Haotian, Wei, Jinfan, Sun, Yu, Li, Zhipeng, Gong, He, Fan, Juanjuan
Source: Animals (2076-2615); May2025, Vol. 15 Issue 10, p1388, 24p
Subject Terms: SIKA deer, DISCRETE wavelet transforms, EVIDENCE gaps, ANTLERS, GENERALIZATION
Abstract: Simple Summary: Monitoring the antler growth status of sika deer is of great significance for sika deer antler grading and sika deer class identification, and it also has a role in promoting the process of intelligent breeding of sika deer. In this study, a new network model for segmentation of sika deer antlers was developed based on the U-Net by incorporating innovative modules and attention mechanisms. The method was evaluated using datasets of antler images from adult sika deer. It not only has high accuracy in segmentation tasks but is also very friendly to hardware resources. This provides the data and technological support for sika deer antler quality assessment and grading. At present, the monitoring technology of the growth status of sika deer antlers faces many challenges in a complex breeding environment (such as light change, object occlusion, etc.). More importantly, an effective method for the segmentation of sika deer antlers is still lacking, which hinders the development of subsequent quality classification of sika deer antlers. In order to fill the research gap and lay a foundation for future sika deer antler quality classification, this paper proposed an improved semantic segmentation model based on U-Net, named SDAS-Net. In order to improve the segmentation accuracy and generalization ability of the model in a complex environment, we introduced a two-dimensional discrete wavelet transform module (2D-DWT) in the encoder head to reduce noise interference and enhance the ability to capture features. In order to compensate for the loss of feature information caused by 2D-DWT, we embedded the Star Blocks module in the encoder. In addition, the efficient mixed channel attention (EMCA) module was introduced to adaptively enhance key feature channels in the decoder, and the dual cross-attention mechanism (DCA) module was used to fuse high-dimensional features in skip connections. To verify the validity of the model, we constructed a 1055-image sika deer antler dataset (SDR). The experimental results show that compared with the baseline model, the performance of the SDAS-Net model is significantly improved, reaching 92.12% in MIoU and 93.63% in the PA index, and the number of parameters is only increased by 6.9%. The results show that the SDAS-Net model can effectively deal with the task of sika deer antler segmentation in a complex breeding environment while maintaining high precision. [ABSTRACT FROM AUTHOR]
Copyright of Animals (2076-2615) is the property of MDPI and its content may not be copied or emailed to multiple sites without the copyright holder's express written permission. Additionally, content may not be used with any artificial intelligence tools or machine learning technologies. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.)
Database: Biomedical Index
Description
Abstract:Simple Summary: Monitoring the antler growth status of sika deer is of great significance for sika deer antler grading and sika deer class identification, and it also has a role in promoting the process of intelligent breeding of sika deer. In this study, a new network model for segmentation of sika deer antlers was developed based on the U-Net by incorporating innovative modules and attention mechanisms. The method was evaluated using datasets of antler images from adult sika deer. It not only has high accuracy in segmentation tasks but is also very friendly to hardware resources. This provides the data and technological support for sika deer antler quality assessment and grading. At present, the monitoring technology of the growth status of sika deer antlers faces many challenges in a complex breeding environment (such as light change, object occlusion, etc.). More importantly, an effective method for the segmentation of sika deer antlers is still lacking, which hinders the development of subsequent quality classification of sika deer antlers. In order to fill the research gap and lay a foundation for future sika deer antler quality classification, this paper proposed an improved semantic segmentation model based on U-Net, named SDAS-Net. In order to improve the segmentation accuracy and generalization ability of the model in a complex environment, we introduced a two-dimensional discrete wavelet transform module (2D-DWT) in the encoder head to reduce noise interference and enhance the ability to capture features. In order to compensate for the loss of feature information caused by 2D-DWT, we embedded the Star Blocks module in the encoder. In addition, the efficient mixed channel attention (EMCA) module was introduced to adaptively enhance key feature channels in the decoder, and the dual cross-attention mechanism (DCA) module was used to fuse high-dimensional features in skip connections. To verify the validity of the model, we constructed a 1055-image sika deer antler dataset (SDR). The experimental results show that compared with the baseline model, the performance of the SDAS-Net model is significantly improved, reaching 92.12% in MIoU and 93.63% in the PA index, and the number of parameters is only increased by 6.9%. The results show that the SDAS-Net model can effectively deal with the task of sika deer antler segmentation in a complex breeding environment while maintaining high precision. [ABSTRACT FROM AUTHOR]
ISSN:20762615
DOI:10.3390/ani15101388