TrEnD: A transformer‐based encoder‐decoder model with adaptive patch embedding for mass segmentation in mammograms

Background Breast cancer is one of the most prevalent malignancies diagnosed in women. Mammogram inspection in the search and delineation of breast tumors is an essential prerequisite for a reliable diagnosis. However, analyzing mammograms by radiologists is time‐consuming and prone to errors. There...

Celý popis

Uložené v:
Podrobná bibliografia
Vydané v:Medical physics (Lancaster) Ročník 50; číslo 5; s. 2884 - 2899
Hlavní autori: Liu, Dongdong, Wu, Bo, Li, Changbo, Sun, Zheng, Zhang, Nan
Médium: Journal Article
Jazyk:English
Vydavateľské údaje: United States 01.05.2023
Predmet:
ISSN:0094-2405, 2473-4209, 2473-4209
On-line prístup:Získať plný text
Tagy: Pridať tag
Žiadne tagy, Buďte prvý, kto otaguje tento záznam!
Popis
Shrnutí:Background Breast cancer is one of the most prevalent malignancies diagnosed in women. Mammogram inspection in the search and delineation of breast tumors is an essential prerequisite for a reliable diagnosis. However, analyzing mammograms by radiologists is time‐consuming and prone to errors. Therefore, the development of computer‐aided diagnostic (CAD) systems to automate the mass segmentation procedure is greatly expected. Purpose Accurate breast mass segmentation in mammograms remains challenging in CAD systems due to the low contrast, various shapes, and fuzzy boundaries of masses. In this paper, we propose a fully automatic and effective mass segmentation model based on deep learning for improving segmentation performance. Methods We propose an effective transformer‐based encoder‐decoder model (TrEnD). Firstly, we introduce a lightweight method for adaptive patch embedding (APE) of the transformer, which utilizes superpixels to adaptively adjust the size and position of each patch. Secondly, we introduce a hierarchical transformer‐encoder and attention‐gated‐decoder structure, which is beneficial for progressively suppressing interference feature activations in irrelevant background areas. Thirdly, a dual‐branch design is employed to extract and fuse globally coarse and locally fine features in parallel, which could capture the global contextual information and ensure the relevance and integrity of local information. The model is evaluated on two public datasets CBIS‐DDSM and INbreast. To further demonstrate the robustness of TrEnD, different cropping strategies are applied to these datasets, termed tight, loose, maximal, and mix‐frame. Finally, ablation analysis is performed to assess the individual contribution of each module to the model performance. Results The proposed segmentation model provides a high Dice coefficient and Intersection over Union (IoU) of 92.20% and 85.81% on the mix‐frame CBIS‐DDSM, while 91.83% and 85.29% for the mix‐frame INbreast, respectively. The segmentation performance outperforms the current state‐of‐the‐art approaches. By adding the APE and attention‐gated module, the Dice and IoU have improved by 6.54% and 10.07%. Conclusion According to extensive qualitative and quantitative assessments, the proposed network is effective for automatic breast mass segmentation, and has adequate potential to offer technical assistance for subsequent clinical diagnoses.
Bibliografia:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
ISSN:0094-2405
2473-4209
2473-4209
DOI:10.1002/mp.16216