Medical Image Segmentation via Sparse Coding Decoder

Transformers have achieved significant success in medical image segmentation, owing to their capability to capture long-range dependencies. Previous studies have employed either pure Transformer or hybrid CNN-Transformer architectures in the encoder module to enhance their ability to extract more co...

Full description

Saved in:

Bibliographic Details
Published in:	Proceedings of the ... IEEE International Conference on Acoustics, Speech and Signal Processing (1998) pp. 1 - 5
Main Authors:	Zeng, Long, Zhu, Mingwei, Wu, Kaigui, Li, Zefang
Format:	Conference Proceeding
Language:	English
Published:	IEEE 06.04.2025
Subjects:	Biomedical imaging Convolution Convolutional codes Decoding Feature extraction Image coding Image segmentation Medical image segmentation Noise Sparse Coding Transformer Transformers Vectors
ISSN:	2379-190X
Online Access:	Get full text
Tags:	Add Tag No Tags, Be the first to tag this record!

Abstract	Transformers have achieved significant success in medical image segmentation, owing to their capability to capture long-range dependencies. Previous studies have employed either pure Transformer or hybrid CNN-Transformer architectures in the encoder module to enhance their ability to extract more complex features. However, these models still exhibit limitations in fine-grained local feature extraction and effectively suppressing irrelevant information. To address this issue, a convolution sparse vector coding-based decoder is proposed, namely the CAScaded multi-layer Convolutional Sparse vector Coding DEcoder (CASCSCDE), which suppresses noise by refining the feature representation to be more sparse and accurate through sparse coding and localized convolution, effectively minimizing less important, noisy components. To demonstrate the effectiveness and versatility of our CASCSCDE, we incorporate our decoder into both pure Transformer and hybrid CNN-Transformer models, such as SwinUNet and TransUNet. Our experiments demonstrate that integrating CASCSCDE into the models significantly enhances segmentation performance. The CASCSCDE opens new ways for constructing decoders based on convolutional sparse vector coding.
AbstractList	Transformers have achieved significant success in medical image segmentation, owing to their capability to capture long-range dependencies. Previous studies have employed either pure Transformer or hybrid CNN-Transformer architectures in the encoder module to enhance their ability to extract more complex features. However, these models still exhibit limitations in fine-grained local feature extraction and effectively suppressing irrelevant information. To address this issue, a convolution sparse vector coding-based decoder is proposed, namely the CAScaded multi-layer Convolutional Sparse vector Coding DEcoder (CASCSCDE), which suppresses noise by refining the feature representation to be more sparse and accurate through sparse coding and localized convolution, effectively minimizing less important, noisy components. To demonstrate the effectiveness and versatility of our CASCSCDE, we incorporate our decoder into both pure Transformer and hybrid CNN-Transformer models, such as SwinUNet and TransUNet. Our experiments demonstrate that integrating CASCSCDE into the models significantly enhances segmentation performance. The CASCSCDE opens new ways for constructing decoders based on convolutional sparse vector coding.
Author	Wu, Kaigui Zhu, Mingwei Zeng, Long Li, Zefang
Author_xml	– sequence: 1 givenname: Long surname: Zeng fullname: Zeng, Long email: longzeng@stu.cqu.edu.cn organization: Chongqing University,School of Computer Science,Chongqing,China – sequence: 2 givenname: Mingwei surname: Zhu fullname: Zhu, Mingwei email: mingweizhu@stu.cqu.edu.cn organization: Chongqing University,School of Energy and Power Engineering,Chongqing,China – sequence: 3 givenname: Kaigui surname: Wu fullname: Wu, Kaigui email: kaiguiwu@cqu.edu.cn organization: Chongqing University,School of Computer Science,Chongqing,China – sequence: 4 givenname: Zefang surname: Li fullname: Li, Zefang email: zefangli@cqu.edu.cn organization: Chongqing University Qianjiang Hospital,Chongqing,China
BookMark	eNo1z19LwzAUBfAoCm5z38CH-AFab3KTNnmU-m8w2aB78G2kyW2JrO1oi-C3t6A-nYcDh99Zsquu74ixewGpEGAfNsVjWe6VzTJIJUidCjDGygwu2Nrm1qAGzEyuxCVbSMxtIix83LDlOH4CwFyYBVPvFKJ3J75pXUO8pKalbnJT7Dv-FR0vz24YiRd9iF3Dn8j3gYZbdl2700jrv1yxw8vzoXhLtrvXGbVNosUp0cajrFQOBLOwRuVqb2si0JXCoEg7pY2tBRKArKrZSQo95WC8yiB4XLG739lIRMfzEFs3fB__T-IPYlpIJA
ContentType	Conference Proceeding
DBID	6IE 6IH CBEJK RIE RIO
DOI	10.1109/ICASSP49660.2025.10889260
DatabaseName	IEEE Electronic Library (IEL) Conference Proceedings IEEE Proceedings Order Plan (POP) 1998-present by volume IEEE Xplore All Conference Proceedings IEEE/IET Electronic Library (IEL) (UW System Shared) IEEE Proceedings Order Plans (POP) 1998-present
DatabaseTitleList
Database_xml	– sequence: 1 dbid: RIE name: IEEE/IET Electronic Library (IEL) (UW System Shared) url: https://ieeexplore.ieee.org/ sourceTypes: Publisher
DeliveryMethod	fulltext_linktorsrc
Discipline	Engineering
EISBN	9798350368741
EISSN	2379-190X
EndPage	5
ExternalDocumentID	10889260
Genre	orig-research
GroupedDBID	23M 6IE 6IF 6IH 6IK 6IL 6IM 6IN AAJGR AAWTH ABLEC ACGFS ADZIZ ALMA_UNASSIGNED_HOLDINGS BEFXN BFFAM BGNUA BKEBE BPEOZ CBEJK CHZPO IEGSK IJVOP IPLJI M43 OCL RIE RIL RIO RNS
ID	FETCH-LOGICAL-i93t-58c32b470e0660f34afc9fee05b43d4e5a4589f13e002bb741e43ce708c460dc3
IEDL.DBID	RIE
IngestDate	Wed Nov 19 08:27:13 EST 2025
IsPeerReviewed	false
IsScholarly	true
Language	English
LinkModel	DirectLink
MergedId	FETCHMERGED-LOGICAL-i93t-58c32b470e0660f34afc9fee05b43d4e5a4589f13e002bb741e43ce708c460dc3
PageCount	5
ParticipantIDs	ieee_primary_10889260
PublicationCentury	2000
PublicationDate	2025-April-6
PublicationDateYYYYMMDD	2025-04-06
PublicationDate_xml	– month: 04 year: 2025 text: 2025-April-6 day: 06
PublicationDecade	2020
PublicationTitle	Proceedings of the ... IEEE International Conference on Acoustics, Speech and Signal Processing (1998)
PublicationTitleAbbrev	ICASSP
PublicationYear	2025
Publisher	IEEE
Publisher_xml	– name: IEEE
SSID	ssj0008748
Score	2.294602
Snippet	Transformers have achieved significant success in medical image segmentation, owing to their capability to capture long-range dependencies. Previous studies...
SourceID	ieee
SourceType	Publisher
StartPage	1
SubjectTerms	Biomedical imaging Convolution Convolutional codes Decoding Feature extraction Image coding Image segmentation Medical image segmentation Noise Sparse Coding Transformer Transformers Vectors
Title	Medical Image Segmentation via Sparse Coding Decoder
URI	https://ieeexplore.ieee.org/document/10889260
hasFullText	1
inHoldings	1
isFullTextHit
isPrint
link	http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV1LTwMhEJ5oY4xefNX4DiZeqesCCxxNtbGJaZpsD701CzuYHrptatvfL9Bt1YMHb4TwyDBhHsD3AfBQ2lS7UmTUeGdDuXdR1BghqJMlz4wxmheRXf9d9npqONT9GqwesTCIGB-fYSsU411-ObXLcFTmd7hS2gfgu7ArZbYGa23NrpJc7cN9TaL52G0_53mfB_JJnwWmorXp_OsblehFOkf_nP8Ymt94PNLfepoT2MHqFA5_UAmeAa9vXEh34i0EyfFjUqOKKrIaFySf-QwWSXsaRiAvGKDs8yYMOq-D9hutf0SgY80WVCjLUsNlgj5QSBzjhbPaISbCcFZyFAUXSrsnht7OGeODBeTMokyU5VlSWnYOjWpa4QUQlXo1hMaJLrjTQkn0mYO2qrDS-aDuEppB_tFszXkx2oh-9Uf9NRyEVY5vWrIbaCzmS7yFPbtajD_nd1FTX2evkxM
linkProvider	IEEE
linkToHtml	http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV1JTwIxFH5RNC4XN4y7NfFaHGfamfZoUAIRCclw4EamnVczBwaCwO-3rQPqwYO3pmmbLulb-vp9D-A-16E0OY-pssqGMquiqFKcU5PkLFZKSZZ5dv1u0uuJ4VD2K7C6x8Igov98hg1X9LH8fKIX7qnM3nAhpDXAN2HLpc6q4FprwSsSJnbgrqLRfOg0n9K0zxz9pPUDQ95Ydf-VSMXrkdbBP2dwCPVvRB7pr3XNEWxgeQz7P8gET4BVMRfSGVsZQVJ8H1e4opIsi4ykU-vDImlO3AjkGR2YfVaHQetl0GzTKicCLWQ0p1zoKFQsCdCaCoGJWGa0NIgBVyzKGfKMcSHNY4RW0illzQVkkcYkEJrFQa6jU6iVkxLPgIjQHoRrHMiMGclFgtZ3kFpkOjHWrDuHulv_aPrFejFaLf3ij_pb2G0P3rqjbqf3egl7bsf9D5f4Cmrz2QKvYVsv58XH7Maf2idGDJZc
openUrl	ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=Proceedings+of+the+...+IEEE+International+Conference+on+Acoustics%2C+Speech+and+Signal+Processing+%281998%29&rft.atitle=Medical+Image+Segmentation+via+Sparse+Coding+Decoder&rft.au=Zeng%2C+Long&rft.au=Zhu%2C+Mingwei&rft.au=Wu%2C+Kaigui&rft.au=Li%2C+Zefang&rft.date=2025-04-06&rft.pub=IEEE&rft.eissn=2379-190X&rft.spage=1&rft.epage=5&rft_id=info:doi/10.1109%2FICASSP49660.2025.10889260&rft.externalDocID=10889260