MCPT: Mixed Convolutional Parallel Transformer for Polarimetric SAR Image Classification
Saved in:
| Title: | MCPT: Mixed Convolutional Parallel Transformer for Polarimetric SAR Image Classification |
|---|---|
| Authors: | Wenke Wang, Jianlong Wang, Bibo Lu, Boyuan Liu, Yake Zhang, Chunyang Wang |
| Source: | Remote Sensing, Vol 15, Iss 11, p 2936 (2023) |
| Publisher Information: | MDPI AG |
| Publication Year: | 2023 |
| Collection: | Directory of Open Access Journals: DOAJ Articles |
| Subject Terms: | polarimetric SAR, convolutional neural network, vision transformer, mixed depthwise convolution tokenization, parallel encoder, global average pooling, Science |
| Description: | Vision transformers (ViT) have the characteristics of massive training data and complex model, which cannot be directly applied to polarimetric synthetic aperture radar (PolSAR) image classification tasks. Therefore, a mixed convolutional parallel transformer (MCPT) model based on ViT is proposed for fast PolSAR image classification. First of all, a mixed depthwise convolution tokenization is introduced. It replaces the learnable linear projection in the original ViT to obtain patch embeddings. The process of tokenization can reduce computational and parameter complexity and extract features of different receptive fields as input to the encoder. Furthermore, combining the idea of shallow networks with lower latency and easier optimization, a parallel encoder is implemented by pairing the same modules and recombining to form parallel blocks, which can decrease the network depth and computing power requirement. In addition, the original class embedding and position embedding are removed during tokenization, and a global average pooling layer is added after the encoder for category feature extraction. Finally, the experimental results on AIRSAR Flevoland and RADARSAT-2 San Francisco datasets show that the proposed method achieves a significant improvement in training and prediction speed. Meanwhile, the overall accuracy achieved was 97.9% and 96.77%, respectively. |
| Document Type: | article in journal/newspaper |
| Language: | English |
| Relation: | https://www.mdpi.com/2072-4292/15/11/2936; https://doaj.org/toc/2072-4292; https://doaj.org/article/1d3f011a206e47dd9deda43a270647b0 |
| DOI: | 10.3390/rs15112936 |
| Availability: | https://doi.org/10.3390/rs15112936 https://doaj.org/article/1d3f011a206e47dd9deda43a270647b0 |
| Accession Number: | edsbas.FC53F6DB |
| Database: | BASE |
| FullText | Text: Availability: 0 CustomLinks: – Url: https://doi.org/10.3390/rs15112936# Name: EDS - BASE (s4221598) Category: fullText Text: View record from BASE – Url: https://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=EBSCO&SrcAuth=EBSCO&DestApp=WOS&ServiceName=TransferToWoS&DestLinkType=GeneralSearchSummary&Func=Links&author=Wang%20W Name: ISI Category: fullText Text: Nájsť tento článok vo Web of Science Icon: https://imagesrvr.epnet.com/ls/20docs.gif MouseOverText: Nájsť tento článok vo Web of Science |
|---|---|
| Header | DbId: edsbas DbLabel: BASE An: edsbas.FC53F6DB RelevancyScore: 944 AccessLevel: 3 PubType: Academic Journal PubTypeId: academicJournal PreciseRelevancyScore: 943.653564453125 |
| IllustrationInfo | |
| Items | – Name: Title Label: Title Group: Ti Data: MCPT: Mixed Convolutional Parallel Transformer for Polarimetric SAR Image Classification – Name: Author Label: Authors Group: Au Data: <searchLink fieldCode="AR" term="%22Wenke+Wang%22">Wenke Wang</searchLink><br /><searchLink fieldCode="AR" term="%22Jianlong+Wang%22">Jianlong Wang</searchLink><br /><searchLink fieldCode="AR" term="%22Bibo+Lu%22">Bibo Lu</searchLink><br /><searchLink fieldCode="AR" term="%22Boyuan+Liu%22">Boyuan Liu</searchLink><br /><searchLink fieldCode="AR" term="%22Yake+Zhang%22">Yake Zhang</searchLink><br /><searchLink fieldCode="AR" term="%22Chunyang+Wang%22">Chunyang Wang</searchLink> – Name: TitleSource Label: Source Group: Src Data: Remote Sensing, Vol 15, Iss 11, p 2936 (2023) – Name: Publisher Label: Publisher Information Group: PubInfo Data: MDPI AG – Name: DatePubCY Label: Publication Year Group: Date Data: 2023 – Name: Subset Label: Collection Group: HoldingsInfo Data: Directory of Open Access Journals: DOAJ Articles – Name: Subject Label: Subject Terms Group: Su Data: <searchLink fieldCode="DE" term="%22polarimetric+SAR%22">polarimetric SAR</searchLink><br /><searchLink fieldCode="DE" term="%22convolutional+neural+network%22">convolutional neural network</searchLink><br /><searchLink fieldCode="DE" term="%22vision+transformer%22">vision transformer</searchLink><br /><searchLink fieldCode="DE" term="%22mixed+depthwise+convolution+tokenization%22">mixed depthwise convolution tokenization</searchLink><br /><searchLink fieldCode="DE" term="%22parallel+encoder%22">parallel encoder</searchLink><br /><searchLink fieldCode="DE" term="%22global+average+pooling%22">global average pooling</searchLink><br /><searchLink fieldCode="DE" term="%22Science%22">Science</searchLink> – Name: Abstract Label: Description Group: Ab Data: Vision transformers (ViT) have the characteristics of massive training data and complex model, which cannot be directly applied to polarimetric synthetic aperture radar (PolSAR) image classification tasks. Therefore, a mixed convolutional parallel transformer (MCPT) model based on ViT is proposed for fast PolSAR image classification. First of all, a mixed depthwise convolution tokenization is introduced. It replaces the learnable linear projection in the original ViT to obtain patch embeddings. The process of tokenization can reduce computational and parameter complexity and extract features of different receptive fields as input to the encoder. Furthermore, combining the idea of shallow networks with lower latency and easier optimization, a parallel encoder is implemented by pairing the same modules and recombining to form parallel blocks, which can decrease the network depth and computing power requirement. In addition, the original class embedding and position embedding are removed during tokenization, and a global average pooling layer is added after the encoder for category feature extraction. Finally, the experimental results on AIRSAR Flevoland and RADARSAT-2 San Francisco datasets show that the proposed method achieves a significant improvement in training and prediction speed. Meanwhile, the overall accuracy achieved was 97.9% and 96.77%, respectively. – Name: TypeDocument Label: Document Type Group: TypDoc Data: article in journal/newspaper – Name: Language Label: Language Group: Lang Data: English – Name: NoteTitleSource Label: Relation Group: SrcInfo Data: https://www.mdpi.com/2072-4292/15/11/2936; https://doaj.org/toc/2072-4292; https://doaj.org/article/1d3f011a206e47dd9deda43a270647b0 – Name: DOI Label: DOI Group: ID Data: 10.3390/rs15112936 – Name: URL Label: Availability Group: URL Data: https://doi.org/10.3390/rs15112936<br />https://doaj.org/article/1d3f011a206e47dd9deda43a270647b0 – Name: AN Label: Accession Number Group: ID Data: edsbas.FC53F6DB |
| PLink | https://erproxy.cvtisr.sk/sfx/access?url=https://search.ebscohost.com/login.aspx?direct=true&site=eds-live&db=edsbas&AN=edsbas.FC53F6DB |
| RecordInfo | BibRecord: BibEntity: Identifiers: – Type: doi Value: 10.3390/rs15112936 Languages: – Text: English Subjects: – SubjectFull: polarimetric SAR Type: general – SubjectFull: convolutional neural network Type: general – SubjectFull: vision transformer Type: general – SubjectFull: mixed depthwise convolution tokenization Type: general – SubjectFull: parallel encoder Type: general – SubjectFull: global average pooling Type: general – SubjectFull: Science Type: general Titles: – TitleFull: MCPT: Mixed Convolutional Parallel Transformer for Polarimetric SAR Image Classification Type: main BibRelationships: HasContributorRelationships: – PersonEntity: Name: NameFull: Wenke Wang – PersonEntity: Name: NameFull: Jianlong Wang – PersonEntity: Name: NameFull: Bibo Lu – PersonEntity: Name: NameFull: Boyuan Liu – PersonEntity: Name: NameFull: Yake Zhang – PersonEntity: Name: NameFull: Chunyang Wang IsPartOfRelationships: – BibEntity: Dates: – D: 01 M: 01 Type: published Y: 2023 Identifiers: – Type: issn-locals Value: edsbas – Type: issn-locals Value: edsbas.oa Titles: – TitleFull: Remote Sensing, Vol 15, Iss 11, p 2936 (2023 Type: main |
| ResultId | 1 |
Nájsť tento článok vo Web of Science