Suchergebnisse - Transformer-based factorized encoder
Andere Suchmöglichkeiten:
- Transformer-based factorized encoder »
-
1
Transformer-based factorized encoder for classification of pneumoconiosis on 3D CT images
ISSN: 0010-4825, 1879-0534, 1879-0534Veröffentlicht: United States Elsevier Ltd 01.11.2022Veröffentlicht in Computers in biology and medicine (01.11.2022)“… ) typically provides more details of the lesions in the lung. Thus, a transformer-based factorized encoder (TBFE …”
Volltext
Journal Article -
2
DMFC-UFormer: Depthwise multi-scale factorized convolution transformer-based UNet for medical image segmentation
ISSN: 1746-8094Veröffentlicht: Elsevier Ltd 01.03.2025Veröffentlicht in Biomedical signal processing and control (01.03.2025)“… In this paper, we combine the benefits of both methodologies. We propose DMFC-UFormer, an advanced fusion of Depthwise Multi-Scale Factorized Convolution-based transformers (DMFC-Transformer) with UNet …”
Volltext
Journal Article -
3
Two-stream vision transformer based multi-label recognition for TCM prescriptions construction
ISSN: 0010-4825, 1879-0534, 1879-0534Veröffentlicht: United States Elsevier Ltd 01.03.2024Veröffentlicht in Computers in biology and medicine (01.03.2024)“… and multi-label classification. The framework comprises three key components: image encoder, label embedding module, and cross-modal …”
Volltext
Journal Article -
4
FSRT: Facial Scene Representation Transformer for Face Reenactment from Factorized Appearance, Head-Pose, and Facial Expression Features
ISSN: 1063-6919Veröffentlicht: IEEE 16.06.2024Veröffentlicht in Proceedings (IEEE Computer Society Conference on Computer Vision and Pattern Recognition. Online) (16.06.2024)“… We propose a transformer-based encoder for computing a set-latent representation of the source image …”
Volltext
Tagungsbericht -
5
See, hear, read: Leveraging multimodality with guided attention for abstractive text summarization
ISSN: 0950-7051, 1872-7409Veröffentlicht: Amsterdam Elsevier B.V 05.09.2021Veröffentlicht in Knowledge-based systems (05.09.2021)“… In recent years, abstractive text summarization with multimodal inputs has started drawing attention due to its ability to accumulate information from …”
Volltext
Journal Article -
6
Multimodal Abstractive Summarization using bidirectional encoder representations from transformers with attention mechanism
ISSN: 2405-8440, 2405-8440Veröffentlicht: England Elsevier Ltd 29.02.2024Veröffentlicht in Heliyon (29.02.2024)“… To address the aforementioned issues, this research presented the Multimodal Abstractive Summarization using Bidirectional Encoder Representations from Transformers (MAS-BERT …”
Volltext
Journal Article -
7
DeHi: A Decoupled Hierarchical Architecture for Unaligned Ground-to-Aerial Geo-Localization
ISSN: 1051-8215, 1558-2205Veröffentlicht: New York IEEE 01.03.2024Veröffentlicht in IEEE transactions on circuits and systems for video technology (01.03.2024)“… Specifically, DeHi first leverages CNN to extract high-level semantic features, and then introduces a novel orthogonally factorized transformer model consisting of part-level and global transformer …”
Volltext
Journal Article -
8
FSRT: Facial Scene Representation Transformer for Face Reenactment from Factorized Appearance, Head-pose, and Facial Expression Features
ISSN: 2331-8422Veröffentlicht: Ithaca Cornell University Library, arXiv.org 10.06.2024Veröffentlicht in arXiv.org (10.06.2024)“… We propose a transformer-based encoder for computing a set-latent representation of the source image …”
Volltext
Paper -
9
Co-Scale Conv-Attentional Image Transformers
ISSN: 2380-7504Veröffentlicht: IEEE 01.10.2021Veröffentlicht in Proceedings / IEEE International Conference on Computer Vision (01.10.2021)“… In this paper, we present Co-scale conv-attentional image Transformers (CoaT), a Transformer-based image classifier equipped with co-scale and conv-attentional mechanisms …”
Volltext
Tagungsbericht -
10
ESRNet: an exploring sample relationships network for arbitrary-shaped scene text detection
ISSN: 0924-669X, 1573-7497Veröffentlicht: New York Springer US 01.11.2024Veröffentlicht in Applied intelligence (Dordrecht, Netherlands) (01.11.2024)“… Recently transformer-based scene text detection methods have been gradually investigated …”
Volltext
Journal Article -
11
PredFormer: Transformers Are Effective Spatial-Temporal Predictive Learners
ISSN: 2331-8422Veröffentlicht: Ithaca Cornell University Library, arXiv.org 18.10.2024Veröffentlicht in arXiv.org (18.10.2024)“… mechanisms, including full-, factorized-, and interleaved-spatial-temporal attention. With its recurrent-free, transformer-based design, PredFormer is both simple and efficient, significantly outperforming previous …”
Volltext
Paper -
12
CAMS: Convolution and Attention-Free Mamba-based Cardiac Image Segmentation
ISSN: 2331-8422Veröffentlicht: Ithaca Cornell University Library, arXiv.org 29.10.2024Veröffentlicht in arXiv.org (29.10.2024)“… Convolutional Neural Networks (CNNs) and Transformer-based self-attention models have become the standard for medical image segmentation …”
Volltext
Paper -
13
CAMS: Convolution and Attention-Free Mamba-Based Cardiac Image Segmentation
ISSN: 2642-9381Veröffentlicht: IEEE 26.02.2025Veröffentlicht in Proceedings / IEEE Workshop on Applications of Computer Vision (26.02.2025)“… Convolutional Neural Networks (CNNs) and Transformer-based self-attention models have become the standard for medical image segmentation …”
Volltext
Tagungsbericht -
14
See, Hear, Read: Leveraging Multimodality with Guided Attention for Abstractive Text Summarization
ISSN: 2331-8422Veröffentlicht: Ithaca Cornell University Library, arXiv.org 15.09.2021Veröffentlicht in arXiv.org (15.09.2021)“… In recent years, abstractive text summarization with multimodal inputs has started drawing attention due to its ability to accumulate information from …”
Volltext
Paper -
15
Efficient Deep Learning Models for Physics Simulation
ISBN: 9798288853104Veröffentlicht: ProQuest Dissertations & Theses 01.01.2025“… Many natural and engineered systems are governed by partial differential equations (PDEs), spanning atomic interactions in molecular systems to large-scale …”
Volltext
Dissertation -
16
Co-Scale Conv-Attentional Image Transformers
ISSN: 2331-8422Veröffentlicht: Ithaca Cornell University Library, arXiv.org 26.08.2021Veröffentlicht in arXiv.org (26.08.2021)“… In this paper, we present Co-scale conv-attentional image Transformers (CoaT), a Transformer-based image classifier equipped with co-scale and conv-attentional mechanisms …”
Volltext
Paper

