Search Results - Transformer-based factorized encoder
-
1
Transformer-based factorized encoder for classification of pneumoconiosis on 3D CT images
ISSN: 0010-4825, 1879-0534, 1879-0534Published: United States Elsevier Ltd 01.11.2022Published in Computers in biology and medicine (01.11.2022)“…) typically provides more details of the lesions in the lung. Thus, a transformer-based factorized encoder (TBFE…”
Get full text
Journal Article -
2
DMFC-UFormer: Depthwise multi-scale factorized convolution transformer-based UNet for medical image segmentation
ISSN: 1746-8094Published: Elsevier Ltd 01.03.2025Published in Biomedical signal processing and control (01.03.2025)“… In this paper, we combine the benefits of both methodologies. We propose DMFC-UFormer, an advanced fusion of Depthwise Multi-Scale Factorized Convolution-based transformers (DMFC-Transformer) with UNet…”
Get full text
Journal Article -
3
Two-stream vision transformer based multi-label recognition for TCM prescriptions construction
ISSN: 0010-4825, 1879-0534, 1879-0534Published: United States Elsevier Ltd 01.03.2024Published in Computers in biology and medicine (01.03.2024)“… and multi-label classification. The framework comprises three key components: image encoder, label embedding module, and cross-modal…”
Get full text
Journal Article -
4
FSRT: Facial Scene Representation Transformer for Face Reenactment from Factorized Appearance, Head-Pose, and Facial Expression Features
ISSN: 1063-6919Published: IEEE 16.06.2024Published in Proceedings (IEEE Computer Society Conference on Computer Vision and Pattern Recognition. Online) (16.06.2024)“… We propose a transformer-based encoder for computing a set-latent representation of the source image…”
Get full text
Conference Proceeding -
5
See, hear, read: Leveraging multimodality with guided attention for abstractive text summarization
ISSN: 0950-7051, 1872-7409Published: Amsterdam Elsevier B.V 05.09.2021Published in Knowledge-based systems (05.09.2021)“…In recent years, abstractive text summarization with multimodal inputs has started drawing attention due to its ability to accumulate information from…”
Get full text
Journal Article -
6
Multimodal Abstractive Summarization using bidirectional encoder representations from transformers with attention mechanism
ISSN: 2405-8440, 2405-8440Published: England Elsevier Ltd 29.02.2024Published in Heliyon (29.02.2024)“… To address the aforementioned issues, this research presented the Multimodal Abstractive Summarization using Bidirectional Encoder Representations from Transformers (MAS-BERT…”
Get full text
Journal Article -
7
DeHi: A Decoupled Hierarchical Architecture for Unaligned Ground-to-Aerial Geo-Localization
ISSN: 1051-8215, 1558-2205Published: New York IEEE 01.03.2024Published in IEEE transactions on circuits and systems for video technology (01.03.2024)“… Specifically, DeHi first leverages CNN to extract high-level semantic features, and then introduces a novel orthogonally factorized transformer model consisting of part-level and global transformer…”
Get full text
Journal Article -
8
FSRT: Facial Scene Representation Transformer for Face Reenactment from Factorized Appearance, Head-pose, and Facial Expression Features
ISSN: 2331-8422Published: Ithaca Cornell University Library, arXiv.org 10.06.2024Published in arXiv.org (10.06.2024)“… We propose a transformer-based encoder for computing a set-latent representation of the source image…”
Get full text
Paper -
9
Co-Scale Conv-Attentional Image Transformers
ISSN: 2380-7504Published: IEEE 01.10.2021Published in Proceedings / IEEE International Conference on Computer Vision (01.10.2021)“…In this paper, we present Co-scale conv-attentional image Transformers (CoaT), a Transformer-based image classifier equipped with co-scale and conv-attentional mechanisms…”
Get full text
Conference Proceeding -
10
ESRNet: an exploring sample relationships network for arbitrary-shaped scene text detection
ISSN: 0924-669X, 1573-7497Published: New York Springer US 01.11.2024Published in Applied intelligence (Dordrecht, Netherlands) (01.11.2024)“…Recently transformer-based scene text detection methods have been gradually investigated…”
Get full text
Journal Article -
11
PredFormer: Transformers Are Effective Spatial-Temporal Predictive Learners
ISSN: 2331-8422Published: Ithaca Cornell University Library, arXiv.org 18.10.2024Published in arXiv.org (18.10.2024)“… mechanisms, including full-, factorized-, and interleaved-spatial-temporal attention. With its recurrent-free, transformer-based design, PredFormer is both simple and efficient, significantly outperforming previous…”
Get full text
Paper -
12
CAMS: Convolution and Attention-Free Mamba-based Cardiac Image Segmentation
ISSN: 2331-8422Published: Ithaca Cornell University Library, arXiv.org 29.10.2024Published in arXiv.org (29.10.2024)“…Convolutional Neural Networks (CNNs) and Transformer-based self-attention models have become the standard for medical image segmentation…”
Get full text
Paper -
13
CAMS: Convolution and Attention-Free Mamba-Based Cardiac Image Segmentation
ISSN: 2642-9381Published: IEEE 26.02.2025Published in Proceedings / IEEE Workshop on Applications of Computer Vision (26.02.2025)“…Convolutional Neural Networks (CNNs) and Transformer-based self-attention models have become the standard for medical image segmentation…”
Get full text
Conference Proceeding -
14
See, Hear, Read: Leveraging Multimodality with Guided Attention for Abstractive Text Summarization
ISSN: 2331-8422Published: Ithaca Cornell University Library, arXiv.org 15.09.2021Published in arXiv.org (15.09.2021)“…In recent years, abstractive text summarization with multimodal inputs has started drawing attention due to its ability to accumulate information from…”
Get full text
Paper -
15
Efficient Deep Learning Models for Physics Simulation
ISBN: 9798288853104Published: ProQuest Dissertations & Theses 01.01.2025“…Many natural and engineered systems are governed by partial differential equations (PDEs), spanning atomic interactions in molecular systems to large-scale…”
Get full text
Dissertation -
16
Co-Scale Conv-Attentional Image Transformers
ISSN: 2331-8422Published: Ithaca Cornell University Library, arXiv.org 26.08.2021Published in arXiv.org (26.08.2021)“…In this paper, we present Co-scale conv-attentional image Transformers (CoaT), a Transformer-based image classifier equipped with co-scale and conv-attentional mechanisms…”
Get full text
Paper

