Search Results - Transformer-based factorized encoder

1

Loading…

Transformer-based factorized encoder for classification of pneumoconiosis on 3D CT images by Huang, Yingying, Si, Yang, Hu, Bingliang, Zhang, Yan, Wu, Shuang, Wu, Dongsheng, Wang, Quan

ISSN: 0010-4825, 1879-0534, 1879-0534

Published: United States Elsevier Ltd 01.11.2022

Published in Computers in biology and medicine (01.11.2022)
“…) typically provides more details of the lesions in the lung. Thus, a transformer-based factorized encoder (TBFE…”

Get full text

Journal Article

Save to List

Saved in:
2

Loading…

DMFC-UFormer: Depthwise multi-scale factorized convolution transformer-based UNet for medical image segmentation by Garbaz, Anass, Oukdach, Yassine, Charfi, Said, Ansari, Mohamed El, Koutti, Lahcen, Salihoun, Mouna

ISSN: 1746-8094

Published: Elsevier Ltd 01.03.2025

Published in Biomedical signal processing and control (01.03.2025)
“… In this paper, we combine the benefits of both methodologies. We propose DMFC-UFormer, an advanced fusion of Depthwise Multi-Scale Factorized Convolution-based transformers (DMFC-Transformer) with UNet…”

Get full text

Journal Article

Save to List

Saved in:
3

Loading…

Two-stream vision transformer based multi-label recognition for TCM prescriptions construction by Zhao, Zijuan, Qiang, Yan, Yang, Fenghao, Hou, Xiao, Zhao, Juanjuan, Song, Kai

ISSN: 0010-4825, 1879-0534, 1879-0534

Published: United States Elsevier Ltd 01.03.2024

Published in Computers in biology and medicine (01.03.2024)
“… and multi-label classification. The framework comprises three key components: image encoder, label embedding module, and cross-modal…”

Get full text

Journal Article

Save to List

Saved in:
4

Loading…

FSRT: Facial Scene Representation Transformer for Face Reenactment from Factorized Appearance, Head-Pose, and Facial Expression Features by Rochow, Andre, Schwarz, Max, Behnke, Sven

ISSN: 1063-6919

Published: IEEE 16.06.2024

Published in Proceedings (IEEE Computer Society Conference on Computer Vision and Pattern Recognition. Online) (16.06.2024)
“… We propose a transformer-based encoder for computing a set-latent representation of the source image…”

Get full text

Conference Proceeding

Save to List

Saved in:
5

Loading…

See, hear, read: Leveraging multimodality with guided attention for abstractive text summarization by Atri, Yash Kumar, Pramanick, Shraman, Goyal, Vikram, Chakraborty, Tanmoy

ISSN: 0950-7051, 1872-7409

Published: Amsterdam Elsevier B.V 05.09.2021

Published in Knowledge-based systems (05.09.2021)
“…In recent years, abstractive text summarization with multimodal inputs has started drawing attention due to its ability to accumulate information from…”

Get full text

Journal Article

Save to List

Saved in:
6

Loading…

Multimodal Abstractive Summarization using bidirectional encoder representations from transformers with attention mechanism by Argade, Dakshata, Khairnar, Vaishali, Vora, Deepali, Patil, Shruti, Kotecha, Ketan, Alfarhood, Sultan

ISSN: 2405-8440, 2405-8440

Published: England Elsevier Ltd 29.02.2024

Published in Heliyon (29.02.2024)
“… To address the aforementioned issues, this research presented the Multimodal Abstractive Summarization using Bidirectional Encoder Representations from Transformers (MAS-BERT…”

Get full text

Journal Article

Save to List

Saved in:
7

Loading…

DeHi: A Decoupled Hierarchical Architecture for Unaligned Ground-to-Aerial Geo-Localization by Wang, Teng, Li, Jiawen, Sun, Changyin

ISSN: 1051-8215, 1558-2205

Published: New York IEEE 01.03.2024

Published in IEEE transactions on circuits and systems for video technology (01.03.2024)
“… Specifically, DeHi first leverages CNN to extract high-level semantic features, and then introduces a novel orthogonally factorized transformer model consisting of part-level and global transformer…”

Get full text

Journal Article

Save to List

Saved in:
8

Loading…

FSRT: Facial Scene Representation Transformer for Face Reenactment from Factorized Appearance, Head-pose, and Facial Expression Features by Rochow, Andre, Schwarz, Max, Behnke, Sven

ISSN: 2331-8422

Published: Ithaca Cornell University Library, arXiv.org 10.06.2024

Published in arXiv.org (10.06.2024)
“… We propose a transformer-based encoder for computing a set-latent representation of the source image…”

Get full text

Paper

Save to List

Saved in:
9

Loading…

Co-Scale Conv-Attentional Image Transformers by Xu, Weijian, Xu, Yifan, Chang, Tyler, Tu, Zhuowen

ISSN: 2380-7504

Published: IEEE 01.10.2021

Published in Proceedings / IEEE International Conference on Computer Vision (01.10.2021)
“…In this paper, we present Co-scale conv-attentional image Transformers (CoaT), a Transformer-based image classifier equipped with co-scale and conv-attentional mechanisms…”

Get full text

Conference Proceeding

Save to List

Saved in:
10

Loading…

ESRNet: an exploring sample relationships network for arbitrary-shaped scene text detection by Fan, Huageng, Lu, Tongwei

ISSN: 0924-669X, 1573-7497

Published: New York Springer US 01.11.2024

Published in Applied intelligence (Dordrecht, Netherlands) (01.11.2024)
“…Recently transformer-based scene text detection methods have been gradually investigated…”

Get full text

Journal Article

Save to List

Saved in:
11

Loading…

PredFormer: Transformers Are Effective Spatial-Temporal Predictive Learners by Tang, Yujin, Lu, Qi, Xie, Fei, Li, Xiangtai, Ma, Chao, Ming-Hsuan Yang

ISSN: 2331-8422

Published: Ithaca Cornell University Library, arXiv.org 18.10.2024

Published in arXiv.org (18.10.2024)
“… mechanisms, including full-, factorized-, and interleaved-spatial-temporal attention. With its recurrent-free, transformer-based design, PredFormer is both simple and efficient, significantly outperforming previous…”

Get full text

Paper

Save to List

Saved in:
12

Loading…

CAMS: Convolution and Attention-Free Mamba-based Cardiac Image Segmentation by Khan, Abbas, Asad, Muhammad, Benning, Martin, Roney, Caroline, Slabaugh, Gregory

ISSN: 2331-8422

Published: Ithaca Cornell University Library, arXiv.org 29.10.2024

Published in arXiv.org (29.10.2024)
“…Convolutional Neural Networks (CNNs) and Transformer-based self-attention models have become the standard for medical image segmentation…”

Get full text

Paper

Save to List

Saved in:
13

Loading…

CAMS: Convolution and Attention-Free Mamba-Based Cardiac Image Segmentation by Khan, Abbas, Asad, Muhammad, Benning, Martin, Roney, Caroline, Slabaugh, Gregory

ISSN: 2642-9381

Published: IEEE 26.02.2025

Published in Proceedings / IEEE Workshop on Applications of Computer Vision (26.02.2025)
“…Convolutional Neural Networks (CNNs) and Transformer-based self-attention models have become the standard for medical image segmentation…”

Get full text

Conference Proceeding

Save to List

Saved in:
14

Loading…

See, Hear, Read: Leveraging Multimodality with Guided Attention for Abstractive Text Summarization by Atri, Yash Kumar, Pramanick, Shraman, Goyal, Vikram, Chakraborty, Tanmoy

ISSN: 2331-8422

Published: Ithaca Cornell University Library, arXiv.org 15.09.2021

Published in arXiv.org (15.09.2021)
“…In recent years, abstractive text summarization with multimodal inputs has started drawing attention due to its ability to accumulate information from…”

Get full text

Paper

Save to List

Saved in:
15

Loading…

Efficient Deep Learning Models for Physics Simulation by Li, Zijie

ISBN: 9798288853104

Published: ProQuest Dissertations & Theses 01.01.2025

“…Many natural and engineered systems are governed by partial differential equations (PDEs), spanning atomic interactions in molecular systems to large-scale…”

Get full text

Dissertation

Save to List

Saved in:
16

Loading…

Co-Scale Conv-Attentional Image Transformers by Xu, Weijian, Xu, Yifan, Chang, Tyler, Tu, Zhuowen

ISSN: 2331-8422

Published: Ithaca Cornell University Library, arXiv.org 26.08.2021

Published in arXiv.org (26.08.2021)
“…In this paper, we present Co-scale conv-attentional image Transformers (CoaT), a Transformer-based image classifier equipped with co-scale and conv-attentional mechanisms…”

Get full text

Paper

Save to List

Saved in:

Search Results - Transformer-based factorized encoder

Transformer-based factorized encoder for classification of pneumoconiosis on 3D CT images by Huang, Yingying, Si, Yang, Hu, Bingliang, Zhang, Yan, Wu, Shuang, Wu, Dongsheng, Wang, Quan

DMFC-UFormer: Depthwise multi-scale factorized convolution transformer-based UNet for medical image segmentation by Garbaz, Anass, Oukdach, Yassine, Charfi, Said, Ansari, Mohamed El, Koutti, Lahcen, Salihoun, Mouna

Two-stream vision transformer based multi-label recognition for TCM prescriptions construction by Zhao, Zijuan, Qiang, Yan, Yang, Fenghao, Hou, Xiao, Zhao, Juanjuan, Song, Kai

FSRT: Facial Scene Representation Transformer for Face Reenactment from Factorized Appearance, Head-Pose, and Facial Expression Features by Rochow, Andre, Schwarz, Max, Behnke, Sven

See, hear, read: Leveraging multimodality with guided attention for abstractive text summarization by Atri, Yash Kumar, Pramanick, Shraman, Goyal, Vikram, Chakraborty, Tanmoy

Multimodal Abstractive Summarization using bidirectional encoder representations from transformers with attention mechanism by Argade, Dakshata, Khairnar, Vaishali, Vora, Deepali, Patil, Shruti, Kotecha, Ketan, Alfarhood, Sultan

DeHi: A Decoupled Hierarchical Architecture for Unaligned Ground-to-Aerial Geo-Localization by Wang, Teng, Li, Jiawen, Sun, Changyin

FSRT: Facial Scene Representation Transformer for Face Reenactment from Factorized Appearance, Head-pose, and Facial Expression Features by Rochow, Andre, Schwarz, Max, Behnke, Sven

Co-Scale Conv-Attentional Image Transformers by Xu, Weijian, Xu, Yifan, Chang, Tyler, Tu, Zhuowen

ESRNet: an exploring sample relationships network for arbitrary-shaped scene text detection by Fan, Huageng, Lu, Tongwei

PredFormer: Transformers Are Effective Spatial-Temporal Predictive Learners by Tang, Yujin, Lu, Qi, Xie, Fei, Li, Xiangtai, Ma, Chao, Ming-Hsuan Yang

CAMS: Convolution and Attention-Free Mamba-based Cardiac Image Segmentation by Khan, Abbas, Asad, Muhammad, Benning, Martin, Roney, Caroline, Slabaugh, Gregory

CAMS: Convolution and Attention-Free Mamba-Based Cardiac Image Segmentation by Khan, Abbas, Asad, Muhammad, Benning, Martin, Roney, Caroline, Slabaugh, Gregory

See, Hear, Read: Leveraging Multimodality with Guided Attention for Abstractive Text Summarization by Atri, Yash Kumar, Pramanick, Shraman, Goyal, Vikram, Chakraborty, Tanmoy

Efficient Deep Learning Models for Physics Simulation by Li, Zijie

Co-Scale Conv-Attentional Image Transformers by Xu, Weijian, Xu, Yifan, Chang, Tyler, Tu, Zhuowen

Search Tools:

Refine Results

Format

Subject Area

Topic

Language

Year of Publication