Suchergebnisse - Transformer-based factorized encoder

Andere Suchmöglichkeiten:

Transformer-based factorized encoder »
- Transformer-based factories encoder

1

Wird geladen …

Transformer-based factorized encoder for classification of pneumoconiosis on 3D CT images von Huang, Yingying, Si, Yang, Hu, Bingliang, Zhang, Yan, Wu, Shuang, Wu, Dongsheng, Wang, Quan

ISSN: 0010-4825, 1879-0534, 1879-0534

Veröffentlicht: United States Elsevier Ltd 01.11.2022

Veröffentlicht in Computers in biology and medicine (01.11.2022)
“… ) typically provides more details of the lesions in the lung. Thus, a transformer-based factorized encoder (TBFE …”

Volltext

Journal Article

Zu den Favoriten

Gespeichert in:
2

Wird geladen …

DMFC-UFormer: Depthwise multi-scale factorized convolution transformer-based UNet for medical image segmentation von Garbaz, Anass, Oukdach, Yassine, Charfi, Said, Ansari, Mohamed El, Koutti, Lahcen, Salihoun, Mouna

ISSN: 1746-8094

Veröffentlicht: Elsevier Ltd 01.03.2025

Veröffentlicht in Biomedical signal processing and control (01.03.2025)
“… In this paper, we combine the benefits of both methodologies. We propose DMFC-UFormer, an advanced fusion of Depthwise Multi-Scale Factorized Convolution-based transformers (DMFC-Transformer) with UNet …”

Volltext

Journal Article

Zu den Favoriten

Gespeichert in:
3

Wird geladen …

Two-stream vision transformer based multi-label recognition for TCM prescriptions construction von Zhao, Zijuan, Qiang, Yan, Yang, Fenghao, Hou, Xiao, Zhao, Juanjuan, Song, Kai

ISSN: 0010-4825, 1879-0534, 1879-0534

Veröffentlicht: United States Elsevier Ltd 01.03.2024

Veröffentlicht in Computers in biology and medicine (01.03.2024)
“… and multi-label classification. The framework comprises three key components: image encoder, label embedding module, and cross-modal …”

Volltext

Journal Article

Zu den Favoriten

Gespeichert in:
4

Wird geladen …

FSRT: Facial Scene Representation Transformer for Face Reenactment from Factorized Appearance, Head-Pose, and Facial Expression Features von Rochow, Andre, Schwarz, Max, Behnke, Sven

ISSN: 1063-6919

Veröffentlicht: IEEE 16.06.2024

Veröffentlicht in Proceedings (IEEE Computer Society Conference on Computer Vision and Pattern Recognition. Online) (16.06.2024)
“… We propose a transformer-based encoder for computing a set-latent representation of the source image …”

Volltext

Tagungsbericht

Zu den Favoriten

Gespeichert in:
5

Wird geladen …

See, hear, read: Leveraging multimodality with guided attention for abstractive text summarization von Atri, Yash Kumar, Pramanick, Shraman, Goyal, Vikram, Chakraborty, Tanmoy

ISSN: 0950-7051, 1872-7409

Veröffentlicht: Amsterdam Elsevier B.V 05.09.2021

Veröffentlicht in Knowledge-based systems (05.09.2021)
“… In recent years, abstractive text summarization with multimodal inputs has started drawing attention due to its ability to accumulate information from …”

Volltext

Journal Article

Zu den Favoriten

Gespeichert in:
6

Wird geladen …

Multimodal Abstractive Summarization using bidirectional encoder representations from transformers with attention mechanism von Argade, Dakshata, Khairnar, Vaishali, Vora, Deepali, Patil, Shruti, Kotecha, Ketan, Alfarhood, Sultan

ISSN: 2405-8440, 2405-8440

Veröffentlicht: England Elsevier Ltd 29.02.2024

Veröffentlicht in Heliyon (29.02.2024)
“… To address the aforementioned issues, this research presented the Multimodal Abstractive Summarization using Bidirectional Encoder Representations from Transformers (MAS-BERT …”

Volltext

Journal Article

Zu den Favoriten

Gespeichert in:
7

Wird geladen …

DeHi: A Decoupled Hierarchical Architecture for Unaligned Ground-to-Aerial Geo-Localization von Wang, Teng, Li, Jiawen, Sun, Changyin

ISSN: 1051-8215, 1558-2205

Veröffentlicht: New York IEEE 01.03.2024

Veröffentlicht in IEEE transactions on circuits and systems for video technology (01.03.2024)
“… Specifically, DeHi first leverages CNN to extract high-level semantic features, and then introduces a novel orthogonally factorized transformer model consisting of part-level and global transformer …”

Volltext

Journal Article

Zu den Favoriten

Gespeichert in:
8

Wird geladen …

FSRT: Facial Scene Representation Transformer for Face Reenactment from Factorized Appearance, Head-pose, and Facial Expression Features von Rochow, Andre, Schwarz, Max, Behnke, Sven

ISSN: 2331-8422

Veröffentlicht: Ithaca Cornell University Library, arXiv.org 10.06.2024

Veröffentlicht in arXiv.org (10.06.2024)
“… We propose a transformer-based encoder for computing a set-latent representation of the source image …”

Volltext

Paper

Zu den Favoriten

Gespeichert in:
9

Wird geladen …

Co-Scale Conv-Attentional Image Transformers von Xu, Weijian, Xu, Yifan, Chang, Tyler, Tu, Zhuowen

ISSN: 2380-7504

Veröffentlicht: IEEE 01.10.2021

Veröffentlicht in Proceedings / IEEE International Conference on Computer Vision (01.10.2021)
“… In this paper, we present Co-scale conv-attentional image Transformers (CoaT), a Transformer-based image classifier equipped with co-scale and conv-attentional mechanisms …”

Volltext

Tagungsbericht

Zu den Favoriten

Gespeichert in:
10

Wird geladen …

ESRNet: an exploring sample relationships network for arbitrary-shaped scene text detection von Fan, Huageng, Lu, Tongwei

ISSN: 0924-669X, 1573-7497

Veröffentlicht: New York Springer US 01.11.2024

Veröffentlicht in Applied intelligence (Dordrecht, Netherlands) (01.11.2024)
“… Recently transformer-based scene text detection methods have been gradually investigated …”

Volltext

Journal Article

Zu den Favoriten

Gespeichert in:
11

Wird geladen …

PredFormer: Transformers Are Effective Spatial-Temporal Predictive Learners von Tang, Yujin, Lu, Qi, Xie, Fei, Li, Xiangtai, Ma, Chao, Ming-Hsuan Yang

ISSN: 2331-8422

Veröffentlicht: Ithaca Cornell University Library, arXiv.org 18.10.2024

Veröffentlicht in arXiv.org (18.10.2024)
“… mechanisms, including full-, factorized-, and interleaved-spatial-temporal attention. With its recurrent-free, transformer-based design, PredFormer is both simple and efficient, significantly outperforming previous …”

Volltext

Paper

Zu den Favoriten

Gespeichert in:
12

Wird geladen …

CAMS: Convolution and Attention-Free Mamba-based Cardiac Image Segmentation von Khan, Abbas, Asad, Muhammad, Benning, Martin, Roney, Caroline, Slabaugh, Gregory

ISSN: 2331-8422

Veröffentlicht: Ithaca Cornell University Library, arXiv.org 29.10.2024

Veröffentlicht in arXiv.org (29.10.2024)
“… Convolutional Neural Networks (CNNs) and Transformer-based self-attention models have become the standard for medical image segmentation …”

Volltext

Paper

Zu den Favoriten

Gespeichert in:
13

Wird geladen …

CAMS: Convolution and Attention-Free Mamba-Based Cardiac Image Segmentation von Khan, Abbas, Asad, Muhammad, Benning, Martin, Roney, Caroline, Slabaugh, Gregory

ISSN: 2642-9381

Veröffentlicht: IEEE 26.02.2025

Veröffentlicht in Proceedings / IEEE Workshop on Applications of Computer Vision (26.02.2025)
“… Convolutional Neural Networks (CNNs) and Transformer-based self-attention models have become the standard for medical image segmentation …”

Volltext

Tagungsbericht

Zu den Favoriten

Gespeichert in:
14

Wird geladen …

See, Hear, Read: Leveraging Multimodality with Guided Attention for Abstractive Text Summarization von Atri, Yash Kumar, Pramanick, Shraman, Goyal, Vikram, Chakraborty, Tanmoy

ISSN: 2331-8422

Veröffentlicht: Ithaca Cornell University Library, arXiv.org 15.09.2021

Veröffentlicht in arXiv.org (15.09.2021)
“… In recent years, abstractive text summarization with multimodal inputs has started drawing attention due to its ability to accumulate information from …”

Volltext

Paper

Zu den Favoriten

Gespeichert in:
15

Wird geladen …

Efficient Deep Learning Models for Physics Simulation von Li, Zijie

ISBN: 9798288853104

Veröffentlicht: ProQuest Dissertations & Theses 01.01.2025

“… Many natural and engineered systems are governed by partial differential equations (PDEs), spanning atomic interactions in molecular systems to large-scale …”

Volltext

Dissertation

Zu den Favoriten

Gespeichert in:
16

Wird geladen …

Co-Scale Conv-Attentional Image Transformers von Xu, Weijian, Xu, Yifan, Chang, Tyler, Tu, Zhuowen

ISSN: 2331-8422

Veröffentlicht: Ithaca Cornell University Library, arXiv.org 26.08.2021

Veröffentlicht in arXiv.org (26.08.2021)
“… In this paper, we present Co-scale conv-attentional image Transformers (CoaT), a Transformer-based image classifier equipped with co-scale and conv-attentional mechanisms …”

Volltext

Paper

Zu den Favoriten

Gespeichert in:

Suchergebnisse - Transformer-based factorized encoder

Andere Suchmöglichkeiten:

Transformer-based factorized encoder for classification of pneumoconiosis on 3D CT images von Huang, Yingying, Si, Yang, Hu, Bingliang, Zhang, Yan, Wu, Shuang, Wu, Dongsheng, Wang, Quan

DMFC-UFormer: Depthwise multi-scale factorized convolution transformer-based UNet for medical image segmentation von Garbaz, Anass, Oukdach, Yassine, Charfi, Said, Ansari, Mohamed El, Koutti, Lahcen, Salihoun, Mouna

Two-stream vision transformer based multi-label recognition for TCM prescriptions construction von Zhao, Zijuan, Qiang, Yan, Yang, Fenghao, Hou, Xiao, Zhao, Juanjuan, Song, Kai

FSRT: Facial Scene Representation Transformer for Face Reenactment from Factorized Appearance, Head-Pose, and Facial Expression Features von Rochow, Andre, Schwarz, Max, Behnke, Sven

See, hear, read: Leveraging multimodality with guided attention for abstractive text summarization von Atri, Yash Kumar, Pramanick, Shraman, Goyal, Vikram, Chakraborty, Tanmoy

Multimodal Abstractive Summarization using bidirectional encoder representations from transformers with attention mechanism von Argade, Dakshata, Khairnar, Vaishali, Vora, Deepali, Patil, Shruti, Kotecha, Ketan, Alfarhood, Sultan

DeHi: A Decoupled Hierarchical Architecture for Unaligned Ground-to-Aerial Geo-Localization von Wang, Teng, Li, Jiawen, Sun, Changyin

FSRT: Facial Scene Representation Transformer for Face Reenactment from Factorized Appearance, Head-pose, and Facial Expression Features von Rochow, Andre, Schwarz, Max, Behnke, Sven

Co-Scale Conv-Attentional Image Transformers von Xu, Weijian, Xu, Yifan, Chang, Tyler, Tu, Zhuowen

ESRNet: an exploring sample relationships network for arbitrary-shaped scene text detection von Fan, Huageng, Lu, Tongwei

PredFormer: Transformers Are Effective Spatial-Temporal Predictive Learners von Tang, Yujin, Lu, Qi, Xie, Fei, Li, Xiangtai, Ma, Chao, Ming-Hsuan Yang

CAMS: Convolution and Attention-Free Mamba-based Cardiac Image Segmentation von Khan, Abbas, Asad, Muhammad, Benning, Martin, Roney, Caroline, Slabaugh, Gregory

CAMS: Convolution and Attention-Free Mamba-Based Cardiac Image Segmentation von Khan, Abbas, Asad, Muhammad, Benning, Martin, Roney, Caroline, Slabaugh, Gregory

See, Hear, Read: Leveraging Multimodality with Guided Attention for Abstractive Text Summarization von Atri, Yash Kumar, Pramanick, Shraman, Goyal, Vikram, Chakraborty, Tanmoy

Efficient Deep Learning Models for Physics Simulation von Li, Zijie

Co-Scale Conv-Attentional Image Transformers von Xu, Weijian, Xu, Yifan, Chang, Tyler, Tu, Zhuowen

Suchwerkzeuge:

Treffer weiter einschränken

Format

Schlagwortumfeld

Thema

Sprache

Erscheinungsjahr