Search Results - Transformer-based factorized encoder

  • Showing 1 - 16 results of 16
Refine Results
  1. 1

    Transformer-based factorized encoder for classification of pneumoconiosis on 3D CT images by Huang, Yingying, Si, Yang, Hu, Bingliang, Zhang, Yan, Wu, Shuang, Wu, Dongsheng, Wang, Quan

    ISSN: 0010-4825, 1879-0534, 1879-0534
    Published: United States Elsevier Ltd 01.11.2022
    Published in Computers in biology and medicine (01.11.2022)
    “…) typically provides more details of the lesions in the lung. Thus, a transformer-based factorized encoder (TBFE…”
    Get full text
    Journal Article
  2. 2

    DMFC-UFormer: Depthwise multi-scale factorized convolution transformer-based UNet for medical image segmentation by Garbaz, Anass, Oukdach, Yassine, Charfi, Said, Ansari, Mohamed El, Koutti, Lahcen, Salihoun, Mouna

    ISSN: 1746-8094
    Published: Elsevier Ltd 01.03.2025
    Published in Biomedical signal processing and control (01.03.2025)
    “… In this paper, we combine the benefits of both methodologies. We propose DMFC-UFormer, an advanced fusion of Depthwise Multi-Scale Factorized Convolution-based transformers (DMFC-Transformer) with UNet…”
    Get full text
    Journal Article
  3. 3

    Two-stream vision transformer based multi-label recognition for TCM prescriptions construction by Zhao, Zijuan, Qiang, Yan, Yang, Fenghao, Hou, Xiao, Zhao, Juanjuan, Song, Kai

    ISSN: 0010-4825, 1879-0534, 1879-0534
    Published: United States Elsevier Ltd 01.03.2024
    Published in Computers in biology and medicine (01.03.2024)
    “… and multi-label classification. The framework comprises three key components: image encoder, label embedding module, and cross-modal…”
    Get full text
    Journal Article
  4. 4

    FSRT: Facial Scene Representation Transformer for Face Reenactment from Factorized Appearance, Head-Pose, and Facial Expression Features by Rochow, Andre, Schwarz, Max, Behnke, Sven

    ISSN: 1063-6919
    Published: IEEE 16.06.2024
    “… We propose a transformer-based encoder for computing a set-latent representation of the source image…”
    Get full text
    Conference Proceeding
  5. 5

    See, hear, read: Leveraging multimodality with guided attention for abstractive text summarization by Atri, Yash Kumar, Pramanick, Shraman, Goyal, Vikram, Chakraborty, Tanmoy

    ISSN: 0950-7051, 1872-7409
    Published: Amsterdam Elsevier B.V 05.09.2021
    Published in Knowledge-based systems (05.09.2021)
    “…In recent years, abstractive text summarization with multimodal inputs has started drawing attention due to its ability to accumulate information from…”
    Get full text
    Journal Article
  6. 6

    Multimodal Abstractive Summarization using bidirectional encoder representations from transformers with attention mechanism by Argade, Dakshata, Khairnar, Vaishali, Vora, Deepali, Patil, Shruti, Kotecha, Ketan, Alfarhood, Sultan

    ISSN: 2405-8440, 2405-8440
    Published: England Elsevier Ltd 29.02.2024
    Published in Heliyon (29.02.2024)
    “… To address the aforementioned issues, this research presented the Multimodal Abstractive Summarization using Bidirectional Encoder Representations from Transformers (MAS-BERT…”
    Get full text
    Journal Article
  7. 7

    DeHi: A Decoupled Hierarchical Architecture for Unaligned Ground-to-Aerial Geo-Localization by Wang, Teng, Li, Jiawen, Sun, Changyin

    ISSN: 1051-8215, 1558-2205
    Published: New York IEEE 01.03.2024
    “… Specifically, DeHi first leverages CNN to extract high-level semantic features, and then introduces a novel orthogonally factorized transformer model consisting of part-level and global transformer…”
    Get full text
    Journal Article
  8. 8

    FSRT: Facial Scene Representation Transformer for Face Reenactment from Factorized Appearance, Head-pose, and Facial Expression Features by Rochow, Andre, Schwarz, Max, Behnke, Sven

    ISSN: 2331-8422
    Published: Ithaca Cornell University Library, arXiv.org 10.06.2024
    Published in arXiv.org (10.06.2024)
    “… We propose a transformer-based encoder for computing a set-latent representation of the source image…”
    Get full text
    Paper
  9. 9

    Co-Scale Conv-Attentional Image Transformers by Xu, Weijian, Xu, Yifan, Chang, Tyler, Tu, Zhuowen

    ISSN: 2380-7504
    Published: IEEE 01.10.2021
    “…In this paper, we present Co-scale conv-attentional image Transformers (CoaT), a Transformer-based image classifier equipped with co-scale and conv-attentional mechanisms…”
    Get full text
    Conference Proceeding
  10. 10

    ESRNet: an exploring sample relationships network for arbitrary-shaped scene text detection by Fan, Huageng, Lu, Tongwei

    ISSN: 0924-669X, 1573-7497
    Published: New York Springer US 01.11.2024
    “…Recently transformer-based scene text detection methods have been gradually investigated…”
    Get full text
    Journal Article
  11. 11

    PredFormer: Transformers Are Effective Spatial-Temporal Predictive Learners by Tang, Yujin, Lu, Qi, Xie, Fei, Li, Xiangtai, Ma, Chao, Ming-Hsuan Yang

    ISSN: 2331-8422
    Published: Ithaca Cornell University Library, arXiv.org 18.10.2024
    Published in arXiv.org (18.10.2024)
    “… mechanisms, including full-, factorized-, and interleaved-spatial-temporal attention. With its recurrent-free, transformer-based design, PredFormer is both simple and efficient, significantly outperforming previous…”
    Get full text
    Paper
  12. 12

    CAMS: Convolution and Attention-Free Mamba-based Cardiac Image Segmentation by Khan, Abbas, Asad, Muhammad, Benning, Martin, Roney, Caroline, Slabaugh, Gregory

    ISSN: 2331-8422
    Published: Ithaca Cornell University Library, arXiv.org 29.10.2024
    Published in arXiv.org (29.10.2024)
    “…Convolutional Neural Networks (CNNs) and Transformer-based self-attention models have become the standard for medical image segmentation…”
    Get full text
    Paper
  13. 13

    CAMS: Convolution and Attention-Free Mamba-Based Cardiac Image Segmentation by Khan, Abbas, Asad, Muhammad, Benning, Martin, Roney, Caroline, Slabaugh, Gregory

    ISSN: 2642-9381
    Published: IEEE 26.02.2025
    “…Convolutional Neural Networks (CNNs) and Transformer-based self-attention models have become the standard for medical image segmentation…”
    Get full text
    Conference Proceeding
  14. 14

    See, Hear, Read: Leveraging Multimodality with Guided Attention for Abstractive Text Summarization by Atri, Yash Kumar, Pramanick, Shraman, Goyal, Vikram, Chakraborty, Tanmoy

    ISSN: 2331-8422
    Published: Ithaca Cornell University Library, arXiv.org 15.09.2021
    Published in arXiv.org (15.09.2021)
    “…In recent years, abstractive text summarization with multimodal inputs has started drawing attention due to its ability to accumulate information from…”
    Get full text
    Paper
  15. 15

    Efficient Deep Learning Models for Physics Simulation by Li, Zijie

    ISBN: 9798288853104
    Published: ProQuest Dissertations & Theses 01.01.2025
    “…Many natural and engineered systems are governed by partial differential equations (PDEs), spanning atomic interactions in molecular systems to large-scale…”
    Get full text
    Dissertation
  16. 16

    Co-Scale Conv-Attentional Image Transformers by Xu, Weijian, Xu, Yifan, Chang, Tyler, Tu, Zhuowen

    ISSN: 2331-8422
    Published: Ithaca Cornell University Library, arXiv.org 26.08.2021
    Published in arXiv.org (26.08.2021)
    “…In this paper, we present Co-scale conv-attentional image Transformers (CoaT), a Transformer-based image classifier equipped with co-scale and conv-attentional mechanisms…”
    Get full text
    Paper