Cross Encoder-Decoder Transformer with Global-Local Visual Extractor for Medical Image Captioning

Transformer-based approaches have shown good results in image captioning tasks. However, current approaches have a limitation in generating text from global features of an entire image. Therefore, we propose novel methods for generating better image captioning as follows: (1) The Global-Local Visual...

Full description

Saved in:

Bibliographic Details
Published in:	Sensors (Basel, Switzerland) Vol. 22; no. 4; p. 1429
Main Authors:	Lee, Hojun, Cho, Hyunjun, Park, Jieun, Chae, Jinyeong, Kim, Jihie
Format:	Journal Article
Language:	English
Published:	Switzerland MDPI AG 01.02.2022 MDPI
Subjects:	Computational linguistics Crop diseases deep learning Electric Power Supplies Electric transformers Evaluation Language processing medical image captioning Medical imaging equipment Natural language interfaces Neural networks Noise transformer South Korea transformer deep learning medical image captioning
ISSN:	1424-8220, 1424-8220
Online Access:	Get full text
Tags:	Add Tag No Tags, Be the first to tag this record!

Be the first to leave a comment!