Cross Encoder-Decoder Transformer with Global-Local Visual Extractor for Medical Image Captioning
Transformer-based approaches have shown good results in image captioning tasks. However, current approaches have a limitation in generating text from global features of an entire image. Therefore, we propose novel methods for generating better image captioning as follows: (1) The Global-Local Visual...
Saved in:
| Published in: | Sensors (Basel, Switzerland) Vol. 22; no. 4; p. 1429 |
|---|---|
| Main Authors: | , , , , |
| Format: | Journal Article |
| Language: | English |
| Published: |
Switzerland
MDPI AG
01.02.2022
MDPI |
| Subjects: | |
| ISSN: | 1424-8220, 1424-8220 |
| Online Access: | Get full text |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Be the first to leave a comment!