From Show to Tell: A Survey on Deep Learning-Based Image Captioning

Connecting Vision and Language plays an essential role in Generative Intelligence. For this reason, large research efforts have been devoted to image captioning, i.e. describing images with syntactically and semantically meaningful sentences. Starting from 2015 the task has generally been addressed...

Full description

Saved in:

Bibliographic Details
Published in:	IEEE transactions on pattern analysis and machine intelligence Vol. 45; no. 1; pp. 539 - 559
Main Authors:	Stefanini, Matteo, Cornia, Marcella, Baraldi, Lorenzo, Cascianelli, Silvia, Fiameni, Giuseppe, Cucchiara, Rita
Format:	Journal Article
Language:	English
Published:	United States IEEE 01.01.2023 The Institute of Electrical and Electronics Engineers, Inc. (IEEE)
Subjects:	Additives Algorithms Benchmarking Coders Computer vision Convolutional neural networks Deep Learning Feature extraction Image captioning Image coding Language Natural Language Processing Sentences survey Task analysis Training vision-and-language Visualization
ISSN:	0162-8828, 1939-3539, 2160-9292, 1939-3539
Online Access:	Get full text
Tags:	Add Tag No Tags, Be the first to tag this record!

Be the first to leave a comment!