Visual Genome: Connecting Language and Vision Using Crowdsourced Dense Image Annotations
Despite progress in perceptual tasks such as image classification, computers still perform poorly on cognitive tasks such as image description and question answering. Cognition is core to tasks that involve not just recognizing, but reasoning about our visual world. However, models used to tackle th...
Saved in:
| Published in: | International journal of computer vision Vol. 123; no. 1; pp. 32 - 73 |
|---|---|
| Main Authors: | , , , , , , , , , , , |
| Format: | Journal Article |
| Language: | English |
| Published: |
New York
Springer US
01.05.2017
Springer Springer Nature B.V |
| Subjects: | |
| ISSN: | 0920-5691, 1573-1405 |
| Online Access: | Get full text |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Be the first to leave a comment!