Interactive Image Caption Generation Reflecting User Intent from Trace Using a Diffusion Language Model
This study proposes an image captioning method designed to incorporate user-specific explanatory intentions into the generated text, as signaled by the user’s trace on the image. We extract areas of interest from dense sections of the trace, determine the order of explanations by tracking changes in...
Saved in:
| Published in: | Journal of advanced computational intelligence and intelligent informatics Vol. 29; no. 6; pp. 1417 - 1426 |
|---|---|
| Main Authors: | , |
| Format: | Journal Article |
| Language: | English |
| Published: |
Tokyo
Fuji Technology Press Co. Ltd
20.11.2025
|
| Subjects: | |
| ISSN: | 1343-0130, 1883-8014 |
| Online Access: | Get full text |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Be the first to leave a comment!