A segmentation-free method for image retrieval and pattern spotting in historical documents using convolutional features

This paper describes a new segmentation-free method for retrieving images and spotting patterns in historical document image collections. The proposed method needs no training on the target domain, characterizing a problem-independent approach. For this purpose, the query and the document image repr...

Celý popis

Uložené v:
Podrobná bibliografia
Vydané v:Multimedia tools and applications Ročník 84; číslo 32; s. 39635 - 39665
Hlavní autori: Curi, Zacarias, Nicolas, Stéphane, Tranouez, Pierrick, M. Saavedra, Jose, de Souza Britto Jr, Alceu, Heutte, Laurent
Médium: Journal Article
Jazyk:English
Vydavateľské údaje: New York Springer US 01.09.2025
Springer Nature B.V
Springer Verlag
Predmet:
ISSN:1573-7721, 1380-7501, 1573-7721
On-line prístup:Získať plný text
Tagy: Pridať tag
Žiadne tagy, Buďte prvý, kto otaguje tento záznam!
Popis
Shrnutí:This paper describes a new segmentation-free method for retrieving images and spotting patterns in historical document image collections. The proposed method needs no training on the target domain, characterizing a problem-independent approach. For this purpose, the query and the document image represented by feature maps extracted using intermediate layers of a pre-trained Fully Convolutional Network are submitted to a cross-correlation process. The produced similarity heatmap is used to locate the query occurrences on the document page. A robust experimental protocol using three datasets shows promising results on image retrieval and pattern spotting. The experiments conducted on the public DocExplore dataset demonstrated that the proposed method could improve the mAP by 69.1% for the spotting task and 21% for the image retrieval task compared to the state-of-the-art. Additional experiments on image retrieval using the Horae dataset and logo spotting using the Tobacco800 dataset confirmed the high generalization capability of the proposed method.
Bibliografia:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
ISSN:1573-7721
1380-7501
1573-7721
DOI:10.1007/s11042-025-20766-6