Understanding the application of handwritten text recognition technology in heritage contexts: a systematic review of Transkribus in published research

Handwritten Text Recognition (HTR) technology is now a mature machine learning tool, becoming integrated in the digitisation processes of libraries and archives, speeding up the transcription of primary sources and facilitating full text searching and analysis of historic texts at scale. However, re...

Celý popis

Uloženo v:
Podrobná bibliografie
Vydáno v:Archival science Ročník 22; číslo 3; s. 367 - 392
Hlavní autoři: Nockels, Joe, Gooding, Paul, Ames, Sarah, Terras, Melissa
Médium: Journal Article
Jazyk:angličtina
Vydáno: Dordrecht Springer Netherlands 01.09.2022
Springer Nature B.V
Témata:
ISSN:1389-0166, 1573-7500, 1573-7500, 1573-7519
On-line přístup:Získat plný text
Tagy: Přidat tag
Žádné tagy, Buďte první, kdo vytvoří štítek k tomuto záznamu!
Popis
Shrnutí:Handwritten Text Recognition (HTR) technology is now a mature machine learning tool, becoming integrated in the digitisation processes of libraries and archives, speeding up the transcription of primary sources and facilitating full text searching and analysis of historic texts at scale. However, research into how HTR is changing our information environment is scant. This paper presents a systematic literature review regarding how researchers are using one particular HTR platform, Transkribus, to indicate the domains where HTR is applied, the approach taken, and how the technology is understood. 381 papers from 2015 to 2020 were gathered from Google Scholar, Scopus, and Web of Science, then grouped and coded into categories using quantitative and qualitative approaches. Published research that mentions Transkribus is international and rapidly growing. Transkribus features primarily in archival and library science publications, while a long tail of broad and eclectic disciplines, including history, computer science, citizen science, law and education, demonstrate the wider applicability of the tool. The most common paper categories were humanities applications (67%), technological (25%), users (5%) and tutorials (3%) . This paper presents the first overarching review of HTR as featured in published research, while also elucidating how HTR is affecting the information environment.
Bibliografie:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
content type line 23
ISSN:1389-0166
1573-7500
1573-7500
1573-7519
DOI:10.1007/s10502-022-09397-0