Artificial colorization of digitized microfilms: a preliminary study

A lot of available digitized manuscripts online are actually digitized microfilms, a technology dating back from the 1930s. With the progress of artificial colorization, we make the hypothesis that microfilms could be colored with these recent technologies, testing InstColorization. We train a model...

Celý popis

Uloženo v:
Podrobná bibliografie
Vydáno v:Journal of data mining and digital humanities Ročník 2022; číslo Towards a Digital Ecosystem:...
Hlavní autoři: Clérice, Thibault, Pinche, Ariane
Médium: Journal Article
Jazyk:angličtina
Vydáno: INRIA 12.04.2023
Nicolas Turenne
Témata:
ISSN:2416-5999, 2416-5999
On-line přístup:Získat plný text
Tagy: Přidat tag
Žádné tagy, Buďte první, kdo vytvoří štítek k tomuto záznamu!
Abstract A lot of available digitized manuscripts online are actually digitized microfilms, a technology dating back from the 1930s. With the progress of artificial colorization, we make the hypothesis that microfilms could be colored with these recent technologies, testing InstColorization. We train a model over an ad-hoc dataset of 18 788 color images that are artificially gray-scaled for this purpose. With promising results in terms of colorization but clear limitations due to the difference between artificially grayscaled images and "naturaly" greyscaled microfilms, we evaluate the impact of this artificial colorization on two downstream tasks using Kraken: layout analysis and text recognition. Unfortunately, the results show little to no improvements which limits the interest of artificial colorization on manuscripts in the computer vision domain.
AbstractList A lot of available digitized manuscripts online are actually digitized microfilms, a technology dating back from the 1930s. With the progress of artificial colorization, we make the hypothesis that microfilms could be colored with these recent technologies, testing InstColorization. We train a model over an ad-hoc dataset of 18 788 color images that are artificially gray-scaled for this purpose. With promising results in terms of colorization but clear limitations due to the difference between artificially grayscaled images and "naturaly" greyscaled microfilms, we evaluate the impact of this artificial colorization on two downstream tasks using Kraken: layout analysis and text recognition. Unfortunately, the results show little to no improvements which limits the interest of artificial colorization on manuscripts in the computer vision domain.
Author Pinche, Ariane
Clérice, Thibault
Author_xml – sequence: 1
  givenname: Thibault
  orcidid: 0000-0003-1852-9204
  surname: Clérice
  fullname: Clérice, Thibault
– sequence: 2
  givenname: Ariane
  orcidid: 0000-0002-7843-5050
  surname: Pinche
  fullname: Pinche, Ariane
BackLink https://hal.science/hal-03335326$$DView record in HAL
BookMark eNpVkFtPwkAQhTcGExF58g_01ZjiXmbbrm8EL5CQ-KLPm73CkrZLtmgCv94CxujTTCZnzpn5rtGgja1D6JbgCRRUVA8b29j1pAIOF2hIgRQ5F0IM_vRXaNx1G4wx4VBxzofoaZp2wQcTVJ2ZWMcUDmoXYptFn9mwCrtwcDZrgknRh7rpHjOVbZOrQxNalfZZt_u0-xt06VXdufFPHaGPl-f32Txfvr0uZtNlbkjJIC-BOgqqj9ZKUeOZdr7knFmttBaYGE-ZgVKBMKTg1AlOaOV1WXCg4ETFRmhx9rVRbeQ2haY_QUYV5GkQ00qq_h1TO1nQinkAUzjdR1ZaMKydIw4LasFb2nvdnb3Wqv5nNZ8u5XGGGWOc0eKL9dr7s7an0HXJ-d8FguWJvTyxl0f27BvsWXjk
ContentType Journal Article
Copyright Attribution
Copyright_xml – notice: Attribution
DBID AAYXX
CITATION
1XC
BXJBU
IHQJB
VOOES
DOA
DOI 10.46298/jdmdh.8454
DatabaseName CrossRef
Hyper Article en Ligne (HAL)
HAL-SHS: Archive ouverte en Sciences de l'Homme et de la Société
HAL-SHS: Archive ouverte en Sciences de l'Homme et de la Société (Open Access)
Hyper Article en Ligne (HAL) (Open Access)
DOAJ Directory of Open Access Journals
DatabaseTitle CrossRef
DatabaseTitleList CrossRef


Database_xml – sequence: 1
  dbid: DOA
  name: DOAJ Directory of Open Access Journals
  url: https://www.doaj.org/
  sourceTypes: Open Website
DeliveryMethod fulltext_linktorsrc
Discipline Computer Science
EISSN 2416-5999
ExternalDocumentID oai_doaj_org_article_6283f44c6eb24a8b930bee1e092d4fd2
oai:HAL:hal-03335326v3
10_46298_jdmdh_8454
GroupedDBID 5VS
AAFWJ
AAYXX
ADBBV
ADQAK
AFPKN
ALMA_UNASSIGNED_HOLDINGS
BCNDV
CITATION
FRP
GROUPED_DOAJ
KQ8
M~E
OK1
1XC
BXJBU
IHQJB
VOOES
ID FETCH-LOGICAL-c1734-742e24a548baa2cf3bef7553dbabb901cf23c47a49c1652e95128fb765424e983
IEDL.DBID DOA
ISSN 2416-5999
IngestDate Fri Oct 03 12:28:00 EDT 2025
Tue Oct 14 06:55:24 EDT 2025
Sat Nov 29 04:10:29 EST 2025
IsDoiOpenAccess true
IsOpenAccess true
IsPeerReviewed true
IsScholarly true
Issue Towards a Digital Ecosystem:...
Language English
License https://creativecommons.org/licenses/by/4.0
Attribution: http://creativecommons.org/licenses/by
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-c1734-742e24a548baa2cf3bef7553dbabb901cf23c47a49c1652e95128fb765424e983
ORCID 0000-0002-7843-5050
0000-0003-1852-9204
OpenAccessLink https://doaj.org/article/6283f44c6eb24a8b930bee1e092d4fd2
ParticipantIDs doaj_primary_oai_doaj_org_article_6283f44c6eb24a8b930bee1e092d4fd2
hal_primary_oai_HAL_hal_03335326v3
crossref_primary_10_46298_jdmdh_8454
PublicationCentury 2000
PublicationDate 2023-04-12
PublicationDateYYYYMMDD 2023-04-12
PublicationDate_xml – month: 04
  year: 2023
  text: 2023-04-12
  day: 12
PublicationDecade 2020
PublicationTitle Journal of data mining and digital humanities
PublicationYear 2023
Publisher INRIA
Nicolas Turenne
Publisher_xml – name: INRIA
– name: Nicolas Turenne
SSID ssj0001548555
Score 2.215662
Snippet A lot of available digitized manuscripts online are actually digitized microfilms, a technology dating back from the 1930s. With the progress of artificial...
SourceID doaj
hal
crossref
SourceType Open Website
Open Access Repository
Index Database
SubjectTerms [scco.comp]cognitive science/computer science
[shs.litt]humanities and social sciences/literature
Cognitive science
Computer science
Humanities and Social Sciences
Literature
Title Artificial colorization of digitized microfilms: a preliminary study
URI https://hal.science/hal-03335326
https://doaj.org/article/6283f44c6eb24a8b930bee1e092d4fd2
Volume 2022
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
journalDatabaseRights – providerCode: PRVAON
  databaseName: DOAJ Directory of Open Access Journals
  customDbUrl:
  eissn: 2416-5999
  dateEnd: 99991231
  omitProxy: false
  ssIdentifier: ssj0001548555
  issn: 2416-5999
  databaseCode: DOA
  dateStart: 20140101
  isFulltext: true
  titleUrlDefault: https://www.doaj.org/
  providerName: Directory of Open Access Journals
– providerCode: PRVHPJ
  databaseName: ROAD: Directory of Open Access Scholarly Resources
  customDbUrl:
  eissn: 2416-5999
  dateEnd: 99991231
  omitProxy: false
  ssIdentifier: ssj0001548555
  issn: 2416-5999
  databaseCode: M~E
  dateStart: 20140101
  isFulltext: true
  titleUrlDefault: https://road.issn.org
  providerName: ISSN International Centre
link http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwrV07T8MwELYQYmDhjSgvWahraGo7TszGq2IoFQNI3SI_aVBfSgsSDPx2zk6K2omFJYNlxfF3tu8u8n0fQk1HrXPMyCg2ykSsDctYOc0jLTkXmnAIioOlu2mvl_X74mlJ6svfCavogSvgWhz8n2NMc0gBmcyUoLGytm1jQQxzJpy-EPUsJVNVfbAnPUmqgjzGichab2ZkBpcZS9iKCwpM_eBYBosfqcGxdHbQVh0R4uvqS3bRmh3voe2F2gKuN98-uvM9Kr4H7Kmmy7qCEk8cNsVrMS--rMEjf8HOFcPR7ApLPC3tMMh2lZ84EMkeoJfO_fPtQ1RrIES6nVIWQeZqYd4wHSUl0Y4q69IkoUZJpcCXa0eoZqlkQrd5QiwETCRzKvU6VMyKjB6i9fFkbI8QVsRaoozlRniRKuqZ7gQHVGFXx0qKBmouYMmnFdVFDilCQC8P6OUevQa68ZD9dvH81KEBrJbXVsv_sloDXQDgK-94uO7mvi2mlCYQVH7Q4_8Y6QRteoX4KNAznqL1efluz9CG_pgXs_I8rBp4Pn7f_wBD2MnB
linkProvider Directory of Open Access Journals
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Artificial+colorization+of+digitized+microfilms%3A+a+preliminary+study&rft.jtitle=Journal+of+data+mining+and+digital+humanities&rft.au=Thibault+Cl%C3%A9rice&rft.au=Ariane+Pinche&rft.date=2023-04-12&rft.pub=Nicolas+Turenne&rft.eissn=2416-5999&rft.volume=2022&rft.issue=Towards+a+Digital+Ecosystem%3A...&rft_id=info:doi/10.46298%2Fjdmdh.8454&rft.externalDBID=DOA&rft.externalDocID=oai_doaj_org_article_6283f44c6eb24a8b930bee1e092d4fd2
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=2416-5999&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=2416-5999&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=2416-5999&client=summon