Character Segmentation in Asian Collector's Seal Imprints: An Attempt to Retrieval Based on Ancient Character Typeface
Collector's seals provide important clues about the ownership of a book. They contain much information pertaining to the essential elements of ancient materials and also show the details of possession, its relation to the book, the identity of the collectors and their social status and wealth,...
Uložené v:
| Vydané v: | Journal of data mining and digital humanities Ročník HistoInformatics; číslo HistoInformatics |
|---|---|
| Hlavní autori: | , , |
| Médium: | Journal Article |
| Jazyk: | English |
| Vydavateľské údaje: |
INRIA
11.01.2021
Nicolas Turenne |
| Predmet: | |
| ISSN: | 2416-5999, 2416-5999 |
| On-line prístup: | Získať plný text |
| Tagy: |
Pridať tag
Žiadne tagy, Buďte prvý, kto otaguje tento záznam!
|
| Abstract | Collector's seals provide important clues about the ownership of a book. They contain much information pertaining to the essential elements of ancient materials and also show the details of possession, its relation to the book, the identity of the collectors and their social status and wealth, amongst others. Asian collectors have typically used artistic ancient characters rather than modern ones to make their seals. In addition to the owner's name, several other words are used to express more profound meanings. A system that automatically recognizes these characters can help enthusiasts and professionals better understand the background information of these seals. However, there is a lack of training data and labelled images, as samples of some seals are scarce and most of them are degraded images. It is necessary to find new ways to make full use of such scarce data. While these data are available online, they do not contain information on the characters' position. The goal of this research is to assist in obtaining more labelled data through user interaction and provide retrieval tools that use only standard character typefaces extracted from font files. In this paper, a character segmentation method is proposed to predict the candidate characters' area without any labelled training data that contain character coordinate information. A retrieval-based recognition system that focuses on a single character is also proposed to support seal retrieval and matching. The experimental results demonstrate that the proposed character segmentation method performs well on Asian collector's seals, with 85% of the test data being correctly segmented. |
|---|---|
| AbstractList | Collector's seals provide important clues about the ownership of a book. They contain much information pertaining to the essential elements of ancient materials and also show the details of possession, its relation to the book, the identity of the collectors and their social status and wealth, amongst others. Asian collectors have typically used artistic ancient characters rather than modern ones to make their seals. In addition to the owner's name, several other words are used to express more profound meanings. A system that automatically recognizes these characters can help enthusiasts and professionals better understand the background information of these seals. However, there is a lack of training data and labelled images, as samples of some seals are scarce and most of them are degraded images. It is necessary to find new ways to make full use of such scarce data. While these data are available online, they do not contain information on the characters' position. The goal of this research is to assist in obtaining more labelled data through user interaction and provide retrieval tools that use only standard character typefaces extracted from font files. In this paper, a character segmentation method is proposed to predict the candidate characters' area without any labelled training data that contain character coordinate information. A retrieval-based recognition system that focuses on a single character is also proposed to support seal retrieval and matching. The experimental results demonstrate that the proposed character segmentation method performs well on Asian collector's seals, with 85% of the test data being correctly segmented. |
| Author | Li, Kangying Maeda, Akira Batjargal, Biligsaikhan |
| Author_xml | – sequence: 1 givenname: Kangying surname: Li fullname: Li, Kangying organization: Graduate School of Information Science and Engineering, Ritsumeikan University, Japan – sequence: 2 givenname: Biligsaikhan orcidid: 0000-0002-3068-2634 surname: Batjargal fullname: Batjargal, Biligsaikhan organization: Kinugasa Research Organization, Ritsumeikan University, Japan – sequence: 3 givenname: Akira surname: Maeda fullname: Maeda, Akira organization: College of Information Science and Engineering, Ritsumeikan University, Japan |
| BackLink | https://inria.hal.science/hal-02476910$$DView record in HAL |
| BookMark | eNpVkVFrFDEQx4NUsNY--QXyJiJXk2yS3fi2HmoPDgStz2E2mfT22N0cSTjot296V6o-zTD85jcD_7fkYokLEvKesxuphek-7_3sdzeaM_GKXArJ9UoZYy7-6d-Q65z3jDGuZKeUuiTH9Q4SuIKJ_sb7GZcCZYwLHRfa5xEWuo7ThK7E9CFXAia6mQ9pXEr-QvvKlILzodAS6S8sacRjJb5CRk-rpF_cWI307427hwMGcPiOvA4wZbx-rlfkz_dvd-vb1fbnj826364cbxuxMpIr4Qc5SDZ4lIoHBtBwF5gJGqXnnWG-C61sgwl86IQyqLnnoCrSdKK5Ipuz10fY2_r4DOnBRhjtaRDTvYVURjehhcHrRng2hNBIJ3QnQ2eU8wxEy5jS1fXx7NrB9J_qtt_apxkTstWGs6Oq7Kcz61LMOWF4WeDMntKyp7TsU1rNI7y5iV8 |
| ContentType | Journal Article |
| Copyright | Distributed under a Creative Commons Attribution 4.0 International License |
| Copyright_xml | – notice: Distributed under a Creative Commons Attribution 4.0 International License |
| DBID | AAYXX CITATION 1XC VOOES DOA |
| DOI | 10.46298/jdmdh.6102 |
| DatabaseName | CrossRef Hyper Article en Ligne (HAL) Hyper Article en Ligne (HAL) (Open Access) DOAJ Directory of Open Access Journals |
| DatabaseTitle | CrossRef |
| DatabaseTitleList | CrossRef |
| Database_xml | – sequence: 1 dbid: DOA name: DOAJ Directory of Open Access Journals url: https://www.doaj.org/ sourceTypes: Open Website |
| DeliveryMethod | fulltext_linktorsrc |
| Discipline | Computer Science |
| EISSN | 2416-5999 |
| ExternalDocumentID | oai_doaj_org_article_abd632d0bff34c2684f895cd0a270056 oai:HAL:hal-02476910v5 10_46298_jdmdh_6102 |
| GroupedDBID | 5VS AAFWJ AAYXX ADBBV ADQAK AFPKN ALMA_UNASSIGNED_HOLDINGS BCNDV CITATION FRP GROUPED_DOAJ KQ8 M~E OK1 1XC VOOES |
| ID | FETCH-LOGICAL-c1732-94152db4b40bde451f0aa31cf09f6e4d1890d8f747f9f1b8259e61d1a5cf03823 |
| IEDL.DBID | DOA |
| ISSN | 2416-5999 |
| IngestDate | Fri Oct 03 12:52:34 EDT 2025 Tue Oct 14 20:24:30 EDT 2025 Sat Nov 29 04:10:29 EST 2025 |
| IsDoiOpenAccess | true |
| IsOpenAccess | true |
| IsPeerReviewed | true |
| IsScholarly | true |
| Issue | HistoInformatics |
| Keywords | Ancient document image processing Asian seal imprint Character segmentation |
| Language | English |
| License | Distributed under a Creative Commons Attribution 4.0 International License: http://creativecommons.org/licenses/by/4.0 |
| LinkModel | DirectLink |
| MergedId | FETCHMERGED-LOGICAL-c1732-94152db4b40bde451f0aa31cf09f6e4d1890d8f747f9f1b8259e61d1a5cf03823 |
| ORCID | 0000-0002-3068-2634 |
| OpenAccessLink | https://doaj.org/article/abd632d0bff34c2684f895cd0a270056 |
| ParticipantIDs | doaj_primary_oai_doaj_org_article_abd632d0bff34c2684f895cd0a270056 hal_primary_oai_HAL_hal_02476910v5 crossref_primary_10_46298_jdmdh_6102 |
| PublicationCentury | 2000 |
| PublicationDate | 2021-01-11 |
| PublicationDateYYYYMMDD | 2021-01-11 |
| PublicationDate_xml | – month: 01 year: 2021 text: 2021-01-11 day: 11 |
| PublicationDecade | 2020 |
| PublicationTitle | Journal of data mining and digital humanities |
| PublicationYear | 2021 |
| Publisher | INRIA Nicolas Turenne |
| Publisher_xml | – name: INRIA – name: Nicolas Turenne |
| SSID | ssj0001548555 |
| Score | 2.1303644 |
| Snippet | Collector's seals provide important clues about the ownership of a book. They contain much information pertaining to the essential elements of ancient... |
| SourceID | doaj hal crossref |
| SourceType | Open Website Open Access Repository Index Database |
| SubjectTerms | [info.info-dl]computer science [cs]/digital libraries [cs.dl] [info]computer science [cs] ancient document image processing asian seal imprint character segmentation Computer Science Digital Libraries |
| Title | Character Segmentation in Asian Collector's Seal Imprints: An Attempt to Retrieval Based on Ancient Character Typeface |
| URI | https://inria.hal.science/hal-02476910 https://doaj.org/article/abd632d0bff34c2684f895cd0a270056 |
| Volume | HistoInformatics |
| hasFullText | 1 |
| inHoldings | 1 |
| isFullTextHit | |
| isPrint | |
| journalDatabaseRights | – providerCode: PRVAON databaseName: DOAJ Directory of Open Access Journals customDbUrl: eissn: 2416-5999 dateEnd: 99991231 omitProxy: false ssIdentifier: ssj0001548555 issn: 2416-5999 databaseCode: DOA dateStart: 20140101 isFulltext: true titleUrlDefault: https://www.doaj.org/ providerName: Directory of Open Access Journals – providerCode: PRVHPJ databaseName: ROAD: Directory of Open Access Scholarly Resources customDbUrl: eissn: 2416-5999 dateEnd: 99991231 omitProxy: false ssIdentifier: ssj0001548555 issn: 2416-5999 databaseCode: M~E dateStart: 20140101 isFulltext: true titleUrlDefault: https://road.issn.org providerName: ISSN International Centre |
| link | http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwrV1JSyUxEA6DeJiL4zKDb1wIInjKmHTS6Y63VhQPKsMs4K3JOj7BdvC17-hvtyrdz-U0l6Ehh1AkTVVCLan6ipD9WGsZjONMVdYyVVSGOe0VC0p4LtGE1rlQ-KK6uqqvr833N62-MCdsgAceGHdoXdCyCNylJJVHbJJUm9IHbvHJtMxg22D1vHGmhvpgBD3B_EXQUJqVYAYNxXlKF6Y-vA134eYbWA7FO3WUUftBydwsgqpZyZytkpXROqTN8Fdr5EPs1smnRecFOl7EDTI_WeAs05_xz91YP9TRaUcbrIqkOR6A8fiDGVDAkhg8mHb97Ig2QNMjIlVP-3v6I3fUguNGj0GfBQqLNF0ukqSve6CzmqyPn8nvs9NfJ-ds7KDAvKhkwQyq5-CUU9yFqEqRuLVS-MRN0lEFURse6gQuRTJJOPAWTdQiCFsCCb4QfiFL3X0XNwmVUiVfWBi1VfBZp0wtuYNLL-DSiwnZXzCy_TsAZbTgYGR-t5nfLfJ7Qo6RyS8kiG6dJ0Dm7Sjz9l8yn5A9ENG7Nc6bixbnwOSoNJhA8_Lr_9hpi3wsMImFCybENlnqHx7jDln28346e9jNZw7Gy6fTZ8qp2qg |
| linkProvider | Directory of Open Access Journals |
| openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Character+Segmentation+in+Asian+Collector%27s+Seal+Imprints%3A+An+Attempt+to+Retrieval+Based+on+Ancient+Character+Typeface&rft.jtitle=Journal+of+data+mining+and+digital+humanities&rft.au=Li%2C+Kangying&rft.au=Batjargal%2C+Biligsaikhan&rft.au=Maeda%2C+Akira&rft.date=2021-01-11&rft.issn=2416-5999&rft.eissn=2416-5999&rft.volume=HistoInformatics&rft.issue=HistoInformatics&rft_id=info:doi/10.46298%2Fjdmdh.6102&rft.externalDBID=n%2Fa&rft.externalDocID=10_46298_jdmdh_6102 |
| thumbnail_l | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=2416-5999&client=summon |
| thumbnail_m | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=2416-5999&client=summon |
| thumbnail_s | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=2416-5999&client=summon |