Optical Recognition Assisted Transcription with Transkribus: The Experiment concerning Eugène Wilhelm's Personal Diary (1885-1951)
This article proposes use the Transkribus software to report on a "user experiment" in a French-speaking context. It is based on the semi-automated transcription project using the diary of the jurist Eugène Wilhelm (1866-1951). This diary presents two main challenges. The first is related...
Uloženo v:
| Vydáno v: | Journal of data mining and digital humanities Ročník Atelier Digit_Hum; číslo Digital humanities in... |
|---|---|
| Hlavní autor: | |
| Médium: | Journal Article |
| Jazyk: | angličtina |
| Vydáno: |
INRIA
28.08.2020
Nicolas Turenne |
| Témata: | |
| ISSN: | 2416-5999, 2416-5999 |
| On-line přístup: | Získat plný text |
| Tagy: |
Přidat tag
Žádné tagy, Buďte první, kdo vytvoří štítek k tomuto záznamu!
|
| Abstract | This article proposes use the Transkribus software to report on a "user experiment" in a French-speaking context. It is based on the semi-automated transcription project using the diary of the jurist Eugène Wilhelm (1866-1951). This diary presents two main challenges. The first is related to the time covered by the writing process-66 years. This leads to variations in the form of the writing, which becomes increasingly "unreadable" with time. The second challenge is related to the concomitant use of two alphabets: Roman for everyday text and Greek for private issues. After presenting the project and the specificities related to the use of the tool, the experiment presented in this contribution is structured around two aspects. Firstly, I will summarise the main obstacles encountered and the solutions provided to overcome them. Secondly, I will come back to the collaborative transcription experiment carried out with students in the classroom, presenting the difficulties observed and the solutions found to overcome them. In conclusion, I will propose an assessment of the use of this Human Text Recognition software in a French-speaking context and in a teaching situation.
Cet article propose de restituer une « expérience utilisateur » du logiciel Transkribus en contexte francophone. Il s’appuie sur le projet de transcription semi-automatisée du journal intime du juriste Eugène Wilhelm (1866-1951). Ce journal comporte deux défis principaux : le premier est lié à la durée de la rédaction, 66 années, qui engendre des variations dans la forme de l’écriture, cette dernière devenant de plus en plus « illisible » le temps passant. Le second défi est lié à l’emploi concomitant de deux alphabets ; romain pour tout ce qui relève du quotidien et grec pour le for privé.L’expérience utilisateur restituée dans cette contribution s’articule autour de deux aspects. Dans un premier temps, après avoir présenté le projet et les spécificités liées à l’usage de l’outil, les principaux obstacles rencontrés et les solutions apportées pour y remédier seront synthétisés. Puis, je reviendrai sur l’expérience collaborative de transcription conduite avec des étudiants en salle de cours en présentant les difficultés observées et les solutions trouvées pour y remédier. En conclusion, je proposerai un bilan relatif à l’utilisation de ce logiciel d’HTR (Human Text Recognition) en contexte francophone et en situation d’enseignement |
|---|---|
| AbstractList | This article proposes use the Transkribus software to report on a "user experiment" in a French-speaking context. It is based on the semi-automated transcription project using the diary of the jurist Eugène Wilhelm (1866-1951). This diary presents two main challenges. The first is related to the time covered by the writing process-66 years. This leads to variations in the form of the writing, which becomes increasingly "unreadable" with time. The second challenge is related to the concomitant use of two alphabets: Roman for everyday text and Greek for private issues. After presenting the project and the specificities related to the use of the tool, the experiment presented in this contribution is structured around two aspects. Firstly, I will summarise the main obstacles encountered and the solutions provided to overcome them. Secondly, I will come back to the collaborative transcription experiment carried out with students in the classroom, presenting the difficulties observed and the solutions found to overcome them. In conclusion, I will propose an assessment of the use of this Human Text Recognition software in a French-speaking context and in a teaching situation.
Cet article propose de restituer une « expérience utilisateur » du logiciel Transkribus en contexte francophone. Il s’appuie sur le projet de transcription semi-automatisée du journal intime du juriste Eugène Wilhelm (1866-1951). Ce journal comporte deux défis principaux : le premier est lié à la durée de la rédaction, 66 années, qui engendre des variations dans la forme de l’écriture, cette dernière devenant de plus en plus « illisible » le temps passant. Le second défi est lié à l’emploi concomitant de deux alphabets ; romain pour tout ce qui relève du quotidien et grec pour le for privé.L’expérience utilisateur restituée dans cette contribution s’articule autour de deux aspects. Dans un premier temps, après avoir présenté le projet et les spécificités liées à l’usage de l’outil, les principaux obstacles rencontrés et les solutions apportées pour y remédier seront synthétisés. Puis, je reviendrai sur l’expérience collaborative de transcription conduite avec des étudiants en salle de cours en présentant les difficultés observées et les solutions trouvées pour y remédier. En conclusion, je proposerai un bilan relatif à l’utilisation de ce logiciel d’HTR (Human Text Recognition) en contexte francophone et en situation d’enseignement This article proposes use the Transkribus software to report on a "user experiment" in a French-speaking context. It is based on the semi-automated transcription project using the diary of the jurist Eugène Wilhelm (1866-1951). This diary presents two main challenges. The first is related to the time covered by the writing process-66 years. This leads to variations in the form of the writing, which becomes increasingly "unreadable" with time. The second challenge is related to the concomitant use of two alphabets: Roman for everyday text and Greek for private issues. After presenting the project and the specificities related to the use of the tool, the experiment presented in this contribution is structured around two aspects. Firstly, I will summarise the main obstacles encountered and the solutions provided to overcome them. Secondly, I will come back to the collaborative transcription experiment carried out with students in the classroom, presenting the difficulties observed and the solutions found to overcome them. In conclusion, I will propose an assessment of the use of this Human Text Recognition software in a French-speaking context and in a teaching situation. |
| Author | Schlagdenhauffen, Régis |
| Author_xml | – sequence: 1 givenname: Régis orcidid: 0000-0002-0185-1435 surname: Schlagdenhauffen fullname: Schlagdenhauffen, Régis organization: Institut de Recherche Interdisciplinaire sur les enjeux Sociaux - sciences sociales, politique, santé |
| BackLink | https://hal.science/hal-02520508$$DView record in HAL |
| BookMark | eNpVkc9uEzEQxi1UJErpiRfwDSq0xfb6z5pbVAKtFKkIBXG0vPY467CxI3tb4MzL8B68GNsEITjN6NPMbz7N9xSdpJwAoeeUXHLJdPd663d-uJSM60folHEqG6G1Pvmnf4LOa90SQqjgnRDiFP243U_R2RF_BJc3KU4xJ7yoNdYJPF4Xm6orcX-Qv8ZpOEpfSuzv6hu8HgAvv-2hxB2kCbucHJQU0wYv7za_fibAn-M4wLh7UfEHKDWn-dLbaMt3_JJ2nWioFvTiGXoc7Fjh_E89Q5_eLddX183q9v3N1WLVOKpa3dDglCeWMSk8DdRLzkACs4EBt46B6pXlAjrWWyW07ZRWPPA2eN163TPVnqGbI9dnuzX72fRsxGQbzUHIZWNsmb8xglGKST_TW8koFzJoFZTitBWKaN73YWZdHFmDHf9DXS9W5kEjTDAiSHffzrOvjrOu5FoLhL8LlJhDdOYQnXmIrv0NxjeOPg |
| ContentType | Journal Article |
| Copyright | Distributed under a Creative Commons Attribution 4.0 International License |
| Copyright_xml | – notice: Distributed under a Creative Commons Attribution 4.0 International License |
| DBID | AAYXX CITATION 1XC VOOES DOA |
| DOI | 10.46298/jdmdh.6249 |
| DatabaseName | CrossRef Hyper Article en Ligne (HAL) Hyper Article en Ligne (HAL) (Open Access) DOAJ Directory of Open Access Journals |
| DatabaseTitle | CrossRef |
| DatabaseTitleList | CrossRef |
| Database_xml | – sequence: 1 dbid: DOA name: DOAJ Directory of Open Access Journals url: https://www.doaj.org/ sourceTypes: Open Website |
| DeliveryMethod | fulltext_linktorsrc |
| Discipline | Computer Science |
| EISSN | 2416-5999 |
| ExternalDocumentID | oai_doaj_org_article_7726de6e3621456f97f7741357094bbf oai:HAL:hal-02520508v3 10_46298_jdmdh_6249 |
| GroupedDBID | 5VS AAFWJ AAYXX ADBBV ADQAK AFPKN ALMA_UNASSIGNED_HOLDINGS BCNDV CITATION FRP GROUPED_DOAJ KQ8 M~E OK1 1XC VOOES |
| ID | FETCH-LOGICAL-c1739-1fc7d0a2265d1f1d642e6e2af2e4ac2e7b7a45e82ba759a87974f43fd93d9b273 |
| IEDL.DBID | DOA |
| ISSN | 2416-5999 |
| IngestDate | Fri Oct 03 12:44:35 EDT 2025 Tue Oct 14 20:43:32 EDT 2025 Sat Nov 29 04:10:29 EST 2025 |
| IsDoiOpenAccess | true |
| IsOpenAccess | true |
| IsPeerReviewed | true |
| IsScholarly | true |
| Issue | Digital humanities in... |
| Keywords | Learning process Human Text Recognition User Experience TEI OCR |
| Language | English |
| License | Distributed under a Creative Commons Attribution 4.0 International License: http://creativecommons.org/licenses/by/4.0 |
| LinkModel | DirectLink |
| MergedId | FETCHMERGED-LOGICAL-c1739-1fc7d0a2265d1f1d642e6e2af2e4ac2e7b7a45e82ba759a87974f43fd93d9b273 |
| ORCID | 0000-0002-0185-1435 |
| OpenAccessLink | https://doaj.org/article/7726de6e3621456f97f7741357094bbf |
| ParticipantIDs | doaj_primary_oai_doaj_org_article_7726de6e3621456f97f7741357094bbf hal_primary_oai_HAL_hal_02520508v3 crossref_primary_10_46298_jdmdh_6249 |
| PublicationCentury | 2000 |
| PublicationDate | 2020-08-28 |
| PublicationDateYYYYMMDD | 2020-08-28 |
| PublicationDate_xml | – month: 08 year: 2020 text: 2020-08-28 day: 28 |
| PublicationDecade | 2020 |
| PublicationTitle | Journal of data mining and digital humanities |
| PublicationYear | 2020 |
| Publisher | INRIA Nicolas Turenne |
| Publisher_xml | – name: INRIA – name: Nicolas Turenne |
| SSID | ssj0001548555 |
| Score | 2.1165702 |
| Snippet | This article proposes use the Transkribus software to report on a "user experiment" in a French-speaking context. It is based on the semi-automated... |
| SourceID | doaj hal crossref |
| SourceType | Open Website Open Access Repository Index Database |
| SubjectTerms | [info.eiah]computer science [cs]/technology for human learning Computer Science human text recognition learning process ocr Technology for Human Learning tei user experience |
| Title | Optical Recognition Assisted Transcription with Transkribus: The Experiment concerning Eugène Wilhelm's Personal Diary (1885-1951) |
| URI | https://hal.science/hal-02520508 https://doaj.org/article/7726de6e3621456f97f7741357094bbf |
| Volume | Atelier Digit_Hum |
| hasFullText | 1 |
| inHoldings | 1 |
| isFullTextHit | |
| isPrint | |
| journalDatabaseRights | – providerCode: PRVAON databaseName: DOAJ Directory of Open Access Journals customDbUrl: eissn: 2416-5999 dateEnd: 99991231 omitProxy: false ssIdentifier: ssj0001548555 issn: 2416-5999 databaseCode: DOA dateStart: 20140101 isFulltext: true titleUrlDefault: https://www.doaj.org/ providerName: Directory of Open Access Journals – providerCode: PRVHPJ databaseName: ROAD: Directory of Open Access Scholarly Resources customDbUrl: eissn: 2416-5999 dateEnd: 99991231 omitProxy: false ssIdentifier: ssj0001548555 issn: 2416-5999 databaseCode: M~E dateStart: 20140101 isFulltext: true titleUrlDefault: https://road.issn.org providerName: ISSN International Centre |
| link | http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwrV3LSsQwFA0iLtz4FscXQQbURbVJ0yZxN-qIC1-IwuxK2iSOryrzAjdu_Bn_wx_zJm1lXLlxU0oobTm3vfecS3KCUDPMFIiKKAmI0UnAkjgLFOTKQAkDCZPmgirmN5vgFxei05FXY1t9uTlhpT1wCdw-sL9Em8RAoiVQ7K3kFhgLiWIOwiTLrMu-wHrGxFS5PtiZnsTlgjyWUCn2H_Sz7u4l1LlmjpUg79QPhaVbN1J9YTmZQzMVI8St8k3m0YQpFtBsvdsCrn6-RfRx-er7zvi6nvPzUmBA18VJY19z6gyAXXe1HHp0ziD9AwyfA27_uPnj3K1W9D0R3B7efX0WBkN-6Jqn5-0-vqoIOj6-V703vEOEiAMCJGl3Cd2etG-OToNqC4UgJzySAbE516ECjhVrYokGtQFYUmWpYSqnhmdcsdgImikeSyU4yAvLIqtlpGUG1GYZTRYvhVlBOJfMqSWgFKF1J1IzHQttI50zmhnWQM0a1fS1dMpIQWF48FMPfurAb6BDh_jPJc7e2g9A0NMq6OlfQW-gLYjXr3ucts5SNwYcjobAOkfR6n88aQ1NU6ewQ8gnYh1NDnpDs4Gm8tHgvt_b9B8dHM_f2983edxq |
| linkProvider | Directory of Open Access Journals |
| openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Optical+Recognition+Assisted+Transcription+with+Transkribus%3A+The+Experiment+concerning+Eug%C3%A8ne+Wilhelm%27s+Personal+Diary+%281885-1951%29&rft.jtitle=Journal+of+data+mining+and+digital+humanities&rft.au=Schlagdenhauffen%2C+R%C3%A9gis&rft.date=2020-08-28&rft.pub=INRIA&rft.eissn=2416-5999&rft.volume=Atelier+Digit_Hum&rft_id=info:doi/10.46298%2Fjdmdh.6249&rft.externalDBID=HAS_PDF_LINK&rft.externalDocID=oai%3AHAL%3Ahal-02520508v3 |
| thumbnail_l | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=2416-5999&client=summon |
| thumbnail_m | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=2416-5999&client=summon |
| thumbnail_s | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=2416-5999&client=summon |