Leveraging Concepts in Open Access Publications
This paper addresses the integration of a Named Entity Recognition and Disambiguation (NERD) service within a group of open access (OA) publishing digital platforms and considers its potential impact on both research and scholarly publishing. The software powering this service, called entity-fishing...
Saved in:
| Published in: | Journal of data mining and digital humanities Vol. 2019 |
|---|---|
| Main Authors: | , , , |
| Format: | Journal Article |
| Language: | English |
| Published: |
INRIA
15.06.2020
Nicolas Turenne |
| Subjects: | |
| ISSN: | 2416-5999, 2416-5999 |
| Online Access: | Get full text |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
| Abstract | This paper addresses the integration of a Named Entity Recognition and Disambiguation (NERD) service within a group of open access (OA) publishing digital platforms and considers its potential impact on both research and scholarly publishing. The software powering this service, called entity-fishing, was initially developed by Inria in the context of the EU FP7 project CENDARI and provides automatic entity recognition and disambiguation using the Wikipedia and Wikidata data sets. The application is distributed with an open-source licence, and it has been deployed as a web service in DARIAH's infrastructure hosted by the French HumaNum. In the paper, we focus on the specific issues related to its integration on five OA platforms specialized in the publication of scholarly monographs in the social sciences and humanities (SSH), as part of the work carried out within the EU H2020 project HIRMEOS (High Integration of Research Monographs in the European Open Science infrastructure). In the first section, we give a brief overview of the current status and evolution of OA publications, considering specifically the challenges that OA monographs are encountering. In the second part, we show how the HIRMEOS project aims to face these challenges by optimizing five OA digital platforms for the publication of monographs from the SSH and ensuring their interoperability. In sections three and four we give a comprehensive description of the entity-fishing service, focusing on its concrete applications in real use cases together with some further possible ideas on how to exploit the annotations generated. We show that entity-fishing annotations can improve both research and publishing process. In the last chapter, we briefly present further possible application scenarios that could be made available through infrastructural projects. |
|---|---|
| AbstractList | This paper addresses the integration of a Named Entity Recognition and Disambiguation (NERD) service within a group of open access (OA) publishing digital platforms and considers its potential impact on both research and scholarly publishing. The software powering this service, called entity-fishing, was initially developed by Inria in the context of the EU FP7 project CENDARI and provides automatic entity recognition and disambiguation using the Wikipedia and Wikidata data sets. The application is distributed with an open-source licence, and it has been deployed as a web service in DARIAH's infrastructure hosted by the French HumaNum. In the paper, we focus on the specific issues related to its integration on five OA platforms specialized in the publication of scholarly monographs in the social sciences and humanities (SSH), as part of the work carried out within the EU H2020 project HIRMEOS (High Integration of Research Monographs in the European Open Science infrastructure). In the first section, we give a brief overview of the current status and evolution of OA publications, considering specifically the challenges that OA monographs are encountering. In the second part, we show how the HIRMEOS project aims to face these challenges by optimizing five OA digital platforms for the publication of monographs from the SSH and ensuring their interoperability. In sections three and four we give a comprehensive description of the entity-fishing service, focusing on its concrete applications in real use cases together with some further possible ideas on how to exploit the annotations generated. We show that entity-fishing annotations can improve both research and publishing process. In the last chapter, we briefly present further possible application scenarios that could be made available through infrastructural projects. |
| Author | Bertino, Andrea Romary, Laurent Mounier, Pierre Foppiano, Luca |
| Author_xml | – sequence: 1 givenname: Andrea orcidid: 0000-0002-5080-036X surname: Bertino fullname: Bertino, Andrea organization: Göttingen State and University Library – sequence: 2 givenname: Luca orcidid: 0000-0002-6114-6164 surname: Foppiano fullname: Foppiano, Luca organization: Automatic Language Modelling and ANAlysis & Computational Humanities – sequence: 3 givenname: Laurent orcidid: 0000-0002-0756-0508 surname: Romary fullname: Romary, Laurent organization: Automatic Language Modelling and ANAlysis & Computational Humanities – sequence: 4 givenname: Pierre orcidid: 0000-0003-0691-6063 surname: Mounier fullname: Mounier, Pierre organization: Centre pour l'édition électronique ouverte, École des hautes études en sciences sociales |
| BackLink | https://inria.hal.science/hal-01981922$$DView record in HAL |
| BookMark | eNpVkMtqwzAQRUVJoWmaVX_A21KczMiSLC1D6CNgSBftWsjSOHFw7GClgf598yilXc3lcucszi0btF1LjN0jTITiRk83YRvWEwkar9iQC1SpNMYM_uQbNo5xAwAohZZSDtm0oAP1blW3q2TetZ52-5jUbbLcUZvMvKcYk7fPsqm929ddG-_YdeWaSOOfO2Ifz0_v89e0WL4s5rMi9ZhnmGrghrIKDGokEEqVWuSePFchOCBdohYSc88JwctAmQRFSiDPFQcQPhuxxYUbOrexu77euv7Ldq6256LrV9b1-9o3ZI0LQYtQOlcJUeW-LNEJQ0qGY5ZVdmQ9XFhr1_xDvc4Ke-oAjUbD-eG0fbxsfd_F2FP1-4Bgz5rtWbM9ac6-ASj-cAo |
| ContentType | Journal Article |
| Copyright | Distributed under a Creative Commons Attribution 4.0 International License |
| Copyright_xml | – notice: Distributed under a Creative Commons Attribution 4.0 International License |
| DBID | AAYXX CITATION 1XC VOOES DOA |
| DOI | 10.46298/jdmdh.5081 |
| DatabaseName | CrossRef Hyper Article en Ligne (HAL) Hyper Article en Ligne (HAL) (Open Access) DOAJ Directory of Open Access Journals |
| DatabaseTitle | CrossRef |
| DatabaseTitleList | CrossRef |
| Database_xml | – sequence: 1 dbid: DOA name: DOAJ Directory of Open Access Journals url: https://www.doaj.org/ sourceTypes: Open Website |
| DeliveryMethod | fulltext_linktorsrc |
| Discipline | Computer Science |
| EISSN | 2416-5999 |
| ExternalDocumentID | oai_doaj_org_article_9add84dbaaf44f7cbb1a49e65d7cb5f3 oai:HAL:hal-01981922v3 10_46298_jdmdh_5081 |
| GroupedDBID | 5VS AAFWJ AAYXX ADBBV ADQAK AFPKN ALMA_UNASSIGNED_HOLDINGS BCNDV CITATION FRP GROUPED_DOAJ KQ8 M~E OK1 1XC VOOES |
| ID | FETCH-LOGICAL-c1731-8029e3f09181e0466b847cec26dda0e8b184517c2e10c5de3506e6412762004c3 |
| IEDL.DBID | DOA |
| ISSN | 2416-5999 |
| IngestDate | Fri Oct 03 12:32:28 EDT 2025 Tue Oct 14 21:00:10 EDT 2025 Sat Nov 29 04:10:29 EST 2025 |
| IsDoiOpenAccess | true |
| IsOpenAccess | true |
| IsPeerReviewed | true |
| IsScholarly | true |
| Keywords | Named Entity Recognition and Disambiguation (NERD) Digital Publishing Platforms Open Access Entity-Fishing Monographs |
| Language | English |
| License | Distributed under a Creative Commons Attribution 4.0 International License: http://creativecommons.org/licenses/by/4.0 |
| LinkModel | DirectLink |
| MergedId | FETCHMERGED-LOGICAL-c1731-8029e3f09181e0466b847cec26dda0e8b184517c2e10c5de3506e6412762004c3 |
| ORCID | 0000-0002-0756-0508 0000-0003-0691-6063 0000-0002-5080-036X 0000-0002-6114-6164 |
| OpenAccessLink | https://doaj.org/article/9add84dbaaf44f7cbb1a49e65d7cb5f3 |
| ParticipantIDs | doaj_primary_oai_doaj_org_article_9add84dbaaf44f7cbb1a49e65d7cb5f3 hal_primary_oai_HAL_hal_01981922v3 crossref_primary_10_46298_jdmdh_5081 |
| PublicationCentury | 2000 |
| PublicationDate | 2020-06-15 |
| PublicationDateYYYYMMDD | 2020-06-15 |
| PublicationDate_xml | – month: 06 year: 2020 text: 2020-06-15 day: 15 |
| PublicationDecade | 2020 |
| PublicationTitle | Journal of data mining and digital humanities |
| PublicationYear | 2020 |
| Publisher | INRIA Nicolas Turenne |
| Publisher_xml | – name: INRIA – name: Nicolas Turenne |
| SSID | ssj0001548555 |
| Score | 2.1093469 |
| Snippet | This paper addresses the integration of a Named Entity Recognition and Disambiguation (NERD) service within a group of open access (OA) publishing digital... |
| SourceID | doaj hal crossref |
| SourceType | Open Website Open Access Repository Index Database |
| SubjectTerms | [ info.info-tt ] computer science [cs]/document and text processing Computer Science digital publishing platforms Document and Text Processing entity-fishing named entity recognition and disambiguation (nerd) open access |
| Title | Leveraging Concepts in Open Access Publications |
| URI | https://inria.hal.science/hal-01981922 https://doaj.org/article/9add84dbaaf44f7cbb1a49e65d7cb5f3 |
| Volume | 2019 |
| hasFullText | 1 |
| inHoldings | 1 |
| isFullTextHit | |
| isPrint | |
| journalDatabaseRights | – providerCode: PRVAON databaseName: DOAJ Directory of Open Access Journals customDbUrl: eissn: 2416-5999 dateEnd: 99991231 omitProxy: false ssIdentifier: ssj0001548555 issn: 2416-5999 databaseCode: DOA dateStart: 20140101 isFulltext: true titleUrlDefault: https://www.doaj.org/ providerName: Directory of Open Access Journals – providerCode: PRVHPJ databaseName: ROAD: Directory of Open Access Scholarly Resources customDbUrl: eissn: 2416-5999 dateEnd: 99991231 omitProxy: false ssIdentifier: ssj0001548555 issn: 2416-5999 databaseCode: M~E dateStart: 20140101 isFulltext: true titleUrlDefault: https://road.issn.org providerName: ISSN International Centre |
| link | http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwrV27TsMwFLVQxcDCG1FeslDX0PgZZywVVYdSMQDqZjm2oxaJgNrSkW_n2klRO7GwRJEVJdHx49xr-Z6DUIcYSySscoktUp5wyrPEUCaS4LRNpXFKRfu211E2HqvJJH_asPoKZ8JqeeAauG4OE1BxVxhTcl5mtiiI4bmXwsG9KKPOJ0Q9G8lUXR8cRE9EXZDHJc1V9829u-kdxCNki4KiUj8Qy3S9kRqJZXCI9puIEPfqPzlCO746RgdrtwXcTL4T1B15GHbRVAj362LDBZ5VOBwJwb3oe4g3N-FO0cvg4bk_TBq7g8SSjBHgCpp7VgKBK-IhbZUFMIf1lkrnTOpVAcmYIJmlnqRWOM9EKr3khMJ6BkPdsjPUqj4qf46wUzAXnWGGG85TC1gA_q6QzBqgn0y1UWeNgP6sVS00ZAMRKB2B0gGoNroP6Pw-EqSoYwN0kG46SP_VQW10C9huvWPYG-nQBuFl0GOjK3bxH1-6RHs0ZMPBWUhcodZy_uWv0a5dLWeL-U0cIHB9_H74AazzwTo |
| linkProvider | Directory of Open Access Journals |
| openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Leveraging+Concepts+in+Open+Access+Publications&rft.jtitle=Journal+of+data+mining+and+digital+humanities&rft.au=Bertino%2C+Andrea&rft.au=Foppiano%2C+Luca&rft.au=Romary%2C+Laurent&rft.au=Mounier%2C+Pierre&rft.date=2020-06-15&rft.issn=2416-5999&rft.eissn=2416-5999&rft.volume=2019&rft_id=info:doi/10.46298%2Fjdmdh.5081&rft.externalDBID=n%2Fa&rft.externalDocID=10_46298_jdmdh_5081 |
| thumbnail_l | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=2416-5999&client=summon |
| thumbnail_m | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=2416-5999&client=summon |
| thumbnail_s | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=2416-5999&client=summon |