Leveraging Concepts in Open Access Publications

This paper addresses the integration of a Named Entity Recognition and Disambiguation (NERD) service within a group of open access (OA) publishing digital platforms and considers its potential impact on both research and scholarly publishing. The software powering this service, called entity-fishing...

Full description

Saved in:
Bibliographic Details
Published in:Journal of data mining and digital humanities Vol. 2019
Main Authors: Bertino, Andrea, Foppiano, Luca, Romary, Laurent, Mounier, Pierre
Format: Journal Article
Language:English
Published: INRIA 15.06.2020
Nicolas Turenne
Subjects:
ISSN:2416-5999, 2416-5999
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Abstract This paper addresses the integration of a Named Entity Recognition and Disambiguation (NERD) service within a group of open access (OA) publishing digital platforms and considers its potential impact on both research and scholarly publishing. The software powering this service, called entity-fishing, was initially developed by Inria in the context of the EU FP7 project CENDARI and provides automatic entity recognition and disambiguation using the Wikipedia and Wikidata data sets. The application is distributed with an open-source licence, and it has been deployed as a web service in DARIAH's infrastructure hosted by the French HumaNum. In the paper, we focus on the specific issues related to its integration on five OA platforms specialized in the publication of scholarly monographs in the social sciences and humanities (SSH), as part of the work carried out within the EU H2020 project HIRMEOS (High Integration of Research Monographs in the European Open Science infrastructure). In the first section, we give a brief overview of the current status and evolution of OA publications, considering specifically the challenges that OA monographs are encountering. In the second part, we show how the HIRMEOS project aims to face these challenges by optimizing five OA digital platforms for the publication of monographs from the SSH and ensuring their interoperability. In sections three and four we give a comprehensive description of the entity-fishing service, focusing on its concrete applications in real use cases together with some further possible ideas on how to exploit the annotations generated. We show that entity-fishing annotations can improve both research and publishing process. In the last chapter, we briefly present further possible application scenarios that could be made available through infrastructural projects.
AbstractList This paper addresses the integration of a Named Entity Recognition and Disambiguation (NERD) service within a group of open access (OA) publishing digital platforms and considers its potential impact on both research and scholarly publishing. The software powering this service, called entity-fishing, was initially developed by Inria in the context of the EU FP7 project CENDARI and provides automatic entity recognition and disambiguation using the Wikipedia and Wikidata data sets. The application is distributed with an open-source licence, and it has been deployed as a web service in DARIAH's infrastructure hosted by the French HumaNum. In the paper, we focus on the specific issues related to its integration on five OA platforms specialized in the publication of scholarly monographs in the social sciences and humanities (SSH), as part of the work carried out within the EU H2020 project HIRMEOS (High Integration of Research Monographs in the European Open Science infrastructure). In the first section, we give a brief overview of the current status and evolution of OA publications, considering specifically the challenges that OA monographs are encountering. In the second part, we show how the HIRMEOS project aims to face these challenges by optimizing five OA digital platforms for the publication of monographs from the SSH and ensuring their interoperability. In sections three and four we give a comprehensive description of the entity-fishing service, focusing on its concrete applications in real use cases together with some further possible ideas on how to exploit the annotations generated. We show that entity-fishing annotations can improve both research and publishing process. In the last chapter, we briefly present further possible application scenarios that could be made available through infrastructural projects.
Author Bertino, Andrea
Romary, Laurent
Mounier, Pierre
Foppiano, Luca
Author_xml – sequence: 1
  givenname: Andrea
  orcidid: 0000-0002-5080-036X
  surname: Bertino
  fullname: Bertino, Andrea
  organization: Göttingen State and University Library
– sequence: 2
  givenname: Luca
  orcidid: 0000-0002-6114-6164
  surname: Foppiano
  fullname: Foppiano, Luca
  organization: Automatic Language Modelling and ANAlysis & Computational Humanities
– sequence: 3
  givenname: Laurent
  orcidid: 0000-0002-0756-0508
  surname: Romary
  fullname: Romary, Laurent
  organization: Automatic Language Modelling and ANAlysis & Computational Humanities
– sequence: 4
  givenname: Pierre
  orcidid: 0000-0003-0691-6063
  surname: Mounier
  fullname: Mounier, Pierre
  organization: Centre pour l'édition électronique ouverte, École des hautes études en sciences sociales
BackLink https://inria.hal.science/hal-01981922$$DView record in HAL
BookMark eNpVkMtqwzAQRUVJoWmaVX_A21KczMiSLC1D6CNgSBftWsjSOHFw7GClgf598yilXc3lcucszi0btF1LjN0jTITiRk83YRvWEwkar9iQC1SpNMYM_uQbNo5xAwAohZZSDtm0oAP1blW3q2TetZ52-5jUbbLcUZvMvKcYk7fPsqm929ddG-_YdeWaSOOfO2Ifz0_v89e0WL4s5rMi9ZhnmGrghrIKDGokEEqVWuSePFchOCBdohYSc88JwctAmQRFSiDPFQcQPhuxxYUbOrexu77euv7Ldq6256LrV9b1-9o3ZI0LQYtQOlcJUeW-LNEJQ0qGY5ZVdmQ9XFhr1_xDvc4Ke-oAjUbD-eG0fbxsfd_F2FP1-4Bgz5rtWbM9ac6-ASj-cAo
ContentType Journal Article
Copyright Distributed under a Creative Commons Attribution 4.0 International License
Copyright_xml – notice: Distributed under a Creative Commons Attribution 4.0 International License
DBID AAYXX
CITATION
1XC
VOOES
DOA
DOI 10.46298/jdmdh.5081
DatabaseName CrossRef
Hyper Article en Ligne (HAL)
Hyper Article en Ligne (HAL) (Open Access)
DOAJ Directory of Open Access Journals
DatabaseTitle CrossRef
DatabaseTitleList CrossRef


Database_xml – sequence: 1
  dbid: DOA
  name: DOAJ Directory of Open Access Journals
  url: https://www.doaj.org/
  sourceTypes: Open Website
DeliveryMethod fulltext_linktorsrc
Discipline Computer Science
EISSN 2416-5999
ExternalDocumentID oai_doaj_org_article_9add84dbaaf44f7cbb1a49e65d7cb5f3
oai:HAL:hal-01981922v3
10_46298_jdmdh_5081
GroupedDBID 5VS
AAFWJ
AAYXX
ADBBV
ADQAK
AFPKN
ALMA_UNASSIGNED_HOLDINGS
BCNDV
CITATION
FRP
GROUPED_DOAJ
KQ8
M~E
OK1
1XC
VOOES
ID FETCH-LOGICAL-c1731-8029e3f09181e0466b847cec26dda0e8b184517c2e10c5de3506e6412762004c3
IEDL.DBID DOA
ISSN 2416-5999
IngestDate Fri Oct 03 12:32:28 EDT 2025
Tue Oct 14 21:00:10 EDT 2025
Sat Nov 29 04:10:29 EST 2025
IsDoiOpenAccess true
IsOpenAccess true
IsPeerReviewed true
IsScholarly true
Keywords Named Entity Recognition and Disambiguation (NERD)
Digital Publishing Platforms
Open Access
Entity-Fishing
Monographs
Language English
License Distributed under a Creative Commons Attribution 4.0 International License: http://creativecommons.org/licenses/by/4.0
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-c1731-8029e3f09181e0466b847cec26dda0e8b184517c2e10c5de3506e6412762004c3
ORCID 0000-0002-0756-0508
0000-0003-0691-6063
0000-0002-5080-036X
0000-0002-6114-6164
OpenAccessLink https://doaj.org/article/9add84dbaaf44f7cbb1a49e65d7cb5f3
ParticipantIDs doaj_primary_oai_doaj_org_article_9add84dbaaf44f7cbb1a49e65d7cb5f3
hal_primary_oai_HAL_hal_01981922v3
crossref_primary_10_46298_jdmdh_5081
PublicationCentury 2000
PublicationDate 2020-06-15
PublicationDateYYYYMMDD 2020-06-15
PublicationDate_xml – month: 06
  year: 2020
  text: 2020-06-15
  day: 15
PublicationDecade 2020
PublicationTitle Journal of data mining and digital humanities
PublicationYear 2020
Publisher INRIA
Nicolas Turenne
Publisher_xml – name: INRIA
– name: Nicolas Turenne
SSID ssj0001548555
Score 2.1093469
Snippet This paper addresses the integration of a Named Entity Recognition and Disambiguation (NERD) service within a group of open access (OA) publishing digital...
SourceID doaj
hal
crossref
SourceType Open Website
Open Access Repository
Index Database
SubjectTerms [ info.info-tt ] computer science [cs]/document and text processing
Computer Science
digital publishing platforms
Document and Text Processing
entity-fishing
named entity recognition and disambiguation (nerd)
open access
Title Leveraging Concepts in Open Access Publications
URI https://inria.hal.science/hal-01981922
https://doaj.org/article/9add84dbaaf44f7cbb1a49e65d7cb5f3
Volume 2019
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
journalDatabaseRights – providerCode: PRVAON
  databaseName: DOAJ Directory of Open Access Journals
  customDbUrl:
  eissn: 2416-5999
  dateEnd: 99991231
  omitProxy: false
  ssIdentifier: ssj0001548555
  issn: 2416-5999
  databaseCode: DOA
  dateStart: 20140101
  isFulltext: true
  titleUrlDefault: https://www.doaj.org/
  providerName: Directory of Open Access Journals
– providerCode: PRVHPJ
  databaseName: ROAD: Directory of Open Access Scholarly Resources
  customDbUrl:
  eissn: 2416-5999
  dateEnd: 99991231
  omitProxy: false
  ssIdentifier: ssj0001548555
  issn: 2416-5999
  databaseCode: M~E
  dateStart: 20140101
  isFulltext: true
  titleUrlDefault: https://road.issn.org
  providerName: ISSN International Centre
link http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwrV27TsMwFLVQxcDCG1FeslDX0PgZZywVVYdSMQDqZjm2oxaJgNrSkW_n2klRO7GwRJEVJdHx49xr-Z6DUIcYSySscoktUp5wyrPEUCaS4LRNpXFKRfu211E2HqvJJH_asPoKZ8JqeeAauG4OE1BxVxhTcl5mtiiI4bmXwsG9KKPOJ0Q9G8lUXR8cRE9EXZDHJc1V9829u-kdxCNki4KiUj8Qy3S9kRqJZXCI9puIEPfqPzlCO746RgdrtwXcTL4T1B15GHbRVAj362LDBZ5VOBwJwb3oe4g3N-FO0cvg4bk_TBq7g8SSjBHgCpp7VgKBK-IhbZUFMIf1lkrnTOpVAcmYIJmlnqRWOM9EKr3khMJ6BkPdsjPUqj4qf46wUzAXnWGGG85TC1gA_q6QzBqgn0y1UWeNgP6sVS00ZAMRKB2B0gGoNroP6Pw-EqSoYwN0kG46SP_VQW10C9huvWPYG-nQBuFl0GOjK3bxH1-6RHs0ZMPBWUhcodZy_uWv0a5dLWeL-U0cIHB9_H74AazzwTo
linkProvider Directory of Open Access Journals
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Leveraging+Concepts+in+Open+Access+Publications&rft.jtitle=Journal+of+data+mining+and+digital+humanities&rft.au=Bertino%2C+Andrea&rft.au=Foppiano%2C+Luca&rft.au=Romary%2C+Laurent&rft.au=Mounier%2C+Pierre&rft.date=2020-06-15&rft.issn=2416-5999&rft.eissn=2416-5999&rft.volume=2019&rft_id=info:doi/10.46298%2Fjdmdh.5081&rft.externalDBID=n%2Fa&rft.externalDocID=10_46298_jdmdh_5081
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=2416-5999&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=2416-5999&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=2416-5999&client=summon