UNESCO’s Proceedings, 1945–2017: A Bilingual Digital Text Corpus

Gespeichert in:
Bibliographische Detailangaben
Titel: UNESCO’s Proceedings, 1945–2017: A Bilingual Digital Text Corpus
Autoren: Martin, Benjamin G., Mohammadi Norén, Fredrik, 1984, Mähler, Roger, Marklund, Andreas, Martin, Oriane
Quelle: Journal of Open Humanities Data. 11:1-5
Schlagwörter: digital text analysis, international organizations, transnational history, global humanities, text corpus, corpus design
Beschreibung: The record of the meetings of UNESCO’s General Conference offers a valuable resource for research in the global humanities. We present a digital text corpus, including metadata and supplementary material, that makes the complete record of these meetings from 1946 to 2017 in English and/or French accessible in a machine-readable form that is suitable for digital text analysis. The corpus is stored on Zenodo; relevant code is available on GitHub. The corpus offers reuse potential for scholars interested in any of the countless issues that have been discussed and debated in UNESCO’s General Conference over more than seventy years, as well as to Natural Language Processing (NLP) developers interested in the challenges of language recognition and automated segmentation.
Dateibeschreibung: electronic
Zugangs-URL: https://urn.kb.se/resolve?urn=urn:nbn:se:mau:diva-75744
https://doi.org/10.5334/johd.314
Datenbank: SwePub
Beschreibung
Abstract:The record of the meetings of UNESCO’s General Conference offers a valuable resource for research in the global humanities. We present a digital text corpus, including metadata and supplementary material, that makes the complete record of these meetings from 1946 to 2017 in English and/or French accessible in a machine-readable form that is suitable for digital text analysis. The corpus is stored on Zenodo; relevant code is available on GitHub. The corpus offers reuse potential for scholars interested in any of the countless issues that have been discussed and debated in UNESCO’s General Conference over more than seventy years, as well as to Natural Language Processing (NLP) developers interested in the challenges of language recognition and automated segmentation.
ISSN:2059481X
DOI:10.5334/johd.314