Mining the History Sections of Wikipedia Articles on Science and Technology

Priority conflicts and the attribution of contributions to important scientific breakthroughs to individuals and groups play an important role in science, its governance, and evaluation. Debates and dynamics around these processes are analyzed by science studies. Our objective is to transform Wikipe...

Celý popis

Uloženo v:
Podrobná bibliografie
Vydáno v:IEEE/ACM Joint Conference on Digital Libraries (Online) s. 200 - 204
Hlavní autoři: Kircheis, Wolfgang, Schmidt, Marion, Simons, Arno, Stein, Benno, Potthast, Martin
Médium: Konferenční příspěvek
Jazyk:angličtina
Vydáno: IEEE 01.06.2023
Témata:
ISSN:2575-8152
On-line přístup:Získat plný text
Tagy: Přidat tag
Žádné tagy, Buďte první, kdo vytvoří štítek k tomuto záznamu!
Popis
Shrnutí:Priority conflicts and the attribution of contributions to important scientific breakthroughs to individuals and groups play an important role in science, its governance, and evaluation. Debates and dynamics around these processes are analyzed by science studies. Our objective is to transform Wikipedia into an accessible, traceable primary source for analyzing such debates. In this paper, we introduce Webis-WikiSciTech-23, a new corpus consisting of science and technology Wikipedia articles, focusing on the identification of their history sections. We extract such articles from Wikipedia dumps through iterative filtering of the category network. The identification of passages covering the historical development of innovations is achieved by combining heuristics for section heading analysis and classifiers trained on a ground truth of articles with designated history sections.
ISSN:2575-8152
DOI:10.1109/JCDL57899.2023.00037