Ten Years of Open Wikipedia Ranking

Uložené v:
Podrobná bibliografia
Názov: Ten Years of Open Wikipedia Ranking
Autori: Boldi, Paolo, Furia, Flavio, Vigna, Sebastiano
Zdroj: Companion Proceedings of the ACM on Web Conference 2025. :883-887
Informácie o vydavateľovi: ACM, 2025.
Rok vydania: 2025
Predmety: wikipedia, ranking, open data
Popis: The Open Wikipedia Ranking is an open dataset published yearly, containing the ranking of Wikipedia pages with respect to centrality measures computed on the whole Wikipedia graph for that year. In this paper, ten years after its start, we report some details, results and anecdotal observations on this dataset. The goal of the Open Wikipedia Ranking is to provide a completely open and reproducible ranking of Wikipedia pages based on indegree, PageRank, harmonic centrality, and page views; the Wikipedia graphs themselves are also made available by the Laboratory of Web Algorithmics. What characterizes the Open Wikipedia Ranking is that the whole graph construction and ranking process are meticulously documented and reproducible. All computations are based on open-source Java software and algorithms from the literature. Thus, the reason of the centrality score of pages can be exactly traced back to structural graph properties.
Druh dokumentu: Article
Conference object
Popis súboru: application/pdf
DOI: 10.1145/3701716.3715510
Prístupová URL adresa: https://hdl.handle.net/2434/1166397
https://doi.org/10.1145/3701716.3715510
Rights: CC BY SA
Prístupové číslo: edsair.doi.dedup.....1a8fde3750a742fb73ba0a5474fc291b
Databáza: OpenAIRE
Popis
Abstrakt:The Open Wikipedia Ranking is an open dataset published yearly, containing the ranking of Wikipedia pages with respect to centrality measures computed on the whole Wikipedia graph for that year. In this paper, ten years after its start, we report some details, results and anecdotal observations on this dataset. The goal of the Open Wikipedia Ranking is to provide a completely open and reproducible ranking of Wikipedia pages based on indegree, PageRank, harmonic centrality, and page views; the Wikipedia graphs themselves are also made available by the Laboratory of Web Algorithmics. What characterizes the Open Wikipedia Ranking is that the whole graph construction and ranking process are meticulously documented and reproducible. All computations are based on open-source Java software and algorithms from the literature. Thus, the reason of the centrality score of pages can be exactly traced back to structural graph properties.
DOI:10.1145/3701716.3715510