The penumbra of open source: projects outside of centralized platforms are longer maintained, more academic and more collaborative.

Saved in:
Bibliographic Details
Title: The penumbra of open source: projects outside of centralized platforms are longer maintained, more academic and more collaborative.
Authors: Trujillo, Milo Z., Hébert-Dufresne, Laurent, Bagrow, James
Source: EPJ Data Science; 7/5/2022, Vol. 11 Issue 1, p1-19, 19p
Subject Terms: ONLINE social networks, CONVENIENCE sampling (Statistics), SOURCE code
Abstract: GitHub has become the central online platform for much of open source, hosting most open source code repositories. With this popularity, the public digital traces of GitHub are now a valuable means to study teamwork and collaboration. In many ways, however, GitHub is a convenience sample, and may not be representative of open source development off the platform. Here we develop a novel, extensive sample of public open source project repositories outside of centralized platforms. We characterized these projects along a number of dimensions, and compare to a time-matched sample of corresponding GitHub projects. Our sample projects tend to have more collaborators, are maintained for longer periods, and tend to be more focused on academic and scientific problems. [ABSTRACT FROM AUTHOR]
Copyright of EPJ Data Science is the property of Springer Nature and its content may not be copied or emailed to multiple sites without the copyright holder's express written permission. Additionally, content may not be used with any artificial intelligence tools or machine learning technologies. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.)
Database: Complementary Index
Description
Abstract:GitHub has become the central online platform for much of open source, hosting most open source code repositories. With this popularity, the public digital traces of GitHub are now a valuable means to study teamwork and collaboration. In many ways, however, GitHub is a convenience sample, and may not be representative of open source development off the platform. Here we develop a novel, extensive sample of public open source project repositories outside of centralized platforms. We characterized these projects along a number of dimensions, and compare to a time-matched sample of corresponding GitHub projects. Our sample projects tend to have more collaborators, are maintained for longer periods, and tend to be more focused on academic and scientific problems. [ABSTRACT FROM AUTHOR]
ISSN:21931127
DOI:10.1140/epjds/s13688-022-00345-7