DATABOOK : a standardised framework for dynamic documentation of algorithm design during Data Science projects

This paper proposes a standard documentation framework for Data Science projects, called Databook. It is a result of five years of action-research on multiple projects in several sectors of activity in France, and of a confrontation of standard theoretical Data Science processes, such as CRISP_DM, w...

Celý popis

Uloženo v:
Podrobná bibliografie
Vydáno v:IASSIST quarterly Ročník 45; číslo 2
Hlavní autor: Nesvijevskaia, Anna
Médium: Journal Article
Jazyk:angličtina
Vydáno: International Association for Social Science Information Service and Technology 26.09.2021
Témata:
ISSN:0739-1137, 2331-4141
On-line přístup:Získat plný text
Tagy: Přidat tag
Žádné tagy, Buďte první, kdo vytvoří štítek k tomuto záznamu!
Popis
Shrnutí:This paper proposes a standard documentation framework for Data Science projects, called Databook. It is a result of five years of action-research on multiple projects in several sectors of activity in France, and of a confrontation of standard theoretical Data Science processes, such as CRISP_DM, with the reality of the field. As a vector for knowledge sharing and capitalisation, the Databook has been identified as one of the main facilitators of Human Data Mediation. Transformed into an operational prototype of simple and minimalist documentation, it has since been tested then on about a hundred Data Science projects, has proven its benefits for the internal and external efficiency of Data Science projects, and can be turned into a more ambitious standard framework for data patrimony valorisation and data quality governance.
ISSN:0739-1137
2331-4141
DOI:10.29173/iq989