DATABOOK : a standardised framework for dynamic documentation of algorithm design during Data Science projects

This paper proposes a standard documentation framework for Data Science projects, called Databook. It is a result of five years of action-research on multiple projects in several sectors of activity in France, and of a confrontation of standard theoretical Data Science processes, such as CRISP_DM, w...

Celý popis

Uložené v:
Podrobná bibliografia
Vydané v:IASSIST quarterly Ročník 45; číslo 2
Hlavný autor: Nesvijevskaia, Anna
Médium: Journal Article
Jazyk:English
Vydavateľské údaje: International Association for Social Science Information Service and Technology 26.09.2021
Predmet:
ISSN:0739-1137, 2331-4141
On-line prístup:Získať plný text
Tagy: Pridať tag
Žiadne tagy, Buďte prvý, kto otaguje tento záznam!
Popis
Shrnutí:This paper proposes a standard documentation framework for Data Science projects, called Databook. It is a result of five years of action-research on multiple projects in several sectors of activity in France, and of a confrontation of standard theoretical Data Science processes, such as CRISP_DM, with the reality of the field. As a vector for knowledge sharing and capitalisation, the Databook has been identified as one of the main facilitators of Human Data Mediation. Transformed into an operational prototype of simple and minimalist documentation, it has since been tested then on about a hundred Data Science projects, has proven its benefits for the internal and external efficiency of Data Science projects, and can be turned into a more ambitious standard framework for data patrimony valorisation and data quality governance.
ISSN:0739-1137
2331-4141
DOI:10.29173/iq989