Rethink reporting of evaluation results in AI

Aggregate metrics and lack of access to results limit understanding.

Saved in:
Bibliographic Details
Published in:Science (American Association for the Advancement of Science) Vol. 380; no. 6641; p. 136
Main Authors: Burnell, Ryan, Schellaert, Wout, Burden, John, Ullman, Tomer D, Martinez-Plumed, Fernando, Tenenbaum, Joshua B, Rutar, Danaja, Cheke, Lucy G, Sohl-Dickstein, Jascha, Mitchell, Melanie, Kiela, Douwe, Shanahan, Murray, Voorhees, Ellen M, Cohn, Anthony G, Leibo, Joel Z, Hernandez-Orallo, Jose
Format: Journal Article
Language:English
Published: United States 14.04.2023
ISSN:1095-9203, 1095-9203
Online Access:Get more information
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Aggregate metrics and lack of access to results limit understanding.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
ISSN:1095-9203
1095-9203
DOI:10.1126/science.adf6369