Cadec: A corpus of adverse drug event annotations

[Display omitted] •Introduction of CADEC an annotated corpus of consumer reviews in pharmacovigilance.•A review and comparison of available relevant resources.•Challenges and lessons from the process of creating such resources. CSIRO Adverse Drug Event Corpus (Cadec) is a new rich annotated corpus o...

Celý popis

Uloženo v:
Podrobná bibliografie
Vydáno v:Journal of biomedical informatics Ročník 55; s. 73 - 81
Hlavní autoři: Karimi, Sarvnaz, Metke-Jimenez, Alejandro, Kemp, Madonna, Wang, Chen
Médium: Journal Article
Jazyk:angličtina
Vydáno: United States Elsevier Inc 01.06.2015
Témata:
ISSN:1532-0464, 1532-0480
On-line přístup:Získat plný text
Tagy: Přidat tag
Žádné tagy, Buďte první, kdo vytvoří štítek k tomuto záznamu!
Popis
Shrnutí:[Display omitted] •Introduction of CADEC an annotated corpus of consumer reviews in pharmacovigilance.•A review and comparison of available relevant resources.•Challenges and lessons from the process of creating such resources. CSIRO Adverse Drug Event Corpus (Cadec) is a new rich annotated corpus of medical forum posts on patient-reported Adverse Drug Events (ADEs). The corpus is sourced from posts on social media, and contains text that is largely written in colloquial language and often deviates from formal English grammar and punctuation rules. Annotations contain mentions of concepts such as drugs, adverse effects, symptoms, and diseases linked to their corresponding concepts in controlled vocabularies, i.e., SNOMED Clinical Terms and MedDRA. The quality of the annotations is ensured by annotation guidelines, multi-stage annotations, measuring inter-annotator agreement, and final review of the annotations by a clinical terminologist. This corpus is useful for studies in the area of information extraction, or more generally text mining, from social media to detect possible adverse drug reactions from direct patient reports. The corpus is publicly available at https://data.csiro.au.1The data can be used for research purposes only, under the CSIRO data licence.1
Bibliografie:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
ISSN:1532-0464
1532-0480
DOI:10.1016/j.jbi.2015.03.010