Update on the predictability of tall stature from DNA markers in Europeans

•Current forensic DNA phenotyping focuses on pigmentation traits.•Update on DNA predictability of tall stature in Europeans is presented.•689 SNPs provided AUC of 0.79, while a subset of 412 SNPs achieved 0.76.•New models improved prediction accuracy compared to previous ones. Predicting adult heigh...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Forensic science international : genetics Jg. 42; S. 8 - 13
Hauptverfasser: Liu, Fan, Zhong, Kaiyin, Jing, Xiaoxi, Uitterlinden, André G., Hendriks, A. Emile J., Drop, Stenvert L.S., Kayser, Manfred
Format: Journal Article
Sprache:Englisch
Veröffentlicht: Netherlands Elsevier B.V 01.09.2019
Schlagworte:
ISSN:1872-4973, 1878-0326, 1878-0326
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:•Current forensic DNA phenotyping focuses on pigmentation traits.•Update on DNA predictability of tall stature in Europeans is presented.•689 SNPs provided AUC of 0.79, while a subset of 412 SNPs achieved 0.76.•New models improved prediction accuracy compared to previous ones. Predicting adult height from DNA has important implications in forensic DNA phenotyping. In 2014, we introduced a prediction model consisting of 180 height-associated SNPs based on data from 10,361 Northwestern Europeans enriched with tall individuals (770 > 1.88 standard deviation), which yielded a mid-ranged accuracy (AUC = 0.75 for binary prediction of tall stature and R2 = 0.12 for quantitative prediction of adult height). Here, we provide an update on DNA-based height predictability considering an enlarged list of subsequently-published height-associated SNPs using data from the same set of 10,361 Europeans. A prediction model based on the full set of 689 SNPs showed an improved accuracy relative to previous models for both tall stature (AUC = 0.79) and quantitative height (R2 = 0.21). A feature selection analysis revealed a subset of 412 most informative SNPs while the corresponding prediction model retained most of the accuracy (AUC = 0.76 and R2 = 0.19) achieved with the full model. Over all, our study empirically exemplifies that the accuracy for predicting human appearance phenotypes with very complex underlying genetic architectures, such as adult height, can be improved by increasing the number of phenotype-associated DNA variants. Our work also demonstrates that a careful sub-selection allows for a considerable reduction of the number of DNA predictors that achieve similar prediction accuracy as provided by the full set. This is forensically relevant due to restrictions in the number of SNPs simultaneously analyzable with forensically suitable DNA technologies in the current days of targeted massively parallel sequencing in forensic genetics.
Bibliographie:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
ISSN:1872-4973
1878-0326
1878-0326
DOI:10.1016/j.fsigen.2019.05.006