Evaluation of Source-wise Missing Data Techniques for the Prediction of Parkinson's Disease Using Smartphones

Multi-source datasets often present the challenge of source-wise missing data which can render large portions of the dataset inaccessible. The applicability of traditional missing data techniques on multi-source datasets is poorly understood. We present the first quantitative evaluation of the state...

Celý popis

Uložené v:
Podrobná bibliografia
Vydané v:Proceedings of the ... IEEE International Conference on Acoustics, Speech and Signal Processing (1998) s. 3927 - 3930
Hlavní autori: Prince, John, Andreotti, Fernando, De Vos, Maarten
Médium: Konferenčný príspevok..
Jazyk:English
Vydavateľské údaje: IEEE 01.05.2019
Predmet:
ISSN:2379-190X
On-line prístup:Získať plný text
Tagy: Pridať tag
Žiadne tagy, Buďte prvý, kto otaguje tento záznam!
Popis
Shrnutí:Multi-source datasets often present the challenge of source-wise missing data which can render large portions of the dataset inaccessible. The applicability of traditional missing data techniques on multi-source datasets is poorly understood. We present the first quantitative evaluation of the state-of-the-art missing data techniques as applied to a freely available dataset of smart-phone recordings from Parkinsonian patients wherein source-wise missing data is simulated. The classification accuracy and imputation error of five missing data techniques, including a multi-modal autoencoder and multi-source ensemble learning, are compared at varying levels of missingness. These results demonstrate the relative applicability of each technique under different conditions and subsequently highlight the challenges of source-wise missing on remotely collected datasets. Specifically, multi-source ensemble learning proves to be a highly successful alternative to the traditional imputation techniques when a majority of observations possess missing data.
ISSN:2379-190X
DOI:10.1109/ICASSP.2019.8682660