Evaluation of Source-wise Missing Data Techniques for the Prediction of Parkinson's Disease Using Smartphones

Multi-source datasets often present the challenge of source-wise missing data which can render large portions of the dataset inaccessible. The applicability of traditional missing data techniques on multi-source datasets is poorly understood. We present the first quantitative evaluation of the state...

Celý popis

Uloženo v:
Podrobná bibliografie
Vydáno v:Proceedings of the ... IEEE International Conference on Acoustics, Speech and Signal Processing (1998) s. 3927 - 3930
Hlavní autoři: Prince, John, Andreotti, Fernando, De Vos, Maarten
Médium: Konferenční příspěvek
Jazyk:angličtina
Vydáno: IEEE 01.05.2019
Témata:
ISSN:2379-190X
On-line přístup:Získat plný text
Tagy: Přidat tag
Žádné tagy, Buďte první, kdo vytvoří štítek k tomuto záznamu!
Popis
Shrnutí:Multi-source datasets often present the challenge of source-wise missing data which can render large portions of the dataset inaccessible. The applicability of traditional missing data techniques on multi-source datasets is poorly understood. We present the first quantitative evaluation of the state-of-the-art missing data techniques as applied to a freely available dataset of smart-phone recordings from Parkinsonian patients wherein source-wise missing data is simulated. The classification accuracy and imputation error of five missing data techniques, including a multi-modal autoencoder and multi-source ensemble learning, are compared at varying levels of missingness. These results demonstrate the relative applicability of each technique under different conditions and subsequently highlight the challenges of source-wise missing on remotely collected datasets. Specifically, multi-source ensemble learning proves to be a highly successful alternative to the traditional imputation techniques when a majority of observations possess missing data.
ISSN:2379-190X
DOI:10.1109/ICASSP.2019.8682660