The fundamental role of density functions in the binary classification problem

In biomedicine, binary classification problems are involved in diagnostic but also, for instance, in personalized medicine. The objective is to use information for correctly allocating subjects in groups. Frequently, this information implies high-dimensional data. An adequate classification rule is...

Celý popis

Uloženo v:
Podrobná bibliografie
Vydáno v:Journal of statistical computation and simulation Ročník 92; číslo 13; s. 2846 - 2861
Hlavní autor: Martínez-Camblor, Pablo
Médium: Journal Article
Jazyk:angličtina
Vydáno: Abingdon Taylor & Francis 02.09.2022
Taylor & Francis Ltd
Témata:
ISSN:0094-9655, 1563-5163
On-line přístup:Získat plný text
Tagy: Přidat tag
Žádné tagy, Buďte první, kdo vytvoří štítek k tomuto záznamu!
Popis
Shrnutí:In biomedicine, binary classification problems are involved in diagnostic but also, for instance, in personalized medicine. The objective is to use information for correctly allocating subjects in groups. Frequently, this information implies high-dimensional data. An adequate classification rule is a trade-off between the sensitivity and the specificity. The ROC curve helps to understand, evaluate and compare the accuracy of classification processes. We propose a procedure for estimating the optimal classification rules based on a penalized estimator of the underlying probability distribution functions. We study its asymptotic properties. Through Monte Carlo simulations, we compare our proposal with a support vector machine-based ROC curve. We illustrate its practical use in a real-world problem. Results suggest that, despite some techniques promise to improve the results provided by traditional methods, in the binary classification problem, the limit is the actual relationship among the density functions.
Bibliografie:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
ISSN:0094-9655
1563-5163
DOI:10.1080/00949655.2022.2051026