Biomarker Combination Based on the Youden Index With and Without Gold Standard.

Uložené v:
Podrobná bibliografia
Názov: Biomarker Combination Based on the Youden Index With and Without Gold Standard.
Autori: Sun A; Center of Data Science, Peking University, Beijing, China., Li Y; Rheumatology Department, Guang'anmen Hospital, Beijing, China., Zhou XH; Department of Biostatistics and Beijing International Center for Mathematical Research, Peking University, Beijing, China.
Zdroj: Statistics in medicine [Stat Med] 2025 Aug; Vol. 44 (18-19), pp. e70189.
Spôsob vydávania: Journal Article
Jazyk: English
Informácie o časopise: Publisher: Wiley Country of Publication: England NLM ID: 8215016 Publication Model: Print Cited Medium: Internet ISSN: 1097-0258 (Electronic) Linking ISSN: 02776715 NLM ISO Abbreviation: Stat Med Subsets: MEDLINE
Imprint Name(s): Original Publication: Chichester ; New York : Wiley, c1982-
Výrazy zo slovníka MeSH: Biomarkers*/analysis , ROC Curve*, Humans ; Area Under Curve ; Computer Simulation ; Models, Statistical ; Medicine, Chinese Traditional ; Reference Standards
Abstrakt: In clinical practice, multiple biomarkers are often measured on the same subject for disease diagnosis, and combining them can improve diagnostic accuracy. Existing studies typically combine multiple biomarkers by maximizing the area under the ROC curve (AUC), assuming a gold standard exists or that biomarkers follow a multivariate normal distribution. However, practical diagnostic settings require both optimal combination coefficients and an effective cutoff value, and the reference test may be imperfect. In this article, we propose a two-stage method for identifying the optimal linear combination and cutoff value based on the Youden index. First, it maximizes an approximation of the empirical AUC to estimate the optimal linear coefficients for combining multiple biomarkers. Then, it maximizes the empirical Youden index to determine the optimal cutoff point for disease classification. Under the semiparametric single index model and regularity conditions, the estimators for the linear coefficients, cutoff point, and Youden index are consistent. This method is also applicable when the reference standard is imperfect. We demonstrate the performance of our method through simulations and apply it to construct a diagnostic scale for Chinese medicine.
(© 2025 John Wiley & Sons Ltd.)
References: S. E. Schindler and R. J. Bateman, “Combining Blood‐Based Biomarkers to Predict Risk for Alzheimer's Disease Dementia,” Nature Aging 1, no. 1 (2021): 26–28.
M. S. Pepe, T. Cai, and G. Longton, “Combining Predictors for Classification Using the Area Under the Receiver Operating Characteristic Curve,” Biometrics 62, no. 1 (2006): 221–229.
X. H. Zhou, N. A. Obuchowski, and D. K. McClish, Statistical Methods in Diagnostic Medicine, vol. 712 (John Wiley & Sons, 2011).
J. Q. Su and J. S. Liu, “Linear Combinations of Multiple Diagnostic Markers,” Journal of the American Statistical Association 88, no. 424 (1993): 1350–1355.
B. Reiser and D. Faraggi, “Confidence Intervals for the Generalized ROC Criterion,” Biometrics 53 (1997): 644–652.
A. Liu, E. F. Schisterman, and Y. Zhu, “On Linear Combinations of Biomarkers to Improve Diagnostic Accuracy,” Statistics in Medicine 24, no. 1 (2005): 37–47.
C. Liu, A. Liu, and S. Halabi, “A Min–Max Combination of Biomarkers to Improve Diagnostic Accuracy,” Statistics in Medicine 30, no. 16 (2011): 2005–2014.
L. Kang, C. Xiong, P. Crane, and L. Tian, “Linear Combinations of Biomarkers to Improve Diagnostic Accuracy With Three Ordinal Diagnostic Categories,” Statistics in Medicine 32, no. 4 (2013): 631–643.
S. Ma and J. Huang, “Combining Multiple Markers for Classification Using ROC,” Biometrics 63, no. 3 (2007): 751–757.
X. Huang, G. Qin, and Y. Fang, “Optimal Combinations of Diagnostic Tests Based on AUC,” Biometrics 67, no. 2 (2011): 568–576.
F. Gao, C. Xiong, Y. Yan, K. Yu, and Z. Zhang, “Estimating Optimum Linear Combination of Multiple Correlated Diagnostic Tests at a Fixed Specificity With Receiver Operating Characteristic Curves,” Journal of Data Science 6, no. 1 (2008): 105–123.
A. Meisner, M. Carone, M. S. Pepe, and K. F. Kerr, “Combining Biomarkers by Maximizing the True Positive Rate for a Fixed False Positive Rate,” Biometrical Journal 63, no. 6 (2021): 1223–1240.
H. Ma, S. Halabi, and A. Liu, “On the Use of Min‐Max Combination of Biomarkers to Maximize the Partial Area Under the ROC Curve,” Journal of Probability and Statistics 2019, no. 1 (2019): 8953530.
Z. Zhang, Y. Lu, and L. Tian, “On Feature Ensemble Optimizing the Sensitivity and Partial ROC Curve,” Statistica Sinica 29, no. 3 (2019): 1395–1418.
Q. Yan, L. E. Bantis, J. L. Stanford, and Z. Feng, “Combining Multiple Biomarkers Linearly to Maximize the Partial Area Under the ROC Curve,” Statistics in Medicine 37, no. 4 (2018): 627–642.
W. J. Youden, “Index for Rating Diagnostic Tests,” Cancer 3, no. 1 (1950): 32–35.
N. J. Perkins and E. F. Schisterman, “The Inconsistency of “Optimal” Cutpoints Obtained Using Two Criteria Based on the Receiver Operating Characteristic Curve,” American Journal of Epidemiology 163, no. 7 (2006): 670–675.
R. Aznar‐Gimeno, L. M. Esteban, G. Sanz, d R. Hoyo‐Alonso, and R. Savirón‐Cornudella, “Incorporating a New Summary Statistic Into the Min–Max Approach: A Min–Max–Median, Min–Max–IQR Combination of Biomarkers for Maximising the Youden Index,” Mathematics 9, no. 19 (2021): 2497.
J. Yin and L. Tian, “Optimal Linear Combinations of Multiple Diagnostic Biomarkers Based on Youden Index,” Statistics in Medicine 33, no. 8 (2014): 1426–1440.
R. Aznar‐Gimeno, L. M. Esteban, d R. Hoyo‐Alonso, Á. Borque‐Fernando, and G. Sanz, “A Stepwise Algorithm for Linearly Combining Biomarkers Under Youden Index Maximization,” Mathematics 10, no. 8 (2022): 1221.
R. Aznar‐Gimeno, L. M. Esteban, G. Sanz, and d R. Hoyo‐Alonso, “Comparing the min–Max–Median/IQR Approach With the min–Max Approach, Logistic Regression and XGBoost, Maximising the Youden Index,” Symmetry 15, no. 3 (2023): 756.
Y. Lu, N. Dendukuri, I. Schiller, and L. Joseph, “A Bayesian Approach to Simultaneously Adjusting for Verification and Reference Standard Bias in Diagnostic Test Studies,” Statistics in Medicine 29, no. 24 (2010): 2532–2543.
B. Yu, C. Zhou, and S. Bandinelli, “Combining Multiple Continuous Tests for the Diagnosis of Kidney Impairment in the Absence of a Gold Standard,” Statistics in Medicine 30, no. 14 (2011): 1712–1721.
L. García Barrado, E. Coart, T. Burzykowski, and Initiative ADN, “Development of a Diagnostic Test Based on Multiple Continuous Biomarkers With an Imperfect Reference Test,” Statistics in Medicine 35, no. 4 (2016): 595–608.
L. García Barrado, E. Coart, and T. Burzykowski, “Estimation of Diagnostic Accuracy of a Combination of Continuous Biomarkers Allowing for Conditional Dependence Between the Biomarkers and the Imperfect Reference‐Test,” Biometrics 73, no. 2 (2017): 646–655.
J. Neyman and E. S. Pearson, “IX. On the Problem of the Most Efficient Tests of Statistical Hypotheses,” Philosophical Transactions of the Royal Society of London. Series A, Containing Papers of a Mathematical or Physical Character 231, no. 694–706 (1933): 289–337.
E. F. Schisterman, N. J. Perkins, A. Liu, and H. Bondell, “Optimal Cut‐Point and Its Corresponding Youden Index to Discriminate Individuals Using Pooled Blood Samples,” Epidemiology 16, no. 1 (2005): 73–81.
J. Li and J. P. Fine, “ROC Analysis With Multiple Classes and Multiple Tests: Methodology and Its Application in Microarray Studies,” Biostatistics 9, no. 3 (2008): 566–576.
E. F. Schisterman and N. Perkins, “Confidence Intervals for the Youden Index and Corresponding Optimal Cut‐Point,” Communications in Statistics: Simulation and Computation 36, no. 3 (2007): 549–563.
J. Nocedal and S. J. Wright, Numerical Optimization, 2nd ed. (Springer, 2006).
H. Lin, L. Zhou, H. Peng, and X. H. Zhou, “Selection and Combination of Biomarkers Using ROC Method for Disease Classification and Prediction,” Canadian Journal of Statistics 39, no. 2 (2011): 324–343.
A. Vexler, A. Liu, E. F. Schisterman, and C. Wu, “Note on Distribution‐Free Estimation of Maximum Linear Separation of Two Multivariate Distributions,” Journal of Nonparametric Statistics 18, no. 2 (2006): 145–158.
J. Zhu and T. Hastie, “Kernel Logistic Regression and the Import Vector Machine,” Journal of Computational and Graphical Statistics 14, no. 1 (2005): 185–205.
W. Stute and L. X. Zhu, “Nonparametric Checks for Single‐Index Models,” Annals of Statistics 33, no. 3 (2005): 1048–1083.
J. Li, B. C. Tai, and D. J. Nott, “Confidence Interval for the Bootstrap P‐Value and Sample Size Calculation of the Bootstrap Test,” Journal of Nonparametric Statistics 21, no. 5 (2009): 649–661.
A. Agresti and B. A. Coull, “Approximate Is Better Than “Exact” for Interval Estimation of Binomial Proportions,” American Statistician 52, no. 2 (1998): 119–126.
E. B. Wilson, “Probable Inference, the Law of Succession, and Statistical Inference,” Journal of the American Statistical Association 22, no. 158 (1927): 209–212.
G. Shan, “Improved Confidence Intervals for the Youden Index,” PLoS One 10, no. 7 (2015): e0127272.
A. P. Dawid and A. M. Skene, “Maximum Likelihood Estimation of Observer Error‐Rates Using the EM Algorithm,” Journal of the Royal Statistical Society: Series C: Applied Statistics 28, no. 1 (1979): 20–28.
S. D. Walter and L. M. Irwig, “Estimation of Test Error Rates, Disease Prevalence and Relative Risk From Misclassified Data: A Review,” Journal of Clinical Epidemiology 41, no. 9 (1988): 923–937.
A. Sun and X. H. Zhou, “Estimation of Diagnostic Test Accuracy Without Gold Standards,” Statistics in Medicine 44, no. 3–4 (2025): e10315.
E. M. Gravallese and G. S. Firestein, “Rheumatoid Arthritis—Common Origins, Divergent Mechanisms,” New England Journal of Medicine 388, no. 6 (2023): 529–542.
J. S. Smolen, R. B. Landewé, S. A. Bergstra, et al., “EULAR Recommendations for the Management of Rheumatoid Arthritis With Synthetic and Biological Disease‐Modifying Antirheumatic Drugs: 2022 Update,” Annals of the Rheumatic Diseases 82, no. 1 (2023): 3–18.
G. Nagy, N. M. Roodenrijs, P. M. Welsing, et al., “EULAR Definition of Difficult‐To‐Treat Rheumatoid Arthritis,” Annals of the Rheumatic Diseases 80, no. 1 (2021): 31–35.
Q. Xing, L. Fu, Z. Yu, and X. Zhou, “Efficacy and Safety of Integrated Traditional Chinese Medicine and Western Medicine on the Treatment of Rheumatoid Arthritis: A Meta‐Analysis,” Evidence‐Based Complementary and Alternative Medicine 2020, no. 1 (2020): 4348709.
G. G. Zhang, B. Singh, W. Lee, B. Handwerger, L. Lao, and B. Berman, “Improvement of Agreement in TCM Diagnosis Among TCM Practitioners for Persons With the Conventional Diagnosis of Rheumatoid Arthritis: Effect of Training,” Journal of Alternative and Complementary Medicine 14, no. 4 (2008): 381–386.
J. Wang, X. Gong, X. Tang, H. Liu, J. Liu, and D. He, “Multi‐Center Cross‐Sectional Survey of Characteristics of Chinese Medicine Syndromes in 1602 Rheumatoid Arthritis Patients,” Journal of Traditional Chinese Medicine 59, no. 11 (2018): 963–967.
S. Li, D. Liu, Z. Chen, et al., “Comparative Efficacy and Safety of Four Classical Prescriptions for Clearing Damp‐Heat Recommended by Clinical Guidelines in Treating Rheumatoid Arthritis: A Network Meta‐Analysis,” Annals of Palliative Medicine 10, no. 7 (2021): 7297328–7298328.
Q. Jiang, H. Wang, X. Gong, and C. Luo, “Guidelines of Diagnosis and Treatment of Rheumatoid Arthritis Disease and Syndrome Combination,” Journal of Traditional Chinese Medicine 59, no. 20 (2018): 1794–1800.
X. Gong, W. Liu, D. Li, et al., “China Rheumatoid Arthritis Registry of Patients With Chinese Medicine (CERTAIN): Rationale, Design, and Baseline Characteristics of the First 11,764 Enrollees,” Phytomedicine 104 (2022): 154236.
X. N. Wang, V. Zhou, Q. Liu, Y. Gao, and X. H. Zhou, “Evaluation of the Accuracy of Diagnostic Scales for a Syndrome in Chinese Medicine in the Absence of a Gold Standard,” Chinese Medicine 11 (2016): 1–7.
Grant Information: 82173623 National Natural Science Foundation of China
Contributed Indexing: Keywords: Chinese medicine scales; Youden index; imperfect reference test; linear combination
Substance Nomenclature: 0 (Biomarkers)
Entry Date(s): Date Created: 20250819 Date Completed: 20250826 Latest Revision: 20250826
Update Code: 20250826
DOI: 10.1002/sim.70189
PMID: 40827373
Databáza: MEDLINE
Popis
Abstrakt:In clinical practice, multiple biomarkers are often measured on the same subject for disease diagnosis, and combining them can improve diagnostic accuracy. Existing studies typically combine multiple biomarkers by maximizing the area under the ROC curve (AUC), assuming a gold standard exists or that biomarkers follow a multivariate normal distribution. However, practical diagnostic settings require both optimal combination coefficients and an effective cutoff value, and the reference test may be imperfect. In this article, we propose a two-stage method for identifying the optimal linear combination and cutoff value based on the Youden index. First, it maximizes an approximation of the empirical AUC to estimate the optimal linear coefficients for combining multiple biomarkers. Then, it maximizes the empirical Youden index to determine the optimal cutoff point for disease classification. Under the semiparametric single index model and regularity conditions, the estimators for the linear coefficients, cutoff point, and Youden index are consistent. This method is also applicable when the reference standard is imperfect. We demonstrate the performance of our method through simulations and apply it to construct a diagnostic scale for Chinese medicine.<br /> (© 2025 John Wiley & Sons Ltd.)
ISSN:1097-0258
DOI:10.1002/sim.70189