TY - JOUR
T1 - Statistical Considerations in the Evaluation of Continuous Biomarkers
AU - Polley, Mei Yin C.
AU - Dignam, James J.
N1 - Funding Information:
Financial Disclosure: This work was supported by the National Cancer Institute of the National Institutes of Health under awards U10 CA180822 (Statistics and Data Management Center, NRG Oncology) and P50 CA116201 (Mayo Clinic Breast Cancer Specialized Program of Research Excellence). The authors of this article have indicated no other relevant relationships that could be perceived as a real or apparent conflict of interest.
Publisher Copyright:
© 2021 by the Society of Nuclear Medicine and Molecular Imaging.
PY - 2021/5/10
Y1 - 2021/5/10
N2 - Discovery of biomarkers has been steadily increasing over the past decade. Although a plethora of biomarkers has been reported in the biomedical literature, few have been sufficiently validated for broader clinical applications. One particular challenge that may have hindered the adoption of biomarkers into practice is the lack of reproducible biomarker cut points. In this article, we attempt to identify some common statistical issues related to biomarker cut point identification and provide guidance on proper evaluation, interpretation, and validation of such cut points. First, we illustrate how discretization of a continuous biomarker using sample percentiles results in significant information loss and should be avoided. Second, we review the popular "minimal-P-value"approach for cut point identification and show that this method results in highly unstable P values and unduly increases the chance of significant findings when the biomarker is not associated with outcome. Third, we critically review a common analysis strategy by which the selected biomarker cut point is used to categorize patients into different risk categories and then the difference in survival curves among these risk groups in the same dataset is claimed as the evidence supporting the biomarker's prognostic strength. We show that this method yields an exaggerated P value and overestimates the prognostic impact of the biomarker. We illustrate that the degree of the optimistic bias increases with the number of variables being considered in a risk model. Finally, we discuss methods to appropriately ascertain the additional prognostic contribution of the new biomarker in disease settings where standard prognostic factors already exist. Throughout the article, we use real examples in oncology to highlight relevant methodologic issues, and when appropriate, we use simulations to illustrate more abstract statistical concepts.
AB - Discovery of biomarkers has been steadily increasing over the past decade. Although a plethora of biomarkers has been reported in the biomedical literature, few have been sufficiently validated for broader clinical applications. One particular challenge that may have hindered the adoption of biomarkers into practice is the lack of reproducible biomarker cut points. In this article, we attempt to identify some common statistical issues related to biomarker cut point identification and provide guidance on proper evaluation, interpretation, and validation of such cut points. First, we illustrate how discretization of a continuous biomarker using sample percentiles results in significant information loss and should be avoided. Second, we review the popular "minimal-P-value"approach for cut point identification and show that this method results in highly unstable P values and unduly increases the chance of significant findings when the biomarker is not associated with outcome. Third, we critically review a common analysis strategy by which the selected biomarker cut point is used to categorize patients into different risk categories and then the difference in survival curves among these risk groups in the same dataset is claimed as the evidence supporting the biomarker's prognostic strength. We show that this method yields an exaggerated P value and overestimates the prognostic impact of the biomarker. We illustrate that the degree of the optimistic bias increases with the number of variables being considered in a risk model. Finally, we discuss methods to appropriately ascertain the additional prognostic contribution of the new biomarker in disease settings where standard prognostic factors already exist. Throughout the article, we use real examples in oncology to highlight relevant methodologic issues, and when appropriate, we use simulations to illustrate more abstract statistical concepts.
KW - area under the ROC curve
KW - biomarker cut point
KW - biomarker discretization
KW - prognostic biomarker
KW - resubstitution statistics
KW - statistics
UR - http://www.scopus.com/inward/record.url?scp=85106540736&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85106540736&partnerID=8YFLogxK
U2 - 10.2967/jnumed.120.251520
DO - 10.2967/jnumed.120.251520
M3 - Article
C2 - 33579807
AN - SCOPUS:85106540736
SN - 0161-5505
VL - 62
SP - 605
EP - 611
JO - Journal of Nuclear Medicine
JF - Journal of Nuclear Medicine
IS - 5
ER -