Calibration of quality-adjusted life years for oncology clinical trials

Jeff A. Sloan; Daniel J. Sargent; Paul J. Novotny; Paul A. Decker; Randolph S. Marks; Heidi Nelson

doi:10.1016/j.jpainsymman.2013.07.016

Calibration of quality-adjusted life years for oncology clinical trials

Jeff A. Sloan, Daniel J. Sargent, Paul J. Novotny, Paul A. Decker, Randolph S. Marks, Heidi Nelson

Research output: Contribution to journal › Article › peer-review

4 Scopus citations

Abstract

Context Quality-adjusted life year (QALY) estimation is a well-known but little used technique to compare survival adjusted for complications. Lack of calibration and interpretation guidance hinders implementation of QALY analyses. Objectives We conducted simulation studies to assess the impact of differences in survival, toxicity rates, and utility values on QALY results. Methods Survival comparisons used both log-rank and Wilcoxon testing. We examined power considerations for a North Central Cancer Treatment Group Phase III lung cancer clinical trial (89-20-52). Results Sample sizes of 100 events per treatment have low power to generate a statistically significant difference in QALYs unless the toxicity rate is 44% higher in one arm. For sample sizes of 200 per arm and equal survival times, toxicity needs to be at least 38% more in one arm for the result to be statistically significant, using a utility of 0.3 for days with toxicity. Sample sizes of 300 (500)/arm provide 80% power if there is a 31% (25%) toxicity difference. If the overall survival hazard ratio between the two treatment arms is 1.25, then samples of at least 150 patients and 13% increased toxicity are necessary to have 80% power to detect QALY differences. In study 89-20-52, there was only 56% power to determine the statistical significance of the observed QALY differences, clarifying the enigmatic conclusion of no statistically significant difference in QALY despite an observed 14.5% increase in toxicity between treatments. Conclusion This calibration allows researchers to interpret the clinical significance of QALY analyses and facilitates QALY inclusion in clinical trials through improved study design.

Original language	English (US)
Pages (from-to)	1091-1099.e3
Journal	Journal of pain and symptom management
Volume	47
Issue number	6
DOIs	https://doi.org/10.1016/j.jpainsymman.2013.07.016
State	Published - Jun 2014

Keywords

Q-TWiST
QALY
QOL
quality of life
quality-adjusted life year
simulation

ASJC Scopus subject areas

General Nursing
Clinical Neurology
Anesthesiology and Pain Medicine

Access to Document

10.1016/j.jpainsymman.2013.07.016

Cite this

@article{089adba344014b9d9e8405d360ad5f35,

title = "Calibration of quality-adjusted life years for oncology clinical trials",

abstract = "Context Quality-adjusted life year (QALY) estimation is a well-known but little used technique to compare survival adjusted for complications. Lack of calibration and interpretation guidance hinders implementation of QALY analyses. Objectives We conducted simulation studies to assess the impact of differences in survival, toxicity rates, and utility values on QALY results. Methods Survival comparisons used both log-rank and Wilcoxon testing. We examined power considerations for a North Central Cancer Treatment Group Phase III lung cancer clinical trial (89-20-52). Results Sample sizes of 100 events per treatment have low power to generate a statistically significant difference in QALYs unless the toxicity rate is 44% higher in one arm. For sample sizes of 200 per arm and equal survival times, toxicity needs to be at least 38% more in one arm for the result to be statistically significant, using a utility of 0.3 for days with toxicity. Sample sizes of 300 (500)/arm provide 80% power if there is a 31% (25%) toxicity difference. If the overall survival hazard ratio between the two treatment arms is 1.25, then samples of at least 150 patients and 13% increased toxicity are necessary to have 80% power to detect QALY differences. In study 89-20-52, there was only 56% power to determine the statistical significance of the observed QALY differences, clarifying the enigmatic conclusion of no statistically significant difference in QALY despite an observed 14.5% increase in toxicity between treatments. Conclusion This calibration allows researchers to interpret the clinical significance of QALY analyses and facilitates QALY inclusion in clinical trials through improved study design.",

keywords = "Q-TWiST, QALY, QOL, quality of life, quality-adjusted life year, simulation",

author = "Sloan, {Jeff A.} and Sargent, {Daniel J.} and Novotny, {Paul J.} and Decker, {Paul A.} and Marks, {Randolph S.} and Heidi Nelson",

note = "Funding Information: This study was supported in part by Public Health Service grants CA-25224 and 5U10CA 149950-02 . The authors report no conflicts of interest in this work.",

year = "2014",

month = jun,

doi = "10.1016/j.jpainsymman.2013.07.016",

language = "English (US)",

volume = "47",

pages = "1091--1099.e3",

journal = "Journal of pain and symptom management",

issn = "0885-3924",

publisher = "Elsevier Inc.",

number = "6",

}

TY - JOUR

T1 - Calibration of quality-adjusted life years for oncology clinical trials

AU - Sloan, Jeff A.

AU - Sargent, Daniel J.

AU - Novotny, Paul J.

AU - Decker, Paul A.

AU - Marks, Randolph S.

AU - Nelson, Heidi

N1 - Funding Information: This study was supported in part by Public Health Service grants CA-25224 and 5U10CA 149950-02 . The authors report no conflicts of interest in this work.

PY - 2014/6

Y1 - 2014/6

N2 - Context Quality-adjusted life year (QALY) estimation is a well-known but little used technique to compare survival adjusted for complications. Lack of calibration and interpretation guidance hinders implementation of QALY analyses. Objectives We conducted simulation studies to assess the impact of differences in survival, toxicity rates, and utility values on QALY results. Methods Survival comparisons used both log-rank and Wilcoxon testing. We examined power considerations for a North Central Cancer Treatment Group Phase III lung cancer clinical trial (89-20-52). Results Sample sizes of 100 events per treatment have low power to generate a statistically significant difference in QALYs unless the toxicity rate is 44% higher in one arm. For sample sizes of 200 per arm and equal survival times, toxicity needs to be at least 38% more in one arm for the result to be statistically significant, using a utility of 0.3 for days with toxicity. Sample sizes of 300 (500)/arm provide 80% power if there is a 31% (25%) toxicity difference. If the overall survival hazard ratio between the two treatment arms is 1.25, then samples of at least 150 patients and 13% increased toxicity are necessary to have 80% power to detect QALY differences. In study 89-20-52, there was only 56% power to determine the statistical significance of the observed QALY differences, clarifying the enigmatic conclusion of no statistically significant difference in QALY despite an observed 14.5% increase in toxicity between treatments. Conclusion This calibration allows researchers to interpret the clinical significance of QALY analyses and facilitates QALY inclusion in clinical trials through improved study design.

AB - Context Quality-adjusted life year (QALY) estimation is a well-known but little used technique to compare survival adjusted for complications. Lack of calibration and interpretation guidance hinders implementation of QALY analyses. Objectives We conducted simulation studies to assess the impact of differences in survival, toxicity rates, and utility values on QALY results. Methods Survival comparisons used both log-rank and Wilcoxon testing. We examined power considerations for a North Central Cancer Treatment Group Phase III lung cancer clinical trial (89-20-52). Results Sample sizes of 100 events per treatment have low power to generate a statistically significant difference in QALYs unless the toxicity rate is 44% higher in one arm. For sample sizes of 200 per arm and equal survival times, toxicity needs to be at least 38% more in one arm for the result to be statistically significant, using a utility of 0.3 for days with toxicity. Sample sizes of 300 (500)/arm provide 80% power if there is a 31% (25%) toxicity difference. If the overall survival hazard ratio between the two treatment arms is 1.25, then samples of at least 150 patients and 13% increased toxicity are necessary to have 80% power to detect QALY differences. In study 89-20-52, there was only 56% power to determine the statistical significance of the observed QALY differences, clarifying the enigmatic conclusion of no statistically significant difference in QALY despite an observed 14.5% increase in toxicity between treatments. Conclusion This calibration allows researchers to interpret the clinical significance of QALY analyses and facilitates QALY inclusion in clinical trials through improved study design.

KW - Q-TWiST

KW - QALY

KW - QOL

KW - quality of life

KW - quality-adjusted life year

KW - simulation

UR - http://www.scopus.com/inward/record.url?scp=84902463199&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84902463199&partnerID=8YFLogxK

U2 - 10.1016/j.jpainsymman.2013.07.016

DO - 10.1016/j.jpainsymman.2013.07.016

M3 - Article

C2 - 24246787

AN - SCOPUS:84902463199

SN - 0885-3924

VL - 47

SP - 1091-1099.e3

JO - Journal of pain and symptom management

JF - Journal of pain and symptom management

IS - 6

ER -

Calibration of quality-adjusted life years for oncology clinical trials

Abstract

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this