Fully specified bootstrap confidence intervals for the difference of two independent binomial proportions based on the median unbiased estimator

Yan Lin, Robert G. Newcombe, Stuart Lipsitz, Rickey E. Carter

Research output: Contribution to journalArticle

13 Citations (Scopus)

Abstract

In studies in which a binary response for each subject is observed, the success probability and functions of this quantity are of interest. The use of confidence intervals has been increasingly encouraged as complementary to, and indeed preferable to, p-values as the primary expression of the impact of sampling uncertainty on the findings. The asymptotic confidence interval, based on a normal approximation, is often considered, but this interval can have poor statistical properties when the sample size is small and/or when the success probability is near 0 or 1. In this paper, an estimate of the risk difference based on median unbiased estimates (MUEs) of the two group probabilities is proposed. A corresponding confidence interval is derived using a fully specified bootstrap sample space. The proposed method is compared with Chen's quasi-exact method, Wald intervals and Agresti and Caffo's method with regard to mean square error and coverage probability. For a variety of settings, the MUE-based estimate of risk difference has mean square error uniformly smaller than maximum likelihood estimate within a certain range of risk difference. The fully specified bootstrap had better coverage probability in the tail area than Chen's quasi-exact method, Wald intervals and Agresti and Caffo's intervals.

Original languageEnglish (US)
Pages (from-to)2876-2890
Number of pages15
JournalStatistics in Medicine
Volume28
Issue number23
DOIs
StatePublished - Oct 15 2009
Externally publishedYes

Fingerprint

Bootstrap Confidence Intervals
Unbiased estimator
Risk Difference
Proportion
Confidence Intervals
Confidence interval
Interval
Exact Method
Coverage Probability
Mean square error
Estimate
Bootstrap
Binary Response
Sample space
Normal Approximation
Likelihood Functions
Error Probability
p-Value
Maximum Likelihood Estimate
Statistical property

Keywords

  • Bootstrap
  • Confidence interval
  • Coverage probability
  • Median unbiased estimate

ASJC Scopus subject areas

  • Epidemiology
  • Statistics and Probability

Cite this

Fully specified bootstrap confidence intervals for the difference of two independent binomial proportions based on the median unbiased estimator. / Lin, Yan; Newcombe, Robert G.; Lipsitz, Stuart; Carter, Rickey E.

In: Statistics in Medicine, Vol. 28, No. 23, 15.10.2009, p. 2876-2890.

Research output: Contribution to journalArticle

@article{2ff0716d8a1047d4b8c8df7af61610b8,
title = "Fully specified bootstrap confidence intervals for the difference of two independent binomial proportions based on the median unbiased estimator",
abstract = "In studies in which a binary response for each subject is observed, the success probability and functions of this quantity are of interest. The use of confidence intervals has been increasingly encouraged as complementary to, and indeed preferable to, p-values as the primary expression of the impact of sampling uncertainty on the findings. The asymptotic confidence interval, based on a normal approximation, is often considered, but this interval can have poor statistical properties when the sample size is small and/or when the success probability is near 0 or 1. In this paper, an estimate of the risk difference based on median unbiased estimates (MUEs) of the two group probabilities is proposed. A corresponding confidence interval is derived using a fully specified bootstrap sample space. The proposed method is compared with Chen's quasi-exact method, Wald intervals and Agresti and Caffo's method with regard to mean square error and coverage probability. For a variety of settings, the MUE-based estimate of risk difference has mean square error uniformly smaller than maximum likelihood estimate within a certain range of risk difference. The fully specified bootstrap had better coverage probability in the tail area than Chen's quasi-exact method, Wald intervals and Agresti and Caffo's intervals.",
keywords = "Bootstrap, Confidence interval, Coverage probability, Median unbiased estimate",
author = "Yan Lin and Newcombe, {Robert G.} and Stuart Lipsitz and Carter, {Rickey E.}",
year = "2009",
month = "10",
day = "15",
doi = "10.1002/sim.3670",
language = "English (US)",
volume = "28",
pages = "2876--2890",
journal = "Statistics in Medicine",
issn = "0277-6715",
publisher = "John Wiley and Sons Ltd",
number = "23",

}

TY - JOUR

T1 - Fully specified bootstrap confidence intervals for the difference of two independent binomial proportions based on the median unbiased estimator

AU - Lin, Yan

AU - Newcombe, Robert G.

AU - Lipsitz, Stuart

AU - Carter, Rickey E.

PY - 2009/10/15

Y1 - 2009/10/15

N2 - In studies in which a binary response for each subject is observed, the success probability and functions of this quantity are of interest. The use of confidence intervals has been increasingly encouraged as complementary to, and indeed preferable to, p-values as the primary expression of the impact of sampling uncertainty on the findings. The asymptotic confidence interval, based on a normal approximation, is often considered, but this interval can have poor statistical properties when the sample size is small and/or when the success probability is near 0 or 1. In this paper, an estimate of the risk difference based on median unbiased estimates (MUEs) of the two group probabilities is proposed. A corresponding confidence interval is derived using a fully specified bootstrap sample space. The proposed method is compared with Chen's quasi-exact method, Wald intervals and Agresti and Caffo's method with regard to mean square error and coverage probability. For a variety of settings, the MUE-based estimate of risk difference has mean square error uniformly smaller than maximum likelihood estimate within a certain range of risk difference. The fully specified bootstrap had better coverage probability in the tail area than Chen's quasi-exact method, Wald intervals and Agresti and Caffo's intervals.

AB - In studies in which a binary response for each subject is observed, the success probability and functions of this quantity are of interest. The use of confidence intervals has been increasingly encouraged as complementary to, and indeed preferable to, p-values as the primary expression of the impact of sampling uncertainty on the findings. The asymptotic confidence interval, based on a normal approximation, is often considered, but this interval can have poor statistical properties when the sample size is small and/or when the success probability is near 0 or 1. In this paper, an estimate of the risk difference based on median unbiased estimates (MUEs) of the two group probabilities is proposed. A corresponding confidence interval is derived using a fully specified bootstrap sample space. The proposed method is compared with Chen's quasi-exact method, Wald intervals and Agresti and Caffo's method with regard to mean square error and coverage probability. For a variety of settings, the MUE-based estimate of risk difference has mean square error uniformly smaller than maximum likelihood estimate within a certain range of risk difference. The fully specified bootstrap had better coverage probability in the tail area than Chen's quasi-exact method, Wald intervals and Agresti and Caffo's intervals.

KW - Bootstrap

KW - Confidence interval

KW - Coverage probability

KW - Median unbiased estimate

UR - http://www.scopus.com/inward/record.url?scp=70449380622&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=70449380622&partnerID=8YFLogxK

U2 - 10.1002/sim.3670

DO - 10.1002/sim.3670

M3 - Article

C2 - 19691015

AN - SCOPUS:70449380622

VL - 28

SP - 2876

EP - 2890

JO - Statistics in Medicine

JF - Statistics in Medicine

SN - 0277-6715

IS - 23

ER -