Got power? A systematic review of sample size adequacy in health professions education research

David A. Cook; Rose Hatala

doi:10.1007/s10459-014-9509-5

Got power? A systematic review of sample size adequacy in health professions education research

David A. Cook, Rose Hatala

General Internal Medicine

Research output: Contribution to journal › Article › peer-review

23 Scopus citations

Abstract

Many education research studies employ small samples, which in turn lowers statistical power. We re-analyzed the results of a meta-analysis of simulation-based education to determine study power across a range of effect sizes, and the smallest effect that could be plausibly excluded. We systematically searched multiple databases through May 2011, and included all studies evaluating simulation-based education for health professionals in comparison with no intervention or another simulation intervention. Reviewers working in duplicate abstracted information to calculate standardized mean differences (SMD’s). We included 897 original research studies. Among the 627 no-intervention-comparison studies the median sample size was 25. Only two studies (0.3 %) had ≥80 % power to detect a small difference (SMD > 0.2 standard deviations) and 136 (22 %) had power to detect a large difference (SMD > 0.8). 110 no-intervention-comparison studies failed to find a statistically significant difference, but none excluded a small difference and only 47 (43 %) excluded a large difference. Among 297 studies comparing alternate simulation approaches the median sample size was 30. Only one study (0.3 %) had ≥80 % power to detect a small difference and 79 (27 %) had power to detect a large difference. Of the 128 studies that did not detect a statistically significant effect, 4 (3 %) excluded a small difference and 91 (71 %) excluded a large difference. In conclusion, most education research studies are powered only to detect effects of large magnitude. For most studies that do not reach statistical significance, the possibility of large and important differences still exists.

Original language	English (US)
Pages (from-to)	73-83
Number of pages	11
Journal	Advances in Health Sciences Education
Volume	20
Issue number	1
DOIs	https://doi.org/10.1007/s10459-014-9509-5
State	Published - Mar 2014

Keywords

Cohen’s d
Comparative effectiveness research
Data interpretation, statistical
Medical education
Noninferiority trials
Research design

ASJC Scopus subject areas

Education

Access to Document

10.1007/s10459-014-9509-5

Cite this

@article{a746b936595b4399908a8668d03bb5c4,

title = "Got power? A systematic review of sample size adequacy in health professions education research",

abstract = "Many education research studies employ small samples, which in turn lowers statistical power. We re-analyzed the results of a meta-analysis of simulation-based education to determine study power across a range of effect sizes, and the smallest effect that could be plausibly excluded. We systematically searched multiple databases through May 2011, and included all studies evaluating simulation-based education for health professionals in comparison with no intervention or another simulation intervention. Reviewers working in duplicate abstracted information to calculate standardized mean differences (SMD{\textquoteright}s). We included 897 original research studies. Among the 627 no-intervention-comparison studies the median sample size was 25. Only two studies (0.3 %) had ≥80 % power to detect a small difference (SMD > 0.2 standard deviations) and 136 (22 %) had power to detect a large difference (SMD > 0.8). 110 no-intervention-comparison studies failed to find a statistically significant difference, but none excluded a small difference and only 47 (43 %) excluded a large difference. Among 297 studies comparing alternate simulation approaches the median sample size was 30. Only one study (0.3 %) had ≥80 % power to detect a small difference and 79 (27 %) had power to detect a large difference. Of the 128 studies that did not detect a statistically significant effect, 4 (3 %) excluded a small difference and 91 (71 %) excluded a large difference. In conclusion, most education research studies are powered only to detect effects of large magnitude. For most studies that do not reach statistical significance, the possibility of large and important differences still exists.",

keywords = "Cohen{\textquoteright}s d, Comparative effectiveness research, Data interpretation, statistical, Medical education, Noninferiority trials, Research design",

author = "Cook, {David A.} and Rose Hatala",

note = "Publisher Copyright: {\textcopyright} 2014, Springer Science+Business Media Dordrecht.",

year = "2014",

month = mar,

doi = "10.1007/s10459-014-9509-5",

language = "English (US)",

volume = "20",

pages = "73--83",

journal = "Advances in Health Sciences Education",

issn = "1382-4996",

publisher = "Springer Netherlands",

number = "1",

}

TY - JOUR

T1 - Got power? A systematic review of sample size adequacy in health professions education research

AU - Cook, David A.

AU - Hatala, Rose

PY - 2014/3

Y1 - 2014/3

N2 - Many education research studies employ small samples, which in turn lowers statistical power. We re-analyzed the results of a meta-analysis of simulation-based education to determine study power across a range of effect sizes, and the smallest effect that could be plausibly excluded. We systematically searched multiple databases through May 2011, and included all studies evaluating simulation-based education for health professionals in comparison with no intervention or another simulation intervention. Reviewers working in duplicate abstracted information to calculate standardized mean differences (SMD’s). We included 897 original research studies. Among the 627 no-intervention-comparison studies the median sample size was 25. Only two studies (0.3 %) had ≥80 % power to detect a small difference (SMD > 0.2 standard deviations) and 136 (22 %) had power to detect a large difference (SMD > 0.8). 110 no-intervention-comparison studies failed to find a statistically significant difference, but none excluded a small difference and only 47 (43 %) excluded a large difference. Among 297 studies comparing alternate simulation approaches the median sample size was 30. Only one study (0.3 %) had ≥80 % power to detect a small difference and 79 (27 %) had power to detect a large difference. Of the 128 studies that did not detect a statistically significant effect, 4 (3 %) excluded a small difference and 91 (71 %) excluded a large difference. In conclusion, most education research studies are powered only to detect effects of large magnitude. For most studies that do not reach statistical significance, the possibility of large and important differences still exists.

AB - Many education research studies employ small samples, which in turn lowers statistical power. We re-analyzed the results of a meta-analysis of simulation-based education to determine study power across a range of effect sizes, and the smallest effect that could be plausibly excluded. We systematically searched multiple databases through May 2011, and included all studies evaluating simulation-based education for health professionals in comparison with no intervention or another simulation intervention. Reviewers working in duplicate abstracted information to calculate standardized mean differences (SMD’s). We included 897 original research studies. Among the 627 no-intervention-comparison studies the median sample size was 25. Only two studies (0.3 %) had ≥80 % power to detect a small difference (SMD > 0.2 standard deviations) and 136 (22 %) had power to detect a large difference (SMD > 0.8). 110 no-intervention-comparison studies failed to find a statistically significant difference, but none excluded a small difference and only 47 (43 %) excluded a large difference. Among 297 studies comparing alternate simulation approaches the median sample size was 30. Only one study (0.3 %) had ≥80 % power to detect a small difference and 79 (27 %) had power to detect a large difference. Of the 128 studies that did not detect a statistically significant effect, 4 (3 %) excluded a small difference and 91 (71 %) excluded a large difference. In conclusion, most education research studies are powered only to detect effects of large magnitude. For most studies that do not reach statistical significance, the possibility of large and important differences still exists.

KW - Cohen’s d

KW - Comparative effectiveness research

KW - Data interpretation, statistical

KW - Medical education

KW - Noninferiority trials

KW - Research design

UR - http://www.scopus.com/inward/record.url?scp=84939896120&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84939896120&partnerID=8YFLogxK

U2 - 10.1007/s10459-014-9509-5

DO - 10.1007/s10459-014-9509-5

M3 - Article

C2 - 24819405

AN - SCOPUS:84939896120

SN - 1382-4996

VL - 20

SP - 73

EP - 83

JO - Advances in Health Sciences Education

JF - Advances in Health Sciences Education

IS - 1

ER -

Got power? A systematic review of sample size adequacy in health professions education research

Abstract

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this