Much ado about differences: why expert-novice comparisons add little to the validity argument

David A. Cook

doi:10.1007/s10459-014-9551-3

Much ado about differences: why expert-novice comparisons add little to the validity argument

David A. Cook

General Internal Medicine

Research output: Contribution to journal › Article › peer-review

42 Scopus citations

Abstract

One approach to validating assessment scores involves evaluating the ability of scores to discriminate among groups who differ in a specific characteristic, such as training status (in education) or disease state (in clinical applications). Such known-groups comparison studies provide validity evidence of “relationships with other variables.” The typical education research study might compare scores between staff physicians and postgraduate trainees with the hypothesis that those with more advanced training (the “experts”) will have higher scores than those less advanced (the “novices”). However, such comparisons are too nonspecific to support clear conclusions, and expert-novice comparisons (and known-groups comparisons in general) thus contribute little to the validity argument. The major flaw is the problem of confounding: there are multiple plausible explanations for any observed between-group differences. The absence of hypothesized differences would suggest a serious flaw in the validity argument, but the confirmation of such differences adds little. As such, accurate known-groups discrimination may be necessary, but will never be sufficient, to support the validity of scores. This article elaborates on this and other problems with the known-groups comparison that limit its utility as a source of validity evidence.

Original language	English (US)
Pages (from-to)	829-834
Number of pages	6
Journal	Advances in Health Sciences Education
Volume	20
Issue number	3
DOIs	https://doi.org/10.1007/s10459-014-9551-3
State	Published - Aug 22 2015

Keywords

Assessment
Data collection
Data interpretation, statistical
Evaluation
Medical education
Reliability
Validation Studies

ASJC Scopus subject areas

Education

Access to Document

10.1007/s10459-014-9551-3

Cite this

@article{b97fc05b633a4d3d90fb0debc1ce9b35,

title = "Much ado about differences: why expert-novice comparisons add little to the validity argument",

abstract = "One approach to validating assessment scores involves evaluating the ability of scores to discriminate among groups who differ in a specific characteristic, such as training status (in education) or disease state (in clinical applications). Such known-groups comparison studies provide validity evidence of “relationships with other variables.” The typical education research study might compare scores between staff physicians and postgraduate trainees with the hypothesis that those with more advanced training (the “experts”) will have higher scores than those less advanced (the “novices”). However, such comparisons are too nonspecific to support clear conclusions, and expert-novice comparisons (and known-groups comparisons in general) thus contribute little to the validity argument. The major flaw is the problem of confounding: there are multiple plausible explanations for any observed between-group differences. The absence of hypothesized differences would suggest a serious flaw in the validity argument, but the confirmation of such differences adds little. As such, accurate known-groups discrimination may be necessary, but will never be sufficient, to support the validity of scores. This article elaborates on this and other problems with the known-groups comparison that limit its utility as a source of validity evidence.",

keywords = "Assessment, Data collection, Data interpretation, statistical, Evaluation, Medical education, Reliability, Validation Studies",

author = "Cook, {David A.}",

note = "Publisher Copyright: {\textcopyright} 2014, Springer Science+Business Media Dordrecht.",

year = "2015",

month = aug,

day = "22",

doi = "10.1007/s10459-014-9551-3",

language = "English (US)",

volume = "20",

pages = "829--834",

journal = "Advances in Health Sciences Education",

issn = "1382-4996",

publisher = "Springer Netherlands",

number = "3",

}

TY - JOUR

T1 - Much ado about differences

T2 - why expert-novice comparisons add little to the validity argument

AU - Cook, David A.

PY - 2015/8/22

Y1 - 2015/8/22

N2 - One approach to validating assessment scores involves evaluating the ability of scores to discriminate among groups who differ in a specific characteristic, such as training status (in education) or disease state (in clinical applications). Such known-groups comparison studies provide validity evidence of “relationships with other variables.” The typical education research study might compare scores between staff physicians and postgraduate trainees with the hypothesis that those with more advanced training (the “experts”) will have higher scores than those less advanced (the “novices”). However, such comparisons are too nonspecific to support clear conclusions, and expert-novice comparisons (and known-groups comparisons in general) thus contribute little to the validity argument. The major flaw is the problem of confounding: there are multiple plausible explanations for any observed between-group differences. The absence of hypothesized differences would suggest a serious flaw in the validity argument, but the confirmation of such differences adds little. As such, accurate known-groups discrimination may be necessary, but will never be sufficient, to support the validity of scores. This article elaborates on this and other problems with the known-groups comparison that limit its utility as a source of validity evidence.

AB - One approach to validating assessment scores involves evaluating the ability of scores to discriminate among groups who differ in a specific characteristic, such as training status (in education) or disease state (in clinical applications). Such known-groups comparison studies provide validity evidence of “relationships with other variables.” The typical education research study might compare scores between staff physicians and postgraduate trainees with the hypothesis that those with more advanced training (the “experts”) will have higher scores than those less advanced (the “novices”). However, such comparisons are too nonspecific to support clear conclusions, and expert-novice comparisons (and known-groups comparisons in general) thus contribute little to the validity argument. The major flaw is the problem of confounding: there are multiple plausible explanations for any observed between-group differences. The absence of hypothesized differences would suggest a serious flaw in the validity argument, but the confirmation of such differences adds little. As such, accurate known-groups discrimination may be necessary, but will never be sufficient, to support the validity of scores. This article elaborates on this and other problems with the known-groups comparison that limit its utility as a source of validity evidence.

KW - Assessment

KW - Data collection

KW - Data interpretation, statistical

KW - Evaluation

KW - Medical education

KW - Reliability

KW - Validation Studies

UR - http://www.scopus.com/inward/record.url?scp=84937635245&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84937635245&partnerID=8YFLogxK

U2 - 10.1007/s10459-014-9551-3

DO - 10.1007/s10459-014-9551-3

M3 - Article

C2 - 25260974

AN - SCOPUS:84937635245

SN - 1382-4996

VL - 20

SP - 829

EP - 834

JO - Advances in Health Sciences Education

JF - Advances in Health Sciences Education

IS - 3

ER -

Much ado about differences: why expert-novice comparisons add little to the validity argument

Abstract

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this