When assessment data are words: Validity evidence for qualitative educational assessments

David A. Cook; Ayelet Kuper; Rose Hatala; Shiphra Ginsburg

doi:10.1097/ACM.0000000000001175

When assessment data are words: Validity evidence for qualitative educational assessments

David A. Cook, Ayelet Kuper, Rose Hatala, Shiphra Ginsburg

General Internal Medicine

Research output: Contribution to journal › Article › peer-review

49 Scopus citations

Abstract

Quantitative scores fail to capture all important features of learner performance. This awareness has led to increased use of qualitative data when assessing health professionals. Yet the use of qualitative assessments is hampered by incomplete understanding of their role in forming judgments, and lack of consensus in how to appraise the rigor of judgments therein derived. The authors articulate the role of qualitative assessment as part of a comprehensive program of assessment, and translate the concept of validity to apply to judgments arising from qualitative assessments. They first identify standards for rigor in qualitative research, and then use two contemporary assessment validity frameworks to reorganize these standards for application to qualitative assessment. Standards for rigor in qualitative research include responsiveness, reflexivity, purposive sampling, thick description, triangulation, transparency, and transferability. These standards can be reframed using Messick's five sources of validity evidence (content, response process, internal structure, relationships with other variables, and consequences) and Kane's four inferences in validation (scoring, generalization, extrapolation, and implications). Evidence can be collected and evaluated for each evidence source or inference. The authors illustrate this approach using published research on learning portfolios. The authors advocate a "methods-neutral" approach to assessment, in which a clearly stated purpose determines the nature of and approach to data collection and analysis. Increased use of qualitative assessments will necessitate more rigorous judgments of the defensibility (validity) of inferences and decisions. Evidence should be strategically sought to inform a coherent validity argument.

Original language	English (US)
Pages (from-to)	1359-1369
Number of pages	11
Journal	Academic Medicine
Volume	91
Issue number	10
DOIs	https://doi.org/10.1097/ACM.0000000000001175
State	Published - Oct 1 2016

ASJC Scopus subject areas

Education

Access to Document

10.1097/ACM.0000000000001175

Cite this

@article{a4bca12446b24d7c92bc2ab007217158,

title = "When assessment data are words: Validity evidence for qualitative educational assessments",

abstract = "Quantitative scores fail to capture all important features of learner performance. This awareness has led to increased use of qualitative data when assessing health professionals. Yet the use of qualitative assessments is hampered by incomplete understanding of their role in forming judgments, and lack of consensus in how to appraise the rigor of judgments therein derived. The authors articulate the role of qualitative assessment as part of a comprehensive program of assessment, and translate the concept of validity to apply to judgments arising from qualitative assessments. They first identify standards for rigor in qualitative research, and then use two contemporary assessment validity frameworks to reorganize these standards for application to qualitative assessment. Standards for rigor in qualitative research include responsiveness, reflexivity, purposive sampling, thick description, triangulation, transparency, and transferability. These standards can be reframed using Messick's five sources of validity evidence (content, response process, internal structure, relationships with other variables, and consequences) and Kane's four inferences in validation (scoring, generalization, extrapolation, and implications). Evidence can be collected and evaluated for each evidence source or inference. The authors illustrate this approach using published research on learning portfolios. The authors advocate a {"}methods-neutral{"} approach to assessment, in which a clearly stated purpose determines the nature of and approach to data collection and analysis. Increased use of qualitative assessments will necessitate more rigorous judgments of the defensibility (validity) of inferences and decisions. Evidence should be strategically sought to inform a coherent validity argument.",

author = "Cook, {David A.} and Ayelet Kuper and Rose Hatala and Shiphra Ginsburg",

note = "Publisher Copyright: {\textcopyright} 2016 by the Association of American Medical Colleges.",

year = "2016",

month = oct,

day = "1",

doi = "10.1097/ACM.0000000000001175",

language = "English (US)",

volume = "91",

pages = "1359--1369",

journal = "Academic Medicine",

issn = "1040-2446",

publisher = "Lippincott Williams and Wilkins",

number = "10",

}

TY - JOUR

T1 - When assessment data are words

T2 - Validity evidence for qualitative educational assessments

AU - Cook, David A.

AU - Kuper, Ayelet

AU - Hatala, Rose

AU - Ginsburg, Shiphra

PY - 2016/10/1

Y1 - 2016/10/1

N2 - Quantitative scores fail to capture all important features of learner performance. This awareness has led to increased use of qualitative data when assessing health professionals. Yet the use of qualitative assessments is hampered by incomplete understanding of their role in forming judgments, and lack of consensus in how to appraise the rigor of judgments therein derived. The authors articulate the role of qualitative assessment as part of a comprehensive program of assessment, and translate the concept of validity to apply to judgments arising from qualitative assessments. They first identify standards for rigor in qualitative research, and then use two contemporary assessment validity frameworks to reorganize these standards for application to qualitative assessment. Standards for rigor in qualitative research include responsiveness, reflexivity, purposive sampling, thick description, triangulation, transparency, and transferability. These standards can be reframed using Messick's five sources of validity evidence (content, response process, internal structure, relationships with other variables, and consequences) and Kane's four inferences in validation (scoring, generalization, extrapolation, and implications). Evidence can be collected and evaluated for each evidence source or inference. The authors illustrate this approach using published research on learning portfolios. The authors advocate a "methods-neutral" approach to assessment, in which a clearly stated purpose determines the nature of and approach to data collection and analysis. Increased use of qualitative assessments will necessitate more rigorous judgments of the defensibility (validity) of inferences and decisions. Evidence should be strategically sought to inform a coherent validity argument.

AB - Quantitative scores fail to capture all important features of learner performance. This awareness has led to increased use of qualitative data when assessing health professionals. Yet the use of qualitative assessments is hampered by incomplete understanding of their role in forming judgments, and lack of consensus in how to appraise the rigor of judgments therein derived. The authors articulate the role of qualitative assessment as part of a comprehensive program of assessment, and translate the concept of validity to apply to judgments arising from qualitative assessments. They first identify standards for rigor in qualitative research, and then use two contemporary assessment validity frameworks to reorganize these standards for application to qualitative assessment. Standards for rigor in qualitative research include responsiveness, reflexivity, purposive sampling, thick description, triangulation, transparency, and transferability. These standards can be reframed using Messick's five sources of validity evidence (content, response process, internal structure, relationships with other variables, and consequences) and Kane's four inferences in validation (scoring, generalization, extrapolation, and implications). Evidence can be collected and evaluated for each evidence source or inference. The authors illustrate this approach using published research on learning portfolios. The authors advocate a "methods-neutral" approach to assessment, in which a clearly stated purpose determines the nature of and approach to data collection and analysis. Increased use of qualitative assessments will necessitate more rigorous judgments of the defensibility (validity) of inferences and decisions. Evidence should be strategically sought to inform a coherent validity argument.

UR - http://www.scopus.com/inward/record.url?scp=84962338979&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84962338979&partnerID=8YFLogxK

U2 - 10.1097/ACM.0000000000001175

DO - 10.1097/ACM.0000000000001175

M3 - Article

AN - SCOPUS:84962338979

SN - 1040-2446

VL - 91

SP - 1359

EP - 1369

JO - Academic Medicine

JF - Academic Medicine

IS - 10

ER -

When assessment data are words: Validity evidence for qualitative educational assessments

Abstract

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this