Interreader reliability of LI-RADS version 2014 algorithm and imaging features for diagnosis of hepatocellular carcinoma: A large international multireader study

Kathryn J. Fowler; An Tang; Cynthia Santillan; Mythreyi Bhargavan-Chatfield; Jay Heiken; Reena C. Jha; Jeffrey Weinreb; Hero Hussain; Donald G. Mitchell; Mustafa R. Bashi; Eduardo A.C. Costa; Guilherme M. Cunha; Laura Coombs; Tanya Wolfson; Anthony C. Gamst; Giuseppe Brancatelli; Benjamin Yeh; Claude B. Sirlin

doi:10.1148/radiol.2017170376

Interreader reliability of LI-RADS version 2014 algorithm and imaging features for diagnosis of hepatocellular carcinoma: A large international multireader study

Kathryn J. Fowler, An Tang, Cynthia Santillan, Mythreyi Bhargavan-Chatfield, Jay Heiken, Reena C. Jha, Jeffrey Weinreb, Hero Hussain, Donald G. Mitchell, Mustafa R. Bashi, Eduardo A.C. Costa, Guilherme M. Cunha, Laura Coombs, Tanya Wolfson, Anthony C. Gamst, Giuseppe Brancatelli, Benjamin Yeh, Claude B. Sirlin

Radiology

Research output: Contribution to journal › Article › peer-review

44 Scopus citations

Abstract

Purpose: To determine in a large multicenter multireader setting the interreader reliability of Liver Imaging Reporting and Data System (LI-RADS) version 2014 categories, the major imaging features seen with computed tomography (CT) and magnetic resonance (MR) imaging, and the potential effect of reader demographics on agreement with a preselected nonconsecutive image set. Materials and Methods: Institutional review board approval was obtained, and patient consent was waived for this retrospective study. Ten image sets, comprising 38-40 unique studies (equal number of CT and MR imaging studies, uniformly distributed LI-RADS categories), were randomly allocated to readers. Images were acquired in unenhanced and standard contrast material-enhanced phases, with observation diameter and growth data provided. Readers completed a demographic survey, assigned LI-RADS version 2014 categories, and assessed major features. Intraclass correlation coefficient (ICC) assessed with mixed-model regression analyses was the metric for interreader reliability of assigning categories and major features. Results: A total of 113 readers evaluated 380 image sets. ICC of final LI-RADS category assignment was 0.67 (95% confidence interval [CI]: 0.61, 0.71) for CT and 0.73 (95% CI: 0.68, 0.77) for MR imaging. ICC was 0.87 (95% CI: 0.84, 0.90) for arterial phase hyperenhancement, 0.85 (95% CI: 0.81, 0.88) for washout appearance, and 0.84 (95% CI: 0.80, 0.87) for capsule appearance. ICC was not significantly affected by liver expertise, LI-RADS familiarity, or years of postresidency practice (ICC range, 0.69-0.70; ICC difference, 0.003-0.01 [95% CI: 20.003 to 20.01, 0.004-0.02]. ICC was borderline higher for private practice readers than for academic readers (ICC difference, 0.009; 95% CI: 0.000, 0.021). Conclusion: ICC is good for final LI-RADS categorization and high for major feature characterization, with minimal reader demographic effect. Of note, our results using selected image sets from nonconsecutive examinations are not necessarily comparable with those of prior studies that used consecutive examination series.

Original language	English (US)
Pages (from-to)	173-185
Number of pages	13
Journal	Radiology
Volume	286
Issue number	1
DOIs	https://doi.org/10.1148/radiol.2017170376
State	Published - Jan 2018

ASJC Scopus subject areas

Radiology Nuclear Medicine and imaging

Access to Document

10.1148/radiol.2017170376

Cite this

Fowler, K. J., Tang, A., Santillan, C., Bhargavan-Chatfield, M., Heiken, J., Jha, R. C., Weinreb, J., Hussain, H., Mitchell, D. G., Bashi, M. R., Costa, E. A. C., Cunha, G. M., Coombs, L., Wolfson, T., Gamst, A. C., Brancatelli, G., Yeh, B., & Sirlin, C. B. (2018). Interreader reliability of LI-RADS version 2014 algorithm and imaging features for diagnosis of hepatocellular carcinoma: A large international multireader study. Radiology, 286(1), 173-185. https://doi.org/10.1148/radiol.2017170376

Fowler, KJ, Tang, A, Santillan, C, Bhargavan-Chatfield, M, Heiken, J, Jha, RC, Weinreb, J, Hussain, H, Mitchell, DG, Bashi, MR, Costa, EAC, Cunha, GM, Coombs, L, Wolfson, T, Gamst, AC, Brancatelli, G, Yeh, B & Sirlin, CB 2018, 'Interreader reliability of LI-RADS version 2014 algorithm and imaging features for diagnosis of hepatocellular carcinoma: A large international multireader study', Radiology, vol. 286, no. 1, pp. 173-185. https://doi.org/10.1148/radiol.2017170376

@article{7976ae6ee4864456b479553999976744,

title = "Interreader reliability of LI-RADS version 2014 algorithm and imaging features for diagnosis of hepatocellular carcinoma: A large international multireader study",

abstract = "Purpose: To determine in a large multicenter multireader setting the interreader reliability of Liver Imaging Reporting and Data System (LI-RADS) version 2014 categories, the major imaging features seen with computed tomography (CT) and magnetic resonance (MR) imaging, and the potential effect of reader demographics on agreement with a preselected nonconsecutive image set. Materials and Methods: Institutional review board approval was obtained, and patient consent was waived for this retrospective study. Ten image sets, comprising 38-40 unique studies (equal number of CT and MR imaging studies, uniformly distributed LI-RADS categories), were randomly allocated to readers. Images were acquired in unenhanced and standard contrast material-enhanced phases, with observation diameter and growth data provided. Readers completed a demographic survey, assigned LI-RADS version 2014 categories, and assessed major features. Intraclass correlation coefficient (ICC) assessed with mixed-model regression analyses was the metric for interreader reliability of assigning categories and major features. Results: A total of 113 readers evaluated 380 image sets. ICC of final LI-RADS category assignment was 0.67 (95% confidence interval [CI]: 0.61, 0.71) for CT and 0.73 (95% CI: 0.68, 0.77) for MR imaging. ICC was 0.87 (95% CI: 0.84, 0.90) for arterial phase hyperenhancement, 0.85 (95% CI: 0.81, 0.88) for washout appearance, and 0.84 (95% CI: 0.80, 0.87) for capsule appearance. ICC was not significantly affected by liver expertise, LI-RADS familiarity, or years of postresidency practice (ICC range, 0.69-0.70; ICC difference, 0.003-0.01 [95% CI: 20.003 to 20.01, 0.004-0.02]. ICC was borderline higher for private practice readers than for academic readers (ICC difference, 0.009; 95% CI: 0.000, 0.021). Conclusion: ICC is good for final LI-RADS categorization and high for major feature characterization, with minimal reader demographic effect. Of note, our results using selected image sets from nonconsecutive examinations are not necessarily comparable with those of prior studies that used consecutive examination series.",

author = "Fowler, {Kathryn J.} and An Tang and Cynthia Santillan and Mythreyi Bhargavan-Chatfield and Jay Heiken and Jha, {Reena C.} and Jeffrey Weinreb and Hero Hussain and Mitchell, {Donald G.} and Bashi, {Mustafa R.} and Costa, {Eduardo A.C.} and Cunha, {Guilherme M.} and Laura Coombs and Tanya Wolfson and Gamst, {Anthony C.} and Giuseppe Brancatelli and Benjamin Yeh and Sirlin, {Claude B.}",

note = "Funding Information: Supported by a research scholarship from the Fonds de Recherche du Qu{\'e}bec–Sant{\'e} and Fondation de l{\textquoteright}association des radiologistes du Qu{\'e}bec (FRQS-ARQ #26993).",

year = "2018",

month = jan,

doi = "10.1148/radiol.2017170376",

language = "English (US)",

volume = "286",

pages = "173--185",

journal = "Radiology",

issn = "0033-8419",

publisher = "Radiological Society of North America Inc.",

number = "1",

}

TY - JOUR

T1 - Interreader reliability of LI-RADS version 2014 algorithm and imaging features for diagnosis of hepatocellular carcinoma

T2 - A large international multireader study

AU - Fowler, Kathryn J.

AU - Tang, An

AU - Santillan, Cynthia

AU - Bhargavan-Chatfield, Mythreyi

AU - Heiken, Jay

AU - Jha, Reena C.

AU - Weinreb, Jeffrey

AU - Hussain, Hero

AU - Mitchell, Donald G.

AU - Bashi, Mustafa R.

AU - Costa, Eduardo A.C.

AU - Cunha, Guilherme M.

AU - Coombs, Laura

AU - Wolfson, Tanya

AU - Gamst, Anthony C.

AU - Brancatelli, Giuseppe

AU - Yeh, Benjamin

AU - Sirlin, Claude B.

N1 - Funding Information: Supported by a research scholarship from the Fonds de Recherche du Québec–Santé and Fondation de l’association des radiologistes du Québec (FRQS-ARQ #26993).

PY - 2018/1

Y1 - 2018/1

N2 - Purpose: To determine in a large multicenter multireader setting the interreader reliability of Liver Imaging Reporting and Data System (LI-RADS) version 2014 categories, the major imaging features seen with computed tomography (CT) and magnetic resonance (MR) imaging, and the potential effect of reader demographics on agreement with a preselected nonconsecutive image set. Materials and Methods: Institutional review board approval was obtained, and patient consent was waived for this retrospective study. Ten image sets, comprising 38-40 unique studies (equal number of CT and MR imaging studies, uniformly distributed LI-RADS categories), were randomly allocated to readers. Images were acquired in unenhanced and standard contrast material-enhanced phases, with observation diameter and growth data provided. Readers completed a demographic survey, assigned LI-RADS version 2014 categories, and assessed major features. Intraclass correlation coefficient (ICC) assessed with mixed-model regression analyses was the metric for interreader reliability of assigning categories and major features. Results: A total of 113 readers evaluated 380 image sets. ICC of final LI-RADS category assignment was 0.67 (95% confidence interval [CI]: 0.61, 0.71) for CT and 0.73 (95% CI: 0.68, 0.77) for MR imaging. ICC was 0.87 (95% CI: 0.84, 0.90) for arterial phase hyperenhancement, 0.85 (95% CI: 0.81, 0.88) for washout appearance, and 0.84 (95% CI: 0.80, 0.87) for capsule appearance. ICC was not significantly affected by liver expertise, LI-RADS familiarity, or years of postresidency practice (ICC range, 0.69-0.70; ICC difference, 0.003-0.01 [95% CI: 20.003 to 20.01, 0.004-0.02]. ICC was borderline higher for private practice readers than for academic readers (ICC difference, 0.009; 95% CI: 0.000, 0.021). Conclusion: ICC is good for final LI-RADS categorization and high for major feature characterization, with minimal reader demographic effect. Of note, our results using selected image sets from nonconsecutive examinations are not necessarily comparable with those of prior studies that used consecutive examination series.

AB - Purpose: To determine in a large multicenter multireader setting the interreader reliability of Liver Imaging Reporting and Data System (LI-RADS) version 2014 categories, the major imaging features seen with computed tomography (CT) and magnetic resonance (MR) imaging, and the potential effect of reader demographics on agreement with a preselected nonconsecutive image set. Materials and Methods: Institutional review board approval was obtained, and patient consent was waived for this retrospective study. Ten image sets, comprising 38-40 unique studies (equal number of CT and MR imaging studies, uniformly distributed LI-RADS categories), were randomly allocated to readers. Images were acquired in unenhanced and standard contrast material-enhanced phases, with observation diameter and growth data provided. Readers completed a demographic survey, assigned LI-RADS version 2014 categories, and assessed major features. Intraclass correlation coefficient (ICC) assessed with mixed-model regression analyses was the metric for interreader reliability of assigning categories and major features. Results: A total of 113 readers evaluated 380 image sets. ICC of final LI-RADS category assignment was 0.67 (95% confidence interval [CI]: 0.61, 0.71) for CT and 0.73 (95% CI: 0.68, 0.77) for MR imaging. ICC was 0.87 (95% CI: 0.84, 0.90) for arterial phase hyperenhancement, 0.85 (95% CI: 0.81, 0.88) for washout appearance, and 0.84 (95% CI: 0.80, 0.87) for capsule appearance. ICC was not significantly affected by liver expertise, LI-RADS familiarity, or years of postresidency practice (ICC range, 0.69-0.70; ICC difference, 0.003-0.01 [95% CI: 20.003 to 20.01, 0.004-0.02]. ICC was borderline higher for private practice readers than for academic readers (ICC difference, 0.009; 95% CI: 0.000, 0.021). Conclusion: ICC is good for final LI-RADS categorization and high for major feature characterization, with minimal reader demographic effect. Of note, our results using selected image sets from nonconsecutive examinations are not necessarily comparable with those of prior studies that used consecutive examination series.

UR - http://www.scopus.com/inward/record.url?scp=85038910820&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85038910820&partnerID=8YFLogxK

U2 - 10.1148/radiol.2017170376

DO - 10.1148/radiol.2017170376

M3 - Article

C2 - 29091751

AN - SCOPUS:85038910820

SN - 0033-8419

VL - 286

SP - 173

EP - 185

JO - Radiology

JF - Radiology

IS - 1

ER -

Interreader reliability of LI-RADS version 2014 algorithm and imaging features for diagnosis of hepatocellular carcinoma: A large international multireader study

Abstract

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this