Volumetric Pancreas Segmentation on Computed Tomography: Accuracy and Efficiency of a Convolutional Neural Network Versus Manual Segmentation in 3D Slicer in the Context of Interreader Variability of Expert Radiologists

Hala Khasawneh; Anurima Patra; Naveen Rajamohan; Garima Suman; Jason Klug; Shounak Majumder; Suresh T. Chari; Panagiotis Korfiatis; Ajit Harishkumar Goenka

doi:10.1097/RCT.0000000000001374

Volumetric Pancreas Segmentation on Computed Tomography: Accuracy and Efficiency of a Convolutional Neural Network Versus Manual Segmentation in 3D Slicer in the Context of Interreader Variability of Expert Radiologists

Hala Khasawneh, Anurima Patra, Naveen Rajamohan, Garima Suman, Jason Klug, Shounak Majumder, Suresh T. Chari, Panagiotis Korfiatis, Ajit Harishkumar Goenka

Research output: Contribution to journal › Article › peer-review

Abstract

Purpose This study aimed to compare accuracy and efficiency of a convolutional neural network (CNN)-enhanced workflow for pancreas segmentation versus radiologists in the context of interreader reliability. Methods Volumetric pancreas segmentations on a data set of 294 portal venous computed tomographies were performed by 3 radiologists (R1, R2, and R3) and by a CNN. Convolutional neural network segmentations were reviewed and, if needed, corrected ("corrected CNN [c-CNN]"segmentations) by radiologists. Ground truth was obtained from radiologists' manual segmentations using simultaneous truth and performance level estimation algorithm. Interreader reliability and model's accuracy were evaluated with Dice-Sorenson coefficient (DSC) and Jaccard coefficient (JC). Equivalence was determined using a two 1-sided test. Convolutional neural network segmentations below the 25th percentile DSC were reviewed to evaluate segmentation errors. Time for manual segmentation and c-CNN was compared. Results Pancreas volumes from 3 sets of segmentations (manual, CNN, and c-CNN) were noninferior to simultaneous truth and performance level estimation-derived volumes [76.6 cm3 (20.2 cm3), P < 0.05]. Interreader reliability was high (mean [SD] DSC between R2-R1, 0.87 [0.04]; R3-R1, 0.90 [0.05]; R2-R3, 0.87 [0.04]). Convolutional neural network segmentations were highly accurate (DSC, 0.88 [0.05]; JC, 0.79 [0.07]) and required minimal-to-no corrections (c-CNN: DSC, 0.89 [0.04]; JC, 0.81 [0.06]; equivalence, P < 0.05). Undersegmentation (n = 47 [64%]) was common in the 73 CNN segmentations below 25th percentile DSC, but there were no major errors. Total inference time (minutes) for CNN was 1.2 (0.3). Average time (minutes) taken by radiologists for c-CNN (0.6 [0.97]) was substantially lower compared with manual segmentation (3.37 [1.47]; savings of 77.9%-87% [P < 0.0001]). Conclusions Convolutional neural network-enhanced workflow provides high accuracy and efficiency for volumetric pancreas segmentation on computed tomography.

Original language	English (US)
Pages (from-to)	841-847
Number of pages	7
Journal	Journal of computer assisted tomography
Volume	46
Issue number	6
DOIs	https://doi.org/10.1097/RCT.0000000000001374
State	Published - Nov 1 2022

Keywords

artificial intelligence
deep learning
pancreas
pancreatic neoplasms
workflow

ASJC Scopus subject areas

Radiology Nuclear Medicine and imaging

Access to Document

10.1097/RCT.0000000000001374

Fingerprint

Dive into the research topics of 'Volumetric Pancreas Segmentation on Computed Tomography: Accuracy and Efficiency of a Convolutional Neural Network Versus Manual Segmentation in 3D Slicer in the Context of Interreader Variability of Expert Radiologists'. Together they form a unique fingerprint.

Cite this

Khasawneh, H., Patra, A., Rajamohan, N., Suman, G., Klug, J., Majumder, S., Chari, S. T., Korfiatis, P., & Goenka, A. H. (2022). Volumetric Pancreas Segmentation on Computed Tomography: Accuracy and Efficiency of a Convolutional Neural Network Versus Manual Segmentation in 3D Slicer in the Context of Interreader Variability of Expert Radiologists. Journal of computer assisted tomography, 46(6), 841-847. https://doi.org/10.1097/RCT.0000000000001374

Volumetric Pancreas Segmentation on Computed Tomography: Accuracy and Efficiency of a Convolutional Neural Network Versus Manual Segmentation in 3D Slicer in the Context of Interreader Variability of Expert Radiologists. / Khasawneh, Hala; Patra, Anurima; Rajamohan, Naveen et al.
In: Journal of computer assisted tomography, Vol. 46, No. 6, 01.11.2022, p. 841-847.

Research output: Contribution to journal › Article › peer-review

Khasawneh, H, Patra, A, Rajamohan, N, Suman, G, Klug, J, Majumder, S, Chari, ST, Korfiatis, P & Goenka, AH 2022, 'Volumetric Pancreas Segmentation on Computed Tomography: Accuracy and Efficiency of a Convolutional Neural Network Versus Manual Segmentation in 3D Slicer in the Context of Interreader Variability of Expert Radiologists', Journal of computer assisted tomography, vol. 46, no. 6, pp. 841-847. https://doi.org/10.1097/RCT.0000000000001374

Khasawneh H, Patra A, Rajamohan N, Suman G, Klug J, Majumder S et al. Volumetric Pancreas Segmentation on Computed Tomography: Accuracy and Efficiency of a Convolutional Neural Network Versus Manual Segmentation in 3D Slicer in the Context of Interreader Variability of Expert Radiologists. Journal of computer assisted tomography. 2022 Nov 1;46(6):841-847. doi: 10.1097/RCT.0000000000001374

@article{fc1bf6f45c774d72af390230b42f9892,

title = "Volumetric Pancreas Segmentation on Computed Tomography: Accuracy and Efficiency of a Convolutional Neural Network Versus Manual Segmentation in 3D Slicer in the Context of Interreader Variability of Expert Radiologists",

abstract = "Purpose This study aimed to compare accuracy and efficiency of a convolutional neural network (CNN)-enhanced workflow for pancreas segmentation versus radiologists in the context of interreader reliability. Methods Volumetric pancreas segmentations on a data set of 294 portal venous computed tomographies were performed by 3 radiologists (R1, R2, and R3) and by a CNN. Convolutional neural network segmentations were reviewed and, if needed, corrected ({"}corrected CNN [c-CNN]{"}segmentations) by radiologists. Ground truth was obtained from radiologists' manual segmentations using simultaneous truth and performance level estimation algorithm. Interreader reliability and model's accuracy were evaluated with Dice-Sorenson coefficient (DSC) and Jaccard coefficient (JC). Equivalence was determined using a two 1-sided test. Convolutional neural network segmentations below the 25th percentile DSC were reviewed to evaluate segmentation errors. Time for manual segmentation and c-CNN was compared. Results Pancreas volumes from 3 sets of segmentations (manual, CNN, and c-CNN) were noninferior to simultaneous truth and performance level estimation-derived volumes [76.6 cm3 (20.2 cm3), P < 0.05]. Interreader reliability was high (mean [SD] DSC between R2-R1, 0.87 [0.04]; R3-R1, 0.90 [0.05]; R2-R3, 0.87 [0.04]). Convolutional neural network segmentations were highly accurate (DSC, 0.88 [0.05]; JC, 0.79 [0.07]) and required minimal-to-no corrections (c-CNN: DSC, 0.89 [0.04]; JC, 0.81 [0.06]; equivalence, P < 0.05). Undersegmentation (n = 47 [64%]) was common in the 73 CNN segmentations below 25th percentile DSC, but there were no major errors. Total inference time (minutes) for CNN was 1.2 (0.3). Average time (minutes) taken by radiologists for c-CNN (0.6 [0.97]) was substantially lower compared with manual segmentation (3.37 [1.47]; savings of 77.9%-87% [P < 0.0001]). Conclusions Convolutional neural network-enhanced workflow provides high accuracy and efficiency for volumetric pancreas segmentation on computed tomography.",

keywords = "artificial intelligence, deep learning, pancreas, pancreatic neoplasms, workflow",

author = "Hala Khasawneh and Anurima Patra and Naveen Rajamohan and Garima Suman and Jason Klug and Shounak Majumder and Chari, {Suresh T.} and Panagiotis Korfiatis and Goenka, {Ajit Harishkumar}",

year = "2022",

month = nov,

day = "1",

doi = "10.1097/RCT.0000000000001374",

language = "English (US)",

volume = "46",

pages = "841--847",

journal = "Journal of computer assisted tomography",

issn = "0363-8715",

publisher = "Lippincott Williams and Wilkins",

number = "6",

}

TY - JOUR

T1 - Volumetric Pancreas Segmentation on Computed Tomography

T2 - Accuracy and Efficiency of a Convolutional Neural Network Versus Manual Segmentation in 3D Slicer in the Context of Interreader Variability of Expert Radiologists

AU - Khasawneh, Hala

AU - Patra, Anurima

AU - Rajamohan, Naveen

AU - Suman, Garima

AU - Klug, Jason

AU - Majumder, Shounak

AU - Chari, Suresh T.

AU - Korfiatis, Panagiotis

AU - Goenka, Ajit Harishkumar

PY - 2022/11/1

Y1 - 2022/11/1

N2 - Purpose This study aimed to compare accuracy and efficiency of a convolutional neural network (CNN)-enhanced workflow for pancreas segmentation versus radiologists in the context of interreader reliability. Methods Volumetric pancreas segmentations on a data set of 294 portal venous computed tomographies were performed by 3 radiologists (R1, R2, and R3) and by a CNN. Convolutional neural network segmentations were reviewed and, if needed, corrected ("corrected CNN [c-CNN]"segmentations) by radiologists. Ground truth was obtained from radiologists' manual segmentations using simultaneous truth and performance level estimation algorithm. Interreader reliability and model's accuracy were evaluated with Dice-Sorenson coefficient (DSC) and Jaccard coefficient (JC). Equivalence was determined using a two 1-sided test. Convolutional neural network segmentations below the 25th percentile DSC were reviewed to evaluate segmentation errors. Time for manual segmentation and c-CNN was compared. Results Pancreas volumes from 3 sets of segmentations (manual, CNN, and c-CNN) were noninferior to simultaneous truth and performance level estimation-derived volumes [76.6 cm3 (20.2 cm3), P < 0.05]. Interreader reliability was high (mean [SD] DSC between R2-R1, 0.87 [0.04]; R3-R1, 0.90 [0.05]; R2-R3, 0.87 [0.04]). Convolutional neural network segmentations were highly accurate (DSC, 0.88 [0.05]; JC, 0.79 [0.07]) and required minimal-to-no corrections (c-CNN: DSC, 0.89 [0.04]; JC, 0.81 [0.06]; equivalence, P < 0.05). Undersegmentation (n = 47 [64%]) was common in the 73 CNN segmentations below 25th percentile DSC, but there were no major errors. Total inference time (minutes) for CNN was 1.2 (0.3). Average time (minutes) taken by radiologists for c-CNN (0.6 [0.97]) was substantially lower compared with manual segmentation (3.37 [1.47]; savings of 77.9%-87% [P < 0.0001]). Conclusions Convolutional neural network-enhanced workflow provides high accuracy and efficiency for volumetric pancreas segmentation on computed tomography.

AB - Purpose This study aimed to compare accuracy and efficiency of a convolutional neural network (CNN)-enhanced workflow for pancreas segmentation versus radiologists in the context of interreader reliability. Methods Volumetric pancreas segmentations on a data set of 294 portal venous computed tomographies were performed by 3 radiologists (R1, R2, and R3) and by a CNN. Convolutional neural network segmentations were reviewed and, if needed, corrected ("corrected CNN [c-CNN]"segmentations) by radiologists. Ground truth was obtained from radiologists' manual segmentations using simultaneous truth and performance level estimation algorithm. Interreader reliability and model's accuracy were evaluated with Dice-Sorenson coefficient (DSC) and Jaccard coefficient (JC). Equivalence was determined using a two 1-sided test. Convolutional neural network segmentations below the 25th percentile DSC were reviewed to evaluate segmentation errors. Time for manual segmentation and c-CNN was compared. Results Pancreas volumes from 3 sets of segmentations (manual, CNN, and c-CNN) were noninferior to simultaneous truth and performance level estimation-derived volumes [76.6 cm3 (20.2 cm3), P < 0.05]. Interreader reliability was high (mean [SD] DSC between R2-R1, 0.87 [0.04]; R3-R1, 0.90 [0.05]; R2-R3, 0.87 [0.04]). Convolutional neural network segmentations were highly accurate (DSC, 0.88 [0.05]; JC, 0.79 [0.07]) and required minimal-to-no corrections (c-CNN: DSC, 0.89 [0.04]; JC, 0.81 [0.06]; equivalence, P < 0.05). Undersegmentation (n = 47 [64%]) was common in the 73 CNN segmentations below 25th percentile DSC, but there were no major errors. Total inference time (minutes) for CNN was 1.2 (0.3). Average time (minutes) taken by radiologists for c-CNN (0.6 [0.97]) was substantially lower compared with manual segmentation (3.37 [1.47]; savings of 77.9%-87% [P < 0.0001]). Conclusions Convolutional neural network-enhanced workflow provides high accuracy and efficiency for volumetric pancreas segmentation on computed tomography.

KW - artificial intelligence

KW - deep learning

KW - pancreas

KW - pancreatic neoplasms

KW - workflow

UR - http://www.scopus.com/inward/record.url?scp=85142403534&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85142403534&partnerID=8YFLogxK

U2 - 10.1097/RCT.0000000000001374

DO - 10.1097/RCT.0000000000001374

M3 - Article

C2 - 36055122

AN - SCOPUS:85142403534

SN - 0363-8715

VL - 46

SP - 841

EP - 847

JO - Journal of computer assisted tomography

JF - Journal of computer assisted tomography

IS - 6

ER -

Volumetric Pancreas Segmentation on Computed Tomography: Accuracy and Efficiency of a Convolutional Neural Network Versus Manual Segmentation in 3D Slicer in the Context of Interreader Variability of Expert Radiologists

Abstract

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this