Billing code algorithms to identify cases of peripheral artery disease from administrative data

Jin Fan, Adelaide M Arruda-Olson, Cynthia L. Leibson, Carin Smith, Guanghui Liu, Kent R Bailey, Iftikhar Jan Kullo

Research output: Contribution to journalArticle

43 Citations (Scopus)

Abstract

Objective: To construct and validate billing code algorithms for identifying patients with peripheral arterial disease (PAD). Methods: We extracted all encounters and line item details including PAD-related billing codes at Mayo Clinic Rochester, Minnesota, between July 1, 1997 and June 30, 2008; 22 712 patients evaluated in the vascular laboratory were divided into training and validation sets. Multiple logistic regression analysis was used to create an integer code score from the training dataset, and this was tested in the validation set. We applied a model-based code algorithm to patients evaluated in the vascular laboratory and compared this with a simpler algorithm ( presence of at least one of the ICD-9 PAD codes 440.20-440.29). We also applied both algorithms to a community-based sample (n=4420), followed by a manual review. Results: The logistic regression model performed well in both training and validation datasets (c statistic=0.91). In patients evaluated in the vascular laboratory, the modelbased code algorithm provided better negative predictive value. The simpler algorithm was reasonably accurate for identification of PAD status, with lesser sensitivity and greater specificity. In the community-based sample, the sensitivity (38.7% vs 68.0%) of the simpler algorithm was much lower, whereas the specificity (92.0% vs 87.6%) was higher than the model-based algorithm. Conclusions: A model-based billing code algorithm had reasonable accuracy in identifying PAD cases from the community, and in patients referred to the non-invasive vascular laboratory. The simpler algorithm had reasonable accuracy for identification of PAD in patients referred to the vascular laboratory but was significantly less sensitive in a community-based sample.

Original languageEnglish (US)
JournalJournal of the American Medical Informatics Association
Volume20
Issue numberE2
DOIs
StatePublished - 2013

Fingerprint

Peripheral Arterial Disease
Blood Vessels
Logistic Models
International Classification of Diseases
Regression Analysis
Sensitivity and Specificity

ASJC Scopus subject areas

  • Health Informatics

Cite this

Billing code algorithms to identify cases of peripheral artery disease from administrative data. / Fan, Jin; Arruda-Olson, Adelaide M; Leibson, Cynthia L.; Smith, Carin; Liu, Guanghui; Bailey, Kent R; Kullo, Iftikhar Jan.

In: Journal of the American Medical Informatics Association, Vol. 20, No. E2, 2013.

Research output: Contribution to journalArticle

@article{3a083f19b20e43ffb43bd6d4a9dc9095,
title = "Billing code algorithms to identify cases of peripheral artery disease from administrative data",
abstract = "Objective: To construct and validate billing code algorithms for identifying patients with peripheral arterial disease (PAD). Methods: We extracted all encounters and line item details including PAD-related billing codes at Mayo Clinic Rochester, Minnesota, between July 1, 1997 and June 30, 2008; 22 712 patients evaluated in the vascular laboratory were divided into training and validation sets. Multiple logistic regression analysis was used to create an integer code score from the training dataset, and this was tested in the validation set. We applied a model-based code algorithm to patients evaluated in the vascular laboratory and compared this with a simpler algorithm ( presence of at least one of the ICD-9 PAD codes 440.20-440.29). We also applied both algorithms to a community-based sample (n=4420), followed by a manual review. Results: The logistic regression model performed well in both training and validation datasets (c statistic=0.91). In patients evaluated in the vascular laboratory, the modelbased code algorithm provided better negative predictive value. The simpler algorithm was reasonably accurate for identification of PAD status, with lesser sensitivity and greater specificity. In the community-based sample, the sensitivity (38.7{\%} vs 68.0{\%}) of the simpler algorithm was much lower, whereas the specificity (92.0{\%} vs 87.6{\%}) was higher than the model-based algorithm. Conclusions: A model-based billing code algorithm had reasonable accuracy in identifying PAD cases from the community, and in patients referred to the non-invasive vascular laboratory. The simpler algorithm had reasonable accuracy for identification of PAD in patients referred to the vascular laboratory but was significantly less sensitive in a community-based sample.",
author = "Jin Fan and Arruda-Olson, {Adelaide M} and Leibson, {Cynthia L.} and Carin Smith and Guanghui Liu and Bailey, {Kent R} and Kullo, {Iftikhar Jan}",
year = "2013",
doi = "10.1136/amiajnl-2013-001827",
language = "English (US)",
volume = "20",
journal = "Journal of the American Medical Informatics Association : JAMIA",
issn = "1067-5027",
publisher = "Oxford University Press",
number = "E2",

}

TY - JOUR

T1 - Billing code algorithms to identify cases of peripheral artery disease from administrative data

AU - Fan, Jin

AU - Arruda-Olson, Adelaide M

AU - Leibson, Cynthia L.

AU - Smith, Carin

AU - Liu, Guanghui

AU - Bailey, Kent R

AU - Kullo, Iftikhar Jan

PY - 2013

Y1 - 2013

N2 - Objective: To construct and validate billing code algorithms for identifying patients with peripheral arterial disease (PAD). Methods: We extracted all encounters and line item details including PAD-related billing codes at Mayo Clinic Rochester, Minnesota, between July 1, 1997 and June 30, 2008; 22 712 patients evaluated in the vascular laboratory were divided into training and validation sets. Multiple logistic regression analysis was used to create an integer code score from the training dataset, and this was tested in the validation set. We applied a model-based code algorithm to patients evaluated in the vascular laboratory and compared this with a simpler algorithm ( presence of at least one of the ICD-9 PAD codes 440.20-440.29). We also applied both algorithms to a community-based sample (n=4420), followed by a manual review. Results: The logistic regression model performed well in both training and validation datasets (c statistic=0.91). In patients evaluated in the vascular laboratory, the modelbased code algorithm provided better negative predictive value. The simpler algorithm was reasonably accurate for identification of PAD status, with lesser sensitivity and greater specificity. In the community-based sample, the sensitivity (38.7% vs 68.0%) of the simpler algorithm was much lower, whereas the specificity (92.0% vs 87.6%) was higher than the model-based algorithm. Conclusions: A model-based billing code algorithm had reasonable accuracy in identifying PAD cases from the community, and in patients referred to the non-invasive vascular laboratory. The simpler algorithm had reasonable accuracy for identification of PAD in patients referred to the vascular laboratory but was significantly less sensitive in a community-based sample.

AB - Objective: To construct and validate billing code algorithms for identifying patients with peripheral arterial disease (PAD). Methods: We extracted all encounters and line item details including PAD-related billing codes at Mayo Clinic Rochester, Minnesota, between July 1, 1997 and June 30, 2008; 22 712 patients evaluated in the vascular laboratory were divided into training and validation sets. Multiple logistic regression analysis was used to create an integer code score from the training dataset, and this was tested in the validation set. We applied a model-based code algorithm to patients evaluated in the vascular laboratory and compared this with a simpler algorithm ( presence of at least one of the ICD-9 PAD codes 440.20-440.29). We also applied both algorithms to a community-based sample (n=4420), followed by a manual review. Results: The logistic regression model performed well in both training and validation datasets (c statistic=0.91). In patients evaluated in the vascular laboratory, the modelbased code algorithm provided better negative predictive value. The simpler algorithm was reasonably accurate for identification of PAD status, with lesser sensitivity and greater specificity. In the community-based sample, the sensitivity (38.7% vs 68.0%) of the simpler algorithm was much lower, whereas the specificity (92.0% vs 87.6%) was higher than the model-based algorithm. Conclusions: A model-based billing code algorithm had reasonable accuracy in identifying PAD cases from the community, and in patients referred to the non-invasive vascular laboratory. The simpler algorithm had reasonable accuracy for identification of PAD in patients referred to the vascular laboratory but was significantly less sensitive in a community-based sample.

UR - http://www.scopus.com/inward/record.url?scp=84890486487&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84890486487&partnerID=8YFLogxK

U2 - 10.1136/amiajnl-2013-001827

DO - 10.1136/amiajnl-2013-001827

M3 - Article

VL - 20

JO - Journal of the American Medical Informatics Association : JAMIA

JF - Journal of the American Medical Informatics Association : JAMIA

SN - 1067-5027

IS - E2

ER -