Performance of an electronic health record-based phenotype algorithm to identify community associated methicillin-resistant Staphylococcus aureus cases and controls for genetic association studies

Kathryn L. Jackson, Michael Mbagwu, Jennifer A. Pacheco, Abigail S. Baldridge, Daniel J. Viox, James G. Linneman, Sanjay K. Shukla, Peggy L. Peissig, Kenneth M. Borthwick, David A. Carrell, Suzette J. Bielinski, Jacqueline C. Kirby, Joshua C. Denny, Frank D. Mentch, Lyam M. Vazquez, Laura J. Rasmussen-Torvik, Abel N. Kho

Research output: Contribution to journalArticle

6 Scopus citations

Abstract

Background: Community associated methicillin-resistant Staphylococcus aureus (CA-MRSA) is one of the most common causes of skin and soft tissue infections in the United States, and a variety of genetic host factors are suspected to be risk factors for recurrent infection. Based on the CDC definition, we have developed and validated an electronic health record (EHR) based CA-MRSA phenotype algorithm utilizing both structured and unstructured data. Methods: The algorithm was validated at three eMERGE consortium sites, and positive predictive value, negative predictive value and sensitivity, were calculated. The algorithm was then run and data collected across seven total sites. The resulting data was used in GWAS analysis. Results: Across seven sites, the CA-MRSA phenotype algorithm identified a total of 349 cases and 7761 controls among the genotyped European and African American biobank populations. PPV ranged from 68 to 100% for cases and 96 to 100% for controls; sensitivity ranged from 94 to 100% for cases and 75 to 100% for controls. Frequency of cases in the populations varied widely by site. There were no plausible GWAS-significant (p<5 E -8) findings. Conclusions: Differences in EHR data representation and screening patterns across sites may have affected identification of cases and controls and accounted for varying frequencies across sites. Future work identifying these patterns is necessary.

Original languageEnglish (US)
Article number684
JournalBMC Infectious Diseases
Volume16
Issue number1
DOIs
StatePublished - Nov 17 2016

Keywords

  • Ca-MRSA Phenotype
  • Ca_MRSA
  • Electronic Health Record
  • GWAS
  • Phenotyping

ASJC Scopus subject areas

  • Infectious Diseases

Fingerprint Dive into the research topics of 'Performance of an electronic health record-based phenotype algorithm to identify community associated methicillin-resistant Staphylococcus aureus cases and controls for genetic association studies'. Together they form a unique fingerprint.

  • Cite this

    Jackson, K. L., Mbagwu, M., Pacheco, J. A., Baldridge, A. S., Viox, D. J., Linneman, J. G., Shukla, S. K., Peissig, P. L., Borthwick, K. M., Carrell, D. A., Bielinski, S. J., Kirby, J. C., Denny, J. C., Mentch, F. D., Vazquez, L. M., Rasmussen-Torvik, L. J., & Kho, A. N. (2016). Performance of an electronic health record-based phenotype algorithm to identify community associated methicillin-resistant Staphylococcus aureus cases and controls for genetic association studies. BMC Infectious Diseases, 16(1), [684]. https://doi.org/10.1186/s12879-016-2020-2