EMERGE Phenome-Wide Association Study (PheWAS) identifies clinical associations and pleiotropy for stop-gain variants

Anurag Verma, Shefali S. Verma, Sarah A. Pendergrass, Dana C. Crawford, David R. Crosslin, Helena Kuivaniemi, William S. Bush, Yuki Bradford, Iftikhar Jan Kullo, Suzette J Bielinski, Rongling Li, Joshua C. Denny, Peggy Peissig, Scott Hebbring, Mariza De Andrade, Marylyn D. Ritchie, Gerard Tromp

Research output: Contribution to journalArticle

10 Citations (Scopus)

Abstract

Background: We explored premature stop-gain variants to test the hypothesis that variants, which are likely to have a consequence on protein structure and function, will reveal important insights with respect to the phenotypes associated with them. We performed a phenome-wide association study (PheWAS) exploring the association between a selected list of functional stop-gain genetic variants (variation resulting in truncated proteins or in nonsense-mediated decay) and an extensive group of diagnoses to identify novel associations and uncover potential pleiotropy. Results: In this study, we selected 25 stop-gain variants: 5 stop-gain variants with previously reported phenotypic associations, and a set of 20 putative stop-gain variants identified using dbSNP. For the PheWAS, we used data from the electronic MEdical Records and GEnomics (eMERGE) Network across 9 sites with a total of 41,057 unrelated patients. We divided all these samples into two datasets by equal proportion of eMERGE site, sex, race, and genotyping platform. We calculated single effect associations between these 25 stop-gain variants and ICD-9 defined case-control diagnoses. We also performed stratified analyses for samples of European and African ancestry. Associations were adjusted for sex, site, genotyping platform and the first three principal components to account for global ancestry. We identified previously known associations, such as variants in LPL associated with hyperglyceridemia indicating that our approach was robust. We also found a total of three significant associations with p < 0.01 in both datasets, with the most significant replicating result being LPL SNP rs328 and ICD-9 code 272.1 "Disorder of Lipoid metabolism" (pdiscovery = 2.59x10-6, preplicating = 2.7x10-4). The other two significant replicated associations identified by this study are: variant rs1137617 in KCNH2 gene associated with ICD-9 code category 244 "Acquired Hypothyroidism" (pdiscovery = 5.31x103, preplicating = 1.15x10-3) and variant rs12060879 in DPT gene associated with ICD-9 code category 996 "Complications peculiar to certain specified procedures" (pdiscovery = 8.65x103, preplicating = 4.16x10-3). Conclusion: In conclusion, this PheWAS revealed novel associations of stop-gained variants with interesting phenotypes (ICD-9 codes) along with pleiotropic effects.

Original languageEnglish (US)
Article number32
JournalBMC Medical Genomics
Volume9
DOIs
StatePublished - Aug 12 2016

Fingerprint

International Classification of Diseases
Electronic Health Records
Genomics
Phenotype
Clinical Studies
Hypothyroidism
Genes
Single Nucleotide Polymorphism
Proteins

ASJC Scopus subject areas

  • Genetics(clinical)
  • Genetics

Cite this

Verma, A., Verma, S. S., Pendergrass, S. A., Crawford, D. C., Crosslin, D. R., Kuivaniemi, H., ... Tromp, G. (2016). EMERGE Phenome-Wide Association Study (PheWAS) identifies clinical associations and pleiotropy for stop-gain variants. BMC Medical Genomics, 9, [32]. https://doi.org/10.1186/s12920-016-0191-8

EMERGE Phenome-Wide Association Study (PheWAS) identifies clinical associations and pleiotropy for stop-gain variants. / Verma, Anurag; Verma, Shefali S.; Pendergrass, Sarah A.; Crawford, Dana C.; Crosslin, David R.; Kuivaniemi, Helena; Bush, William S.; Bradford, Yuki; Kullo, Iftikhar Jan; Bielinski, Suzette J; Li, Rongling; Denny, Joshua C.; Peissig, Peggy; Hebbring, Scott; De Andrade, Mariza; Ritchie, Marylyn D.; Tromp, Gerard.

In: BMC Medical Genomics, Vol. 9, 32, 12.08.2016.

Research output: Contribution to journalArticle

Verma, A, Verma, SS, Pendergrass, SA, Crawford, DC, Crosslin, DR, Kuivaniemi, H, Bush, WS, Bradford, Y, Kullo, IJ, Bielinski, SJ, Li, R, Denny, JC, Peissig, P, Hebbring, S, De Andrade, M, Ritchie, MD & Tromp, G 2016, 'EMERGE Phenome-Wide Association Study (PheWAS) identifies clinical associations and pleiotropy for stop-gain variants', BMC Medical Genomics, vol. 9, 32. https://doi.org/10.1186/s12920-016-0191-8
Verma, Anurag ; Verma, Shefali S. ; Pendergrass, Sarah A. ; Crawford, Dana C. ; Crosslin, David R. ; Kuivaniemi, Helena ; Bush, William S. ; Bradford, Yuki ; Kullo, Iftikhar Jan ; Bielinski, Suzette J ; Li, Rongling ; Denny, Joshua C. ; Peissig, Peggy ; Hebbring, Scott ; De Andrade, Mariza ; Ritchie, Marylyn D. ; Tromp, Gerard. / EMERGE Phenome-Wide Association Study (PheWAS) identifies clinical associations and pleiotropy for stop-gain variants. In: BMC Medical Genomics. 2016 ; Vol. 9.
@article{07b07ed66dd04fa3997f2f15ea3df273,
title = "EMERGE Phenome-Wide Association Study (PheWAS) identifies clinical associations and pleiotropy for stop-gain variants",
abstract = "Background: We explored premature stop-gain variants to test the hypothesis that variants, which are likely to have a consequence on protein structure and function, will reveal important insights with respect to the phenotypes associated with them. We performed a phenome-wide association study (PheWAS) exploring the association between a selected list of functional stop-gain genetic variants (variation resulting in truncated proteins or in nonsense-mediated decay) and an extensive group of diagnoses to identify novel associations and uncover potential pleiotropy. Results: In this study, we selected 25 stop-gain variants: 5 stop-gain variants with previously reported phenotypic associations, and a set of 20 putative stop-gain variants identified using dbSNP. For the PheWAS, we used data from the electronic MEdical Records and GEnomics (eMERGE) Network across 9 sites with a total of 41,057 unrelated patients. We divided all these samples into two datasets by equal proportion of eMERGE site, sex, race, and genotyping platform. We calculated single effect associations between these 25 stop-gain variants and ICD-9 defined case-control diagnoses. We also performed stratified analyses for samples of European and African ancestry. Associations were adjusted for sex, site, genotyping platform and the first three principal components to account for global ancestry. We identified previously known associations, such as variants in LPL associated with hyperglyceridemia indicating that our approach was robust. We also found a total of three significant associations with p < 0.01 in both datasets, with the most significant replicating result being LPL SNP rs328 and ICD-9 code 272.1 {"}Disorder of Lipoid metabolism{"} (pdiscovery = 2.59x10-6, preplicating = 2.7x10-4). The other two significant replicated associations identified by this study are: variant rs1137617 in KCNH2 gene associated with ICD-9 code category 244 {"}Acquired Hypothyroidism{"} (pdiscovery = 5.31x103, preplicating = 1.15x10-3) and variant rs12060879 in DPT gene associated with ICD-9 code category 996 {"}Complications peculiar to certain specified procedures{"} (pdiscovery = 8.65x103, preplicating = 4.16x10-3). Conclusion: In conclusion, this PheWAS revealed novel associations of stop-gained variants with interesting phenotypes (ICD-9 codes) along with pleiotropic effects.",
author = "Anurag Verma and Verma, {Shefali S.} and Pendergrass, {Sarah A.} and Crawford, {Dana C.} and Crosslin, {David R.} and Helena Kuivaniemi and Bush, {William S.} and Yuki Bradford and Kullo, {Iftikhar Jan} and Bielinski, {Suzette J} and Rongling Li and Denny, {Joshua C.} and Peggy Peissig and Scott Hebbring and {De Andrade}, Mariza and Ritchie, {Marylyn D.} and Gerard Tromp",
year = "2016",
month = "8",
day = "12",
doi = "10.1186/s12920-016-0191-8",
language = "English (US)",
volume = "9",
journal = "BMC Medical Genomics",
issn = "1755-8794",
publisher = "BioMed Central",

}

TY - JOUR

T1 - EMERGE Phenome-Wide Association Study (PheWAS) identifies clinical associations and pleiotropy for stop-gain variants

AU - Verma, Anurag

AU - Verma, Shefali S.

AU - Pendergrass, Sarah A.

AU - Crawford, Dana C.

AU - Crosslin, David R.

AU - Kuivaniemi, Helena

AU - Bush, William S.

AU - Bradford, Yuki

AU - Kullo, Iftikhar Jan

AU - Bielinski, Suzette J

AU - Li, Rongling

AU - Denny, Joshua C.

AU - Peissig, Peggy

AU - Hebbring, Scott

AU - De Andrade, Mariza

AU - Ritchie, Marylyn D.

AU - Tromp, Gerard

PY - 2016/8/12

Y1 - 2016/8/12

N2 - Background: We explored premature stop-gain variants to test the hypothesis that variants, which are likely to have a consequence on protein structure and function, will reveal important insights with respect to the phenotypes associated with them. We performed a phenome-wide association study (PheWAS) exploring the association between a selected list of functional stop-gain genetic variants (variation resulting in truncated proteins or in nonsense-mediated decay) and an extensive group of diagnoses to identify novel associations and uncover potential pleiotropy. Results: In this study, we selected 25 stop-gain variants: 5 stop-gain variants with previously reported phenotypic associations, and a set of 20 putative stop-gain variants identified using dbSNP. For the PheWAS, we used data from the electronic MEdical Records and GEnomics (eMERGE) Network across 9 sites with a total of 41,057 unrelated patients. We divided all these samples into two datasets by equal proportion of eMERGE site, sex, race, and genotyping platform. We calculated single effect associations between these 25 stop-gain variants and ICD-9 defined case-control diagnoses. We also performed stratified analyses for samples of European and African ancestry. Associations were adjusted for sex, site, genotyping platform and the first three principal components to account for global ancestry. We identified previously known associations, such as variants in LPL associated with hyperglyceridemia indicating that our approach was robust. We also found a total of three significant associations with p < 0.01 in both datasets, with the most significant replicating result being LPL SNP rs328 and ICD-9 code 272.1 "Disorder of Lipoid metabolism" (pdiscovery = 2.59x10-6, preplicating = 2.7x10-4). The other two significant replicated associations identified by this study are: variant rs1137617 in KCNH2 gene associated with ICD-9 code category 244 "Acquired Hypothyroidism" (pdiscovery = 5.31x103, preplicating = 1.15x10-3) and variant rs12060879 in DPT gene associated with ICD-9 code category 996 "Complications peculiar to certain specified procedures" (pdiscovery = 8.65x103, preplicating = 4.16x10-3). Conclusion: In conclusion, this PheWAS revealed novel associations of stop-gained variants with interesting phenotypes (ICD-9 codes) along with pleiotropic effects.

AB - Background: We explored premature stop-gain variants to test the hypothesis that variants, which are likely to have a consequence on protein structure and function, will reveal important insights with respect to the phenotypes associated with them. We performed a phenome-wide association study (PheWAS) exploring the association between a selected list of functional stop-gain genetic variants (variation resulting in truncated proteins or in nonsense-mediated decay) and an extensive group of diagnoses to identify novel associations and uncover potential pleiotropy. Results: In this study, we selected 25 stop-gain variants: 5 stop-gain variants with previously reported phenotypic associations, and a set of 20 putative stop-gain variants identified using dbSNP. For the PheWAS, we used data from the electronic MEdical Records and GEnomics (eMERGE) Network across 9 sites with a total of 41,057 unrelated patients. We divided all these samples into two datasets by equal proportion of eMERGE site, sex, race, and genotyping platform. We calculated single effect associations between these 25 stop-gain variants and ICD-9 defined case-control diagnoses. We also performed stratified analyses for samples of European and African ancestry. Associations were adjusted for sex, site, genotyping platform and the first three principal components to account for global ancestry. We identified previously known associations, such as variants in LPL associated with hyperglyceridemia indicating that our approach was robust. We also found a total of three significant associations with p < 0.01 in both datasets, with the most significant replicating result being LPL SNP rs328 and ICD-9 code 272.1 "Disorder of Lipoid metabolism" (pdiscovery = 2.59x10-6, preplicating = 2.7x10-4). The other two significant replicated associations identified by this study are: variant rs1137617 in KCNH2 gene associated with ICD-9 code category 244 "Acquired Hypothyroidism" (pdiscovery = 5.31x103, preplicating = 1.15x10-3) and variant rs12060879 in DPT gene associated with ICD-9 code category 996 "Complications peculiar to certain specified procedures" (pdiscovery = 8.65x103, preplicating = 4.16x10-3). Conclusion: In conclusion, this PheWAS revealed novel associations of stop-gained variants with interesting phenotypes (ICD-9 codes) along with pleiotropic effects.

UR - http://www.scopus.com/inward/record.url?scp=84982153592&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84982153592&partnerID=8YFLogxK

U2 - 10.1186/s12920-016-0191-8

DO - 10.1186/s12920-016-0191-8

M3 - Article

C2 - 27535653

AN - SCOPUS:84982153592

VL - 9

JO - BMC Medical Genomics

JF - BMC Medical Genomics

SN - 1755-8794

M1 - 32

ER -