TY - JOUR
T1 - BinomiRare
T2 - A robust test for association of a rare genetic variant with a binary outcome for mixed models and any case-control proportion
AU - NHLBI Trans-Omics for Precision Medicine (TOPMed) Consortium
AU - Sofer, Tamar
AU - Lee, Jiwon
AU - Kurniansyah, Nuzulul
AU - Jain, Deepti
AU - Laurie, Cecelia A.
AU - Gogarten, Stephanie M.
AU - Conomos, Matthew P.
AU - Heavner, Ben
AU - Hu, Yao
AU - Kooperberg, Charles
AU - Haessler, Jeffrey
AU - Vasan, Ramachandran S.
AU - Cupples, L. Adrienne
AU - Coombes, Brandon J.
AU - Seyerle, Amanda
AU - Gharib, Sina A.
AU - Chen, Han
AU - O'Connell, Jeffrey R.
AU - Zhang, Man
AU - Gottlieb, Daniel J.
AU - Psaty, Bruce M.
AU - Longstreth, W. T.
AU - Rotter, Jerome I.
AU - Taylor, Kent D.
AU - Rich, Stephen S.
AU - Guo, Xiuqing
AU - Boerwinkle, Eric
AU - Morrison, Alanna C.
AU - Pankow, James S.
AU - Johnson, Andrew D.
AU - Pankratz, Nathan
AU - Reiner, Alex P.
AU - Redline, Susan
AU - Smith, Nicholas L.
AU - Rice, Kenneth M.
AU - Schifano, Elizabeth D.
N1 - Funding Information:
This work was supported by the National Heart, Lung, and Blood Institute (NHLBI) grant R35HL135818 . Study-specific acknowledgments are available in the Supplemental information. Whole-genome sequencing (WGS) for the Trans-Omics in Precision Medicine (TOPMed) program was supported by the NHLBI. See Supplemental information for sequencing center support information. Centralized read mapping and genotype calling, along with variant quality metrics and filtering, were provided by the TOPMed Informatics Research Center ( 3R01HL-117626-02S1 ; contract HHSN268201800002I ). Phenotype harmonization, data management, sample-identity QC, and general study coordination were provided by the TOPMed Data Coordinating Center ( R01HL-120393 ; U01HL-120393 ; contract HHSN268201800001I ). We gratefully acknowledge the studies and participants who provided biological samples and data for TOPMed. The views expressed in this manuscript are those of the authors and do not necessarily represent the views of the National Heart, Lung, and Blood Institute; the National Institutes of Health; or the US Department of Health and Human Services.
Funding Information:
This work was supported by the National Heart, Lung, and Blood Institute (NHLBI) grant R35HL135818. Study-specific acknowledgments are available in the Supplemental information. Whole-genome sequencing (WGS) for the Trans-Omics in Precision Medicine (TOPMed) program was supported by the NHLBI. See Supplemental information for sequencing center support information. Centralized read mapping and genotype calling, along with variant quality metrics and filtering, were provided by the TOPMed Informatics Research Center (3R01HL-117626-02S1; contract HHSN268201800002I). Phenotype harmonization, data management, sample-identity QC, and general study coordination were provided by the TOPMed Data Coordinating Center (R01HL-120393; U01HL-120393; contract HHSN268201800001I). We gratefully acknowledge the studies and participants who provided biological samples and data for TOPMed. The views expressed in this manuscript are those of the authors and do not necessarily represent the views of the National Heart, Lung, and Blood Institute; the National Institutes of Health; or the US Department of Health and Human Services. B.M.P. serves on the Steering Committee of the Yale Open Data Access Project funded by Johnson & Johnson. All other authors declare no competing interests.
Publisher Copyright:
© 2021 The Author(s)
PY - 2021/7/8
Y1 - 2021/7/8
N2 - Whole-genome sequencing (WGS) and whole-exome sequencing studies have become increasingly available and are being used to identify rare genetic variants associated with health and disease outcomes. Investigators routinely use mixed models to account for genetic relatedness or other clustering variables (e.g., family or household) when testing genetic associations. However, no existing tests of the association of a rare variant with a binary outcome in the presence of correlated data control the type 1 error where there are (1) few individuals harboring the rare allele, (2) a small proportion of cases relative to controls, and (3) covariates to adjust for. Here, we address all three issues in developing a framework for testing rare variant association with a binary trait in individuals harboring at least one risk allele. In this framework, we estimate outcome probabilities under the null hypothesis and then use them, within the individuals with at least one risk allele, to test variant associations. We extend the BinomiRare test, which was previously proposed for independent observations, and develop the Conway-Maxwell-Poisson (CMP) test and study their properties in simulations. We show that the BinomiRare test always controls the type 1 error, while the CMP test sometimes does not. We then use the BinomiRare test to test the association of rare genetic variants in target genes with small-vessel disease (SVD) stroke, short sleep, and venous thromboembolism (VTE), in whole-genome sequence data from the Trans-Omics for Precision Medicine (TOPMed) program.
AB - Whole-genome sequencing (WGS) and whole-exome sequencing studies have become increasingly available and are being used to identify rare genetic variants associated with health and disease outcomes. Investigators routinely use mixed models to account for genetic relatedness or other clustering variables (e.g., family or household) when testing genetic associations. However, no existing tests of the association of a rare variant with a binary outcome in the presence of correlated data control the type 1 error where there are (1) few individuals harboring the rare allele, (2) a small proportion of cases relative to controls, and (3) covariates to adjust for. Here, we address all three issues in developing a framework for testing rare variant association with a binary trait in individuals harboring at least one risk allele. In this framework, we estimate outcome probabilities under the null hypothesis and then use them, within the individuals with at least one risk allele, to test variant associations. We extend the BinomiRare test, which was previously proposed for independent observations, and develop the Conway-Maxwell-Poisson (CMP) test and study their properties in simulations. We show that the BinomiRare test always controls the type 1 error, while the CMP test sometimes does not. We then use the BinomiRare test to test the association of rare genetic variants in target genes with small-vessel disease (SVD) stroke, short sleep, and venous thromboembolism (VTE), in whole-genome sequence data from the Trans-Omics for Precision Medicine (TOPMed) program.
KW - diverse populations
KW - genome sequencing
KW - mixed models
KW - rare variant assocaition testing
KW - targeted gene analysis
UR - http://www.scopus.com/inward/record.url?scp=85123070300&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85123070300&partnerID=8YFLogxK
U2 - 10.1016/j.xhgg.2021.100040
DO - 10.1016/j.xhgg.2021.100040
M3 - Article
AN - SCOPUS:85123070300
SN - 2666-2477
VL - 2
JO - Human Genetics and Genomics Advances
JF - Human Genetics and Genomics Advances
IS - 3
M1 - 100040
ER -