A rare variant analysis framework using public genotype summary counts to prioritize disease-predisposition genes

Wenan Chen, Shuoguo Wang, Saima Sultana Tithi, David W. Ellison, Daniel J. Schaid, Gang Wu

Research output: Contribution to journalArticlepeer-review

Abstract

Sequencing cases without matched healthy controls hinders prioritization of germline disease-predisposition genes. To circumvent this problem, genotype summary counts from public data sets can serve as controls. However, systematic inflation and false positives can arise if confounding factors are not controlled. We propose a framework, consistent summary counts based rare variant burden test (CoCoRV), to address these challenges. CoCoRV implements consistent variant quality control and filtering, ethnicity-stratified rare variant association test, accurate estimation of inflation factors, powerful FDR control, and detection of rare variant pairs in high linkage disequilibrium. When we applied CoCoRV to pediatric cancer cohorts, the top genes identified were cancer-predisposition genes. We also applied CoCoRV to identify disease-predisposition genes in adult brain tumors and amyotrophic lateral sclerosis. Given that potential confounding factors were well controlled after applying the framework, CoCoRV provides a cost-effective solution to prioritizing disease-risk genes enriched with rare pathogenic variants.

Original languageEnglish (US)
Article number2592
JournalNature communications
Volume13
Issue number1
DOIs
StatePublished - Dec 2022

ASJC Scopus subject areas

  • Chemistry(all)
  • Biochemistry, Genetics and Molecular Biology(all)
  • Physics and Astronomy(all)

Fingerprint

Dive into the research topics of 'A rare variant analysis framework using public genotype summary counts to prioritize disease-predisposition genes'. Together they form a unique fingerprint.

Cite this