With new technologies, multiple types of genomic data are commonly collected on a single set of samples. However, standard analysis methods concentrate on a single data type at a time and ignore the relationships between genes, proteins, and biochemical reactions that give rise to complex phenotypes. In this paper, we propose a novel integrative model to incorporate multiple types of genomic data into an association analysis with a complex phenotype. The method combines path analysis and stochastic search variable selection into a Bayesian hierarchical model that simultaneously identifies both direct and indirect genomic effects on the phenotype. Results from a simulation study and application of the Bayesian model to a pharmacogenomic study of the drug gemcitabine demonstrate greater sensitivity to detect genomic effects in some simulation scenarios, when compared to the standard single data type analysis. Further research is required to extend and modify this integrative modeling framework to increase computational efficiency to best capitalize on the wealth of information available across multiple genomic data types.
- Cell lines
- Genetic association
- MRNA expression
- Markov chain Monte Carlo (MCMC)
- Single nucleotide polymorphism (SNPs)
- Stochastic search variable selection
ASJC Scopus subject areas