TY - JOUR
T1 - Fast implementation of a scan statistic for identifying chromosomal patterns of genome wide association studies
AU - Sun, Yan V.
AU - Jacobsen, Douglas M.
AU - Turner, Stephen T.
AU - Boerwinkle, Eric
AU - Kardia, Sharon L.R.
N1 - Funding Information:
This work was supported by National Institute of Health grant HL087660, HL68737, HL 74735 and HL 53335.
PY - 2009/3/15
Y1 - 2009/3/15
N2 - In order to take into account the complex genomic distribution of SNP variations when identifying chromosomal regions with significant SNP effects, a single nucleotide polymorphism (SNP) association scan statistic was developed. To address the computational needs of genome wide association (GWA) studies, a fast Java application, which combines single-locus SNP tests and a scan statistic for identifying chromosomal regions with significant clusters of significant SNP effects, was developed and implemented. To illustrate this application, SNP associations were analyzed in a pharmacogenomic study of the blood pressure lowering effect of thiazide-diuretics (N = 195) using the Affymetrix Human Mapping 100 K Set. 55,335 tagSNPs (pair-wise linkage disequilibrium R2 < 0.5) were selected to reduce the frequency correlation between SNPs. A typical workstation can complete the whole genome scan including 10,000 permutation tests within 3 h. The most significant regions locate on chromosome 3, 6, 13 and 16, two of which contain candidate genes that may be involved in the underlying drug response mechanism. The computational performance of ChromoScan-GWA and its scalability were tested with up to 1,000,000 SNPs and up to 4000 subjects. Using 10,000 permutations, the computation time grew linearly in these datasets. This scan statistic application provides a robust statistical and computational foundation for identifying genomic regions associated with disease and provides a method to compare GWA results even across different platforms.
AB - In order to take into account the complex genomic distribution of SNP variations when identifying chromosomal regions with significant SNP effects, a single nucleotide polymorphism (SNP) association scan statistic was developed. To address the computational needs of genome wide association (GWA) studies, a fast Java application, which combines single-locus SNP tests and a scan statistic for identifying chromosomal regions with significant clusters of significant SNP effects, was developed and implemented. To illustrate this application, SNP associations were analyzed in a pharmacogenomic study of the blood pressure lowering effect of thiazide-diuretics (N = 195) using the Affymetrix Human Mapping 100 K Set. 55,335 tagSNPs (pair-wise linkage disequilibrium R2 < 0.5) were selected to reduce the frequency correlation between SNPs. A typical workstation can complete the whole genome scan including 10,000 permutation tests within 3 h. The most significant regions locate on chromosome 3, 6, 13 and 16, two of which contain candidate genes that may be involved in the underlying drug response mechanism. The computational performance of ChromoScan-GWA and its scalability were tested with up to 1,000,000 SNPs and up to 4000 subjects. Using 10,000 permutations, the computation time grew linearly in these datasets. This scan statistic application provides a robust statistical and computational foundation for identifying genomic regions associated with disease and provides a method to compare GWA results even across different platforms.
UR - http://www.scopus.com/inward/record.url?scp=60349123180&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=60349123180&partnerID=8YFLogxK
U2 - 10.1016/j.csda.2008.04.013
DO - 10.1016/j.csda.2008.04.013
M3 - Article
AN - SCOPUS:60349123180
SN - 0167-9473
VL - 53
SP - 1794
EP - 1801
JO - Computational Statistics and Data Analysis
JF - Computational Statistics and Data Analysis
IS - 5
ER -