TY - JOUR
T1 - Genetic association studies of copy-number variation
T2 - Should assignment of copy number states precede testing?
AU - Breheny, Patrick
AU - Chalise, Prabhakar
AU - Batzler, Anthony
AU - Wang, Liewei
AU - Fridley, Brooke L.
PY - 2012/4/6
Y1 - 2012/4/6
N2 - Recently, structural variation in the genome has been implicated in many complex diseases. Using genomewide single nucleotide polymorphism (SNP) arrays, researchers are able to investigate the impact not only of SNP variation, but also of copy-number variants (CNVs) on the phenotype. The most common analytic approach involves estimating, at the level of the individual genome, the underlying number of copies present at each location. Once this is completed, tests are performed to determine the association between copy number state and phenotype. An alternative approach is to carry out association testing first, between phenotype and raw intensities from the SNP array at the level of the individual marker, and then aggregate neighboring test results to identify CNVs associated with the phenotype. Here, we explore the strengths and weaknesses of these two approaches using both simulations and real data from a pharmacogenomic study of the chemotherapeutic agent gemcitabine. Our results indicate that pooled marker-level testing is capable of offering a dramatic increase in power (> 12-fold) over CNV-level testing, particularly for small CNVs. However, CNV-level testing is superior when CNVs are large and rare; understanding these tradeoffs is an important consideration in conducting association studies of structural variation.
AB - Recently, structural variation in the genome has been implicated in many complex diseases. Using genomewide single nucleotide polymorphism (SNP) arrays, researchers are able to investigate the impact not only of SNP variation, but also of copy-number variants (CNVs) on the phenotype. The most common analytic approach involves estimating, at the level of the individual genome, the underlying number of copies present at each location. Once this is completed, tests are performed to determine the association between copy number state and phenotype. An alternative approach is to carry out association testing first, between phenotype and raw intensities from the SNP array at the level of the individual marker, and then aggregate neighboring test results to identify CNVs associated with the phenotype. Here, we explore the strengths and weaknesses of these two approaches using both simulations and real data from a pharmacogenomic study of the chemotherapeutic agent gemcitabine. Our results indicate that pooled marker-level testing is capable of offering a dramatic increase in power (> 12-fold) over CNV-level testing, particularly for small CNVs. However, CNV-level testing is superior when CNVs are large and rare; understanding these tradeoffs is an important consideration in conducting association studies of structural variation.
UR - http://www.scopus.com/inward/record.url?scp=84859512085&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84859512085&partnerID=8YFLogxK
U2 - 10.1371/journal.pone.0034262
DO - 10.1371/journal.pone.0034262
M3 - Article
C2 - 22493684
AN - SCOPUS:84859512085
SN - 1932-6203
VL - 7
JO - PLoS One
JF - PLoS One
IS - 4
M1 - e34262
ER -