TY - JOUR
T1 - Methods for the selection of tagging SNPs
T2 - A comparison of tagging efficiency and performance
AU - Ding, Keyue
AU - Kullo, Iftikhar J.
N1 - Funding Information:
This work was supported in part by NIH grant RR-17720 (I.J.K). We acknowledge the Supercomputing Institute of University of Minnesota, Minneapolis for technical support.
PY - 2007/2
Y1 - 2007/2
N2 - There is great interest in the use of tagging single nucleotide polymorphisms (tSNPs) to facilitate association studies of complex diseases. This is based on the premise that a minimum set of tSNPs may be sufficient to capture most of the variation in certain regions of the human genome. Several methods have been described to select tSNPs, based on either haplotype-block structure or independent of the underlying block structure. In this paper, we compare eight methods for choosing tSNPs in 10 representative resequenced candidate genes (a total of 194.2kb) with different levels of linkage disequilibrium (LD) in a sample of European-Americans. We compared tagging efficiency (TE) and prediction accuracy of tSNPs identified by these methods, as a function of several factors, including LD level, minor allele frequency, and tagging criteria. We also assessed tagging consistency between each method. We found that tSNPs selected based on the methods Haplotype Diversity and Haplotype r2 provided the highest TE, whereas the prediction accuracy was comparable among different methods. Tagging consistency between different methods of tSNPs selection was poor. This work demonstrates that when tSNPs-based association studies are undertaken, the choice of method for selecting tSNPs requires careful consideration.
AB - There is great interest in the use of tagging single nucleotide polymorphisms (tSNPs) to facilitate association studies of complex diseases. This is based on the premise that a minimum set of tSNPs may be sufficient to capture most of the variation in certain regions of the human genome. Several methods have been described to select tSNPs, based on either haplotype-block structure or independent of the underlying block structure. In this paper, we compare eight methods for choosing tSNPs in 10 representative resequenced candidate genes (a total of 194.2kb) with different levels of linkage disequilibrium (LD) in a sample of European-Americans. We compared tagging efficiency (TE) and prediction accuracy of tSNPs identified by these methods, as a function of several factors, including LD level, minor allele frequency, and tagging criteria. We also assessed tagging consistency between each method. We found that tSNPs selected based on the methods Haplotype Diversity and Haplotype r2 provided the highest TE, whereas the prediction accuracy was comparable among different methods. Tagging consistency between different methods of tSNPs selection was poor. This work demonstrates that when tSNPs-based association studies are undertaken, the choice of method for selecting tSNPs requires careful consideration.
UR - http://www.scopus.com/inward/record.url?scp=33846283432&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=33846283432&partnerID=8YFLogxK
U2 - 10.1038/sj.ejhg.5201755
DO - 10.1038/sj.ejhg.5201755
M3 - Article
C2 - 17164795
AN - SCOPUS:33846283432
SN - 1018-4813
VL - 15
SP - 228
EP - 236
JO - European Journal of Human Genetics
JF - European Journal of Human Genetics
IS - 2
ER -