Sequential haplotype scan methods for association analysis

Zhaoxia Yu, Daniel J Schaid

Research output: Contribution to journalArticle

22 Citations (Scopus)

Abstract

Multi-locus association analyses, including haplotype-based analyses, can sometimes provide greater power than single-locus analyses for detecting disease susceptibility loci. This potential gain, however, can be compromised by the large number of degrees of freedom caused by irrelevant markers. Exhaustive search for the optimal set of markers might be possible for a small number of markers, yet it is computationally inefficient. In this paper, we present a sequential haplotype scan method to search for combinations of adjacent markers that are jointly associated with disease status. When evaluating each marker, we add markers close to it in a sequential manner: a marker is added if its contribution to the haplotype association with disease is warranted, conditional on current haplotypes. This conditional evaluation is based on the well-known Mantel-Haenszel statistic. We propose two permutation based methods to evaluate the growing haplotypes: a haplotype method for the combined markers, and a summary method that sums conditional statistics. We compared our proposed methods, the single-locus method, and a sliding window method using simulated data. We also applied our sequential haplotype scan algorithm to experimental data for CYP2D6. The results indicate that the sequential scan procedure can identify a set of adjacent markers whose haplotypes might have strong genetic effects or be in linkage disequilibrium with disease predisposing variants. As a result, our methods can achieve greater power than the single-locus method, yet is much more computationally efficient than sliding window methods.

Original languageEnglish (US)
Pages (from-to)553-564
Number of pages12
JournalGenetic Epidemiology
Volume31
Issue number6
DOIs
StatePublished - Sep 2007

Fingerprint

Haplotypes
Cytochrome P-450 CYP2D6
Disease Susceptibility
Linkage Disequilibrium

Keywords

  • Case-control
  • Gene-gene interaction
  • Mantel-Haenszel statistic
  • Single nucleotide polymorphism (SNP)
  • Single-locus analysis

ASJC Scopus subject areas

  • Epidemiology
  • Genetics(clinical)

Cite this

Sequential haplotype scan methods for association analysis. / Yu, Zhaoxia; Schaid, Daniel J.

In: Genetic Epidemiology, Vol. 31, No. 6, 09.2007, p. 553-564.

Research output: Contribution to journalArticle

@article{207b32a0cfd8493685859f4c4a345dea,
title = "Sequential haplotype scan methods for association analysis",
abstract = "Multi-locus association analyses, including haplotype-based analyses, can sometimes provide greater power than single-locus analyses for detecting disease susceptibility loci. This potential gain, however, can be compromised by the large number of degrees of freedom caused by irrelevant markers. Exhaustive search for the optimal set of markers might be possible for a small number of markers, yet it is computationally inefficient. In this paper, we present a sequential haplotype scan method to search for combinations of adjacent markers that are jointly associated with disease status. When evaluating each marker, we add markers close to it in a sequential manner: a marker is added if its contribution to the haplotype association with disease is warranted, conditional on current haplotypes. This conditional evaluation is based on the well-known Mantel-Haenszel statistic. We propose two permutation based methods to evaluate the growing haplotypes: a haplotype method for the combined markers, and a summary method that sums conditional statistics. We compared our proposed methods, the single-locus method, and a sliding window method using simulated data. We also applied our sequential haplotype scan algorithm to experimental data for CYP2D6. The results indicate that the sequential scan procedure can identify a set of adjacent markers whose haplotypes might have strong genetic effects or be in linkage disequilibrium with disease predisposing variants. As a result, our methods can achieve greater power than the single-locus method, yet is much more computationally efficient than sliding window methods.",
keywords = "Case-control, Gene-gene interaction, Mantel-Haenszel statistic, Single nucleotide polymorphism (SNP), Single-locus analysis",
author = "Zhaoxia Yu and Schaid, {Daniel J}",
year = "2007",
month = "9",
doi = "10.1002/gepi.20228",
language = "English (US)",
volume = "31",
pages = "553--564",
journal = "Genetic Epidemiology",
issn = "0741-0395",
publisher = "Wiley-Liss Inc.",
number = "6",

}

TY - JOUR

T1 - Sequential haplotype scan methods for association analysis

AU - Yu, Zhaoxia

AU - Schaid, Daniel J

PY - 2007/9

Y1 - 2007/9

N2 - Multi-locus association analyses, including haplotype-based analyses, can sometimes provide greater power than single-locus analyses for detecting disease susceptibility loci. This potential gain, however, can be compromised by the large number of degrees of freedom caused by irrelevant markers. Exhaustive search for the optimal set of markers might be possible for a small number of markers, yet it is computationally inefficient. In this paper, we present a sequential haplotype scan method to search for combinations of adjacent markers that are jointly associated with disease status. When evaluating each marker, we add markers close to it in a sequential manner: a marker is added if its contribution to the haplotype association with disease is warranted, conditional on current haplotypes. This conditional evaluation is based on the well-known Mantel-Haenszel statistic. We propose two permutation based methods to evaluate the growing haplotypes: a haplotype method for the combined markers, and a summary method that sums conditional statistics. We compared our proposed methods, the single-locus method, and a sliding window method using simulated data. We also applied our sequential haplotype scan algorithm to experimental data for CYP2D6. The results indicate that the sequential scan procedure can identify a set of adjacent markers whose haplotypes might have strong genetic effects or be in linkage disequilibrium with disease predisposing variants. As a result, our methods can achieve greater power than the single-locus method, yet is much more computationally efficient than sliding window methods.

AB - Multi-locus association analyses, including haplotype-based analyses, can sometimes provide greater power than single-locus analyses for detecting disease susceptibility loci. This potential gain, however, can be compromised by the large number of degrees of freedom caused by irrelevant markers. Exhaustive search for the optimal set of markers might be possible for a small number of markers, yet it is computationally inefficient. In this paper, we present a sequential haplotype scan method to search for combinations of adjacent markers that are jointly associated with disease status. When evaluating each marker, we add markers close to it in a sequential manner: a marker is added if its contribution to the haplotype association with disease is warranted, conditional on current haplotypes. This conditional evaluation is based on the well-known Mantel-Haenszel statistic. We propose two permutation based methods to evaluate the growing haplotypes: a haplotype method for the combined markers, and a summary method that sums conditional statistics. We compared our proposed methods, the single-locus method, and a sliding window method using simulated data. We also applied our sequential haplotype scan algorithm to experimental data for CYP2D6. The results indicate that the sequential scan procedure can identify a set of adjacent markers whose haplotypes might have strong genetic effects or be in linkage disequilibrium with disease predisposing variants. As a result, our methods can achieve greater power than the single-locus method, yet is much more computationally efficient than sliding window methods.

KW - Case-control

KW - Gene-gene interaction

KW - Mantel-Haenszel statistic

KW - Single nucleotide polymorphism (SNP)

KW - Single-locus analysis

UR - http://www.scopus.com/inward/record.url?scp=35248885678&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=35248885678&partnerID=8YFLogxK

U2 - 10.1002/gepi.20228

DO - 10.1002/gepi.20228

M3 - Article

C2 - 17487883

AN - SCOPUS:35248885678

VL - 31

SP - 553

EP - 564

JO - Genetic Epidemiology

JF - Genetic Epidemiology

SN - 0741-0395

IS - 6

ER -