Genomic similarity and kernel methods ii: Methods for genomic information

Research output: Contribution to journalArticle

54 Citations (Scopus)

Abstract

Measures of genomic similarity are often the basis of flexible statistical analyses, and when based on kernel methods, they provide a powerful platform to take advantage of a broad and deep statistical theory, and a wide range of existing software; see the companion paper for a review of this material [1]. The kernel method converts information-perhaps complex or high-dimensional information-for a pair of subjects to a quantitative value representing either similarity or dissimilarity, with the requirement that it must create a positive semidefinite matrix when applied to all pairs of subjects. This approach provides enormous opportunities to enhance genetic analyses by including a wide range of publically-available data as structured kernel 'prior' information. Kernel methods are appealing for their generality, yet this generality can make it challenging to formulate measures of similarity that directly address a specific scientific aim, or that are most powerful to detect a specific genetic mechanism. Although it is difficult to create a cook book of kernels for genetic studies, useful guidelines can be gleaned from a variety of novel published approaches. We review some novel developments of kernels for specific analyses and speculate on how to build kernels for complex genomic attributes based on publically available data. The creativity of analysts, with rigorous evaluations by applications to real and simulated data, will ultimately provide a much stronger array of kernel 'tools' for genetic analyses.

Original languageEnglish (US)
Pages (from-to)132-140
Number of pages9
JournalHuman Heredity
Volume70
Issue number2
DOIs
StatePublished - Jul 2010

Fingerprint

Creativity
Software
Guidelines

Keywords

  • Genomic pathways
  • Kernel
  • Networks

ASJC Scopus subject areas

  • Genetics
  • Genetics(clinical)

Cite this

Genomic similarity and kernel methods ii : Methods for genomic information. / Schaid, Daniel J.

In: Human Heredity, Vol. 70, No. 2, 07.2010, p. 132-140.

Research output: Contribution to journalArticle

@article{ca872dfc91974d889250895bfa9f8f10,
title = "Genomic similarity and kernel methods ii: Methods for genomic information",
abstract = "Measures of genomic similarity are often the basis of flexible statistical analyses, and when based on kernel methods, they provide a powerful platform to take advantage of a broad and deep statistical theory, and a wide range of existing software; see the companion paper for a review of this material [1]. The kernel method converts information-perhaps complex or high-dimensional information-for a pair of subjects to a quantitative value representing either similarity or dissimilarity, with the requirement that it must create a positive semidefinite matrix when applied to all pairs of subjects. This approach provides enormous opportunities to enhance genetic analyses by including a wide range of publically-available data as structured kernel 'prior' information. Kernel methods are appealing for their generality, yet this generality can make it challenging to formulate measures of similarity that directly address a specific scientific aim, or that are most powerful to detect a specific genetic mechanism. Although it is difficult to create a cook book of kernels for genetic studies, useful guidelines can be gleaned from a variety of novel published approaches. We review some novel developments of kernels for specific analyses and speculate on how to build kernels for complex genomic attributes based on publically available data. The creativity of analysts, with rigorous evaluations by applications to real and simulated data, will ultimately provide a much stronger array of kernel 'tools' for genetic analyses.",
keywords = "Genomic pathways, Kernel, Networks",
author = "Schaid, {Daniel J}",
year = "2010",
month = "7",
doi = "10.1159/000312643",
language = "English (US)",
volume = "70",
pages = "132--140",
journal = "Human Heredity",
issn = "0001-5652",
publisher = "S. Karger AG",
number = "2",

}

TY - JOUR

T1 - Genomic similarity and kernel methods ii

T2 - Methods for genomic information

AU - Schaid, Daniel J

PY - 2010/7

Y1 - 2010/7

N2 - Measures of genomic similarity are often the basis of flexible statistical analyses, and when based on kernel methods, they provide a powerful platform to take advantage of a broad and deep statistical theory, and a wide range of existing software; see the companion paper for a review of this material [1]. The kernel method converts information-perhaps complex or high-dimensional information-for a pair of subjects to a quantitative value representing either similarity or dissimilarity, with the requirement that it must create a positive semidefinite matrix when applied to all pairs of subjects. This approach provides enormous opportunities to enhance genetic analyses by including a wide range of publically-available data as structured kernel 'prior' information. Kernel methods are appealing for their generality, yet this generality can make it challenging to formulate measures of similarity that directly address a specific scientific aim, or that are most powerful to detect a specific genetic mechanism. Although it is difficult to create a cook book of kernels for genetic studies, useful guidelines can be gleaned from a variety of novel published approaches. We review some novel developments of kernels for specific analyses and speculate on how to build kernels for complex genomic attributes based on publically available data. The creativity of analysts, with rigorous evaluations by applications to real and simulated data, will ultimately provide a much stronger array of kernel 'tools' for genetic analyses.

AB - Measures of genomic similarity are often the basis of flexible statistical analyses, and when based on kernel methods, they provide a powerful platform to take advantage of a broad and deep statistical theory, and a wide range of existing software; see the companion paper for a review of this material [1]. The kernel method converts information-perhaps complex or high-dimensional information-for a pair of subjects to a quantitative value representing either similarity or dissimilarity, with the requirement that it must create a positive semidefinite matrix when applied to all pairs of subjects. This approach provides enormous opportunities to enhance genetic analyses by including a wide range of publically-available data as structured kernel 'prior' information. Kernel methods are appealing for their generality, yet this generality can make it challenging to formulate measures of similarity that directly address a specific scientific aim, or that are most powerful to detect a specific genetic mechanism. Although it is difficult to create a cook book of kernels for genetic studies, useful guidelines can be gleaned from a variety of novel published approaches. We review some novel developments of kernels for specific analyses and speculate on how to build kernels for complex genomic attributes based on publically available data. The creativity of analysts, with rigorous evaluations by applications to real and simulated data, will ultimately provide a much stronger array of kernel 'tools' for genetic analyses.

KW - Genomic pathways

KW - Kernel

KW - Networks

UR - http://www.scopus.com/inward/record.url?scp=77954187925&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=77954187925&partnerID=8YFLogxK

U2 - 10.1159/000312643

DO - 10.1159/000312643

M3 - Article

C2 - 20606458

AN - SCOPUS:77954187925

VL - 70

SP - 132

EP - 140

JO - Human Heredity

JF - Human Heredity

SN - 0001-5652

IS - 2

ER -