An R package implementation of multifactor dimensionality reduction

Stacey J Winham, Alison A. Motsinger-Reif

Research output: Contribution to journalArticle

13 Citations (Scopus)

Abstract

Background: A breadth of high-dimensional data is now available with unprecedented numbers of genetic markers and data-mining approaches to variable selection are increasingly being utilized to uncover associations, including potential gene-gene and gene-environment interactions. One of the most commonly used data-mining methods for case-control data is Multifactor Dimensionality Reduction (MDR), which has displayed success in both simulations and real data applications. Additional software applications in alternative programming languages can improve the availability and usefulness of the method for a broader range of users. Results: We introduce a package for the R statistical language to implement the Multifactor Dimensionality Reduction (MDR) method for nonparametric variable selection of interactions. This package is designed to provide an alternative implementation for R users, with great flexibility and utility for both data analysis and research. The 'MDR' package is freely available online at http://www.r-project.org/. We also provide data examples to illustrate the use and functionality of the package. Conclusions: MDR is a frequently-used data-mining method to identify potential gene-gene interactions, and alternative implementations will further increase this usage. We introduce a flexible software package for R users.

Original languageEnglish (US)
Article number24
JournalBioData Mining
Volume4
Issue number1
DOIs
StatePublished - 2011

Fingerprint

Multifactor Dimensionality Reduction
Dimensionality Reduction
Data Mining
Genes
Gene
Data mining
Alternatives
Software
Gene-environment Interaction
Case-control Data
Programming Languages
Selection of Variables
Gene-Environment Interaction
Breadth
High-dimensional Data
Variable Selection
Reduction Method
Genetic Markers
Interaction
Software Package

ASJC Scopus subject areas

  • Genetics
  • Biochemistry
  • Molecular Biology
  • Computer Science Applications
  • Computational Theory and Mathematics
  • Computational Mathematics

Cite this

An R package implementation of multifactor dimensionality reduction. / Winham, Stacey J; Motsinger-Reif, Alison A.

In: BioData Mining, Vol. 4, No. 1, 24, 2011.

Research output: Contribution to journalArticle

@article{4eab117486c94cdab1e7117f5b731b17,
title = "An R package implementation of multifactor dimensionality reduction",
abstract = "Background: A breadth of high-dimensional data is now available with unprecedented numbers of genetic markers and data-mining approaches to variable selection are increasingly being utilized to uncover associations, including potential gene-gene and gene-environment interactions. One of the most commonly used data-mining methods for case-control data is Multifactor Dimensionality Reduction (MDR), which has displayed success in both simulations and real data applications. Additional software applications in alternative programming languages can improve the availability and usefulness of the method for a broader range of users. Results: We introduce a package for the R statistical language to implement the Multifactor Dimensionality Reduction (MDR) method for nonparametric variable selection of interactions. This package is designed to provide an alternative implementation for R users, with great flexibility and utility for both data analysis and research. The 'MDR' package is freely available online at http://www.r-project.org/. We also provide data examples to illustrate the use and functionality of the package. Conclusions: MDR is a frequently-used data-mining method to identify potential gene-gene interactions, and alternative implementations will further increase this usage. We introduce a flexible software package for R users.",
author = "Winham, {Stacey J} and Motsinger-Reif, {Alison A.}",
year = "2011",
doi = "10.1186/1756-0381-4-24",
language = "English (US)",
volume = "4",
journal = "BioData Mining",
issn = "1756-0381",
publisher = "BioMed Central",
number = "1",

}

TY - JOUR

T1 - An R package implementation of multifactor dimensionality reduction

AU - Winham, Stacey J

AU - Motsinger-Reif, Alison A.

PY - 2011

Y1 - 2011

N2 - Background: A breadth of high-dimensional data is now available with unprecedented numbers of genetic markers and data-mining approaches to variable selection are increasingly being utilized to uncover associations, including potential gene-gene and gene-environment interactions. One of the most commonly used data-mining methods for case-control data is Multifactor Dimensionality Reduction (MDR), which has displayed success in both simulations and real data applications. Additional software applications in alternative programming languages can improve the availability and usefulness of the method for a broader range of users. Results: We introduce a package for the R statistical language to implement the Multifactor Dimensionality Reduction (MDR) method for nonparametric variable selection of interactions. This package is designed to provide an alternative implementation for R users, with great flexibility and utility for both data analysis and research. The 'MDR' package is freely available online at http://www.r-project.org/. We also provide data examples to illustrate the use and functionality of the package. Conclusions: MDR is a frequently-used data-mining method to identify potential gene-gene interactions, and alternative implementations will further increase this usage. We introduce a flexible software package for R users.

AB - Background: A breadth of high-dimensional data is now available with unprecedented numbers of genetic markers and data-mining approaches to variable selection are increasingly being utilized to uncover associations, including potential gene-gene and gene-environment interactions. One of the most commonly used data-mining methods for case-control data is Multifactor Dimensionality Reduction (MDR), which has displayed success in both simulations and real data applications. Additional software applications in alternative programming languages can improve the availability and usefulness of the method for a broader range of users. Results: We introduce a package for the R statistical language to implement the Multifactor Dimensionality Reduction (MDR) method for nonparametric variable selection of interactions. This package is designed to provide an alternative implementation for R users, with great flexibility and utility for both data analysis and research. The 'MDR' package is freely available online at http://www.r-project.org/. We also provide data examples to illustrate the use and functionality of the package. Conclusions: MDR is a frequently-used data-mining method to identify potential gene-gene interactions, and alternative implementations will further increase this usage. We introduce a flexible software package for R users.

UR - http://www.scopus.com/inward/record.url?scp=80051659014&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=80051659014&partnerID=8YFLogxK

U2 - 10.1186/1756-0381-4-24

DO - 10.1186/1756-0381-4-24

M3 - Article

VL - 4

JO - BioData Mining

JF - BioData Mining

SN - 1756-0381

IS - 1

M1 - 24

ER -