Learning on weighted hypergraphs to integrate protein interactions and gene expressions for cancer outcome prediction

Taehyun Hwang; Ze Tian; Rui Kuang; Jean Pierre Kocher

doi:10.1109/ICDM.2008.37

Learning on weighted hypergraphs to integrate protein interactions and gene expressions for cancer outcome prediction

Taehyun Hwang, Ze Tian, Rui Kuang, Jean Pierre Kocher

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

32 Scopus citations

Abstract

Building reliable predictive models from multiple complementary genomic data for cancer study is a crucial step towards successful cancer treatment and a full understanding of the underlying biological principles. To tackle this challenging data integration problem, we propose a hypergraph-based learning algorithm called HyperGene to integrate microarray gene expressions and protein-protein interactions for cancer outcome prediction and biomarker identification. HyperGene is a robust two-step iterative method that alternatively finds the optimal outcome prediction and the optimal weighting of the marker genes guided by a protein-protein interaction network. Under the hypothesis that cancer-related genes tend to interact with each other, the HyperGene algorithm uses a protein-protein interaction network as prior knowledge by imposing a consistent weighting of interacting genes. Our experimental results on two large-scale breast cancer gene expression datasets show that HyperGene utilizing a curated roteinprotein interaction network achieves significantly improved cancer outcome prediction. Moreover, HyperGene can also retrieve many known cancer genes as highly weighted marker genes.

Original language	English (US)
Title of host publication	Proceedings - 8th IEEE International Conference on Data Mining, ICDM 2008
Pages	293-302
Number of pages	10
DOIs	https://doi.org/10.1109/ICDM.2008.37
State	Published - 2008
Event	8th IEEE International Conference on Data Mining, ICDM 2008 - Pisa, Italy Duration: Dec 15 2008 → Dec 19 2008

Publication series

Name	Proceedings - IEEE International Conference on Data Mining, ICDM
ISSN (Print)	1550-4786

Other

Other	8th IEEE International Conference on Data Mining, ICDM 2008
Country/Territory	Italy
City	Pisa
Period	12/15/08 → 12/19/08

ASJC Scopus subject areas

General Engineering

Access to Document

10.1109/ICDM.2008.37

Cite this

Hwang, T., Tian, Z., Kuang, R., & Kocher, J. P. (2008). Learning on weighted hypergraphs to integrate protein interactions and gene expressions for cancer outcome prediction. In Proceedings - 8th IEEE International Conference on Data Mining, ICDM 2008 (pp. 293-302). Article 4781124 (Proceedings - IEEE International Conference on Data Mining, ICDM). https://doi.org/10.1109/ICDM.2008.37

Learning on weighted hypergraphs to integrate protein interactions and gene expressions for cancer outcome prediction. / Hwang, Taehyun; Tian, Ze; Kuang, Rui et al.
Proceedings - 8th IEEE International Conference on Data Mining, ICDM 2008. 2008. p. 293-302 4781124 (Proceedings - IEEE International Conference on Data Mining, ICDM).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

Hwang, T, Tian, Z, Kuang, R & Kocher, JP 2008, Learning on weighted hypergraphs to integrate protein interactions and gene expressions for cancer outcome prediction. in Proceedings - 8th IEEE International Conference on Data Mining, ICDM 2008., 4781124, Proceedings - IEEE International Conference on Data Mining, ICDM, pp. 293-302, 8th IEEE International Conference on Data Mining, ICDM 2008, Pisa, Italy, 12/15/08. https://doi.org/10.1109/ICDM.2008.37

@inproceedings{fa5bd59b6e0b424490a98b35c898c719,

title = "Learning on weighted hypergraphs to integrate protein interactions and gene expressions for cancer outcome prediction",

abstract = "Building reliable predictive models from multiple complementary genomic data for cancer study is a crucial step towards successful cancer treatment and a full understanding of the underlying biological principles. To tackle this challenging data integration problem, we propose a hypergraph-based learning algorithm called HyperGene to integrate microarray gene expressions and protein-protein interactions for cancer outcome prediction and biomarker identification. HyperGene is a robust two-step iterative method that alternatively finds the optimal outcome prediction and the optimal weighting of the marker genes guided by a protein-protein interaction network. Under the hypothesis that cancer-related genes tend to interact with each other, the HyperGene algorithm uses a protein-protein interaction network as prior knowledge by imposing a consistent weighting of interacting genes. Our experimental results on two large-scale breast cancer gene expression datasets show that HyperGene utilizing a curated roteinprotein interaction network achieves significantly improved cancer outcome prediction. Moreover, HyperGene can also retrieve many known cancer genes as highly weighted marker genes.",

author = "Taehyun Hwang and Ze Tian and Rui Kuang and Kocher, {Jean Pierre}",

year = "2008",

doi = "10.1109/ICDM.2008.37",

language = "English (US)",

isbn = "9780769535029",

series = "Proceedings - IEEE International Conference on Data Mining, ICDM",

pages = "293--302",

booktitle = "Proceedings - 8th IEEE International Conference on Data Mining, ICDM 2008",

note = "8th IEEE International Conference on Data Mining, ICDM 2008 ; Conference date: 15-12-2008 Through 19-12-2008",

}

TY - GEN

T1 - Learning on weighted hypergraphs to integrate protein interactions and gene expressions for cancer outcome prediction

AU - Hwang, Taehyun

AU - Tian, Ze

AU - Kuang, Rui

AU - Kocher, Jean Pierre

PY - 2008

Y1 - 2008

N2 - Building reliable predictive models from multiple complementary genomic data for cancer study is a crucial step towards successful cancer treatment and a full understanding of the underlying biological principles. To tackle this challenging data integration problem, we propose a hypergraph-based learning algorithm called HyperGene to integrate microarray gene expressions and protein-protein interactions for cancer outcome prediction and biomarker identification. HyperGene is a robust two-step iterative method that alternatively finds the optimal outcome prediction and the optimal weighting of the marker genes guided by a protein-protein interaction network. Under the hypothesis that cancer-related genes tend to interact with each other, the HyperGene algorithm uses a protein-protein interaction network as prior knowledge by imposing a consistent weighting of interacting genes. Our experimental results on two large-scale breast cancer gene expression datasets show that HyperGene utilizing a curated roteinprotein interaction network achieves significantly improved cancer outcome prediction. Moreover, HyperGene can also retrieve many known cancer genes as highly weighted marker genes.

AB - Building reliable predictive models from multiple complementary genomic data for cancer study is a crucial step towards successful cancer treatment and a full understanding of the underlying biological principles. To tackle this challenging data integration problem, we propose a hypergraph-based learning algorithm called HyperGene to integrate microarray gene expressions and protein-protein interactions for cancer outcome prediction and biomarker identification. HyperGene is a robust two-step iterative method that alternatively finds the optimal outcome prediction and the optimal weighting of the marker genes guided by a protein-protein interaction network. Under the hypothesis that cancer-related genes tend to interact with each other, the HyperGene algorithm uses a protein-protein interaction network as prior knowledge by imposing a consistent weighting of interacting genes. Our experimental results on two large-scale breast cancer gene expression datasets show that HyperGene utilizing a curated roteinprotein interaction network achieves significantly improved cancer outcome prediction. Moreover, HyperGene can also retrieve many known cancer genes as highly weighted marker genes.

UR - http://www.scopus.com/inward/record.url?scp=67049160152&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=67049160152&partnerID=8YFLogxK

U2 - 10.1109/ICDM.2008.37

DO - 10.1109/ICDM.2008.37

M3 - Conference contribution

AN - SCOPUS:67049160152

SN - 9780769535029

T3 - Proceedings - IEEE International Conference on Data Mining, ICDM

SP - 293

EP - 302

BT - Proceedings - 8th IEEE International Conference on Data Mining, ICDM 2008

T2 - 8th IEEE International Conference on Data Mining, ICDM 2008

Y2 - 15 December 2008 through 19 December 2008

ER -

Learning on weighted hypergraphs to integrate protein interactions and gene expressions for cancer outcome prediction

Abstract

Publication series

Other

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this