TY - GEN
T1 - Polyadic regression and its application to chemogenomics
AU - Perros, Ioakeim
AU - Wang, Fei
AU - Zhang, Ping
AU - Walker, Peter
AU - Vuduc, Richard
AU - Pathak, Jyotishman
AU - Sun, Jimeng
N1 - Funding Information:
This work was supported by NSF grants under number IIS-1418511 and CCF-1533768, research partnership between Children’s Healthcare of Atlanta and the Georgia Institute of Technology, Google Faculty Award and UCB. Portions of this research were supported by Defense Medical Research Program Grant DM130137. The opinions of the authors do not necessarily reflect those of the United States Navy. PB Walker is a military service member. This work was prepared as part of their official duties. Title 17 U.S.C. 101 defines U.S. Government work as a work prepared by a military service member or employee of the U.S. Government as part of that person’s official duties. The work of Fei Wang is partially supported by NSF grant under number IIS-1650723. The first author would like to thank Mohammad Taha Bahadori for his valuable feedback which improved the paper’s clarity.
Publisher Copyright:
Copyright © by SIAM.
PY - 2017
Y1 - 2017
N2 - We study the problem of Polyadic Prediction, where the input consists of an ordered tuple of objects, and the goal is to predict a measurement associated with them. Many tasks can be naturally framed as Polyadic Prediction problems. In drug discovery, for instance, it is important to estimate the treatment effect of a drug on various tissue-specific diseases, as it is expressed over the available genes. Thus, we essentially predict the expression value measurements for several (drug, gene, tissue) triads. To tackle Polyadic Prediction problems, we propose a general framework, called Polyadic Regression, predicting measurements associated with multiple objects. Our framework is inductive, in the sense of enabling predictions for new objects, unseen during training. Our model is expressive, exploring high-order, polyadic interactions in an efficient manner. An alternating Proximal Gradient Descent procedure is proposed to fit our model. We perform an extensive evaluation using real-world chemogenomics data, where we illustrate the superior performance of Polyadic Regression over the prior art. Our method achieves an increase of 0:06 and 0:1 in Spearman correlation between the predicted and the actual measurement vectors, for predicting missing polyadic data and predicting polyadic data for new drugs, respectively.
AB - We study the problem of Polyadic Prediction, where the input consists of an ordered tuple of objects, and the goal is to predict a measurement associated with them. Many tasks can be naturally framed as Polyadic Prediction problems. In drug discovery, for instance, it is important to estimate the treatment effect of a drug on various tissue-specific diseases, as it is expressed over the available genes. Thus, we essentially predict the expression value measurements for several (drug, gene, tissue) triads. To tackle Polyadic Prediction problems, we propose a general framework, called Polyadic Regression, predicting measurements associated with multiple objects. Our framework is inductive, in the sense of enabling predictions for new objects, unseen during training. Our model is expressive, exploring high-order, polyadic interactions in an efficient manner. An alternating Proximal Gradient Descent procedure is proposed to fit our model. We perform an extensive evaluation using real-world chemogenomics data, where we illustrate the superior performance of Polyadic Regression over the prior art. Our method achieves an increase of 0:06 and 0:1 in Spearman correlation between the predicted and the actual measurement vectors, for predicting missing polyadic data and predicting polyadic data for new drugs, respectively.
KW - Chemogenomics
KW - Polyadic prediction
KW - Tensors
UR - http://www.scopus.com/inward/record.url?scp=85027854385&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85027854385&partnerID=8YFLogxK
U2 - 10.1137/1.9781611974973.9
DO - 10.1137/1.9781611974973.9
M3 - Conference contribution
AN - SCOPUS:85027854385
T3 - Proceedings of the 17th SIAM International Conference on Data Mining, SDM 2017
SP - 72
EP - 80
BT - Proceedings of the 17th SIAM International Conference on Data Mining, SDM 2017
A2 - Chawla, Nitesh
A2 - Wang, Wei
PB - Society for Industrial and Applied Mathematics Publications
T2 - 17th SIAM International Conference on Data Mining, SDM 2017
Y2 - 27 April 2017 through 29 April 2017
ER -