Convex optimization for binary classifier aggregation in multiclass problems

Sunho Park; Tae Hyun Hwang; Seungjin Choi

doi:10.1137/1.9781611973440.32

Convex optimization for binary classifier aggregation in multiclass problems

Sunho Park, Tae Hyun Hwang, Seungjin Choi

Artificial Intelligence and Informatics

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

Abstract

Multiclass problems are often decomposed into multiple binary problems that are solved by individual binary classifiers whose results are integrated into a final answer. Various methods, including all-pairs (APs), one-versus-all (OVA), and error correcting output code (ECOC), have been studied, to decompose multiclass problems into binary problems. However, little study has been made to optimally aggregate binary problems to determine a final answer to the multiclass problem. In this paper we present a convex optimization method for an optimal aggregation of binary classifiers to estimate class membership probabilities in multiclass problems. We model the class membership probability as a softmax function which takes a conic combination of discrepancies induced by individual binary classifiers, as an input. With this model, we formulate the regularized maximum likelihood estimation as a convex optimization problem, which is solved by the primal-dual interior point method. Connections of our method to large margin classifiers are presented, showing that the large margin formulation can be considered as a limiting case of our convex formulation. In the experiments on human disease classification, we demonstrate that our method outperforms existing aggregation methods as well as direct methods, in terms of the classification accuracy and F-score.

Original language	English (US)
Title of host publication	SIAM International Conference on Data Mining 2014, SDM 2014
Editors	Mohammed J. Zaki, Arindam Banerjee, Srinivasan Parthasarathy, Pang Ning-Tan, Zoran Obradovic, Chandrika Kamath
Publisher	Society for Industrial and Applied Mathematics Publications
Pages	280-288
Number of pages	9
ISBN (Electronic)	9781510811515
DOIs	https://doi.org/10.1137/1.9781611973440.32
State	Published - 2014
Event	14th SIAM International Conference on Data Mining, SDM 2014 - Philadelphia, United States Duration: Apr 24 2014 → Apr 26 2014

Publication series

Name	SIAM International Conference on Data Mining 2014, SDM 2014
Volume	1

Conference

Conference	14th SIAM International Conference on Data Mining, SDM 2014
Country/Territory	United States
City	Philadelphia
Period	4/24/14 → 4/26/14

Keywords

Binary classifier aggregation
Convex optimization
Human disease classification
Large margin learning
Multiclass learning

ASJC Scopus subject areas

Computer Science Applications
Software

Access to Document

10.1137/1.9781611973440.32

Cite this

Park, S., Hwang, T. H., & Choi, S. (2014). Convex optimization for binary classifier aggregation in multiclass problems. In M. J. Zaki, A. Banerjee, S. Parthasarathy, P. Ning-Tan, Z. Obradovic, & C. Kamath (Eds.), SIAM International Conference on Data Mining 2014, SDM 2014 (pp. 280-288). (SIAM International Conference on Data Mining 2014, SDM 2014; Vol. 1). Society for Industrial and Applied Mathematics Publications. https://doi.org/10.1137/1.9781611973440.32

Convex optimization for binary classifier aggregation in multiclass problems. / Park, Sunho; Hwang, Tae Hyun; Choi, Seungjin.
SIAM International Conference on Data Mining 2014, SDM 2014. ed. / Mohammed J. Zaki; Arindam Banerjee; Srinivasan Parthasarathy; Pang Ning-Tan; Zoran Obradovic; Chandrika Kamath. Society for Industrial and Applied Mathematics Publications, 2014. p. 280-288 (SIAM International Conference on Data Mining 2014, SDM 2014; Vol. 1).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

Park, S, Hwang, TH & Choi, S 2014, Convex optimization for binary classifier aggregation in multiclass problems. in MJ Zaki, A Banerjee, S Parthasarathy, P Ning-Tan, Z Obradovic & C Kamath (eds), SIAM International Conference on Data Mining 2014, SDM 2014. SIAM International Conference on Data Mining 2014, SDM 2014, vol. 1, Society for Industrial and Applied Mathematics Publications, pp. 280-288, 14th SIAM International Conference on Data Mining, SDM 2014, Philadelphia, United States, 4/24/14. https://doi.org/10.1137/1.9781611973440.32

Park S, Hwang TH, Choi S. Convex optimization for binary classifier aggregation in multiclass problems. In Zaki MJ, Banerjee A, Parthasarathy S, Ning-Tan P, Obradovic Z, Kamath C, editors, SIAM International Conference on Data Mining 2014, SDM 2014. Society for Industrial and Applied Mathematics Publications. 2014. p. 280-288. (SIAM International Conference on Data Mining 2014, SDM 2014). doi: 10.1137/1.9781611973440.32

Park, Sunho ; Hwang, Tae Hyun ; Choi, Seungjin. / Convex optimization for binary classifier aggregation in multiclass problems. SIAM International Conference on Data Mining 2014, SDM 2014. editor / Mohammed J. Zaki ; Arindam Banerjee ; Srinivasan Parthasarathy ; Pang Ning-Tan ; Zoran Obradovic ; Chandrika Kamath. Society for Industrial and Applied Mathematics Publications, 2014. pp. 280-288 (SIAM International Conference on Data Mining 2014, SDM 2014).

@inproceedings{d1281271701c4536b2a697edffb323cb,

title = "Convex optimization for binary classifier aggregation in multiclass problems",

abstract = "Multiclass problems are often decomposed into multiple binary problems that are solved by individual binary classifiers whose results are integrated into a final answer. Various methods, including all-pairs (APs), one-versus-all (OVA), and error correcting output code (ECOC), have been studied, to decompose multiclass problems into binary problems. However, little study has been made to optimally aggregate binary problems to determine a final answer to the multiclass problem. In this paper we present a convex optimization method for an optimal aggregation of binary classifiers to estimate class membership probabilities in multiclass problems. We model the class membership probability as a softmax function which takes a conic combination of discrepancies induced by individual binary classifiers, as an input. With this model, we formulate the regularized maximum likelihood estimation as a convex optimization problem, which is solved by the primal-dual interior point method. Connections of our method to large margin classifiers are presented, showing that the large margin formulation can be considered as a limiting case of our convex formulation. In the experiments on human disease classification, we demonstrate that our method outperforms existing aggregation methods as well as direct methods, in terms of the classification accuracy and F-score.",

keywords = "Binary classifier aggregation, Convex optimization, Human disease classification, Large margin learning, Multiclass learning",

author = "Sunho Park and Hwang, {Tae Hyun} and Seungjin Choi",

note = "Publisher Copyright: Copyright {\textcopyright} SIAM.; 14th SIAM International Conference on Data Mining, SDM 2014 ; Conference date: 24-04-2014 Through 26-04-2014",

year = "2014",

doi = "10.1137/1.9781611973440.32",

language = "English (US)",

series = "SIAM International Conference on Data Mining 2014, SDM 2014",

publisher = "Society for Industrial and Applied Mathematics Publications",

pages = "280--288",

editor = "Zaki, {Mohammed J.} and Arindam Banerjee and Srinivasan Parthasarathy and Pang Ning-Tan and Zoran Obradovic and Chandrika Kamath",

booktitle = "SIAM International Conference on Data Mining 2014, SDM 2014",

address = "United States",

}

TY - GEN

T1 - Convex optimization for binary classifier aggregation in multiclass problems

AU - Park, Sunho

AU - Hwang, Tae Hyun

AU - Choi, Seungjin

PY - 2014

Y1 - 2014

N2 - Multiclass problems are often decomposed into multiple binary problems that are solved by individual binary classifiers whose results are integrated into a final answer. Various methods, including all-pairs (APs), one-versus-all (OVA), and error correcting output code (ECOC), have been studied, to decompose multiclass problems into binary problems. However, little study has been made to optimally aggregate binary problems to determine a final answer to the multiclass problem. In this paper we present a convex optimization method for an optimal aggregation of binary classifiers to estimate class membership probabilities in multiclass problems. We model the class membership probability as a softmax function which takes a conic combination of discrepancies induced by individual binary classifiers, as an input. With this model, we formulate the regularized maximum likelihood estimation as a convex optimization problem, which is solved by the primal-dual interior point method. Connections of our method to large margin classifiers are presented, showing that the large margin formulation can be considered as a limiting case of our convex formulation. In the experiments on human disease classification, we demonstrate that our method outperforms existing aggregation methods as well as direct methods, in terms of the classification accuracy and F-score.

AB - Multiclass problems are often decomposed into multiple binary problems that are solved by individual binary classifiers whose results are integrated into a final answer. Various methods, including all-pairs (APs), one-versus-all (OVA), and error correcting output code (ECOC), have been studied, to decompose multiclass problems into binary problems. However, little study has been made to optimally aggregate binary problems to determine a final answer to the multiclass problem. In this paper we present a convex optimization method for an optimal aggregation of binary classifiers to estimate class membership probabilities in multiclass problems. We model the class membership probability as a softmax function which takes a conic combination of discrepancies induced by individual binary classifiers, as an input. With this model, we formulate the regularized maximum likelihood estimation as a convex optimization problem, which is solved by the primal-dual interior point method. Connections of our method to large margin classifiers are presented, showing that the large margin formulation can be considered as a limiting case of our convex formulation. In the experiments on human disease classification, we demonstrate that our method outperforms existing aggregation methods as well as direct methods, in terms of the classification accuracy and F-score.

KW - Binary classifier aggregation

KW - Convex optimization

KW - Human disease classification

KW - Large margin learning

KW - Multiclass learning

UR - http://www.scopus.com/inward/record.url?scp=84959867863&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84959867863&partnerID=8YFLogxK

U2 - 10.1137/1.9781611973440.32

DO - 10.1137/1.9781611973440.32

M3 - Conference contribution

AN - SCOPUS:84959867863

T3 - SIAM International Conference on Data Mining 2014, SDM 2014

SP - 280

EP - 288

BT - SIAM International Conference on Data Mining 2014, SDM 2014

A2 - Zaki, Mohammed J.

A2 - Banerjee, Arindam

A2 - Parthasarathy, Srinivasan

A2 - Ning-Tan, Pang

A2 - Obradovic, Zoran

A2 - Kamath, Chandrika

PB - Society for Industrial and Applied Mathematics Publications

T2 - 14th SIAM International Conference on Data Mining, SDM 2014

Y2 - 24 April 2014 through 26 April 2014

ER -

Convex optimization for binary classifier aggregation in multiclass problems

Abstract

Publication series

Conference

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this