A Topic-modeling Based Framework for Drug-drug Interaction Classification from Biomedical Text

Dingcheng Li; Sijia Liu; Majid Rastegar-Mojarad; Yanshan Wang; Vipin Chaudhary; Terry Therneau; Hongfang Liu

A Topic-modeling Based Framework for Drug-drug Interaction Classification from Biomedical Text

Dingcheng Li, Sijia Liu, Majid Rastegar-Mojarad, Yanshan Wang, Vipin Chaudhary, Terry Therneau, Hongfang Liu

Research output: Contribution to journal › Article › peer-review

Abstract

Classification of drug-drug interaction (DDI) from medical literatures is significant in preventing medication-related errors. Most of the existing machine learning approaches are based on supervised learning methods. However, the dynamic nature of drug knowledge, combined with the enormity and rapidly growing of the biomedical literatures make supervised DDI classification methods easily overfit the corpora and may not meet the needs of real-world applications. In this paper, we proposed a relation classification framework based on topic modeling (RelTM) augmented with distant supervision for the task of DDI from biomedical text. The uniqueness of RelTM lies in its two-level sampling from both DDI and drug entities. Through this design, RelTM take both relation features and drug mention features into considerations. An efficient inference algorithm for the model using Gibbs sampling is also proposed. Compared to the previous supervised models, our approach does not require human efforts such as annotation and labeling, which is its advantage in trending big data applications. Meanwhile, the distant supervision combination allows RelTM to incorporate rich existing knowledge resources provided by domain experts. The experimental results on the 2013 DDI challenge corpus reach 48% in F1 score, showing the effectiveness of RelTM.

Original language	English (US)
Pages (from-to)	789-798
Number of pages	10
Journal	AMIA ... Annual Symposium proceedings. AMIA Symposium
Volume	2016
State	Published - 2016

ASJC Scopus subject areas

General Medicine

Cite this

@article{70716d769afc406cb1ebf181293f0cb4,

title = "A Topic-modeling Based Framework for Drug-drug Interaction Classification from Biomedical Text",

abstract = "Classification of drug-drug interaction (DDI) from medical literatures is significant in preventing medication-related errors. Most of the existing machine learning approaches are based on supervised learning methods. However, the dynamic nature of drug knowledge, combined with the enormity and rapidly growing of the biomedical literatures make supervised DDI classification methods easily overfit the corpora and may not meet the needs of real-world applications. In this paper, we proposed a relation classification framework based on topic modeling (RelTM) augmented with distant supervision for the task of DDI from biomedical text. The uniqueness of RelTM lies in its two-level sampling from both DDI and drug entities. Through this design, RelTM take both relation features and drug mention features into considerations. An efficient inference algorithm for the model using Gibbs sampling is also proposed. Compared to the previous supervised models, our approach does not require human efforts such as annotation and labeling, which is its advantage in trending big data applications. Meanwhile, the distant supervision combination allows RelTM to incorporate rich existing knowledge resources provided by domain experts. The experimental results on the 2013 DDI challenge corpus reach 48% in F1 score, showing the effectiveness of RelTM.",

author = "Dingcheng Li and Sijia Liu and Majid Rastegar-Mojarad and Yanshan Wang and Vipin Chaudhary and Terry Therneau and Hongfang Liu",

year = "2016",

language = "English (US)",

volume = "2016",

pages = "789--798",

journal = "AMIA ... Annual Symposium proceedings. AMIA Symposium",

issn = "1559-4076",

publisher = "American Medical Informatics Association",

}

TY - JOUR

T1 - A Topic-modeling Based Framework for Drug-drug Interaction Classification from Biomedical Text

AU - Li, Dingcheng

AU - Liu, Sijia

AU - Rastegar-Mojarad, Majid

AU - Wang, Yanshan

AU - Chaudhary, Vipin

AU - Therneau, Terry

AU - Liu, Hongfang

PY - 2016

Y1 - 2016

N2 - Classification of drug-drug interaction (DDI) from medical literatures is significant in preventing medication-related errors. Most of the existing machine learning approaches are based on supervised learning methods. However, the dynamic nature of drug knowledge, combined with the enormity and rapidly growing of the biomedical literatures make supervised DDI classification methods easily overfit the corpora and may not meet the needs of real-world applications. In this paper, we proposed a relation classification framework based on topic modeling (RelTM) augmented with distant supervision for the task of DDI from biomedical text. The uniqueness of RelTM lies in its two-level sampling from both DDI and drug entities. Through this design, RelTM take both relation features and drug mention features into considerations. An efficient inference algorithm for the model using Gibbs sampling is also proposed. Compared to the previous supervised models, our approach does not require human efforts such as annotation and labeling, which is its advantage in trending big data applications. Meanwhile, the distant supervision combination allows RelTM to incorporate rich existing knowledge resources provided by domain experts. The experimental results on the 2013 DDI challenge corpus reach 48% in F1 score, showing the effectiveness of RelTM.

AB - Classification of drug-drug interaction (DDI) from medical literatures is significant in preventing medication-related errors. Most of the existing machine learning approaches are based on supervised learning methods. However, the dynamic nature of drug knowledge, combined with the enormity and rapidly growing of the biomedical literatures make supervised DDI classification methods easily overfit the corpora and may not meet the needs of real-world applications. In this paper, we proposed a relation classification framework based on topic modeling (RelTM) augmented with distant supervision for the task of DDI from biomedical text. The uniqueness of RelTM lies in its two-level sampling from both DDI and drug entities. Through this design, RelTM take both relation features and drug mention features into considerations. An efficient inference algorithm for the model using Gibbs sampling is also proposed. Compared to the previous supervised models, our approach does not require human efforts such as annotation and labeling, which is its advantage in trending big data applications. Meanwhile, the distant supervision combination allows RelTM to incorporate rich existing knowledge resources provided by domain experts. The experimental results on the 2013 DDI challenge corpus reach 48% in F1 score, showing the effectiveness of RelTM.

UR - http://www.scopus.com/inward/record.url?scp=85027488840&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85027488840&partnerID=8YFLogxK

M3 - Article

C2 - 28269875

AN - SCOPUS:85027488840

SN - 1559-4076

VL - 2016

SP - 789

EP - 798

JO - AMIA ... Annual Symposium proceedings. AMIA Symposium

JF - AMIA ... Annual Symposium proceedings. AMIA Symposium

ER -

A Topic-modeling Based Framework for Drug-drug Interaction Classification from Biomedical Text

Abstract

ASJC Scopus subject areas

Other files and links

Fingerprint

Cite this