Assessing the Need of Discourse-Level Analysis in Identifying Evidence of Drug-Disease Relations in Scientific Literature

Majid Rastegar-Mojarad, Ravikumar Komandur Elayavilli, Dingcheng Li, Hongfang Liu

Research output: Chapter in Book/Report/Conference proceedingConference contribution

1 Scopus citations

Abstract

Relation extraction typically involves the extraction of relations between two or more entities occurring within a single or multiple sentences. In this study, we investigated the significance of extracting information from multiple sentences specifically in the context of drug-disease relation discovery. We used multiple resources such as Semantic Medline, a literature based resource, and Medline search (for filtering spurious results) and inferred 8,772 potential drug-disease pairs. Our analysis revealed that 6,450 (73.5%) of the 8,772 potential drug-disease relations did not occur in a single sentence. Moreover, only 537 of the drug-disease pairs matched the curated gold standard in Comparative Toxicogenomics Database (CTD), a trusted resource for drug-disease relations. Among the 537, nearly 75% (407) of the drug-disease pairs occur in multiple sentences. Our analysis revealed that the drug-disease pairs inferred from Semantic Medline or retrieved from CTD could be extracted from multiple sentences in the literature. This highlights the significance of the need of discourse-level analysis in extracting the relations from biomedical literature.

Original languageEnglish (US)
Title of host publicationMEDINFO 2015
Subtitle of host publicationeHealth-Enabled Health - Proceedings of the 15th World Congress on Health and Biomedical Informatics
EditorsAndrew Georgiou, Indra Neil Sarkar, Paulo Mazzoncini de Azevedo Marques
PublisherIOS Press
Pages539-543
Number of pages5
ISBN (Electronic)9781614995630
DOIs
StatePublished - 2015
Event15th World Congress on Health and Biomedical Informatics, MEDINFO 2015 - Sao Paulo, Brazil
Duration: Aug 19 2015Aug 23 2015

Publication series

NameStudies in Health Technology and Informatics
Volume216
ISSN (Print)0926-9630
ISSN (Electronic)1879-8365

Other

Other15th World Congress on Health and Biomedical Informatics, MEDINFO 2015
CountryBrazil
CitySao Paulo
Period8/19/158/23/15

Keywords

  • Discourse-level analysis
  • Literature-based discovery
  • Relation extraction
  • Semantic Medline

ASJC Scopus subject areas

  • Biomedical Engineering
  • Health Informatics
  • Health Information Management

Fingerprint Dive into the research topics of 'Assessing the Need of Discourse-Level Analysis in Identifying Evidence of Drug-Disease Relations in Scientific Literature'. Together they form a unique fingerprint.

  • Cite this

    Rastegar-Mojarad, M., Komandur Elayavilli, R., Li, D., & Liu, H. (2015). Assessing the Need of Discourse-Level Analysis in Identifying Evidence of Drug-Disease Relations in Scientific Literature. In A. Georgiou, I. N. Sarkar, & P. M. de Azevedo Marques (Eds.), MEDINFO 2015: eHealth-Enabled Health - Proceedings of the 15th World Congress on Health and Biomedical Informatics (pp. 539-543). (Studies in Health Technology and Informatics; Vol. 216). IOS Press. https://doi.org/10.3233/978-1-61499-564-7-539