Assessing the Need of Discourse-Level Analysis in Identifying Evidence of Drug-Disease Relations in Scientific Literature

Majid Rastegar-Mojarad; Ravikumar Komandur Elayavilli; Dingcheng Li; Hongfang Liu

doi:10.3233/978-1-61499-564-7-539

Assessing the Need of Discourse-Level Analysis in Identifying Evidence of Drug-Disease Relations in Scientific Literature

Majid Rastegar-Mojarad, Ravikumar Komandur Elayavilli, Dingcheng Li, Hongfang Liu

Digital Health Sciences

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

1 Scopus citations

Abstract

Relation extraction typically involves the extraction of relations between two or more entities occurring within a single or multiple sentences. In this study, we investigated the significance of extracting information from multiple sentences specifically in the context of drug-disease relation discovery. We used multiple resources such as Semantic Medline, a literature based resource, and Medline search (for filtering spurious results) and inferred 8,772 potential drug-disease pairs. Our analysis revealed that 6,450 (73.5%) of the 8,772 potential drug-disease relations did not occur in a single sentence. Moreover, only 537 of the drug-disease pairs matched the curated gold standard in Comparative Toxicogenomics Database (CTD), a trusted resource for drug-disease relations. Among the 537, nearly 75% (407) of the drug-disease pairs occur in multiple sentences. Our analysis revealed that the drug-disease pairs inferred from Semantic Medline or retrieved from CTD could be extracted from multiple sentences in the literature. This highlights the significance of the need of discourse-level analysis in extracting the relations from biomedical literature.

Original language	English (US)
Title of host publication	MEDINFO 2015
Subtitle of host publication	eHealth-Enabled Health - Proceedings of the 15th World Congress on Health and Biomedical Informatics
Editors	Andrew Georgiou, Indra Neil Sarkar, Paulo Mazzoncini de Azevedo Marques
Publisher	IOS Press
Pages	539-543
Number of pages	5
ISBN (Electronic)	9781614995630
DOIs	https://doi.org/10.3233/978-1-61499-564-7-539
State	Published - 2015
Event	15th World Congress on Health and Biomedical Informatics, MEDINFO 2015 - Sao Paulo, Brazil Duration: Aug 19 2015 → Aug 23 2015

Publication series

Name	Studies in Health Technology and Informatics
Volume	216
ISSN (Print)	0926-9630
ISSN (Electronic)	1879-8365

Other

Other	15th World Congress on Health and Biomedical Informatics, MEDINFO 2015
Country/Territory	Brazil
City	Sao Paulo
Period	8/19/15 → 8/23/15

Keywords

Discourse-level analysis
Literature-based discovery
Relation extraction
Semantic Medline

ASJC Scopus subject areas

Biomedical Engineering
Health Informatics
Health Information Management

Access to Document

10.3233/978-1-61499-564-7-539

Cite this

Rastegar-Mojarad, M., Komandur Elayavilli, R., Li, D., & Liu, H. (2015). Assessing the Need of Discourse-Level Analysis in Identifying Evidence of Drug-Disease Relations in Scientific Literature. In A. Georgiou, I. N. Sarkar, & P. M. de Azevedo Marques (Eds.), MEDINFO 2015: eHealth-Enabled Health - Proceedings of the 15th World Congress on Health and Biomedical Informatics (pp. 539-543). (Studies in Health Technology and Informatics; Vol. 216). IOS Press. https://doi.org/10.3233/978-1-61499-564-7-539

Assessing the Need of Discourse-Level Analysis in Identifying Evidence of Drug-Disease Relations in Scientific Literature. / Rastegar-Mojarad, Majid; Komandur Elayavilli, Ravikumar; Li, Dingcheng et al.
MEDINFO 2015: eHealth-Enabled Health - Proceedings of the 15th World Congress on Health and Biomedical Informatics. ed. / Andrew Georgiou; Indra Neil Sarkar; Paulo Mazzoncini de Azevedo Marques. IOS Press, 2015. p. 539-543 (Studies in Health Technology and Informatics; Vol. 216).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

Rastegar-Mojarad, M, Komandur Elayavilli, R, Li, D & Liu, H 2015, Assessing the Need of Discourse-Level Analysis in Identifying Evidence of Drug-Disease Relations in Scientific Literature. in A Georgiou, IN Sarkar & PM de Azevedo Marques (eds), MEDINFO 2015: eHealth-Enabled Health - Proceedings of the 15th World Congress on Health and Biomedical Informatics. Studies in Health Technology and Informatics, vol. 216, IOS Press, pp. 539-543, 15th World Congress on Health and Biomedical Informatics, MEDINFO 2015, Sao Paulo, Brazil, 8/19/15. https://doi.org/10.3233/978-1-61499-564-7-539

Rastegar-Mojarad M, Komandur Elayavilli R, Li D, Liu H. Assessing the Need of Discourse-Level Analysis in Identifying Evidence of Drug-Disease Relations in Scientific Literature. In Georgiou A, Sarkar IN, de Azevedo Marques PM, editors, MEDINFO 2015: eHealth-Enabled Health - Proceedings of the 15th World Congress on Health and Biomedical Informatics. IOS Press. 2015. p. 539-543. (Studies in Health Technology and Informatics). doi: 10.3233/978-1-61499-564-7-539

Rastegar-Mojarad, Majid ; Komandur Elayavilli, Ravikumar ; Li, Dingcheng et al. / Assessing the Need of Discourse-Level Analysis in Identifying Evidence of Drug-Disease Relations in Scientific Literature. MEDINFO 2015: eHealth-Enabled Health - Proceedings of the 15th World Congress on Health and Biomedical Informatics. editor / Andrew Georgiou ; Indra Neil Sarkar ; Paulo Mazzoncini de Azevedo Marques. IOS Press, 2015. pp. 539-543 (Studies in Health Technology and Informatics).

@inproceedings{40fedfc08b2644918508872f701b9d3b,

title = "Assessing the Need of Discourse-Level Analysis in Identifying Evidence of Drug-Disease Relations in Scientific Literature",

abstract = "Relation extraction typically involves the extraction of relations between two or more entities occurring within a single or multiple sentences. In this study, we investigated the significance of extracting information from multiple sentences specifically in the context of drug-disease relation discovery. We used multiple resources such as Semantic Medline, a literature based resource, and Medline search (for filtering spurious results) and inferred 8,772 potential drug-disease pairs. Our analysis revealed that 6,450 (73.5%) of the 8,772 potential drug-disease relations did not occur in a single sentence. Moreover, only 537 of the drug-disease pairs matched the curated gold standard in Comparative Toxicogenomics Database (CTD), a trusted resource for drug-disease relations. Among the 537, nearly 75% (407) of the drug-disease pairs occur in multiple sentences. Our analysis revealed that the drug-disease pairs inferred from Semantic Medline or retrieved from CTD could be extracted from multiple sentences in the literature. This highlights the significance of the need of discourse-level analysis in extracting the relations from biomedical literature.",

keywords = "Discourse-level analysis, Literature-based discovery, Relation extraction, Semantic Medline",

author = "Majid Rastegar-Mojarad and {Komandur Elayavilli}, Ravikumar and Dingcheng Li and Hongfang Liu",

note = "Publisher Copyright: {\textcopyright} 2015 IMIA and IOS Press.; 15th World Congress on Health and Biomedical Informatics, MEDINFO 2015 ; Conference date: 19-08-2015 Through 23-08-2015",

year = "2015",

doi = "10.3233/978-1-61499-564-7-539",

language = "English (US)",

series = "Studies in Health Technology and Informatics",

publisher = "IOS Press",

pages = "539--543",

editor = "Andrew Georgiou and Sarkar, {Indra Neil} and {de Azevedo Marques}, {Paulo Mazzoncini}",

booktitle = "MEDINFO 2015",

}

TY - GEN

T1 - Assessing the Need of Discourse-Level Analysis in Identifying Evidence of Drug-Disease Relations in Scientific Literature

AU - Rastegar-Mojarad, Majid

AU - Komandur Elayavilli, Ravikumar

AU - Li, Dingcheng

AU - Liu, Hongfang

PY - 2015

Y1 - 2015

N2 - Relation extraction typically involves the extraction of relations between two or more entities occurring within a single or multiple sentences. In this study, we investigated the significance of extracting information from multiple sentences specifically in the context of drug-disease relation discovery. We used multiple resources such as Semantic Medline, a literature based resource, and Medline search (for filtering spurious results) and inferred 8,772 potential drug-disease pairs. Our analysis revealed that 6,450 (73.5%) of the 8,772 potential drug-disease relations did not occur in a single sentence. Moreover, only 537 of the drug-disease pairs matched the curated gold standard in Comparative Toxicogenomics Database (CTD), a trusted resource for drug-disease relations. Among the 537, nearly 75% (407) of the drug-disease pairs occur in multiple sentences. Our analysis revealed that the drug-disease pairs inferred from Semantic Medline or retrieved from CTD could be extracted from multiple sentences in the literature. This highlights the significance of the need of discourse-level analysis in extracting the relations from biomedical literature.

AB - Relation extraction typically involves the extraction of relations between two or more entities occurring within a single or multiple sentences. In this study, we investigated the significance of extracting information from multiple sentences specifically in the context of drug-disease relation discovery. We used multiple resources such as Semantic Medline, a literature based resource, and Medline search (for filtering spurious results) and inferred 8,772 potential drug-disease pairs. Our analysis revealed that 6,450 (73.5%) of the 8,772 potential drug-disease relations did not occur in a single sentence. Moreover, only 537 of the drug-disease pairs matched the curated gold standard in Comparative Toxicogenomics Database (CTD), a trusted resource for drug-disease relations. Among the 537, nearly 75% (407) of the drug-disease pairs occur in multiple sentences. Our analysis revealed that the drug-disease pairs inferred from Semantic Medline or retrieved from CTD could be extracted from multiple sentences in the literature. This highlights the significance of the need of discourse-level analysis in extracting the relations from biomedical literature.

KW - Discourse-level analysis

KW - Literature-based discovery

KW - Relation extraction

KW - Semantic Medline

UR - http://www.scopus.com/inward/record.url?scp=84951950549&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84951950549&partnerID=8YFLogxK

U2 - 10.3233/978-1-61499-564-7-539

DO - 10.3233/978-1-61499-564-7-539

M3 - Conference contribution

C2 - 26262109

AN - SCOPUS:84951950549

T3 - Studies in Health Technology and Informatics

SP - 539

EP - 543

BT - MEDINFO 2015

A2 - Georgiou, Andrew

A2 - Sarkar, Indra Neil

A2 - de Azevedo Marques, Paulo Mazzoncini

PB - IOS Press

T2 - 15th World Congress on Health and Biomedical Informatics, MEDINFO 2015

Y2 - 19 August 2015 through 23 August 2015

ER -

Assessing the Need of Discourse-Level Analysis in Identifying Evidence of Drug-Disease Relations in Scientific Literature

Abstract

Publication series

Other

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this