Semantator: Semantic annotator for converting biomedical text to linked data

Cui Tao, Dezhao Song, Deepak Sharma, Christopher G. Chute

Research output: Contribution to journalArticle

15 Citations (Scopus)

Abstract

More than 80% of biomedical data is embedded in plain text. The unstructured nature of these text-based documents makes it challenging to easily browse and query the data of interest in them. One approach to facilitate browsing and querying biomedical text is to convert the plain text to a linked web of data, i.e., converting data originally in free text to structured formats with defined meta-level semantics. In this paper, we introduce Semantator (Semantic Annotator), a semantic-web-based environment for annotating data of interest in biomedical documents, browsing and querying the annotated data, and interactively refining annotation results if needed. Through Semantator, information of interest can be either annotated manually or semi-automatically using plug-in information extraction tools. The annotated results will be stored in RDF and can be queried using the SPARQL query language. In addition, semantic reasoners can be directly applied to the annotated data for consistency checking and knowledge inference. Semantator has been released online and was used by the biomedical ontology community who provided positive feedbacks. Our evaluation results indicated that (1) Semantator can perform the annotation functionalities as designed; (2) Semantator can be adopted in real applications in clinical and transactional research; and (3) the annotated results using Semantator can be easily used in Semantic-web-based reasoning tools for further inference.

Original languageEnglish (US)
Pages (from-to)882-893
Number of pages12
JournalJournal of Biomedical Informatics
Volume46
Issue number5
DOIs
StatePublished - Oct 2013

Fingerprint

Semantics
Semantic Web
Biological Ontologies
Query languages
Information Storage and Retrieval
Refining
Ontology
Feedback
Language
Research

Keywords

  • Clinical narratives
  • Protege plugin
  • Semantator
  • Semantic Annotation
  • Semantic web

ASJC Scopus subject areas

  • Computer Science Applications
  • Health Informatics

Cite this

Semantator : Semantic annotator for converting biomedical text to linked data. / Tao, Cui; Song, Dezhao; Sharma, Deepak; Chute, Christopher G.

In: Journal of Biomedical Informatics, Vol. 46, No. 5, 10.2013, p. 882-893.

Research output: Contribution to journalArticle

Tao, Cui ; Song, Dezhao ; Sharma, Deepak ; Chute, Christopher G. / Semantator : Semantic annotator for converting biomedical text to linked data. In: Journal of Biomedical Informatics. 2013 ; Vol. 46, No. 5. pp. 882-893.
@article{5e2bd8fc16a44ad9b8000a872377e9c6,
title = "Semantator: Semantic annotator for converting biomedical text to linked data",
abstract = "More than 80{\%} of biomedical data is embedded in plain text. The unstructured nature of these text-based documents makes it challenging to easily browse and query the data of interest in them. One approach to facilitate browsing and querying biomedical text is to convert the plain text to a linked web of data, i.e., converting data originally in free text to structured formats with defined meta-level semantics. In this paper, we introduce Semantator (Semantic Annotator), a semantic-web-based environment for annotating data of interest in biomedical documents, browsing and querying the annotated data, and interactively refining annotation results if needed. Through Semantator, information of interest can be either annotated manually or semi-automatically using plug-in information extraction tools. The annotated results will be stored in RDF and can be queried using the SPARQL query language. In addition, semantic reasoners can be directly applied to the annotated data for consistency checking and knowledge inference. Semantator has been released online and was used by the biomedical ontology community who provided positive feedbacks. Our evaluation results indicated that (1) Semantator can perform the annotation functionalities as designed; (2) Semantator can be adopted in real applications in clinical and transactional research; and (3) the annotated results using Semantator can be easily used in Semantic-web-based reasoning tools for further inference.",
keywords = "Clinical narratives, Protege plugin, Semantator, Semantic Annotation, Semantic web",
author = "Cui Tao and Dezhao Song and Deepak Sharma and Chute, {Christopher G.}",
year = "2013",
month = "10",
doi = "10.1016/j.jbi.2013.07.003",
language = "English (US)",
volume = "46",
pages = "882--893",
journal = "Journal of Biomedical Informatics",
issn = "1532-0464",
publisher = "Academic Press Inc.",
number = "5",

}

TY - JOUR

T1 - Semantator

T2 - Semantic annotator for converting biomedical text to linked data

AU - Tao, Cui

AU - Song, Dezhao

AU - Sharma, Deepak

AU - Chute, Christopher G.

PY - 2013/10

Y1 - 2013/10

N2 - More than 80% of biomedical data is embedded in plain text. The unstructured nature of these text-based documents makes it challenging to easily browse and query the data of interest in them. One approach to facilitate browsing and querying biomedical text is to convert the plain text to a linked web of data, i.e., converting data originally in free text to structured formats with defined meta-level semantics. In this paper, we introduce Semantator (Semantic Annotator), a semantic-web-based environment for annotating data of interest in biomedical documents, browsing and querying the annotated data, and interactively refining annotation results if needed. Through Semantator, information of interest can be either annotated manually or semi-automatically using plug-in information extraction tools. The annotated results will be stored in RDF and can be queried using the SPARQL query language. In addition, semantic reasoners can be directly applied to the annotated data for consistency checking and knowledge inference. Semantator has been released online and was used by the biomedical ontology community who provided positive feedbacks. Our evaluation results indicated that (1) Semantator can perform the annotation functionalities as designed; (2) Semantator can be adopted in real applications in clinical and transactional research; and (3) the annotated results using Semantator can be easily used in Semantic-web-based reasoning tools for further inference.

AB - More than 80% of biomedical data is embedded in plain text. The unstructured nature of these text-based documents makes it challenging to easily browse and query the data of interest in them. One approach to facilitate browsing and querying biomedical text is to convert the plain text to a linked web of data, i.e., converting data originally in free text to structured formats with defined meta-level semantics. In this paper, we introduce Semantator (Semantic Annotator), a semantic-web-based environment for annotating data of interest in biomedical documents, browsing and querying the annotated data, and interactively refining annotation results if needed. Through Semantator, information of interest can be either annotated manually or semi-automatically using plug-in information extraction tools. The annotated results will be stored in RDF and can be queried using the SPARQL query language. In addition, semantic reasoners can be directly applied to the annotated data for consistency checking and knowledge inference. Semantator has been released online and was used by the biomedical ontology community who provided positive feedbacks. Our evaluation results indicated that (1) Semantator can perform the annotation functionalities as designed; (2) Semantator can be adopted in real applications in clinical and transactional research; and (3) the annotated results using Semantator can be easily used in Semantic-web-based reasoning tools for further inference.

KW - Clinical narratives

KW - Protege plugin

KW - Semantator

KW - Semantic Annotation

KW - Semantic web

UR - http://www.scopus.com/inward/record.url?scp=84883821995&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84883821995&partnerID=8YFLogxK

U2 - 10.1016/j.jbi.2013.07.003

DO - 10.1016/j.jbi.2013.07.003

M3 - Article

C2 - 23867104

AN - SCOPUS:84883821995

VL - 46

SP - 882

EP - 893

JO - Journal of Biomedical Informatics

JF - Journal of Biomedical Informatics

SN - 1532-0464

IS - 5

ER -