BmQGen: Biomedical query generator for knowledge discovery

Feichen Shen, Hongfang D Liu, Sunghwan Sohn, David Larson, Yugyung Lee

Research output: Chapter in Book/Report/Conference proceedingConference contribution

6 Citations (Scopus)

Abstract

A large number of structured and unstructured data (e.g., EHRs, ontologies, reports) have been introduced by the biomedical community. Cross-domain data integration is identified as an important research problem for translational research. From an application perspective, identifying related concepts among medical ontologies is an important goal of life science research. It is essential to analyze how relations are specified to connect concepts in a single ontology or across multiple ontologies. With the explosion of cross domain datasets, it is extremely hard for researchers to discover knowledge from current infrastructures of ontologies. It is mainly a lack of the connectivity between the ontologies' cross domains and ontologies to unstructured data; even if they have specific biomedical knowledge in a more general and comprehensive level. Therefore, there is a need for a mechanism to do semantic partition and query generation for cross domain biomedical knowledge discovery. In this paper, we present such a model that clusters integrated data based on semantic closeness of predicates into different groups and produces meaningful queries to fully discover knowledge over a set of interlinked data sources. We have implemented a prototype of the BmQGen system and evaluated the proposed query model based on the predicate oriented clustering with colorectal surgical cohort from the Mayo Clinic.

Original languageEnglish (US)
Title of host publicationProceedings - 2015 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2015
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages1092-1097
Number of pages6
ISBN (Print)9781467367981
DOIs
StatePublished - Dec 16 2015
EventIEEE International Conference on Bioinformatics and Biomedicine, BIBM 2015 - Washington, United States
Duration: Nov 9 2015Nov 12 2015

Other

OtherIEEE International Conference on Bioinformatics and Biomedicine, BIBM 2015
CountryUnited States
CityWashington
Period11/9/1511/12/15

Fingerprint

Semantics
Data mining
Ontology
Translational Medical Research
Biological Science Disciplines
Explosions
Information Storage and Retrieval
Research
Cluster Analysis
Research Personnel
Data integration
Datasets

Keywords

  • biomedical cross domain
  • knowledge discovery
  • query generation
  • semantic

ASJC Scopus subject areas

  • Software
  • Artificial Intelligence
  • Health Informatics
  • Biomedical Engineering

Cite this

Shen, F., Liu, H. D., Sohn, S., Larson, D., & Lee, Y. (2015). BmQGen: Biomedical query generator for knowledge discovery. In Proceedings - 2015 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2015 (pp. 1092-1097). [7359833] Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/BIBM.2015.7359833

BmQGen : Biomedical query generator for knowledge discovery. / Shen, Feichen; Liu, Hongfang D; Sohn, Sunghwan; Larson, David; Lee, Yugyung.

Proceedings - 2015 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2015. Institute of Electrical and Electronics Engineers Inc., 2015. p. 1092-1097 7359833.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Shen, F, Liu, HD, Sohn, S, Larson, D & Lee, Y 2015, BmQGen: Biomedical query generator for knowledge discovery. in Proceedings - 2015 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2015., 7359833, Institute of Electrical and Electronics Engineers Inc., pp. 1092-1097, IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2015, Washington, United States, 11/9/15. https://doi.org/10.1109/BIBM.2015.7359833
Shen F, Liu HD, Sohn S, Larson D, Lee Y. BmQGen: Biomedical query generator for knowledge discovery. In Proceedings - 2015 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2015. Institute of Electrical and Electronics Engineers Inc. 2015. p. 1092-1097. 7359833 https://doi.org/10.1109/BIBM.2015.7359833
Shen, Feichen ; Liu, Hongfang D ; Sohn, Sunghwan ; Larson, David ; Lee, Yugyung. / BmQGen : Biomedical query generator for knowledge discovery. Proceedings - 2015 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2015. Institute of Electrical and Electronics Engineers Inc., 2015. pp. 1092-1097
@inproceedings{81566ec08cf44799bf091c48abd58095,
title = "BmQGen: Biomedical query generator for knowledge discovery",
abstract = "A large number of structured and unstructured data (e.g., EHRs, ontologies, reports) have been introduced by the biomedical community. Cross-domain data integration is identified as an important research problem for translational research. From an application perspective, identifying related concepts among medical ontologies is an important goal of life science research. It is essential to analyze how relations are specified to connect concepts in a single ontology or across multiple ontologies. With the explosion of cross domain datasets, it is extremely hard for researchers to discover knowledge from current infrastructures of ontologies. It is mainly a lack of the connectivity between the ontologies' cross domains and ontologies to unstructured data; even if they have specific biomedical knowledge in a more general and comprehensive level. Therefore, there is a need for a mechanism to do semantic partition and query generation for cross domain biomedical knowledge discovery. In this paper, we present such a model that clusters integrated data based on semantic closeness of predicates into different groups and produces meaningful queries to fully discover knowledge over a set of interlinked data sources. We have implemented a prototype of the BmQGen system and evaluated the proposed query model based on the predicate oriented clustering with colorectal surgical cohort from the Mayo Clinic.",
keywords = "biomedical cross domain, knowledge discovery, query generation, semantic",
author = "Feichen Shen and Liu, {Hongfang D} and Sunghwan Sohn and David Larson and Yugyung Lee",
year = "2015",
month = "12",
day = "16",
doi = "10.1109/BIBM.2015.7359833",
language = "English (US)",
isbn = "9781467367981",
pages = "1092--1097",
booktitle = "Proceedings - 2015 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2015",
publisher = "Institute of Electrical and Electronics Engineers Inc.",

}

TY - GEN

T1 - BmQGen

T2 - Biomedical query generator for knowledge discovery

AU - Shen, Feichen

AU - Liu, Hongfang D

AU - Sohn, Sunghwan

AU - Larson, David

AU - Lee, Yugyung

PY - 2015/12/16

Y1 - 2015/12/16

N2 - A large number of structured and unstructured data (e.g., EHRs, ontologies, reports) have been introduced by the biomedical community. Cross-domain data integration is identified as an important research problem for translational research. From an application perspective, identifying related concepts among medical ontologies is an important goal of life science research. It is essential to analyze how relations are specified to connect concepts in a single ontology or across multiple ontologies. With the explosion of cross domain datasets, it is extremely hard for researchers to discover knowledge from current infrastructures of ontologies. It is mainly a lack of the connectivity between the ontologies' cross domains and ontologies to unstructured data; even if they have specific biomedical knowledge in a more general and comprehensive level. Therefore, there is a need for a mechanism to do semantic partition and query generation for cross domain biomedical knowledge discovery. In this paper, we present such a model that clusters integrated data based on semantic closeness of predicates into different groups and produces meaningful queries to fully discover knowledge over a set of interlinked data sources. We have implemented a prototype of the BmQGen system and evaluated the proposed query model based on the predicate oriented clustering with colorectal surgical cohort from the Mayo Clinic.

AB - A large number of structured and unstructured data (e.g., EHRs, ontologies, reports) have been introduced by the biomedical community. Cross-domain data integration is identified as an important research problem for translational research. From an application perspective, identifying related concepts among medical ontologies is an important goal of life science research. It is essential to analyze how relations are specified to connect concepts in a single ontology or across multiple ontologies. With the explosion of cross domain datasets, it is extremely hard for researchers to discover knowledge from current infrastructures of ontologies. It is mainly a lack of the connectivity between the ontologies' cross domains and ontologies to unstructured data; even if they have specific biomedical knowledge in a more general and comprehensive level. Therefore, there is a need for a mechanism to do semantic partition and query generation for cross domain biomedical knowledge discovery. In this paper, we present such a model that clusters integrated data based on semantic closeness of predicates into different groups and produces meaningful queries to fully discover knowledge over a set of interlinked data sources. We have implemented a prototype of the BmQGen system and evaluated the proposed query model based on the predicate oriented clustering with colorectal surgical cohort from the Mayo Clinic.

KW - biomedical cross domain

KW - knowledge discovery

KW - query generation

KW - semantic

UR - http://www.scopus.com/inward/record.url?scp=84962388411&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84962388411&partnerID=8YFLogxK

U2 - 10.1109/BIBM.2015.7359833

DO - 10.1109/BIBM.2015.7359833

M3 - Conference contribution

AN - SCOPUS:84962388411

SN - 9781467367981

SP - 1092

EP - 1097

BT - Proceedings - 2015 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2015

PB - Institute of Electrical and Electronics Engineers Inc.

ER -