Seed-based generation of personalized bio-ontologies for information extraction

Cui Tao, David W. Embley

Research output: Chapter in Book/Report/Conference proceedingConference contribution

8 Citations (Scopus)

Abstract

Biologists usually focus on only a small, individualized, subdomain of the huge domain of biology. With respect to their sub-domain, they often need data collected from various different web resources. In this research, we provide a tool with which biologists can generate a sub-domain-size, user-specific ontology that can extract data from web resources. The central idea is to let a user provide a seed, which consists of a single data instance embedded within the concepts of interest, Given a seed, the system can generate an extraction ontology, match information with the user's view based on the seed, and collect information from online repositories. Our initial experimentations Indicate that our prototype system can successfully match source data with an ontology seed and gather information from different sources with respect to user-specific, personalized views.

Original languageEnglish (US)
Title of host publicationLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Pages74-84
Number of pages11
Volume4802 LNCS
StatePublished - 2007
Externally publishedYes
Event26th International Conference on Conceptual Modeling, ER 2007 - Auckland, New Zealand
Duration: Nov 5 2007Nov 9 2007

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume4802 LNCS
ISSN (Print)03029743
ISSN (Electronic)16113349

Other

Other26th International Conference on Conceptual Modeling, ER 2007
CountryNew Zealand
CityAuckland
Period11/5/0711/9/07

Fingerprint

Information Storage and Retrieval
Information Extraction
Ontology
Seed
Seeds
Resources
Experimentation
Repository
Biology
Prototype
Research

ASJC Scopus subject areas

  • Computer Science(all)
  • Biochemistry, Genetics and Molecular Biology(all)
  • Theoretical Computer Science

Cite this

Tao, C., & Embley, D. W. (2007). Seed-based generation of personalized bio-ontologies for information extraction. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 4802 LNCS, pp. 74-84). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 4802 LNCS).

Seed-based generation of personalized bio-ontologies for information extraction. / Tao, Cui; Embley, David W.

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). Vol. 4802 LNCS 2007. p. 74-84 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 4802 LNCS).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Tao, C & Embley, DW 2007, Seed-based generation of personalized bio-ontologies for information extraction. in Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). vol. 4802 LNCS, Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), vol. 4802 LNCS, pp. 74-84, 26th International Conference on Conceptual Modeling, ER 2007, Auckland, New Zealand, 11/5/07.
Tao C, Embley DW. Seed-based generation of personalized bio-ontologies for information extraction. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). Vol. 4802 LNCS. 2007. p. 74-84. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)).
Tao, Cui ; Embley, David W. / Seed-based generation of personalized bio-ontologies for information extraction. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). Vol. 4802 LNCS 2007. pp. 74-84 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)).
@inproceedings{5f87377ffe104418a0e6ec247039f83f,
title = "Seed-based generation of personalized bio-ontologies for information extraction",
abstract = "Biologists usually focus on only a small, individualized, subdomain of the huge domain of biology. With respect to their sub-domain, they often need data collected from various different web resources. In this research, we provide a tool with which biologists can generate a sub-domain-size, user-specific ontology that can extract data from web resources. The central idea is to let a user provide a seed, which consists of a single data instance embedded within the concepts of interest, Given a seed, the system can generate an extraction ontology, match information with the user's view based on the seed, and collect information from online repositories. Our initial experimentations Indicate that our prototype system can successfully match source data with an ontology seed and gather information from different sources with respect to user-specific, personalized views.",
author = "Cui Tao and Embley, {David W.}",
year = "2007",
language = "English (US)",
isbn = "9783540762911",
volume = "4802 LNCS",
series = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",
pages = "74--84",
booktitle = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",

}

TY - GEN

T1 - Seed-based generation of personalized bio-ontologies for information extraction

AU - Tao, Cui

AU - Embley, David W.

PY - 2007

Y1 - 2007

N2 - Biologists usually focus on only a small, individualized, subdomain of the huge domain of biology. With respect to their sub-domain, they often need data collected from various different web resources. In this research, we provide a tool with which biologists can generate a sub-domain-size, user-specific ontology that can extract data from web resources. The central idea is to let a user provide a seed, which consists of a single data instance embedded within the concepts of interest, Given a seed, the system can generate an extraction ontology, match information with the user's view based on the seed, and collect information from online repositories. Our initial experimentations Indicate that our prototype system can successfully match source data with an ontology seed and gather information from different sources with respect to user-specific, personalized views.

AB - Biologists usually focus on only a small, individualized, subdomain of the huge domain of biology. With respect to their sub-domain, they often need data collected from various different web resources. In this research, we provide a tool with which biologists can generate a sub-domain-size, user-specific ontology that can extract data from web resources. The central idea is to let a user provide a seed, which consists of a single data instance embedded within the concepts of interest, Given a seed, the system can generate an extraction ontology, match information with the user's view based on the seed, and collect information from online repositories. Our initial experimentations Indicate that our prototype system can successfully match source data with an ontology seed and gather information from different sources with respect to user-specific, personalized views.

UR - http://www.scopus.com/inward/record.url?scp=38349044139&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=38349044139&partnerID=8YFLogxK

M3 - Conference contribution

SN - 9783540762911

VL - 4802 LNCS

T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

SP - 74

EP - 84

BT - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

ER -