FOCIH

Form-based ontology creation and information harvesting

Cui Tao, David W. Embley, Stephen W. Liddle

Research output: Chapter in Book/Report/Conference proceedingConference contribution

12 Citations (Scopus)

Abstract

Creating an ontology and populating it with data are both labor-intensive tasks requiring a high degree of expertise. Thus, scaling ontology creation and population to the size of the web in an effort to create a web of data - which some see as Web 3.0 - is prohibitive. Can we find ways to streamline these tasks and lower the barrier enough to enable Web 3.0? Toward this end we offer a form-based approach to ontology creation that provides a way to create Web 3.0 ontologies without the need for specialized training. And we offer a way to semi-automatically harvest data from the current web of pages for a Web 3.0 ontology. In addition to harvesting information with respect to an ontology, the approach also annotates web pages and links facts in web pages to ontological concepts, resulting in a web of data superimposed over the web of pages. Experience with our prototype system shows that mappings between conceptual-model-based ontologies and forms are sufficient for creating the kind of ontologies needed for Web 3.0, and experiments with our prototype system show that automatic harvesting, automatic annotation, and automatic superimposition of a web of data over a web of pages work well.

Original languageEnglish (US)
Title of host publicationLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Pages346-359
Number of pages14
Volume5829 LNCS
DOIs
StatePublished - 2009
Externally publishedYes
Event28th International Conference on Conceptual Modeling, ER 2009 - Gramado, Brazil
Duration: Nov 9 2009Nov 12 2009

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume5829 LNCS
ISSN (Print)03029743
ISSN (Electronic)16113349

Other

Other28th International Conference on Conceptual Modeling, ER 2009
CountryBrazil
CityGramado
Period11/9/0911/12/09

Fingerprint

Harvesting
Ontology
Websites
Form
World Wide Web
Prototype
Personnel
Conceptual Model
Streamlines
Expertise
Annotation

Keywords

  • Automatic annotation of web pages
  • Information harvesting from the web
  • Ontology generation from forms
  • Web 3.0
  • Web of data

ASJC Scopus subject areas

  • Computer Science(all)
  • Theoretical Computer Science

Cite this

Tao, C., Embley, D. W., & Liddle, S. W. (2009). FOCIH: Form-based ontology creation and information harvesting. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 5829 LNCS, pp. 346-359). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 5829 LNCS). https://doi.org/10.1007/978-3-642-04840-1_26

FOCIH : Form-based ontology creation and information harvesting. / Tao, Cui; Embley, David W.; Liddle, Stephen W.

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). Vol. 5829 LNCS 2009. p. 346-359 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 5829 LNCS).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Tao, C, Embley, DW & Liddle, SW 2009, FOCIH: Form-based ontology creation and information harvesting. in Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). vol. 5829 LNCS, Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), vol. 5829 LNCS, pp. 346-359, 28th International Conference on Conceptual Modeling, ER 2009, Gramado, Brazil, 11/9/09. https://doi.org/10.1007/978-3-642-04840-1_26
Tao C, Embley DW, Liddle SW. FOCIH: Form-based ontology creation and information harvesting. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). Vol. 5829 LNCS. 2009. p. 346-359. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)). https://doi.org/10.1007/978-3-642-04840-1_26
Tao, Cui ; Embley, David W. ; Liddle, Stephen W. / FOCIH : Form-based ontology creation and information harvesting. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). Vol. 5829 LNCS 2009. pp. 346-359 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)).
@inproceedings{9f016a65fde64e5eb696a94b8ec2c078,
title = "FOCIH: Form-based ontology creation and information harvesting",
abstract = "Creating an ontology and populating it with data are both labor-intensive tasks requiring a high degree of expertise. Thus, scaling ontology creation and population to the size of the web in an effort to create a web of data - which some see as Web 3.0 - is prohibitive. Can we find ways to streamline these tasks and lower the barrier enough to enable Web 3.0? Toward this end we offer a form-based approach to ontology creation that provides a way to create Web 3.0 ontologies without the need for specialized training. And we offer a way to semi-automatically harvest data from the current web of pages for a Web 3.0 ontology. In addition to harvesting information with respect to an ontology, the approach also annotates web pages and links facts in web pages to ontological concepts, resulting in a web of data superimposed over the web of pages. Experience with our prototype system shows that mappings between conceptual-model-based ontologies and forms are sufficient for creating the kind of ontologies needed for Web 3.0, and experiments with our prototype system show that automatic harvesting, automatic annotation, and automatic superimposition of a web of data over a web of pages work well.",
keywords = "Automatic annotation of web pages, Information harvesting from the web, Ontology generation from forms, Web 3.0, Web of data",
author = "Cui Tao and Embley, {David W.} and Liddle, {Stephen W.}",
year = "2009",
doi = "10.1007/978-3-642-04840-1_26",
language = "English (US)",
isbn = "3642048390",
volume = "5829 LNCS",
series = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",
pages = "346--359",
booktitle = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",

}

TY - GEN

T1 - FOCIH

T2 - Form-based ontology creation and information harvesting

AU - Tao, Cui

AU - Embley, David W.

AU - Liddle, Stephen W.

PY - 2009

Y1 - 2009

N2 - Creating an ontology and populating it with data are both labor-intensive tasks requiring a high degree of expertise. Thus, scaling ontology creation and population to the size of the web in an effort to create a web of data - which some see as Web 3.0 - is prohibitive. Can we find ways to streamline these tasks and lower the barrier enough to enable Web 3.0? Toward this end we offer a form-based approach to ontology creation that provides a way to create Web 3.0 ontologies without the need for specialized training. And we offer a way to semi-automatically harvest data from the current web of pages for a Web 3.0 ontology. In addition to harvesting information with respect to an ontology, the approach also annotates web pages and links facts in web pages to ontological concepts, resulting in a web of data superimposed over the web of pages. Experience with our prototype system shows that mappings between conceptual-model-based ontologies and forms are sufficient for creating the kind of ontologies needed for Web 3.0, and experiments with our prototype system show that automatic harvesting, automatic annotation, and automatic superimposition of a web of data over a web of pages work well.

AB - Creating an ontology and populating it with data are both labor-intensive tasks requiring a high degree of expertise. Thus, scaling ontology creation and population to the size of the web in an effort to create a web of data - which some see as Web 3.0 - is prohibitive. Can we find ways to streamline these tasks and lower the barrier enough to enable Web 3.0? Toward this end we offer a form-based approach to ontology creation that provides a way to create Web 3.0 ontologies without the need for specialized training. And we offer a way to semi-automatically harvest data from the current web of pages for a Web 3.0 ontology. In addition to harvesting information with respect to an ontology, the approach also annotates web pages and links facts in web pages to ontological concepts, resulting in a web of data superimposed over the web of pages. Experience with our prototype system shows that mappings between conceptual-model-based ontologies and forms are sufficient for creating the kind of ontologies needed for Web 3.0, and experiments with our prototype system show that automatic harvesting, automatic annotation, and automatic superimposition of a web of data over a web of pages work well.

KW - Automatic annotation of web pages

KW - Information harvesting from the web

KW - Ontology generation from forms

KW - Web 3.0

KW - Web of data

UR - http://www.scopus.com/inward/record.url?scp=78650700298&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=78650700298&partnerID=8YFLogxK

U2 - 10.1007/978-3-642-04840-1_26

DO - 10.1007/978-3-642-04840-1_26

M3 - Conference contribution

SN - 3642048390

SN - 9783642048395

VL - 5829 LNCS

T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

SP - 346

EP - 359

BT - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

ER -