FOCIH: Form-based ontology creation and information harvesting

Cui Tao, David W. Embley, Stephen W. Liddle

Research output: Chapter in Book/Report/Conference proceedingConference contribution

12 Scopus citations

Abstract

Creating an ontology and populating it with data are both labor-intensive tasks requiring a high degree of expertise. Thus, scaling ontology creation and population to the size of the web in an effort to create a web of data - which some see as Web 3.0 - is prohibitive. Can we find ways to streamline these tasks and lower the barrier enough to enable Web 3.0? Toward this end we offer a form-based approach to ontology creation that provides a way to create Web 3.0 ontologies without the need for specialized training. And we offer a way to semi-automatically harvest data from the current web of pages for a Web 3.0 ontology. In addition to harvesting information with respect to an ontology, the approach also annotates web pages and links facts in web pages to ontological concepts, resulting in a web of data superimposed over the web of pages. Experience with our prototype system shows that mappings between conceptual-model-based ontologies and forms are sufficient for creating the kind of ontologies needed for Web 3.0, and experiments with our prototype system show that automatic harvesting, automatic annotation, and automatic superimposition of a web of data over a web of pages work well.

Original languageEnglish (US)
Title of host publicationConceptual Modeling - ER 2009 - 28th International Conference on Conceptual Modeling, Proceedings
Pages346-359
Number of pages14
DOIs
StatePublished - 2009
Event28th International Conference on Conceptual Modeling, ER 2009 - Gramado, Brazil
Duration: Nov 9 2009Nov 12 2009

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume5829 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Other

Other28th International Conference on Conceptual Modeling, ER 2009
Country/TerritoryBrazil
CityGramado
Period11/9/0911/12/09

Keywords

  • Automatic annotation of web pages
  • Information harvesting from the web
  • Ontology generation from forms
  • Web 3.0
  • Web of data

ASJC Scopus subject areas

  • Theoretical Computer Science
  • General Computer Science

Fingerprint

Dive into the research topics of 'FOCIH: Form-based ontology creation and information harvesting'. Together they form a unique fingerprint.

Cite this