Learning classifiers from distributed, ontology-extended data sources

Doina Caragea, Jun Zhang, Jyotishman Pathak, Vasant Honavar

Research output: Chapter in Book/Report/Conference proceedingConference contribution

1 Scopus citations

Abstract

There is an urgent need for sound approaches to integrative and collaborative analysis of large, autonomous (and hence, inevitably semantically heterogeneous) data sources in several increasingly data-rich application domains. In this paper, we precisely formulate and solve the problem of learning classifiers from such data sources, in a setting where each data source has a hierarchical ontology associated with it and semantic correspondences between data source ontologies and a user ontology are supplied. The proposed approach yields algorithms for learning a broad class of classifiers (including Bayesian networks, decision trees, etc.) from semantically heterogeneous distributed data with strong performance guarantees relative to their centralized counterparts. We illustrate the application of the proposed approach in the case of learning Naive Bayes classifiers from distributed, ontology-extended data sources.

Original languageEnglish (US)
Title of host publicationData Warehousing and Knowledge Discovery - 8th International Conference, DaWaK 2006, Proceedings
PublisherSpringer Verlag
Pages363-373
Number of pages11
ISBN (Print)3540377360, 9783540377368
DOIs
StatePublished - 2006
Event8th International Conference on Data Warehousing and Knowledge Discovery, DaWaK 2006 - Krakow, Poland
Duration: Sep 4 2006Sep 8 2006

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume4081 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Other

Other8th International Conference on Data Warehousing and Knowledge Discovery, DaWaK 2006
CountryPoland
CityKrakow
Period9/4/069/8/06

ASJC Scopus subject areas

  • Theoretical Computer Science
  • Computer Science(all)

Fingerprint Dive into the research topics of 'Learning classifiers from distributed, ontology-extended data sources'. Together they form a unique fingerprint.

  • Cite this

    Caragea, D., Zhang, J., Pathak, J., & Honavar, V. (2006). Learning classifiers from distributed, ontology-extended data sources. In Data Warehousing and Knowledge Discovery - 8th International Conference, DaWaK 2006, Proceedings (pp. 363-373). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 4081 LNCS). Springer Verlag. https://doi.org/10.1007/11823728_35