Not logged in.

Contribution Details

Type Technical Report
Scope Discipline-based scholarship
Title SimPack: A Generic Java Library for Similarity Measures in Ontologies
Organization Unit
Authors
  • Abraham Bernstein
  • Esther Kaufmann
  • Christoph Kiefer
  • Christoph Bürki
Number IFI-2008.0008
Date August 2005
Abstract Text Good similarity measures are central for techniques such as retrieval, matchmaking, clustering, data-mining, ontology translations, automatic database schema matching, and simple object comparisons. Measures for the use with complex (or aggregated) objects in ontologies are, however, rare, even though they are central for semantic web applications. This paper first introduces SimPack, a library of similarity measures for the use in ontologies (of complex objects). The measures of the library are then experimentally compared with a similarity ``gold standard'' established by surveying 94 human subjects in two ontologies. Results show that human and algorithm assessments vary (both between people and across ontologies), but can be grouped into cohesive clusters, each of which is well modeled by one of the measures in the library. Furthermore, we show two increasingly accurate methods to predict the cluster membership of the subjects providing the foundation for the construction of personalized similarity measures.
PDF File Download
Export BibTeX