Not logged in.

Contribution Details

Type Journal Article
Scope Discipline-based scholarship
Title Towards the Web of Embeddings: Integrating multiple knowledge graph embedding spaces with FedCoder
Organization Unit
Authors
  • Matthias Baumgartner
  • Daniele Dell’Aglio
  • Heiko Paulheim
  • Abraham Bernstein
Item Subtype Original Work
Refereed Yes
Status Published in final form
Language
  • English
Journal Title Journal of Web Semantics
Publisher Elsevier
Geographical Reach international
ISSN 1570-8268
Volume 75
Page Range 100741
Date 2023
Abstract Text The Semantic Web is distributed yet interoperable: Distributed since resources are created and published by a variety of producers, tailored to their specific needs and knowledge; Interoperable as entities are linked across resources, allowing to use resources from different providers in concord. Complementary to the explicit usage of Semantic Web resources, embedding methods made them applicable to machine learning tasks. Subsequently, embedding models for numerous tasks and structures have been developed, and embedding spaces for various resources have been published. The ecosystem of embedding spaces is distributed but not interoperable: Entity embeddings are not readily comparable across different spaces. To parallel the Web of Data with a Web of Embeddings, we must thus integrate available embedding spaces into a uniform space. Current integration approaches are limited to two spaces and presume that both of them were embedded with the same method — both assumptions are unlikely to hold in the context of a Web of Embeddings. In this paper, we present FedCoder— an approach that integrates multiple embedding spaces via a latent space. We assert that linked entities have a similar representation in the latent space so that entities become comparable across embedding spaces. FedCoder employs an autoencoder to learn this latent space from linked as well as non-linked entities. Our experiments show that FedCoder substantially outperforms state-of-the-art approaches when faced with different embedding models, that it scales better than previous methods in the number of embedding spaces, and that it improves with more graphs being integrated whilst performing comparably with current approaches that assumed joint learning of the embeddings and were, usually, limited to two sources. Our results demonstrate that FedCoder is well adapted to integrate the distributed, diverse, and large ecosystem of embeddings spaces into an interoperable Web of Embeddings.
Digital Object Identifier 10.1016/j.websem.2022.100741
Other Identification Number merlin-id:22641
PDF File Download from ZORA
Export BibTeX
EP3 XML (ZORA)