Not logged in.

Contribution Details

Type Journal Article
Scope Discipline-based scholarship
Title Dynamic Interleaving of Content and Structure for Robust Indexing of Semi-Structured Hierarchical Data
Organization Unit
Authors
  • Kevin Wellenzohn
  • Michael Hanspeter Böhlen
  • Sven Helmer
Item Subtype Original Work
Refereed Yes
Status Published in final form
Language
  • English
Journal Title Proceedings of the VLDB Endowment
Publisher Association for Computing Machinery
Geographical Reach international
ISSN 2150-8097
Volume 13
Number 10
Page Range 1641 - 1653
Date 2020
Abstract Text We propose a robust index for semi-structured hierarchical data that supports content-and-structure (CAS) queries specified by path and value predicates. At the heart of our approach is a novel dynamic interleaving scheme that merges the path and value dimensions of composite keys in a balanced way. We store these keys in our trie-based Robust Content-And-Structure index, which efficiently supports a wide range of CAS queries, including queries with wildcards and descendant axes. Additionally, we show important properties of our scheme, such as robustness against varying selectivities, and demonstrate improvements of up to two orders of magnitude over existing approaches in our experimental evaluation.
Official URL http://www.vldb.org/pvldb/vol13/p1641-wellenzohn.pdf
Digital Object Identifier 10.14778/3401960.3401963
Other Identification Number merlin-id:20733
PDF File Download from ZORA
Export BibTeX
EP3 XML (ZORA)