Not logged in.

Contribution Details

Type Journal Article
Scope Discipline-based scholarship
Title Disjoint interval partitioning
Organization Unit
Authors
  • Francesco Cafagna
  • Michael Hanspeter Böhlen
Item Subtype Original Work
Refereed Yes
Status Published in final form
Language
  • English
Journal Title VLDB Journal
Publisher Springer
Geographical Reach international
ISSN 1066-8888
Volume 26
Number 3
Page Range 447 - 466
Date 2017
Abstract Text In databases with time interval attributes, query processing techniques that are based on sort-merge or sort-aggregate deteriorate. This happens because for intervals no total order exists and either the start or end point is used for the sorting. Doing so leads to inefficient solutions with lots of unproductive comparisons that do not produce an output tuple. Even if just one tuple with a long interval is present in the data, the number of unproductive comparisons of sort-merge and sort-aggregate gets quadratic. In this paper we propose disjoint interval partitioning (\(\mathcal {DIP}\)), a technique to efficiently perform sort-based operators on interval data. \(\mathcal {DIP}\) divides an input relation into the minimum number of partitions, such that all tuples in a partition are non-overlapping. The absence of overlapping tuples guarantees efficient sort-merge computations without backtracking. With \(\mathcal {DIP}\) the number of unproductive comparisons is linear in the number of partitions. In contrast to current solutions with inefficient random accesses to the active tuples, \(\mathcal {DIP}\) fetches the tuples in a partition sequentially. We illustrate the generality and efficiency of \(\mathcal {DIP}\) by describing and evaluating three basic database operators over interval data: join, anti-join and aggregation.
Digital Object Identifier 10.1007/s00778-017-0456-7
Other Identification Number merlin-id:14867
PDF File Download from ZORA
Export BibTeX
EP3 XML (ZORA)
Keywords Hardware and Architecture, Information Systems