Not logged in.

Contribution Details

Type Journal Article
Scope Discipline-based scholarship
Title Parsimonious temporal aggregation
Organization Unit
Authors
  • Juozas Gordevicius
  • Johann Gamper
  • Michael Böhlen
Item Subtype Original Work
Refereed Yes
Status Published in final form
Language
  • English
Journal Title VLDB Journal
Publisher Springer
Geographical Reach international
ISSN 1066-8888 (P) 0949-877X (E)
Volume 21
Number 3
Page Range 309 - 332
Date 2012
Abstract Text Temporal aggregation is an important operationin temporal databases, and different variants thereof havebeen proposed. In this paper, we introduce a novel temporalaggregation operator, termed parsimonious temporal aggregation (PTA), that overcomes major limitations of existingapproaches. PTA takes the result of instant temporal aggregation (ITA) of size n, which might be up to twice as largeas the argument relation, and merges similar tuples untila given error () or size (c) bound is reached. The newoperator is data-adaptive and allows the user to control thetrade-off between the result size and the error introduced bymerging. For the precise evaluation of PTA queries, we propose two dynamic programming-based algorithms for sizeand error-bounded queries, respectively, with a worst-casecomplexity that is quadratic in n. We present two optimizations that take advantage of temporal gaps and differentaggregation groups and achieve a linear runtime in experiments with real-world data. For the quick computation ofan approximate PTA answer, we propose an efficient greedymerging strategy with a precision that is upper bounded byO(log n). We present two algorithms that implement thisstrategy and begin to merge as ITA tuples are produced. Theyrequire O(n log(c + β)) time and O(c + β) space, where βis the size of a read-ahead buffer and is typically very small. An empirical evaluation on real-world and synthetic datashows that PTA considerably reduces the size of the aggregation result, yet introducing only small errors. The greedyalgorithms are scalable for large data sets and introduce lesserror than other approximation techniques.
Free access at Related URL
Related URLs
Digital Object Identifier 10.1007/s00778-011-0243-9
Other Identification Number merlin-id:7323
PDF File Download from ZORA
Export BibTeX
EP3 XML (ZORA)