Not logged in.

Quick Search - Contribution

Contribution Details

Type	Conference or Workshop Paper
Scope	Discipline-based scholarship
Published in Proceedings	Yes
Title	Parsimonious temporal aggregation
Organization Unit	Database Technology (Michael Hanspeter Böhlen)
Authors	Juozas Gordevicius Johann Gamper Michael Böhlen
Presentation Type	paper
Item Subtype	Original Work
Refereed	Yes
Status	Published in final form
Language	English
ISSN	0949-877X
Page Range	1006 - 1017
Event Title	EDBT/ICDT 2009 Joint Conference
Event Type	conference
Event Location	St. Petersburg, Russia
Event Start Date	March 23 - 2009
Event End Date	March 26 - 2009
Number	21(3)
Abstract Text	Temporal aggregation is a crucial operator in temporal databases and has been studied in various flavors, including instant temporal aggregation (ITA) and span temporal aggregation (STA), each having its strengths and weaknesses. In this paper we define a new temporal aggregation operator, called parsimonious temporal aggregation (PTA), which comprises two main steps: (i) it computes the ITA result over the input relation and (ii) it compresses this intermediate result to a user-specified size c by merging adjacent tuples and keeping the induced total error minimal; the compressed ITA result is returned as the final result. By considering the distribution of the input data and allowing to control the result size, PTA combines the best features of ITA and STA. We provide two evaluation algorithms for PTA queries. First, the oPTA algorithm computes an exact solution, by applying dynamic programming to explore all possibilities to compress the ITA result and selecting the compression with the minimal total error. It runs in O(n2pc) time and O(n2) space, where n is the size of the input relation and p is the number of aggregation functions in the query. Second, the more efficient gPTA algorithm computes an approximate solution by greedily merging the most similar ITA result tuples, which, however, does not guarantee a compression with a minimal total error. gPTA intermingles the two steps of PTA and avoids large intermediate results. The compression step of gPTA runs in O(np log(c + ?)) time and O(c + ?) space, where ? is a small buffer for ""look ahead"". An empirical evaluation shows good results: considerable reductions of the result size introduce only small errors, and gPTA scales to large data sets and is only slightly worse than the exact solution of PTA.
Official URL	http://www.edbt.org/Proceedings/2009-StPetersburg/edbt/papers/p1006-Gordevicius.pdf
Related URLs	https://www.zora.uzh.ch/65048/
Digital Object Identifier	10.1145/1516360.1516475
Other Identification Number	merlin-id:2293
PDF File	Download from ZORA
Export	BibTeX EP3 XML (ZORA)
Additional Information	The VLDB Journal, June 2012, Volume 21, Issue 3, pp 309-332