Not logged in.

Contribution Details

Type Conference or Workshop Paper
Scope Discipline-based scholarship
Published in Proceedings Yes
Title Perm: Processing provenance and data on the same data model through query rewriting
Organization Unit
Authors
  • B Glavic
  • G Alonso
Presentation Type paper
Item Subtype Original Work
Refereed Yes
Status Published in final form
Language
  • English
ISBN 978-0-7695-3545-6
ISSN 1084-4627
Page Range 174 - 185
Event Title 25th International Conference on Data Engineering
Event Type conference
Event Location Shanghai
Event Start Date March 29 - 2009
Event End Date April 2 - 2009
Series Name International Conference on Data Engineering. Proceedings
Number 25
Place of Publication Shanghai, China.
Publisher IEEE Computer Society
Abstract Text Data provenance is information that describes how a given data item was produced. The provenance includes source and intermediate data as well as the transformations involved in producing the concrete data item. In the context of a relational databases, the source and intermediate data items are relations, tuples and attribute values. The transformations are SQL queries and/or functions on the relational data items. Existing approaches capture provenance information by extending the underlying data model. This has the intrinsic disadvantage that the provenance must be stored and accessed using a different model than the actual data. In this paper, we present an alternative approach that uses query rewriting to annotate result tuples with provenance information. The rewritten query and its result use the same model and can, thus, be queried, stored and optimized using standard relational database techniques. In the paper we formalize the query rewriting procedures, prove their correctness, and evaluate a first implementation of the ideas using PostgreSQL. As the experiments indicate, our approach efficiently provides provenance information inducing only a small overhead on normal operations.
Free access at Official URL
Digital Object Identifier 10.1109/ICDE.2009.15
Other Identification Number merlin-id:254
PDF File Download from ZORA
Export BibTeX
EP3 XML (ZORA)
Keywords provenance, relational databases, query rewrite