Not logged in.

Quick Search - Contribution

Contribution Details

Type	Master's Thesis
Scope	Discipline-based scholarship
Title	Semantic Flow Processing with Events and Facts
Organization Unit	Dynamic and Distributed Information Systems (Abraham Bernstein)
Authors	Pascal Muther
Supervisors	Abraham Bernstein Shen Gao
Language	English
Institution	University of Zurich
Faculty	Faculty of Economics, Business Administration and Information Technology
Number of Pages	80
Date	2014
Abstract Text	The combination of the semantic web with stream data opens new possibilities for information retrieval. This thesis develops a system for processing Linked Data streams – Linked Esper. It takes TEF-SPARQL as input query, which handles both event and derived facts. The syntax of the TEF-SPARQL grammar is improved and implemented in this project. The Linked Esper compiles the input query into runnable statements of a stream engine – Esper. Aside from the local execution mode, it also offers a distributed execution mode, which enables scalable processing of large amounts of data. In local as well as in distributed execution mode, different optimization algorithms are applied which improves the overall performances as shown by the experiment results.The evaluation reveals that: first, Linked Esper can handle the most important query features of TEF-SPARQL. Second, it compiles input query to a optimized query plan and then to a distributed execution plan. The results also show that the optimized plan achieves a higher throughput. In addition, partitioning the data flow graph based on optimization algorithms as well as parallelization increases the efficiency of Linked Esper.
Zusammenfassung	Die Kombination von Semantic Web mit Datenströmen eröffnet neue Möglichkeiten in der Wissensgewinnung. In dieser Arbeit wird ein System für das Verarbeiten von Linked Data Datenströmen entwickelt – Linked Esper. Es ermöglicht auch die Ableitung und Verarbeitung von temporären Fakten. Die Syntax von TEF-SPARQL wird formal definiert und dient als Eingabesprache für Linked Esper. Linked Esper ist sowohl lokal als auch verteilt ausführbar und ist daher ein Werkzeug um auch grosse Datenmengen verarbeiten zu können. Im lokalen sowie auch im verteilten Modus werden Prozessoptimierungsalgorithmen angewendet, welche die Leistung des Systems erhöhen. Die Evaluation zeigt, dass durch die Verteilung auf verschiedene ausführende Systeme ein erhöhter Datendurchsatz erreicht werden kann. Eine optimale Partitionierung des Datenflussgraphen und Parallelisierung erhöht die Effizienz von Linked Esper.
PDF File	Download
Export	BibTeX