Not logged in.

Quick Search - Contribution

Contribution Details

Type	Bachelor's Thesis
Scope	Discipline-based scholarship
Title	SPARQL query evaluation in Big Data processors
Organization Unit	Dynamic and Distributed Information Systems (Abraham Bernstein)
Authors	Cyrus Einsele
Supervisors	Daniele Dell' Aglio Abraham Bernstein
Language	English
Institution	University of Zurich
Faculty	Faculty of Business, Economics and Informatics
Date	2017
Abstract Text	This thesis aims to investigate the usage of Big Data processors, namely Apache Beam, to extract information from Semantic Web data, specifically RDF. We present a Java application that can match SPARQL basic graph patterns, within a stream of triples. It does so utilizing the open-source Big Data processing framework Apache Beam, which is largely based upon Google Cloud Dataflow. We describe the principles of the algorithm used to achieve this goal and evaluate the correctness and the performance of the application by comparing it to a traditional SPARQL query execution engine, namely Apache Jena ARQ.
Zusammenfassung	Das Ziel dieser Arbeit besteht darin herauszufinden wie Big Data Verarbeitungswerkzeuge, namentlich Apache Beam, daf¸r verwendet werden kˆnnen um Informationen aus Daten des Semantic Webs, insbesondere RDF, zu extrahieren. Wir pr‰sentieren eine Java Applikation, welche es ermˆglicht Basic Graph Patterns welche in SPARQL kodiert sind in einem Stream von Tripeln zu entdecken. Daf¸r verwenden wir das Open-Source Big Data Framework Apache Beam. Dieses Framework basiert stark auf Google Cloud Dataflow. Wir beschreiben das Funktionsprinzip des Algorithmus welcher der Applikation zugrunde liegt und evaluieren die Korrektheit und die Performanz der Applikation anhand eines Vergleichs mit einem herkˆmmlichen SPARQL Query Exekutionsengine, namentlich Apache Jena ARQ.
PDF File	Download
Export	BibTeX