Not logged in.

Contribution Details

Type Master's Thesis
Scope Discipline-based scholarship
Title End-to-End lmplementation of Pair-Wise Correlation Computation in a Streaming System
Organization Unit
Authors
  • Wenqing Chang
Supervisors
  • Sven Helmer
  • Michael Hanspeter Böhlen
Language
  • English
Institution University of Zurich
Faculty Faculty of Business, Economics and Informatics
Date 2023
Abstract Text This thesis aims to address a key challenge in time-series data analysis - efficiently identifying correlations between various data streams. As the prominence of sensor networks rises, the analysis of time-series data has become increasingly crucial. The insights derived from these data streams hold significant intelligence, but extracting them efficiently remains a complex task. This thesis brings into focus the dimensionality-reduction filter-and-refine techniques, designed to expedite the process of identifying correlations. Despite their utility, these techniques lack a comprehensive comparative analysis over streaming systems, and this thesis seeks to fill this gap. The core objective is to implement these techniques within a streaming platform, enabling benchmarking under realistic conditions. The thesis is divided into several chapters that provide a comprehensive overview of the problem, delve into the dimensionality reduction algorithms, and discuss their implementation within a streaming system. Particular emphasis is placed on the Filter-and-Refine Algorithm on the Kafka Streaming Platform. Two distinct design approaches, one based on Kafka Streams and another simpler Producer-Consumer design, are implemented, compared, and evaluated. The thesis culminates with an exhaustive series of experiments assessing the performance of the implemented algorithms. The ultimate goal is to provide a framework that not only implements the techniques but also evaluates their performance, aiming to contribute to the field of time-series data analysis.
PDF File Download
Export BibTeX