Tim Draws, Oana Inel, Nava Tintarev, Christian Baden, Benjamin Timmermans, Comprehensive Viewpoint Representations for a Deeper Understanding of User Interactions With Debated Topics, In: ACM SIGIR Conference on Human Information Interaction and Retrieval, ACM, 2022. (Conference or Workshop Paper published in Proceedings)
Research in the area of human information interaction (HII) typically represents viewpoints on debated topics in a binary fashion, as either against or in favor of a given topic (e.g., the feminist movement). This simple taxonomy, however, greatly reduces the latent richness of viewpoints and thereby limits the potential of research and practical applications in this field. Work in the communication sciences has already demonstrated that viewpoints can be represented in much more comprehensive ways, which could enable a deeper understanding of users’ interactions with debated topics online. For instance, a viewpoint's stance usually has a degree of strength (e.g., mild or strong), and, even if two viewpoints support or oppose something to the same degree, they may use different logics of evaluation (i.e., underlying reasons). In this paper, we draw from communication science practice to propose a novel, two-dimensional way of representing viewpoints that incorporates a viewpoint's stance degree as well as its logic of evaluation. We show in a case study of tweets on debated topics how our proposed viewpoint label can be obtained via crowdsourcing with acceptable reliability. By analyzing the resulting data set and conducting a user study, we further show that the two-dimensional viewpoint representation we propose allows for more meaningful analyses and diversification interventions compared to current approaches. Finally, we discuss what this novel viewpoint label implies for HII research and how obtaining it may be made cheaper in the future. |
|
Florian Ruosch, Cristina Sarasua, Abraham Bernstein, BAM: Benchmarking Argument Mining on Scientific Documents, In: The AAAI-22 Workshop on Scientific Document Understanding at the Thirty-Sixth AAAI Conference on Artificial Intelligence (AAAI-22), CEUR Workshop Proceedings, 2022. (Conference or Workshop Paper published in Proceedings)
In this paper, we present BAM, a unified Benchmark for Argument Mining (AM). We propose a method to homogenize both the evaluation process and the data to provide a common view in order to ultimately produce comparable results. Built as a four stage and end-to-end pipeline, the benchmark allows for the direct inclusion of additional argument miners to be evaluated. First, our system pre-processes a ground truth set used both for training and testing. Then, the benchmark calculates a total of four measures to assess different aspects of the mining process. To showcase an initial implementation of our approach, we apply our procedure and evaluate a set of systems on a corpus of scientific publications. With the obtained comparable results we can homogeneously assess the current state of AM in this domain. |
|
Silvan Heller, Viktor Gsteiger, Werner Bailer, Cathal Gurrin, Björn þóR Jónsson, Jakub Lokoč, Andreas Leibetseder, František Mejzlík, Ladislav Peška, Luca Rossetto, Konstantin Schall, Klaus Schoeffmann, Heiko Schuldt, Florian Spiess, Ly-Duyen Tran, Lucia Vadicamo, Patrik Veselý, Stefanos Vrochidis, Jiaxin Wu, Interactive video retrieval evaluation at a distance: comparing sixteen interactive video search systems in a remote setting at the 10th Video Browser Showdown, International Journal of Multimedia Information Retrieval, Vol. 11 (1), 2022. (Journal Article)
The Video Browser Showdown addresses difficult video search challenges through an annual interactive evaluation campaign attracting research teams focusing on interactive video retrieval. The campaign aims to provide insights into the performance of participating interactive video retrieval systems, tested by selected search tasks on large video collections. For the first time in its ten year history, the Video Browser Showdown 2021 was organized in a fully remote setting and hosted a record number of sixteen scoring systems. In this paper, we describe the competition setting, tasks and results and give an overview of state-of-the-art methods used by the competing systems. By looking at query result logs provided by ten systems, we analyze differences in retrieval model performances and browsing times before a correct submission. Through advances in data gathering methodology and tools, we provide a comprehensive analysis of ad-hoc video search tasks, discuss results, task design and methodological challenges. We highlight that almost all top performing systems utilize some sort of joint embedding for text-image retrieval and enable specification of temporal context in queries for known-item search. Whereas a combination of these techniques drive the currently top performing systems, we identify several future challenges for interactive video search engines and the Video Browser Showdown competition itself. |
|
Lukas Yu, Style Transfer Algorithm for Online News, University of Zurich, Faculty of Business, Economics and Informatics, 2022. (Master's Thesis)
In an experimental setting, data anonymization is vital to get valid results. For studies dealing with news articles, white-labelling their source is a non-trivial task, since news outlets might possess traceable writing styles. In this thesis, modern neural network architectures for natural language processing are utilized to transfer texts to a uniform style. The method does not rely on parallel corpora, which is usually the bottleneck for many systems. Instead, a pseudo-parallel corpus is created using monolingual data and masked-language modeling. Additionally, a new scraper architecture is designed and implemented to easily obtain article from news websites and store them in a homogeneous format. |
|
Sanjay Seetharaman, Shubham Malaviya, Rosni Vasu, Manish Shukla, Sachin Lodha, Influence based defense against data poisoning attacks in online learning, In: 2022 14th International Conference on COMmunication Systems \& NETworkS (COMSNETS), IEEE, Bangalore, India, 2022. (Conference or Workshop Paper published in Proceedings)
|
|
Luca Rossetto, You were saying? - Spoken Language in the V3C Dataset, 2022. (Other Publication)
|
|
Suzanne Tolmeijer, The Right Thing To Do? Artificial Intelligence for Ethical Decision Making, University of Zurich, Faculty of Business, Economics and Informatics, 2022. (Dissertation)
With the advancement of AI technology, an increasing amount of AI applications are being developed and applied in various domains. While some tasks in such applications lend itself well for the strengths of AI, other tasks are more challenging to automate. One example of this is ethical decision making. AI for ethical decision making has not been explored much, among other reasons, for its possibly impactful and ethically loaded results, as well as a lack of ‘ground truth’ on what is considered the right thing to do. However, AI for ethical decision making could both be valuable in explicit ethical decision making domains and increase the ethical use of other AI applications. This thesis fills the mentioned research gap by focusing on if and how AI for ethical decision making can be designed in a way that is acceptable for users.
The investigated research topics that are part of AI for ethical decision making are presented according to the incremental and iterative design cycle (IID), which is often applied in the development of new technology. After an initial planning phase, a design cycle consists of the following phases: planning and requirements, analysis and implementation, testing, and evaluation.
During the first phase, initial planning, we investigate the state of the art of implementing ethical theory in AI, by performing an extensive literature review. Among other results, we find that the field is scattered in terms of the ethical theory and AI types used to create AI for ethical decision making. Additionally, the developed applications consist mostly of prototypes. These results imply that a Wizard of Oz approach is appropriate for the implementation and testing in the design cycle presented in this thesis.
The success of any AI application depends on whether the users trust the AI enough to rely on it. Given the varying opinions regarding a ground truth for ethical AI, where AI decisions can easily be considered to be wrong, we focus on how AI mistakes influence user trust. In the second phase of the design cycle, called planning and requirements, we perform an experiment to investigate the effect of AI mistakes and their timing on user trust and reliance. We find that system inaccuracy negatively influences trust and reliance. Furthermore, the negative effect of AI mistakes is stronger when mistakes are made during the first interaction with the user.
To mitigate these negative effect of AI mistakes, the third phase of analysis and implementation focuses on AI mistakes and how their negative effects can be mitigated, by presenting different interaction designs. This is done by introducing a taxonomy of AI mistakes and appropriate mitigation strategies.
In the fourth testing phase, we use a Wizard of Oz application to test user perception of AI for ethical decision making. We find that while participants had higher moral trust in a human expert and find humans more responsible, they had more capacity trust and overall trust in an AI system for ethical decision making.
In the final phase, evaluation, we describe the consequences of our finding. Since people perceive AI and humans to have different strengths that are both valuable for ethical decision making, we propose an interaction paradigm that utilizes the strengths of both: human-autonomy teaming. For AI and humans to be able to form an effective team, further development of different AI capabilities is needed: agency, communication, shared mental models, intent, and interdependence.
In conclusion, this work contributes to the understanding of user perception of AI for ethical decision making, and suggests design strategies to move research on AI for ethical decision making forward. |
|
Dhivyabharathi Ramasamy, Cristina Sarasua, Alberto Bacchelli, Abraham Bernstein, Workflow analysis of data science code in public GitHub repositories, Empirical Software Engineering, Vol. 28 (7), 2022. (Journal Article)
Despite the ubiquity of data science, we are far from rigorously understanding how coding in data science is performed. Even though the scientific literature has hinted at the iterative and explorative nature of data science coding, we need further empirical evidence to understand this practice and its workflows in detail. Such understanding is critical to recognise the needs of data scientists and, for instance, inform tooling support. To obtain a deeper understanding of the iterative and explorative nature of data science coding, we analysed 470 Jupyter notebooks publicly available in GitHub repositories. We focused on the extent to which data scientists transition between different types of data science activities, or steps (such as data preprocessing and modelling), as well as the frequency and co-occurrence of such transitions. For our analysis, we developed a dataset with the help of five data science experts, who manually annotated the data science steps for each code cell within the aforementioned 470 notebooks. Using the first-order Markov chain model, we extracted the transitions and analysed the transition probabilities between the different steps. In addition to providing deeper insights into the implementation practices of data science coding, our results provide evidence that the steps in a data science workflow are indeed iterative and reveal specific patterns. We also evaluated the use of the annotated dataset to train machine-learning classifiers to predict the data science step(s) of a given code cell. We investigate the representativeness of the classification by comparing the workflow analysis applied to (a) the predicted data set and (b) the data set labelled by experts, finding an F1-score of about 71% for the 10-class data science step prediction problem. |
|
Oana Inel, Lora Aroyo, Fine-tuning machine confidence with human relevance for video discovery, Interactions, Vol. 29 (4), 2022. (Journal Article)
|
|
Jakub Lokoč, Werner Bailer, Kai Uwe Barthel, Cathal Gurrin, Silvan Heller, Björn Þór Jónsson, Ladislav Peška, Luca Rossetto, Klaus Schoeffmann, Lucia Vadicamo, Stefanos Vrochidis, Jiaxin Wu, A Task Category Space for User-Centric Comparative Multimedia Search Evaluations, In: MultiMedia Modeling. MMM 2022. Lecture Notes in Computer Science. Part I, Springer, Cham, p. 193 - 204, 2022. (Book Chapter)
In the last decade, user-centric video search competitions have facilitated the evolution of interactive video search systems. So far, these competitions focused on a small number of search task categories, with few attempts to change task category configurations. Based on our extensive experience with interactive video search contests, we have analyzed the spectrum of possible task categories and propose a list of individual axes that define a large space of possible task categories. Using this concept of category space, new user-centric video search competitions can be designed to benchmark video search systems from different perspectives. We further analyse the three task categories considered so far at the Video Browser Showdown and discuss possible (but sometimes challenging) shifts within the task category space. |
|
Alexander Theus, Luca Rossetto, Abraham Bernstein, HyText – A Scene-Text Extraction Method for Video Retrieval, In: MultiMedia Modeling. MMM 2022. Lecture Notes in Computer Science, Part II, Springer, Cham, p. 182 - 193, 2022. (Book Chapter)
Scene-text has been shown to be an effective query target for video retrieval applications in a known-item search context. While much progress has been made in scene-text extraction from individual pictures, the special case of video has so far received less attention. This paper introduces HyText, a scene-text extraction method for video with a focus on retrieval applications. HyText uses intermittent scene-text detection in combination with bi-directional tracking in order to increase throughput without reducing detection accuracy. |
|
Silvan Heller, Rahel Arnold, Ralph Gasser, Viktor Gsteiger, Mahnaz Parian-Scherb, Luca Rossetto, Loris Sauter, Florian Spiess, Heiko Schuldt, Multi-modal Interactive Video Retrieval with Temporal Queries, In: MultiMedia Modeling. MMM 2022. Lecture Notes in Computer Science. Part II, Springer, Cham, p. 493 - 498, 2022. (Book Chapter)
This paper presents the version of vitrivr participating at the Video Browser Showdown (VBS) 2022. vitrivr already supports a wide range of query modalities, such as color and semantic sketches, OCR, ASR and text embedding. In this paper, we briefly introduce the system, then describe our new approach to queries specifying temporal context, ideas for color-based sketches in a competitive retrieval setting and a novel approach to pose-based queries. |
|
Florian Spiess, Ralph Gasser, Silvan Heller, Mahnaz Parian-Scherb, Luca Rossetto, Loris Sauter, Heiko Schuldt, Multi-modal Video Retrieval in Virtual Reality with vitrivr-VR, In: MultiMedia Modeling. MMM 2022. Lecture Notes in Computer Science. Part II, Springer, Cham, p. 499 - 504, 2022. (Book Chapter)
In multimedia search, appropriate user interfaces (UIs) are essential to enable effective specification of the user’s information needs and the user-friendly presentation of search results. vitrivr-VR addresses these challenges and provides a novel Virtual Reality-based UI on top of the multimedia retrieval system vitrivr. In this paper we present the version of vitrivr-VR participating in the Video Browser Showdown (VBS) 2022. We describe our visual-text co-embedding feature and new query interfaces, namely text entry, pose queries and temporal queries. |
|
Lucien Heitz, Juliane A. Lischka, Alena Birrer, Bibek Paudel, Suzanne Tolmeijer, Laura Laugwitz, Abraham Bernstein, Benefits of Diverse News Recommendations for Democracy: A User Study, Digital Journalism, Vol. 10 (10), 2022. (Journal Article)
News recommender systems provide a technological architecture that helps shaping public discourse. Following a normative approach to news recommender system design, we test utility and external effects of a diversity-aware news recommender algorithm. In an experimental study using a custom-built news app, we show that diversity-optimized recommendations (1) perform similar to methods optimizing for user preferences regarding user utility, (2) that diverse news recommendations are related to a higher tolerance for opposing views, especially for politically conservative users, and (3) that diverse news recommender systems may nudge users towards preferring news with differing or even opposing views. We conclude that diverse news recommendations can have a depolarizing capacity for democratic societies.
|
|
Fan Feng, Natural Language Question Answering via Knowledge Graph Reasoning, University of Zurich, Faculty of Business, Economics and Informatics, 2021. (Master's Thesis)
Knowledge graphs (KGs) have drawn a wide research attention in recent years, since they enable semi-structured information to be stored in an unified, connected and organized way. The inherent features of this data structure are leveraged in many tasks, such as information retrieval, recommendation systems, etc.
Meanwhile, there are challenges in understanding and reasoning on a subset of a KG. One scenario would be question answering over KGs. Natural language questions can be flexible in expressions, which means that it is difficult for machines to retrieve an answer from a KG given a question posed by human.
[Qiu et al., 2020] proposed a reinforcement learning-based (RL-based) approach, which finds answer entities for multi-hop questions via stepwise reasoning over KGs. Inspired by its work, this thesis adopts the model’s main body as a baseline architecture and investigates three research questions.
The premise of KG reasoning is the accurate selection of topic entities. This work adapts a passive entity linker to link question mentions to KG nodes. In reasoning processes, an attention mechanism is implemented to associate history of actions with semantic information from questions, such that an agent can learn on which part of questions to focus.
Conventional RL-based reasoning returns terminal rewards after complete reasoning episodes, resulting in a lack of guidance in sequential decision process. To address this problem, we use potential-based shaping rewards instead. The empirical results show that the reward shaping function improves the hits@1 performances on two benchmarks. |
|
Marco Heiniger, Recommender System for Portfolio Management Based on Social Media, University of Zurich, Faculty of Business, Economics and Informatics, 2021. (Bachelor's Thesis)
In this thesis, a recommender system is built for portfolio management based on social media. With the emergence of social media and so-called influencers, people hold on to recommendations from famous financial investors. However, to what extent the social media posts and other mediums are able to explain changes in the composition of the financial actors remains unknown. This thesis is aimed at answering this question through a pipeline which consists of news scraping, content analysis, and a recommender system. The first two parts are used to create the data model inspired by a knowledge graph, consisting of various information about the financial influencer or the entity. Whereas
the third part, the recommender system, proposes user-based or item-based recommendations, with the addition that various parameters can be set to create different investing strategies. Moreover, it should be included that the system allows user-specific recommendations for a certain period of time, which sets a basis for future research questions. |
|
Chandrayee Basu, Rosni Vasu, Michihiro Yasunaga, Sohyeong Kim, Qian Yang, Automatic Medical Text Simplification: Challenges of Data Quality and Curation., In: HUMAN@ AAAI Fall Symposium, CEUR Workshop Proceedings, 2021. (Conference or Workshop Paper published in Proceedings)
|
|
Simon Widmer, Large-scale Active Learning for Concept Detection in Video, University of Zurich, Faculty of Business, Economics and Informatics, 2021. (Master's Thesis)
Modern neural network based classifications system often require large training sets and struggle with degrading classification performance when confronted with unseen objects categories. This thesis investigates practical and effective ways to implement a large-scale active learning pipeline for concept detection in videos, which is capable to constantly learn new object categories from annotated images provided by human supervisors. The proposed pipeline uses an active learning loop with a simple uncertainty-based heuristic to select the most informative images for annotation to achieve this goal. The evaluation of four different convolutional neural networks for image feature embedding showed that the InceptionResNetV2 architecture delivers the best performance over all studied classification scenarios. Furthermore, there is no single classification methods which works best in all classification scenarios. It is advantageous to let the system chose the ‘best’ classifier for each classification task. Moreover, the classification performance can be further improved for very small training sets if extracted box images are added as training instances. |
|
Joel Watter, The Argument Annotator Pipeline - Generate Visually Annotated Documents, University of Zurich, Faculty of Business, Economics and Informatics, 2021. (Bachelor's Thesis)
The research on argumentation in natural text is evolving, but a perfect way to model, annotate and mine argumentative structures is yet to be found.
High-quality annotation corpora are created in complex and time consuming manual work, to represent annotations for the training, testing and improvement of automated Argument Mining tools.
The value such corpora have for a machine is out of question.
But the fact, that the referenced argumentative structures in the annotation file of the corpus are completely separated from their actual context, within their original text, makes it difficult for a human reader to benefit on a similar level from the data they incorporate.
In this thesis, we address that problem and implement a tool to generate visually annotated PDF documents from corpus data.
The produced documents support human readers to understand and comprehend the visible annotations and the presented relationships they have to other annotations within the text.
Attaching and embedding the original text and annotation files as well as the annotation structure, created during the creation process to our documents, makes these PDF documents to an all in one file solution.
As proof of our concept, we processed an example corpus with our tool. |
|
Cathal Gurrin, Björn þóR Jónsson, Klaus Schöffmann, Duc-Tien Dang-Nguyen, Jakub Lokoč, Minh-Triet Tran, Wolfgang Hürst, Luca Rossetto, Graham Healy, Introduction to the Fourth Annual Lifelog Search Challenge, LSC'21, In: ICMR '21: International Conference on Multimedia Retrieval, ACM, New York, NY, USA, 2021-09-21. (Conference or Workshop Paper published in Proceedings)
The Lifelog Search Challenge (LSC) is an annual benchmarking challenge for comparing approaches to interactive retrieval from multi-modal lifelogs. LSC'21, the fourth challenge, attracted sixteen participants, each of which had developed interactive retrieval systems for large multimodal lifelogs. These interactive retrieval systems participated in a comparative evaluation in front of an online live-audience at the LSC workshop at ACM ICMR'21. This overview presents the motivation for LSC'21, the lifelog dataset used in the competition, and the participating systems. |
|