Loris Sauter, Ralph Gasser, Abraham Bernstein, Heiko Schuldt, Luca Rossetto, An Asynchronous Scheme for the Distributed Evaluation of Interactive Multimedia Retrieval, In: MM '22: The 30th ACM International Conference on Multimedia, ACM, New York, NY, USA, 2022-11-10. (Conference or Workshop Paper published in Proceedings)
 
Evaluation campaigns for interactive multimedia retrieval, such as the Video Browser Shodown (VBS) or the Lifelog Search Challenge (LSC), so far imposed constraints on both simultaneity and locality of all participants, requiring them to solve the same tasks in the same place, at the same time and under the same conditions. These constraints are in contrast to other evaluation campaigns that do not focus on interactivity, where participants can process the tasks in any place at any time. The recent travel restrictions necessitated the relaxation of the locality constraint of interactive campaigns, enabling participants to take place from an arbitrary location. Born out of necessity, this relaxation turned out to be a boon since it greatly simplified the evaluation process and enabled organisation of ad-hoc evaluations outside of the large campaigns. However, it also introduced an additional complication in cases where participants were spread over several time zones. In this paper, we introduce an evaluation scheme for interactive retrieval evaluation that relaxes both the simultaneity and locality constraints, enabling participation from any place at any time within a predefined time frame. This scheme, as implemented in the Distributed Retrieval Evaluation Server (DRES), enables novel ways of conducting interactive retrieval evaluation and bridged the gap between interactive campaigns and non-interactive ones. |
|
Florent Thouvenin, Jacques de Werra, Yaniv Benhamou, Abraham Bernstein, Felix Gille, Diego Kuonen, Christian Lovis, Stephanie Volz, Viktor von Wyl, Governance Mechanisms for Access and Use of Data in Public Health Crises: Call for Action, Jusletter (17.10.2022), 2022. (Journal Article)
 
The Covid 19 pandemic demonstrated the importance of access to data to ensure that authorities can make informed decisions in the event of a crisis. However, there are currently three types of barriers preventing access and use of data: technical barriers, especially the lack of uniform data formats and semantics; legal barriers, especially data protection that limits the use of personal data; and societal barriers, especially the lack of data literacy and trust. This call for action presents ways to overcome these barriers and pro- poses new governance mechanisms for the access and use of data in public health crises. |
|
Ombretta Strafforello, Vanathi Rajasekart, Osman S Kayhan, Oana Inel, Jan van Gemert, Humans disagree with the IoU for measuring object detector localization error, In: 2022 IEEE International Conference on Image Processing (ICIP), -, 2022. (Conference or Workshop Paper published in Proceedings)

|
|
Silvan Heller, Luca Rossetto, Loris Sauter, Heiko Schuldt, vitrivr at the Lifelog Search Challenge 2022, In: ICMR '22: International Conference on Multimedia Retrieval, ACM, New York, NY, USA, 2022. (Conference or Workshop Paper published in Proceedings)
 
In this paper, we present the iteration of the multimedia retrieval system vitrivr participating at LSC 2022. vitrivr is a general-purpose retrieval system which has previously participated at LSC. We describe the system architecture and functionality, and show initial results based on the test and validation topics. |
|
Cathal Gurrin, Liting Zhou, Graham Healy, Björn Þór Jónsson, Duc-Tien Dang-Nguyen, Jakub Lokoč, Minh-Triet Tran, Wolfgang Hürst, Luca Rossetto, Klaus Schöffmann, Introduction to the Fifth Annual Lifelog Search Challenge, LSC'22, In: ICMR '22: International Conference on Multimedia Retrieval, ACM, New York, NY, USA, 2022. (Conference or Workshop Paper)
 
For the fifth time since 2018, the Lifelog Search Challenge (LSC) facilitated a benchmarking exercise to compare interactive search systems designed for multimodal lifelogs. LSC'22 attracted nine participating research groups who developed interactive lifelog retrieval systems enabling fast and effective access to lifelogs. The systems competed in front of a hybrid audience at the LSC workshop at ACM ICMR'22. This paper presents an introduction to the LSC workshop, the new (larger) dataset used in the competition, and introduces the participating lifelog search systems. |
|
Alessandro Piscopo, Oana Inel, Sanne Vrijenhoek, Martijn Millecamp, Krisztian Balog, QUARE: 1st Workshop on Measuring the Quality of Explanations in Recommender Systems, In: Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval, Association for Computing Machinery, New York, NY, USA, 2022. (Conference or Workshop Paper)

QUARE - measuring the QUality of explAnations in REcommender systems - is the first workshop that aims to promote discussion upon future research and practice directions around evaluation methodologies for explanations in recommender systems. To that end, we bring together researchers and practitioners from academia and industry to facilitate discussions about the main issues and best practices in the respective areas, identify possible synergies, and outline priorities regarding future research directions. Additionally, we want to stimulate reflections around methods to systematically and holistically assess explanation approaches, impact, and goals, at the interplay between organisational and human values. The homepage of the workshop is available at: https://sites.google.com/view/quare-2022/. |
|
Alessandro Piscopo and
Oana Inel and
Sanne Vrijenhoek and
Martijn Millecamp and
Krisztian Balog, QUARE: 1st Workshop on Measuring the Quality of Explanations in Recommender Systems, In: SIGIR '22: The 45th International ACM SIGIR Conference on Research and Development in Information Retrieval, Madrid, Spain, July 11 - 15, 2022, ACM, Madrid, Spain, 2022. (Conference or Workshop Paper)

|
|
Cataldo Musto, Amra Delic, Oana Inel, Marco Polignano, Amon Rapp, Giovanni Semeraro, Jürgen Ziegler, Workshop on Explainable User Models and Personalised Systems (ExUM), In: Adjunct Proceedings of the 30th ACM Conference on User Modeling, Adaptation and Personalization, Barcelona, Spain, 2022. (Conference or Workshop Paper)

|
|
Vincent Robbemond, Oana Inel, Ujwal Gadiraju, Understanding the Role of Explanation Modality in AI-assisted Decision-making, In: Proceedings of the 30th ACM Conference on User Modeling, Adaptation and Personalization, Barcelona, Spain, 2022. (Conference or Workshop Paper published in Proceedings)

|
|
Ruijie Wang, Luca Rossetto, Michael Cochez, Abraham Bernstein, QAGCN: A Graph Convolutional Network-based Multi-Relation Question Answering System, In: ArXiv.org, No. arxiv.2206, 2022. (Working Paper)
 
Answering multi-relation questions over knowledge graphs is a challenging task as it requires multi-step reasoning over a huge number of possible paths. Reasoning-based methods with complex reasoning mechanisms, such as reinforcement learning-based sequential decision making, have been regarded as the default pathway for this task. However, these mechanisms are difficult to implement and train, which hampers their reproducibility and transferability to new domains. In this paper, we propose QAGCN - a simple but effective and novel model that leverages attentional graph convolutional networks that can perform multi-step reasoning during the encoding of knowledge graphs. As a consequence, complex reasoning mechanisms are avoided. In addition, to improve efficiency, we retrieve answers using highly-efficient embedding computations and, for better interpretability, we extract interpretable paths for returned answers. On widely adopted benchmark datasets, the proposed model has been demonstrated competitive against state-of-the-art methods that rely on complex reasoning mechanisms. We also conducted extensive experiments to scrutinize the efficiency and contribution of each component of our model. |
|
Romana Pernisch, Daniele Dell'Aglio, Mirko Serbak, Rafael S. Gonçalves, Abraham Bernstein, Visualising the effects of ontology changes and studying their understanding with ChImp, Journal of Web Semantics, Vol. 74, 2022. (Journal Article)
 
Due to the Semantic Web’s decentralised nature, ontology engineers rarely know all applications that leverage their ontology. Consequently, they are unaware of the full extent of possible consequences that changes might cause to the ontology. Our goal is to lessen the gap between ontology engineers and users by investigating ontology engineers’ understanding of ontology changes’ impact at editing time. Hence, this paper introduces the Protégé plugin ChImp which we use to reach our goal. We elicited requirements for ChImp through a questionnaire with ontology engineers. We then developed ChImp according to these requirements and it displays all changes of a given session and provides selected information on said changes and their effects. For each change, it computes a number of metrics on both the ontology and its materialisation. It displays those metrics on both the originally loaded ontology at the beginning of the editing session and the current state to help ontology engineers understand the impact of their changes. We investigated the informativeness of materialisation impact measures, the meaning of severe impact, and also the usefulness of ChImp in an online user study with 36 ontology engineers. We asked the participants to solve two ontology engineering tasks – with and without ChImp (assigned in random order) – and answer in-depth questions about the applied changes as well as the materialisation impact measures. We found that ChImp increased the participants’ understanding of change effects and that they felt better informed. Answers also suggest that the proposed measures were useful and informative. We also learned that the participants consider different outcomes of changes severe, but most would define severity based on the amount of changes to the materialisation compared to its size. The participants also acknowledged the importance of quantifying the impact of changes and that the study will affect their approach of editing ontologies. |
|
Lutharsanen Kunam, High Level Semantic Video Understanding, University of Zurich, Faculty of Business, Economics and Informatics, 2022. (Master's Thesis)
 
High level semantic video understanding deals with the problem of analyzing basic insights from movies like interpersonal relationships, relationships to other entities or interpersonal interactions. The Deep Video Understanding Challenge has focused on this issue and organizes an annual competition in which a set of queries is created which should be answered by the participants. This thesis is written in the context of the Deep Video Understanding Challenge 2021 and describes a pipeline that is able to answer the set of queries on a movie- and scene-level. The pipeline consists of a scene segmentation engine which cuts the scenes into single keyframes and shots. After that, they are processed by two streams, which consists of several feature extraction models. One stream focuses on the visual component, while the other stream focuses on the audio component. After that, the features are combined and processed. Numerous classifiers are trained and used to predict the interpersonal relationships, relationships with other entities or
interpersonal interactions. At the movie-level, a knowledge graph is then created, reflecting all the relationships between all the entities of a movie. This is used to answer the queries at movie-level. There, 8% of all questions could be answered correctly. The queries from scene-level could be answered to 1.5% correctly. The other pipelines from the DVU Challenge 2021 achieves better results as the worst result on movie level is 17% of correctly answered queries and the worst result on scene-level is 27% of correctly answered queries. |
|
Suzanne Tolmeijer, Markus Christen, Serhiy Kandul, Markus Kneer, Abraham Bernstein, Capable but Amoral? Comparing AI and Human Expert Collaboration in Ethical Decision Making, In: ACM CHI Conference on Human Factors in Computing Systems (CHI'22), ACM Press, New York, NY, USA, 2022-04-29. (Conference or Workshop Paper published in Proceedings)
 
While artificial intelligence (AI) is increasingly applied for decision- making processes, ethical decisions pose challenges for AI applica- tions. Given that humans cannot always agree on the right thing to do, how would ethical decision-making by AI systems be perceived and how would responsibility be ascribed in human-AI collabora- tion? In this study, we investigate how the expert type (human vs. AI) and level of expert autonomy (adviser vs. decider) influence trust, perceived responsibility, and reliance. We find that partici- pants consider humans to be more morally trustworthy but less capable than their AI equivalent. This shows in participants’ re- liance on AI: AI recommendations and decisions are accepted more often than the human expert’s. However, AI team experts are per- ceived to be less responsible than humans, while programmers and sellers of AI systems are deemed partially responsible instead. |
|
Xiaolin Han, Daniele Dell’Aglio, Tobias Grubenmann, Reynold Cheng, Abraham Bernstein, A framework for differentially-private knowledge graph embeddings, Journal of Web Semantics, Vol. 72, 2022. (Journal Article)
 
Knowledge graph (KG) embedding methods are at the basis of many KG-based data mining tasks, such as link prediction and node clustering. However, graphs may contain confidential information about people or organizations, which may be leaked via embeddings. Research recently studied how to apply differential privacy to a number of graphs (and KG) analyses, but embedding methods have not been considered so far. This study moves a step towards filling such a gap, by proposing the Differential Private Knowledge Graph Embedding (DPKGE) framework.
DPKGE extends existing KG embedding methods (e.g., TransE, TransM, RESCAL, and DistMult) and processes KGs containing both confidential and unrestricted statements. The resulting embeddings protect the presence of any of the former statements in the embedding space using differential privacy. Our experiments identify the cases where DPKGE produces useful embeddings, by analyzing the training process and tasks executed on top of the resulting embeddings. |
|
Yasamin Klingler, Claude Lehmann, João Pedro Monteiro, Carlo Saladin, Abraham Bernstein, Kurt Stockinger, Evaluation of Algorithms for Interaction-Sparse Recommendations: Neural Networks don’t Always Win, In: 25th International Conference on Extending Database Technology, OpenProceedings, OpenProceedings.org, 2022-03. (Conference or Workshop Paper published in Proceedings)
 
In recent years, top-K recommender systems with implicit feed-back data gained interest in many real-world business scenarios. In particular, neural networks have shown promising results on these tasks. However, while traditional recommender systems are built on datasets with frequent user interactions, insurance recommenders often have access to a very limited amount of user interactions, as people only buy a few insurance products.
In this paper, we shed new light on the problem of top-K recommendations for interaction-sparse recommender problems. In particular, we analyze six different recommender algorithms, namely a popularity-based baseline and compare it against two matrix factorization methods (SVD++, ALS), one neural network approach (JCA) and two combinations of neural network and factorization machine approaches (DeepFM, NeuFM). We evaluate these algorithms on six different interaction-sparse datasets and one dataset with a less sparse interaction pattern to elucidate the unique behavior of interaction-sparse datasets.
In our experimental evaluation based on real-world insurance data, we demonstrate that DeepFM shows the best performance followed by JCA and SVD++, which indicates that neural network approaches are the dominant technologies. However, for the remaining five datasets we observe a different pattern. Overall, the matrix factorization method SVD++ is the winner. Surprisingly, the simple popularity-based approach comes out second followed by the neural network approach JCA. In summary, our experimental evaluation for interaction-sparse datasets demonstrates that in general matrix factorization methods outperform neural network approaches. As a consequence, traditional well- established methods should be part of the portfolio of algorithms to solve real-world interaction-sparse recommender problems. |
|
Tim Draws, Oana Inel, Nava Tintarev, Christian Baden, Benjamin Timmermans, Comprehensive Viewpoint Representations for a Deeper Understanding of User Interactions With Debated Topics, In: ACM SIGIR Conference on Human Information Interaction and Retrieval, ACM, 2022. (Conference or Workshop Paper published in Proceedings)
 
Research in the area of human information interaction (HII) typically represents viewpoints on debated topics in a binary fashion, as either against or in favor of a given topic (e.g., the feminist movement). This simple taxonomy, however, greatly reduces the latent richness of viewpoints and thereby limits the potential of research and practical applications in this field. Work in the communication sciences has already demonstrated that viewpoints can be represented in much more comprehensive ways, which could enable a deeper understanding of users’ interactions with debated topics online. For instance, a viewpoint's stance usually has a degree of strength (e.g., mild or strong), and, even if two viewpoints support or oppose something to the same degree, they may use different logics of evaluation (i.e., underlying reasons). In this paper, we draw from communication science practice to propose a novel, two-dimensional way of representing viewpoints that incorporates a viewpoint's stance degree as well as its logic of evaluation. We show in a case study of tweets on debated topics how our proposed viewpoint label can be obtained via crowdsourcing with acceptable reliability. By analyzing the resulting data set and conducting a user study, we further show that the two-dimensional viewpoint representation we propose allows for more meaningful analyses and diversification interventions compared to current approaches. Finally, we discuss what this novel viewpoint label implies for HII research and how obtaining it may be made cheaper in the future. |
|
Florian Ruosch, Cristina Sarasua, Abraham Bernstein, BAM: Benchmarking Argument Mining on Scientific Documents, In: The AAAI-22 Workshop on Scientific Document Understanding at the Thirty-Sixth AAAI Conference on Artificial Intelligence (AAAI-22), CEUR Workshop Proceedings, 2022. (Conference or Workshop Paper published in Proceedings)
 
In this paper, we present BAM, a unified Benchmark for Argument Mining (AM). We propose a method to homogenize both the evaluation process and the data to provide a common view in order to ultimately produce comparable results. Built as a four stage and end-to-end pipeline, the benchmark allows for the direct inclusion of additional argument miners to be evaluated. First, our system pre-processes a ground truth set used both for training and testing. Then, the benchmark calculates a total of four measures to assess different aspects of the mining process. To showcase an initial implementation of our approach, we apply our procedure and evaluate a set of systems on a corpus of scientific publications. With the obtained comparable results we can homogeneously assess the current state of AM in this domain. |
|
Silvan Heller, Viktor Gsteiger, Werner Bailer, Cathal Gurrin, Björn þóR Jónsson, Jakub Lokoč, Andreas Leibetseder, František Mejzlík, Ladislav Peška, Luca Rossetto, Konstantin Schall, Klaus Schoeffmann, Heiko Schuldt, Florian Spiess, Ly-Duyen Tran, Lucia Vadicamo, Patrik Veselý, Stefanos Vrochidis, Jiaxin Wu, Interactive video retrieval evaluation at a distance: comparing sixteen interactive video search systems in a remote setting at the 10th Video Browser Showdown, International Journal of Multimedia Information Retrieval, Vol. 11 (1), 2022. (Journal Article)
 
The Video Browser Showdown addresses difficult video search challenges through an annual interactive evaluation campaign attracting research teams focusing on interactive video retrieval. The campaign aims to provide insights into the performance of participating interactive video retrieval systems, tested by selected search tasks on large video collections. For the first time in its ten year history, the Video Browser Showdown 2021 was organized in a fully remote setting and hosted a record number of sixteen scoring systems. In this paper, we describe the competition setting, tasks and results and give an overview of state-of-the-art methods used by the competing systems. By looking at query result logs provided by ten systems, we analyze differences in retrieval model performances and browsing times before a correct submission. Through advances in data gathering methodology and tools, we provide a comprehensive analysis of ad-hoc video search tasks, discuss results, task design and methodological challenges. We highlight that almost all top performing systems utilize some sort of joint embedding for text-image retrieval and enable specification of temporal context in queries for known-item search. Whereas a combination of these techniques drive the currently top performing systems, we identify several future challenges for interactive video search engines and the Video Browser Showdown competition itself. |
|
Lukas Yu, Style Transfer Algorithm for Online News, University of Zurich, Faculty of Business, Economics and Informatics, 2022. (Master's Thesis)
 
In an experimental setting, data anonymization is vital to get valid results. For studies dealing with news articles, white-labelling their source is a non-trivial task, since news outlets might possess traceable writing styles. In this thesis, modern neural network architectures for natural language processing are utilized to transfer texts to a uniform style. The method does not rely on parallel corpora, which is usually the bottleneck for many systems. Instead, a pseudo-parallel corpus is created using monolingual data and masked-language modeling. Additionally, a new scraper architecture is designed and implemented to easily obtain article from news websites and store them in a homogeneous format. |
|
Sanjay Seetharaman, Shubham Malaviya, Rosni Vasu, Manish Shukla, Sachin Lodha, Influence based defense against data poisoning attacks in online learning, In: 2022 14th International Conference on COMmunication Systems \& NETworkS (COMSNETS), IEEE, Bangalore, India, 2022. (Conference or Workshop Paper published in Proceedings)

|
|