Not logged in.

Contribution Details

Type Conference or Workshop Paper
Scope Discipline-based scholarship
Published in Proceedings Yes
Title When process data quality affects the number of bugs: correlations in software engineering datasets
Organization Unit
Authors
  • Abraham Bernstein
  • Adrian Bachmann
Presentation Type paper
Item Subtype Original Work
Refereed Yes
Status Published in final form
Language
  • English
Page Range 62 - 71
Event Title MSR '10: 7th IEEE Working Conference on Mining Software Repositories
Event Type conference
Event Location Cape Town, South Africa
Event Start Date January 1 - 2010
Event End Date January 1 - 2010
Abstract Text Software engineering process information extracted from version control systems and bug tracking databases are widely used in empirical software engineering. In prior work, we showed that these data are plagued by quality deficiencies, which vary in its characteristics across projects. In addition, we showed that those deficiencies in the form of bias do impact the results of studies in empirical software engineering. While these findings affect software engineering researchers the impact on practitioners has not yet been substantiated. In this paper we, therefore, explore (i) if the process data quality and characteristics have an influence on the bug fixing process and (ii) if the process quality as measured by the process data has an influence on the product (i.e., software) quality. Specifically, we analyze six Open Source as well as two Closed Source projects and show that process data quality and characteristics have an impact on the bug fixing process: the high rate of empty commit messages in Eclipse, for example, correlates with the bug report quality. We also show that the product quality -- measured by number of bugs reported -- is affected by process data quality measures. These findings have the potential to prompt practitioners to increase the quality of their software process and its associated data quality.
Digital Object Identifier 10.1109/MSR.2010.5463286
Other Identification Number 1371; merlin-id:130
PDF File Download from ZORA
Export BibTeX
EP3 XML (ZORA)