Not logged in.

Contribution Details

Type Journal Article
Scope Discipline-based scholarship
Title Classifying code comments in Java software systems
Organization Unit
Authors
  • Luca Pascarella
  • Magiel Bruntink
  • Alberto Bacchelli
Item Subtype Original Work
Refereed Yes
Status Published in final form
Language
  • English
Journal Title Empirical Software Engineering
Publisher Springer
Geographical Reach international
ISSN 1382-3256
Volume 24
Number 3
Page Range 1499 - 1537
Date 2019
Abstract Text Code comments are a key software component containing information about the underlying implementation. Several studies have shown that code comments enhance the readability of the code. Nevertheless, not all the comments have the same goal and target audience. In this paper, we investigate how 14 diverse Java open and closed source software projects use code comments, with the aim of understanding their purpose. Through our analysis, we produce a taxonomy of source code comments; subsequently, we investigate how often each category occur by manually classifying more than 40,000 lines of code comments from the aforementioned projects. In addition, we investigate how to automatically classify code comments at line level into our taxonomy using machine learning; initial results are promising and suggest that an accurate classification is within reach, even when training the machine learner on projects different than the target one. Data and Materials [https://doi.org/10.5281/zenodo.2628361].
Digital Object Identifier 10.1007/s10664-019-09694-w
Other Identification Number merlin-id:20242
PDF File Download from ZORA
Export BibTeX
EP3 XML (ZORA)