Not logged in.

Contribution Details

Type Conference or Workshop Paper
Scope Discipline-based scholarship
Published in Proceedings Yes
Title PathMiner: A Library for Mining of Path-Based Representations of Code
Organization Unit
Authors
  • Vladimir Kovalenko
  • Egor Bogomolov
  • Timofey Bryksin
  • Alberto Bacchelli
Presentation Type paper
Item Subtype Original Work
Refereed Yes
Status Published in final form
Language
  • English
ISBN 978-1-7281-3412-3
Page Range 13 - 17
Event Title 2019 IEEE/ACM 16th International Conference on Mining Software Repositories (MSR)
Event Type conference
Event Location Montreal, QC, Canada
Event Start Date June 25 - 2019
Event End Date July 1 - 2019
Place of Publication USA
Publisher IEEE
Abstract Text One recent, significant advance in modeling source code for machine learning algorithms has been the introduction of path-based representation - an approach consisting in representing a snippet of code as a collection of paths from its syntax tree. Such representation efficiently captures the structure of code, which, in turn, carries its semantics and other information. Building the path-based representation involves parsing the code and extracting the paths from its syntax tree; these steps build up to a substantial technical job. With no common reusable toolkit existing for this task, the burden of mining diverts the focus of researchers from the essential work and hinders newcomers in the field of machine learning on code. In this paper, we present PathMiner - an open-source library for mining path-based representations of code. PathMiner is fast, flexible, well-tested, and easily extensible to support input code in any common programming language. Preprint [https://doi.org/10.5281/zenodo.2595271]; released tool [https://doi.org/10.5281/zenodo.2595257].
Digital Object Identifier 10.1109/MSR.2019.00013
Other Identification Number merlin-id:20229
PDF File Download from ZORA
Export BibTeX
EP3 XML (ZORA)