Write a Blog >>
MSR 2019
Sun 26 - Mon 27 May 2019 Montreal, QC, Canada
co-located with ICSE 2019

Data from software repositories have become an important foundation for the empirical study of software engineering processes. A recurring theme in the repository mining literature is the inference of developer networks capturing e.g. collaboration, coordination, or communication from the commit history of projects. Most of the studied networks are based on the co-authorship of software artefacts defined at the level of files, modules, or packages. While this approach has led to insights into the social aspects of software development, it neglects detailed information on code changes and code ownership, e.g. which exact lines of code have been authored by which developers, that is contained in the commit log of software projects. Addressing this issue, we introduce git2net, a scalable python software that facilitates the extraction of fine-grained co-editing networks in large git repositories. It uses text mining techniques to analyse the detailed history of textual modifications within files. This information allows us to construct directed, weighted, and time-stamped networks, where a link signifies that one developer has edited a block of source code originally written by another developer. Our tool is applied in case studies of an Open Source and a commercial software project. We argue that it opens up a massive new source of high-resolution data on human collaboration patterns.

Mon 27 May

Displayed time zone: Eastern Time (US & Canada) change

11:00 - 11:45
Session V: Collaboration & Communication (Part 1)MSR 2019 Technical Papers at Place du Canada
Chair(s): Peter Rigby Concordia University, Montreal, Canada
11:00
15m
Full-paper
An Empirical Study of Multiple Names and Email Addresses in OSS Version Control Repositories
MSR 2019 Technical Papers
Jiaxin Zhu Institute of Software at Chinese Academy of Sciences, China, Jun Wei Institute of Software, Chinese Academy of Sciences, China
11:15
15m
Full-paper
Characterizing the Roles of Contributors in Open-source Scientific Software Projects
MSR 2019 Technical Papers
Reed Milewicz Sandia National Laboratories, Gustavo Pinto UFPA, Paige Rodeghero University of Notre Dame
Pre-print
11:30
15m
Full-paper
git2net - Mining Time-Stamped Co-Editing Networks from Large git Repositories
MSR 2019 Technical Papers
Christoph Gote Chair of Systems Design, ETH Zurich, Ingo Scholtes , Frank Schweitzer
DOI Pre-print