Write a Blog >>
MSR 2019
Sun 26 - Mon 27 May 2019 Montreal, QC, Canada
co-located with ICSE 2019
Sun 26 May 2019 15:24 - 15:30 at Place du Canada - Session V: Large-Scale Mining Chair(s): Robert Dyer

Large-scale software repository mining typically requires substantial storage and computational resources, and often involves a large number of calls to (rate-limited) APIs such as those of GitHub and StackOverflow. This creates a growing need for distributed execution of repository mining programs to which remote collaborators can contribute computational and storage resources, as well as API quotas (ideally without sharing API access tokens or credentials). In this paper we introduce Crossflow, a novel framework for building distributed repository mining programs. We demonstrate how Crossflow can delegate mining jobs to remote workers and cache their results, and how workers can implement advanced behaviour such as load balancing and rejecting jobs they cannot perform (e.g. due to lack of space, credentials for a specific API).

Sun 26 May
Times are displayed in time zone: Eastern Time (US & Canada) change

14:45 - 15:30
Session V: Large-Scale MiningMSR 2019 Technical Papers / MSR 2019 Data Showcase at Place du Canada
Chair(s): Robert DyerBowling Green State University
14:45
15m
Full-paper
Time Present and Time Past: Analyzing the Evolution of JavaScript Code in the Wild
MSR 2019 Technical Papers
Dimitris Mitropoulos, Panos Louridas, Vitalis Salis, Diomidis SpinellisAthens University of Economics and Business
Pre-print
15:01
6m
Talk
The Software Heritage Graph Dataset: public software development under one roof
MSR 2019 Data Showcase
Antoine PietriInria, Diomidis SpinellisAthens University of Economics and Business, Stefano ZacchiroliUniversity Paris Diderot and Inria, France
Pre-print
15:08
15m
Full-paper
World of Code: An Infrastructure for Mining the Universe of Open Source VCS Data
MSR 2019 Technical Papers
Yuxing Ma, Christopher BogartCarnegie Mellon University, Sadika Amreen, Russell Zaretzki, Audris MockusUniversity of Tennessee - Knoxville
15:24
6m
Short-paper
Crossflow: A Framework for Distributed Mining of Software Repositories
MSR 2019 Technical Papers
Dimitris KolovosUniversity of York, Patrick NeubauerUniversity of York, UK, Konstantinos Barmpis , Nicholas Matragkas, Richard PaigeMcMaster University
Pre-print