Write a Blog >>
MSR 2019
Sun 26 - Mon 27 May 2019 Montreal, QC, Canada
co-located with ICSE 2019
Sun 26 May 2019 15:24 - 15:30 at Place du Canada - Session V: Large-Scale Mining Chair(s): Robert Dyer

Large-scale software repository mining typically requires substantial storage and computational resources, and often involves a large number of calls to (rate-limited) APIs such as those of GitHub and StackOverflow. This creates a growing need for distributed execution of repository mining programs to which remote collaborators can contribute computational and storage resources, as well as API quotas (ideally without sharing API access tokens or credentials). In this paper we introduce Crossflow, a novel framework for building distributed repository mining programs. We demonstrate how Crossflow can delegate mining jobs to remote workers and cache their results, and how workers can implement advanced behaviour such as load balancing and rejecting jobs they cannot perform (e.g. due to lack of space, credentials for a specific API).

Sun 26 May

msr-2019-Paper-Presentations
14:45 - 15:30: MSR 2019 Paper Presentations - Session V: Large-Scale Mining at Place du Canada
Chair(s): Robert DyerBowling Green State University
msr-2019-papers14:45 - 15:00
Full-paper
Dimitris Mitropoulos , Panos Louridas , Vitalis Salis, Diomidis SpinellisAthens University of Economics and Business
Pre-print
msr-2019-Data-Showcase15:01 - 15:07
Talk
Antoine PietriInria, Diomidis SpinellisAthens University of Economics and Business, Stefano ZacchiroliUniversity Paris Diderot and Inria, France
Pre-print
msr-2019-papers15:08 - 15:23
Full-paper
Yuxing Ma, Christopher BogartCarnegie Mellon University, Sadika Amreen, Russell Zaretzki, Audris MockusUniversity of Tennessee - Knoxville
msr-2019-papers15:24 - 15:30
Short-paper
Dimitris KolovosUniversity of York, Patrick NeubauerUniversity of York, UK, Konstantinos Barmpis , Nicholas Matragkas, Richard PaigeMcMaster University
Pre-print