Write a Blog >>
MSR 2019
Sun 26 - Mon 27 May 2019 Montreal, QC, Canada
co-located with ICSE 2019
Mon 27 May 2019 15:00 - 15:06 at Centre-Ville - Session X: Building on Data Chair(s): Cor-Paul Bezemer

The popularity of Python programming language has surged in recent years due to its increasing usage in Data Science. The availability of Python repositories in Github presents an opportunity for mining software repository research, e.g., suggesting the best practices in developing Data Science applications, identifying bug-patterns, recommending code enhancements, etc. To enable this research, we have created a new dataset that includes 1,558 mature Github projects that develop Python software for Data Science tasks. By analyzing the metadata and code, we have included the projects in our dataset which use a diverse set of machine learning libraries and managed by a variety of users and organizations. The dataset is made publicly available through Boa infrastructure both as a collection of raw projects as well as in a processed form that could be used for performing large scale analysis using Boa language. We also present two initial applications to demonstrate the potential of the dataset that could be leveraged by the community.

Mon 27 May

msr-2019-Paper-Presentations
14:45 - 15:30: MSR 2019 Paper Presentations - Session X: Building on Data at Centre-Ville
Chair(s): Cor-Paul BezemerUniversity of Alberta, Canada
msr-2019-papers14:45 - 15:00
Full-paper
Zoe KottiAthens University of Economics and Business, Diomidis SpinellisAthens University of Economics and Business
Pre-print
msr-2019-Data-Showcase15:00 - 15:06
Talk
Sumon BiswasIowa State University, Md Johirul IslamIowa State University, Yijia Huang, Hridesh RajanIowa State University
Pre-print Media Attached
msr-2019-Data-Showcase15:06 - 15:12
Talk
Oliviero Riganelli, Marco Mobilio, Daniela MicucciUniversity of Milano-Bicocca, Italy, Leonardo MarianiUniversity of Milano Bicocca
msr-2019-Data-Showcase15:12 - 15:18
Talk
Saket JoshiIndian Institute of Technology Tirupati, Sridhar ChimalakondaIndian Institute of Technology Tirupati
msr-2019-papers15:18 - 15:24
Short-paper
Ahmed ZeroualiUMONS, Valerio CosentinoBitergia, Jesus M. Gonzalez-BarahonaUniversidad Rey Juan Carlos, Gregorio RoblesUniversidad Rey Juan Carlos, Tom MensUniversity of Mons
Pre-print
msr-2019-Data-Showcase15:24 - 15:30
Talk
Gian Luca Scoccia, Anthony PerumaRochester Institute of Technology, Virginia Pujols, Ben Christians, Daniel KrutzRochester Institute of Technology