Write a Blog >>
MSR 2019
Sun 26 - Mon 27 May 2019 Montreal, QC, Canada
co-located with ICSE 2019
Mon 27 May 2019 14:45 - 15:00 at Centre-Ville - Session X: Building on Data Chair(s): Cor-Paul Bezemer

Introduction: The establishment of the Mining Software Repositories (MSR) Data Showcase conference track has encouraged researchers to provide more data sets as a basis for further empirical studies.

Objectives: Examine the usage of the data papers published in the MSR proceedings in terms of use frequency, users, and use purpose.

Methods: Data track papers were collected from the MSR Data Showcase and through the manual inspection of older MSR proceedings. The use of data papers was established through citation searching followed by reading the studies that have cited them. Data papers were then clustered based on their content, whereas their citations were classified according to the knowledge areas of the Guide to the Software Engineering Body of Knowledge.

Results: We found that 65% of the data papers have been used in other studies, with a long-tail distribution in the number of citations. MSR data papers are cited less than other MSR papers. A considerable number of the citations stem from the teams that authored the data papers. Publications providing repository data and metadata are the most frequent data papers and the most often cited ones. Mobile application data papers are the least common ones, but the second most frequently cited.

Conclusion: Data papers have provided the foundation for a significant number of studies, but there is room for improvement in their utilization. This can be done by setting a higher bar for their publication, by encouraging their use, and by providing incentives for the enrichment of existing data collections.

Mon 27 May

Displayed time zone: Eastern Time (US & Canada) change

14:45 - 15:30
Session X: Building on DataMSR 2019 Data Showcase / MSR 2019 Technical Papers at Centre-Ville
Chair(s): Cor-Paul Bezemer University of Alberta, Canada
14:45
15m
Full-paper
Standing on Shoulders or Feet? The Usage of the MSR Data Papers
MSR 2019 Technical Papers
Zoe Kotti Athens University of Economics and Business, Diomidis Spinellis Athens University of Economics and Business
Pre-print
15:00
6m
Talk
Boa Meets Python: A Boa Dataset of Data Science Software in Python Language
MSR 2019 Data Showcase
Sumon Biswas Iowa State University, Md Johirul Islam Iowa State University, Yijia Huang , Hridesh Rajan Iowa State University
Pre-print Media Attached
15:06
6m
Talk
A Benchmark of Data Loss Bugs for Android Apps
MSR 2019 Data Showcase
Oliviero Riganelli , Marco Mobilio , Daniela Micucci University of Milano-Bicocca, Italy, Leonardo Mariani University of Milano Bicocca
15:12
6m
Talk
RapidRelease - A Dataset of Projects and Issues on GitHub with Rapid Release
MSR 2019 Data Showcase
Saket Joshi Indian Institute of Technology Tirupati, Sridhar Chimalakonda Indian Institute of Technology Tirupati
15:18
6m
Short-paper
A Tool to Analyze Packages in Software Containers
MSR 2019 Technical Papers
Ahmed Zerouali UMONS, Valerio Cosentino Bitergia, Jesus M. Gonzalez-Barahona Universidad Rey Juan Carlos, Gregorio Robles Universidad Rey Juan Carlos, Tom Mens University of Mons
Pre-print
15:24
6m
Talk
An Empirical History of Permission Requests and Mistakes in Open Source Android Apps
MSR 2019 Data Showcase
Gian Luca Scoccia , Anthony Peruma Rochester Institute of Technology, Virginia Pujols , Ben Christians , Daniel Krutz Rochester Institute of Technology