To quote our research community’s succinct mission statement: “The Mining Software Repositories (MSR) field analyzes the rich data available in software repositories to uncover interesting and actionable information about software systems and projects.” In the earliest days of this conference, this mission was a novel possibility that the flourishing Open Source movement created. These days, however, the practice of turning repository data into actionable insights and deployed models has become bog standard. So, congratulations to the MSR community for leading the way! But now what? MSR finds itself caught in a heated competition among industry researchers and data scientists to find novel ways to exploit data and apply models. Given the resources and energy that industry now invests in data science and machine learning, MSR cannot hope to succeed by working on the same types of problems, using the same techniques. It’s time to pivot. Luckily there are hard open problems for which industry is hungry for results: How can we continue to get insights and build models while upholding privacy laws (GDPR) and user privacy preferences? How can we make trained models understandable to all relevant stakeholders? How can we ensure that our insights and models are not harmed by human biases like sexism, racism, political manipulation, etc.? The first half of this talk will describe current industry practice in data science and machine learning, based on recent studies. In the second half, I’ll describe some difficult new problems, to prod energetic discussion about the future direction of MSR.
Sun 26 May Times are displayed in time zone: (GMT-04:00) Eastern Time (US & Canada) change
|09:05 - 09:50|
A: Rob DeLineMicrosoft Research
|09:50 - 10:00|
|10:00 - 10:30|