A Large-scale Study about Quality and Reproducibility of Jupyter Notebooks (MSR 2019 - Technical Papers)

Who

João Felipe Pimentel, Leonardo Murta, Vanessa Braganholo, Juliana Freire

Track

MSR 2019 MSR Technical Papers

Time Zone

The program is currently displayed in (GMT-04:00) Eastern Time (US & Canada).

Use conference time zone: (GMT-04:00) Eastern Time (US & Canada)Select other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

By setting a time band, the program will dim events that are outside this time window. This is useful for (virtual) conferences with a continuous program (with repeated sessions).
The time band will also limit the events that are included in the personal iCalendar subscription service.

Display full programSpecify a time band

Save

When

Mon 27 May 2019 11:55 - 12:10 at Centre-Ville - Session VIII: Software Quality (part 2) Chair(s): Yasutaka Kamei

Abstract

Jupyter Notebooks have been widely adopted by many different communities, both in science and industry. They support the creation of literate programming documents that combine code, text, and execution results with visualizations and all sorts of rich media. The self-documenting aspects and the ability to reproduce results have been touted as significant benefits of notebooks. At the same time, there has been growing criticism that the way notebooks are being used leads to unexpected behavior, encourage poor coding practices, and that their results can be hard to reproduce. To understand good and bad practices used in the development of real notebooks, we studied 1.4 million notebooks from GitHub. We present a detailed analysis of their characteristics that impact reproducibility. We also propose a set of best practices that can improve the rate of reproducibility and discuss open challenges that require further research and development.

Link to Preprint

http://www.ic.uff.br/~leomurta/papers/pimentel2019a.pdf

João Felipe Pimentel

Leonardo Murta

Universidade Federal Fluminense (UFF)

Brazil

Vanessa Braganholo

Juliana Freire