Graphics Group @ ISU

We are interested in graphics and computational tools.

Managing Computational Reproducibility in a Shifting R Package Landscape

Most modern data analysis requires the use of statistical software. The results of data analysis then rely on the underlying software utilized and the actions applied to data. R, one of the most widely-used statistical softwares for data analysis, relies on user-developed “packages” for many data science and data analysis tasks. These packages are subject to change over time, which can impact computational reproducibility efforts, as well as frustrate users who are left to hunt down problems in broken code. We propose an adaptive approach to computational reproducibility in R that focuses on identifying changes in packages over time. In this talk, I will discuss some prior approaches to achieving computational reproducibility in R and present our proposed framework for managing computational reproducibility. I will also demonstrate the software tools we are developing to assist users as a part of the “manager” package through two R package case studies.

The slides for the talk can be found here.


Share