Data7 Reproducible Research with GitHub and RStudio Book


Greg Chism

UArizona Data7 Reproducible Research with GitHub and RStudio Book!

Here you will find a collection of materials prepared by the staff of the Data Science Institute

Table of contents:


Introduction to the book and what you can expect to take away

Create a Research Compendium

Create a template Research Compendium from rrtools

Manage functionality as a package

Make your Compendium an R Package to ensure reproducibility

Reproduce a paper with Distill

Produce a Distill R Markdown version of a paper

Knowledge Level



Familiarity with Version Control through RStudio and R Markdown.

System Requirements:

Pandoc (>= 1.17.2)


If you don’t have LaTeX installed, consider installing TinyTeX, a custom LaTeX distribution based on TeX Live that is small in size but functions well in most cases, especially for R users.


Check docs before before installing.

devtools requirements

You might also need a set of development tools to install and run devtools. On Windows, download and install Rtools, and devtools takes care of the rest. On Mac, install the Xcode command line tools. On Linux, install the R development package, usually called r-devel or r-base-dev.


This book is derived from materials authored by Anna Krystalli, and (Marwick 2019). The abridged materials here have been updated to current best practices. Additionally, the gillespie.csv dataset was replaced with the open source diabetes.csv Pima County Native American Diabetes dataset.

The original materials are licensed under a Creative Commons Attribution 4.0 International License.

CC BY Created: 8/22/2022 (G. Chism); Last update: 11/17/2022 ————————————————————————

Original workshop based on:

  • Research compendium cboettig/noise-phenomena: Supplement to: “From noise to knowledge: how randomness generates novel phenomena and reveals information” by Carl Boettiger licensed under CC BY 4.0. DOI

  • Marwick, B., Boettiger, C. & L. Mullen (2017). Packaging data analytical work reproducibly using R (and friends). PeerJ Preprints 5:e3192v1