Together, the checkpoint package and the checkpoint server act as a CRAN time machine. The
checkpoint() function installs the packages referenced in the specified project to a local library exactly as they existed at the specified point in time. Only those packages are available to your session, thereby avoiding any package updates that came later and may have altered your results. In this way, anyone using the checkpoint
checkpoint() function can ensure the reproducibility of your scripts or projects at any time.
checkpoint(snapshotDate, project = getwd(), R.version, scanForPackages = TRUE, checkpointLocation = "~/", verbose = TRUE, use.knitr, auto.install.knitr = TRUE, scan.rnw.with.knitr = FALSE, forceInstall = FALSE, forceProject = FALSE)
Date of snapshot to use in
YYYY-MM-DD format, e.g.
"2014-09-17". Specify a date on or after
"2014-09-17". MRAN takes one snapshot per day. To list all valid snapshot dates on MRAN use
A project path. This is the path to the root of the project that references the packages to be installed from the MRAN snapshot for the date specified for
snapshotDate. Defaults to current working directory using
Optional character string, e.g.
"3.1.2". If specified, compares the current R.version to the specified R.version. If these differ, stops processing with an error, making no changes to the system. Specifically, if the check fails, the library path is NOT modified. This argument allows the original script author to specify a specific version of R to obtain the desired results.
TRUE, scans for packages in project folder (see details). If FALSE, skips the scanning process. A use case for
scanForPackages = FALSE is to skip the scanning and installation process, e.g. in production environments with a large number of R scripts in the project. Only set
scanForPackages = FALSE if you are certain that all package dependencies are already in the checkpoint folder.
File path where the checkpoint library is stored. Default is
"~/", i.e. the user's home directory. A use case for changing this is to create a checkpoint library on a portable drive (e.g. USB drive).
TRUE, displays progress messages.
TRUE, parses all
Rmarkdown files using the
TRUE and the project contains rmarkdown files, then automatically included the packages
knitr in packages to install.
TRUE, forces the re-installation of all discovered packages and their dependencies. This is useful if, for some reason, the checkpoint archive becomes corrupted.
TRUE, forces the checkpoint process, even if the provided project folder doesn't look like an R project. A commonly reported user problem is that they accidentally trigger the checkpoint process from their home folder, resulting in scanning many R files and downloading many packages. To prevent this, we use a heuristic to determine if the project folder looks like an R project. If the project folder is the home folder, and also contains no R files, then
checkpoint() asks for confirmation to continue.
Checkpoint is called for its side-effects (see the details section), but invisibly returns a list with elements:
To reset the checkpoint, simply restart your R session.
You can also use the experimental function
checkpoint() uses https to download packages. The default MRAN snapshot defaults to https://mran.microsoft.com/snapshot in R versions 3.2.0 and later, if https support is enabled.
You can modify the default URL. To change the URL, use
options(checkpoint.mranUrl = ...).
As a side effect, the
checkpoint function writes a log file with information about the downloaded files, in particular the package downloaded and the associated file size in bytes. The log is stored at the root of the
checkpointLocation. For example, if
checkpointLocation is the user home folder (the default) then the log file is at
~/.checkpoint/checkpoint_log.csv. This file contains columns for:
checkpoint() function stores a marker in the snapshot folder every time the function gets called. This marker contains the system date, thus indicating the the last time the snapshot was accessed. See also
getAccessDate(). To remove snapshots that have not been used since a given date, use
checkpoint() creates a local library into which it installs a copy of the packages required by your project as they existed on CRAN on the specified snapshot date. Your R session is updated to use only these packages.
To automatically determine all packages used in your project, the function scans all R code (
.Rpres files) for
require() statements. In addition, scans for occurrences of code that accesses functions in namespaces using
package[:::]foo(). Finally, any occurrences of the functions methods::setClass, methods::setRefClass, methods::setMethod or methods::setGeneric will also identify the
methods package as a dependency.
Specifically, the function will:
Create a new local snapshot library to install packages. By default this library folder is at
~/.checkpoint but you can modify the path using the
Update the options for your CRAN mirror and point to an MRAN snapshot using options
Scan your project folder for all required packages and install them from the snapshot using
# Create temporary project and set working directory example_project <- paste0("~/checkpoint_example_project_", Sys.Date()) dir.create(example_project, recursive = TRUE) oldwd <- setwd(example_project) # Write dummy code file to project cat("library(MASS)", "library(foreach)", sep="\n", file="checkpoint_example_code.R") # Create a checkpoint by specifying a snapshot date library(checkpoint) checkpoint("2014-09-17") # Check that CRAN mirror is set to MRAN snapshot getOption("repos") # Check that library path is set to ~/.checkpoint .libPaths() # Check which packages are installed in checkpoint library installed.packages() # cleanup unlink(example_project, recursive = TRUE) setwd(oldwd)