Learn R Programming

Tufte-Inspired Slopegraphs in R

This repository holds some working code for creating "slopegraphs" in R.

This is very much a work in progress. Once it's more stable, I will release the package to CRAN.

Pull requests welcome. Please report any issues on the issues page.

The package currently includes one mainfunction, slopegraph(), which produces a slopegraph from an observation-by-period data frame. Everything is more or less drawn automatically, but is highly customizable in terms of line and text colors, font sizes and styles, axes, titles, and plotting behind and in front of the slopegraph lines. An underlying function, segmentize() produces the data structure used for the actual plotting. And a new function, ggslopegraph() does the same as slopegraph() but using ggplot2 graphics.

Examples

The current output of the slopegraph() function (for the examples included in documentation) are shown below.

Tufte's most famous slopegraph example is probably the "cancer survival graph," depicting 5, 10, 15, and 20 year survival rates for various cancers. The first example mimics this result but draws it to the correct scale (unlike Tufte's original):

library("slopegraph")
data(cancer)
slopegraph(cancer, col.lines = 'gray', col.lab = "black", 
           xlim = c(-.5,5.5), cex.lab = 0.5, cex.num = 0.5,
           xlabels = c('5 Year','10 Year','15 Year','20 Year'))

The second example, also from Tufte, shows changes in gross domestic product for a small set of countries over two points in time:

data(gdp)
slopegraph(gdp, col.lines = 'gray', col.lab = "black", xlabels = c('1970','1979'),  
           main = 'Current Receipts of Goverment as a Percentage of Gross Domestic Product')

This third example comes from an 1878 publication (a copy of which is available here), showing the relative ranking of the population of various U.S. states. This example features a reversed y-axis to better display the ranking and I demonstrate the col.lines argument to highlight South Carolina:

data(states)
cols <- `[<-`(rep("black", 37), 7, "red")
slopegraph(states, xlim = c(-1, 12), ylim = c(37,0), offset.x = 0.06,
           col.lines = cols, col.lab = cols, 
           main = 'Relative Rank of U.S. State Populations, 1790-1870')

As of v0.1.9, there is also a ggplot2-based function, ggslopegraph() that produces a similar representation but using ggplot2 graphics:

require("ggplot2")
## Loading required package: ggplot2
data(states)
cols <- `[<-`(rep("black", 37), 7, "red")
ggslopegraph(states, offset.x = 0.06, yrev = TRUE,
  col.lines = cols, col.lab = cols, 
  main = 'Relative Rank of U.S. State Populations, 1790-1870') +
 theme_bw()    
## Warning: Removed 84 rows containing missing values (geom_text).

Installation

To install the latest development version of slopegraph from GitHub:

if (!require("remotes")) {
    install.packages("remotes")
}
remotes::install_github("leeper/slopegraph")

Copy Link

Version

Version

0.1.14

License

GPL-2

Issues

Pull Requests

Stars

Forks

Maintainer

Thomas J. Leeper

Last Published

August 7th, 2018

Functions in slopegraph (0.1.14)

states

U.S. State Populations
segmentize

Segmentize an observation-by-period data frame
slopegraph

Create Slopegraph from a data frame
gdp

Current receipts of government as a percentage of Gross Domestic Product, 1970 and 1979
ggslopegraph

Create Slopegraph from a data frame using ggplot2
bachelors

US Bachelors Degrees
cancer

Estimates of relative survival rates, by cancer site
ggslopegraph2

Plot a Slopegraph a la Tufte using dplyr and ggplot2
cancer2

Estimates of relative survival rates, by cancer site (long)