Ben Baumer

Ben Baumer

9 packages on CRAN

2 packages on GitHub

etl

cran
86th

Percentile

A predictable and pipeable framework for performing ETL (extract-transform-load) operations on publicly-accessible medium-sized data set. This package sets up the method structure and implements generic functions. Packages that depend on this package download specific data sets from the Internet, clean them up, and import them into a local or remote relational database management system.

mdsr

cran
85th

Percentile

A complement to *Modern Data Science with R*, both the first (ISBN: 978-1498724487, publisher URL: <https://www.routledge.com/Modern-Data-Science-with-R/Baumer-Kaplan-Horton/p/book/9781498724487>) and second editions (ISBN: 978-0367191498, publisher URL: <https://www.routledge.com/Modern-Data-Science-with-R/Baumer-Kaplan-Horton/p/book/9780367191498>). This package contains data and code to complete exercises and reproduce examples from the text. It also facilitates connections to the SQL database server used in the book. Both editions of the book are supported by this package.

macleish

cran
57th

Percentile

Download data from the Ada and Archibald MacLeish Field Station in Whately, MA. The Ada and Archibald MacLeish Field Station is a 260-acre patchwork of forest and farmland located in West Whately, MA that provides opportunities for faculty and students to pursue environmental research, outdoor education, and low-impact recreation (see <http://www.smith.edu/ceeds/macleish.php> for more information). This package contains weather data over several years, and spatial data on various man-made and natural structures.

teamcolors

cran
57th

Percentile

Provides color palettes corresponding to professional and amateur, sports teams. These can be useful in creating data graphics that are themed for particular teams.

openWAR

github
16th

Percentile

This package serves two primary purposes: 1) it facilitates the computation of openWAR, a fully open-source implementation of Wins Above Replacement (WAR) that could serve as a reference implementation for the sabermetric community; and 2) it downloads raw XML files from the MLBAM GameDay web application and processes them into play-by-play data in a tabular format. This play-by-play information is similar in spirit, though not in syntax, to play-by-play data made available by Retrosheet. Those interested in the modeling choices that we have made in our computation of openWAR should consult our JQAS or arXiv paper on that subject. This implementation of openWAR includes functions for constructing interval estimates of WAR for each player, as well as comparing openWAR point estimates to those of Baseball-Reference.com's rWAR.

knitr

cran
99.9th

Percentile

Provides a general-purpose tool for dynamic report generation in R using Literate Programming techniques.

infer

cran
98th

Percentile

The objective of this package is to perform inference using an expressive statistical grammar that coheres with the tidy design framework.

openintro

cran
91th

Percentile

Supplemental functions and data for 'OpenIntro' resources, which includes open-source textbooks and resources for introductory statistics (<https://www.openintro.org/>). The package contains data sets used in our open-source textbooks along with custom plotting functions for reproducing book figures. Note that many functions and examples include color transparency; some plotting elements may not show up properly (or at all) when run in some versions of Windows operating system.

90th

Percentile

Datasets and code published by the data journalism website 'FiveThirtyEight' available at <https://github.com/fivethirtyeight/data>. Note that while we received guidance from editors at 'FiveThirtyEight', this package is not officially published by 'FiveThirtyEight'.

fec16

cran
68th

Percentile

Easily analyze relational data from the United States 2016 federal election cycle as reported by the Federal Election Commission. This package contains data about candidates, committees, and a variety of different financial expenditures. Data is from <https://www.fec.gov/data/browse-data/?tab=bulk-data>.

baseballr

github
16th

Percentile

Provides numerous functions for acquiring and analyzing baseball data. Data can be acquired from various online sources from within R. Custom metrics can also be calculated, such as wOBA, FIP, and Edge%.