runner an R package for running operations.
About
Package contains standard running functions (aka. windowed, rolling, cumulative) with additional options. runner provides extended functionality like date windows, handling missings and varying window size. runner brings also rolling streak and rolling which, what extends beyond range of functions already implemented in R packages.
Installation
Install package from from GitHub or from CRAN.
# devtools::install_github("gogonzo/runner")
install.packages("runner")Using runner
runner package provides functions applied on running windows. Diagram below illustrates what running windows are - in this case running k = 4 windows. For each of 15 elements of a vector each window contains current 4 elements.
Using runner one can apply any R function f in running window of length defined by k, window lag, observation indexes idx.
Window size
k denotes number of elements in window. If k is a single value then window size is constant for all elements of x. For varying window size one should specify k as integer vector of length(k) == length(x) where each element of k defines window length. If k is empty it means that window will be cumulative (like base::cumsum). Example below illustrates window of k = 4 for 10th element of vector x.
Window lag
lag denotes how many observations windows will be lagged by. If lag is a single value than it's constant for all elements of x. For varying lag size one should specify lag as integer vector of length(lag) == length(x) where each element of lag defines lag of window. Default value of lag = 0. Example below illustrates window of k = 4 lagged by lag = 2 for 10'th element of vector x. Lag can also be negative value, which shifts window forward instead of backward.
Windows depending on date
Sometimes data points in dataset are not equally spaced (missing weekends, holidays, other missings) and thus window size should vary to keep expected time frame. If one specifies idx argument, than running functions are applied on windows depending on date. idx should be the same length as x of class Date or integer. Including idx can be combined with varying window size, than k will denote number of periods in window different for each data point. Example below illustrates window of size k = 5 lagged by lag = 2. In parentheses ranges for each window.
NA padding
Using runner one can also specify na_pad = TRUE which would return NA for any window which is partially out of range - meaning that there is no sufficient number of observations to fill the window. By default na_pad = FALSE, which means that incomplete windows are calculated anyway. na_pad is applied on normal cumulative windows and on windows depending on date.
Build-in functions
With runner one can use any R functions, but some of them are optimized for speed reasons. These functions are:
- aggregating functions -
length_run,min_run,max_run,minmax_run,sum_run,mean_run,streak_run - utility functions -
fill_run,lag_run,which_run
Simple example
14-days trimmed mean
library(runner)
x <- rnorm(20)
date <- seq.Date(Sys.Date(), Sys.Date() + 19, by = "1 day")
runner(x, k = 14, idx = date, f = function(xi) mean(xi, na.rm = TRUE, trim = 0.05))## [1] 0.4394014 0.3126355 0.4827135 0.5236178 0.4819801 0.6976055 0.6023376
## [8] 0.6202999 0.5219315 0.3677634 0.3204818 0.4331546 0.5947573 0.5615686
## [15] 0.5020783 0.4840329 0.4259051 0.3359465 0.3505671 0.3486055