Methods to get a grip on working with remote data sources ('SQL' databases, sparklyr' 'Spark' 2.0.0 and above) through 'dplyr'. Adds convenience functions to make such tasks more like working with an in-memory 'data.frame'. Results do depend on which 'dplyr' data service you use.
replyr
helps with the following:
Package common data manipulation tasks into operators such as the gapply
function.
Provide "remote data" (SQL, Spark) replacements for functions commonly used on in-memory data frames.
Provide bug-fixes and work-arounds for various data services.
Collect and document clever dplyr tricks.
To learn more about replyr, please start with the vignette:
NA