Applies an R function to a Spark object (typically, a Spark DataFrame).
spark_apply(x, f, names = colnames(x), memory = TRUE, group_by = NULL,
packages = TRUE, ...)
An object (usually a spark_tbl
) coercable to a Spark DataFrame.
A function that transforms a data frame partition into a data frame.
The column names for the transformed object, defaults to the names from the original object.
Boolean; should the table be cached into memory?
Column name used to group by data frame partitions.
Boolean; distribute .libPaths()
packages to nodes?
Optional arguments; currently unused.