sdf_mutate

sdf_mutate_

.data

Named arguments, mapping new column names to the transformation to
be applied.

A named list, mapping output names to transformations.

.dots

Use Spark's <a href="http://spark.apache.org/docs/latest/ml-features.html">feature transformers</a>
to mutate a Spark DataFrame.

R interface to Apache Spark, a fast and general engine for big data
processing, see <http://spark.apache.org>. This package supports connecting to
local and remote Apache Spark clusters, provides a 'dplyr' compatible back-end,
and provides an interface to Spark's built-in machine learning algorithms.

Javier Luraschi

sparklyr

R Interface to Apache Spark

Kevin Kuo

Kevin Ushey

JJ Allaire

 RStudio

 The Apache Software Foundation

sdf_mutate function

The family of functions prefixed with <code>sdf_</code> generally access the Scala
Spark DataFrame API directly, as opposed to the <code>dplyr</code> interface which
uses Spark SQL. These functions will 'force' any pending SQL in a
<code>dplyr</code> pipeline, such that the resulting <code>tbl_spark</code> object
returned will no longer have the attached 'lazy' SQL operations. Note that
the underlying Spark DataFrame does execute its operations lazily, so
that even though the pending set of operations (currently) are not exposed at
the R level, these operations will only be executed when you explicitly
<code>collect()</code> the table.

Transforming Spark DataFrames

Use Spark's <a href='http://spark.apache.org/docs/latest/ml-features.html'>feature transformers</a>
to mutate a Spark DataFrame.

sdf_mutate: Mutate a Spark DataFrame

Description

Usage

Arguments

Transforming Spark DataFrames

Examples