copy_to.spark_connection

dest

An R <code>data.frame</code>.

The name to assign to the copied table in Spark.

name

Boolean; should the table be cached into memory?

memory

The number of partitions to use when distributing the
table across the Spark cluster. The default (0) can be used to avoid
partitioning.

repartition

Boolean; overwrite a pre-existing table with the name <code>name</code>
if one already exists?

overwrite

Optional arguments; currently unused.

Copy an R <code>data.frame</code> to Spark, and return a reference to the
generated Spark DataFrame as a <code>tbl_spark</code>. The returned object will
act as a <code>dplyr</code>-compatible interface to the underlying Spark table.

R interface to Apache Spark, a fast and general engine for big data
processing, see <http://spark.apache.org>. This package supports connecting to
local and remote Apache Spark clusters, provides a 'dplyr' compatible back-end,
and provides an interface to Spark's built-in machine learning algorithms.

copy_to.spark_connection: Copy an R Data Frame to Spark

Description

Usage

Arguments

Value