sdf_repartition

<p>A <code>spark_connection</code>, <code>ml_pipeline</code>, or a <code>tbl_spark</code>.</p>

partitions

<p>vector of column names used for partitioning, only supported for Spark 2.0+</p>

partition_by

R interface to Apache Spark, a fast and general engine for big data
processing, see <http://spark.apache.org>. This package supports connecting to
local and remote Apache Spark clusters, provides a 'dplyr' compatible back-end,
and provides an interface to Spark's built-in machine learning algorithms.

Javier Luraschi

sparklyr

R Interface to Apache Spark

Kevin Kuo

Kevin Ushey

JJ Allaire

 RStudio

 The Apache Software Foundation

sdf_repartition function

Repartition a Spark DataFrame — sdf_repartition

Repartition a Spark DataFrame

sdf_repartition: Repartition a Spark DataFrame

Description

Usage

Arguments