sdf_repartition

<p>A <code>spark_connection</code>, <code>ml_pipeline</code>, or a <code>tbl_spark</code>.</p>

partitions

<p>vector of column names used for partitioning, only supported for Spark 2.0+</p>

partition_by

R interface to Apache Spark, a fast and general
engine for big data processing, see <http://spark.apache.org>. This
package supports connecting to local and remote Apache Spark clusters,
provides a 'dplyr' compatible back-end, and provides an interface to
Spark's built-in machine learning algorithms.

Yitao Li

sparklyr

R Interface to Apache Spark

Javier Luraschi

Kevin Kuo

Kevin Ushey

JJ Allaire

Samuel Macedo

Hossein Falaki

Lu Wang

Andy Zhang

Jozef Hajnala

Maciej Szymkiewicz

Wil Davis

 RStudio

 The Apache Software Foundation

sdf_repartition function

Repartition a Spark DataFrame — sdf_repartition

Repartition a Spark DataFrame

sdf_repartition: Repartition a Spark DataFrame

Description

Usage

Arguments