sparklyr (version 0.8.1-9001)

sdf_repartition: Repartition a Spark DataFrame

Description

Repartition a Spark DataFrame

Usage

sdf_repartition(x, partitions = NULL, partition_by = NULL)

Arguments

x

A spark_connection, ml_pipeline, or a tbl_spark.

partitions

number of partitions

partition_by

vector of column names used for partitioning, only supported for Spark 2.0+