
Last chance! 50% off unlimited learning
Sale ends in
Repartition a Spark DataFrame
sdf_repartition(x, partitions = NULL, partition_by = NULL)
A spark_connection
, ml_pipeline
, or a tbl_spark
.
number of partitions
vector of column names used for partitioning, only supported for Spark 2.0+