Repartition a Spark DataFrame

Repartition a Spark DataFrame

sdf_repartition(x, partitions = NULL, partition_by = NULL)

Arguments

x

A spark_connection, ml_pipeline, or a tbl_spark.

partitions

number of partitions

partition_by

vector of column names used for partitioning, only supported for Spark 2.0+