Repartition a Spark DataFrame

R/sdf_interface.R

sdf_repartition

Description

Repartition a Spark DataFrame

Usage

sdf_repartition(x, partitions = NULL, partition_by = NULL) 

Arguments

Arguments Description
x A spark_connection, ml_pipeline, or a tbl_spark.
partitions number of partitions
partition_by vector of column names used for partitioning, only supported for Spark 2.0+