Fast cbind for Spark DataFrames

R/sdf_utils.R

sdf_fast_bind_cols

Description

This is a version of sdf_bind_cols that works by zipping RDDs. From the API docs: “Assumes that the two RDDs have the same number of partitions and the same number of elements in each partition (e.g. one was made through a map on the other).”

Usage

sdf_fast_bind_cols(...) 

Arguments

Arguments Description
Spark DataFrames to cbind