One-hot encoding maps a column of label indices to a column of binary vectors, with at most a single one-value. This encoding allows algorithms which expect continuous features, such as Logistic Regression, to use categorical features.

ft_one_hot_encoder(x, input.col = NULL, output.col = NULL,
  drop.last = TRUE, ...)



An object (usually a spark_tbl) coercable to a Spark DataFrame.


The name of the input column(s).


The name of the output column.


Boolean; drop the last category?


Optional arguments; currently unused.

See also

See for more information on the set of transformations available for DataFrame columns in Spark.

