We are excited to share that sparklyr 0.7 is now available on CRAN! Sparklyr provides an R interface to Apache Spark. It supports dplyr syntax for working with Spark DataFrames and exposes the full range of machine learning algorithms available in Spark. Features in this release:
Adds support for ML Pipelines which provide a uniform set of high-level APIs to help create, tune, and deploy machine learning pipelines at scale.
Enhances Machine Learning capabilities by supporting the full range of ML algorithms and feature transformers.
Improves Data Serialization, specifically by adding support for date columns.
Adds support for YARN cluster mode connections.
The full blog post is available in the RStudio Blog site: https://blog.rstudio.com/2018/01/29/sparklyr-0-7/