Demo and presentation of sparklyr on a YARN managed cluster by Edgar Ruiz at the 2017 Strata+Hadoop in San Jose, CA.
Link to presentation deck - RStudio sparklyr Strata Hadoop
Link to demo Notebook - sparklyr - R Notebook
Demo and presentation by Nathan Stephen at the 2017 Spark Summit
Source code of the examples used in the web demo showing how to use spaklyr to analyze 1.1 Billion records using sparklyr and Cloudera.
Link to the video at cloudera.com - Analyzing Hadoop with sparklyr
Video and materials of Javier Luraschi’s presentation delivered during the RStudio Conference 2017. It shows how to sample 1 trillion pages using sparklyr to answer the question: What is the most used keyword or library in the web?
Link to the Presentation Deck - R and Spark
Link to the RPubs article - Analyzing 4 Billions of Tags with R and Spark
Announcement regarding the release of sparklyr 0.5 into CRAN. This article expands on the newest features in sparklyr.
Post in the Cloudera Engineering Blog that shows how to automate a sparklyr environment on AWS using Cloudera Director