Source code of the examples used in the web demo showing how to use spaklyr to analyze 1.1 Billion records using sparklyr and Cloudera.
Presentation delivered during the RStudio Conference 2017. It shows how to sample 1 trillion pages using sparklyr to answer the question: What is the most used keyword or library in the web?
Link to the Presentation Deck - R and Spark
Link to the RPubs article - Analyzing 4 Billions of Tags with R and Spark
Announcement regarding the release of sparklyr 0.5 into CRAN. This article expands on the newest features in sparklyr.
Post in the Cloudera Engineering Blog that shows how to automate a sparklyr environment on AWS using Cloudera Director