Demos and Presentations

Analyzing Hadoop Data Using sparklyr

Source code of the examples used in the web demo showing how to use spaklyr to analyze 1.1 Billion records using sparklyr and Cloudera.

R and Spark

Presentation delivered during the RStudio Conference 2017. It shows how to sample 1 trillion pages using sparklyr to answer the question: What is the most used keyword or library in the web?

Link to the Presentation Deck - R and Spark

Link to the RPubs article - Analyzing 4 Billions of Tags with R and Spark

Blog Posts

sparklyr 0.5

Announcement regarding the release of sparklyr 0.5 into CRAN. This article expands on the newest features in sparklyr.

How-to: Automate Your sparklyr Environment with Cloudera Director

Post in the Cloudera Engineering Blog that shows how to automate a sparklyr environment on AWS using Cloudera Director

sparklyr is an RStudio project. © 2016 RStudio, Inc.