Loading…
This event has ended. Visit the official site or create your own event on Sched.
Click here to return to main conference site. For a one page, printable overview of the schedule, see this.
View analytic
Monday, June 27 • 1:00pm - 2:15pm
Introduction to SparkR (Part 1)

Log in to save this to your schedule and see who's attending!

Apache Spark is a popular cluster computing framework used for performing large scale data analysis. This tutorial will introduce cluster computing using SparkR: the R language API for Spark. SparkR provides a distributed data frame API that enables structured data processing with a syntax familiar to R users. In this tutorial we will provide example workflows for ingesting data, performing data analysis and doing interactive queries using distributed data frames. Finally, participants will be able to try SparkR on realworld datasets using Databricks R notebooks to get hands-on experience using SparkR.

For details, refer to tutorial description.

Speakers
HF

Hossein Falaki

Databricks Inc.


Monday June 27, 2016 1:00pm - 2:15pm
Econ 140 579 Serra Mall, Stanford, CA 94305

Attendees (83)