Loading…
This event has ended. Visit the official site or create your own event on Sched.
Click here to return to main conference site. For a one page, printable overview of the schedule, see this.
View analytic
Wednesday, June 29 • 11:40am - 11:45am
Chunked, dplyr for large text files

Log in to save this to your schedule and see who's attending!

During a data analysis project it may happen that a new version of the raw data comes available or that data changes are made outside of your control. `daff` is a R package that helps to keep track of such changes. It can find differences in values between data.frames, store these differences, render them and apply them as a patch to a new data.frame. It can also merge two versions of a data.frame having a common parent version. It wraps the daff.js library of Paul Fitzpatrick (http://github.com/paulfitz/daff) using the V8 package.

Moderators
avatar for Joseph Rickert

Joseph Rickert

Program Manager, Microsoft
Joseph is a Program Manager at Microsoft having come to Microsoft with the acquisition of Revolution Analytics. He is a data scientist and R language evangelist passionate about analyzing data and teaching people about R. He is a regular contributor to the Revolutions blog and an organizer of the Bay Area R Users Group. Joseph is a long-time Silicon Valley start-up guy with experience building statistical models in industries as diverse as... Read More →

Speakers
avatar for Edwin  de Jonge

Edwin de Jonge

Statistical consultant / Data Scientist, Statistics Netherlands (CBS)


Wednesday June 29, 2016 11:40am - 11:45am
SIEPR 120 366 Galvez St, Stanford, CA 94305

Attendees (68)