Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Juan Natera - Highlights of useR conference - L...
Search
Data Science LA
September 06, 2014
1
3.3k
Juan Natera - Highlights of useR conference - LA R meetup - Sep 2014
Data Science LA
September 06, 2014
Tweet
Share
More Decks by Data Science LA
See All by Data Science LA
Opening the Black Box: Attempts to Understand the Results of Machine Learning Models - Michael Tiernay - LA Data Science Meetup - May 2017
datasciencela
2
1.6k
Scott Le Grand - DSSTNE - LA Data Science Meetup - Oct 2016
datasciencela
1
420
Tianqi Chen - XGBoost: Implementation Details - LA Workshop Talk
datasciencela
4
28k
Tianqi Chen - XGBoost: Overview and Latest News - LA Meetup Talk
datasciencela
9
790k
Erin LeDell - Intro to H2O Machine Learning in Python - Python Data Science LA Meetup - Jan 2016
datasciencela
1
200
Jeong-Yoon Lee - Winning Data Science Competitions - Data Science Meetup - Oct 2015
datasciencela
8
11k
Ulas Bardak, Maarten Bosma, Rohan Monga - Data Science @Whisper - LA Data Science Meetup - March 2015
datasciencela
5
1.7k
Eduardo Arino de la Rubia - Big Data is not Hadoop - LA DW/BI/Analytics Meetup - Febr 2015
datasciencela
3
1k
Eric Klusman - The BI software market - LA DW/BI/Analytics Meetup - Febr 2015
datasciencela
2
1k
Featured
See All Featured
CSS Pre-Processors: Stylus, Less & Sass
bermonpainter
359
30k
Build your cross-platform service in a week with App Engine
jlugia
234
18k
Large-scale JavaScript Application Architecture
addyosmani
515
110k
Design and Strategy: How to Deal with People Who Don’t "Get" Design
morganepeng
132
19k
Exploring the Power of Turbo Streams & Action Cable | RailsConf2023
kevinliebholz
37
6.2k
Navigating Team Friction
lara
191
16k
Save Time (by Creating Custom Rails Generators)
garrettdimon
PRO
32
1.9k
Product Roadmaps are Hard
iamctodd
PRO
55
12k
DevOps and Value Stream Thinking: Enabling flow, efficiency and business value
helenjbeal
1
71
Neural Spatial Audio Processing for Sound Field Analysis and Control
skoyamalab
0
140
Making the Leap to Tech Lead
cromwellryan
135
9.7k
What’s in a name? Adding method to the madness
productmarketing
PRO
24
3.8k
Transcript
A short introduction to dplyr Juan Natera Los Angeles R
Meetup 09/04/2014
A bit about me • Software Engineer • Interested in
R and its use for gaining insights about data • Open Source enthusiast • Baseball fanatic
About dplyr • Developed by Hadley Wickham, Chief Scientist @
Rstudio. • Part of a suite of packages meant to facilitate working on the “data pipeline”.
Why? • People spend a lot of time getting data
ready for analysis • Almost no learning curve (just need to learn 5 verbs) • Improves readability • It's FAST
The data pipeline Tidy Transform Model Visualize
The 5 verbs • flter: remove rows • select: choose
columns • arrange: reorder rows • mutate: change data • summarize: guess...
No learning curve, how? • First parameter is always a
data.frame • Other parameters describe what you want to do with it. • Always returns a new data.frame
It's Fast
Let's see some code!
A great book I picked up at useR 2014
Questions or Comments?
[email protected]