Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Juan Natera - Highlights of useR conference - L...
Search
Sponsored
·
Your Podcast. Everywhere. Effortlessly.
Share. Educate. Inspire. Entertain. You do you. We'll handle the rest.
→
Data Science LA
September 06, 2014
1
3.3k
Juan Natera - Highlights of useR conference - LA R meetup - Sep 2014
Data Science LA
September 06, 2014
Tweet
Share
More Decks by Data Science LA
See All by Data Science LA
Opening the Black Box: Attempts to Understand the Results of Machine Learning Models - Michael Tiernay - LA Data Science Meetup - May 2017
datasciencela
2
1.6k
Scott Le Grand - DSSTNE - LA Data Science Meetup - Oct 2016
datasciencela
1
430
Tianqi Chen - XGBoost: Implementation Details - LA Workshop Talk
datasciencela
4
28k
Tianqi Chen - XGBoost: Overview and Latest News - LA Meetup Talk
datasciencela
9
790k
Erin LeDell - Intro to H2O Machine Learning in Python - Python Data Science LA Meetup - Jan 2016
datasciencela
1
210
Jeong-Yoon Lee - Winning Data Science Competitions - Data Science Meetup - Oct 2015
datasciencela
8
11k
Ulas Bardak, Maarten Bosma, Rohan Monga - Data Science @Whisper - LA Data Science Meetup - March 2015
datasciencela
5
1.7k
Eduardo Arino de la Rubia - Big Data is not Hadoop - LA DW/BI/Analytics Meetup - Febr 2015
datasciencela
3
1k
Eric Klusman - The BI software market - LA DW/BI/Analytics Meetup - Febr 2015
datasciencela
2
1k
Featured
See All Featured
Noah Learner - AI + Me: how we built a GSC Bulk Export data pipeline
techseoconnect
PRO
0
150
Dominate Local Search Results - an insider guide to GBP, reviews, and Local SEO
greggifford
PRO
0
120
Self-Hosted WebAssembly Runtime for Runtime-Neutral Checkpoint/Restore in Edge–Cloud Continuum
chikuwait
0
430
How to optimise 3,500 product descriptions for ecommerce in one day using ChatGPT
katarinadahlin
PRO
1
3.5k
The Illustrated Guide to Node.js - THAT Conference 2024
reverentgeek
1
320
The Art of Delivering Value - GDevCon NA Keynote
reverentgeek
16
1.9k
Save Time (by Creating Custom Rails Generators)
garrettdimon
PRO
32
2.6k
コードの90%をAIが書く世界で何が待っているのか / What awaits us in a world where 90% of the code is written by AI
rkaga
61
43k
Between Models and Reality
mayunak
2
250
The #1 spot is gone: here's how to win anyway
tamaranovitovic
2
1k
Improving Core Web Vitals using Speculation Rules API
sergeychernyshev
21
1.4k
Max Prin - Stacking Signals: How International SEO Comes Together (And Falls Apart)
techseoconnect
PRO
0
130
Transcript
A short introduction to dplyr Juan Natera Los Angeles R
Meetup 09/04/2014
A bit about me • Software Engineer • Interested in
R and its use for gaining insights about data • Open Source enthusiast • Baseball fanatic
About dplyr • Developed by Hadley Wickham, Chief Scientist @
Rstudio. • Part of a suite of packages meant to facilitate working on the “data pipeline”.
Why? • People spend a lot of time getting data
ready for analysis • Almost no learning curve (just need to learn 5 verbs) • Improves readability • It's FAST
The data pipeline Tidy Transform Model Visualize
The 5 verbs • flter: remove rows • select: choose
columns • arrange: reorder rows • mutate: change data • summarize: guess...
No learning curve, how? • First parameter is always a
data.frame • Other parameters describe what you want to do with it. • Always returns a new data.frame
It's Fast
Let's see some code!
A great book I picked up at useR 2014
Questions or Comments?
[email protected]