About dplyr • Developed by Hadley Wickham, Chief Scientist @ Rstudio. • Part of a suite of packages meant to facilitate working on the “data pipeline”.
Why? • People spend a lot of time getting data ready for analysis • Almost no learning curve (just need to learn 5 verbs) • Improves readability • It's FAST
No learning curve, how? • First parameter is always a data.frame • Other parameters describe what you want to do with it. • Always returns a new data.frame