Slide 1

Slide 1 text

Build a Career in Data Science Emily Robinson @robinson_es

Slide 2

Slide 2 text

Build a Career in Data Science 40% off with code ctwdsgo20 at datascicareer.com (code is good for 40% off everything on Manning) Build a Career in Data Science teaches you what school leaves out, from how to land you first job, to the lifecycle of a data science project, and even how to become a manager.

Slide 3

Slide 3 text

Build a Career in Data Science

Slide 4

Slide 4 text

What is Data Science?

Slide 5

Slide 5 text

One definition https://hackernoon.com/what-on-earth-is-data-science-eb1237d8cb37, Cassie Kozyrkov Data science is the discipline of making data useful

Slide 6

Slide 6 text

Classic data science venn diagram http://drewconway.com/zia/2013/3/26/the-data-science-venn-diagram

Slide 7

Slide 7 text

Our (slightly updated) version

Slide 8

Slide 8 text

Programming: What you need to know OR

Slide 9

Slide 9 text

Mathematics & statistics 1. What techniques exists • I need to group customers together -> I should try clustering 2. How to apply them • How to do a k-means clustering in R/Python 3. How to choose which to try • What clustering method will work best?

Slide 10

Slide 10 text

How can we split our customers into different groups to market to? How can we run a clustering algorithm to segment customer data? Business question Data science question A k-means clustering found 3 distinct groups Data science answer Business answer Here are 3 types of customers: new, high spending, commercial Domain knowledge - Renee Teate, @BecomingDataSci Skills: • Communication • Empathy • Understanding your data (where it lives, built-in assumptions, edge cases)

Slide 11

Slide 11 text

How Do I Become a Data Scientist?

Slide 12

Slide 12 text

“Fake data scientists”

Slide 13

Slide 13 text

“Fake data scientists”

Slide 14

Slide 14 text

“Must know” lists

Slide 15

Slide 15 text

“Must know” lists

Slide 16

Slide 16 text

You don’t need to know everything

Slide 17

Slide 17 text

How to find a data science job Create a portfolio Expand your network Find the right jobs

Slide 18

Slide 18 text

Creating a Portfolio

Slide 19

Slide 19 text

What is a portfolio? A public body of work that illustrates your data science skills

Slide 20

Slide 20 text

How?

Slide 21

Slide 21 text

Dataset -> Question

Slide 22

Slide 22 text

Dataset -> Question

Slide 23

Slide 23 text

Question -> Dataset http://varianceexplained.org/r/trump-tweets/

Slide 24

Slide 24 text

Tip 1: Include visualizations https://hackernoon.com/more-than-a-million-pro-repeal-net-neutrality-comments-were-likely-faked-e9f0e3ed36a6

Slide 25

Slide 25 text

Tip 2: choose a topic you’re excited about https://masalmon.eu/2018/01/01/sortinghat/

Slide 26

Slide 26 text

Tip 3: Limit your scope https://kkulma.github.io/2017-08-13-friendships-among-top-r-twitterers/

Slide 27

Slide 27 text

Making progress Inspired by bit.ly/drob-rstudio-2019 Less valuable More valuable Idea Getting data Cleaning Exploratory Final result Modeling Less valuable More valuable Work only on your computer Work online (GitHub, Blog, Kaggle) How I used to think about analyses How I think about analyses now

Slide 28

Slide 28 text

The full process

Slide 29

Slide 29 text

Put it on GitHub

Slide 30

Slide 30 text

Where? ➔ Easy & quick to set up ➔ Organic traffic (medium) ➔ Less customizability/control

Slide 31

Slide 31 text

Where? ➔ Complete control ➔ Always free ➔ Little longer to set-up ➔ May get stuck debugging issues

Slide 32

Slide 32 text

Explain your analysis https://theambitiouseconomist.com/an-analysis-of-the-gender-wage-gap-in-australia/

Slide 33

Slide 33 text

Teach a concept https://juliasilge.com/blog/stack-overflow-pca/

Slide 34

Slide 34 text

Share your experience https://d4tagirl.com/2018/08/rstudio-conf-diversity-scholarships-for-the-win

Slide 35

Slide 35 text

Give advice www.rladiesnyc.org/post/2019-nyr-conference-tips/ towardsdatascience.com/prioritizing-data-science-work-936b3765fd45

Slide 36

Slide 36 text

Expanding your Network

Slide 37

Slide 37 text

How people get data science jobs http://www.rctatman.com/files/Tatman_2018_DataSciencePortfolios_DC.pdf

Slide 38

Slide 38 text

Meetups – search on meetup.com

Slide 39

Slide 39 text

Twitter

Slide 40

Slide 40 text

Ask for help (use hashtags)

Slide 41

Slide 41 text

Live tweet talks

Slide 42

Slide 42 text

Share your work

Slide 43

Slide 43 text

Share other people’s work

Slide 44

Slide 44 text

Reaching out • Mentions their work • Offers a topic • Suggests a specific time (that’s limited) https://medium.com/@treycausey/do-you-have-time-for-a-quick-chat-c3f7e46de89d

Slide 45

Slide 45 text

Reaching out Check out https://www.datahelpers.org/ by Angela Bassa

Slide 46

Slide 46 text

Reaching out

Slide 47

Slide 47 text

Finding the right job

Slide 48

Slide 48 text

Let go of the “data scientist” title “Think about how attached you are to the data scientist title. If you decide to not concern yourself with what you’re called and to instead focus on the work that you’re doing, you’ll have a lot more flexibility to find jobs.” - Jesse Mostipak, Community Advocate at Kaggle, Chapter 5

Slide 49

Slide 49 text

Figure out your specialty https://www.linkedin.com/pulse/one-data-science-job-doesnt-fit-all-elena-grewal/

Slide 50

Slide 50 text

You don’t need to meet all of the “requirements”

Slide 51

Slide 51 text

Consider the type of company matters Criteria Massive tech Retailer Startup Mid tech Government Contractor Bureaucracy Freedom Salary Job security Chances to learn Chapter 2, How data science works at different companies

Slide 52

Slide 52 text

Conclusion

Slide 53

Slide 53 text

Take-away points • You don’t need to know everything • There’s no such thing as a “fake data scientist” • Let go of the “data scientist” title • Focus on: • Creating a portfolio • Expanding your network • Finding the right job

Slide 54

Slide 54 text

Thank you! hookedondata.org @robinson_es datascicareer.com 40% off w/ code ctwdsgo20