Slide 1

Slide 1 text

How to Become a Rockstar Data Scientist Akmal B. Chaudhri

Slide 2

Slide 2 text

Welcome

Slide 3

Slide 3 text

Thank you

Slide 4

Slide 4 text

Introduction

Slide 5

Slide 5 text

My background •  Wide-range of roles –  Blue Chip companies –  Big Data startups •  Wide-range of techs –  Database systems –  Programming languages •  Developer relations •  University relations •  10 books, many presentations

Slide 6

Slide 6 text

No content

Slide 7

Slide 7 text

Cloud computing in agriculture

Slide 8

Slide 8 text

Data Scientist has been named the sexiest job of the 21st century, and are so in demand that there won’t be enough of them to fill every position by 2018, according to a report by McKinsey Global Institute. How to help fill this demand and become a Rockstar Data Scientist? Abstract

Slide 9

Slide 9 text

No content

Slide 10

Slide 10 text

Sexy data scientist (for ladies)

Slide 11

Slide 11 text

Sexy data scientist (for gents)

Slide 12

Slide 12 text

What is a data scientist? Source: http://www.mobilefish.com/services/wanted_poster/wanted_poster.php

Slide 13

Slide 13 text

Data science is not new Source: Rina Piccolo, used with permission •  https://www.rinapiccolo.com/piccolo- cartoons/

Slide 14

Slide 14 text

Data scientist skills Source: http://drewconway.com/zia/2013/3/26/the-data-science-venn-diagram/

Slide 15

Slide 15 text

Modern data scientist Source: http://www.marketingdistillery.com

Slide 16

Slide 16 text

Too much to know Hacking Skills Substantive Expertise Math & Statistics

Slide 17

Slide 17 text

Super man/woman

Slide 18

Slide 18 text

Role of a data scientist •  Data mining expert •  Statistics SME •  Trusted advisor •  Experiment designer •  Advanced analytics software expert

Slide 19

Slide 19 text

60% 19% 9% 5% 4% 3% Cleaning and organizing data Collec7ng data sets Mining data for pa

Slide 20

Slide 20 text

57% 21% 10% 5% 4% 3% Cleaning and organizing data Collec7ng data sets Building training sets Other Refining algorithms Mining data for pa

Slide 21

Slide 21 text

Jobs analysis

Slide 22

Slide 22 text

Job postings Source: Indeed

Slide 23

Slide 23 text

Jobseeker interest Source: Indeed

Slide 24

Slide 24 text

Division of labour ... Source: Shutterstock Image ID 159183185 Data Scientist Data Engineer Data Analyst

Slide 25

Slide 25 text

Division of labour •  Data scientist – Builds analytical models •  Data engineer – Infrastructure and tools •  Data analyst – Communicates business insights

Slide 26

Slide 26 text

Job postings Source: Indeed

Slide 27

Slide 27 text

Jobseeker interest Source: Indeed

Slide 28

Slide 28 text

Data scientist jobs in the UK •  Top related IT skills –  Python (663) –  ML (605) –  R (504) –  Analytics (433) –  SQL (379) –  Mathematics (355) –  Big Data (348) –  Statistics (345) Source: http://www.itjobswatch.co.uk/jobs/uk/data scientist.do (18 March 2017)

Slide 29

Slide 29 text

Source: Big Cloud

Slide 30

Slide 30 text

Good data scientists are rare •  http://www.condenaststore.com/-sp/Your- three-o-clock-hallucination-is-here-New- Yorker-Cartoon-Prints_i8476186_.htm Source: After “Your three-o’clock hallucination is here.” by Michael Maslin for The New Yorker

Slide 31

Slide 31 text

And everyone wants experience •  https://www.cartoonstock.com/ cartoonview.asp?catref=jmo0676 Source: “Impossible to Fill Vacancies” by John Morris

Slide 32

Slide 32 text

Roadmap

Slide 33

Slide 33 text

Data science roadmap ... Source: http://nirvacana.com/thoughts/becoming-a-data-scientist/

Slide 34

Slide 34 text

Data science roadmap 1.  Fundamentals 2.  Statistics 3.  Programming 4.  Machine Learning 5.  Text Mining / NLP 6.  Data Visualization 7.  Big Data 8.  Data Ingestion 9.  Data Munging 10. Toolbox Source: http://nirvacana.com/thoughts/becoming-a-data-scientist/

Slide 35

Slide 35 text

Best practices in data science •  Understand the business problem •  Define your methodology •  Know your data sources •  Prepare your data •  Understand the results

Slide 36

Slide 36 text

Citizen data scientist A person who creates or generates models that leverage predictive or prescriptive analytics but whose primary job function is outside of the field of statistics. -- Gartner

Slide 37

Slide 37 text

Why citizen data scientists? •  Skilled data scientists are rare – Specialized skills and business knowledge •  Growth in self-service data preparation – Visual exploration with immediate feedback •  Growth in advanced analytics platforms – IBM Watson

Slide 38

Slide 38 text

No content

Slide 39

Slide 39 text

No content

Slide 40

Slide 40 text

Prepare for data science ... •  Spreadsheet software •  Python data science software – pandas, numpy, scipy •  SQL – Used for relational and non-relational •  Kaggle or equivalent – Get experience – Look at solutions

Slide 41

Slide 41 text

Prepare for data science •  Local meetups and conferences •  Online courses – edX, Coursera, Udacity, ... – Audit the courses, gain knowledge •  Bootcamps •  Master’s or PhD – Many data scientists have advanced degrees •  Build your online portfolio

Slide 42

Slide 42 text

Theory is not enough

Slide 43

Slide 43 text

Resources

Slide 44

Slide 44 text

Guides

Slide 45

Slide 45 text

ASEAN research

Slide 46

Slide 46 text

Indonesia •  Data Science Indonesia – http://datascience.or.id – https://www.facebook.com/datascienceindo/ – @DataScienceIndo

Slide 47

Slide 47 text

Thank you

Slide 48

Slide 48 text

Contact details

Slide 49

Slide 49 text

Find me on – http://www.linkedin.com/in/akmalchaudhri/ – http://twitter.com/akmalchaudhri/ – http://www.quora.com/Akmal-Chaudhri/ – http://www.facebook.com/akmal.chaudhri/ – http://plus.google.com/+AkmalChaudhri/ – http://www.slideshare.net/VeryFatBoy/ – http://www.youtube.com/VeryFatBoyVideos/

Slide 50

Slide 50 text

Akmal B. Chaudhri [email protected]