Upgrade to Pro — share decks privately, control downloads, hide ads and more …

How to Become a Rockstar Data Scientist

How to Become a Rockstar Data Scientist

Originally presented at:

Big Data Week, Jakarta, Indonesia, 23 March 2017
http://jakarta.bigdataweek.com

VeryFatBoy

March 23, 2017
Tweet

More Decks by VeryFatBoy

Other Decks in Technology

Transcript

  1. My background •  Wide-range of roles –  Blue Chip companies

    –  Big Data startups •  Wide-range of techs –  Database systems –  Programming languages •  Developer relations •  University relations •  10 books, many presentations
  2. Data Scientist has been named the sexiest job of the

    21st century, and are so in demand that there won’t be enough of them to fill every position by 2018, according to a report by McKinsey Global Institute. How to help fill this demand and become a Rockstar Data Scientist? Abstract
  3. Data science is not new Source: Rina Piccolo, used with

    permission •  https://www.rinapiccolo.com/piccolo- cartoons/
  4. Role of a data scientist •  Data mining expert • 

    Statistics SME •  Trusted advisor •  Experiment designer •  Advanced analytics software expert
  5. 60% 19% 9% 5% 4% 3% Cleaning and organizing data

    Collec7ng data sets Mining data for pa<erns Other Refining algorithms Building training sets How a data scientist spends their day Source: http://visit.crowdflower.com/rs/416-ZBE-142/images/CrowdFlower_DataScienceReport_2016.pdf
  6. 57% 21% 10% 5% 4% 3% Cleaning and organizing data

    Collec7ng data sets Building training sets Other Refining algorithms Mining data for pa<erns What’s the least enjoyable part of data science? Source: http://visit.crowdflower.com/rs/416-ZBE-142/images/CrowdFlower_DataScienceReport_2016.pdf
  7. Division of labour •  Data scientist – Builds analytical models • 

    Data engineer – Infrastructure and tools •  Data analyst – Communicates business insights
  8. Data scientist jobs in the UK •  Top related IT

    skills –  Python (663) –  ML (605) –  R (504) –  Analytics (433) –  SQL (379) –  Mathematics (355) –  Big Data (348) –  Statistics (345) Source: http://www.itjobswatch.co.uk/jobs/uk/data scientist.do (18 March 2017)
  9. Data science roadmap 1.  Fundamentals 2.  Statistics 3.  Programming 4. 

    Machine Learning 5.  Text Mining / NLP 6.  Data Visualization 7.  Big Data 8.  Data Ingestion 9.  Data Munging 10. Toolbox Source: http://nirvacana.com/thoughts/becoming-a-data-scientist/
  10. Best practices in data science •  Understand the business problem

    •  Define your methodology •  Know your data sources •  Prepare your data •  Understand the results
  11. Citizen data scientist A person who creates or generates models

    that leverage predictive or prescriptive analytics but whose primary job function is outside of the field of statistics. -- Gartner
  12. Why citizen data scientists? •  Skilled data scientists are rare

    – Specialized skills and business knowledge •  Growth in self-service data preparation – Visual exploration with immediate feedback •  Growth in advanced analytics platforms – IBM Watson
  13. Prepare for data science ... •  Spreadsheet software •  Python

    data science software – pandas, numpy, scipy •  SQL – Used for relational and non-relational •  Kaggle or equivalent – Get experience – Look at solutions
  14. Prepare for data science •  Local meetups and conferences • 

    Online courses – edX, Coursera, Udacity, ... – Audit the courses, gain knowledge •  Bootcamps •  Master’s or PhD – Many data scientists have advanced degrees •  Build your online portfolio