Upgrade to Pro — share decks privately, control downloads, hide ads and more …

How to Become a Rockstar Data Scientist

How to Become a Rockstar Data Scientist

Originally presented at:

Big Data Week, Jakarta, Indonesia, 23 March 2017
http://jakarta.bigdataweek.com

D92714958de22e97c3c0461a3238a2c3?s=128

VeryFatBoy

March 23, 2017
Tweet

More Decks by VeryFatBoy

Other Decks in Technology

Transcript

  1. How to Become a Rockstar Data Scientist Akmal B. Chaudhri

  2. Welcome

  3. Thank you

  4. Introduction

  5. My background •  Wide-range of roles –  Blue Chip companies

    –  Big Data startups •  Wide-range of techs –  Database systems –  Programming languages •  Developer relations •  University relations •  10 books, many presentations
  6. None
  7. Cloud computing in agriculture

  8. Data Scientist has been named the sexiest job of the

    21st century, and are so in demand that there won’t be enough of them to fill every position by 2018, according to a report by McKinsey Global Institute. How to help fill this demand and become a Rockstar Data Scientist? Abstract
  9. None
  10. Sexy data scientist (for ladies)

  11. Sexy data scientist (for gents)

  12. What is a data scientist? Source: http://www.mobilefish.com/services/wanted_poster/wanted_poster.php

  13. Data science is not new Source: Rina Piccolo, used with

    permission •  https://www.rinapiccolo.com/piccolo- cartoons/
  14. Data scientist skills Source: http://drewconway.com/zia/2013/3/26/the-data-science-venn-diagram/

  15. Modern data scientist Source: http://www.marketingdistillery.com

  16. Too much to know Hacking Skills Substantive Expertise Math &

    Statistics
  17. Super man/woman

  18. Role of a data scientist •  Data mining expert • 

    Statistics SME •  Trusted advisor •  Experiment designer •  Advanced analytics software expert
  19. 60% 19% 9% 5% 4% 3% Cleaning and organizing data

    Collec7ng data sets Mining data for pa<erns Other Refining algorithms Building training sets How a data scientist spends their day Source: http://visit.crowdflower.com/rs/416-ZBE-142/images/CrowdFlower_DataScienceReport_2016.pdf
  20. 57% 21% 10% 5% 4% 3% Cleaning and organizing data

    Collec7ng data sets Building training sets Other Refining algorithms Mining data for pa<erns What’s the least enjoyable part of data science? Source: http://visit.crowdflower.com/rs/416-ZBE-142/images/CrowdFlower_DataScienceReport_2016.pdf
  21. Jobs analysis

  22. Job postings Source: Indeed

  23. Jobseeker interest Source: Indeed

  24. Division of labour ... Source: Shutterstock Image ID 159183185 Data

    Scientist Data Engineer Data Analyst
  25. Division of labour •  Data scientist – Builds analytical models • 

    Data engineer – Infrastructure and tools •  Data analyst – Communicates business insights
  26. Job postings Source: Indeed

  27. Jobseeker interest Source: Indeed

  28. Data scientist jobs in the UK •  Top related IT

    skills –  Python (663) –  ML (605) –  R (504) –  Analytics (433) –  SQL (379) –  Mathematics (355) –  Big Data (348) –  Statistics (345) Source: http://www.itjobswatch.co.uk/jobs/uk/data scientist.do (18 March 2017)
  29. Source: Big Cloud

  30. Good data scientists are rare •  http://www.condenaststore.com/-sp/Your- three-o-clock-hallucination-is-here-New- Yorker-Cartoon-Prints_i8476186_.htm Source:

    After “Your three-o’clock hallucination is here.” by Michael Maslin for The New Yorker
  31. And everyone wants experience •  https://www.cartoonstock.com/ cartoonview.asp?catref=jmo0676 Source: “Impossible to

    Fill Vacancies” by John Morris
  32. Roadmap

  33. Data science roadmap ... Source: http://nirvacana.com/thoughts/becoming-a-data-scientist/

  34. Data science roadmap 1.  Fundamentals 2.  Statistics 3.  Programming 4. 

    Machine Learning 5.  Text Mining / NLP 6.  Data Visualization 7.  Big Data 8.  Data Ingestion 9.  Data Munging 10. Toolbox Source: http://nirvacana.com/thoughts/becoming-a-data-scientist/
  35. Best practices in data science •  Understand the business problem

    •  Define your methodology •  Know your data sources •  Prepare your data •  Understand the results
  36. Citizen data scientist A person who creates or generates models

    that leverage predictive or prescriptive analytics but whose primary job function is outside of the field of statistics. -- Gartner
  37. Why citizen data scientists? •  Skilled data scientists are rare

    – Specialized skills and business knowledge •  Growth in self-service data preparation – Visual exploration with immediate feedback •  Growth in advanced analytics platforms – IBM Watson
  38. None
  39. None
  40. Prepare for data science ... •  Spreadsheet software •  Python

    data science software – pandas, numpy, scipy •  SQL – Used for relational and non-relational •  Kaggle or equivalent – Get experience – Look at solutions
  41. Prepare for data science •  Local meetups and conferences • 

    Online courses – edX, Coursera, Udacity, ... – Audit the courses, gain knowledge •  Bootcamps •  Master’s or PhD – Many data scientists have advanced degrees •  Build your online portfolio
  42. Theory is not enough

  43. Resources

  44. Guides

  45. ASEAN research

  46. Indonesia •  Data Science Indonesia – http://datascience.or.id – https://www.facebook.com/datascienceindo/ – @DataScienceIndo

  47. Thank you

  48. Contact details

  49. Find me on – http://www.linkedin.com/in/akmalchaudhri/ – http://twitter.com/akmalchaudhri/ – http://www.quora.com/Akmal-Chaudhri/ – http://www.facebook.com/akmal.chaudhri/ – http://plus.google.com/+AkmalChaudhri/ – http://www.slideshare.net/VeryFatBoy/ – http://www.youtube.com/VeryFatBoyVideos/

  50. Akmal B. Chaudhri firstname.lastname@live.com