Upgrade to Pro — share decks privately, control downloads, hide ads and more …

#datacareer - Analyst? Engineer? Scientist? Roles in industry and startups with lead data scientist Alexey Grigorev

#datacareer - Analyst? Engineer? Scientist? Roles in industry and startups with lead data scientist Alexey Grigorev

“The problem for AI in Europe is not the money, it is finding the talent” (Leading European AI practitioner)

Data and Artificial Intelligence constitute the fastest-growing job market for the highly qualified. This workshop offers you the following:

- How to find the right role for you among the emerging specialized roles in e.g. data engineering, data analytics, data science, machine learning, and deep learning.
- Pragmatic advice on handling your CV and skills profile for your next role.
- Orientation on the labour market, what employers miss most, and which #aiusecase are winning.

Dânia Meira

October 26, 2020
Tweet

More Decks by Dânia Meira

Other Decks in Technology

Transcript

  1. #DATACAREER “No matter who you are, self-improvement is one of

    the most important and most overlooked attributes of young AI talent. It only takes four years of experience to become a senior AI researcher, or five years of experience to lead an entire institute. The determination and discipline to improve both the hard and soft skills continually will be the deciding factor in an AI researcher’s career.” Jean-François Gagné
  2. DÂNIA MEIRA Founding member, AI Guild ML models for predictive

    analytics Former bootcamp teacher #datacareer since 2012 LinkedIn
  3. ALEXEY GRIGOREV Software Engineering, 10 years of experience Machine Learning

    and Data Science, 6 years of experience Author of the book “Machine Learning Bookcamp” LinkedIn Twitter
  4. CHRIS ARMBRUSTER Founding member, AI Guild 10,000 Data Scientists for

    Europe Former bootcamp director #datacareer coaching since 2017 LinkedIn
  5. INSIGHTS FROM CAREER COACHING Search for the 1st as well

    as the 2nd role may take >6 months Upgrading inside a company may be easier Job advertisements may be misleading and confusing The role ‘in real life’ may not match the talents expectations
  6. OBSERVING THE MARKET Specialization and differentiation of roles Rising value

    of domain expertise Experimental phase with PoC plays ending Increasing focus on deployment
  7. PRODUCTIONIZING MACHINE LEARNING ML Models Data Collection Data Quality Infrastructure

    Process Management Tools Monitoring Feature Extraction Analysis Data Preprocessing Parameter Configuration Offline Validation A/B Testing Data Engineer Data Scientist Data Analyst ML Engineer AI Researcher #dataroles See also: “Hidden Technical Debt in Machine Learning System” by Sculley et al, Google inc, 2015 Machine Resource Management Configuration Business Logic
  8. #DATAROLES Task Understand business case, build features to train predictive

    models to address such use cases Skill Statistics, SQL, programming (e.g. python, R), ML & DL techniques. Data Scientist Task Business and data understanding to report on what happens Skill Descriptive analytics, SQL, statistics, dashboarding and visualization tools Data Analyst Data Engineer Task Build and maintain infrastructure and pipeline to collect, clean and pre-process data Skill Distributed systems, databases, software engineering Task Optimize, deploy and maintain machine learning models in production Skill Software engineering, devOps and systems architecture Machine Learning Engineer Task Build new machine learning algorithms, find custom scientific solutions Skill Research, presenting at conferences, writing publications AI Researcher
  9. ‚COOKING‘ DATA: EXPLAINING SPECIALIZATION ML Models Data Collection Data Quality

    Infrastructure Process Management Tools Machine Resource Management Monitoring Configuration Feature Extraction Analysis Data Preprocessing Parameter Configuration Offline Validation Business Logic A/B Testing See also: Understanding a Machine Learning Workflow Through Food by Daniel Godoy Sowing Harvesting Choose recipe Prepare ingredients Customers tasting Kitchen Tasting Use utensils Try combinations of appliances and recipes Kitchen space and available appliances
  10. UNDERSTANDING #DATAROLES Build Kitchen Appliances Create and use recipes to

    cook Check quality of ingredients and recipes Process ingredients at scale Turn a recipe into many dishes served efficiently Data Engineer Data Scientist Data Analyst ML Engineer AI Researcher
  11. D e e p SKILLS SETS FOR DATA ROLES Data

    Engineer Data Scientist Data Analyst ML Engineer AI Researcher #dataroles Cross-discipline ML Algorithms Visualization Domain Expertise Programming SW Engineering Communication Tools Platforms Statistics
  12. Data Analyst SQL R, Python Descriptive statistics Hypothesis testing Probability

    distributions Regression Excel Tableau Data interpretation Logical approach Marketing Healthcare E-commerce Mobility Manufacturing ... Cross-discipline ML Algorithms Visualization Domain Expertise Programming SW Engineering Communication Tools Platforms Statistics
  13. Data Scientist Jupyter Lab, RStudio Git, Docker, Airflow, Jenkins SQL,

    R, Python Python: pandas, scikit-learn R: dplyr, forecast Regression Classification Clustering Deep Learning Marketing Healthcare E-commerce Mobility Manufacturing ... Cross-discipline ML Algorithms Visualization Domain Expertise Programming SW Engineering Communication Tools Platforms Statistics Data interpretation Logical approach
  14. Hadoop: Hive, Pig, Spark Databases Git, Docker, Airflow, Jenkins SQL,

    Bash, Java, Scala, Python Data pipelines Data structures Linux, AWS, Google Cloud Platform, Microsoft Azure Cross-discipline ML Algorithms Visualization Domain Expertise Programming SW Engineering Communication Tools Platforms Statistics Data Engineer
  15. ML Engineer Hadoop: Hive, Pig, Spark Git, Docker, Airflow, Jenkins

    SQL, Bash, Java, Scala, Python sk-learn Linux, AWS, Google Cloud Platform, Microsoft Azure Microservices Infrastructure Cross-discipline ML Algorithms Visualization Domain Expertise Programming SW Engineering Communication Tools Platforms Statistics
  16. ALEXEY ON TRANSITIONING FROM SOFTWARE ENGINEERING TO DATA SCIENTIST Online

    courses & books Project-based learning Collaboration Real life experience
  17. KEY INDUSTRY CHALLENGES* ◼ Data volume, accessibility, and quality ◼

    Trust of customers, stakeholders, and employees, including governance, compliance, and reputation ◼ Competence of employees, management, and company *Based on the 2019 PWC report “Künstliche Intelligenz in Unternehmen”, p. 12
  18. SOME STARTUP CHALLENGES • Data volume, accessibility, and quality •

    Company funding and runway • Expertise levels and team size
  19. WRAPPING UP Keep observing the market Look for matches between

    employers’ needs and your skills profile Scan the industry and startups for the most promising #aiusecase