Upgrade to Pro — share decks privately, control downloads, hide ads and more …

Lux Tech & DSEA Python For Data Science

Lux Tech & DSEA Python For Data Science

Victor Omondi Slides for Lux Tech and Data Science East Africa python For Data Science with real examples

Avatar for VICTOR OMONDI

VICTOR OMONDI

April 16, 2022

More Decks by VICTOR OMONDI

Other Decks in Education

Transcript

  1. Vick 2022 Intro Intro to data science and python Application

    areas Where Data Science has been used and how it has impacted. DS Tools Tools and libraries used by data scientist DS Curriculum How to effectively learn data science and list of important resources that can help you. 02 01 04 03
  2. Vick 2022 ABOUT VICK Data Analyst @ Maisha Meds and

    Volunteer @ #PyConKE & DataQuest
  3. Vick 2022 Data science is an interdisciplinary field that uses

    scientific methods, processes, algorithms and systems to extract knowledge and insights from noisy, structured and unstructured data, and apply knowledge and actionable insights from data across a broad range of application domains
  4. Vick 2022 Research, Probability, statistics, descriptive analysis such as mean,

    median, SD MATHS Machine learning, and software development (programming languages) Business knowledge, ie being an expert at a particular field eg health, finance Design, soft skills eg communication CS DOMAIN Others
  5. Vick 2022 TYPES OF DATA SCIENCE … … Machine Learning

    Business Intelligence … Decision Science
  6. Vick 2022 Business Intelligence • Taking the data the company

    already have • Getting the data to the right people • In form of dashboards, reports or emails
  7. Vick 2022 Machine Learning • Predictive analytics • SL, UL,

    RL • Put models continuously into production
  8. Vick 2022 Decision Making • Prescriptive analytics • Take the

    insights discovered in the data science work • Use it to help the company decision making • How likely will customer take this product?
  9. Vick 2022 DATA SCIENCE BIG PICTURE Data collection, data pipelines

    and storage. Python as an automation tool (the Prefect library) Experimentation and Prediction. Performing A/B tests and machine learning. Sklearn, scipy, Tensorflow. Preparation, exploration & visualization. Using python in data cleaning and EDA. Pandas, matplotlib & numpy Data Engineer Data Scientist Data Analyst
  10. Vick 2022 Tools for data Science with Python Getting the

    data Data Collection and data cleaning: pandas, numpy, thefuzz, … Modelling Building models (statistical inference and machine learning): scipy, scikit-learn Business Decisions made ……. Visualization Building visualizations and and report: plotly, matplotlib, dash, bokeh, seaborn. Communication Communicate results to stakeholders: jupyter notebook, ipython Repeat Monitoring and evaluation of models, adding more data etc
  11. Vick 2022 Drug list data cleaning ( string matching) and

    normal day to day EDA using jupyter notebook, python, pandas & thefuzz. Products A/B Testing and customer retention predictions Building credit scoring models using scikit learn & python. Movie recommendation based on how you watch (Youtube too) WHAT DO WE DO?
  12. Vick 2022 HOW DO WE DO IT? Build something amazing,

    an innovative solution (like crop disease prediction) Learn by taking a bootcamp or a course Write blogs, take talks to share your learnings BUILD LEARN SHARE
  13. Vick 2022 Tools 01 02 03 04 Jupyter notebook, python

    basics (variables, lists, loops, dictionaries, strings and strings methods) OOP with python, numpy, pandas, matplotlib Seaborn, Scikit learn Take part in Zindi/Kaggle, continue learning.
  14. Vick 2022 Python for Data Science Beginner Road map Understand

    the goal of the project, and the question to be answered Clean and standardize your data. Explore the data to know how it looks like. Perform hyperparameter tuning and monitor how your model performs in new data. Train your model, know the predictor… Data understanding modeling Business understanding Evaluation Data Preparation
  15. Vick 2022 JOBS • Maisha Meds research analyst: https://maishameds.bamboohr.com/jobs/ LEARNING

    • DataQuest Resources: https://www.dataquest.io/data-science-resources/ • DataCamp Resources: https://www.datacamp.com/resources • DataCamp scholarship: https://ingressive.org/i4g-datacamp-application/ • DataQuest • Udemy • Coursera RESOURCES