Upgrade to Pro — share decks privately, control downloads, hide ads and more …

Python for Data Science

Python for Data Science

Introduction to Data-Analytics and Visualization through python packages like NumPy, pandas, Matplotlib, Seaborn and Scikit-Learn.

Shadab Hussain

February 27, 2019
Tweet

More Decks by Shadab Hussain

Other Decks in Technology

Transcript

  1. Shadab Hussain, IOTShow 2019 D A T A S C

    I E N C E P Y T H O N F O R
  2. Intro Data Science Tools NumPy Visualization Python For Data Science,

    Shadab Hussain, IOTShow 2019 Q & A About Me Shadab Hussain Education, Training & Assessment Infosys Ltd. https://www.linkedin.com/in/shadabhussain96/ Background: • Computer Science Engineer, AKTU • Pursuing PG Diploma in Data Science, IIIT-B • Program Committee Member, Open Data Science Conference India • Organizer, Bangalore Data Science #ODSC • Organizer, PyData Mysore … using a diverse set of tools: SQL, Excel, R, Python, Tableau Pandas ML
  3. Python For Data Science, Shadab Hussain, IOTShow 2019 About this

    talk Objective: Introduction to Data-Analytics and Visualization through python packages like NumPy, pandas, Matplotlib, Seaborn and Scikit-Learn. Structure: • Data Science Tools • Python Basics • NumPy • Pandas • Matplotlib/Seaborn • Scikit-Learn Python For Data Science Intro Data Science Tools NumPy Visualization Q & A Pandas ML
  4. Python For Data Science, Shadab Hussain, IOTShow 2019 What’s a

    Data Scientist? Intro Data Science Tools NumPy Visualization Q & A Pandas ML
  5. Python For Data Science, Shadab Hussain, IOTShow 2019 What’s a

    Data Scientist? • Solid hands-on experience in developing analytical solutions using statistical tools • Experience in implementing Machine Learning systems which may include classification, clustering, natural language processing and time series analysis. • Hands-on experience in database management • Solid hands-on coding experience in Python, R, Julia or similar • Experience in dealing with large data sets and a solid understanding of Big Data technologies and applications • Sound presentation skills, visualizing complicated data science results in Tableau, or similar • Comfortable working with front-end development technologies, including: HTML, JavaScript, D3.js, Django, etc. Intro Data Science Tools NumPy Visualization Q & A Pandas ML
  6. Python For Data Science, Shadab Hussain, IOTShow 2019 “ At

    my company X, we have peta/terabytes of data, just lying around, waiting for someone to explore it” - someone at some conference Intro Data Science Tools NumPy Visualization Q & A Pandas ML
  7. Python For Data Science, Shadab Hussain, IOTShow 2019 “ At

    my company X, we have peta/terabytes of data, just lying around, waiting for someone to explore it” - someone at some conference Let’s make it easier for users to explore and extract useful insights out of data. Intro Data Science Tools NumPy Visualization Q & A Pandas ML
  8. Python For Data Science, Shadab Hussain, IOTShow 2019 “ At

    my company X, we have peta/terabytes of data, just lying around, waiting for someone to explore it” - someone at some conference Let’s make it easier for users to explore and extract useful insights out of data. Anaconda Search and download popular Python/R packages Conda Package manager NumPy/Pandas Python Library for Data Analysis Matplotlib/Seaborn Python Library for Data Visualization Scikit-Learn Python Library for Machine Learning NumPy Visualization Q & A Pandas ML Data Science Tools Intro
  9. Python For Data Science, Shadab Hussain, IOTShow 2019 Downloading and

    Installing Anaconda NumPy Visualization Q & A Pandas ML Data Science Tools Intro
  10. Python For Data Science, Shadab Hussain, IOTShow 2019 Downloading and

    Installing Anaconda NumPy Visualization Q & A Pandas ML Data Science Tools Intro
  11. Python For Data Science, Shadab Hussain, IOTShow 2019 Downloading Packages

    ! pip install <package-name> eg. ! pip install folium NumPy Visualization Q & A Pandas ML Data Science Tools Intro
  12. Python For Data Science, Shadab Hussain, IOTShow 2019 Data Science

    Lifecycle NumPy Visualization Q & A Pandas ML Data Science Tools Intro Image Source- internet
  13. Python For Data Science, Shadab Hussain, IOTShow 2019 • Basics

    of NumPy Arrays • Mathematical operations in NumPy • NumPy Array manipulation • NumPy Array broadcasting Data Science Tools NumPy Visualization Q & A Pandas ML Intro
  14. Python For Data Science, Shadab Hussain, IOTShow 2019 Data Science

    Tools NumPy Visualization Q & A Pandas ML Intro • Data Structures in Pandas-Series and DataFrames • Data cleaning in Pandas • Data manipulation in Pandas • Handling missing values in datasets
  15. Python For Data Science, Shadab Hussain, IOTShow 2019 Data Science

    Tools NumPy Visualization Q & A Pandas ML Intro • Plotting basic charts in Python • Data visualization with Matplotlib • Statistical data visualization with Seaborn
  16. Python For Data Science, Shadab Hussain, IOTShow 2019 Data Science

    Tools NumPy Visualization Q & A Pandas ML Intro • What is Machine Learning? • Use Cases of Machine Learning • Types of Machine Learning - Supervised to Unsupervised methods • Machine Learning workflow
  17. Python For Data Science, Shadab Hussain, IOTShow 2019 Thank You

     Intro Data Science Tools NumPy Visualization Q & A Pandas ML