Slide 1

Slide 1 text

Shadab Hussain, IOTShow 2019 D A T A S C I E N C E P Y T H O N F O R

Slide 2

Slide 2 text

Intro Data Science Tools NumPy Visualization Python For Data Science, Shadab Hussain, IOTShow 2019 Q & A About Me Shadab Hussain Education, Training & Assessment Infosys Ltd. https://www.linkedin.com/in/shadabhussain96/ Background: • Computer Science Engineer, AKTU • Pursuing PG Diploma in Data Science, IIIT-B • Program Committee Member, Open Data Science Conference India • Organizer, Bangalore Data Science #ODSC • Organizer, PyData Mysore … using a diverse set of tools: SQL, Excel, R, Python, Tableau Pandas ML

Slide 3

Slide 3 text

Python For Data Science, Shadab Hussain, IOTShow 2019 About this talk Objective: Introduction to Data-Analytics and Visualization through python packages like NumPy, pandas, Matplotlib, Seaborn and Scikit-Learn. Structure: • Data Science Tools • Python Basics • NumPy • Pandas • Matplotlib/Seaborn • Scikit-Learn Python For Data Science Intro Data Science Tools NumPy Visualization Q & A Pandas ML

Slide 4

Slide 4 text

Python For Data Science, Shadab Hussain, IOTShow 2019 What’s a Data Scientist? Intro Data Science Tools NumPy Visualization Q & A Pandas ML

Slide 5

Slide 5 text

Python For Data Science, Shadab Hussain, IOTShow 2019 What’s a Data Scientist? • Solid hands-on experience in developing analytical solutions using statistical tools • Experience in implementing Machine Learning systems which may include classification, clustering, natural language processing and time series analysis. • Hands-on experience in database management • Solid hands-on coding experience in Python, R, Julia or similar • Experience in dealing with large data sets and a solid understanding of Big Data technologies and applications • Sound presentation skills, visualizing complicated data science results in Tableau, or similar • Comfortable working with front-end development technologies, including: HTML, JavaScript, D3.js, Django, etc. Intro Data Science Tools NumPy Visualization Q & A Pandas ML

Slide 6

Slide 6 text

Python For Data Science, Shadab Hussain, IOTShow 2019 “ At my company X, we have peta/terabytes of data, just lying around, waiting for someone to explore it” - someone at some conference Intro Data Science Tools NumPy Visualization Q & A Pandas ML

Slide 7

Slide 7 text

Python For Data Science, Shadab Hussain, IOTShow 2019 “ At my company X, we have peta/terabytes of data, just lying around, waiting for someone to explore it” - someone at some conference Let’s make it easier for users to explore and extract useful insights out of data. Intro Data Science Tools NumPy Visualization Q & A Pandas ML

Slide 8

Slide 8 text

Python For Data Science, Shadab Hussain, IOTShow 2019 “ At my company X, we have peta/terabytes of data, just lying around, waiting for someone to explore it” - someone at some conference Let’s make it easier for users to explore and extract useful insights out of data. Anaconda Search and download popular Python/R packages Conda Package manager NumPy/Pandas Python Library for Data Analysis Matplotlib/Seaborn Python Library for Data Visualization Scikit-Learn Python Library for Machine Learning NumPy Visualization Q & A Pandas ML Data Science Tools Intro

Slide 9

Slide 9 text

Python For Data Science, Shadab Hussain, IOTShow 2019 Downloading and Installing Anaconda NumPy Visualization Q & A Pandas ML Data Science Tools Intro

Slide 10

Slide 10 text

Python For Data Science, Shadab Hussain, IOTShow 2019 Downloading and Installing Anaconda NumPy Visualization Q & A Pandas ML Data Science Tools Intro

Slide 11

Slide 11 text

Python For Data Science, Shadab Hussain, IOTShow 2019 Downloading Packages ! pip install eg. ! pip install folium NumPy Visualization Q & A Pandas ML Data Science Tools Intro

Slide 12

Slide 12 text

Python For Data Science, Shadab Hussain, IOTShow 2019 Data Science Lifecycle NumPy Visualization Q & A Pandas ML Data Science Tools Intro Image Source- internet

Slide 13

Slide 13 text

Python For Data Science, Shadab Hussain, IOTShow 2019 • Basics of NumPy Arrays • Mathematical operations in NumPy • NumPy Array manipulation • NumPy Array broadcasting Data Science Tools NumPy Visualization Q & A Pandas ML Intro

Slide 14

Slide 14 text

Python For Data Science, Shadab Hussain, IOTShow 2019 Data Science Tools NumPy Visualization Q & A Pandas ML Intro • Data Structures in Pandas-Series and DataFrames • Data cleaning in Pandas • Data manipulation in Pandas • Handling missing values in datasets

Slide 15

Slide 15 text

Python For Data Science, Shadab Hussain, IOTShow 2019 Data Science Tools NumPy Visualization Q & A Pandas ML Intro • Plotting basic charts in Python • Data visualization with Matplotlib • Statistical data visualization with Seaborn

Slide 16

Slide 16 text

Python For Data Science, Shadab Hussain, IOTShow 2019 Data Science Tools NumPy Visualization Q & A Pandas ML Intro • What is Machine Learning? • Use Cases of Machine Learning • Types of Machine Learning - Supervised to Unsupervised methods • Machine Learning workflow

Slide 17

Slide 17 text

Python For Data Science, Shadab Hussain, IOTShow 2019 Thank You  Intro Data Science Tools NumPy Visualization Q & A Pandas ML