Shadab Hussain, IOTShow 2019 Q & A About Me Shadab Hussain Education, Training & Assessment Infosys Ltd. https://www.linkedin.com/in/shadabhussain96/ Background: • Computer Science Engineer, AKTU • Pursuing PG Diploma in Data Science, IIIT-B • Program Committee Member, Open Data Science Conference India • Organizer, Bangalore Data Science #ODSC • Organizer, PyData Mysore … using a diverse set of tools: SQL, Excel, R, Python, Tableau Pandas ML
talk Objective: Introduction to Data-Analytics and Visualization through python packages like NumPy, pandas, Matplotlib, Seaborn and Scikit-Learn. Structure: • Data Science Tools • Python Basics • NumPy • Pandas • Matplotlib/Seaborn • Scikit-Learn Python For Data Science Intro Data Science Tools NumPy Visualization Q & A Pandas ML
Data Scientist? • Solid hands-on experience in developing analytical solutions using statistical tools • Experience in implementing Machine Learning systems which may include classification, clustering, natural language processing and time series analysis. • Hands-on experience in database management • Solid hands-on coding experience in Python, R, Julia or similar • Experience in dealing with large data sets and a solid understanding of Big Data technologies and applications • Sound presentation skills, visualizing complicated data science results in Tableau, or similar • Comfortable working with front-end development technologies, including: HTML, JavaScript, D3.js, Django, etc. Intro Data Science Tools NumPy Visualization Q & A Pandas ML
my company X, we have peta/terabytes of data, just lying around, waiting for someone to explore it” - someone at some conference Intro Data Science Tools NumPy Visualization Q & A Pandas ML
my company X, we have peta/terabytes of data, just lying around, waiting for someone to explore it” - someone at some conference Let’s make it easier for users to explore and extract useful insights out of data. Intro Data Science Tools NumPy Visualization Q & A Pandas ML
my company X, we have peta/terabytes of data, just lying around, waiting for someone to explore it” - someone at some conference Let’s make it easier for users to explore and extract useful insights out of data. Anaconda Search and download popular Python/R packages Conda Package manager NumPy/Pandas Python Library for Data Analysis Matplotlib/Seaborn Python Library for Data Visualization Scikit-Learn Python Library for Machine Learning NumPy Visualization Q & A Pandas ML Data Science Tools Intro
Tools NumPy Visualization Q & A Pandas ML Intro • Data Structures in Pandas-Series and DataFrames • Data cleaning in Pandas • Data manipulation in Pandas • Handling missing values in datasets
Tools NumPy Visualization Q & A Pandas ML Intro • Plotting basic charts in Python • Data visualization with Matplotlib • Statistical data visualization with Seaborn
Tools NumPy Visualization Q & A Pandas ML Intro • What is Machine Learning? • Use Cases of Machine Learning • Types of Machine Learning - Supervised to Unsupervised methods • Machine Learning workflow