Upgrade to Pro — share decks privately, control downloads, hide ads and more …

Intro to Data Science and Preparation

Intro to Data Science and Preparation

Bethany Jepchumba

October 19, 2022
Tweet

More Decks by Bethany Jepchumba

Other Decks in Technology

Transcript

  1. Introduction to Data Science  as a scientific field that

    uses scientific methods to extract knowledge and insights from structured and unstructured data and apply knowledge and actionable insights from data across a broad range of application domains.  The main goal of data science is to extract knowledgefrom data, in other words - to understand data, find some hidden relationships and build a model.
  2. Related fields Databases: data storage Big Data: large data quantities

    Machine Learning: developing models from existing data Artificial Intelligence: building complex models that mimic human beings Visualization: visualizing information to make sense of data
  3. Elements of Data Science Data acquisition: collecting data Data Storage:

    how to store and query data for future Data Processing: converting data for use Visualization: helps in understanding data and gaining insights Modelling: creating machine learning models
  4. Libraries: Pandas: manipulate data in dataframes. Numpy: numerical python to

    work with multidimensional arrays. Matplotlib and Seaborn: used for vizualization and plotting graphs