Upgrade to Pro — share decks privately, control downloads, hide ads and more …

A Visual Introduction to Machine Learning - Vlad Golubev

A Visual Introduction to Machine Learning - Vlad Golubev

A Visual Introduction to Machine Learning - Vlad Golubev

GDG Ternopil

March 16, 2016
Tweet

More Decks by GDG Ternopil

Other Decks in Technology

Transcript

  1. Why it is important? • Trend: 90% of the data

    in the world today has been created in the last two years • Recommendation systems: Amazon, eBay, IMDb • Text & Speech recognition: Google • Sales forecasting: Everywhere Use cases: • Problem: people can’t operate on such amounts • Solution: apply machine learning to provide valuable insights Predictions, not causality
  2. — What the heck is Machine Learning anyway? Definition Machine

    Learning is a field of study that gives computers the ability to learn without being explicitly programmed.
  3. — What the heck is Machine Learning anyway? Definition Machine

    Learning is a field of study that gives computers the ability to learn without being explicitly programmed. … explores algorithms that can learn from … … and make prediction on data
  4. size color shape fruit big red round apple small red

    heart-shaped cherry big green long cylinder banana small green oval grape Okay, we have some data features decision variable How does it look like? observations usually muuuch more wider
  5. Basic concepts: Train vs Test data Train data: • outcome

    is known • used for learning Test data: • outcome is unknown • used for classifying size color shape fruit big red round apple small red heart-shaped cherry big green long cylinder banana small green oval grape size color shape fruit big red round ? small red heart-shaped ? big green long cylinder ? small green oval ?
  6. Basic concepts: Supervised vs Unsupervised Supervised: • Usually classification •

    70% of machine learning Unsupervised: • Usually clusterization • Group similar data input
  7. Basic concepts: Supervised learning size color shape fruit big red

    round apple small red heart-shaped cherry big green long cylinder banana small green oval grape Train data: Is size big? apple cherry Is shape oval? Is color red? yes no grape banana Decision trees!
  8. Basic concepts: Decision trees Assuming we have this dataset elevation

    year bathroom bedroom price area city 120m 1950 2 1 514500 80 San Francisco 21m 1982 1 2 357800 45 New York 198m 2000 3 3 1100000 120 San Francisco 15m 1993 4 5 850000 30 New York …. …. …. …. …. …. ….
  9. Basic concepts: Decision trees It’s all about finding patterns 3.

    Repeat 1. Look at one variable at a time 2. Use statistical learning to identify boundaries best split elevation greedy
  10. Useful Resources • Datacamp.com - the easiest way to learn

    data science • Kaggle.com - the home of data science • https://www.coursera.org/learn/machine-learning/