in the world today has been created in the last two years • Recommendation systems: Amazon, eBay, IMDb • Text & Speech recognition: Google • Sales forecasting: Everywhere Use cases: • Problem: people can’t operate on such amounts • Solution: apply machine learning to provide valuable insights Predictions, not causality
Learning is a field of study that gives computers the ability to learn without being explicitly programmed. … explores algorithms that can learn from … … and make prediction on data
heart-shaped cherry big green long cylinder banana small green oval grape Okay, we have some data features decision variable How does it look like? observations usually muuuch more wider
is known • used for learning Test data: • outcome is unknown • used for classifying size color shape fruit big red round apple small red heart-shaped cherry big green long cylinder banana small green oval grape size color shape fruit big red round ? small red heart-shaped ? big green long cylinder ? small green oval ?
round apple small red heart-shaped cherry big green long cylinder banana small green oval grape Train data: Is size big? apple cherry Is shape oval? Is color red? yes no grape banana Decision trees!
year bathroom bedroom price area city 120m 1950 2 1 514500 80 San Francisco 21m 1982 1 2 357800 45 New York 198m 2000 3 3 1100000 120 San Francisco 15m 1993 4 5 850000 30 New York …. …. …. …. …. …. ….