A basic introduction to key concepts underpinning machine learning and provides useful resources to get started, suitable for someone new to the field. Slides presented at The App Business (Dec, 2017) and Socrates UK conference (June, 2018).
why has it become the new cool thing? 3 Abundant data Abundant computation Advances in the field New possibilities and applications have been created by…
value(s) Example algorithms: decision trees, logistic regression, k- nearerst neighbours Example usage: spam filtering Prediction: continuous value Example algorithms: linear regression, regression tree Example usage: predicting house prices Regression Classificiation Prediction: group based on relationship Example algorithms: k-means, affinity propagation, hierarchical Example usage: customer segmentation Clustering
value(s) Example algorithms: decision trees, logistic regression, k- nearerst neighbours Example usage: spam filtering Prediction: continuous value Example algorithms: linear regression, regression tree Example usage: predicting house prices Regression Classificiation Prediction: group based on relationship Example algorithms: k-means, affinity propagation, hierarchical Example usage: customer segmentation Clustering
value(s) Example algorithms: decision trees, logistic regression, k- nearerst neighbours Example usage: spam filtering Prediction: continuous value Example algorithms: linear regression, regression tree Example usage: predicting house prices Regression Classificiation Prediction: group based on relationship Example algorithms: k-means, affinity propagation, hierarchical Example usage: customer segmentation Clustering
training data includes desired solutions e.g. example spam/not spam messages Unsupervised training data doesn’t have labels or any penalties/rewards to guide e.g. suggest similar news articles Semi-supervised a mixture of the above two e.g. photo tagging where users tags a person in one picture & others with them found Reinforcement receive rewards and penalties from environment and learns using those e.g. show/hide this ad on Facebook
online learning learn on the fly from a continuous flow of data batch learning requires all training data to learn; once it’s live it doesn’t keep learning
spam filter that looks for similarity to previous spam messages model-based e.g. house price predictor that looks for trends in training data to predict new values