A brief introduction to Machine Learning

Slide 1

Slide 1 text

A brief introduction to Machine Learning A summary of basic feature selection and classiﬁcation methods on medical data Zena Hira1 - Loizos Makrides1 1Imperial College London, Department of Computing January 30, 2014 Zena Hira - Loizos Makrides (Imperial College London) A brief introduction to Machine Learning January 30, 2014 1 / 25

Slide 2

Slide 2 text

Outline 1 Dimensionality Reduction Overview Methods Linear Discriminant Analysis Principal Components Analysis Independent Components Analysis Manifold Learning 2 Classiﬁcation Overview Methods Support Vector Machines k - Nearest Neighbours Decision Trees Neural Networks 3 Applications Medical Applications Zena Hira - Loizos Makrides (Imperial College London) A brief introduction to Machine Learning January 30, 2014 2 / 25

Slide 3

Slide 3 text

Slide 4

Slide 4 text

Dimensionality Reduction Overview Dimensionality reduction is the process of reducing the number of random variables under consideration and can be divided into two steps: Feature extraction transforms the data in the high-dimensional space to a space of fewer dimensions. Feature selection process of selecting a subset of relevant features for use in model construction. Zena Hira - Loizos Makrides (Imperial College London) A brief introduction to Machine Learning January 30, 2014 4 / 25

Slide 5

Slide 5 text

Dimensionality Reduction Why is it needed? The curse of dimensionality (Small number of samples - Large number of features) Cancer data involve tens of thousands of gene expressions and thousands of different patients in which they are expressed. Functional MRI data model the brain with almost 400.000 volumetric pixels for each time point. Real-life data contain a lot of noise Noise coming from the hardware that is used to capture the measurements Noise coming from other effects that are happening alongside the effect we are studying Zena Hira - Loizos Makrides (Imperial College London) A brief introduction to Machine Learning January 30, 2014 5 / 25

Slide 6

Slide 6 text

Slide 7

Slide 7 text

Linear Discriminant Analysis Zena Hira - Loizos Makrides (Imperial College London) A brief introduction to Machine Learning January 30, 2014 7 / 25

Slide 8

Slide 8 text

Principal Components Analysis PCA transforms a number of possibly correlated variables into a (smaller) number of uncorrelated variables called principal components. Zena Hira - Loizos Makrides (Imperial College London) A brief introduction to Machine Learning January 30, 2014 8 / 25

Slide 9

Slide 9 text

Independent Components Analysis ICA transforms a multivariate signal into a number of additive subcomponents which are statistically independent. The assumption that the input signals are not normally distributed must hold. Zena Hira - Loizos Makrides (Imperial College London) A brief introduction to Machine Learning January 30, 2014 9 / 25

Slide 10

Slide 10 text

Manifold Learning / Isomap ISOMAP High Dimensional Space mapped to a lower one by perceiving topological characteristics. Works by determining the geodesic distances of the neighbourhood graph and creating the multidimensional scaling. Zena Hira - Loizos Makrides (Imperial College London) A brief introduction to Machine Learning January 30, 2014 10 / 25

Slide 11

Slide 11 text

Slide 12

Slide 12 text

Classiﬁcation Classiﬁcation is the problem of identifying to which of a set of categories (sub-populations) a new observation belongs, on the basis of a training set of data containing observations (or instances) whose category membership is known. Zena Hira - Loizos Makrides (Imperial College London) A brief introduction to Machine Learning January 30, 2014 12 / 25

Slide 13

Slide 13 text

Slide 14

Slide 14 text

Support Vector Machines SVM represents all the observations as points in space, mapped so that observations of separate categories are divided by a clear gap that is as wide as possible. New observations are then mapped into that same space and predicted to belong to a category based on which side of the gap they fall on. Zena Hira - Loizos Makrides (Imperial College London) A brief introduction to Machine Learning January 30, 2014 14 / 25

Slide 15

Slide 15 text

Support Vector Machines In addition to performing linear classification, SVMs can efficiently perform a non-linear classification using what is called the kernel trick, implicitly mapping their inputs into high-dimensional feature spaces. Zena Hira - Loizos Makrides (Imperial College London) A brief introduction to Machine Learning January 30, 2014 15 / 25

Slide 16

Slide 16 text

k - Nearest Neighbours In k-NN classiﬁcation an object is classiﬁed by a majority vote of its neighbours, with the object being assigned to the class most common among its k nearest neighbours. Zena Hira - Loizos Makrides (Imperial College London) A brief introduction to Machine Learning January 30, 2014 16 / 25

Slide 17

Slide 17 text

Decision Trees Creates a model that predicts the value of a target variable Learns simple decision rules inferred from the data features. Zena Hira - Loizos Makrides (Imperial College London) A brief introduction to Machine Learning January 30, 2014 17 / 25

Slide 18

Slide 18 text

Neural Networks Inspired by the way biological nervous systems Starting with random weights on each arc tries to ﬁnd the solution that minimizes the overall solution error by adjusting them in each cycle Zena Hira - Loizos Makrides (Imperial College London) A brief introduction to Machine Learning January 30, 2014 18 / 25

Slide 19

Slide 19 text

Slide 20

Slide 20 text

Microarrays A Quick Description Consist of a small membrane or glass slide containing samples of many genes arranged in a regular pattern. Used for Gene Discovery, Disease Diagnosis, Drug Discovery, Toxicological Research Zena Hira - Loizos Makrides (Imperial College London) A brief introduction to Machine Learning January 30, 2014 20 / 25

Slide 21

Slide 21 text

Microarrays Illustrated Zena Hira - Loizos Makrides (Imperial College London) A brief introduction to Machine Learning January 30, 2014 21 / 25

Slide 22

Slide 22 text

Leukaemia Diagnosis Thanks To George Trigeorgis Zena Hira - Loizos Makrides (Imperial College London) A brief introduction to Machine Learning January 30, 2014 22 / 25

Slide 23

Slide 23 text

Functional MRI Zena Hira - Loizos Makrides (Imperial College London) A brief introduction to Machine Learning January 30, 2014 23 / 25

Slide 24

Slide 24 text

Brain Decoding Predicting what task the subject was performing in the MRI scanner Zena Hira - Loizos Makrides (Imperial College London) A brief introduction to Machine Learning January 30, 2014 24 / 25

Slide 25

Slide 25 text

Zena Hira - Loizos Makrides (Imperial College London) A brief introduction to Machine Learning January 30, 2014 25 / 25