on data • system is both the algorithm and the data • only as good as your data • starts with a hypothesis about how we can represent the data (for linear regression : a straight line) • can deal poorly with outliers • lots of calculation to learn, but very fast to apply (can run on mobile)