Slide 26
Slide 26 text
Supervised
Training Loop
breakdown..
(raw) Data - a.k.a. Observations / Input
Items about which we want to predict something. We usually will
denote observation with x.
Labels - a.k.a. Targets (i.e. Ground Truth)
Labels corresponding to observations. These are usually the things being
predicted. Following standard notations in ML/DL, we will use y to
refer to these.
Model f(x) = ˆy
A mathematica expression or a function that takes an observation x and
predicts the value of its target label.
Predictions - a.k.a. Estimates: Values of the Targets generated by the
model - usually referred to as ˆy
Parameters - a.k.a. Weights (in DL terminology)
Parameters of the Model. We will refer to them using the w.
Loss Function L(y, ˆy):
Function that compares how far off a prediction is from its target for
observations in the training data. The loss function assigns a scalar real
value called the loss. The lower the value of the loss, the better the
model is predicting. The Loss is usually referred to as L
Source:D. Rao et al. - Natural Language Processing with PyTorch, O’Reilly 2019