Slide 19
Slide 19 text
Supervised vs. Unsupervised vs Reinforcement
• Labelled training
data
• Want to predict
labels of new,
unlabeled data
• Ex: Classification,
K-nearest neighbor
• Algorithm finds
trends in data,
optimization is
algorithm-reliant
• Ex: Automated
clustering, k-means,
data exploration
• Complex
problem/reward
space
• Consecutive actions
by agent result in an
outcome/score
• Ex: Agent for
autonomous driving
Three classes of problem