Slide 3
Slide 3 text
Types of Computer Vision Problems
Classification Detection, Segmentation,
Tracking
• Whole image - binary,
multiclass or multilabel
• Cats vs Dogs, MNIST, CIFAR,
ImageNet
• LeNet, AlexNet, VGG, etc
• Conv layers are feature
extractors
• Metrics have whole images as
data points
• Multiple ML problems within
a single image
• PASCAL VOC, COCO,
LSUN
• FCNs, UNets, R-CNNs,
YOLO
• Conv layers are more like
predictors
• Metrics have image subsets /
pixels as data points