Slide 37
Slide 37 text
Faster R-CNN: base network
Image of arbitrary size → feature map.
Common architectures:
● VGG (16, 19)
● ResNet (50, 101, 152, ...)
● Inception (V2, V3)
● Xception
● MobileNet
● ...
1/16 spatially, 1024 deep for ResNet 101.
Feature map
50
37
600
800
CNN
(ResNet)
3
1024