Slide 91
Slide 91 text
# Layer
Input: 3 x 224 x 224
(RGB image, zero-mean)
1 64C3
2 64C3
MP2
3 128C3
4 128C3
MP2
Early part
Blocks consisting
of:
A few convolutional
layers, often 3x3
kernels
- followed by -
Down-sampling;
max-pooling or
striding
64C3 = 3x3 conv, 64 filters
MP2 = max-pooling, 2x2