Reading Circle (Self-supervised Equivariant Attention Mechanism for Weakly Supervised Semantic Segmentation)

Reading Circle Self-supervised Equivariant Attention Mechanism for Weakly Supervised Semantic
Segmentation [Wang+, CVPR20]

Semantic Segmentation Assign a semantic category to each pixel. 2
Supervised Dataset : image & pixel-level class label ✗ huge annotation cost ↓ Weakly-supervised Dataset : image & image-level class label

Weakly-Supervised SS Methods Three steps to train on the image-level
label. 1. predict an initial category-wise response map to localize the object 2. refine the initial response as the pseudo GT 3. train the segmentation network based on pseudo labels 3 1 2 3

Weakly-Supervised SS Methods Three steps to train on the image-level
label. 1. predict an initial category-wise response map to localize the object 2. refine the initial response as the pseudo GT 3. train the segmentation network based on pseudo labels 4 1

What’s New Introduce a self-supervised equivariant attention mechanism (SEAM). -
Narrow the supervision gap between fully and weakly supervised semantic segmentation 5

Focus on affine transformation 6 Previous Proposed

Focus on affine transformation 7 Previous Proposed CAM varies depending on the size of the input image

Focus on affine transformation 8 Previous Proposed Consistent CAM regardless of the size of the input image

Network Architecture of SEAM Use Siamese Network and three kinds
of Losses. 9

of Losses. 10

Pixel Correlation Module (PCM) Modify CAM by self attention mechanism.
11

12 “Non-local neural networks” [Wang+, CVPR18]

Self Attention [Wang+, CVPR18] Non-local mean operation x : input
signal y : output signal g : representation function f : similarity function (scalar) 13 Gaussian Embedded Gaussian Dot product Concatenation

Self Attention [Wang+, CVPR18] Non-local block 14

15 “Non-local neural networks” [Wang+, CVPR18]

16

of Losses. 17

Loss Design of SEAM 1. Class Loss : multi-label soft
margin loss 18 original CAM

of Losses. 19

Loss Design of SEAM 2. Equivariant Regularization (ER) Loss 20
Consistency between before and after affine transformation

of Losses. 21

Loss Design of SEAM 2. Equivariant Cross Regularization (ECR) Loss
22 to further improve the ability of network for equivariance learning

Network Architecture Use Siamese Network and three kinds of Losses.
23

Dataset PASCAL VOC 2012 semantic segmentation benchmark 21 categories one
or multiple object class 1,464 images in training set 1,449 images in validation set 1,456 images in test set 24

Result -CAM- 25 Proposed Baseline GT Original

Result -Semantic Segmentation- 26 Input GT Ours

Quantitative Comparison Evaluation of transformations on equivariant regularization - evaluation
metric : ↑ mIoU (%) 27

Quantitative Comparison Category performance comparison - evaluation metric : ↑
mIoU (%) 28

Quantitative Comparison Evaluation of WSSS performance 29 [Chang+, CVPR20] val
: 66.1 test : 65.9

Conclusion Weakly-supervised Learning 1. generate pseudo GT label 2. apply
supervised learning Weakly-supervised Semantic Segmentation Self-supervised Equivariant Attention Mechanism ↓ appropriate CAM ↓ better pseudo GT label 30

Reading Circle (Self-supervised Equivariant Att...

Reading Circle (Self-supervised Equivariant Attention Mechanism for Weakly Supervised Semantic Segmentation)

pyman

More Decks by pyman

Other Decks in Research

Featured

Transcript