9. • Average Similar Pixels
• Do not Average non-Similar Pixels
Problem)
Not Enough Similar Pixels in LOCAL REGIONS
10. • Average Similar Pixels
• Do not Average non-Similar Pixels
Problem)
Not Enough Similar Pixels in LOCAL REGIONS
Get More Samples in Non-LOCAL REGIONS
42. Stride=2 Conv Stride=1 ConvStride=2 Pool
256x56x56 512x28x28
1
x
1
3
x
3
1
x
1
1
x
1
+
1
x
1
3
x
3
1
x
1
1
x
1
+
1
x
1
3
x
3
1
x
1
1
x
1
+
1
x
1
3
x
3
1
x
1
1
x
1
+
128128512
512
43. Stride=2 Conv Stride=1 ConvStride=2 Pool
512x28x28
1
x
1
3
x
3
1
x
1
1
x
1
+
1
x
1
3
x
3
1
x
1
1
x
1
+
1
x
1
3
x
3
1
x
1
1
x
1
+
256256 1024
1024
1
x
1
3
x
3
1
x
1
1
x
1
+
1
x
1
3
x
3
1
x
1
1
x
1
+
1
x
1
3
x
3
1
x
1
1
x
1
+
1024x14x14
44. Stride=2 Conv Stride=1 ConvStride=2 Pool
1
x
1
3
x
3
1
x
1
1
x
1
+
1
x
1
3
x
3
1
x
1
1
x
1
+
1
x
1
3
x
3
1
x
1
1
x
1
+
1024x14x14 2048x7x7
512 512 2048
2048
2048x1x1
1000
7
x
7
F
C
52. • Add 1 Non-local Block
• Right before the last residual block of res4
• The Attentional Behavior is Not the Key to the Improvement
• Similarity + Learning >> Similarity (Gaussian)
53. Stride=2 Conv Stride=1 ConvStride=2 Pool
512x28x28
1
x
1
3
x
3
1
x
1
1
x
1
+
1
x
1
3
x
3
1
x
1
1
x
1
+
1
x
1
3
x
3
1
x
1
1
x
1
+
256256 1024
1024
1
x
1
3
x
3
1
x
1
1
x
1
+
1
x
1
3
x
3
1
x
1
1
x
1
+
1
x
1
3
x
3
1
x
1
1
x
1
+
1024x14x14