SlideShare a Scribd company logo
1 of 25
Download to read offline
FickleNet: Weakly and Semi-supervised Semantic
Image Segmentation using Stochastic Inference
Hwang seung hyun
Yonsei University Severance Hospital CCIDS
SNU, Korea | CVPR 2019
2020.03.22
Introduction Related Work Methods and
Experiments
01 02 03
Conclusion
04
Yonsei Unversity Severance Hospital CCIDS
Contents
FickleNet
Introduction – Limitation of Prior Works
• Semantic segmentation in real life requires a large variety of object classes and labeled data
• Current weakly supervised segmentation methods show inferior results to fully supervised
segmentation
• Main obstacle to weakly supervised semantic image segmentation is the obtaining pixel-level
information(Locations or Boundaries)
• Most weakly supervised segmentation methods depend on localization maps obtained by a
classification network.
• These localization maps focus only on the small discriminative parts of objects(Hard to locate
boundaries)
Introduction / Related Work / Methods and Experiments / Conclusion
FickleNet
Introduction – FickleNet
• Generate variety of localization maps from a single image using random combinations of
hidden units in CNN
• Chooses hidden units at random for each sliding window position
(Similar to Dropout technique)
• Random selection of hidden units (stochastic approach) produces regions of different shapes
• Many existing studies used stochastic regularization in their training process(e.g. Dropout), but
not in inference phase.
Introduction / Related Work / Methods and Experiments / Conclusion
FickleNet
Introduction – FickleNet
Introduction / Related Work / Methods and Experiments / Conclusion
Multiple Localization
Maps
Hidden Unit
Selection
FickleNet
Introduction – Contributions
Introduction / Related Work / Methods and Experiments / Conclusion
• FickleNet discovers the relationship between locations in an image
and enlarges the regions activated by the classifier.
• Introduce method of expanding feature maps which makes the
model work faster with only a small cost in GPU memory.
• FickleNet achieved SOTA performance on the PASCAL VOC 2012
benchmark in both weakly and semi supervised settings
Related Work
Image Level Processing
Introduction / Related Work / Methods and Experiments / Conclusion
• Class Activation Map (CAM) is a good starting point for the
classification of pixels from image-level annotations
• CAM discovers the contribution of each hidden unit in NN, but
it tends to focus on the small discriminative region of a target.
Related Work
Feature Level Processing
Introduction / Related Work / Methods and Experiments / Conclusion
• Multi-dilated convolution(MDC) uses several convolutional blocks,
dilated at different rates, and aggrates CAMs obtained from each block
that resembles ensemble learning
• Dilation rates are limited
• Standard dilated convolution is square with a fixed size, so MDC tends to
identify false positive regions
Related Work
Region Growing
Introduction / Related Work / Methods and Experiments / Conclusion
• SRG (Seed Growing Region)
Related Work
Region Growing
Introduction / Related Work / Methods and Experiments / Conclusion
• DSRG(Deep Seed Growing Region)
→ Seeds for region growing are obtained from CAM
→ VGG for classification network
→ DeepLab-ASPP for the segmentation network
→Seeds only come from discriminative parts of objects, difficult to grow into
non-discrimative parts.
Methods and Experiments
Stochastic Hidden Unit Selection
Introduction / Related Work / Methods and Experiments / Conclusion
• Randomly select hidden units, to associate a non-discriminative part of an
object with a discriminative part of the same object.
Methods and Experiments
Stochastic Hidden Unit Selection - Feature Map Expansion
Introduction / Related Work / Methods and Experiments / Conclusion
• Apply spatial dropout to the feature X at each sliding window position.
• Different from standard dropout technique, which only samples hidden units in the feature
maps once.
• This method of selecting hidden units can generate receptive fields of many different shapes
and sizes
• Calling convolution function and dropout function w x h times in each forwarding pass is very
inefficient
• Therefore, expand feature maps so that no sliding window positions overlap
Methods and Experiments
Stochastic Hidden Unit Selection – Center preserving spatial dropout
Introduction / Related Work / Methods and Experiments / Conclusion
• Do not drop the center of the kernel of each sliding window
position
• Relationships between kernel center and other locations in each
stride can be found this way
Methods and Experiments
Inference Localization Map
Introduction / Related Work / Methods and Experiments / Conclusion
• Use gradient based CAM(Grad-CAM), which is a generalization of
class activation map(CAM)
• Grad-CAM discovers the class specific contribution of each hidden
unit to the classification score from gradient flow
• From the final output feature map, apply global average
pooling(GAP) and sigmoid function to obtain classification score
Methods and Experiments
Inference Localization Map – Aggregate localization map
Introduction / Related Work / Methods and Experiments / Conclusion
• FickleNet constructs N different localization maps from a single image and
aggregate them into a single localization map.
Methods and Experiments
Inference Localization Map – Training Process
Introduction / Related Work / Methods and Experiments / Conclusion
• Localization map provides pseudo-label to train a semantic image
segmentation network
• Use same background cues as DSRG
• Using aggregated map as a seed, apply region growing method based on the
probabilities obtained from the segmentation network.
Segmentation
Network
Aggregated
Map
Methods and Experiments
FickleNet – Experimental Setup
Introduction / Related Work / Methods and Experiments / Conclusion
• Dataset – PASCAL VOC 2012 image segmentation
(21 object classes / 10,582 training images with image-level annotation)
• Based on VGG-16 network pre-trained using the ImageNet
(modified by removing all fc layers and the last pooling layer)
• Segmentation is performed by DSRG, based on Deeplab-CRF
• Set the number of different localization maps to 200
Methods and Experiments
FickleNet – Weakly Supervised Semantic Segmentation
Introduction / Related Work / Methods and Experiments / Conclusion
Methods and Experiments
Introduction / Related Work / Methods and Experiments / Conclusion
FickleNet – Weakly Supervised Semantic Segmentation with ResNet
Methods and Experiments
Introduction / Related Work / Methods and Experiments / Conclusion
FickleNet – Semi Supervised Semantic Segmentation with ResNet
Methods and Experiments
Introduction / Related Work / Methods and Experiments / Conclusion
FickleNet – Semi and Weakly Supervised Semantic Segmentation
Methods and Experiments
Ablation Study
Introduction / Related Work / Methods and Experiments / Conclusion
1. Effects of the Map Expansion Technique
• Training and CAM extraction times are reduced factors of 15.4
and 14.2, at a cost of 12% in GPU memory use
Methods and Experiments
Ablation Study
Introduction / Related Work / Methods and Experiments / Conclusion
2. Iterative Inference and Dropout Rate
• Additional random selection identifies more regions of a target object
• The segmentation performance converge as N increases
• Dropout rate of 0.9 allows FickleNet to cover larger regions of the target object
than DSRG – More randomness, more non-discriminative parts
Methods and Experiments
Ablation Study
Introduction / Related Work / Methods and Experiments / Conclusion
3. Comparison to General Dropout
• Hidden unit in FickleNet may be activated at some window positions and dropped
at others so that every hidden unit is able to affect the classification score
Conclusion
Introduction / Related Work / Methods and Experiments / Conclusion
• Addressed the problem of semantic image segmentation using only
image-level annotations
• Obtain many different localization maps and aggregate those maps into
a single localization map
• Implemented efficiently by expanding the feature maps
• Results of FickleNet on both weakly supervised and semi supervised
segmentation are better than those produced by other SOTAs

More Related Content

What's hot

Passive stereo vision with deep learning
Passive stereo vision with deep learningPassive stereo vision with deep learning
Passive stereo vision with deep learningYu Huang
 
SINGLE IMAGE SUPER RESOLUTION IN SPATIAL AND WAVELET DOMAIN
SINGLE IMAGE SUPER RESOLUTION IN SPATIAL AND WAVELET DOMAINSINGLE IMAGE SUPER RESOLUTION IN SPATIAL AND WAVELET DOMAIN
SINGLE IMAGE SUPER RESOLUTION IN SPATIAL AND WAVELET DOMAINijma
 
The single image dehazing based on efficient transmission estimation
The single image dehazing based on efficient transmission estimationThe single image dehazing based on efficient transmission estimation
The single image dehazing based on efficient transmission estimationAVVENIRE TECHNOLOGIES
 
Segmentation of Color Image using Adaptive Thresholding and Masking with Wate...
Segmentation of Color Image using Adaptive Thresholding and Masking with Wate...Segmentation of Color Image using Adaptive Thresholding and Masking with Wate...
Segmentation of Color Image using Adaptive Thresholding and Masking with Wate...Habibur Rahman
 
High quality single shot capture of facial geometry
High quality single shot capture of facial geometryHigh quality single shot capture of facial geometry
High quality single shot capture of facial geometryBrohi Aijaz Ali
 
3-d interpretation from single 2-d image for autonomous driving
3-d interpretation from single 2-d image for autonomous driving3-d interpretation from single 2-d image for autonomous driving
3-d interpretation from single 2-d image for autonomous drivingYu Huang
 
Band ratioing presentation
Band ratioing presentationBand ratioing presentation
Band ratioing presentationsk asadul haque
 
Multispectral Satellite Color Image Segmentation Using Fuzzy Based Innovative...
Multispectral Satellite Color Image Segmentation Using Fuzzy Based Innovative...Multispectral Satellite Color Image Segmentation Using Fuzzy Based Innovative...
Multispectral Satellite Color Image Segmentation Using Fuzzy Based Innovative...Dibya Jyoti Bora
 
CutMix: Regularization Strategy to Train Strong Classifiers with Localizable ...
CutMix: Regularization Strategy to Train Strong Classifiers with Localizable ...CutMix: Regularization Strategy to Train Strong Classifiers with Localizable ...
CutMix: Regularization Strategy to Train Strong Classifiers with Localizable ...Joonhyung Lee
 
LiDAR-based Autonomous Driving III (by Deep Learning)
LiDAR-based Autonomous Driving III (by Deep Learning)LiDAR-based Autonomous Driving III (by Deep Learning)
LiDAR-based Autonomous Driving III (by Deep Learning)Yu Huang
 
Outsourcing the Design & Manufacturing of Projection Engines for 3D Metrology...
Outsourcing the Design & Manufacturing of Projection Engines for 3D Metrology...Outsourcing the Design & Manufacturing of Projection Engines for 3D Metrology...
Outsourcing the Design & Manufacturing of Projection Engines for 3D Metrology...Giplink Digital
 
Stereo matching based on absolute differences for multiple objects detection
Stereo matching based on absolute differences for multiple objects detectionStereo matching based on absolute differences for multiple objects detection
Stereo matching based on absolute differences for multiple objects detectionTELKOMNIKA JOURNAL
 
A Survey on Exemplar-Based Image Inpainting Techniques
A Survey on Exemplar-Based Image Inpainting TechniquesA Survey on Exemplar-Based Image Inpainting Techniques
A Survey on Exemplar-Based Image Inpainting Techniquesijsrd.com
 
Fisheye Omnidirectional View in Autonomous Driving
Fisheye Omnidirectional View in Autonomous DrivingFisheye Omnidirectional View in Autonomous Driving
Fisheye Omnidirectional View in Autonomous DrivingYu Huang
 

What's hot (17)

Passive stereo vision with deep learning
Passive stereo vision with deep learningPassive stereo vision with deep learning
Passive stereo vision with deep learning
 
SINGLE IMAGE SUPER RESOLUTION IN SPATIAL AND WAVELET DOMAIN
SINGLE IMAGE SUPER RESOLUTION IN SPATIAL AND WAVELET DOMAINSINGLE IMAGE SUPER RESOLUTION IN SPATIAL AND WAVELET DOMAIN
SINGLE IMAGE SUPER RESOLUTION IN SPATIAL AND WAVELET DOMAIN
 
The single image dehazing based on efficient transmission estimation
The single image dehazing based on efficient transmission estimationThe single image dehazing based on efficient transmission estimation
The single image dehazing based on efficient transmission estimation
 
Segmentation of Color Image using Adaptive Thresholding and Masking with Wate...
Segmentation of Color Image using Adaptive Thresholding and Masking with Wate...Segmentation of Color Image using Adaptive Thresholding and Masking with Wate...
Segmentation of Color Image using Adaptive Thresholding and Masking with Wate...
 
High quality single shot capture of facial geometry
High quality single shot capture of facial geometryHigh quality single shot capture of facial geometry
High quality single shot capture of facial geometry
 
3-d interpretation from single 2-d image for autonomous driving
3-d interpretation from single 2-d image for autonomous driving3-d interpretation from single 2-d image for autonomous driving
3-d interpretation from single 2-d image for autonomous driving
 
B04410814
B04410814B04410814
B04410814
 
Band ratioing presentation
Band ratioing presentationBand ratioing presentation
Band ratioing presentation
 
Multispectral Satellite Color Image Segmentation Using Fuzzy Based Innovative...
Multispectral Satellite Color Image Segmentation Using Fuzzy Based Innovative...Multispectral Satellite Color Image Segmentation Using Fuzzy Based Innovative...
Multispectral Satellite Color Image Segmentation Using Fuzzy Based Innovative...
 
regions
regionsregions
regions
 
CutMix: Regularization Strategy to Train Strong Classifiers with Localizable ...
CutMix: Regularization Strategy to Train Strong Classifiers with Localizable ...CutMix: Regularization Strategy to Train Strong Classifiers with Localizable ...
CutMix: Regularization Strategy to Train Strong Classifiers with Localizable ...
 
LiDAR-based Autonomous Driving III (by Deep Learning)
LiDAR-based Autonomous Driving III (by Deep Learning)LiDAR-based Autonomous Driving III (by Deep Learning)
LiDAR-based Autonomous Driving III (by Deep Learning)
 
Outsourcing the Design & Manufacturing of Projection Engines for 3D Metrology...
Outsourcing the Design & Manufacturing of Projection Engines for 3D Metrology...Outsourcing the Design & Manufacturing of Projection Engines for 3D Metrology...
Outsourcing the Design & Manufacturing of Projection Engines for 3D Metrology...
 
Stereo matching based on absolute differences for multiple objects detection
Stereo matching based on absolute differences for multiple objects detectionStereo matching based on absolute differences for multiple objects detection
Stereo matching based on absolute differences for multiple objects detection
 
A Survey on Exemplar-Based Image Inpainting Techniques
A Survey on Exemplar-Based Image Inpainting TechniquesA Survey on Exemplar-Based Image Inpainting Techniques
A Survey on Exemplar-Based Image Inpainting Techniques
 
Fisheye Omnidirectional View in Autonomous Driving
Fisheye Omnidirectional View in Autonomous DrivingFisheye Omnidirectional View in Autonomous Driving
Fisheye Omnidirectional View in Autonomous Driving
 
FutureTech 2010
FutureTech 2010FutureTech 2010
FutureTech 2010
 

Similar to FickleNet: Weakly and Semi-supervised Semantic Image Segmentation using Stochastic Inference

ResNeSt: Split-Attention Networks
ResNeSt: Split-Attention NetworksResNeSt: Split-Attention Networks
ResNeSt: Split-Attention NetworksSeunghyun Hwang
 
End-to-End Object Detection with Transformers
End-to-End Object Detection with TransformersEnd-to-End Object Detection with Transformers
End-to-End Object Detection with TransformersSeunghyun Hwang
 
A Probabilistic U-Net for Segmentation of Ambiguous Images
A Probabilistic U-Net for Segmentation of Ambiguous ImagesA Probabilistic U-Net for Segmentation of Ambiguous Images
A Probabilistic U-Net for Segmentation of Ambiguous ImagesSeunghyun Hwang
 
ANALYSIS OF INSTANCE SEGMENTATION APPROACH FOR LANE DETECTION
ANALYSIS OF INSTANCE SEGMENTATION APPROACH FOR LANE DETECTIONANALYSIS OF INSTANCE SEGMENTATION APPROACH FOR LANE DETECTION
ANALYSIS OF INSTANCE SEGMENTATION APPROACH FOR LANE DETECTIONRajatRoy60
 
From Experimentation to Production: The Future of WebGL
From Experimentation to Production: The Future of WebGLFrom Experimentation to Production: The Future of WebGL
From Experimentation to Production: The Future of WebGLFITC
 
Multiple UGV SLAM Map Sharing
Multiple UGV SLAM Map SharingMultiple UGV SLAM Map Sharing
Multiple UGV SLAM Map SharingAkash Borate
 
A Simple Framework for Contrastive Learning of Visual Representations
A Simple Framework for Contrastive Learning of Visual RepresentationsA Simple Framework for Contrastive Learning of Visual Representations
A Simple Framework for Contrastive Learning of Visual RepresentationsSeunghyun Hwang
 
TIP_TAViT_presentation.pdf
TIP_TAViT_presentation.pdfTIP_TAViT_presentation.pdf
TIP_TAViT_presentation.pdfBoahKim2
 
DeconvNet, DecoupledNet, TransferNet in Image Segmentation
DeconvNet, DecoupledNet, TransferNet in Image SegmentationDeconvNet, DecoupledNet, TransferNet in Image Segmentation
DeconvNet, DecoupledNet, TransferNet in Image SegmentationNamHyuk Ahn
 
Image Segmentation Using Deep Learning : A survey
Image Segmentation Using Deep Learning : A surveyImage Segmentation Using Deep Learning : A survey
Image Segmentation Using Deep Learning : A surveyNUPUR YADAV
 
Data quality evaluation & orbit identification from scatterometer
Data quality evaluation & orbit identification from scatterometerData quality evaluation & orbit identification from scatterometer
Data quality evaluation & orbit identification from scatterometerMudit Dholakia
 
IRJET- Image Segmentation Techniques: A Review
IRJET- Image Segmentation Techniques: A ReviewIRJET- Image Segmentation Techniques: A Review
IRJET- Image Segmentation Techniques: A ReviewIRJET Journal
 
Efficient architecture to condensate visual information driven by attention ...
Efficient architecture to condensate visual information driven by attention ...Efficient architecture to condensate visual information driven by attention ...
Efficient architecture to condensate visual information driven by attention ...Sara Granados Cabeza
 
International Journal of Computational Engineering Research(IJCER)
International Journal of Computational Engineering Research(IJCER)International Journal of Computational Engineering Research(IJCER)
International Journal of Computational Engineering Research(IJCER)ijceronline
 
PR-231: A Simple Framework for Contrastive Learning of Visual Representations
PR-231: A Simple Framework for Contrastive Learning of Visual RepresentationsPR-231: A Simple Framework for Contrastive Learning of Visual Representations
PR-231: A Simple Framework for Contrastive Learning of Visual RepresentationsJinwon Lee
 
Unsupervised/Self-supervvised visual object tracking
Unsupervised/Self-supervvised visual object trackingUnsupervised/Self-supervvised visual object tracking
Unsupervised/Self-supervvised visual object trackingYu Huang
 
various methods for image segmentation
various methods for image segmentationvarious methods for image segmentation
various methods for image segmentationRaveesh Methi
 

Similar to FickleNet: Weakly and Semi-supervised Semantic Image Segmentation using Stochastic Inference (20)

ResNeSt: Split-Attention Networks
ResNeSt: Split-Attention NetworksResNeSt: Split-Attention Networks
ResNeSt: Split-Attention Networks
 
End-to-End Object Detection with Transformers
End-to-End Object Detection with TransformersEnd-to-End Object Detection with Transformers
End-to-End Object Detection with Transformers
 
A Probabilistic U-Net for Segmentation of Ambiguous Images
A Probabilistic U-Net for Segmentation of Ambiguous ImagesA Probabilistic U-Net for Segmentation of Ambiguous Images
A Probabilistic U-Net for Segmentation of Ambiguous Images
 
ANALYSIS OF INSTANCE SEGMENTATION APPROACH FOR LANE DETECTION
ANALYSIS OF INSTANCE SEGMENTATION APPROACH FOR LANE DETECTIONANALYSIS OF INSTANCE SEGMENTATION APPROACH FOR LANE DETECTION
ANALYSIS OF INSTANCE SEGMENTATION APPROACH FOR LANE DETECTION
 
Presentation roi
Presentation roiPresentation roi
Presentation roi
 
From Experimentation to Production: The Future of WebGL
From Experimentation to Production: The Future of WebGLFrom Experimentation to Production: The Future of WebGL
From Experimentation to Production: The Future of WebGL
 
Multiple UGV SLAM Map Sharing
Multiple UGV SLAM Map SharingMultiple UGV SLAM Map Sharing
Multiple UGV SLAM Map Sharing
 
Paper review
Paper reviewPaper review
Paper review
 
A Simple Framework for Contrastive Learning of Visual Representations
A Simple Framework for Contrastive Learning of Visual RepresentationsA Simple Framework for Contrastive Learning of Visual Representations
A Simple Framework for Contrastive Learning of Visual Representations
 
SPPNet
SPPNetSPPNet
SPPNet
 
TIP_TAViT_presentation.pdf
TIP_TAViT_presentation.pdfTIP_TAViT_presentation.pdf
TIP_TAViT_presentation.pdf
 
DeconvNet, DecoupledNet, TransferNet in Image Segmentation
DeconvNet, DecoupledNet, TransferNet in Image SegmentationDeconvNet, DecoupledNet, TransferNet in Image Segmentation
DeconvNet, DecoupledNet, TransferNet in Image Segmentation
 
Image Segmentation Using Deep Learning : A survey
Image Segmentation Using Deep Learning : A surveyImage Segmentation Using Deep Learning : A survey
Image Segmentation Using Deep Learning : A survey
 
Data quality evaluation & orbit identification from scatterometer
Data quality evaluation & orbit identification from scatterometerData quality evaluation & orbit identification from scatterometer
Data quality evaluation & orbit identification from scatterometer
 
IRJET- Image Segmentation Techniques: A Review
IRJET- Image Segmentation Techniques: A ReviewIRJET- Image Segmentation Techniques: A Review
IRJET- Image Segmentation Techniques: A Review
 
Efficient architecture to condensate visual information driven by attention ...
Efficient architecture to condensate visual information driven by attention ...Efficient architecture to condensate visual information driven by attention ...
Efficient architecture to condensate visual information driven by attention ...
 
International Journal of Computational Engineering Research(IJCER)
International Journal of Computational Engineering Research(IJCER)International Journal of Computational Engineering Research(IJCER)
International Journal of Computational Engineering Research(IJCER)
 
PR-231: A Simple Framework for Contrastive Learning of Visual Representations
PR-231: A Simple Framework for Contrastive Learning of Visual RepresentationsPR-231: A Simple Framework for Contrastive Learning of Visual Representations
PR-231: A Simple Framework for Contrastive Learning of Visual Representations
 
Unsupervised/Self-supervvised visual object tracking
Unsupervised/Self-supervvised visual object trackingUnsupervised/Self-supervvised visual object tracking
Unsupervised/Self-supervvised visual object tracking
 
various methods for image segmentation
various methods for image segmentationvarious methods for image segmentation
various methods for image segmentation
 

More from Seunghyun Hwang

An annotation sparsification strategy for 3D medical image segmentation via r...
An annotation sparsification strategy for 3D medical image segmentation via r...An annotation sparsification strategy for 3D medical image segmentation via r...
An annotation sparsification strategy for 3D medical image segmentation via r...Seunghyun Hwang
 
Do wide and deep networks learn the same things? Uncovering how neural networ...
Do wide and deep networks learn the same things? Uncovering how neural networ...Do wide and deep networks learn the same things? Uncovering how neural networ...
Do wide and deep networks learn the same things? Uncovering how neural networ...Seunghyun Hwang
 
Deep Learning-based Fully Automated Detection and Quantification of Acute Inf...
Deep Learning-based Fully Automated Detection and Quantification of Acute Inf...Deep Learning-based Fully Automated Detection and Quantification of Acute Inf...
Deep Learning-based Fully Automated Detection and Quantification of Acute Inf...Seunghyun Hwang
 
Diagnosis of Maxillary Sinusitis in Water’s view based on Deep learning model
Diagnosis of Maxillary Sinusitis in Water’s view based on Deep learning model Diagnosis of Maxillary Sinusitis in Water’s view based on Deep learning model
Diagnosis of Maxillary Sinusitis in Water’s view based on Deep learning model Seunghyun Hwang
 
Energy-based Model for Out-of-Distribution Detection in Deep Medical Image Se...
Energy-based Model for Out-of-Distribution Detection in Deep Medical Image Se...Energy-based Model for Out-of-Distribution Detection in Deep Medical Image Se...
Energy-based Model for Out-of-Distribution Detection in Deep Medical Image Se...Seunghyun Hwang
 
Deep Generative model-based quality control for cardiac MRI segmentation
Deep Generative model-based quality control for cardiac MRI segmentation Deep Generative model-based quality control for cardiac MRI segmentation
Deep Generative model-based quality control for cardiac MRI segmentation Seunghyun Hwang
 
Segmenting Medical MRI via Recurrent Decoding Cell
Segmenting Medical MRI via Recurrent Decoding CellSegmenting Medical MRI via Recurrent Decoding Cell
Segmenting Medical MRI via Recurrent Decoding CellSeunghyun Hwang
 
Progressive learning and Disentanglement of hierarchical representations
Progressive learning and Disentanglement of hierarchical representationsProgressive learning and Disentanglement of hierarchical representations
Progressive learning and Disentanglement of hierarchical representationsSeunghyun Hwang
 
Learning Sparse Networks using Targeted Dropout
Learning Sparse Networks using Targeted DropoutLearning Sparse Networks using Targeted Dropout
Learning Sparse Networks using Targeted DropoutSeunghyun Hwang
 
Your Classifier is Secretly an Energy based model and you should treat it lik...
Your Classifier is Secretly an Energy based model and you should treat it lik...Your Classifier is Secretly an Energy based model and you should treat it lik...
Your Classifier is Secretly an Energy based model and you should treat it lik...Seunghyun Hwang
 
Mix Conv: Mixed Depthwise Convolutional Kernels
Mix Conv: Mixed Depthwise Convolutional KernelsMix Conv: Mixed Depthwise Convolutional Kernels
Mix Conv: Mixed Depthwise Convolutional KernelsSeunghyun Hwang
 
Large Scale GAN Training for High Fidelity Natural Image Synthesis
Large Scale GAN Training for High Fidelity Natural Image SynthesisLarge Scale GAN Training for High Fidelity Natural Image Synthesis
Large Scale GAN Training for High Fidelity Natural Image SynthesisSeunghyun Hwang
 

More from Seunghyun Hwang (12)

An annotation sparsification strategy for 3D medical image segmentation via r...
An annotation sparsification strategy for 3D medical image segmentation via r...An annotation sparsification strategy for 3D medical image segmentation via r...
An annotation sparsification strategy for 3D medical image segmentation via r...
 
Do wide and deep networks learn the same things? Uncovering how neural networ...
Do wide and deep networks learn the same things? Uncovering how neural networ...Do wide and deep networks learn the same things? Uncovering how neural networ...
Do wide and deep networks learn the same things? Uncovering how neural networ...
 
Deep Learning-based Fully Automated Detection and Quantification of Acute Inf...
Deep Learning-based Fully Automated Detection and Quantification of Acute Inf...Deep Learning-based Fully Automated Detection and Quantification of Acute Inf...
Deep Learning-based Fully Automated Detection and Quantification of Acute Inf...
 
Diagnosis of Maxillary Sinusitis in Water’s view based on Deep learning model
Diagnosis of Maxillary Sinusitis in Water’s view based on Deep learning model Diagnosis of Maxillary Sinusitis in Water’s view based on Deep learning model
Diagnosis of Maxillary Sinusitis in Water’s view based on Deep learning model
 
Energy-based Model for Out-of-Distribution Detection in Deep Medical Image Se...
Energy-based Model for Out-of-Distribution Detection in Deep Medical Image Se...Energy-based Model for Out-of-Distribution Detection in Deep Medical Image Se...
Energy-based Model for Out-of-Distribution Detection in Deep Medical Image Se...
 
Deep Generative model-based quality control for cardiac MRI segmentation
Deep Generative model-based quality control for cardiac MRI segmentation Deep Generative model-based quality control for cardiac MRI segmentation
Deep Generative model-based quality control for cardiac MRI segmentation
 
Segmenting Medical MRI via Recurrent Decoding Cell
Segmenting Medical MRI via Recurrent Decoding CellSegmenting Medical MRI via Recurrent Decoding Cell
Segmenting Medical MRI via Recurrent Decoding Cell
 
Progressive learning and Disentanglement of hierarchical representations
Progressive learning and Disentanglement of hierarchical representationsProgressive learning and Disentanglement of hierarchical representations
Progressive learning and Disentanglement of hierarchical representations
 
Learning Sparse Networks using Targeted Dropout
Learning Sparse Networks using Targeted DropoutLearning Sparse Networks using Targeted Dropout
Learning Sparse Networks using Targeted Dropout
 
Your Classifier is Secretly an Energy based model and you should treat it lik...
Your Classifier is Secretly an Energy based model and you should treat it lik...Your Classifier is Secretly an Energy based model and you should treat it lik...
Your Classifier is Secretly an Energy based model and you should treat it lik...
 
Mix Conv: Mixed Depthwise Convolutional Kernels
Mix Conv: Mixed Depthwise Convolutional KernelsMix Conv: Mixed Depthwise Convolutional Kernels
Mix Conv: Mixed Depthwise Convolutional Kernels
 
Large Scale GAN Training for High Fidelity Natural Image Synthesis
Large Scale GAN Training for High Fidelity Natural Image SynthesisLarge Scale GAN Training for High Fidelity Natural Image Synthesis
Large Scale GAN Training for High Fidelity Natural Image Synthesis
 

Recently uploaded

08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Alan Dix
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...gurkirankumar98700
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024Results
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...shyamraj55
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Allon Mureinik
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 

Recently uploaded (20)

08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 

FickleNet: Weakly and Semi-supervised Semantic Image Segmentation using Stochastic Inference

  • 1. FickleNet: Weakly and Semi-supervised Semantic Image Segmentation using Stochastic Inference Hwang seung hyun Yonsei University Severance Hospital CCIDS SNU, Korea | CVPR 2019 2020.03.22
  • 2. Introduction Related Work Methods and Experiments 01 02 03 Conclusion 04 Yonsei Unversity Severance Hospital CCIDS Contents
  • 3. FickleNet Introduction – Limitation of Prior Works • Semantic segmentation in real life requires a large variety of object classes and labeled data • Current weakly supervised segmentation methods show inferior results to fully supervised segmentation • Main obstacle to weakly supervised semantic image segmentation is the obtaining pixel-level information(Locations or Boundaries) • Most weakly supervised segmentation methods depend on localization maps obtained by a classification network. • These localization maps focus only on the small discriminative parts of objects(Hard to locate boundaries) Introduction / Related Work / Methods and Experiments / Conclusion
  • 4. FickleNet Introduction – FickleNet • Generate variety of localization maps from a single image using random combinations of hidden units in CNN • Chooses hidden units at random for each sliding window position (Similar to Dropout technique) • Random selection of hidden units (stochastic approach) produces regions of different shapes • Many existing studies used stochastic regularization in their training process(e.g. Dropout), but not in inference phase. Introduction / Related Work / Methods and Experiments / Conclusion
  • 5. FickleNet Introduction – FickleNet Introduction / Related Work / Methods and Experiments / Conclusion Multiple Localization Maps Hidden Unit Selection
  • 6. FickleNet Introduction – Contributions Introduction / Related Work / Methods and Experiments / Conclusion • FickleNet discovers the relationship between locations in an image and enlarges the regions activated by the classifier. • Introduce method of expanding feature maps which makes the model work faster with only a small cost in GPU memory. • FickleNet achieved SOTA performance on the PASCAL VOC 2012 benchmark in both weakly and semi supervised settings
  • 7. Related Work Image Level Processing Introduction / Related Work / Methods and Experiments / Conclusion • Class Activation Map (CAM) is a good starting point for the classification of pixels from image-level annotations • CAM discovers the contribution of each hidden unit in NN, but it tends to focus on the small discriminative region of a target.
  • 8. Related Work Feature Level Processing Introduction / Related Work / Methods and Experiments / Conclusion • Multi-dilated convolution(MDC) uses several convolutional blocks, dilated at different rates, and aggrates CAMs obtained from each block that resembles ensemble learning • Dilation rates are limited • Standard dilated convolution is square with a fixed size, so MDC tends to identify false positive regions
  • 9. Related Work Region Growing Introduction / Related Work / Methods and Experiments / Conclusion • SRG (Seed Growing Region)
  • 10. Related Work Region Growing Introduction / Related Work / Methods and Experiments / Conclusion • DSRG(Deep Seed Growing Region) → Seeds for region growing are obtained from CAM → VGG for classification network → DeepLab-ASPP for the segmentation network →Seeds only come from discriminative parts of objects, difficult to grow into non-discrimative parts.
  • 11. Methods and Experiments Stochastic Hidden Unit Selection Introduction / Related Work / Methods and Experiments / Conclusion • Randomly select hidden units, to associate a non-discriminative part of an object with a discriminative part of the same object.
  • 12. Methods and Experiments Stochastic Hidden Unit Selection - Feature Map Expansion Introduction / Related Work / Methods and Experiments / Conclusion • Apply spatial dropout to the feature X at each sliding window position. • Different from standard dropout technique, which only samples hidden units in the feature maps once. • This method of selecting hidden units can generate receptive fields of many different shapes and sizes • Calling convolution function and dropout function w x h times in each forwarding pass is very inefficient • Therefore, expand feature maps so that no sliding window positions overlap
  • 13. Methods and Experiments Stochastic Hidden Unit Selection – Center preserving spatial dropout Introduction / Related Work / Methods and Experiments / Conclusion • Do not drop the center of the kernel of each sliding window position • Relationships between kernel center and other locations in each stride can be found this way
  • 14. Methods and Experiments Inference Localization Map Introduction / Related Work / Methods and Experiments / Conclusion • Use gradient based CAM(Grad-CAM), which is a generalization of class activation map(CAM) • Grad-CAM discovers the class specific contribution of each hidden unit to the classification score from gradient flow • From the final output feature map, apply global average pooling(GAP) and sigmoid function to obtain classification score
  • 15. Methods and Experiments Inference Localization Map – Aggregate localization map Introduction / Related Work / Methods and Experiments / Conclusion • FickleNet constructs N different localization maps from a single image and aggregate them into a single localization map.
  • 16. Methods and Experiments Inference Localization Map – Training Process Introduction / Related Work / Methods and Experiments / Conclusion • Localization map provides pseudo-label to train a semantic image segmentation network • Use same background cues as DSRG • Using aggregated map as a seed, apply region growing method based on the probabilities obtained from the segmentation network. Segmentation Network Aggregated Map
  • 17. Methods and Experiments FickleNet – Experimental Setup Introduction / Related Work / Methods and Experiments / Conclusion • Dataset – PASCAL VOC 2012 image segmentation (21 object classes / 10,582 training images with image-level annotation) • Based on VGG-16 network pre-trained using the ImageNet (modified by removing all fc layers and the last pooling layer) • Segmentation is performed by DSRG, based on Deeplab-CRF • Set the number of different localization maps to 200
  • 18. Methods and Experiments FickleNet – Weakly Supervised Semantic Segmentation Introduction / Related Work / Methods and Experiments / Conclusion
  • 19. Methods and Experiments Introduction / Related Work / Methods and Experiments / Conclusion FickleNet – Weakly Supervised Semantic Segmentation with ResNet
  • 20. Methods and Experiments Introduction / Related Work / Methods and Experiments / Conclusion FickleNet – Semi Supervised Semantic Segmentation with ResNet
  • 21. Methods and Experiments Introduction / Related Work / Methods and Experiments / Conclusion FickleNet – Semi and Weakly Supervised Semantic Segmentation
  • 22. Methods and Experiments Ablation Study Introduction / Related Work / Methods and Experiments / Conclusion 1. Effects of the Map Expansion Technique • Training and CAM extraction times are reduced factors of 15.4 and 14.2, at a cost of 12% in GPU memory use
  • 23. Methods and Experiments Ablation Study Introduction / Related Work / Methods and Experiments / Conclusion 2. Iterative Inference and Dropout Rate • Additional random selection identifies more regions of a target object • The segmentation performance converge as N increases • Dropout rate of 0.9 allows FickleNet to cover larger regions of the target object than DSRG – More randomness, more non-discriminative parts
  • 24. Methods and Experiments Ablation Study Introduction / Related Work / Methods and Experiments / Conclusion 3. Comparison to General Dropout • Hidden unit in FickleNet may be activated at some window positions and dropped at others so that every hidden unit is able to affect the classification score
  • 25. Conclusion Introduction / Related Work / Methods and Experiments / Conclusion • Addressed the problem of semantic image segmentation using only image-level annotations • Obtain many different localization maps and aggregate those maps into a single localization map • Implemented efficiently by expanding the feature maps • Results of FickleNet on both weakly supervised and semi supervised segmentation are better than those produced by other SOTAs