Instance Segmentation - Míriam Bellver - UPC Barcelona 2018

•

1 gefällt mir•774 views

https://telecombcn-dl.github.io/2018-dlcv/ Deep learning technologies are at the core of the current revolution in artificial intelligence for multimedia data analysis. The convergence of large-scale annotated datasets and affordable GPU hardware has allowed the training of neural networks for data analysis tasks which were previously addressed with hand-crafted features. Architectures such as convolutional neural networks, recurrent neural networks and Q-nets for reinforcement learning have shaped a brand new scenario in signal processing. This course will cover the basic principles and applications of deep learning to computer vision problems, such as image classification, object detection or image captioning.

Daten & Analysen

Míriam Bellver
miriam.bellver@bsc.edu
PhD Candidate
Barcelona Supercomputing Center
Instance Segmentation
Day 2 Lecture 4
#DLUPC
http://bit.ly/dlcv2018

Semantic Segmentation
Label every pixel!
Don’t differentiate
instances (cows)
Classic computer
vision problem
Slide Credit: CS231n 2

Instance Segmentation
Detect instances,
give category, label
pixels
“simultaneous
detection and
segmentation” (SDS)
Label are
class-aware and
instance-aware
Slide Credit: CS231n 3

Outline
Instance Segmentation Methods
● Proposal-Based
● Recurrent
● Instance Embedding
4

Proposal-based
Slide Credit: CS231nHariharan et al. Simultaneous Detection and Segmentation. ECCV 2014
External
Segment
proposals
Mask out background
with mean image
Similar to R-CNN, but with segment proposals
5

He et al. Mask R-CNN. ICCV 2017
Proposal-based Instance Segmentation: Mask R-CNN
Faster R-CNN for Pixel Level Segmentation as a parallel prediction of masks
and class labels
6

He et al. Mask R-CNN. ICCV 2017
Mask R-CNN
● Classification & box detection losses are identical to those in Faster R-CNN
● Addition of a new loss term for mask prediction:
The network outputs a K x m x m volume for mask prediction, where K is the
number of categories and m is the size of the mask (square)
7

Mask R-CNN
He et al. Mask R-CNN. ICCV 2017

He et al. Mask R-CNN. ICCV 2017
Mask R-CNN: RoI Align
Reminder: RoI Pool from Fast R-CNN
Hi-res input image:
3 x 800 x 600
with region
proposal
Convolution
and Pooling
Hi-res conv features:
C x H x W
with region proposal
Fully-connected
layers
Max-pool within
each grid cell
RoI conv features:
C x h x w
for region proposal
Fully-connected layers expect
low-res conv features:
C x h x w
x/16 & rounding → misalignment ! + not differentiable
9

Mask R-CNN: RoI Align
10
Roi Pooling example
Figure Credit: Deepsense.aiHe et al. Mask R-CNN. ICCV 2017

Figure Credit: Silvio Galesso
Mask R-CNN: RoI Align
He et al. Mask R-CNN. ICCV 2017
use bilinear
interpolation

Limitations of Proposal-based models
13
1. Two objects might share the same bounding box: Only
one will be kept after NMS step.
2. Choice of NMS threshold is application dependant
3. Same pixel can be assigned to multiple instances
4. Number of predictions is limited by the number of
proposals.

Recurrent Instance Segmentation
Romera-Paredes & H.S. Torr. Recurrent Instance Segmentation ECCV 2016
14
Sequential mask generation

Recurrent Instance Segmentation
Romera-Paredes & H.S. Torr. Recurrent Instance Segmentation ECCV 2016 15

Recurrent Semantic Instance Segmentation
Salvador, A., Bellver, Campos. V, M., Baradad, M., Marqués, F., Torres, J., & Giro-i-Nieto, X. (2018) From
Pixels to Object Sequences: Recurrent Semantic Instance Segmentation.

Recurrent Semantic Instance Segmentation
Pascal VOC
Cityscapes
CVPPP
Salvador, A., Bellver, Campos. V, M., Baradad, M., Marqués, F., Torres, J., & Giro-i-Nieto, X. (2018) From
Pixels to Object Sequences: Recurrent Semantic Instance Segmentation.

Instance Embedding
Basis of Instance Embedding Segmentation
● Each pixel in the output of the network is a point in the embedding space
● Pixel belonging to same object are close in the embedding space, and pixels belonging to different
objects, are distant
● Parsing the image embeddings involves some clustering
embedding dimension K

Instance Embedding
Brabandere et al. Semantic Instance Segmentation with a Discriminative Loss Function. CVPRW 2017

Instance Embedding
22
Mapping pixels to a N-dimensional space where pixels belonging to the same object are close to each other.
Results on Cityscapes
Brabandere et al. Semantic Instance Segmentation with a Discriminative Loss Function. CVPRW 2017

Outline
Instance Segmentation Methods
● Proposal-Based
● Recurrent
● Instance Embedding
23

Weitere ähnliche Inhalte

Was ist angesagt?

Object RecognitionEman Abed AlWahhab

Faster R-CNN: Towards real-time object detection with region proposal network...Universitat Politècnica de Catalunya

Object Detection using Deep Neural NetworksUsman Qayyum

Deep learning for object detectionWenjing Chen

Mask R-CNNChanuk Lim

Introduction to Object recognitionAshiq Ullah

Deep learning based object detection basicsBrodmann17

Deep Learning for Computer Vision: Object Detection (UPC 2016)Universitat Politècnica de Catalunya

ViT (Vision Transformer) Review [CDM]Dongmin Choi

Image feature extractionRushin Shah

Object Detection & TrackingAkshay Gujarathi

Pr057 mask rcnnTaeoh Kim

04 image enhancement edge detectionRumah Belajar

AlexNetBertil Hatt

Semantic Segmentation on Satellite ImageryRAHUL BHOJWANI

Semantic Segmentation - Fully Convolutional Networks for Semantic Segmentation岳華杜

Convolutional Neural Networks (CNN)Gaurav Mittal

Object detection with deep learningSushant Shrivastava

Computer vision and roboticsBiniam Asnake

SSD: Single Shot MultiBox Detector (UPC Reading Group)Universitat Politècnica de Catalunya

Was ist angesagt? (20)

Object Recognition

Faster R-CNN: Towards real-time object detection with region proposal network...

Object Detection using Deep Neural Networks

Deep learning for object detection

Mask R-CNN

Introduction to Object recognition

Deep learning based object detection basics

Deep Learning for Computer Vision: Object Detection (UPC 2016)

ViT (Vision Transformer) Review [CDM]

Image feature extraction

Object Detection & Tracking

Pr057 mask rcnn

04 image enhancement edge detection

AlexNet

Semantic Segmentation on Satellite Imagery

Semantic Segmentation - Fully Convolutional Networks for Semantic Segmentation

Convolutional Neural Networks (CNN)

Object detection with deep learning

Computer vision and robotics

SSD: Single Shot MultiBox Detector (UPC Reading Group)

Ähnlich wie Instance Segmentation - Míriam Bellver - UPC Barcelona 2018

Object Segmentation (D2L7 Insight@DCU Machine Learning Workshop 2017)Universitat Politècnica de Catalunya

2019 cvpr paper_overviewLEE HOSEONG

2019 cvpr paper overview by Ho Seong LeeMoazzem Hossain

Reconciling Event-Based Knowledge through RDF2VECMehwish Alam

Large Scale Image Retrieval 2022.pdfSamuCerezo

[CVPR 2018] Utilizing unlabeled or noisy labeled data (classification, detect...NAVER Engineering

Semantic segmentation with Convolutional Neural Network ApproachesFellowship at Vodafone FutureLab

IRJET - Object Detection using Deep Learning with OpenCV and PythonIRJET Journal

Object Detection with TransformersDatabricks

D3L4-objects.pdfssusere945ae

Recent Progress on Object Detection_20170331Jihong Kang

ppt icitisee 2022_without_recording.pptxssusera4da91

最近の研究情勢についていくために - Deep Learningを中心に - Hiroshi Fukui

TMS workshop on machine learning in materials science: Intro to deep learning...BrianDeCost

教師なし画像特徴表現学習の動向 {Un, Self} supervised representation learning (CVPR 2018 完全読破...cvpaper. challenge

Classification accuracy of sar images for various landeSAT Publishing House

reducing firearms based on deep learningkonkalasilpa1319

Fn2611681170IJERA Editor

Convolutional Features for Instance SearchUniversitat Politècnica de Catalunya

Dj31514517IJMER

Ähnlich wie Instance Segmentation - Míriam Bellver - UPC Barcelona 2018 (20)

Object Segmentation (D2L7 Insight@DCU Machine Learning Workshop 2017)

2019 cvpr paper_overview

2019 cvpr paper overview by Ho Seong Lee

Reconciling Event-Based Knowledge through RDF2VEC

Large Scale Image Retrieval 2022.pdf

[CVPR 2018] Utilizing unlabeled or noisy labeled data (classification, detect...

Semantic segmentation with Convolutional Neural Network Approaches

IRJET - Object Detection using Deep Learning with OpenCV and Python

Object Detection with Transformers

D3L4-objects.pdf

Recent Progress on Object Detection_20170331

ppt icitisee 2022_without_recording.pptx

最近の研究情勢についていくために - Deep Learningを中心に -

TMS workshop on machine learning in materials science: Intro to deep learning...

教師なし画像特徴表現学習の動向 {Un, Self} supervised representation learning (CVPR 2018 完全読破...

Classification accuracy of sar images for various land

reducing firearms based on deep learning

Fn2611681170

Convolutional Features for Instance Search

Dj31514517

Mehr von Universitat Politècnica de Catalunya

Deep Generative Learning for All - The Gen AI Hype (Spring 2024)Universitat Politècnica de Catalunya

Deep Generative Learning for AllUniversitat Politècnica de Catalunya

The Transformer in Vision | Xavier Giro | Master in Computer Vision Barcelona...Universitat Politècnica de Catalunya

Towards Sign Language Translation & Production | Xavier Giro-i-NietoUniversitat Politècnica de Catalunya

The Transformer - Xavier Giró - UPC Barcelona 2021Universitat Politècnica de Catalunya

Learning Representations for Sign Language Videos - Xavier Giro - NIST TRECVI...Universitat Politècnica de Catalunya

Open challenges in sign language translation and productionUniversitat Politècnica de Catalunya

Generation of Synthetic Referring Expressions for Object Segmentation in VideosUniversitat Politècnica de Catalunya

Discovery and Learning of Navigation Goals from Pixels in MinecraftUniversitat Politècnica de Catalunya

Learn2Sign : Sign language recognition and translation using human keypoint e...Universitat Politècnica de Catalunya

Intepretability / Explainable AI for Deep Neural NetworksUniversitat Politècnica de Catalunya

Convolutional Neural Networks - Xavier Giro - UPC TelecomBCN Barcelona 2020Universitat Politècnica de Catalunya

Self-Supervised Audio-Visual Learning - Xavier Giro - UPC TelecomBCN Barcelon...Universitat Politècnica de Catalunya

Attention for Deep Learning - Xavier Giro - UPC TelecomBCN Barcelona 2020Universitat Politècnica de Catalunya

Generative Adversarial Networks GAN - Xavier Giro - UPC TelecomBCN Barcelona ...Universitat Politècnica de Catalunya

Q-Learning with a Neural Network - Xavier Giró - UPC Barcelona 2020Universitat Politècnica de Catalunya

Language and Vision with Deep Learning - Xavier Giró - ACM ICMR 2020 (Tutorial)Universitat Politècnica de Catalunya

Image Segmentation with Deep Learning - Xavier Giro & Carles Ventura - ISSonD...Universitat Politècnica de Catalunya

Curriculum Learning for Recurrent Video Object SegmentationUniversitat Politècnica de Catalunya

Deep Self-supervised Learning for All - Xavier Giro - X-Europe 2020Universitat Politècnica de Catalunya

Mehr von Universitat Politècnica de Catalunya (20)

Deep Generative Learning for All - The Gen AI Hype (Spring 2024)

Deep Generative Learning for All

The Transformer in Vision | Xavier Giro | Master in Computer Vision Barcelona...

Towards Sign Language Translation & Production | Xavier Giro-i-Nieto

The Transformer - Xavier Giró - UPC Barcelona 2021

Learning Representations for Sign Language Videos - Xavier Giro - NIST TRECVI...

Open challenges in sign language translation and production

Generation of Synthetic Referring Expressions for Object Segmentation in Videos

Discovery and Learning of Navigation Goals from Pixels in Minecraft

Learn2Sign : Sign language recognition and translation using human keypoint e...

Intepretability / Explainable AI for Deep Neural Networks

Convolutional Neural Networks - Xavier Giro - UPC TelecomBCN Barcelona 2020

Self-Supervised Audio-Visual Learning - Xavier Giro - UPC TelecomBCN Barcelon...

Attention for Deep Learning - Xavier Giro - UPC TelecomBCN Barcelona 2020

Generative Adversarial Networks GAN - Xavier Giro - UPC TelecomBCN Barcelona ...

Q-Learning with a Neural Network - Xavier Giró - UPC Barcelona 2020

Language and Vision with Deep Learning - Xavier Giró - ACM ICMR 2020 (Tutorial)

Image Segmentation with Deep Learning - Xavier Giro & Carles Ventura - ISSonD...

Curriculum Learning for Recurrent Video Object Segmentation

Deep Self-supervised Learning for All - Xavier Giro - X-Europe 2020

Kürzlich hochgeladen

Abortion pills in Doha Qatar (+966572737505 ! Get CytotecAbortion pills in Riyadh +966572737505 get cytotec

April 2024 - Crypto Market Report's Analysismanisha194592

BabyOno dropshipping via API with DroFx.pptxolyaivanovalion

Smarteg dropshipping via API with DroFx.pptxolyaivanovalion

Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Delhi Call girls

Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...amitlee9823

CebaBaby dropshipping via API with DroFX.pptxolyaivanovalion

Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...amitlee9823

Predicting Loan Approval: A Data Science ProjectBoston Institute of Analytics

Ravak dropshipping via API with DroFx.pptxolyaivanovalion

VidaXL dropshipping via API with DroFx.pptxolyaivanovalion

Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightDelhi Call girls

Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...amitlee9823

ELKO dropshipping via API with DroFx.pptxolyaivanovalion

Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangaloreamitlee9823

Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...amitlee9823

Generative AI on Enterprise Cloud with NiFi and MilvusTimothy Spann

(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7Call Girls in Nagpur High Profile Call Girls

Probability Grade 10 Third Quarter LessonsJoseMangaJr1

Anomaly detection and data imputation within time seriesParis Women in Machine Learning and Data Science

Kürzlich hochgeladen (20)

Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec

April 2024 - Crypto Market Report's Analysis

BabyOno dropshipping via API with DroFx.pptx

Smarteg dropshipping via API with DroFx.pptx

Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...

Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...

CebaBaby dropshipping via API with DroFX.pptx

Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...

Predicting Loan Approval: A Data Science Project

Ravak dropshipping via API with DroFx.pptx

VidaXL dropshipping via API with DroFx.pptx

Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night

Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...

ELKO dropshipping via API with DroFx.pptx

Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore

Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...

Generative AI on Enterprise Cloud with NiFi and Milvus

(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7

Probability Grade 10 Third Quarter Lessons

Anomaly detection and data imputation within time series

Instance Segmentation - Míriam Bellver - UPC Barcelona 2018

1. Míriam Bellver miriam.bellver@bsc.edu PhD Candidate Barcelona Supercomputing Center Instance Segmentation Day 2 Lecture 4 #DLUPC http://bit.ly/dlcv2018

2. Semantic Segmentation Label every pixel! Don’t differentiate instances (cows) Classic computer vision problem Slide Credit: CS231n 2

3. Instance Segmentation Detect instances, give category, label pixels “simultaneous detection and segmentation” (SDS) Label are class-aware and instance-aware Slide Credit: CS231n 3

4. Outline Instance Segmentation Methods ● Proposal-Based ● Recurrent ● Instance Embedding 4

5. Proposal-based Slide Credit: CS231nHariharan et al. Simultaneous Detection and Segmentation. ECCV 2014 External Segment proposals Mask out background with mean image Similar to R-CNN, but with segment proposals 5

6. He et al. Mask R-CNN. ICCV 2017 Proposal-based Instance Segmentation: Mask R-CNN Faster R-CNN for Pixel Level Segmentation as a parallel prediction of masks and class labels 6

7. He et al. Mask R-CNN. ICCV 2017 Mask R-CNN ● Classification & box detection losses are identical to those in Faster R-CNN ● Addition of a new loss term for mask prediction: The network outputs a K x m x m volume for mask prediction, where K is the number of categories and m is the size of the mask (square) 7

8. Mask R-CNN He et al. Mask R-CNN. ICCV 2017

9. He et al. Mask R-CNN. ICCV 2017 Mask R-CNN: RoI Align Reminder: RoI Pool from Fast R-CNN Hi-res input image: 3 x 800 x 600 with region proposal Convolution and Pooling Hi-res conv features: C x H x W with region proposal Fully-connected layers Max-pool within each grid cell RoI conv features: C x h x w for region proposal Fully-connected layers expect low-res conv features: C x h x w x/16 & rounding → misalignment ! + not differentiable 9

10. Mask R-CNN: RoI Align 10 Roi Pooling example Figure Credit: Deepsense.aiHe et al. Mask R-CNN. ICCV 2017

11. Figure Credit: Silvio Galesso Mask R-CNN: RoI Align He et al. Mask R-CNN. ICCV 2017 use bilinear interpolation

12. 12

13. Limitations of Proposal-based models 13 1. Two objects might share the same bounding box: Only one will be kept after NMS step. 2. Choice of NMS threshold is application dependant 3. Same pixel can be assigned to multiple instances 4. Number of predictions is limited by the number of proposals.

14. Recurrent Instance Segmentation Romera-Paredes & H.S. Torr. Recurrent Instance Segmentation ECCV 2016 14 Sequential mask generation

15. Recurrent Instance Segmentation Romera-Paredes & H.S. Torr. Recurrent Instance Segmentation ECCV 2016 15

16. Recurrent Semantic Instance Segmentation Salvador, A., Bellver, Campos. V, M., Baradad, M., Marqués, F., Torres, J., & Giro-i-Nieto, X. (2018) From Pixels to Object Sequences: Recurrent Semantic Instance Segmentation.

17. Recurrent Semantic Instance Segmentation Salvador, A., Bellver, Campos. V, M., Baradad, M., Marqués, F., Torres, J., & Giro-i-Nieto, X. (2018) From Pixels to Object Sequences: Recurrent Semantic Instance Segmentation.

18. Recurrent Semantic Instance Segmentation Pascal VOC Cityscapes CVPPP Salvador, A., Bellver, Campos. V, M., Baradad, M., Marqués, F., Torres, J., & Giro-i-Nieto, X. (2018) From Pixels to Object Sequences: Recurrent Semantic Instance Segmentation.

19. Recurrent Semantic Instance Segmentation Salvador, A., Bellver, Campos. V, M., Baradad, M., Marqués, F., Torres, J., & Giro-i-Nieto, X. (2018) From Pixels to Object Sequences: Recurrent Semantic Instance Segmentation. Object discovery patterns

20. Instance Embedding Basis of Instance Embedding Segmentation ● Each pixel in the output of the network is a point in the embedding space ● Pixel belonging to same object are close in the embedding space, and pixels belonging to different objects, are distant ● Parsing the image embeddings involves some clustering embedding dimension K

21. Instance Embedding Brabandere et al. Semantic Instance Segmentation with a Discriminative Loss Function. CVPRW 2017

22. Instance Embedding 22 Mapping pixels to a N-dimensional space where pixels belonging to the same object are close to each other. Results on Cityscapes Brabandere et al. Semantic Instance Segmentation with a Discriminative Loss Function. CVPRW 2017

23. Outline Instance Segmentation Methods ● Proposal-Based ● Recurrent ● Instance Embedding 23

24. Questions? 24

Instance Segmentation - Míriam Bellver - UPC Barcelona 2018

Empfohlen

Empfohlen

Weitere ähnliche Inhalte

Was ist angesagt?

Was ist angesagt? (20)

Ähnlich wie Instance Segmentation - Míriam Bellver - UPC Barcelona 2018

Ähnlich wie Instance Segmentation - Míriam Bellver - UPC Barcelona 2018 (20)

Mehr von Universitat Politècnica de Catalunya

Mehr von Universitat Politècnica de Catalunya (20)

Kürzlich hochgeladen

Kürzlich hochgeladen (20)

Instance Segmentation - Míriam Bellver - UPC Barcelona 2018