SlideShare ist ein Scribd-Unternehmen logo
1 von 59
Multiclass object detection
Multiclass object detection
Context: objects appear in configurations
Generalization: objects share parts
How many categories?
“Muchas” Slide by Aude Oliva
How many object categories are there? ~10,000 to 30,000 Biederman 1987
How many categories? Probably this question is not even specific enough to have an answer
? Which level of categorization is the right one? Car is an object composed of: 	a few doors, four wheels (not all visible at all times), a roof,  	front lights, windshield  If you are thinking in buying a car, you might want to be a bit more specific about your categorization level.
A bird An ostrich Entry-level categories(Jolicoeur, Gluck, Kosslyn 1984) Typical member of a basic-level category are categorized at the expected level Atypical members tend to be classified at a subordinate level.
We do not need to recognize the exact category A new class can borrow information from similar categories
So, where is computer vision? Well…
Multiclass object detectionthe not so early days
[object Object],- two cascades for each view  (a) One detector for each class Multiclass object detectionthe not so early days Using a set of independent binary classifiers was a common strategy: ,[object Object],There is nothing wrong with this approach if you have access to lots of training data and you do not care about efficiency.
Generalizing Across Categories Can we transfer knowledge from one object category to another? Slide by Erik Sudderth
Shared features Is learning the object class 1000 easier than learning the first? Can we transfer knowledge from one object to another? Are the shared properties interesting by themselves?  …
Multitask learning R. Caruana. Multitask Learning. ML 1997 “MTL improves generalization by leveraging the domain-specific information contained in the training signals of related tasks. It does this by training tasks in parallel while using a shared representation”. vs. Sejnowski & Rosenberg 1986; Hinton 1986; Le Cun et al. 1989; Suddarth & Kergosien 1990; Pratt et al. 1991; Sharkey & Sharkey 1992; …
Multitask learning R. Caruana. Multitask Learning. ML 1997 Primary task: detect door knobs Tasks used: ,[object Object]
width of left door jamb
width of right door jamb
horizontal location of left edge of door
horizontal location of right edge of door
horizontal location of doorknob
single or double door
horizontal location of doorway center
width of doorway
horizontal location of left door jamb,[object Object]
Convolutional Neural Network Le Cun et al, 98 Translation invariance is already built into the network The output neurons share all the intermediate levels
Sharing transformations Miller, E., Matsakis, N., and Viola, P. (2000). Learning from one example through shared densities on transforms. In IEEE Computer Vision and Pattern Recognition. Transformations are shared and can be learnt from other tasks.
Sharing in constellation models Pictorial StructuresFischler & Elschlager, IEEE Trans. Comp. 1973 SVM DetectorsHeisele, Poggio, et. al., NIPS 2001 Constellation ModelFei-Fei, Fergus, Perona, ICCV 2003  Model-Guided SegmentationMori, Ren, Efros, & Malik, CVPR 2004
Reusable Parts Krempp, Geman, & Amit “Sequential Learning of Reusable Parts for Object Detection”. TR 2002 Goal: Look for a vocabulary of edges that reduces the number of features. Examples of reused parts Number of features Number of classes
Specific feature pedestrian chair Traffic light sign face Background class Non-shared feature: this feature is too specific to faces.
Shared feature shared feature
[object Object],Screen detector Car detector Face detector ,[object Object],Screen detector Car detector Face detector Additive models and boosting Torralba, Murphy, Freeman. CVPR 2004. PAMI 2007
50 training samples/class 29 object classes 2000 entries in the dictionary Results averaged on 20 runs Error bars = 80% interval Class-specific features Shared features Torralba, Murphy, Freeman. CVPR 2004. PAMI 2007
12 viewpoints 12 unrelated object classes Generalization as a function of object similarities K = 2.1 K = 4.8 Area under ROC Area under ROC Number of training samples per class Number of training samples per class Torralba, Murphy, Freeman. CVPR 2004. PAMI 2007
J. Shotton, A. Blake, R. Cipolla.Multi-Scale Categorical Object Recognition Using Contour Fragments. In IEEETrans. on PAMI, 30(7):1270-1281, July 2008. Efficiency Generalization Opelt, Pinz, Zisserman, CVPR 2006
Sharing patches Bart and Ullman, 2004 For a new class, use only features similar to features that where good for other classes: Proposed Dog  features
Some more references Baxter 1996 Caruana 1997 Schapire, Singer, 2000 Thrun, Pratt 1997 Krempp, Geman, Amit, 2002 E.L.Miller, Matsakis, Viola, 2000 Mahamud, Hebert, Lafferty, 2001 Fink et al. 2003, 2004 LeCun, Huang, Bottou, 2004 Holub, Welling, Perona, 2005 …
Modeling object relationships
The “guess what I am trying to detect” challenge The detector challenge: by looking at the output of a detector on a random setof images, can you guess which object is it trying to detect?
What object is detector trying to detect? The detector challenge: by looking at the output of a detector on a random setof images, can you guess which object is it trying to detect?
1. chair, 2. table, 3. road, 4. road, 5. table, 6. car, 7. keyboard.
The context challenge How far can you go without using an object detector?
1 2 What are the hidden objects?
What are the hidden objects? Chance ~ 1/30000
objects image p(O | I) ap(I|O) p(O) Object model Context model
p(O | I) ap(I|O) p(O) Object model Context model Full joint Scene model Aprox. joint
p(O | I) ap(I|O) p(O) Object model Context model Full joint Approx. joint Scene model
p(O | I) ap(I|O) p(O) Object model Context model Full joint Scene model Approx. joint p(O) = S Pp(Oi|S=s) p(S=s) i s office street
p(O | I) ap(I|O) p(O) Object model Context model Full joint Scene model Approx. joint
Pixel labeling using MRFs Enforce consistency between neighboring labels, and between labels and pixels Oi Carbonetto, de Freitas & Barnard, ECCV’04
Beyond nearest-neighbor grids Most MRF/CRF models assume nearest-neighbor graph topology This cannot capture long-distance correlations
Object-Object Relationships Use latent variables to induce long distance correlations between labels in a Conditional Random Field (CRF) He, Zemel & Carreira-Perpinan (04)
Object-Object Relationships [Kumar Hebert 2005]
Fink & Perona (NIPS 03) Use output of boosting from other objects at previous iterations as input into boosting for this iteration Object-Object Relationships
Objects in Context Building Building, boat, person Road Road Building, boat, motorbike Boat Water, sky Water Most consistent labeling according to object co-occurrences& locallabel probabilities. A. Rabinovich, A. Vedaldi, C. Galleguillos, E. Wiewiora and S. Belongie. Objects in Context. ICCV 2007

Weitere ähnliche Inhalte

Ähnlich wie Iccv2009 recognition and learning object categories p2 c02 - recognizing muliple objects in an image - sharing and context

Cvpr2007 object category recognition p3 - discriminative models
Cvpr2007 object category recognition   p3 - discriminative modelsCvpr2007 object category recognition   p3 - discriminative models
Cvpr2007 object category recognition p3 - discriminative modelszukun
 
NIPS2009: Understand Visual Scenes - Part 2
NIPS2009: Understand Visual Scenes - Part 2NIPS2009: Understand Visual Scenes - Part 2
NIPS2009: Understand Visual Scenes - Part 2zukun
 
Mit6870 orsu lecture2
Mit6870 orsu lecture2Mit6870 orsu lecture2
Mit6870 orsu lecture2zukun
 
Mit6870 orsu lecture11
Mit6870 orsu lecture11Mit6870 orsu lecture11
Mit6870 orsu lecture11zukun
 
How to Ground A Language for Legal Discourse In a Prototypical Perceptual Sem...
How to Ground A Language for Legal Discourse In a Prototypical Perceptual Sem...How to Ground A Language for Legal Discourse In a Prototypical Perceptual Sem...
How to Ground A Language for Legal Discourse In a Prototypical Perceptual Sem...L. Thorne McCarty
 
NIPS2009: Understand Visual Scenes - Part 1
NIPS2009: Understand Visual Scenes - Part 1NIPS2009: Understand Visual Scenes - Part 1
NIPS2009: Understand Visual Scenes - Part 1zukun
 
artificial intelligence
artificial intelligenceartificial intelligence
artificial intelligenceMayank Saxena
 
Xin Yao: "What can evolutionary computation do for you?"
Xin Yao: "What can evolutionary computation do for you?"Xin Yao: "What can evolutionary computation do for you?"
Xin Yao: "What can evolutionary computation do for you?"ieee_cis_cyprus
 
A Featherweight Approach to FOOL
A Featherweight Approach to FOOLA Featherweight Approach to FOOL
A Featherweight Approach to FOOLgreenwop
 
Iccv2009 recognition and learning object categories p0 c00 - introduction
Iccv2009 recognition and learning object categories   p0 c00 - introductionIccv2009 recognition and learning object categories   p0 c00 - introduction
Iccv2009 recognition and learning object categories p0 c00 - introductionzukun
 
Cvpr2007 object category recognition p2 - part based models
Cvpr2007 object category recognition   p2 - part based modelsCvpr2007 object category recognition   p2 - part based models
Cvpr2007 object category recognition p2 - part based modelszukun
 
The Role Of Ontology In Modern Expert Systems Dallas 2008
The Role Of Ontology In Modern Expert Systems   Dallas   2008The Role Of Ontology In Modern Expert Systems   Dallas   2008
The Role Of Ontology In Modern Expert Systems Dallas 2008Jason Morris
 
Using binary classifiers
Using binary classifiersUsing binary classifiers
Using binary classifiersbutest
 
David Barber - Deep Nets, Bayes and the story of AI
David Barber - Deep Nets, Bayes and the story of AIDavid Barber - Deep Nets, Bayes and the story of AI
David Barber - Deep Nets, Bayes and the story of AIBayes Nets meetup London
 
Natural Interaction for Augmented Reality Applications
Natural Interaction for Augmented Reality ApplicationsNatural Interaction for Augmented Reality Applications
Natural Interaction for Augmented Reality ApplicationsMark Billinghurst
 

Ähnlich wie Iccv2009 recognition and learning object categories p2 c02 - recognizing muliple objects in an image - sharing and context (20)

Cvpr2007 object category recognition p3 - discriminative models
Cvpr2007 object category recognition   p3 - discriminative modelsCvpr2007 object category recognition   p3 - discriminative models
Cvpr2007 object category recognition p3 - discriminative models
 
NIPS2009: Understand Visual Scenes - Part 2
NIPS2009: Understand Visual Scenes - Part 2NIPS2009: Understand Visual Scenes - Part 2
NIPS2009: Understand Visual Scenes - Part 2
 
Tangible Interfaces
Tangible InterfacesTangible Interfaces
Tangible Interfaces
 
Mit6870 orsu lecture2
Mit6870 orsu lecture2Mit6870 orsu lecture2
Mit6870 orsu lecture2
 
Mit6870 orsu lecture11
Mit6870 orsu lecture11Mit6870 orsu lecture11
Mit6870 orsu lecture11
 
How to Ground A Language for Legal Discourse In a Prototypical Perceptual Sem...
How to Ground A Language for Legal Discourse In a Prototypical Perceptual Sem...How to Ground A Language for Legal Discourse In a Prototypical Perceptual Sem...
How to Ground A Language for Legal Discourse In a Prototypical Perceptual Sem...
 
NIPS2009: Understand Visual Scenes - Part 1
NIPS2009: Understand Visual Scenes - Part 1NIPS2009: Understand Visual Scenes - Part 1
NIPS2009: Understand Visual Scenes - Part 1
 
artificial intelligence
artificial intelligenceartificial intelligence
artificial intelligence
 
Xin Yao: "What can evolutionary computation do for you?"
Xin Yao: "What can evolutionary computation do for you?"Xin Yao: "What can evolutionary computation do for you?"
Xin Yao: "What can evolutionary computation do for you?"
 
nnml.ppt
nnml.pptnnml.ppt
nnml.ppt
 
Fame cvpr
Fame cvprFame cvpr
Fame cvpr
 
A Featherweight Approach to FOOL
A Featherweight Approach to FOOLA Featherweight Approach to FOOL
A Featherweight Approach to FOOL
 
The Tower of Knowledge A Generic System Architecture
The Tower of Knowledge A Generic System ArchitectureThe Tower of Knowledge A Generic System Architecture
The Tower of Knowledge A Generic System Architecture
 
Iccv2009 recognition and learning object categories p0 c00 - introduction
Iccv2009 recognition and learning object categories   p0 c00 - introductionIccv2009 recognition and learning object categories   p0 c00 - introduction
Iccv2009 recognition and learning object categories p0 c00 - introduction
 
Cvpr2007 object category recognition p2 - part based models
Cvpr2007 object category recognition   p2 - part based modelsCvpr2007 object category recognition   p2 - part based models
Cvpr2007 object category recognition p2 - part based models
 
The Role Of Ontology In Modern Expert Systems Dallas 2008
The Role Of Ontology In Modern Expert Systems   Dallas   2008The Role Of Ontology In Modern Expert Systems   Dallas   2008
The Role Of Ontology In Modern Expert Systems Dallas 2008
 
Using binary classifiers
Using binary classifiersUsing binary classifiers
Using binary classifiers
 
lecun-01.ppt
lecun-01.pptlecun-01.ppt
lecun-01.ppt
 
David Barber - Deep Nets, Bayes and the story of AI
David Barber - Deep Nets, Bayes and the story of AIDavid Barber - Deep Nets, Bayes and the story of AI
David Barber - Deep Nets, Bayes and the story of AI
 
Natural Interaction for Augmented Reality Applications
Natural Interaction for Augmented Reality ApplicationsNatural Interaction for Augmented Reality Applications
Natural Interaction for Augmented Reality Applications
 

Mehr von zukun

My lyn tutorial 2009
My lyn tutorial 2009My lyn tutorial 2009
My lyn tutorial 2009zukun
 
ETHZ CV2012: Tutorial openCV
ETHZ CV2012: Tutorial openCVETHZ CV2012: Tutorial openCV
ETHZ CV2012: Tutorial openCVzukun
 
ETHZ CV2012: Information
ETHZ CV2012: InformationETHZ CV2012: Information
ETHZ CV2012: Informationzukun
 
Siwei lyu: natural image statistics
Siwei lyu: natural image statisticsSiwei lyu: natural image statistics
Siwei lyu: natural image statisticszukun
 
Lecture9 camera calibration
Lecture9 camera calibrationLecture9 camera calibration
Lecture9 camera calibrationzukun
 
Brunelli 2008: template matching techniques in computer vision
Brunelli 2008: template matching techniques in computer visionBrunelli 2008: template matching techniques in computer vision
Brunelli 2008: template matching techniques in computer visionzukun
 
Modern features-part-4-evaluation
Modern features-part-4-evaluationModern features-part-4-evaluation
Modern features-part-4-evaluationzukun
 
Modern features-part-3-software
Modern features-part-3-softwareModern features-part-3-software
Modern features-part-3-softwarezukun
 
Modern features-part-2-descriptors
Modern features-part-2-descriptorsModern features-part-2-descriptors
Modern features-part-2-descriptorszukun
 
Modern features-part-1-detectors
Modern features-part-1-detectorsModern features-part-1-detectors
Modern features-part-1-detectorszukun
 
Modern features-part-0-intro
Modern features-part-0-introModern features-part-0-intro
Modern features-part-0-introzukun
 
Lecture 02 internet video search
Lecture 02 internet video searchLecture 02 internet video search
Lecture 02 internet video searchzukun
 
Lecture 01 internet video search
Lecture 01 internet video searchLecture 01 internet video search
Lecture 01 internet video searchzukun
 
Lecture 03 internet video search
Lecture 03 internet video searchLecture 03 internet video search
Lecture 03 internet video searchzukun
 
Icml2012 tutorial representation_learning
Icml2012 tutorial representation_learningIcml2012 tutorial representation_learning
Icml2012 tutorial representation_learningzukun
 
Advances in discrete energy minimisation for computer vision
Advances in discrete energy minimisation for computer visionAdvances in discrete energy minimisation for computer vision
Advances in discrete energy minimisation for computer visionzukun
 
Gephi tutorial: quick start
Gephi tutorial: quick startGephi tutorial: quick start
Gephi tutorial: quick startzukun
 
EM algorithm and its application in probabilistic latent semantic analysis
EM algorithm and its application in probabilistic latent semantic analysisEM algorithm and its application in probabilistic latent semantic analysis
EM algorithm and its application in probabilistic latent semantic analysiszukun
 
Object recognition with pictorial structures
Object recognition with pictorial structuresObject recognition with pictorial structures
Object recognition with pictorial structureszukun
 
Iccv2011 learning spatiotemporal graphs of human activities
Iccv2011 learning spatiotemporal graphs of human activities Iccv2011 learning spatiotemporal graphs of human activities
Iccv2011 learning spatiotemporal graphs of human activities zukun
 

Mehr von zukun (20)

My lyn tutorial 2009
My lyn tutorial 2009My lyn tutorial 2009
My lyn tutorial 2009
 
ETHZ CV2012: Tutorial openCV
ETHZ CV2012: Tutorial openCVETHZ CV2012: Tutorial openCV
ETHZ CV2012: Tutorial openCV
 
ETHZ CV2012: Information
ETHZ CV2012: InformationETHZ CV2012: Information
ETHZ CV2012: Information
 
Siwei lyu: natural image statistics
Siwei lyu: natural image statisticsSiwei lyu: natural image statistics
Siwei lyu: natural image statistics
 
Lecture9 camera calibration
Lecture9 camera calibrationLecture9 camera calibration
Lecture9 camera calibration
 
Brunelli 2008: template matching techniques in computer vision
Brunelli 2008: template matching techniques in computer visionBrunelli 2008: template matching techniques in computer vision
Brunelli 2008: template matching techniques in computer vision
 
Modern features-part-4-evaluation
Modern features-part-4-evaluationModern features-part-4-evaluation
Modern features-part-4-evaluation
 
Modern features-part-3-software
Modern features-part-3-softwareModern features-part-3-software
Modern features-part-3-software
 
Modern features-part-2-descriptors
Modern features-part-2-descriptorsModern features-part-2-descriptors
Modern features-part-2-descriptors
 
Modern features-part-1-detectors
Modern features-part-1-detectorsModern features-part-1-detectors
Modern features-part-1-detectors
 
Modern features-part-0-intro
Modern features-part-0-introModern features-part-0-intro
Modern features-part-0-intro
 
Lecture 02 internet video search
Lecture 02 internet video searchLecture 02 internet video search
Lecture 02 internet video search
 
Lecture 01 internet video search
Lecture 01 internet video searchLecture 01 internet video search
Lecture 01 internet video search
 
Lecture 03 internet video search
Lecture 03 internet video searchLecture 03 internet video search
Lecture 03 internet video search
 
Icml2012 tutorial representation_learning
Icml2012 tutorial representation_learningIcml2012 tutorial representation_learning
Icml2012 tutorial representation_learning
 
Advances in discrete energy minimisation for computer vision
Advances in discrete energy minimisation for computer visionAdvances in discrete energy minimisation for computer vision
Advances in discrete energy minimisation for computer vision
 
Gephi tutorial: quick start
Gephi tutorial: quick startGephi tutorial: quick start
Gephi tutorial: quick start
 
EM algorithm and its application in probabilistic latent semantic analysis
EM algorithm and its application in probabilistic latent semantic analysisEM algorithm and its application in probabilistic latent semantic analysis
EM algorithm and its application in probabilistic latent semantic analysis
 
Object recognition with pictorial structures
Object recognition with pictorial structuresObject recognition with pictorial structures
Object recognition with pictorial structures
 
Iccv2011 learning spatiotemporal graphs of human activities
Iccv2011 learning spatiotemporal graphs of human activities Iccv2011 learning spatiotemporal graphs of human activities
Iccv2011 learning spatiotemporal graphs of human activities
 

Kürzlich hochgeladen

Spellings Wk 3 English CAPS CARES Please Practise
Spellings Wk 3 English CAPS CARES Please PractiseSpellings Wk 3 English CAPS CARES Please Practise
Spellings Wk 3 English CAPS CARES Please PractiseAnaAcapella
 
Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfAdmir Softic
 
How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17Celine George
 
Salient Features of India constitution especially power and functions
Salient Features of India constitution especially power and functionsSalient Features of India constitution especially power and functions
Salient Features of India constitution especially power and functionsKarakKing
 
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptxMaritesTamaniVerdade
 
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptxBasic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptxDenish Jangid
 
Fostering Friendships - Enhancing Social Bonds in the Classroom
Fostering Friendships - Enhancing Social Bonds  in the ClassroomFostering Friendships - Enhancing Social Bonds  in the Classroom
Fostering Friendships - Enhancing Social Bonds in the ClassroomPooky Knightsmith
 
Unit 3 Emotional Intelligence and Spiritual Intelligence.pdf
Unit 3 Emotional Intelligence and Spiritual Intelligence.pdfUnit 3 Emotional Intelligence and Spiritual Intelligence.pdf
Unit 3 Emotional Intelligence and Spiritual Intelligence.pdfDr Vijay Vishwakarma
 
General Principles of Intellectual Property: Concepts of Intellectual Proper...
General Principles of Intellectual Property: Concepts of Intellectual  Proper...General Principles of Intellectual Property: Concepts of Intellectual  Proper...
General Principles of Intellectual Property: Concepts of Intellectual Proper...Poonam Aher Patil
 
Food safety_Challenges food safety laboratories_.pdf
Food safety_Challenges food safety laboratories_.pdfFood safety_Challenges food safety laboratories_.pdf
Food safety_Challenges food safety laboratories_.pdfSherif Taha
 
Towards a code of practice for AI in AT.pptx
Towards a code of practice for AI in AT.pptxTowards a code of practice for AI in AT.pptx
Towards a code of practice for AI in AT.pptxJisc
 
This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.christianmathematics
 
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...ZurliaSoop
 
How to Create and Manage Wizard in Odoo 17
How to Create and Manage Wizard in Odoo 17How to Create and Manage Wizard in Odoo 17
How to Create and Manage Wizard in Odoo 17Celine George
 
FSB Advising Checklist - Orientation 2024
FSB Advising Checklist - Orientation 2024FSB Advising Checklist - Orientation 2024
FSB Advising Checklist - Orientation 2024Elizabeth Walsh
 
Graduate Outcomes Presentation Slides - English
Graduate Outcomes Presentation Slides - EnglishGraduate Outcomes Presentation Slides - English
Graduate Outcomes Presentation Slides - Englishneillewis46
 
Python Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docxPython Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docxRamakrishna Reddy Bijjam
 
Micro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdfMicro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdfPoh-Sun Goh
 
80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...
80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...
80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...Nguyen Thanh Tu Collection
 

Kürzlich hochgeladen (20)

Spellings Wk 3 English CAPS CARES Please Practise
Spellings Wk 3 English CAPS CARES Please PractiseSpellings Wk 3 English CAPS CARES Please Practise
Spellings Wk 3 English CAPS CARES Please Practise
 
Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdf
 
How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17
 
Salient Features of India constitution especially power and functions
Salient Features of India constitution especially power and functionsSalient Features of India constitution especially power and functions
Salient Features of India constitution especially power and functions
 
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
 
Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024
 
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptxBasic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
 
Fostering Friendships - Enhancing Social Bonds in the Classroom
Fostering Friendships - Enhancing Social Bonds  in the ClassroomFostering Friendships - Enhancing Social Bonds  in the Classroom
Fostering Friendships - Enhancing Social Bonds in the Classroom
 
Unit 3 Emotional Intelligence and Spiritual Intelligence.pdf
Unit 3 Emotional Intelligence and Spiritual Intelligence.pdfUnit 3 Emotional Intelligence and Spiritual Intelligence.pdf
Unit 3 Emotional Intelligence and Spiritual Intelligence.pdf
 
General Principles of Intellectual Property: Concepts of Intellectual Proper...
General Principles of Intellectual Property: Concepts of Intellectual  Proper...General Principles of Intellectual Property: Concepts of Intellectual  Proper...
General Principles of Intellectual Property: Concepts of Intellectual Proper...
 
Food safety_Challenges food safety laboratories_.pdf
Food safety_Challenges food safety laboratories_.pdfFood safety_Challenges food safety laboratories_.pdf
Food safety_Challenges food safety laboratories_.pdf
 
Towards a code of practice for AI in AT.pptx
Towards a code of practice for AI in AT.pptxTowards a code of practice for AI in AT.pptx
Towards a code of practice for AI in AT.pptx
 
This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.
 
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
 
How to Create and Manage Wizard in Odoo 17
How to Create and Manage Wizard in Odoo 17How to Create and Manage Wizard in Odoo 17
How to Create and Manage Wizard in Odoo 17
 
FSB Advising Checklist - Orientation 2024
FSB Advising Checklist - Orientation 2024FSB Advising Checklist - Orientation 2024
FSB Advising Checklist - Orientation 2024
 
Graduate Outcomes Presentation Slides - English
Graduate Outcomes Presentation Slides - EnglishGraduate Outcomes Presentation Slides - English
Graduate Outcomes Presentation Slides - English
 
Python Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docxPython Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docx
 
Micro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdfMicro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdf
 
80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...
80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...
80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...
 

Iccv2009 recognition and learning object categories p2 c02 - recognizing muliple objects in an image - sharing and context

  • 3.
  • 4.
  • 5. Context: objects appear in configurations
  • 9. How many object categories are there? ~10,000 to 30,000 Biederman 1987
  • 10. How many categories? Probably this question is not even specific enough to have an answer
  • 11. ? Which level of categorization is the right one? Car is an object composed of: a few doors, four wheels (not all visible at all times), a roof, front lights, windshield If you are thinking in buying a car, you might want to be a bit more specific about your categorization level.
  • 12. A bird An ostrich Entry-level categories(Jolicoeur, Gluck, Kosslyn 1984) Typical member of a basic-level category are categorized at the expected level Atypical members tend to be classified at a subordinate level.
  • 13. We do not need to recognize the exact category A new class can borrow information from similar categories
  • 14. So, where is computer vision? Well…
  • 15. Multiclass object detectionthe not so early days
  • 16.
  • 17. Generalizing Across Categories Can we transfer knowledge from one object category to another? Slide by Erik Sudderth
  • 18. Shared features Is learning the object class 1000 easier than learning the first? Can we transfer knowledge from one object to another? Are the shared properties interesting by themselves? …
  • 19. Multitask learning R. Caruana. Multitask Learning. ML 1997 “MTL improves generalization by leveraging the domain-specific information contained in the training signals of related tasks. It does this by training tasks in parallel while using a shared representation”. vs. Sejnowski & Rosenberg 1986; Hinton 1986; Le Cun et al. 1989; Suddarth & Kergosien 1990; Pratt et al. 1991; Sharkey & Sharkey 1992; …
  • 20.
  • 21. width of left door jamb
  • 22. width of right door jamb
  • 23. horizontal location of left edge of door
  • 24. horizontal location of right edge of door
  • 27. horizontal location of doorway center
  • 29.
  • 30. Convolutional Neural Network Le Cun et al, 98 Translation invariance is already built into the network The output neurons share all the intermediate levels
  • 31. Sharing transformations Miller, E., Matsakis, N., and Viola, P. (2000). Learning from one example through shared densities on transforms. In IEEE Computer Vision and Pattern Recognition. Transformations are shared and can be learnt from other tasks.
  • 32. Sharing in constellation models Pictorial StructuresFischler & Elschlager, IEEE Trans. Comp. 1973 SVM DetectorsHeisele, Poggio, et. al., NIPS 2001 Constellation ModelFei-Fei, Fergus, Perona, ICCV 2003 Model-Guided SegmentationMori, Ren, Efros, & Malik, CVPR 2004
  • 33. Reusable Parts Krempp, Geman, & Amit “Sequential Learning of Reusable Parts for Object Detection”. TR 2002 Goal: Look for a vocabulary of edges that reduces the number of features. Examples of reused parts Number of features Number of classes
  • 34. Specific feature pedestrian chair Traffic light sign face Background class Non-shared feature: this feature is too specific to faces.
  • 36.
  • 37. 50 training samples/class 29 object classes 2000 entries in the dictionary Results averaged on 20 runs Error bars = 80% interval Class-specific features Shared features Torralba, Murphy, Freeman. CVPR 2004. PAMI 2007
  • 38. 12 viewpoints 12 unrelated object classes Generalization as a function of object similarities K = 2.1 K = 4.8 Area under ROC Area under ROC Number of training samples per class Number of training samples per class Torralba, Murphy, Freeman. CVPR 2004. PAMI 2007
  • 39. J. Shotton, A. Blake, R. Cipolla.Multi-Scale Categorical Object Recognition Using Contour Fragments. In IEEETrans. on PAMI, 30(7):1270-1281, July 2008. Efficiency Generalization Opelt, Pinz, Zisserman, CVPR 2006
  • 40. Sharing patches Bart and Ullman, 2004 For a new class, use only features similar to features that where good for other classes: Proposed Dog features
  • 41. Some more references Baxter 1996 Caruana 1997 Schapire, Singer, 2000 Thrun, Pratt 1997 Krempp, Geman, Amit, 2002 E.L.Miller, Matsakis, Viola, 2000 Mahamud, Hebert, Lafferty, 2001 Fink et al. 2003, 2004 LeCun, Huang, Bottou, 2004 Holub, Welling, Perona, 2005 …
  • 43. The “guess what I am trying to detect” challenge The detector challenge: by looking at the output of a detector on a random setof images, can you guess which object is it trying to detect?
  • 44. What object is detector trying to detect? The detector challenge: by looking at the output of a detector on a random setof images, can you guess which object is it trying to detect?
  • 45. 1. chair, 2. table, 3. road, 4. road, 5. table, 6. car, 7. keyboard.
  • 46. The context challenge How far can you go without using an object detector?
  • 47. 1 2 What are the hidden objects?
  • 48. What are the hidden objects? Chance ~ 1/30000
  • 49. objects image p(O | I) ap(I|O) p(O) Object model Context model
  • 50. p(O | I) ap(I|O) p(O) Object model Context model Full joint Scene model Aprox. joint
  • 51. p(O | I) ap(I|O) p(O) Object model Context model Full joint Approx. joint Scene model
  • 52. p(O | I) ap(I|O) p(O) Object model Context model Full joint Scene model Approx. joint p(O) = S Pp(Oi|S=s) p(S=s) i s office street
  • 53. p(O | I) ap(I|O) p(O) Object model Context model Full joint Scene model Approx. joint
  • 54. Pixel labeling using MRFs Enforce consistency between neighboring labels, and between labels and pixels Oi Carbonetto, de Freitas & Barnard, ECCV’04
  • 55. Beyond nearest-neighbor grids Most MRF/CRF models assume nearest-neighbor graph topology This cannot capture long-distance correlations
  • 56. Object-Object Relationships Use latent variables to induce long distance correlations between labels in a Conditional Random Field (CRF) He, Zemel & Carreira-Perpinan (04)
  • 58. Fink & Perona (NIPS 03) Use output of boosting from other objects at previous iterations as input into boosting for this iteration Object-Object Relationships
  • 59. Objects in Context Building Building, boat, person Road Road Building, boat, motorbike Boat Water, sky Water Most consistent labeling according to object co-occurrences& locallabel probabilities. A. Rabinovich, A. Vedaldi, C. Galleguillos, E. Wiewiora and S. Belongie. Objects in Context. ICCV 2007
  • 60. 52 Objects in Context:Contextual Refinement Building Road Boat Independent segment classification Water Mean interaction of all label pairs Φ(i,j) is basically the observed label co-occurrences in training set. Contextual model based on co-occurrences Try to find the most consistent labeling with high posterior probability and high mean pairwise interaction. Use CRF for this purpose. Slide by GokberkCinbis
  • 61. Detecting difficult objects Maybe there is a mouse Office Start recognizing the scene Torralba, Murphy, Freeman. NIPS 2004.
  • 62. Detecting difficult objects Detect first simple objects (reliable detectors) that provide strong contextual constraints to the target (screen -> keyboard -> mouse) Torralba, Murphy, Freeman. NIPS 2004.
  • 63. Detecting difficult objects Detect first simple objects (reliable detectors) that provide strong contextual constraints to the target (screen -> keyboard -> mouse) Torralba, Murphy, Freeman. NIPS 2004.
  • 64. BRF for car detection: topology Torralba Murphy Freeman (2004)
  • 65. BRF for car detection: results Torralba Murphy Freeman (2004)
  • 66. A “car” out of context is less of a car From image Thresholded beliefs From detectors Road Car Building b F G b F G b F G
  • 67. Contextual object relationships Carbonetto, de Freitas & Barnard (2004) Kumar, Hebert (2005) Torralba Murphy Freeman (2004) E. Sudderth et al (2005) Fink & Perona (2003)