SlideShare ist ein Scribd-Unternehmen logo
1 von 56
Classical Methods for Object Recognition  Rob Fergus (NYU)
Classical Methods Bag of words approaches Parts and structure approaches  Discriminative methods Condensed version of sections from  2007 edition of  tutorial
Bag of Words Models
Object Bag of ‘words’
Bag of Words Independent features  Histogram representation
1.Feature detectionand representation Compute descriptor  e.g. SIFT [Lowe’99] Normalize patch Detect patches [Mikojaczyk and Schmid ’02] [Mata, Chum, Urban & Pajdla, ’02]  [Sivic & Zisserman, ’03] Local interest operator or Regular grid Slide credit: Josef Sivic
… 1.Feature detectionand representation
… 2. Codewords dictionary formation 128-D SIFT space
… 2. Codewords dictionary formation Codewords + + + Vector quantization 128-D SIFT space Slide credit: Josef Sivic
Image patch examples of codewords Sivic et al. 2005
….. Image representation Histogram of features assigned to each cluster  frequency codewords
Uses of BoW representation Treat as feature vector for standard classifier e.g SVM Cluster BoW vectors over image collection Discover visual themes Hierarchical models  Decompose scene/object Scene
BoW as input to classifier SVM for object classification Csurka, Bray, Dance & Fan, 2004 Naïve Bayes See 2007 edition of this course
Clustering BoW vectors  Use models from text document literature Probabilistic latent semantic analysis (pLSA) Latent Dirichlet allocation (LDA) See 2007 edition for explanation/code d = image,   w = visual word,    z = topic (cluster)
Clustering BoW vectors Scene classification (supervised) Vogel & Schiele, 2004 Fei-Fei & Perona, 2005 Bosch, Zisserman & Munoz, 2006 Object discovery (unsupervised) Each cluster corresponds to visual theme Sivic, Russell, Efros, Freeman & Zisserman, 2005
Related work Early “bag of words” models: mostly texture recognition Cula & Dana, 2001; Leung & Malik 2001; Mori, Belongie & Malik, 2001; Schmid 2001; Varma & Zisserman, 2002, 2003; Lazebnik, Schmid & Ponce, 2003 Hierarchical Bayesian models for documents (pLSA, LDA, etc.) Hoffman 1999; Blei, Ng & Jordan, 2004; Teh, Jordan, Beal & Blei, 2004 Object categorization Csurka, Bray, Dance & Fan, 2004; Sivic, Russell, Efros, Freeman & Zisserman, 2005; Sudderth, Torralba, Freeman & Willsky, 2005; Natural scene categorization Vogel & Schiele, 2004; Fei-Fei & Perona, 2005; Bosch, Zisserman & Munoz, 2006
What about spatial info? ?
Adding spatial info. to BoW Feature level Spatial influence through correlogram features: Savarese, Winn and Criminisi, CVPR 2006
Adding spatial info. to BoW Feature level Generative models Sudderth, Torralba, Freeman & Willsky, 2005, 2006 Hierarchical model of scene/objects/parts
P1 P2 P3 P4 w Image Bg Adding spatial info. to BoW Feature level Generative models Sudderth, Torralba, Freeman & Willsky, 2005, 2006 Niebles & Fei-Fei, CVPR 2007
Adding spatial info. to BoW Feature level Generative models Discriminative methods Lazebnik, Schmid & Ponce, 2006
Part-based Models
Problem with bag-of-words All have equal probability for bag-of-words methods Location information is important BoW + location still doesn’t give correspondence
Model: Parts and Structure
Representation Object as set of parts Generative representation Model: Relative locations between parts Appearance of part Issues: How to model location How to represent appearance How to handle occlusion/clutter Figure from [Fischler & Elschlager 73]
History of Parts and Structure approaches ,[object Object]
Yuille ‘91
Brunelli & Poggio ‘93
Lades, v.d. Malsburg et al. ‘93
Cootes, Lanitis, Taylor et al. ‘95
Amit & Geman ‘95, ‘99
Perona et al. ‘95, ‘96, ’98, ’00, ’03, ‘04, ‘05
Felzenszwalb & Huttenlocher ’00, ’04
Crandall & Huttenlocher ’05, ’06
Leibe & Schiele ’03, ’04
Many papers since 2000,[object Object]
The correspondence problem Model with P parts Image with N possible assignments for each part Consider mapping to be 1-1 ,[object Object],[object Object]
Efficient methods ,[object Object]
Felzenszwalb and Huttenlocher ‘00 and ‘05
 O(N2P)  O(NP) for tree structured   models
 Removes need for region detectors,[object Object]
Appearance representation ,[object Object],Decision trees [Lepetit and Fua CVPR 2005] ,[object Object],Figure from Winn & Shotton, CVPR ‘06
Learn Appearance Generative models of appearance Can learn with little supervision E.g. Fergus et al’ 03 Discriminative training of part appearance model SVM part detectors Felzenszwalb, Mcallester, Ramanan, CVPR 2008 Much better performance
Felzenszwalb, Mcallester, Ramanan, CVPR 2008 2-scale model Whole object Parts HOG representation +SVM training to obtainrobust part detectors Distancetransforms allowexamination of every location in the image
Hierarchical Representations  Pixels  Pixel groupings  Parts  Object ,[object Object]
Amit and Geman’98
Ullman et al.
Bouchard & Triggs’05
Zhu and Mumford
Jin & Geman‘06
Zhu & Yuille ’07
Fidler & Leonardis ‘07Images from [Amit98]
Stochastic Grammar of ImagesS.C. Zhu et al. and D. Mumford
Context and Hierarchy in a Probabilistic Image ModelJin & Geman (2006) animal head instantiated by bear head e.g. animals, trees, rocks e.g. contours, intermediate objects e.g. linelets, curvelets, T-junctions e.g. discontinuities, gradient  animal head instantiated by tiger head
A Hierarchical Compositional System for Rapid Object DetectionLong Zhu, Alan L. Yuille, 2007. Able to learn #parts at each level
Learning a Compositional Hierarchy of Object Structure Fidler & Leonardis, CVPR’07; Fidler, Boben & Leonardis, CVPR 2008 Parts model The architecture Learned parts

Weitere ähnliche Inhalte

Was ist angesagt?

SSII2021 [OS2-03] 自己教師あり学習における対照学習の基礎と応用
SSII2021 [OS2-03] 自己教師あり学習における対照学習の基礎と応用SSII2021 [OS2-03] 自己教師あり学習における対照学習の基礎と応用
SSII2021 [OS2-03] 自己教師あり学習における対照学習の基礎と応用SSII
 
Lecture 21 - Image Categorization - Computer Vision Spring2015
Lecture 21 - Image Categorization -  Computer Vision Spring2015Lecture 21 - Image Categorization -  Computer Vision Spring2015
Lecture 21 - Image Categorization - Computer Vision Spring2015Jia-Bin Huang
 
Brunelli 2008: template matching techniques in computer vision
Brunelli 2008: template matching techniques in computer visionBrunelli 2008: template matching techniques in computer vision
Brunelli 2008: template matching techniques in computer visionzukun
 
No Slide Title
No Slide TitleNo Slide Title
No Slide Titlebutest
 
Information Visualisation (Multimedia 2009 course)
Information Visualisation (Multimedia 2009 course)Information Visualisation (Multimedia 2009 course)
Information Visualisation (Multimedia 2009 course)Joris Klerkx
 

Was ist angesagt? (6)

Blurclassification
BlurclassificationBlurclassification
Blurclassification
 
SSII2021 [OS2-03] 自己教師あり学習における対照学習の基礎と応用
SSII2021 [OS2-03] 自己教師あり学習における対照学習の基礎と応用SSII2021 [OS2-03] 自己教師あり学習における対照学習の基礎と応用
SSII2021 [OS2-03] 自己教師あり学習における対照学習の基礎と応用
 
Lecture 21 - Image Categorization - Computer Vision Spring2015
Lecture 21 - Image Categorization -  Computer Vision Spring2015Lecture 21 - Image Categorization -  Computer Vision Spring2015
Lecture 21 - Image Categorization - Computer Vision Spring2015
 
Brunelli 2008: template matching techniques in computer vision
Brunelli 2008: template matching techniques in computer visionBrunelli 2008: template matching techniques in computer vision
Brunelli 2008: template matching techniques in computer vision
 
No Slide Title
No Slide TitleNo Slide Title
No Slide Title
 
Information Visualisation (Multimedia 2009 course)
Information Visualisation (Multimedia 2009 course)Information Visualisation (Multimedia 2009 course)
Information Visualisation (Multimedia 2009 course)
 

Andere mochten auch

美团技术沙龙02 - 滴滴订单分配策略
美团技术沙龙02 - 滴滴订单分配策略美团技术沙龙02 - 滴滴订单分配策略
美团技术沙龙02 - 滴滴订单分配策略美团点评技术团队
 
机器学习概述
机器学习概述机器学习概述
机器学习概述Dong Guo
 
第一讲 机器学习概述
第一讲 机器学习概述第一讲 机器学习概述
第一讲 机器学习概述juzihua1102
 
An Introduction to Computer Vision
An Introduction to Computer VisionAn Introduction to Computer Vision
An Introduction to Computer Visionguestd1b1b5
 
Computer Vision Basics
Computer Vision BasicsComputer Vision Basics
Computer Vision BasicsSuren Kumar
 

Andere mochten auch (6)

美团技术沙龙02 - 滴滴订单分配策略
美团技术沙龙02 - 滴滴订单分配策略美团技术沙龙02 - 滴滴订单分配策略
美团技术沙龙02 - 滴滴订单分配策略
 
机器学习概述
机器学习概述机器学习概述
机器学习概述
 
第一讲 机器学习概述
第一讲 机器学习概述第一讲 机器学习概述
第一讲 机器学习概述
 
An Introduction to Computer Vision
An Introduction to Computer VisionAn Introduction to Computer Vision
An Introduction to Computer Vision
 
Computer Vision
Computer VisionComputer Vision
Computer Vision
 
Computer Vision Basics
Computer Vision BasicsComputer Vision Basics
Computer Vision Basics
 

Ähnlich wie Iccv2009 recognition and learning object categories p1 c01 - classical methods

Cvpr2007 object category recognition p2 - part based models
Cvpr2007 object category recognition   p2 - part based modelsCvpr2007 object category recognition   p2 - part based models
Cvpr2007 object category recognition p2 - part based modelszukun
 
Lec18 bag of_features
Lec18 bag of_featuresLec18 bag of_features
Lec18 bag of_featuresBo Li
 
Cvpr2007 object category recognition p4 - combined segmentation and recogni...
Cvpr2007 object category recognition   p4 - combined segmentation and recogni...Cvpr2007 object category recognition   p4 - combined segmentation and recogni...
Cvpr2007 object category recognition p4 - combined segmentation and recogni...zukun
 
Verifiable visualization for isosurface extraction vis 2009
Verifiable visualization for isosurface extraction   vis 2009Verifiable visualization for isosurface extraction   vis 2009
Verifiable visualization for isosurface extraction vis 2009Tiago Queiroz
 
Salient KeypointSelection for Object Representation
Salient KeypointSelection for Object RepresentationSalient KeypointSelection for Object Representation
Salient KeypointSelection for Object RepresentationPrerana Mukherjee
 
Soundarya m.sc
Soundarya m.scSoundarya m.sc
Soundarya m.scsowfi
 
Low level vision - A tuturial
Low level vision - A tuturialLow level vision - A tuturial
Low level vision - A tuturialpotaters
 
Contextless Object Recognition with Shape-enriched SIFT and Bags of Features
Contextless Object Recognition with Shape-enriched SIFT and Bags of FeaturesContextless Object Recognition with Shape-enriched SIFT and Bags of Features
Contextless Object Recognition with Shape-enriched SIFT and Bags of FeaturesUniversitat Politècnica de Catalunya
 
Semantic Retrieval and Automatic Annotation: Linear Transformations, Correlat...
Semantic Retrieval and Automatic Annotation: Linear Transformations, Correlat...Semantic Retrieval and Automatic Annotation: Linear Transformations, Correlat...
Semantic Retrieval and Automatic Annotation: Linear Transformations, Correlat...Jonathon Hare
 
Iccv2009 recognition and learning object categories p1 c03 - 3d object models
Iccv2009 recognition and learning object categories   p1 c03 - 3d object modelsIccv2009 recognition and learning object categories   p1 c03 - 3d object models
Iccv2009 recognition and learning object categories p1 c03 - 3d object modelszukun
 
Extended Co-occurrence HOG with Dense Trajectories for Fine-grained Activity ...
Extended Co-occurrence HOG with Dense Trajectories for Fine-grained Activity ...Extended Co-occurrence HOG with Dense Trajectories for Fine-grained Activity ...
Extended Co-occurrence HOG with Dense Trajectories for Fine-grained Activity ...Hirokatsu Kataoka
 
Mit6870 orsu lecture2
Mit6870 orsu lecture2Mit6870 orsu lecture2
Mit6870 orsu lecture2zukun
 
Histogram of oriented gradients for human detection
Histogram of oriented gradients for human detectionHistogram of oriented gradients for human detection
Histogram of oriented gradients for human detectionzukun
 
Colour Object Recognition using Biologically Inspired Model
Colour Object Recognition using Biologically Inspired ModelColour Object Recognition using Biologically Inspired Model
Colour Object Recognition using Biologically Inspired Modelijsrd.com
 

Ähnlich wie Iccv2009 recognition and learning object categories p1 c01 - classical methods (20)

Cvpr2007 object category recognition p2 - part based models
Cvpr2007 object category recognition   p2 - part based modelsCvpr2007 object category recognition   p2 - part based models
Cvpr2007 object category recognition p2 - part based models
 
Lec18 bag of_features
Lec18 bag of_featuresLec18 bag of_features
Lec18 bag of_features
 
Cvpr2007 object category recognition p4 - combined segmentation and recogni...
Cvpr2007 object category recognition   p4 - combined segmentation and recogni...Cvpr2007 object category recognition   p4 - combined segmentation and recogni...
Cvpr2007 object category recognition p4 - combined segmentation and recogni...
 
Verifiable visualization for isosurface extraction vis 2009
Verifiable visualization for isosurface extraction   vis 2009Verifiable visualization for isosurface extraction   vis 2009
Verifiable visualization for isosurface extraction vis 2009
 
Salient KeypointSelection for Object Representation
Salient KeypointSelection for Object RepresentationSalient KeypointSelection for Object Representation
Salient KeypointSelection for Object Representation
 
Soundarya m.sc
Soundarya m.scSoundarya m.sc
Soundarya m.sc
 
06_features_slides.pdf
06_features_slides.pdf06_features_slides.pdf
06_features_slides.pdf
 
Low level vision - A tuturial
Low level vision - A tuturialLow level vision - A tuturial
Low level vision - A tuturial
 
Contextless Object Recognition with Shape-enriched SIFT and Bags of Features
Contextless Object Recognition with Shape-enriched SIFT and Bags of FeaturesContextless Object Recognition with Shape-enriched SIFT and Bags of Features
Contextless Object Recognition with Shape-enriched SIFT and Bags of Features
 
Iciap 2
Iciap 2Iciap 2
Iciap 2
 
lecture_16_jiajun.pdf
lecture_16_jiajun.pdflecture_16_jiajun.pdf
lecture_16_jiajun.pdf
 
Semantic Retrieval and Automatic Annotation: Linear Transformations, Correlat...
Semantic Retrieval and Automatic Annotation: Linear Transformations, Correlat...Semantic Retrieval and Automatic Annotation: Linear Transformations, Correlat...
Semantic Retrieval and Automatic Annotation: Linear Transformations, Correlat...
 
Iccv2009 recognition and learning object categories p1 c03 - 3d object models
Iccv2009 recognition and learning object categories   p1 c03 - 3d object modelsIccv2009 recognition and learning object categories   p1 c03 - 3d object models
Iccv2009 recognition and learning object categories p1 c03 - 3d object models
 
Extended Co-occurrence HOG with Dense Trajectories for Fine-grained Activity ...
Extended Co-occurrence HOG with Dense Trajectories for Fine-grained Activity ...Extended Co-occurrence HOG with Dense Trajectories for Fine-grained Activity ...
Extended Co-occurrence HOG with Dense Trajectories for Fine-grained Activity ...
 
Machine Learning
Machine LearningMachine Learning
Machine Learning
 
Bagwords
BagwordsBagwords
Bagwords
 
Mit6870 orsu lecture2
Mit6870 orsu lecture2Mit6870 orsu lecture2
Mit6870 orsu lecture2
 
Histogram of oriented gradients for human detection
Histogram of oriented gradients for human detectionHistogram of oriented gradients for human detection
Histogram of oriented gradients for human detection
 
Cmap presentation
Cmap presentationCmap presentation
Cmap presentation
 
Colour Object Recognition using Biologically Inspired Model
Colour Object Recognition using Biologically Inspired ModelColour Object Recognition using Biologically Inspired Model
Colour Object Recognition using Biologically Inspired Model
 

Mehr von zukun

My lyn tutorial 2009
My lyn tutorial 2009My lyn tutorial 2009
My lyn tutorial 2009zukun
 
ETHZ CV2012: Tutorial openCV
ETHZ CV2012: Tutorial openCVETHZ CV2012: Tutorial openCV
ETHZ CV2012: Tutorial openCVzukun
 
ETHZ CV2012: Information
ETHZ CV2012: InformationETHZ CV2012: Information
ETHZ CV2012: Informationzukun
 
Siwei lyu: natural image statistics
Siwei lyu: natural image statisticsSiwei lyu: natural image statistics
Siwei lyu: natural image statisticszukun
 
Lecture9 camera calibration
Lecture9 camera calibrationLecture9 camera calibration
Lecture9 camera calibrationzukun
 
Modern features-part-4-evaluation
Modern features-part-4-evaluationModern features-part-4-evaluation
Modern features-part-4-evaluationzukun
 
Modern features-part-3-software
Modern features-part-3-softwareModern features-part-3-software
Modern features-part-3-softwarezukun
 
Modern features-part-2-descriptors
Modern features-part-2-descriptorsModern features-part-2-descriptors
Modern features-part-2-descriptorszukun
 
Modern features-part-1-detectors
Modern features-part-1-detectorsModern features-part-1-detectors
Modern features-part-1-detectorszukun
 
Modern features-part-0-intro
Modern features-part-0-introModern features-part-0-intro
Modern features-part-0-introzukun
 
Lecture 02 internet video search
Lecture 02 internet video searchLecture 02 internet video search
Lecture 02 internet video searchzukun
 
Lecture 01 internet video search
Lecture 01 internet video searchLecture 01 internet video search
Lecture 01 internet video searchzukun
 
Lecture 03 internet video search
Lecture 03 internet video searchLecture 03 internet video search
Lecture 03 internet video searchzukun
 
Icml2012 tutorial representation_learning
Icml2012 tutorial representation_learningIcml2012 tutorial representation_learning
Icml2012 tutorial representation_learningzukun
 
Advances in discrete energy minimisation for computer vision
Advances in discrete energy minimisation for computer visionAdvances in discrete energy minimisation for computer vision
Advances in discrete energy minimisation for computer visionzukun
 
Gephi tutorial: quick start
Gephi tutorial: quick startGephi tutorial: quick start
Gephi tutorial: quick startzukun
 
EM algorithm and its application in probabilistic latent semantic analysis
EM algorithm and its application in probabilistic latent semantic analysisEM algorithm and its application in probabilistic latent semantic analysis
EM algorithm and its application in probabilistic latent semantic analysiszukun
 
Object recognition with pictorial structures
Object recognition with pictorial structuresObject recognition with pictorial structures
Object recognition with pictorial structureszukun
 
Iccv2011 learning spatiotemporal graphs of human activities
Iccv2011 learning spatiotemporal graphs of human activities Iccv2011 learning spatiotemporal graphs of human activities
Iccv2011 learning spatiotemporal graphs of human activities zukun
 
Icml2012 learning hierarchies of invariant features
Icml2012 learning hierarchies of invariant featuresIcml2012 learning hierarchies of invariant features
Icml2012 learning hierarchies of invariant featureszukun
 

Mehr von zukun (20)

My lyn tutorial 2009
My lyn tutorial 2009My lyn tutorial 2009
My lyn tutorial 2009
 
ETHZ CV2012: Tutorial openCV
ETHZ CV2012: Tutorial openCVETHZ CV2012: Tutorial openCV
ETHZ CV2012: Tutorial openCV
 
ETHZ CV2012: Information
ETHZ CV2012: InformationETHZ CV2012: Information
ETHZ CV2012: Information
 
Siwei lyu: natural image statistics
Siwei lyu: natural image statisticsSiwei lyu: natural image statistics
Siwei lyu: natural image statistics
 
Lecture9 camera calibration
Lecture9 camera calibrationLecture9 camera calibration
Lecture9 camera calibration
 
Modern features-part-4-evaluation
Modern features-part-4-evaluationModern features-part-4-evaluation
Modern features-part-4-evaluation
 
Modern features-part-3-software
Modern features-part-3-softwareModern features-part-3-software
Modern features-part-3-software
 
Modern features-part-2-descriptors
Modern features-part-2-descriptorsModern features-part-2-descriptors
Modern features-part-2-descriptors
 
Modern features-part-1-detectors
Modern features-part-1-detectorsModern features-part-1-detectors
Modern features-part-1-detectors
 
Modern features-part-0-intro
Modern features-part-0-introModern features-part-0-intro
Modern features-part-0-intro
 
Lecture 02 internet video search
Lecture 02 internet video searchLecture 02 internet video search
Lecture 02 internet video search
 
Lecture 01 internet video search
Lecture 01 internet video searchLecture 01 internet video search
Lecture 01 internet video search
 
Lecture 03 internet video search
Lecture 03 internet video searchLecture 03 internet video search
Lecture 03 internet video search
 
Icml2012 tutorial representation_learning
Icml2012 tutorial representation_learningIcml2012 tutorial representation_learning
Icml2012 tutorial representation_learning
 
Advances in discrete energy minimisation for computer vision
Advances in discrete energy minimisation for computer visionAdvances in discrete energy minimisation for computer vision
Advances in discrete energy minimisation for computer vision
 
Gephi tutorial: quick start
Gephi tutorial: quick startGephi tutorial: quick start
Gephi tutorial: quick start
 
EM algorithm and its application in probabilistic latent semantic analysis
EM algorithm and its application in probabilistic latent semantic analysisEM algorithm and its application in probabilistic latent semantic analysis
EM algorithm and its application in probabilistic latent semantic analysis
 
Object recognition with pictorial structures
Object recognition with pictorial structuresObject recognition with pictorial structures
Object recognition with pictorial structures
 
Iccv2011 learning spatiotemporal graphs of human activities
Iccv2011 learning spatiotemporal graphs of human activities Iccv2011 learning spatiotemporal graphs of human activities
Iccv2011 learning spatiotemporal graphs of human activities
 
Icml2012 learning hierarchies of invariant features
Icml2012 learning hierarchies of invariant featuresIcml2012 learning hierarchies of invariant features
Icml2012 learning hierarchies of invariant features
 

Kürzlich hochgeladen

Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...christianmathematics
 
Class 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdfClass 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdfAyushMahapatra5
 
Web & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfWeb & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfJayanti Pande
 
Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeThiyagu K
 
Student login on Anyboli platform.helpin
Student login on Anyboli platform.helpinStudent login on Anyboli platform.helpin
Student login on Anyboli platform.helpinRaunakKeshri1
 
social pharmacy d-pharm 1st year by Pragati K. Mahajan
social pharmacy d-pharm 1st year by Pragati K. Mahajansocial pharmacy d-pharm 1st year by Pragati K. Mahajan
social pharmacy d-pharm 1st year by Pragati K. Mahajanpragatimahajan3
 
9548086042 for call girls in Indira Nagar with room service
9548086042  for call girls in Indira Nagar  with room service9548086042  for call girls in Indira Nagar  with room service
9548086042 for call girls in Indira Nagar with room servicediscovermytutordmt
 
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Krashi Coaching
 
Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Celine George
 
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in DelhiRussian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhikauryashika82
 
Interactive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationInteractive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationnomboosow
 
APM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAPM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAssociation for Project Management
 
Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..Disha Kariya
 
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...fonyou31
 
Unit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxUnit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxVishalSingh1417
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdfQucHHunhnh
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxiammrhaywood
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdfQucHHunhnh
 

Kürzlich hochgeladen (20)

Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
 
Class 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdfClass 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdf
 
Web & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfWeb & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdf
 
Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and Mode
 
Student login on Anyboli platform.helpin
Student login on Anyboli platform.helpinStudent login on Anyboli platform.helpin
Student login on Anyboli platform.helpin
 
social pharmacy d-pharm 1st year by Pragati K. Mahajan
social pharmacy d-pharm 1st year by Pragati K. Mahajansocial pharmacy d-pharm 1st year by Pragati K. Mahajan
social pharmacy d-pharm 1st year by Pragati K. Mahajan
 
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptxINDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
 
9548086042 for call girls in Indira Nagar with room service
9548086042  for call girls in Indira Nagar  with room service9548086042  for call girls in Indira Nagar  with room service
9548086042 for call girls in Indira Nagar with room service
 
Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1
 
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
 
Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17
 
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in DelhiRussian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
 
Interactive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationInteractive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communication
 
APM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAPM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across Sectors
 
Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..
 
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
 
Unit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxUnit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptx
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
 

Iccv2009 recognition and learning object categories p1 c01 - classical methods

  • 1. Classical Methods for Object Recognition Rob Fergus (NYU)
  • 2. Classical Methods Bag of words approaches Parts and structure approaches Discriminative methods Condensed version of sections from 2007 edition of tutorial
  • 3. Bag of Words Models
  • 4. Object Bag of ‘words’
  • 5. Bag of Words Independent features Histogram representation
  • 6. 1.Feature detectionand representation Compute descriptor e.g. SIFT [Lowe’99] Normalize patch Detect patches [Mikojaczyk and Schmid ’02] [Mata, Chum, Urban & Pajdla, ’02] [Sivic & Zisserman, ’03] Local interest operator or Regular grid Slide credit: Josef Sivic
  • 7. … 1.Feature detectionand representation
  • 8. … 2. Codewords dictionary formation 128-D SIFT space
  • 9. … 2. Codewords dictionary formation Codewords + + + Vector quantization 128-D SIFT space Slide credit: Josef Sivic
  • 10. Image patch examples of codewords Sivic et al. 2005
  • 11. ….. Image representation Histogram of features assigned to each cluster frequency codewords
  • 12. Uses of BoW representation Treat as feature vector for standard classifier e.g SVM Cluster BoW vectors over image collection Discover visual themes Hierarchical models Decompose scene/object Scene
  • 13. BoW as input to classifier SVM for object classification Csurka, Bray, Dance & Fan, 2004 Naïve Bayes See 2007 edition of this course
  • 14. Clustering BoW vectors Use models from text document literature Probabilistic latent semantic analysis (pLSA) Latent Dirichlet allocation (LDA) See 2007 edition for explanation/code d = image, w = visual word, z = topic (cluster)
  • 15. Clustering BoW vectors Scene classification (supervised) Vogel & Schiele, 2004 Fei-Fei & Perona, 2005 Bosch, Zisserman & Munoz, 2006 Object discovery (unsupervised) Each cluster corresponds to visual theme Sivic, Russell, Efros, Freeman & Zisserman, 2005
  • 16. Related work Early “bag of words” models: mostly texture recognition Cula & Dana, 2001; Leung & Malik 2001; Mori, Belongie & Malik, 2001; Schmid 2001; Varma & Zisserman, 2002, 2003; Lazebnik, Schmid & Ponce, 2003 Hierarchical Bayesian models for documents (pLSA, LDA, etc.) Hoffman 1999; Blei, Ng & Jordan, 2004; Teh, Jordan, Beal & Blei, 2004 Object categorization Csurka, Bray, Dance & Fan, 2004; Sivic, Russell, Efros, Freeman & Zisserman, 2005; Sudderth, Torralba, Freeman & Willsky, 2005; Natural scene categorization Vogel & Schiele, 2004; Fei-Fei & Perona, 2005; Bosch, Zisserman & Munoz, 2006
  • 18. Adding spatial info. to BoW Feature level Spatial influence through correlogram features: Savarese, Winn and Criminisi, CVPR 2006
  • 19. Adding spatial info. to BoW Feature level Generative models Sudderth, Torralba, Freeman & Willsky, 2005, 2006 Hierarchical model of scene/objects/parts
  • 20. P1 P2 P3 P4 w Image Bg Adding spatial info. to BoW Feature level Generative models Sudderth, Torralba, Freeman & Willsky, 2005, 2006 Niebles & Fei-Fei, CVPR 2007
  • 21. Adding spatial info. to BoW Feature level Generative models Discriminative methods Lazebnik, Schmid & Ponce, 2006
  • 23. Problem with bag-of-words All have equal probability for bag-of-words methods Location information is important BoW + location still doesn’t give correspondence
  • 24. Model: Parts and Structure
  • 25. Representation Object as set of parts Generative representation Model: Relative locations between parts Appearance of part Issues: How to model location How to represent appearance How to handle occlusion/clutter Figure from [Fischler & Elschlager 73]
  • 26.
  • 29. Lades, v.d. Malsburg et al. ‘93
  • 30. Cootes, Lanitis, Taylor et al. ‘95
  • 31. Amit & Geman ‘95, ‘99
  • 32. Perona et al. ‘95, ‘96, ’98, ’00, ’03, ‘04, ‘05
  • 34. Crandall & Huttenlocher ’05, ’06
  • 35. Leibe & Schiele ’03, ’04
  • 36.
  • 37.
  • 38.
  • 40. O(N2P)  O(NP) for tree structured models
  • 41.
  • 42.
  • 43. Learn Appearance Generative models of appearance Can learn with little supervision E.g. Fergus et al’ 03 Discriminative training of part appearance model SVM part detectors Felzenszwalb, Mcallester, Ramanan, CVPR 2008 Much better performance
  • 44. Felzenszwalb, Mcallester, Ramanan, CVPR 2008 2-scale model Whole object Parts HOG representation +SVM training to obtainrobust part detectors Distancetransforms allowexamination of every location in the image
  • 45.
  • 51. Zhu & Yuille ’07
  • 52. Fidler & Leonardis ‘07Images from [Amit98]
  • 53. Stochastic Grammar of ImagesS.C. Zhu et al. and D. Mumford
  • 54. Context and Hierarchy in a Probabilistic Image ModelJin & Geman (2006) animal head instantiated by bear head e.g. animals, trees, rocks e.g. contours, intermediate objects e.g. linelets, curvelets, T-junctions e.g. discontinuities, gradient animal head instantiated by tiger head
  • 55. A Hierarchical Compositional System for Rapid Object DetectionLong Zhu, Alan L. Yuille, 2007. Able to learn #parts at each level
  • 56. Learning a Compositional Hierarchy of Object Structure Fidler & Leonardis, CVPR’07; Fidler, Boben & Leonardis, CVPR 2008 Parts model The architecture Learned parts
  • 57. Parts and Structure modelsSummary Explicit notion of correspondence between image and model Efficient methods for large # parts and # positions in image With powerful part detectors, can get state-of-the-art performance Hierarchical models allow for more parts
  • 59. Classifier based methods Decision boundary Background Computer screen Bag of image patches In some feature space Object detection and recognition is formulated as a classification problem. The image is partitioned into a set of overlapping windows … and a decision is taken at each window about if it contains a target object or not. Where are the screens?
  • 60.
  • 61.
  • 62.
  • 63. Face recognition using eigenfaces M. Turk and A. Pentland (1991).
  • 64. Human Face Detection in Visual Scenes - Rowley, Baluja, Kanade (1995)
  • 65. Graded Learning for Object Detection - Fleuret, Geman (1999)
  • 66. Robust Real-time Object Detection - Viola, Jones (2001)
  • 67. Feature Reduction and Hierarchy of Classifiers for Fast Object Detection in Video Images - Heisele, Serre, Mukherjee, Poggio (2001)
  • 68.
  • 69. Features: Edges and chamfer distance Gavrila, Philomin, ICCV 1999
  • 70. Features: Edge fragments Opelt, Pinz, Zisserman, ECCV 2006 Weak detector = k edge fragments and threshold. Chamfer distance uses 8 orientation planes
  • 71.
  • 72.
  • 73. Classifier: Neural Networks Fukushima’s Neocognitron, 1980 Rowley, Baluja, Kanade 1998 LeCun, Bottou, Bengio, Haffner 1998 Serre et al. 2005 Riesenhuber, M. and Poggio, T. 1999 LeNetconvolutional architecture (LeCun 1998)
  • 74. Classifier: Support Vector Machine Guyon, Vapnik Heisele, Serre, Poggio, 2001 …….. Dalal & Triggs , CVPR 2005 HOG – Histogram of Oriented gradients Learn weighting of descriptor with linear SVM Image HOG descriptor HOG descriptor weighted by +ve SVM -ve SVM weights
  • 75. Classifier: Boosting Viola & Jones 2001 Haar features via Integral Image Cascade Real-time performance ……. Torralbaet al., 2004 Part-based Boosting Each weak classifier is a part Part location modeled by offset mask
  • 76. Summary of classifier-based methods Many techniques for training discriminative models are used Many not mentioned here Conditional random fields Kernels for object recognition Learning object similarities .....
  • 77.
  • 78. Dalal & Triggs HOG detector HOG – Histogram of Oriented gradients Careful selection of spatial bin size/# orientation bins/normalization Learn weighting of descriptor with learn SVM Image HOG descriptor HOG descriptor weighted by +ve SVM -ve SVM weights