SlideShare ist ein Scribd-Unternehmen logo
1 von 43
Pictures and Words
Vision and language in human brain Language Vision Wernicke Area Broca Area PPA LOC V1 FFA
Vision and language in human brain figure modified from: http://www.colorado.edu/intphys/Class/IPHY3730
Vision and language in human brain ? (Translation: “This is not a pipe.”) figure modified from: http://www.colorado.edu/intphys/Class/IPHY3730
What can you see in a glance of a scene? Fei-Fei, Iyer, Koch, Perona, JoV, 2007
PT = 27ms This was a picture with some dark sploches in it. Yeah. . .that's about it. (Subject: KM) PT = 40ms I think I saw two people on a field. (Subject:  RW)  PT = 67ms Outdoor scene. There were some kind of animals, maybe dogs or horses, in the middle of the picture. It looked like they were running in the middle of a grassy field. (Subject: IV)  PT = 500ms Some kind of game or fight. Two groups of two men? The foregound pair looked like one was getting a fist in the face. Outdoors seemed like because i have an impression of grass and maybe lines on the grass? That would be why I think perhaps a game, rough game though, more like rugby than football because they pairs weren't in pads and helmets, though I did get the impression of similar clothing. maybe some trees? in the background. (Subject: SM) PT = 107ms two people, whose profile was toward me. looked like they were on a field of some sort and engaged in some sort of sport (their attire suggested soccer, but it looked like there was too much contact for that). (Subject: AI)  Fei-Fei, Iyer, Koch, Perona, JoV, 2007
Section outline Early “pictures and words” work Content-based retrieval Beyond nouns, towards total scene annotation
“Pictures and words” Barnard, Duygulu, de Freitas, Forsyth, Blei, Jordan, Matching words and pictures, JMLR, 2003 Duygulu, Barnard, de Freitas, Forsyth, Object Recognition as Machine Translation: Learning a lexicon for a fixed image vocabulary , ECCV, 2003 Blei & Jordan, Modeling annotated data, ACM SIGIR, 2003 Chang, Goh, Sychay, & Wu, Soft annotation using Bayes point machines, IEEE Transactions on Circuits and Systems for Video Technology, 2003 Goh, Chang, & Cheng, Ensemble of SVM-based classifiers for annotation, 2003 ….
[object Object]
Images are clustered based on priors over concepts.
Learning determines localized concepts models from global annotations.
Addresses the correspondence problem
One possible assumption: concept models simultaneously generate both a word and blob  sun sun sky water waves Barnard et al. JMLR, 2005 Slide courtesy of Kobus Barnard (1 hour ago!)
[object Object]
Chose an image cluster by p(c)
Chose multimodal concept clusters using p(s|c)
From each multimodal cluster, sample a Gaussian for blob features, p(b|s), and a multinomial for words, p(w|s)
(Skip with some probability to account for mismatched numbers of words and blobs)
For a given correspondence*sun sun sky water waves Barnard et al. JMLR, 2005 Slide courtesy of Kobus Barnard (1 hour ago!)
Barnard et al. JMLR, 2005
Section outline Early “pictures and words” work Content-based retrieval Beyond nouns, towards total scene annotation
Content-based retrieval Elegance Love Symmetry Flower Petals Tower France Rose Corolla Australian Floribunda Rose EiffelTower Paris Slide courtesy of RitendraDatta, Jia Li, James Z. Wang
Literature – MANY!!! A. W. Smeulders, M. Worring, S. Santini, A. Gupta, R. Jain, Content-Based Image Retrieval at the End of the Early Years, IEEE Trans. Pattern Analysis and Machine Intelligence , 22(12):1349-1380, 2000.  R. Datta, D. Joshi, J. Li, and J. Z. Wang, Image Retrieval: Ideas, Influences, and Trends of the New Age, ACM Computing Surveys, vol. 40, no. 2, pp. 5:1-60, 2008.
Try out Alipr (www.alipr.com)
Try out Alipr (www.alipr.com)
Automatic Image Annotation: ALIP Slide courtesy ofRitendraDatta, Jia Li, James Z. Wang
Automatic Image Annotation: ALIP Slide courtesy ofRitendraDatta, Jia Li, James Z. Wang
Automatic Image Annotation: ALIP 2D-MHMM: Two-dimensional multi-resolution hidden Markov model Slide courtesy ofRitendraDatta, Jia Li, James Z. Wang
Automatic Image Annotation: ALIP Annotation Process ,[object Object]
Salient words appearing in the classification favored moreFood, indoor, cuisine, dessert Building, sky, lake, landscape, Europe, tree Snow, animal, wildlife, sky, cloth, ice, people Slide courtesy ofRitendraDatta, Jia Li, James Z. Wang
Section outline Early “pictures and words” work Content-based retrieval Beyond nouns, towards total scene annotation Propositions A. Gupta and L. S. Davis, Beyond Nouns: Exploiting prepositions and comparative adjectives for learning visual classifiers, ECCV, 2008 Objects, scenes, activities L.-J. Li and L. Fei-Fei. What, where and who? Classifying event by scene and object recognition. ICCV, 2007 L.-J. Li, R. Socher and L. Fei-Fei. Towards Total Scene Understanding:Classification, Annotation and Segmentation in an Automatic Framework. CVPR, 2009
Section outline Early “pictures and words” work Content-based retrieval Beyond nouns, towards total scene annotation Propositions A. Gupta and L. S. Davis, Beyond Nouns: Exploiting prepositions and comparative adjectives for learning visual classifiers, ECCV, 2008 Objects, scenes, activities L.-J. Li and L. Fei-Fei. What, where and who? Classifying event by scene and object recognition. ICCV, 2007 L.-J. Li, R. Socher and L. Fei-Fei. Towards Total Scene Understanding:Classification, Annotation and Segmentation in an Automatic Framework. CVPR, 2009
Gupta & Davis, EECV, 2008 “Beyond nouns”
“Beyond nouns” Gupta & Davis, EECV, 2008
Gupta & Davis, EECV, 2008
Section outline Early “pictures and words” work Content-based retrieval Beyond nouns, towards total scene annotation Propositions A. Gupta and L. S. Davis, Beyond Nouns: Exploiting prepositions and comparative adjectives for learning visual classifiers, ECCV, 2008 Objects, scenes, activities L.-J. Li and L. Fei-Fei. What, where and who? Classifying event by scene and object recognition. ICCV, 2007 L.-J. Li, R. Socher and L. Fei-Fei. Towards Total Scene Understanding:Classification, Annotation and Segmentation in an Automatic Framework. CVPR, 2009
What, where and who? Classifying events by scene and object recognition L-J Li & L. Fei-Fei, ICCV 2007
scene pathway object pathway event PFC “where” pathway “what” pathway L.-J. Li & L. Fei-Fei ICCV 2007
scene pathway “Polo Field” Fei-Fei & Perona, CVPR, 2005 L.-J. Li & L. Fei-Fei ICCV 2007
O= ‘horse’ object pathway G. Wang & L. Fei-Fei, CVPR, 2006 L.-J. Li , G. Wang & L. Fei-Fei, CVPR, 2007 L. Cao & L. Fei-Fei, ICCV, 2007 L.-J. Li & L. Fei-Fei ICCV 2007
The 3W stories what who where L.-J. Li & L. Fei-Fei ICCV 2007

Weitere ähnliche Inhalte

Andere mochten auch

Feature Matching using SIFT algorithm
Feature Matching using SIFT algorithmFeature Matching using SIFT algorithm
Feature Matching using SIFT algorithmSajid Pareeth
 
Machine Learning Experimentation at Sift Science
Machine Learning Experimentation at Sift ScienceMachine Learning Experimentation at Sift Science
Machine Learning Experimentation at Sift ScienceSift Science
 
Michal Erel's SIFT presentation
Michal Erel's SIFT presentationMichal Erel's SIFT presentation
Michal Erel's SIFT presentationwolf
 
Image feature extraction
Image feature extractionImage feature extraction
Image feature extractionRushin Shah
 
LinkedIn SlideShare: Knowledge, Well-Presented
LinkedIn SlideShare: Knowledge, Well-PresentedLinkedIn SlideShare: Knowledge, Well-Presented
LinkedIn SlideShare: Knowledge, Well-PresentedSlideShare
 

Andere mochten auch (8)

Feature Matching using SIFT algorithm
Feature Matching using SIFT algorithmFeature Matching using SIFT algorithm
Feature Matching using SIFT algorithm
 
SURF
SURFSURF
SURF
 
SIFT
SIFTSIFT
SIFT
 
Machine Learning Experimentation at Sift Science
Machine Learning Experimentation at Sift ScienceMachine Learning Experimentation at Sift Science
Machine Learning Experimentation at Sift Science
 
Feature Extraction
Feature ExtractionFeature Extraction
Feature Extraction
 
Michal Erel's SIFT presentation
Michal Erel's SIFT presentationMichal Erel's SIFT presentation
Michal Erel's SIFT presentation
 
Image feature extraction
Image feature extractionImage feature extraction
Image feature extraction
 
LinkedIn SlideShare: Knowledge, Well-Presented
LinkedIn SlideShare: Knowledge, Well-PresentedLinkedIn SlideShare: Knowledge, Well-Presented
LinkedIn SlideShare: Knowledge, Well-Presented
 

Ähnlich wie Iccv2009 recognition and learning object categories p2 c03 - objects and annotations

Cvpr2007 object category recognition p1 - bag of words models
Cvpr2007 object category recognition   p1 - bag of words modelsCvpr2007 object category recognition   p1 - bag of words models
Cvpr2007 object category recognition p1 - bag of words modelszukun
 
NIPS2009: Understand Visual Scenes - Part 2
NIPS2009: Understand Visual Scenes - Part 2NIPS2009: Understand Visual Scenes - Part 2
NIPS2009: Understand Visual Scenes - Part 2zukun
 
NLP in Practice - Part II
NLP in Practice - Part IINLP in Practice - Part II
NLP in Practice - Part IIDelip Rao
 
Towards Linked Ontologies and Data on the Semantic Web
Towards Linked Ontologies and Data on the Semantic WebTowards Linked Ontologies and Data on the Semantic Web
Towards Linked Ontologies and Data on the Semantic WebJie Bao
 
Recognizing Human-Object Interactions in Still Images by Modeling the Mutual ...
Recognizing Human-Object Interactions inStill Images by Modeling the Mutual ...Recognizing Human-Object Interactions inStill Images by Modeling the Mutual ...
Recognizing Human-Object Interactions in Still Images by Modeling the Mutual ...أحلام انصارى
 
Iccv2009 recognition and learning object categories p3 c00 - summary and da...
Iccv2009 recognition and learning object categories   p3 c00 - summary and da...Iccv2009 recognition and learning object categories   p3 c00 - summary and da...
Iccv2009 recognition and learning object categories p3 c00 - summary and da...zukun
 
Query Translation for Ontology-extended Data Sources
Query Translation for Ontology-extended Data SourcesQuery Translation for Ontology-extended Data Sources
Query Translation for Ontology-extended Data SourcesJie Bao
 
Iccv2009 recognition and learning object categories p1 c01 - classical methods
Iccv2009 recognition and learning object categories   p1 c01 - classical methodsIccv2009 recognition and learning object categories   p1 c01 - classical methods
Iccv2009 recognition and learning object categories p1 c01 - classical methodszukun
 
Jia-Bin Huang's Curriculum Vitae
Jia-Bin Huang's Curriculum VitaeJia-Bin Huang's Curriculum Vitae
Jia-Bin Huang's Curriculum VitaeJia-Bin Huang
 
ICDM 2019 Tutorial: Speech and Language Processing: New Tools and Applications
ICDM 2019 Tutorial: Speech and Language Processing: New Tools and ApplicationsICDM 2019 Tutorial: Speech and Language Processing: New Tools and Applications
ICDM 2019 Tutorial: Speech and Language Processing: New Tools and ApplicationsForward Gradient
 
scene description
scene descriptionscene description
scene descriptionkhushi2551
 
Mit6870 orsu lecture11
Mit6870 orsu lecture11Mit6870 orsu lecture11
Mit6870 orsu lecture11zukun
 

Ähnlich wie Iccv2009 recognition and learning object categories p2 c03 - objects and annotations (19)

Cvpr2007 object category recognition p1 - bag of words models
Cvpr2007 object category recognition   p1 - bag of words modelsCvpr2007 object category recognition   p1 - bag of words models
Cvpr2007 object category recognition p1 - bag of words models
 
NIPS2009: Understand Visual Scenes - Part 2
NIPS2009: Understand Visual Scenes - Part 2NIPS2009: Understand Visual Scenes - Part 2
NIPS2009: Understand Visual Scenes - Part 2
 
NLP in Practice - Part II
NLP in Practice - Part IINLP in Practice - Part II
NLP in Practice - Part II
 
Bagwords
BagwordsBagwords
Bagwords
 
Image captions.pptx
Image captions.pptxImage captions.pptx
Image captions.pptx
 
Fame cvpr
Fame cvprFame cvpr
Fame cvpr
 
Towards Linked Ontologies and Data on the Semantic Web
Towards Linked Ontologies and Data on the Semantic WebTowards Linked Ontologies and Data on the Semantic Web
Towards Linked Ontologies and Data on the Semantic Web
 
Iciap 2
Iciap 2Iciap 2
Iciap 2
 
Recognizing Human-Object Interactions in Still Images by Modeling the Mutual ...
Recognizing Human-Object Interactions inStill Images by Modeling the Mutual ...Recognizing Human-Object Interactions inStill Images by Modeling the Mutual ...
Recognizing Human-Object Interactions in Still Images by Modeling the Mutual ...
 
Iccv2009 recognition and learning object categories p3 c00 - summary and da...
Iccv2009 recognition and learning object categories   p3 c00 - summary and da...Iccv2009 recognition and learning object categories   p3 c00 - summary and da...
Iccv2009 recognition and learning object categories p3 c00 - summary and da...
 
Query Translation for Ontology-extended Data Sources
Query Translation for Ontology-extended Data SourcesQuery Translation for Ontology-extended Data Sources
Query Translation for Ontology-extended Data Sources
 
Explorations in media visualization
Explorations in media visualizationExplorations in media visualization
Explorations in media visualization
 
Iccv2009 recognition and learning object categories p1 c01 - classical methods
Iccv2009 recognition and learning object categories   p1 c01 - classical methodsIccv2009 recognition and learning object categories   p1 c01 - classical methods
Iccv2009 recognition and learning object categories p1 c01 - classical methods
 
Convolutional Features for Instance Search
Convolutional Features for Instance SearchConvolutional Features for Instance Search
Convolutional Features for Instance Search
 
ICVSS2011 Selected Presentations
ICVSS2011 Selected PresentationsICVSS2011 Selected Presentations
ICVSS2011 Selected Presentations
 
Jia-Bin Huang's Curriculum Vitae
Jia-Bin Huang's Curriculum VitaeJia-Bin Huang's Curriculum Vitae
Jia-Bin Huang's Curriculum Vitae
 
ICDM 2019 Tutorial: Speech and Language Processing: New Tools and Applications
ICDM 2019 Tutorial: Speech and Language Processing: New Tools and ApplicationsICDM 2019 Tutorial: Speech and Language Processing: New Tools and Applications
ICDM 2019 Tutorial: Speech and Language Processing: New Tools and Applications
 
scene description
scene descriptionscene description
scene description
 
Mit6870 orsu lecture11
Mit6870 orsu lecture11Mit6870 orsu lecture11
Mit6870 orsu lecture11
 

Mehr von zukun

My lyn tutorial 2009
My lyn tutorial 2009My lyn tutorial 2009
My lyn tutorial 2009zukun
 
ETHZ CV2012: Tutorial openCV
ETHZ CV2012: Tutorial openCVETHZ CV2012: Tutorial openCV
ETHZ CV2012: Tutorial openCVzukun
 
ETHZ CV2012: Information
ETHZ CV2012: InformationETHZ CV2012: Information
ETHZ CV2012: Informationzukun
 
Siwei lyu: natural image statistics
Siwei lyu: natural image statisticsSiwei lyu: natural image statistics
Siwei lyu: natural image statisticszukun
 
Lecture9 camera calibration
Lecture9 camera calibrationLecture9 camera calibration
Lecture9 camera calibrationzukun
 
Brunelli 2008: template matching techniques in computer vision
Brunelli 2008: template matching techniques in computer visionBrunelli 2008: template matching techniques in computer vision
Brunelli 2008: template matching techniques in computer visionzukun
 
Modern features-part-4-evaluation
Modern features-part-4-evaluationModern features-part-4-evaluation
Modern features-part-4-evaluationzukun
 
Modern features-part-3-software
Modern features-part-3-softwareModern features-part-3-software
Modern features-part-3-softwarezukun
 
Modern features-part-2-descriptors
Modern features-part-2-descriptorsModern features-part-2-descriptors
Modern features-part-2-descriptorszukun
 
Modern features-part-1-detectors
Modern features-part-1-detectorsModern features-part-1-detectors
Modern features-part-1-detectorszukun
 
Modern features-part-0-intro
Modern features-part-0-introModern features-part-0-intro
Modern features-part-0-introzukun
 
Lecture 02 internet video search
Lecture 02 internet video searchLecture 02 internet video search
Lecture 02 internet video searchzukun
 
Lecture 01 internet video search
Lecture 01 internet video searchLecture 01 internet video search
Lecture 01 internet video searchzukun
 
Lecture 03 internet video search
Lecture 03 internet video searchLecture 03 internet video search
Lecture 03 internet video searchzukun
 
Icml2012 tutorial representation_learning
Icml2012 tutorial representation_learningIcml2012 tutorial representation_learning
Icml2012 tutorial representation_learningzukun
 
Advances in discrete energy minimisation for computer vision
Advances in discrete energy minimisation for computer visionAdvances in discrete energy minimisation for computer vision
Advances in discrete energy minimisation for computer visionzukun
 
Gephi tutorial: quick start
Gephi tutorial: quick startGephi tutorial: quick start
Gephi tutorial: quick startzukun
 
EM algorithm and its application in probabilistic latent semantic analysis
EM algorithm and its application in probabilistic latent semantic analysisEM algorithm and its application in probabilistic latent semantic analysis
EM algorithm and its application in probabilistic latent semantic analysiszukun
 
Object recognition with pictorial structures
Object recognition with pictorial structuresObject recognition with pictorial structures
Object recognition with pictorial structureszukun
 
Iccv2011 learning spatiotemporal graphs of human activities
Iccv2011 learning spatiotemporal graphs of human activities Iccv2011 learning spatiotemporal graphs of human activities
Iccv2011 learning spatiotemporal graphs of human activities zukun
 

Mehr von zukun (20)

My lyn tutorial 2009
My lyn tutorial 2009My lyn tutorial 2009
My lyn tutorial 2009
 
ETHZ CV2012: Tutorial openCV
ETHZ CV2012: Tutorial openCVETHZ CV2012: Tutorial openCV
ETHZ CV2012: Tutorial openCV
 
ETHZ CV2012: Information
ETHZ CV2012: InformationETHZ CV2012: Information
ETHZ CV2012: Information
 
Siwei lyu: natural image statistics
Siwei lyu: natural image statisticsSiwei lyu: natural image statistics
Siwei lyu: natural image statistics
 
Lecture9 camera calibration
Lecture9 camera calibrationLecture9 camera calibration
Lecture9 camera calibration
 
Brunelli 2008: template matching techniques in computer vision
Brunelli 2008: template matching techniques in computer visionBrunelli 2008: template matching techniques in computer vision
Brunelli 2008: template matching techniques in computer vision
 
Modern features-part-4-evaluation
Modern features-part-4-evaluationModern features-part-4-evaluation
Modern features-part-4-evaluation
 
Modern features-part-3-software
Modern features-part-3-softwareModern features-part-3-software
Modern features-part-3-software
 
Modern features-part-2-descriptors
Modern features-part-2-descriptorsModern features-part-2-descriptors
Modern features-part-2-descriptors
 
Modern features-part-1-detectors
Modern features-part-1-detectorsModern features-part-1-detectors
Modern features-part-1-detectors
 
Modern features-part-0-intro
Modern features-part-0-introModern features-part-0-intro
Modern features-part-0-intro
 
Lecture 02 internet video search
Lecture 02 internet video searchLecture 02 internet video search
Lecture 02 internet video search
 
Lecture 01 internet video search
Lecture 01 internet video searchLecture 01 internet video search
Lecture 01 internet video search
 
Lecture 03 internet video search
Lecture 03 internet video searchLecture 03 internet video search
Lecture 03 internet video search
 
Icml2012 tutorial representation_learning
Icml2012 tutorial representation_learningIcml2012 tutorial representation_learning
Icml2012 tutorial representation_learning
 
Advances in discrete energy minimisation for computer vision
Advances in discrete energy minimisation for computer visionAdvances in discrete energy minimisation for computer vision
Advances in discrete energy minimisation for computer vision
 
Gephi tutorial: quick start
Gephi tutorial: quick startGephi tutorial: quick start
Gephi tutorial: quick start
 
EM algorithm and its application in probabilistic latent semantic analysis
EM algorithm and its application in probabilistic latent semantic analysisEM algorithm and its application in probabilistic latent semantic analysis
EM algorithm and its application in probabilistic latent semantic analysis
 
Object recognition with pictorial structures
Object recognition with pictorial structuresObject recognition with pictorial structures
Object recognition with pictorial structures
 
Iccv2011 learning spatiotemporal graphs of human activities
Iccv2011 learning spatiotemporal graphs of human activities Iccv2011 learning spatiotemporal graphs of human activities
Iccv2011 learning spatiotemporal graphs of human activities
 

Kürzlich hochgeladen

“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...Marc Dusseiller Dusjagr
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactdawncurless
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)eniolaolutunde
 
Introduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxIntroduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxpboyjonauth
 
APM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAPM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAssociation for Project Management
 
_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting Data_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting DataJhengPantaleon
 
How to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptxHow to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptxmanuelaromero2013
 
Introduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher EducationIntroduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher Educationpboyjonauth
 
PSYCHIATRIC History collection FORMAT.pptx
PSYCHIATRIC   History collection FORMAT.pptxPSYCHIATRIC   History collection FORMAT.pptx
PSYCHIATRIC History collection FORMAT.pptxPoojaSen20
 
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions  for the students and aspirants of Chemistry12th.pptxOrganic Name Reactions  for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions for the students and aspirants of Chemistry12th.pptxVS Mahajan Coaching Centre
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxheathfieldcps1
 
Crayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon ACrayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon AUnboundStockton
 
Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3JemimahLaneBuaron
 
Employee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxEmployee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxNirmalaLoungPoorunde1
 
mini mental status format.docx
mini    mental       status     format.docxmini    mental       status     format.docx
mini mental status format.docxPoojaSen20
 
Science 7 - LAND and SEA BREEZE and its Characteristics
Science 7 - LAND and SEA BREEZE and its CharacteristicsScience 7 - LAND and SEA BREEZE and its Characteristics
Science 7 - LAND and SEA BREEZE and its CharacteristicsKarinaGenton
 

Kürzlich hochgeladen (20)

“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
 
Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impact
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)
 
Introduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxIntroduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptx
 
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
 
APM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAPM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across Sectors
 
_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting Data_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting Data
 
How to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptxHow to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptx
 
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdfTataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
 
Introduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher EducationIntroduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher Education
 
PSYCHIATRIC History collection FORMAT.pptx
PSYCHIATRIC   History collection FORMAT.pptxPSYCHIATRIC   History collection FORMAT.pptx
PSYCHIATRIC History collection FORMAT.pptx
 
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions  for the students and aspirants of Chemistry12th.pptxOrganic Name Reactions  for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptx
 
Crayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon ACrayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon A
 
Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3
 
Employee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxEmployee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptx
 
mini mental status format.docx
mini    mental       status     format.docxmini    mental       status     format.docx
mini mental status format.docx
 
Science 7 - LAND and SEA BREEZE and its Characteristics
Science 7 - LAND and SEA BREEZE and its CharacteristicsScience 7 - LAND and SEA BREEZE and its Characteristics
Science 7 - LAND and SEA BREEZE and its Characteristics
 
Model Call Girl in Bikash Puri Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Bikash Puri  Delhi reach out to us at 🔝9953056974🔝Model Call Girl in Bikash Puri  Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Bikash Puri Delhi reach out to us at 🔝9953056974🔝
 

Iccv2009 recognition and learning object categories p2 c03 - objects and annotations

  • 2. Vision and language in human brain Language Vision Wernicke Area Broca Area PPA LOC V1 FFA
  • 3. Vision and language in human brain figure modified from: http://www.colorado.edu/intphys/Class/IPHY3730
  • 4. Vision and language in human brain ? (Translation: “This is not a pipe.”) figure modified from: http://www.colorado.edu/intphys/Class/IPHY3730
  • 5.
  • 6.
  • 7. What can you see in a glance of a scene? Fei-Fei, Iyer, Koch, Perona, JoV, 2007
  • 8. PT = 27ms This was a picture with some dark sploches in it. Yeah. . .that's about it. (Subject: KM) PT = 40ms I think I saw two people on a field. (Subject: RW) PT = 67ms Outdoor scene. There were some kind of animals, maybe dogs or horses, in the middle of the picture. It looked like they were running in the middle of a grassy field. (Subject: IV) PT = 500ms Some kind of game or fight. Two groups of two men? The foregound pair looked like one was getting a fist in the face. Outdoors seemed like because i have an impression of grass and maybe lines on the grass? That would be why I think perhaps a game, rough game though, more like rugby than football because they pairs weren't in pads and helmets, though I did get the impression of similar clothing. maybe some trees? in the background. (Subject: SM) PT = 107ms two people, whose profile was toward me. looked like they were on a field of some sort and engaged in some sort of sport (their attire suggested soccer, but it looked like there was too much contact for that). (Subject: AI) Fei-Fei, Iyer, Koch, Perona, JoV, 2007
  • 9. Section outline Early “pictures and words” work Content-based retrieval Beyond nouns, towards total scene annotation
  • 10. “Pictures and words” Barnard, Duygulu, de Freitas, Forsyth, Blei, Jordan, Matching words and pictures, JMLR, 2003 Duygulu, Barnard, de Freitas, Forsyth, Object Recognition as Machine Translation: Learning a lexicon for a fixed image vocabulary , ECCV, 2003 Blei & Jordan, Modeling annotated data, ACM SIGIR, 2003 Chang, Goh, Sychay, & Wu, Soft annotation using Bayes point machines, IEEE Transactions on Circuits and Systems for Video Technology, 2003 Goh, Chang, & Cheng, Ensemble of SVM-based classifiers for annotation, 2003 ….
  • 11.
  • 12. Images are clustered based on priors over concepts.
  • 13. Learning determines localized concepts models from global annotations.
  • 15. One possible assumption: concept models simultaneously generate both a word and blob sun sun sky water waves Barnard et al. JMLR, 2005 Slide courtesy of Kobus Barnard (1 hour ago!)
  • 16.
  • 17. Chose an image cluster by p(c)
  • 18. Chose multimodal concept clusters using p(s|c)
  • 19. From each multimodal cluster, sample a Gaussian for blob features, p(b|s), and a multinomial for words, p(w|s)
  • 20. (Skip with some probability to account for mismatched numbers of words and blobs)
  • 21. For a given correspondence*sun sun sky water waves Barnard et al. JMLR, 2005 Slide courtesy of Kobus Barnard (1 hour ago!)
  • 22. Barnard et al. JMLR, 2005
  • 23. Section outline Early “pictures and words” work Content-based retrieval Beyond nouns, towards total scene annotation
  • 24. Content-based retrieval Elegance Love Symmetry Flower Petals Tower France Rose Corolla Australian Floribunda Rose EiffelTower Paris Slide courtesy of RitendraDatta, Jia Li, James Z. Wang
  • 25. Literature – MANY!!! A. W. Smeulders, M. Worring, S. Santini, A. Gupta, R. Jain, Content-Based Image Retrieval at the End of the Early Years, IEEE Trans. Pattern Analysis and Machine Intelligence , 22(12):1349-1380, 2000. R. Datta, D. Joshi, J. Li, and J. Z. Wang, Image Retrieval: Ideas, Influences, and Trends of the New Age, ACM Computing Surveys, vol. 40, no. 2, pp. 5:1-60, 2008.
  • 26. Try out Alipr (www.alipr.com)
  • 27. Try out Alipr (www.alipr.com)
  • 28. Automatic Image Annotation: ALIP Slide courtesy ofRitendraDatta, Jia Li, James Z. Wang
  • 29. Automatic Image Annotation: ALIP Slide courtesy ofRitendraDatta, Jia Li, James Z. Wang
  • 30. Automatic Image Annotation: ALIP 2D-MHMM: Two-dimensional multi-resolution hidden Markov model Slide courtesy ofRitendraDatta, Jia Li, James Z. Wang
  • 31.
  • 32. Salient words appearing in the classification favored moreFood, indoor, cuisine, dessert Building, sky, lake, landscape, Europe, tree Snow, animal, wildlife, sky, cloth, ice, people Slide courtesy ofRitendraDatta, Jia Li, James Z. Wang
  • 33. Section outline Early “pictures and words” work Content-based retrieval Beyond nouns, towards total scene annotation Propositions A. Gupta and L. S. Davis, Beyond Nouns: Exploiting prepositions and comparative adjectives for learning visual classifiers, ECCV, 2008 Objects, scenes, activities L.-J. Li and L. Fei-Fei. What, where and who? Classifying event by scene and object recognition. ICCV, 2007 L.-J. Li, R. Socher and L. Fei-Fei. Towards Total Scene Understanding:Classification, Annotation and Segmentation in an Automatic Framework. CVPR, 2009
  • 34. Section outline Early “pictures and words” work Content-based retrieval Beyond nouns, towards total scene annotation Propositions A. Gupta and L. S. Davis, Beyond Nouns: Exploiting prepositions and comparative adjectives for learning visual classifiers, ECCV, 2008 Objects, scenes, activities L.-J. Li and L. Fei-Fei. What, where and who? Classifying event by scene and object recognition. ICCV, 2007 L.-J. Li, R. Socher and L. Fei-Fei. Towards Total Scene Understanding:Classification, Annotation and Segmentation in an Automatic Framework. CVPR, 2009
  • 35. Gupta & Davis, EECV, 2008 “Beyond nouns”
  • 36. “Beyond nouns” Gupta & Davis, EECV, 2008
  • 37. Gupta & Davis, EECV, 2008
  • 38. Section outline Early “pictures and words” work Content-based retrieval Beyond nouns, towards total scene annotation Propositions A. Gupta and L. S. Davis, Beyond Nouns: Exploiting prepositions and comparative adjectives for learning visual classifiers, ECCV, 2008 Objects, scenes, activities L.-J. Li and L. Fei-Fei. What, where and who? Classifying event by scene and object recognition. ICCV, 2007 L.-J. Li, R. Socher and L. Fei-Fei. Towards Total Scene Understanding:Classification, Annotation and Segmentation in an Automatic Framework. CVPR, 2009
  • 39. What, where and who? Classifying events by scene and object recognition L-J Li & L. Fei-Fei, ICCV 2007
  • 40. scene pathway object pathway event PFC “where” pathway “what” pathway L.-J. Li & L. Fei-Fei ICCV 2007
  • 41. scene pathway “Polo Field” Fei-Fei & Perona, CVPR, 2005 L.-J. Li & L. Fei-Fei ICCV 2007
  • 42. O= ‘horse’ object pathway G. Wang & L. Fei-Fei, CVPR, 2006 L.-J. Li , G. Wang & L. Fei-Fei, CVPR, 2007 L. Cao & L. Fei-Fei, ICCV, 2007 L.-J. Li & L. Fei-Fei ICCV 2007
  • 43. The 3W stories what who where L.-J. Li & L. Fei-Fei ICCV 2007
  • 44. Classification Annotation Segmentation class: Polo Sky Tree Athlete Athlete Horse Grass Trees Sky Saddle Horse Horse Horse Horse Horse Horse Grass L-J Li , R. Socher & L. Fei-Fei, CVPR, 2009
  • 45. Our model: a hierarchical representation of the image and its semantic contents Sky Rock Total Scene initialization Mountain Sky Sky Generative Model Tree … Class: Polo Athlete Athlete Horse Grass Trees Sky Saddle Class: Rock climbing Horse Tree noisy images and tags Horse Athlete Athlete Mountain Trees Rock Sky Ascent Athlete Horse Horse Horse Learning Grass Tree sailboat Water Class: Sailing Athlete Sailboat Trees Water Sky Wind Recognition L-J Li , R. Socher & L. Fei-Fei, CVPR, 2009
  • 46. Our model: a hierarchical representation of the image and its semantic contents Sky Rock Total Scene initialization Mountain Sky Sky Generative Model Generative Model Tree … Class: Polo Athlete Athlete Horse Grass Trees Sky Saddle Class: Rock climbing Horse Tree noisy images and tags Horse Athlete Athlete Mountain Trees Rock Sky Ascent Athlete Horse Horse Horse Learning Grass Tree sailboat Water Class: Sailing Athlete Sailboat Trees Water Sky Wind Recognition L-J Li , R. Socher & L. Fei-Fei, CVPR, 2009
  • 47. The model: a hierarchical representation of the image and its semantic contents Total Scene Polo C “Switch variable” Visible Text Not visible S Visual Athlete Horse Grass Trees Sky Saddle horse Horse O T X R Z Ar NF Nr Nt “Connector variable” D
  • 48. Our model: a hierarchical representation of the image and its semantic contents Sky Rock Total Scene initialization initialization Mountain Sky Sky Generative Model Generative Model Tree … Class: Polo Athlete Athlete Horse Grass Trees Sky Saddle Class: Rock climbing Horse Tree noisy images and tags Horse Athlete Athlete Mountain Trees Rock Sky Ascent Athlete Horse Horse Horse Learning Learning Grass Tree sailboat Water Class: Sailing Athlete Sailboat Trees Water Sky Wind Recognition L-J Li , R. Socher & L. Fei-Fei, CVPR, 2009
  • 49. Need some good, initial “guestimate” of O Total Scene C Scene/Event images from the Internet S O T X R Z Nr NF Ar Nt L-J Li , R. Socher & L. Fei-Fei, CVPR, 2009
  • 50. Auto-semi-supervised learning: Small # of initialized images + Large # of uninitialized images Total Scene Scene/Event images from the Internet Generative Model Large # of uninitialized images + Athlete Horse Grass Tree Wind Saddle Small # of initialized images L-J Li , R. Socher & L. Fei-Fei, CVPR, 2009
  • 51. Our model: a hierarchical representation of the image and its semantic contents Sky Rock Total Scene initialization Mountain Sky Sky Generative Model Tree … Class: Polo Athlete Athlete Horse Grass Trees Sky Saddle Class: Rock climbing Horse Tree noisy images and tags Horse Athlete Athlete Mountain Trees Rock Sky Ascent Athlete Horse Horse Horse Learning Grass Tree sailboat Water Class: Sailing Athlete Sailboat Trees Water Sky Wind Recognition L-J Li , R. Socher & L. Fei-Fei, CVPR, 2009
  • 52. 8 Event/Scene Classes Rockclimbing Badminton Bocce Rowing Croquet Sailing Snow boarding Polo
  • 53. 43 Some sample results Total Scene Class: Croquet Class: Bocce Class: Snowboarding Class: Polo Class: Sailing Class: Badminton Class: Rock Climbing Class: Rowing L-J Li , R. Socher & L. Fei-Fei, CVPR, 2009
  • 54. PT = 27ms This was a picture with some dark sploches in it. Yeah. . .that's about it. (Subject: KM) PT = 40ms I think I saw two people on a field. (Subject: RW) PT = 67ms Outdoor scene. There were some kind of animals, maybe dogs or horses, in the middle of the picture. It looked like they were running in the middle of a grassy field. (Subject: IV) PT = 500ms Some kind of game or fight. Two groups of two men? The foregound pair looked like one was getting a fist in the face. Outdoors seemed like because i have an impression of grass and maybe lines on the grass? That would be why I think perhaps a game, rough game though, more like rugby than football because they pairs weren't in pads and helmets, though I did get the impression of similar clothing. maybe some trees? in the background. (Subject: SM) PT = 107ms two people, whose profile was toward me. looked like they were on a field of some sort and engaged in some sort of sport (their attire suggested soccer, but it looked like there was too much contact for that). (Subject: AI) Fei-Fei, Iyer, Koch, Perona, JoV, 2007