SlideShare ist ein Scribd-Unternehmen logo
1 von 83
Downloaden Sie, um offline zu lesen
Sketching out Your Search Intent
  - Towards bridging intent and semantic gaps in
          web-scale multimedia search


                          Xian-Sheng Hua
                Lead Researcher, Microsoft Research Asia

  June 24 2010 – ACM Multimedia 2010 TPC Meeting – University of Amsterdam
Xian-Sheng Hua, MSR Asia




Evolution of CBIR/CBVR

                                      Annotation Based
  Commercial Search                                            ~2006 
      Engines
                                                ~2003     (1) Tagging Based
                                                          (2) Large-Scale CBIR/CBVR
       Conventional CBIR      ~2001
                            Direct-Text Based
                  1990’                           Academia
                                                  Prototypes

        1970~
        1980’
    Manual Labeling Based




                                                                                            2
Xian-Sheng Hua, MSR Asia




Three Schemes

                Example-Based


            Annotation-Based


                 Text-Based
Xian-Sheng Hua, MSR Asia




Limitation of Text-Based Search
‱ High noises
   Ì”   Surrounding text: frequently not related to the visual content
   Ì”   Social tags: also noisy, and a large portion don’t have tags

‱ Mostly only covers simple semantics
   Ì”   Surrounding text: limited
   Ì”   Social tags: People tend to only input simple and personal tags
   Ì”   Query association: powerful but coverage is still limited (non-clicked
       images will never be well indexed)


       A complex example:
          Finding Lady Gaga walking on red carpet with black dress
Xian-Sheng Hua, MSR Asia




Limitation of Annotation-Based Search
‱ Low accuracy
  Ì”   The accuracy of automatic annotation is still far from satisfactory
  Ì”   Difficult to solve due

‱ Limited coverage
  Ì”   The coverage of the semantic concepts is still far from the rich content
      that is contain in an image
  Ì”   Difficult to extend

‱ High computation
  Ì”   Learning costs high computation
  Ì”   Recognition costs high if the number of concepts is large

                   Still has a long way to go
Xian-Sheng Hua, MSR Asia




Limitation of Large-Scale Example-Based Search
‱ You need an example first
   Ì”   How to do it if I don’t have an initial example?

‱ Large-scale (semantic) similarity search is still difficult
   Ì”   Though large-scale (near) duplicate detection is good




                     An example: TinEye
Xian-Sheng Hua, MSR Asia




Possible Way-Outs?
‱ Large-scale high-performance annotation?
  Ì” Model-based? Data-driven? Hybrid? 


‱ Large-scale manual labeling?
  Ì” ESP game? Label Me? Machinery Turk? 


‱ New query interface?
  Ì” Interactive search?


Or 
 To combine text, content and interface?
Xian-Sheng Hua, MSR Asia




Simple Attempts
‱ Exemplary attempts based on this idea
  Ì” Search by dominant color (Google/Bing)
   Ì” Show similar images (Bing)/Find similar images (Google)
    Ì” Show more sizes (Bing)
Xian-Sheng Hua, MSR Asia




Two Recent Projects
‱ Sketching out your search intent
  Ì” Image Search by Color Sketch
   Ì” Image Search by Concept Sketch
Search by Color Sketch
      WWW 2010 Demo
Xian-Sheng Hua, MSR Asia




Seems It Is Not New?
‱ “Query by sketch” has been proposed long time ago
  Ì”   But on small datasets and difficult to scale up
  Ì”   And seldom work well actually
  Ì”   Not use to use – “object sketch”
It is based on a SIGGRAPH 95’s paper:
       C. Jacobs et al. Fast Multiresolution Image Querying. SIGGRAPH 1995
on small scale image set and difficult to scale-up (0.44 second for 10,000 images)
Xian-Sheng Hua, MSR Asia
Xian-Sheng Hua, MSR Asia




Our Approach
‱ Color sketch 

  Ì”   Is not “object sketch”
  Ì”   But only rough color distribution
  Ì”   Supports large-scale data
  Ì”   And uses text at the same time
Xian-Sheng Hua, MSR Asia




Our Approach
‱ Color sketch 

  Ì”   Is not “object sketch”
  Ì”   But only rough color distribution
  Ì”   Supports large-scale data
  Ì”   And uses text at the same time

‱ Advantages of “Color Sketch”
  Ì”   More presentative than dominant color (and text only)
  Ì”   More robust than “object sketch”
  Ì”   Easy to use compared with “object sketch”
  Ì”   Easy to scale up
Xian-Sheng Hua, MSR Asia




Example: Blue flower with green leaf
Xian-Sheng Hua, MSR Asia




Example: Brooke Hogan walking on red carpet
Xian-Sheng Hua, MSR Asia




Color Sketch Is More Powerful
  Blue Flower with Green Leaf   Brooke Hogan walking on red carpet
Xian-Sheng Hua, MSR Asia




Color Sketch Is More Powerful




                                                  30
Xian-Sheng Hua, MSR Asia




Demo
‱ It has been productized in Bing
Gambia flag
Xian-Sheng Hua, MSR Asia




 Summary – Why it Works
‱ What are difficult
   Ì”   Semantics is difficult
   Ì”   Intent prediction is difficult

‱ What are easier
   Ì”   Use color to describe images’ content
   Ì”   Use color to describe users’ intent
                   content                color sketch              intent




   Color sketch is an intermediate representation to connect images’ content and users’ intent.
Xian-Sheng Hua, MSR Asia




Limitation of Color Sketch
‱ Only works well for those semantics that can
  be described by color distribution
‱ Only works well when people is able to transfer
  their desire into color distribution



     A more complex example:
        A flying butterfly on the top-left of a flower.
Search by Concept Sketch
         SIGIR 2010
Xian-Sheng Hua, MSR Asia




Search By Concept Sketch
‱ A system for the image search intentions concerning:
   Ì”   The presence of the semantic concepts (semantic constraints)
   Ì”   the spatial layout of the concepts (spatial constraints)




        butterfly

              flower



 A concept map query             Image search results of our system
Xian-Sheng Hua, MSR Asia




 Framework
A concept map query                                                           A candidate image
                                1
    A: butterfly                    Create Candidate
    B: flower                          Image Set
       butterfly
    A: X=0.5, Y=0.5
    B: X=0.8, flower
              Y=0.8

                                2                                        3
                                    Instant Concept                          Concept Distribution
                                       Modeling                                  Estimation
                                                  Concept Models                      Estimated distributions

                                               butterfly                               butterfly
                   4                              flower
                            Intent                                                      flower
                        Interpretation
        Desired distributions
                                              5
                                                  Spatial distribution
         butterfly                                   comparison

           flower
                                                     Relevance score
Xian-Sheng Hua, MSR Asia




 Search Results Comparison
  Task: searching for the images with “snoopy appear at right”



    snoopy
 Text-based image search




   “show similar image”




              snoopy




Image search by concept map
Xian-Sheng Hua, MSR Asia




Search Results
Task: searching for the images with “snoopy appear at top” / “snoopy appear at left”
Xian-Sheng Hua, MSR Asia




Search Results Comparison
Task: searching for the images with “a jeep shown above a piece of grass”




  jeep grass
Xian-Sheng Hua, MSR Asia




Search Results Comparison
Task: searching for the images with “a car parked in front of a house”




  house car
Xian-Sheng Hua, MSR Asia




Search Results Comparison
Task: searching for the images with “a keyboard and a mouse being side by side”




  keyboard mouse
Xian-Sheng Hua, MSR Asia




Search Results Comparison
Task: searching for the images with “sky/Colosseum/grass appear from top to bottom”




  Colosseum grass
Xian-Sheng Hua, MSR Asia




 Advanced Function: Influence Scope
 Indication
‱ Allow users to explicitly specify the occupied
  region of a concept

           house
                   Default
                                          house


    lawn                           lawn




                                          house


                                   lawn




                             Search for the long narrow lawn at the bottom of the image
Xian-Sheng Hua, MSR Asia




Advanced Function:
Influence Scope Adjustment



      Searching for small windmill




      Searching for large windmill
Xian-Sheng Hua, MSR Asia




Advanced Function:
Visual Modeling Adjustment
                               sky


                           house


                               lawn


                 Clear                       Search

                         Advanced Function
                 Visual Assistor




                                                      Previous   Next




  Allow users to indicate or narrow down
  what a desired concept looks like (visual constraints)
Xian-Sheng Hua, MSR Asia




Advanced Function:
Visual Modeling Adjustment


                Searching for
                front-view jeeps

 jeep




               Searching for
               side-view jeeps




        Visual instances
Xian-Sheng Hua, MSR Asia




Advanced Function:
Visual Modeling Adjustment


                  Searching for
                  butterfly with
                  yellow flower

 butterfly


       flower
                  Searching for
                  butterfly with
                  red flower




                Visual instances
Xian-Sheng Hua, MSR Asia




Conclusions
‱ To bridge the gap between users’ intent and
  visual semantics of images by
  Ì” Sketching out your search intent, and
   Ì” Combining text and visual content

‱ Two exemplary approaches introduced
  Ì” Search by color sketch
   Ì” Search by concept sketch
Xian-Sheng Hua, MSR Asia




Thank You

Weitere Àhnliche Inhalte

Andere mochten auch

Andere mochten auch (8)

Ptc creo fmx sales presentation
Ptc creo fmx sales presentationPtc creo fmx sales presentation
Ptc creo fmx sales presentation
 
Dynamic Search Using Semantics & Statistics
Dynamic Search Using Semantics & StatisticsDynamic Search Using Semantics & Statistics
Dynamic Search Using Semantics & Statistics
 
Network Intent Composition in OpenDaylight
Network Intent Composition in OpenDaylightNetwork Intent Composition in OpenDaylight
Network Intent Composition in OpenDaylight
 
Summit 16: Applying Machine Learning to Intent-based Networking and Nfv Scali...
Summit 16: Applying Machine Learning to Intent-based Networking and Nfv Scali...Summit 16: Applying Machine Learning to Intent-based Networking and Nfv Scali...
Summit 16: Applying Machine Learning to Intent-based Networking and Nfv Scali...
 
Droidcon it 2015: Android Lollipop for Enterprise
Droidcon it 2015: Android Lollipop for EnterpriseDroidcon it 2015: Android Lollipop for Enterprise
Droidcon it 2015: Android Lollipop for Enterprise
 
Object-oriented design patterns in UML [Software Modeling] [Computer Science...
Object-oriented design patterns  in UML [Software Modeling] [Computer Science...Object-oriented design patterns  in UML [Software Modeling] [Computer Science...
Object-oriented design patterns in UML [Software Modeling] [Computer Science...
 
Unsupervised Learning and Modeling of Knowledge and Intent for Spoken Dialogu...
Unsupervised Learning and Modeling of Knowledge and Intent for Spoken Dialogu...Unsupervised Learning and Modeling of Knowledge and Intent for Spoken Dialogu...
Unsupervised Learning and Modeling of Knowledge and Intent for Spoken Dialogu...
 
Evolution Towards Web 3.0: The Semantic Web
Evolution Towards Web 3.0: The Semantic WebEvolution Towards Web 3.0: The Semantic Web
Evolution Towards Web 3.0: The Semantic Web
 

Ähnlich wie Sketching Out Your Search Intent

Content based image retrieval(cbir)
Content based image retrieval(cbir)Content based image retrieval(cbir)
Content based image retrieval(cbir)
paddu123
 

Ähnlich wie Sketching Out Your Search Intent (8)

Learning a Joint Embedding Representation for Image Search using Self-supervi...
Learning a Joint Embedding Representation for Image Search using Self-supervi...Learning a Joint Embedding Representation for Image Search using Self-supervi...
Learning a Joint Embedding Representation for Image Search using Self-supervi...
 
Saliency-based Models of Image Content and their Application to Auto-Annotati...
Saliency-based Models of Image Content and their Application to Auto-Annotati...Saliency-based Models of Image Content and their Application to Auto-Annotati...
Saliency-based Models of Image Content and their Application to Auto-Annotati...
 
Reinvent The Wheel: Sketching Your Own Design Process
Reinvent The Wheel: Sketching Your Own Design ProcessReinvent The Wheel: Sketching Your Own Design Process
Reinvent The Wheel: Sketching Your Own Design Process
 
Using content-based multimedia similarity search for learning
Using content-based multimedia similarity search for learningUsing content-based multimedia similarity search for learning
Using content-based multimedia similarity search for learning
 
Content based image retrieval(cbir)
Content based image retrieval(cbir)Content based image retrieval(cbir)
Content based image retrieval(cbir)
 
The Search for a New Visual Search Beyond Language - StampedeCon AI Summit 2017
The Search for a New Visual Search Beyond Language - StampedeCon AI Summit 2017The Search for a New Visual Search Beyond Language - StampedeCon AI Summit 2017
The Search for a New Visual Search Beyond Language - StampedeCon AI Summit 2017
 
Machine Learning
Machine LearningMachine Learning
Machine Learning
 
Analyzing visuals ppt
Analyzing visuals pptAnalyzing visuals ppt
Analyzing visuals ppt
 

KĂŒrzlich hochgeladen

Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Victor Rentea
 
Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..
Christopher Logan Kennedy
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 

KĂŒrzlich hochgeladen (20)

DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
Introduction to use of FHIR Documents in ABDM
Introduction to use of FHIR Documents in ABDMIntroduction to use of FHIR Documents in ABDM
Introduction to use of FHIR Documents in ABDM
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
JohnPollard-hybrid-app-RailsConf2024.pptx
JohnPollard-hybrid-app-RailsConf2024.pptxJohnPollard-hybrid-app-RailsConf2024.pptx
JohnPollard-hybrid-app-RailsConf2024.pptx
 
Spring Boot vs Quarkus the ultimate battle - DevoxxUK
Spring Boot vs Quarkus the ultimate battle - DevoxxUKSpring Boot vs Quarkus the ultimate battle - DevoxxUK
Spring Boot vs Quarkus the ultimate battle - DevoxxUK
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
 
Six Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal OntologySix Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal Ontology
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
 
AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)
AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)
AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)
 
Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 

Sketching Out Your Search Intent

  • 1. Sketching out Your Search Intent - Towards bridging intent and semantic gaps in web-scale multimedia search Xian-Sheng Hua Lead Researcher, Microsoft Research Asia June 24 2010 – ACM Multimedia 2010 TPC Meeting – University of Amsterdam
  • 2. Xian-Sheng Hua, MSR Asia Evolution of CBIR/CBVR Annotation Based Commercial Search ~2006  Engines ~2003 (1) Tagging Based (2) Large-Scale CBIR/CBVR Conventional CBIR ~2001 Direct-Text Based 1990’ Academia Prototypes 1970~ 1980’ Manual Labeling Based 2
  • 3. Xian-Sheng Hua, MSR Asia Three Schemes Example-Based Annotation-Based Text-Based
  • 4. Xian-Sheng Hua, MSR Asia Limitation of Text-Based Search ‱ High noises Ì” Surrounding text: frequently not related to the visual content Ì” Social tags: also noisy, and a large portion don’t have tags ‱ Mostly only covers simple semantics Ì” Surrounding text: limited Ì” Social tags: People tend to only input simple and personal tags Ì” Query association: powerful but coverage is still limited (non-clicked images will never be well indexed) A complex example: Finding Lady Gaga walking on red carpet with black dress
  • 5.
  • 6. Xian-Sheng Hua, MSR Asia Limitation of Annotation-Based Search ‱ Low accuracy Ì” The accuracy of automatic annotation is still far from satisfactory Ì” Difficult to solve due ‱ Limited coverage Ì” The coverage of the semantic concepts is still far from the rich content that is contain in an image Ì” Difficult to extend ‱ High computation Ì” Learning costs high computation Ì” Recognition costs high if the number of concepts is large Still has a long way to go
  • 7. Xian-Sheng Hua, MSR Asia Limitation of Large-Scale Example-Based Search ‱ You need an example first Ì” How to do it if I don’t have an initial example? ‱ Large-scale (semantic) similarity search is still difficult Ì” Though large-scale (near) duplicate detection is good An example: TinEye
  • 8. Xian-Sheng Hua, MSR Asia Possible Way-Outs? ‱ Large-scale high-performance annotation? Ì” Model-based? Data-driven? Hybrid? 
 ‱ Large-scale manual labeling? Ì” ESP game? Label Me? Machinery Turk? 
 ‱ New query interface? Ì” Interactive search? Or 
 To combine text, content and interface?
  • 9. Xian-Sheng Hua, MSR Asia Simple Attempts ‱ Exemplary attempts based on this idea Ì” Search by dominant color (Google/Bing) Ì” Show similar images (Bing)/Find similar images (Google) Ì” Show more sizes (Bing)
  • 10.
  • 11.
  • 12.
  • 13.
  • 14.
  • 15.
  • 16. Xian-Sheng Hua, MSR Asia Two Recent Projects ‱ Sketching out your search intent Ì” Image Search by Color Sketch Ì” Image Search by Concept Sketch
  • 17. Search by Color Sketch WWW 2010 Demo
  • 18. Xian-Sheng Hua, MSR Asia Seems It Is Not New? ‱ “Query by sketch” has been proposed long time ago Ì” But on small datasets and difficult to scale up Ì” And seldom work well actually Ì” Not use to use – “object sketch”
  • 19.
  • 20. It is based on a SIGGRAPH 95’s paper: C. Jacobs et al. Fast Multiresolution Image Querying. SIGGRAPH 1995 on small scale image set and difficult to scale-up (0.44 second for 10,000 images)
  • 22.
  • 23. Xian-Sheng Hua, MSR Asia Our Approach ‱ Color sketch 
 Ì” Is not “object sketch” Ì” But only rough color distribution Ì” Supports large-scale data Ì” And uses text at the same time
  • 24.
  • 25.
  • 26. Xian-Sheng Hua, MSR Asia Our Approach ‱ Color sketch 
 Ì” Is not “object sketch” Ì” But only rough color distribution Ì” Supports large-scale data Ì” And uses text at the same time ‱ Advantages of “Color Sketch” Ì” More presentative than dominant color (and text only) Ì” More robust than “object sketch” Ì” Easy to use compared with “object sketch” Ì” Easy to scale up
  • 27. Xian-Sheng Hua, MSR Asia Example: Blue flower with green leaf
  • 28. Xian-Sheng Hua, MSR Asia Example: Brooke Hogan walking on red carpet
  • 29. Xian-Sheng Hua, MSR Asia Color Sketch Is More Powerful Blue Flower with Green Leaf Brooke Hogan walking on red carpet
  • 30. Xian-Sheng Hua, MSR Asia Color Sketch Is More Powerful 30
  • 31. Xian-Sheng Hua, MSR Asia Demo ‱ It has been productized in Bing
  • 32.
  • 33.
  • 34.
  • 35.
  • 36.
  • 37.
  • 38.
  • 39.
  • 40.
  • 41.
  • 42.
  • 43.
  • 44.
  • 45.
  • 46.
  • 47.
  • 48.
  • 50.
  • 51.
  • 52.
  • 53.
  • 54.
  • 55.
  • 56.
  • 57.
  • 58.
  • 59.
  • 60.
  • 61.
  • 62.
  • 63.
  • 64.
  • 65.
  • 66. Xian-Sheng Hua, MSR Asia Summary – Why it Works ‱ What are difficult Ì” Semantics is difficult Ì” Intent prediction is difficult ‱ What are easier Ì” Use color to describe images’ content Ì” Use color to describe users’ intent content color sketch intent Color sketch is an intermediate representation to connect images’ content and users’ intent.
  • 67. Xian-Sheng Hua, MSR Asia Limitation of Color Sketch ‱ Only works well for those semantics that can be described by color distribution ‱ Only works well when people is able to transfer their desire into color distribution A more complex example: A flying butterfly on the top-left of a flower.
  • 68. Search by Concept Sketch SIGIR 2010
  • 69. Xian-Sheng Hua, MSR Asia Search By Concept Sketch ‱ A system for the image search intentions concerning: Ì” The presence of the semantic concepts (semantic constraints) Ì” the spatial layout of the concepts (spatial constraints) butterfly flower A concept map query Image search results of our system
  • 70. Xian-Sheng Hua, MSR Asia Framework A concept map query A candidate image 1 A: butterfly Create Candidate B: flower Image Set butterfly A: X=0.5, Y=0.5 B: X=0.8, flower Y=0.8 2 3 Instant Concept Concept Distribution Modeling Estimation Concept Models Estimated distributions butterfly butterfly 4 flower Intent flower Interpretation Desired distributions 5 Spatial distribution butterfly comparison flower Relevance score
  • 71. Xian-Sheng Hua, MSR Asia Search Results Comparison Task: searching for the images with “snoopy appear at right” snoopy Text-based image search “show similar image” snoopy Image search by concept map
  • 72. Xian-Sheng Hua, MSR Asia Search Results Task: searching for the images with “snoopy appear at top” / “snoopy appear at left”
  • 73. Xian-Sheng Hua, MSR Asia Search Results Comparison Task: searching for the images with “a jeep shown above a piece of grass” jeep grass
  • 74. Xian-Sheng Hua, MSR Asia Search Results Comparison Task: searching for the images with “a car parked in front of a house” house car
  • 75. Xian-Sheng Hua, MSR Asia Search Results Comparison Task: searching for the images with “a keyboard and a mouse being side by side” keyboard mouse
  • 76. Xian-Sheng Hua, MSR Asia Search Results Comparison Task: searching for the images with “sky/Colosseum/grass appear from top to bottom” Colosseum grass
  • 77. Xian-Sheng Hua, MSR Asia Advanced Function: Influence Scope Indication ‱ Allow users to explicitly specify the occupied region of a concept house Default house lawn lawn house lawn Search for the long narrow lawn at the bottom of the image
  • 78. Xian-Sheng Hua, MSR Asia Advanced Function: Influence Scope Adjustment Searching for small windmill Searching for large windmill
  • 79. Xian-Sheng Hua, MSR Asia Advanced Function: Visual Modeling Adjustment sky house lawn Clear Search Advanced Function Visual Assistor Previous Next Allow users to indicate or narrow down what a desired concept looks like (visual constraints)
  • 80. Xian-Sheng Hua, MSR Asia Advanced Function: Visual Modeling Adjustment Searching for front-view jeeps jeep Searching for side-view jeeps Visual instances
  • 81. Xian-Sheng Hua, MSR Asia Advanced Function: Visual Modeling Adjustment Searching for butterfly with yellow flower butterfly flower Searching for butterfly with red flower Visual instances
  • 82. Xian-Sheng Hua, MSR Asia Conclusions ‱ To bridge the gap between users’ intent and visual semantics of images by Ì” Sketching out your search intent, and Ì” Combining text and visual content ‱ Two exemplary approaches introduced Ì” Search by color sketch Ì” Search by concept sketch
  • 83. Xian-Sheng Hua, MSR Asia Thank You