SlideShare ist ein Scribd-Unternehmen logo
1 von 59
Downloaden Sie, um offline zu lesen
Content-Based Image Retrieval


   Charlie Dagli and Thomas S. Huang
        {dagli,huang}@ifp.uiuc.edu




                                       1
Outline
    What is CBIR ?



    Image Features



    Feature Weighting and Relevance



    Feedback
    User Interface and Visualization





                                       2
What is Content-based Image
Retrieval (CBIR)?
    Image Search Systems that search



    for images by image content
    <-> Keyword-based Image/Video Retrieval
       (ex. Google Image Search, YouTube)




                                              3
Applications of CBIR
    Consumer Digital Photo Albums



        Digital Cameras
    


        Flickr
    


    Medical Images



    Digital Museum



    Trademarks Search



    MPEG-7 Content Descriptors





                                    4
Basic Components of CBIR
    Feature Extractor



        Create the metadata
    



    Query Engine



        Calculate similarity
    



    User Interface





                               5
How does CBIR work ?
    Extract Features from Images



    Let the user do Query



        Query by Sketch
    


        Query by Keywords
    


        Query by Example
    



    Refine the result by Relevance Feedback



        Give feedback to the previous result
    




                                               6
Query by Example
    Pick example images, then ask the system



    to retrieve “similar” images.

           “Get similar images”
                   CBIR

Query Sample


                                     Results


                                               7
Relevance Feedback
    User gives a feedback to the query results



    System recalculates feature weights





              Query                                        Feedback
                                   Feedback


    Initial
    sample            1st Result              2nd Result


                                                                      8
Basic Components of CBIR
    Feature Extractor



        Create the metadata
    



    Query Engine



        Calculate similarity
    



    User Interface





                               9
Image Features (Metadata)
    Color



    Texture



    Structure



    etc





                            10
Color Features
    Which Color Space?



        RGB, CMY, YCrCb, CIE, YIQ, HLS, …
    



    Our Favorite is HSV



        Designed to be similar to human perception
    




                                                     11
HSV Color Space
    H (Hue)



        Dominant color (spectral)
    



    S (Saturation)



        Amount of white
    



    V (Value)



        Brightness
    




               How to Use This?


                                    12
Straightforward way to use
HSV as color features
    Histogram for each H, S, and V



    Then compare in each bin



    Is this a good idea?





                                     13
Are these two that different?
    Histogram comparison is very



    sensitive




                                   14
Color Moments [Stricker ‘95]
    For each image, the color distribution in



    each of H, S and V is calculated
        1st (mean), 2nd (var) and 3rd moment for HSV
    



                    N
              1
                ∑ pij
         Ei =
              N j=1
                                           i : color channel {i=h,s,v}
                     N
                  1
                                           N = # of pixels in image
                    ∑ (pij − Ei ) 2 )1 2
         σi = (
                  N j=1
                                           Total 9 features
               1N
         si = ( ∑ (pij − Ei ) 3)1 3
               N j=1



                                                                         15
Shape Features

    Region-Based Shape



         Outer Boundary
     



    Contour-Based Shape



         Features of Contour
     



    Edge-Based Shape



         Ex. Histogram of edge length and
     


         orientation


                                            16
Region-based vs. Contour-based
    Region-based



        Suitable for Complex objects with
    


        disjoint region
    Contour-based



        Suitable when semantics are contained in
    


        the contour




                                                   17
Region-based vs. Contour-based
                     Region Complexity




  Contour
Complexity



                                         18
Region-based vs. Contour-based

 Good Examples for Region-based shape




 Good Examples for Contour-based shape




                                         19
Angular Radial Transformation (ART)
[Kim’99]

    A Region-based shape



    Calculate the coefficients based on image



    intensities in polar coordinates (n<3, m<12)
                 2π       1
             ∫        ∫
     Fnm =                  Vnm (ρ,θ ) f   (ρ,θ )ρdρdθ
                 0        o
                  f (ρ,θ )L image intensity in polar coordinates
                         Vnm (ρ,θ )L ART basis function
                        Vnm (ρ,θ ) = 1 / 2π exp( jmθ )Rn (ρ)
                                          1        n=0
                           Rn (ρ) = 
                                     2 cos(πnρ) n ≠ 0
     Total 35 coefficients in 140 bits (4 bits/coeff)

        €                                                          20
Curvature Scale-Space (CSS)
[Mokhtaarian ‘92]

    A contour-based shape



    1) Apply lowpass filter repeatedly until
       concave contours smoothed out
    2) “How contours are filtered” becomes
       the features
           Zero crossing in the curvature functions
       •
           after each application of the lowpass
           filter
           CSS Image
       •


                                                      21
CSS Image
Tracks zero-crossing locations of each concavity in the contour
          Contour                   Curvature

                            B
                    A
                                C
                                D
                                                          3 ITR
                        F
                                E

                                                         29 ITR
                            B
                    A




                            E
                        F



                                                        100
                                                        ITR




                                                                  22
                                                S
CSS Features
    # of peaks in CSS images



    Highest peak



    Circularity (perimeter2/ area)



    Eccentricity



    Etc.





                                     23
Texture Features
    Wavelet-based Texture Features



    [Smith’94]




                                     24
Wavelet Filter Bank
                     Coarse Info (low
                     freq)




                 Wavelet
                 Filter
Original Image
                               Detail (high freq)
                                                    25
Texture Features from Wavelet

                                Wavelet
Original                                       Feature
                                Image
Image                                         Extraction
                                           f6,v6 f3,v3
                                                         f1,v1
               Wavelet                     f5,v5 f4,v4

                Filter
                                             f2,v2 f0,vo
                Bank

                                          Take the mean
           Decompose the
                                          and variance of
           images into
                                          each subband
           frequency subbands



                                                                 26
Other approaches: Region-Based
    Global features often times fail to



    capture local content in an image
                         GLOBAL DESCRIPTION
                          {Green, Grassy, Hillside}
                           color, texture, shape


    No sheep? No fence? No houses?



                                                      27
Other approaches: Region-Based
    Segmentation-Based



        Images are segmented by color/texture similarities:
    


        Blobworld [Carson ‘99], Netra [Ma and Manjunath ‘99]

    Grid-Based



        Images are partitioned, features are calculated from
    


        blocks: [Tian ‘00],[Moghaddam ‘99]




                                                               28
Other approaches: Region-Based
    Combine Grid and Segmentation: [Dagli and Huang, ‘04]





                                                            29
Basic Components of CBIR
    Feature Extractor



        Create the metadata
    



    Query Engine



        Calculate similarity
    



    User Interface





                               30
Now, We have many features
(too many?)

    How to express visual “similarity”



    with these features?




                                         31
Visual Similarity ?
    “Similarity” is Subjective and Context-



    dependent.
    “Similarity” is High-level Concept.



        Cars, Flowers, …
    



    But, our features are Low-level features.



        Semantic Gap!
    




                                                32
Which features are most important?

    Not all features are always important.



    “Similarity” measure is always changing



    The system has to weight features on the fly.



                       How ?




                                                    33
Online Feature Weighting
    Approach #1 - Manual



        Ask the user to specify number
    


        “35% of color and 50% of texture…”
        Very difficult to determine the numbers
    


    Approach #2 - Automatic



        Learn feature weights from examples
    


        Relevance Feedback
    




                                                  34
Online Feature Weighting
    From Query Examples, the system



    determines feature weighting matrix W




                    CBIR
                  Calculate W

    Query
                                     Result
               rr         r rT        rr
     distance( x, y ) = ( x − y ) W ( x − y )
                                                35
How to Calculate W ?
    No Negative Examples (1-class)



    Positive and Negative Examples (2-class)



    One Positive and Many Negative classes


    (1+x)-class
    Many Positive and Many Negative classes


    (x+y)-class



                                               36
When there are only relevant
images available…
    We want to give more weights to



    common features among example
    images.
    Use the variance.



        Features with low variance
    


        -> Common features
        -> Give higher weight


                                      37
One Class Relevance Feedback in
    MARS [Rui ‘98]
     Calculates the Variance among relevant examples.



     The inverse of variance becomes the weight of each


     feature.
     This means “common features” between positive


     examples have larger weights.


       1/σ12                 0
                                 
                  2
              1/σ 2
                                 
    W =                          
                        2
                    1/σ 3
                                      W is a k x k diagonal matrix
                                 
                          O
                                 
                                2
       0                   1/σ k 
       


                                                                 38
Relevance Feedback as Two-Class
 Problem (positive and negative)
  Fisherʼs Discriminant Analysis (FDA)

                            Find a W that …
                        


                            minimizes the scatter
                        

                            of each class cluster
                            (within scatter)
positive
                            maximizes the scatter
                        

                            between the clusters
           negative
                            (between scatter)

                                                    39
Two-Class problem
    Target function

                                 W T SBW
                       W = argmax T
    W is full matrix             W SW W
                             W

                       SB LBetween Scatter Matrix
                       SW LWithin Scatter Matrix
                           2
                  SW = ∑         ∑                (x j − mi )(x j − mi )T
                                     j ∈group # i
                           i=1

                  SB = (m1 − m2 )(m1 − m2 )T
              €
                  m1,m2 Lmean of each class

                                                                       40
Solution
    The problem is reduced to



    generalized eigenvalue problem

    SB w i = λi SW w i
                1/ 2
    W = ΦΛ
    ΛLdiagonal matrix of eigenvalues
    ΦLeigenvectors



                                       41
From Two-class to (1+x)-class
    Positive examples are usually from


    one class such as flower
    Negative examples can be from any


    classes such as “car”, “elephant”,
    “orange”…
    It is not desirable to assume negative


    images as one class.


                                             42
RF as (1+x)-Class Problem
• Biased Discriminant Analysis [Zhou et al. ‘01]
• Negative examples can be any images
• Each negative image has its own group
                            positive
                            negative

                           SW = ∑                  (x − m)(x − m)T
                                    x ∈ positive

                           SB = ∑                  (x − m)(x − m)T
                                    x ∈negative

                           mLmean of positive class
                          The solution is similar to FDA
                                                                     43
RF as (x+y)-Class Problem
    Group BDA [Nakazato, Dagli ‘03]



    Multiple Positive classes



    Scattered Negative classes





                                      positive

                                      negative



                                                 44
RF as (x+y)-Class Problem
    Group BDA [Nakazato, Dagli ‘03]



    Multiple Positive classes



    Scattered Negative classes



                            positive
                           negative
                        SW = ∑ i ∑ x ∈i (x − mi )(x − mi )T
                        SB = ∑ ∑                  (y − mi )(y − mi )T
                                i   y ∈negative

                        mi Lmean of positive class i


                                                                        45
Basic Components of CBIR
    Feature Extractor



        Create the metadata
    


    Query Engine



        Calculate similarity
    


    User Interface





                               46
User Interface and Visualization

    Basic GUI



    Direct Manipulation GUI



        El Nino [UC San Diego]
    


        Image Grouper [Nakazato and Huang]
    



    3D Virtual Reality Display





                                             47
Traditional GUI for Relevance
Feedback
    User selects


    relevant images
    If good images


    are found, add
    them
    When no more


    images to add, the
    search converges
           Slider or Checkbox


                                48
ImageGrouper [Nakazato and Huang]
    Query by Groups



        Make a query by creating groups of images
    


        Easier to try different combinations of
    


        query sets (trial-and-Error Query)




                                                    49
ImageGrouper




   Result View   Pallete Panel


                                 50
Note
    Trial-and-Error Query is very important



    because
        Image similarity is subjective and context-
    


        dependent.
        In addition, we are using low-level image features.
    


        (semantic gap)
        Thus, it is VERY difficult to express the user’s
    


        concept by these features.




                                                              51
Image Retrieval in 3D

    Image retrieval and browsing in 3D


    Virtual Reality
    The user can see more images without


    occlusion
    Query results can be displayed in


    various criteria
        Results by Color features, by texture, by
    


        combination of color and texture

                                                    52
3D MARS



                  Texture



                                 Structure


                        color
Initial Display
                                Result

                                             53
3D MARS in CAVE™

    Shuttered glasses for



    immersive 3D
    experience
    Click and Drag images



    by WAND
    Fly-through by



    Joystick




                            54
Demos
    Traditional GUI



        IBM QBIC
    


        • http://wwwqbic.almaden.ibm.com/
        UIUC MARS
    


        • http://chopin.ifp.uiuc.edu:8080

    ImageGrouper



        http://www.ifp.uiuc.edu/~nakazato/grouper
    




                                                    55
References (Image Features)
    Bober, M., “MPEG-7 Visual Descriptors,” In IEEE Transactions on Circuits and


    Systems for Video Technology, Vol. 11, No. 6, June 2001.
    Stricker, M. and Orengo, M., “Similarity of Color Images,” In Proceedings of SPIE,


    Vol. 2420 (Storage and Retrieval of Image and Video Databases III), SPIE Press,
    Feb. 1995.
    Zhou, X. S. and Huang, T. S., “Edge-based structural feature for content-base


    image retrieval,” Pattern Recognition Letters, Special issue on Image and Video
    Indexing, 2000.
    Smith, J. R. and Chang S-F. Transform features for texture classification and


    discrimination in large image databases. In Proceedings of IEEE Intl. Conf. on
    Image Processing, 1994.
    Smith J. R. and Chang S-F. “Quad-Tree Segmentation for Texture-based Image


    Query.” In Proceedings of ACM 2nd International Conference on Multimedia, 1994.
    Dagli, C. K. and Huang, T.S., “A Framework for Color Grid-Based Image Retrieval,”


    In Proceedings of International Conference on Pattern Recognition, 2004.
    Tian, Q. et. al. “Combine user defined region-of-interest and spatial layout in image


    retrieval,” in IEEE Intl. Conf. on Image Processing, 2000.
    Moghaddam B. et. al. “Defining image content with multiple regions-of-interest,” in


    IEEE Wrkshp on Content-Based Access of Image and Video Libraries, 1999.

                                                                                            56
References (Relevance Feedback)

    Rui, Y., et al., “Relevance Feedback: A Power Tool for


    Interactive Content-Based Image Retrieval,” In IEEE
    Trans. on Circuits and Video Technology, Vol.8, No.5,
    Sept. 1998
    Zhou, X. S., Petrovic, N. and Huang, T. S. “Comparing


    Discriminating Transformations and SVM for Learning
    during Multimedia Retrieval.” In Proceedings of ACM
    Multimedia ‘01, 2001.
    Ishikawa, Y., Subrammanya, R. and Faloutsos, C.,


    “MindReader: Query database through multiple
    examples,” In Proceedings of the 24th VLDB
    Conference, 1998.

                                                             57
References (User Interfaces and
    Visualizations)
    Nakazato, M. and Huang, T. S. “3D MARS: Immersive Virtual


    Reality for Content-Based Image Retrieval.“ In Proceedings of
    2001 IEEE International Conference on Multimedia and Expo
    (ICME2001), Tokyo, August 22-25, 2001
    Nakazato, M., Manola, L. and Huang, T.S., “ImageGrouper: Search,


    Annotate and Organize Images by Groups,” In Proc. of 5th Intl.
    Conf. On Visual Information Systems (VIS’02), 2002.
    Nakazato, M., Dagli C.K., and Huang T.S., “Evaluating Group-Based


    Relevance Feedback for Content-Based Image Retrieval,” In
    Proceedings of International Conference on Image Processing,
    2003.
    Santini, S. and Jain, R., “Integrated Browsing and Querying for


    Image Database,” IEEE Multimedia, Vol. 7, No. 3, 2000, page
    26-39.

                                                                        58
Similar Region Shape, Different
Contour




                                  59

Weitere ähnliche Inhalte

Was ist angesagt?

Computer Vision: Shape from Specularities and Motion
Computer Vision: Shape from Specularities and MotionComputer Vision: Shape from Specularities and Motion
Computer Vision: Shape from Specularities and Motion
Damian T. Gordon
 

Was ist angesagt? (20)

Image Segmentation
 Image Segmentation Image Segmentation
Image Segmentation
 
Image segmentation
Image segmentationImage segmentation
Image segmentation
 
Image-to-Image Translation pix2pix
Image-to-Image Translation pix2pixImage-to-Image Translation pix2pix
Image-to-Image Translation pix2pix
 
Image segmentation with deep learning
Image segmentation with deep learningImage segmentation with deep learning
Image segmentation with deep learning
 
Content based image retrieval
Content based image retrievalContent based image retrieval
Content based image retrieval
 
Digital Image Fundamentals - II
Digital Image Fundamentals - IIDigital Image Fundamentals - II
Digital Image Fundamentals - II
 
Michal Erel's SIFT presentation
Michal Erel's SIFT presentationMichal Erel's SIFT presentation
Michal Erel's SIFT presentation
 
Canny Edge Detection
Canny Edge DetectionCanny Edge Detection
Canny Edge Detection
 
Image segmentation
Image segmentationImage segmentation
Image segmentation
 
Object tracking
Object trackingObject tracking
Object tracking
 
IMAGE SEGMENTATION TECHNIQUES
IMAGE SEGMENTATION TECHNIQUESIMAGE SEGMENTATION TECHNIQUES
IMAGE SEGMENTATION TECHNIQUES
 
Chapter10 image segmentation
Chapter10 image segmentationChapter10 image segmentation
Chapter10 image segmentation
 
Digital Image Fundamentals
Digital Image FundamentalsDigital Image Fundamentals
Digital Image Fundamentals
 
Computer Vision: Shape from Specularities and Motion
Computer Vision: Shape from Specularities and MotionComputer Vision: Shape from Specularities and Motion
Computer Vision: Shape from Specularities and Motion
 
ZeroCap: Zero-Shot Image-to-Text Generation for Visual-Semantic Arithmetic
ZeroCap: Zero-Shot Image-to-Text Generation for Visual-Semantic ArithmeticZeroCap: Zero-Shot Image-to-Text Generation for Visual-Semantic Arithmetic
ZeroCap: Zero-Shot Image-to-Text Generation for Visual-Semantic Arithmetic
 
Color image processing Presentation
Color image processing PresentationColor image processing Presentation
Color image processing Presentation
 
Digital Image Processing: Image Segmentation
Digital Image Processing: Image SegmentationDigital Image Processing: Image Segmentation
Digital Image Processing: Image Segmentation
 
Image Translation with GAN
Image Translation with GANImage Translation with GAN
Image Translation with GAN
 
Fundamental steps in image processing
Fundamental steps in image processingFundamental steps in image processing
Fundamental steps in image processing
 
Optic flow estimation with deep learning
Optic flow estimation with deep learningOptic flow estimation with deep learning
Optic flow estimation with deep learning
 

Ähnlich wie CBIR

Urban 3D Semantic Modelling Using Stereo Vision, ICRA 2013
Urban 3D Semantic Modelling Using Stereo Vision, ICRA 2013Urban 3D Semantic Modelling Using Stereo Vision, ICRA 2013
Urban 3D Semantic Modelling Using Stereo Vision, ICRA 2013
Sunando Sengupta
 

Ähnlich wie CBIR (20)

Digital Image Processing
Digital Image ProcessingDigital Image Processing
Digital Image Processing
 
Tehzeeb Seminar presentation ppt0000.pptx
Tehzeeb Seminar presentation ppt0000.pptxTehzeeb Seminar presentation ppt0000.pptx
Tehzeeb Seminar presentation ppt0000.pptx
 
Image Processing with OpenCV
Image Processing with OpenCVImage Processing with OpenCV
Image Processing with OpenCV
 
4 image enhancement in spatial domain
4 image enhancement in spatial domain4 image enhancement in spatial domain
4 image enhancement in spatial domain
 
Lec13 stereo converted
Lec13 stereo convertedLec13 stereo converted
Lec13 stereo converted
 
Cbir ‐ features
Cbir ‐ featuresCbir ‐ features
Cbir ‐ features
 
PPT s06-machine vision-s2
PPT s06-machine vision-s2PPT s06-machine vision-s2
PPT s06-machine vision-s2
 
Java Based Quantitative Imaging Tool
Java Based Quantitative Imaging ToolJava Based Quantitative Imaging Tool
Java Based Quantitative Imaging Tool
 
09 cie552 image_featuresi
09 cie552 image_featuresi09 cie552 image_featuresi
09 cie552 image_featuresi
 
Analyzing color imaging failure on consumer-grade cameras
Analyzing color imaging failure on consumer-grade camerasAnalyzing color imaging failure on consumer-grade cameras
Analyzing color imaging failure on consumer-grade cameras
 
Computer Graphics
Computer GraphicsComputer Graphics
Computer Graphics
 
24th IP_Fundamentals.ppt
24th IP_Fundamentals.ppt24th IP_Fundamentals.ppt
24th IP_Fundamentals.ppt
 
Final image processing
Final image processingFinal image processing
Final image processing
 
Urban 3D Semantic Modelling Using Stereo Vision, ICRA 2013
Urban 3D Semantic Modelling Using Stereo Vision, ICRA 2013Urban 3D Semantic Modelling Using Stereo Vision, ICRA 2013
Urban 3D Semantic Modelling Using Stereo Vision, ICRA 2013
 
Enhance and quantify Microstructure using Machine Learning
Enhance and quantify Microstructure using Machine LearningEnhance and quantify Microstructure using Machine Learning
Enhance and quantify Microstructure using Machine Learning
 
Spatial Filtering in intro image processingr
Spatial Filtering in intro image processingrSpatial Filtering in intro image processingr
Spatial Filtering in intro image processingr
 
Ch2
Ch2Ch2
Ch2
 
Scale and object aware image retargeting for thumbnail browsing
Scale and object aware image retargeting for thumbnail browsingScale and object aware image retargeting for thumbnail browsing
Scale and object aware image retargeting for thumbnail browsing
 
Overview of graphics systems
Overview of  graphics systemsOverview of  graphics systems
Overview of graphics systems
 
Searching Images with MPEG-7 (& MPEG-7 Like) Powered Localized dEscriptors (S...
Searching Images with MPEG-7 (& MPEG-7 Like) Powered Localized dEscriptors (S...Searching Images with MPEG-7 (& MPEG-7 Like) Powered Localized dEscriptors (S...
Searching Images with MPEG-7 (& MPEG-7 Like) Powered Localized dEscriptors (S...
 

Kürzlich hochgeladen

Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
vu2urc
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
Joaquim Jorge
 

Kürzlich hochgeladen (20)

ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 

CBIR

  • 1. Content-Based Image Retrieval Charlie Dagli and Thomas S. Huang {dagli,huang}@ifp.uiuc.edu 1
  • 2. Outline What is CBIR ?  Image Features  Feature Weighting and Relevance  Feedback User Interface and Visualization  2
  • 3. What is Content-based Image Retrieval (CBIR)? Image Search Systems that search  for images by image content <-> Keyword-based Image/Video Retrieval (ex. Google Image Search, YouTube) 3
  • 4. Applications of CBIR Consumer Digital Photo Albums  Digital Cameras  Flickr  Medical Images  Digital Museum  Trademarks Search  MPEG-7 Content Descriptors  4
  • 5. Basic Components of CBIR Feature Extractor  Create the metadata  Query Engine  Calculate similarity  User Interface  5
  • 6. How does CBIR work ? Extract Features from Images  Let the user do Query  Query by Sketch  Query by Keywords  Query by Example  Refine the result by Relevance Feedback  Give feedback to the previous result  6
  • 7. Query by Example Pick example images, then ask the system  to retrieve “similar” images. “Get similar images” CBIR Query Sample Results 7
  • 8. Relevance Feedback User gives a feedback to the query results  System recalculates feature weights  Query Feedback Feedback Initial sample 1st Result 2nd Result 8
  • 9. Basic Components of CBIR Feature Extractor  Create the metadata  Query Engine  Calculate similarity  User Interface  9
  • 10. Image Features (Metadata) Color  Texture  Structure  etc  10
  • 11. Color Features Which Color Space?  RGB, CMY, YCrCb, CIE, YIQ, HLS, …  Our Favorite is HSV  Designed to be similar to human perception  11
  • 12. HSV Color Space H (Hue)  Dominant color (spectral)  S (Saturation)  Amount of white  V (Value)  Brightness  How to Use This? 12
  • 13. Straightforward way to use HSV as color features Histogram for each H, S, and V  Then compare in each bin  Is this a good idea?  13
  • 14. Are these two that different? Histogram comparison is very  sensitive 14
  • 15. Color Moments [Stricker ‘95] For each image, the color distribution in  each of H, S and V is calculated 1st (mean), 2nd (var) and 3rd moment for HSV  N 1 ∑ pij Ei = N j=1 i : color channel {i=h,s,v} N 1 N = # of pixels in image ∑ (pij − Ei ) 2 )1 2 σi = ( N j=1 Total 9 features 1N si = ( ∑ (pij − Ei ) 3)1 3 N j=1 15
  • 16. Shape Features Region-Based Shape  Outer Boundary  Contour-Based Shape  Features of Contour  Edge-Based Shape  Ex. Histogram of edge length and  orientation 16
  • 17. Region-based vs. Contour-based Region-based  Suitable for Complex objects with  disjoint region Contour-based  Suitable when semantics are contained in  the contour 17
  • 18. Region-based vs. Contour-based Region Complexity Contour Complexity 18
  • 19. Region-based vs. Contour-based Good Examples for Region-based shape Good Examples for Contour-based shape 19
  • 20. Angular Radial Transformation (ART) [Kim’99] A Region-based shape  Calculate the coefficients based on image  intensities in polar coordinates (n<3, m<12) 2π 1 ∫ ∫ Fnm = Vnm (ρ,θ ) f (ρ,θ )ρdρdθ 0 o f (ρ,θ )L image intensity in polar coordinates Vnm (ρ,θ )L ART basis function Vnm (ρ,θ ) = 1 / 2π exp( jmθ )Rn (ρ)  1 n=0 Rn (ρ) =   2 cos(πnρ) n ≠ 0 Total 35 coefficients in 140 bits (4 bits/coeff) € 20
  • 21. Curvature Scale-Space (CSS) [Mokhtaarian ‘92] A contour-based shape  1) Apply lowpass filter repeatedly until concave contours smoothed out 2) “How contours are filtered” becomes the features Zero crossing in the curvature functions • after each application of the lowpass filter CSS Image • 21
  • 22. CSS Image Tracks zero-crossing locations of each concavity in the contour Contour Curvature B A C D 3 ITR F E 29 ITR B A E F 100 ITR 22 S
  • 23. CSS Features # of peaks in CSS images  Highest peak  Circularity (perimeter2/ area)  Eccentricity  Etc.  23
  • 24. Texture Features Wavelet-based Texture Features  [Smith’94] 24
  • 25. Wavelet Filter Bank Coarse Info (low freq) Wavelet Filter Original Image Detail (high freq) 25
  • 26. Texture Features from Wavelet Wavelet Original Feature Image Image Extraction f6,v6 f3,v3 f1,v1 Wavelet f5,v5 f4,v4 Filter f2,v2 f0,vo Bank Take the mean Decompose the and variance of images into each subband frequency subbands 26
  • 27. Other approaches: Region-Based Global features often times fail to  capture local content in an image GLOBAL DESCRIPTION {Green, Grassy, Hillside} color, texture, shape No sheep? No fence? No houses? 27
  • 28. Other approaches: Region-Based Segmentation-Based  Images are segmented by color/texture similarities:  Blobworld [Carson ‘99], Netra [Ma and Manjunath ‘99] Grid-Based  Images are partitioned, features are calculated from  blocks: [Tian ‘00],[Moghaddam ‘99] 28
  • 29. Other approaches: Region-Based Combine Grid and Segmentation: [Dagli and Huang, ‘04]  29
  • 30. Basic Components of CBIR Feature Extractor  Create the metadata  Query Engine  Calculate similarity  User Interface  30
  • 31. Now, We have many features (too many?) How to express visual “similarity”  with these features? 31
  • 32. Visual Similarity ? “Similarity” is Subjective and Context-  dependent. “Similarity” is High-level Concept.  Cars, Flowers, …  But, our features are Low-level features.  Semantic Gap!  32
  • 33. Which features are most important? Not all features are always important.  “Similarity” measure is always changing  The system has to weight features on the fly.  How ? 33
  • 34. Online Feature Weighting Approach #1 - Manual  Ask the user to specify number  “35% of color and 50% of texture…” Very difficult to determine the numbers  Approach #2 - Automatic  Learn feature weights from examples  Relevance Feedback  34
  • 35. Online Feature Weighting From Query Examples, the system  determines feature weighting matrix W CBIR Calculate W Query Result rr r rT rr distance( x, y ) = ( x − y ) W ( x − y ) 35
  • 36. How to Calculate W ? No Negative Examples (1-class)  Positive and Negative Examples (2-class)  One Positive and Many Negative classes  (1+x)-class Many Positive and Many Negative classes  (x+y)-class 36
  • 37. When there are only relevant images available… We want to give more weights to  common features among example images. Use the variance.  Features with low variance  -> Common features -> Give higher weight 37
  • 38. One Class Relevance Feedback in MARS [Rui ‘98] Calculates the Variance among relevant examples.  The inverse of variance becomes the weight of each  feature. This means “common features” between positive  examples have larger weights. 1/σ12 0   2 1/σ 2   W =  2 1/σ 3 W is a k x k diagonal matrix   O   2 0 1/σ k   38
  • 39. Relevance Feedback as Two-Class Problem (positive and negative) Fisherʼs Discriminant Analysis (FDA) Find a W that …  minimizes the scatter  of each class cluster (within scatter) positive maximizes the scatter  between the clusters negative (between scatter) 39
  • 40. Two-Class problem Target function  W T SBW W = argmax T W is full matrix W SW W  W SB LBetween Scatter Matrix SW LWithin Scatter Matrix 2 SW = ∑ ∑ (x j − mi )(x j − mi )T j ∈group # i i=1 SB = (m1 − m2 )(m1 − m2 )T € m1,m2 Lmean of each class 40
  • 41. Solution The problem is reduced to  generalized eigenvalue problem SB w i = λi SW w i 1/ 2 W = ΦΛ ΛLdiagonal matrix of eigenvalues ΦLeigenvectors 41
  • 42. From Two-class to (1+x)-class Positive examples are usually from  one class such as flower Negative examples can be from any  classes such as “car”, “elephant”, “orange”… It is not desirable to assume negative  images as one class. 42
  • 43. RF as (1+x)-Class Problem • Biased Discriminant Analysis [Zhou et al. ‘01] • Negative examples can be any images • Each negative image has its own group positive negative SW = ∑ (x − m)(x − m)T x ∈ positive SB = ∑ (x − m)(x − m)T x ∈negative mLmean of positive class The solution is similar to FDA 43
  • 44. RF as (x+y)-Class Problem Group BDA [Nakazato, Dagli ‘03]  Multiple Positive classes  Scattered Negative classes  positive negative 44
  • 45. RF as (x+y)-Class Problem Group BDA [Nakazato, Dagli ‘03]  Multiple Positive classes  Scattered Negative classes  positive negative SW = ∑ i ∑ x ∈i (x − mi )(x − mi )T SB = ∑ ∑ (y − mi )(y − mi )T i y ∈negative mi Lmean of positive class i 45
  • 46. Basic Components of CBIR Feature Extractor  Create the metadata  Query Engine  Calculate similarity  User Interface  46
  • 47. User Interface and Visualization Basic GUI  Direct Manipulation GUI  El Nino [UC San Diego]  Image Grouper [Nakazato and Huang]  3D Virtual Reality Display  47
  • 48. Traditional GUI for Relevance Feedback User selects  relevant images If good images  are found, add them When no more  images to add, the search converges Slider or Checkbox 48
  • 49. ImageGrouper [Nakazato and Huang] Query by Groups  Make a query by creating groups of images  Easier to try different combinations of  query sets (trial-and-Error Query) 49
  • 50. ImageGrouper Result View Pallete Panel 50
  • 51. Note Trial-and-Error Query is very important  because Image similarity is subjective and context-  dependent. In addition, we are using low-level image features.  (semantic gap) Thus, it is VERY difficult to express the user’s  concept by these features. 51
  • 52. Image Retrieval in 3D Image retrieval and browsing in 3D  Virtual Reality The user can see more images without  occlusion Query results can be displayed in  various criteria Results by Color features, by texture, by  combination of color and texture 52
  • 53. 3D MARS Texture Structure color Initial Display Result 53
  • 54. 3D MARS in CAVE™ Shuttered glasses for  immersive 3D experience Click and Drag images  by WAND Fly-through by  Joystick 54
  • 55. Demos Traditional GUI  IBM QBIC  • http://wwwqbic.almaden.ibm.com/ UIUC MARS  • http://chopin.ifp.uiuc.edu:8080 ImageGrouper  http://www.ifp.uiuc.edu/~nakazato/grouper  55
  • 56. References (Image Features) Bober, M., “MPEG-7 Visual Descriptors,” In IEEE Transactions on Circuits and  Systems for Video Technology, Vol. 11, No. 6, June 2001. Stricker, M. and Orengo, M., “Similarity of Color Images,” In Proceedings of SPIE,  Vol. 2420 (Storage and Retrieval of Image and Video Databases III), SPIE Press, Feb. 1995. Zhou, X. S. and Huang, T. S., “Edge-based structural feature for content-base  image retrieval,” Pattern Recognition Letters, Special issue on Image and Video Indexing, 2000. Smith, J. R. and Chang S-F. Transform features for texture classification and  discrimination in large image databases. In Proceedings of IEEE Intl. Conf. on Image Processing, 1994. Smith J. R. and Chang S-F. “Quad-Tree Segmentation for Texture-based Image  Query.” In Proceedings of ACM 2nd International Conference on Multimedia, 1994. Dagli, C. K. and Huang, T.S., “A Framework for Color Grid-Based Image Retrieval,”  In Proceedings of International Conference on Pattern Recognition, 2004. Tian, Q. et. al. “Combine user defined region-of-interest and spatial layout in image  retrieval,” in IEEE Intl. Conf. on Image Processing, 2000. Moghaddam B. et. al. “Defining image content with multiple regions-of-interest,” in  IEEE Wrkshp on Content-Based Access of Image and Video Libraries, 1999. 56
  • 57. References (Relevance Feedback) Rui, Y., et al., “Relevance Feedback: A Power Tool for  Interactive Content-Based Image Retrieval,” In IEEE Trans. on Circuits and Video Technology, Vol.8, No.5, Sept. 1998 Zhou, X. S., Petrovic, N. and Huang, T. S. “Comparing  Discriminating Transformations and SVM for Learning during Multimedia Retrieval.” In Proceedings of ACM Multimedia ‘01, 2001. Ishikawa, Y., Subrammanya, R. and Faloutsos, C.,  “MindReader: Query database through multiple examples,” In Proceedings of the 24th VLDB Conference, 1998. 57
  • 58. References (User Interfaces and Visualizations) Nakazato, M. and Huang, T. S. “3D MARS: Immersive Virtual  Reality for Content-Based Image Retrieval.“ In Proceedings of 2001 IEEE International Conference on Multimedia and Expo (ICME2001), Tokyo, August 22-25, 2001 Nakazato, M., Manola, L. and Huang, T.S., “ImageGrouper: Search,  Annotate and Organize Images by Groups,” In Proc. of 5th Intl. Conf. On Visual Information Systems (VIS’02), 2002. Nakazato, M., Dagli C.K., and Huang T.S., “Evaluating Group-Based  Relevance Feedback for Content-Based Image Retrieval,” In Proceedings of International Conference on Image Processing, 2003. Santini, S. and Jain, R., “Integrated Browsing and Querying for  Image Database,” IEEE Multimedia, Vol. 7, No. 3, 2000, page 26-39. 58
  • 59. Similar Region Shape, Different Contour 59