SlideShare ist ein Scribd-Unternehmen logo
1 von 33
Downloaden Sie, um offline zu lesen
ICCV 2011




  Scale and Object Aware Image
Retargeting for Thumbnail Browsing

            Jin Sun, Haibin Ling
  Temple University, Philadelphia, PA, 19122
                By perillaroc
Introduction
• Image browsing tasks
• Tiny thumbnails: a fixed small size
• Scaling
      – bring difficulties in searching and recognition
• Automatic image retargeting methods
      – target size is comparable to that of the original image
• Several important issues for thumbnail
  browsing


2012/6/19                                                         2
Thumbnail Browsing
• Thumbnail scales
      – Studies have shown that scales can have
        significant effects on human visual perception
• Object completeness
      – Low-level gradient-based information
      – NOW! The object-level completeness
• Structure smoothness
      – The contamination caused by pixel removal
        methods, e.g. seam carving
2012/6/19                                                3
Scale and Object Aware Retargeting
• Thumbnail scales
      – An image perceived by human vision system
      – A new scale dependent saliency
• Object completeness
      – integrating the objectness measurement recently
        proposed by Alexe et al.
• Structure smoothness
      – Use the thin-plate-spline(TPS) as warping function
      – Use Cyclic seam Carving to guide the warping.
2012/6/19                                                    4
Scale and Object Aware Retargeting


• A continuous retargeting algorithm but uses
  discrete retargeting to guide the warping
  transformation.




2012/6/19                                       5
Overview : Task Formulation
• Retargeting problem:
                 J  F ( I , S ( I ))
• Seam carving(SC)
                  I (i, j ),    if j  seam(i )
              F 
                 sc

                  I (i, j  1), if j  seam(i )
                i, j


• First get retargeted images to a target size that is
  comparable to the original image size.
• Then shrink the retargeted images to get the final
  thumbnails.


2012/6/19                                                6
Overview : Framework Overview
•   Scale and object aware image retargeting(SOAP): F   so


•   Warping function: thin-plate-spline (TPS)
•   Landmark points : cyclic seam carving (CSC)
•   Scale dependent saliency S scale ( I )
•   Object awareness O( I )
•   Scale and object aware saliency
                  S so ( I )  S scale ( I )  O( I )
• SOAP
                   J  F ( I , S ( I ))
                             so        so



2012/6/19                                                7
Flow chart of the proposed method




2012/6/19                               8
Scale-dependent Saliency




•   Original image size
•   Display size                  so        s D
                          sd        , s p  d rp
•   Projection size              DPI          D
•   DPI(Dots Per Inch)

2012/6/19                                           9
Scale-dependent Saliency
• How well such a system preserves the image
  information?
• We want to make the foreground object/
  content/theme of the image as “clear” as
  possible
• According to the study, not all patterns in an
  image are recognizable by human



2012/6/19                                          10
Scale-dependent Saliency
• the perceived image, denoted as I p , can be derived as
                       I (i, j ), if sd / D  [ ,  ]
             I p ( I (i, j ))  
                                 I N (i, j ),   otherwise
• I N (i, j ) is the mean value of N-connected neighbors of pixel I(i,j)
• N is determined by display device specifications.
• κ and ρ are the lower and upper bounds(in cycles per degree)
  respectively of human visual acuity.
• κ and ρ define the limits at which the visual stimuli frequency
  becomes too low or too high to be recognized by human.
• We use κ = 0.0175 and ρ = 0.83.



2012/6/19                                                            11
Scale-dependent Saliency
• A pixel may become indistinguishable from its
  neighbors to a human observer.
• An image patch that was salient in the original image
  may not appear salient to a human observer.

• Scale dependent saliency




2012/6/19                                             12
Scale-dependent Saliency
• First, the original image is scaled in homogeneous
  into the final thumbnail size, i.e. the display size,
  60x60 pixels in our experiment.
• Then, the minimum recognizable pattern, denoted by
    
   sd is determined by Eqn.4.
• Finally, the scale dependent saliency    S scale ( I ) is defined
  as
                        S scale ( I )  S ( I p )

     where S(.) is the standard saliency (Itti 1998)


2012/6/19                                                        13
Scale-dependent Saliency
• the minimum recognizable pattern sd    
• In our monitor : 1680X1050@65Hz , 120 DPI
•   
   sd is in average 0.009 inches.
• The size is approximately the distance between two pixel lines
  on the screen, i.e. N = 4 in Eqn.5.
• As a result, in the final thumbnail four adjacent neighbors of
  one image pixel patch with value differences in color space
  within certain threshold will be assigned their mean value,
  which means those pixels are unable to be distinguished by
  human.
• Differences : 50 in our experiment


2012/6/19                                                      14
Scale-dependent Saliency




2012/6/19                              15
Objectness Measure
• Preserve object completeness as much as possible
• Alexe et al. What is an object? (CVPR,2010).
      – A novel objectness measure, which is trained to distinguish
        object windows from background ones.
      – For a rectangular window w = (i,j,w,h) with a top-left
        corner at (i,j), width w and height h, its objectness is
        defined as the probability p(w) that w contains an object.




2012/6/19                                                        16
Objectness Measure
• First sample nw windows W  {wi }iw1 for an input image
• Then calculate the objectness map O as the average
  objectness reponse at each pixel
                                    1
                       O(i, j )         , j )w pobj (w)
                                     wW (i


  Where   max O(i, j )
                i, j


• nw = 10000 in our experiments




2012/6/19                                                    17
Scale and Object Aware Saliency
            S S
            so     scale
                           (i, j ) O(i, j )




2012/6/19                                     18
Cyclic Seam Carving
• In many cases a seam has no choice but to cross objects due
  to the original definition of seam.
• Discontinuous seams
• Discontinuous seam-carving for video retargeting. CVPR, 2010




• Problem: object-structure damage, e.g. pixel shift problem

2012/6/19                                                      19
Cyclic Seam Carving
• Cyclic Seams: warp the image




2012/6/19                         20
Cyclic Seam Carving
• Energy function
              Escale    Esc  (1   )  S so
   where ρ is the weight and set to 0.3 .

• The improved energy is then combined with the CSC
  algorithm to provide landmark point pairs needed for
  estimating TPS warping.




2012/6/19                                            21
Cyclic Seam Carving




2012/6/19                         22
Image Warping Function
• Many discrete retargeting methods generate
  excellent results in general but they sometimes
  create serious artifacts when the target has a size
  much smaller than the input.

• We combine a continuous warping model with a
  discrete retargeting guidance.

• Thin-plate-spline(TPS) as our warping function


2012/6/19                                               23
Thin-plate-spline(TPS)
• Landmark points          P   pi      2
                                              , i  1,2,
                                                  and      , nl 
  Q  qi  2 , i  1,2, , nl  , where p is mapped to q.

• the TPS transformation T is defined as the
  transformation from P to Q that minimizes the
  regularized bending energy E(f) defined as
             E ( f )   qi  f (qi ) 
                                    2

                      i

                           2 f 2    2 f 2 2 f 2
                       ( 2 )  2(      )  ( 2 ) dxdy
                           x        xy      y

• The TPS warping is defined as
                      T  arg min f E ( f )

2012/6/19                                                           24
TPS Warping Function
• Landmark point pairs P and Q is derived from the CSC
  retarget algorithm
• A two-way solution
• First, We sample randomly a landmark set b P (Fig.
  6(a)) from original image and then trace their shifting
  in the CSC process.




2012/6/19                                              25
TPS Warping Function
• Then, point set Q is re-sampled uniformly on the
  target image, which is generated by CSC.
• Finally, a sample set P is generated by mapping Q to
  the original image using warping estimated by Q and
  P.




2012/6/19                                            26
Quantitative Experiments
• To find the thumbnail that matches the
  description
• From 10 x 10 image thumbnails
• Different methods
      – Scaling(SL)
      – Seam carving (SC)
      – Improved seam carving (ISC)
      – The proposed SOAR algorithm (SOAR)


2012/6/19                                    27
Quantitative Experiments




2012/6/19                              28
Quantitative Experiments




2012/6/19                              29
Qualitative Experiments
• In general our method performs the best regarding
  the (foreground) object size in the thumbnails.




2012/6/19                                             30
RetargetMe




2012/6/19    31
Limitations
• Saliency distribution is scattered




• Object is to big




2012/6/19                              32
Conclusion
• A scale and object aware image retargeting
  method for thumbnail browsing.
• Several new techniques
      – Scale dependent saliency
      – Objectness
      – Cyclic seam carving
      – A TPS-based continuous warping model




2012/6/19                                      33

Weitere ähnliche Inhalte

Was ist angesagt?

Unsupervised Change Detection in the Feature Space Using Kernels.pdf
Unsupervised Change Detection in the Feature Space Using Kernels.pdfUnsupervised Change Detection in the Feature Space Using Kernels.pdf
Unsupervised Change Detection in the Feature Space Using Kernels.pdfgrssieee
 
Elementary Landscape Decomposition of Combinatorial Optimization Problems
Elementary Landscape Decomposition of Combinatorial Optimization ProblemsElementary Landscape Decomposition of Combinatorial Optimization Problems
Elementary Landscape Decomposition of Combinatorial Optimization Problemsjfrchicanog
 
Estimating Human Pose from Occluded Images (ACCV 2009)
Estimating Human Pose from Occluded Images (ACCV 2009)Estimating Human Pose from Occluded Images (ACCV 2009)
Estimating Human Pose from Occluded Images (ACCV 2009)Jia-Bin Huang
 
A Physical Approach to Moving Cast Shadow Detection (ICASSP 2009)
A Physical Approach to Moving Cast Shadow Detection (ICASSP 2009)A Physical Approach to Moving Cast Shadow Detection (ICASSP 2009)
A Physical Approach to Moving Cast Shadow Detection (ICASSP 2009)Jia-Bin Huang
 
CVPR2010: Advanced ITinCVPR in a Nutshell: part 4: additional slides
CVPR2010: Advanced ITinCVPR in a Nutshell: part 4: additional slidesCVPR2010: Advanced ITinCVPR in a Nutshell: part 4: additional slides
CVPR2010: Advanced ITinCVPR in a Nutshell: part 4: additional slideszukun
 
Lec13 stereo converted
Lec13 stereo convertedLec13 stereo converted
Lec13 stereo convertedBaliThorat1
 
Learning Moving Cast Shadows for Foreground Detection (VS 2008)
Learning Moving Cast Shadows for Foreground Detection (VS 2008)Learning Moving Cast Shadows for Foreground Detection (VS 2008)
Learning Moving Cast Shadows for Foreground Detection (VS 2008)Jia-Bin Huang
 
Fcv bio cv_weiss
Fcv bio cv_weissFcv bio cv_weiss
Fcv bio cv_weisszukun
 
SIGGRAPH ASIA 2012 Stereoscopic Cloning Presentation Slide
SIGGRAPH ASIA 2012 Stereoscopic Cloning Presentation SlideSIGGRAPH ASIA 2012 Stereoscopic Cloning Presentation Slide
SIGGRAPH ASIA 2012 Stereoscopic Cloning Presentation SlideI-Chao Shen
 
IVR - Chapter 2 - Basics of filtering I: Spatial filters (25Mb)
IVR - Chapter 2 - Basics of filtering I: Spatial filters (25Mb) IVR - Chapter 2 - Basics of filtering I: Spatial filters (25Mb)
IVR - Chapter 2 - Basics of filtering I: Spatial filters (25Mb) Charles Deledalle
 
Image formation
Image formationImage formation
Image formationpotaters
 
Navidgator - Similarity Based Browsing for Image & Video Databases
Navidgator - Similarity Based Browsing for Image & Video DatabasesNavidgator - Similarity Based Browsing for Image & Video Databases
Navidgator - Similarity Based Browsing for Image & Video DatabasesDamian Borth
 
Metric Learning Survey Slides
Metric Learning Survey SlidesMetric Learning Survey Slides
Metric Learning Survey Slidesguestd8baf6
 
Recent Advances in Object-based Change Detection.pdf
Recent Advances in Object-based Change Detection.pdfRecent Advances in Object-based Change Detection.pdf
Recent Advances in Object-based Change Detection.pdfgrssieee
 
Ondelettes, représentations bidimensionnelles, multi-échelles et géométriques...
Ondelettes, représentations bidimensionnelles, multi-échelles et géométriques...Ondelettes, représentations bidimensionnelles, multi-échelles et géométriques...
Ondelettes, représentations bidimensionnelles, multi-échelles et géométriques...Laurent Duval
 

Was ist angesagt? (18)

Seminar
SeminarSeminar
Seminar
 
Unsupervised Change Detection in the Feature Space Using Kernels.pdf
Unsupervised Change Detection in the Feature Space Using Kernels.pdfUnsupervised Change Detection in the Feature Space Using Kernels.pdf
Unsupervised Change Detection in the Feature Space Using Kernels.pdf
 
Elementary Landscape Decomposition of Combinatorial Optimization Problems
Elementary Landscape Decomposition of Combinatorial Optimization ProblemsElementary Landscape Decomposition of Combinatorial Optimization Problems
Elementary Landscape Decomposition of Combinatorial Optimization Problems
 
Estimating Human Pose from Occluded Images (ACCV 2009)
Estimating Human Pose from Occluded Images (ACCV 2009)Estimating Human Pose from Occluded Images (ACCV 2009)
Estimating Human Pose from Occluded Images (ACCV 2009)
 
A Physical Approach to Moving Cast Shadow Detection (ICASSP 2009)
A Physical Approach to Moving Cast Shadow Detection (ICASSP 2009)A Physical Approach to Moving Cast Shadow Detection (ICASSP 2009)
A Physical Approach to Moving Cast Shadow Detection (ICASSP 2009)
 
CVPR2010: Advanced ITinCVPR in a Nutshell: part 4: additional slides
CVPR2010: Advanced ITinCVPR in a Nutshell: part 4: additional slidesCVPR2010: Advanced ITinCVPR in a Nutshell: part 4: additional slides
CVPR2010: Advanced ITinCVPR in a Nutshell: part 4: additional slides
 
Lec13 stereo converted
Lec13 stereo convertedLec13 stereo converted
Lec13 stereo converted
 
Learning Moving Cast Shadows for Foreground Detection (VS 2008)
Learning Moving Cast Shadows for Foreground Detection (VS 2008)Learning Moving Cast Shadows for Foreground Detection (VS 2008)
Learning Moving Cast Shadows for Foreground Detection (VS 2008)
 
Fcv bio cv_weiss
Fcv bio cv_weissFcv bio cv_weiss
Fcv bio cv_weiss
 
SIGGRAPH ASIA 2012 Stereoscopic Cloning Presentation Slide
SIGGRAPH ASIA 2012 Stereoscopic Cloning Presentation SlideSIGGRAPH ASIA 2012 Stereoscopic Cloning Presentation Slide
SIGGRAPH ASIA 2012 Stereoscopic Cloning Presentation Slide
 
IVR - Chapter 2 - Basics of filtering I: Spatial filters (25Mb)
IVR - Chapter 2 - Basics of filtering I: Spatial filters (25Mb) IVR - Chapter 2 - Basics of filtering I: Spatial filters (25Mb)
IVR - Chapter 2 - Basics of filtering I: Spatial filters (25Mb)
 
Image formation
Image formationImage formation
Image formation
 
Lec10 alignment
Lec10 alignmentLec10 alignment
Lec10 alignment
 
Navidgator - Similarity Based Browsing for Image & Video Databases
Navidgator - Similarity Based Browsing for Image & Video DatabasesNavidgator - Similarity Based Browsing for Image & Video Databases
Navidgator - Similarity Based Browsing for Image & Video Databases
 
Metric Learning Survey Slides
Metric Learning Survey SlidesMetric Learning Survey Slides
Metric Learning Survey Slides
 
Test
TestTest
Test
 
Recent Advances in Object-based Change Detection.pdf
Recent Advances in Object-based Change Detection.pdfRecent Advances in Object-based Change Detection.pdf
Recent Advances in Object-based Change Detection.pdf
 
Ondelettes, représentations bidimensionnelles, multi-échelles et géométriques...
Ondelettes, représentations bidimensionnelles, multi-échelles et géométriques...Ondelettes, représentations bidimensionnelles, multi-échelles et géométriques...
Ondelettes, représentations bidimensionnelles, multi-échelles et géométriques...
 

Ähnlich wie Scale and object aware image retargeting for thumbnail browsing

李俊良/Feature Engineering in Machine Learning
李俊良/Feature Engineering in Machine Learning李俊良/Feature Engineering in Machine Learning
李俊良/Feature Engineering in Machine Learning台灣資料科學年會
 
Introduction to Binocular Stereo in Computer Vision
Introduction to Binocular Stereo in Computer VisionIntroduction to Binocular Stereo in Computer Vision
Introduction to Binocular Stereo in Computer Visionothersk46
 
One-Pass Clustering Superpixels
One-Pass Clustering SuperpixelsOne-Pass Clustering Superpixels
One-Pass Clustering SuperpixelsKesavan Yogarajah
 
Caustic Object Construction Based on Multiple Caustic Patterns
Caustic Object Construction Based on Multiple Caustic PatternsCaustic Object Construction Based on Multiple Caustic Patterns
Caustic Object Construction Based on Multiple Caustic PatternsBudianto Tandianus
 
Iccv11 salientobjectdetection
Iccv11 salientobjectdetectionIccv11 salientobjectdetection
Iccv11 salientobjectdetectionJie Feng
 
De-convolution on Digital Images
De-convolution on Digital ImagesDe-convolution on Digital Images
De-convolution on Digital ImagesMd. Shohel Rana
 
物件偵測與辨識技術
物件偵測與辨識技術物件偵測與辨識技術
物件偵測與辨識技術CHENHuiMei
 
PR-132: SSD: Single Shot MultiBox Detector
PR-132: SSD: Single Shot MultiBox DetectorPR-132: SSD: Single Shot MultiBox Detector
PR-132: SSD: Single Shot MultiBox DetectorJinwon Lee
 
SCALE RATIO ICP FOR 3D POINT CLOUDS WITH DIFFERENT SCALES
SCALE RATIO ICP FOR 3D POINT CLOUDS WITH DIFFERENT SCALES SCALE RATIO ICP FOR 3D POINT CLOUDS WITH DIFFERENT SCALES
SCALE RATIO ICP FOR 3D POINT CLOUDS WITH DIFFERENT SCALES Toru Tamaki
 
Computer Vision - Stereo Vision
Computer Vision - Stereo VisionComputer Vision - Stereo Vision
Computer Vision - Stereo VisionWael Badawy
 
[DLHacks 実装]Perceptual Adversarial Networks for Image-to-Image Transformation
[DLHacks 実装]Perceptual Adversarial Networks for Image-to-Image Transformation[DLHacks 実装]Perceptual Adversarial Networks for Image-to-Image Transformation
[DLHacks 実装]Perceptual Adversarial Networks for Image-to-Image TransformationDeep Learning JP
 
Imagine camp, Developing Image Processing app for windows phone platform
Imagine camp, Developing Image Processing app for windows phone platformImagine camp, Developing Image Processing app for windows phone platform
Imagine camp, Developing Image Processing app for windows phone platformRahat Yasir
 
Defect detection in circlips using image processing in ni lab view
Defect detection in circlips using image processing in ni lab viewDefect detection in circlips using image processing in ni lab view
Defect detection in circlips using image processing in ni lab viewSayali Bodhankar
 
Introduction to Computer Vision (uapycon 2017)
Introduction to Computer Vision (uapycon 2017)Introduction to Computer Vision (uapycon 2017)
Introduction to Computer Vision (uapycon 2017)Anton Kasyanov
 
6 large-scale-learning.pptx
6 large-scale-learning.pptx6 large-scale-learning.pptx
6 large-scale-learning.pptxmustafa sarac
 

Ähnlich wie Scale and object aware image retargeting for thumbnail browsing (20)

P1151139820
P1151139820P1151139820
P1151139820
 
李俊良/Feature Engineering in Machine Learning
李俊良/Feature Engineering in Machine Learning李俊良/Feature Engineering in Machine Learning
李俊良/Feature Engineering in Machine Learning
 
Introduction to Binocular Stereo in Computer Vision
Introduction to Binocular Stereo in Computer VisionIntroduction to Binocular Stereo in Computer Vision
Introduction to Binocular Stereo in Computer Vision
 
One-Pass Clustering Superpixels
One-Pass Clustering SuperpixelsOne-Pass Clustering Superpixels
One-Pass Clustering Superpixels
 
Caustic Object Construction Based on Multiple Caustic Patterns
Caustic Object Construction Based on Multiple Caustic PatternsCaustic Object Construction Based on Multiple Caustic Patterns
Caustic Object Construction Based on Multiple Caustic Patterns
 
CBIR
CBIRCBIR
CBIR
 
Iccv11 salientobjectdetection
Iccv11 salientobjectdetectionIccv11 salientobjectdetection
Iccv11 salientobjectdetection
 
De-convolution on Digital Images
De-convolution on Digital ImagesDe-convolution on Digital Images
De-convolution on Digital Images
 
物件偵測與辨識技術
物件偵測與辨識技術物件偵測與辨識技術
物件偵測與辨識技術
 
PR-132: SSD: Single Shot MultiBox Detector
PR-132: SSD: Single Shot MultiBox DetectorPR-132: SSD: Single Shot MultiBox Detector
PR-132: SSD: Single Shot MultiBox Detector
 
SCALE RATIO ICP FOR 3D POINT CLOUDS WITH DIFFERENT SCALES
SCALE RATIO ICP FOR 3D POINT CLOUDS WITH DIFFERENT SCALES SCALE RATIO ICP FOR 3D POINT CLOUDS WITH DIFFERENT SCALES
SCALE RATIO ICP FOR 3D POINT CLOUDS WITH DIFFERENT SCALES
 
Computer Vision - Stereo Vision
Computer Vision - Stereo VisionComputer Vision - Stereo Vision
Computer Vision - Stereo Vision
 
[DLHacks 実装]Perceptual Adversarial Networks for Image-to-Image Transformation
[DLHacks 実装]Perceptual Adversarial Networks for Image-to-Image Transformation[DLHacks 実装]Perceptual Adversarial Networks for Image-to-Image Transformation
[DLHacks 実装]Perceptual Adversarial Networks for Image-to-Image Transformation
 
project_PPT_final
project_PPT_finalproject_PPT_final
project_PPT_final
 
Imagine camp, Developing Image Processing app for windows phone platform
Imagine camp, Developing Image Processing app for windows phone platformImagine camp, Developing Image Processing app for windows phone platform
Imagine camp, Developing Image Processing app for windows phone platform
 
Class Weighted Convolutional Features for Image Retrieval
Class Weighted Convolutional Features for Image Retrieval Class Weighted Convolutional Features for Image Retrieval
Class Weighted Convolutional Features for Image Retrieval
 
Part2
Part2Part2
Part2
 
Defect detection in circlips using image processing in ni lab view
Defect detection in circlips using image processing in ni lab viewDefect detection in circlips using image processing in ni lab view
Defect detection in circlips using image processing in ni lab view
 
Introduction to Computer Vision (uapycon 2017)
Introduction to Computer Vision (uapycon 2017)Introduction to Computer Vision (uapycon 2017)
Introduction to Computer Vision (uapycon 2017)
 
6 large-scale-learning.pptx
6 large-scale-learning.pptx6 large-scale-learning.pptx
6 large-scale-learning.pptx
 

Kürzlich hochgeladen

Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptxLBM Solutions
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxKatpro Technologies
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphNeo4j
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Alan Dix
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machinePadma Pradeep
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersThousandEyes
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhisoniya singh
 
Benefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksBenefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksSoftradix Technologies
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 

Kürzlich hochgeladen (20)

Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptx
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food Manufacturing
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
 
Benefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksBenefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other Frameworks
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 

Scale and object aware image retargeting for thumbnail browsing

  • 1. ICCV 2011 Scale and Object Aware Image Retargeting for Thumbnail Browsing Jin Sun, Haibin Ling Temple University, Philadelphia, PA, 19122 By perillaroc
  • 2. Introduction • Image browsing tasks • Tiny thumbnails: a fixed small size • Scaling – bring difficulties in searching and recognition • Automatic image retargeting methods – target size is comparable to that of the original image • Several important issues for thumbnail browsing 2012/6/19 2
  • 3. Thumbnail Browsing • Thumbnail scales – Studies have shown that scales can have significant effects on human visual perception • Object completeness – Low-level gradient-based information – NOW! The object-level completeness • Structure smoothness – The contamination caused by pixel removal methods, e.g. seam carving 2012/6/19 3
  • 4. Scale and Object Aware Retargeting • Thumbnail scales – An image perceived by human vision system – A new scale dependent saliency • Object completeness – integrating the objectness measurement recently proposed by Alexe et al. • Structure smoothness – Use the thin-plate-spline(TPS) as warping function – Use Cyclic seam Carving to guide the warping. 2012/6/19 4
  • 5. Scale and Object Aware Retargeting • A continuous retargeting algorithm but uses discrete retargeting to guide the warping transformation. 2012/6/19 5
  • 6. Overview : Task Formulation • Retargeting problem: J  F ( I , S ( I )) • Seam carving(SC)  I (i, j ), if j  seam(i ) F  sc  I (i, j  1), if j  seam(i ) i, j • First get retargeted images to a target size that is comparable to the original image size. • Then shrink the retargeted images to get the final thumbnails. 2012/6/19 6
  • 7. Overview : Framework Overview • Scale and object aware image retargeting(SOAP): F so • Warping function: thin-plate-spline (TPS) • Landmark points : cyclic seam carving (CSC) • Scale dependent saliency S scale ( I ) • Object awareness O( I ) • Scale and object aware saliency S so ( I )  S scale ( I )  O( I ) • SOAP J  F ( I , S ( I )) so so 2012/6/19 7
  • 8. Flow chart of the proposed method 2012/6/19 8
  • 9. Scale-dependent Saliency • Original image size • Display size so s D sd  , s p  d rp • Projection size DPI D • DPI(Dots Per Inch) 2012/6/19 9
  • 10. Scale-dependent Saliency • How well such a system preserves the image information? • We want to make the foreground object/ content/theme of the image as “clear” as possible • According to the study, not all patterns in an image are recognizable by human 2012/6/19 10
  • 11. Scale-dependent Saliency • the perceived image, denoted as I p , can be derived as  I (i, j ), if sd / D  [ ,  ] I p ( I (i, j ))    I N (i, j ), otherwise • I N (i, j ) is the mean value of N-connected neighbors of pixel I(i,j) • N is determined by display device specifications. • κ and ρ are the lower and upper bounds(in cycles per degree) respectively of human visual acuity. • κ and ρ define the limits at which the visual stimuli frequency becomes too low or too high to be recognized by human. • We use κ = 0.0175 and ρ = 0.83. 2012/6/19 11
  • 12. Scale-dependent Saliency • A pixel may become indistinguishable from its neighbors to a human observer. • An image patch that was salient in the original image may not appear salient to a human observer. • Scale dependent saliency 2012/6/19 12
  • 13. Scale-dependent Saliency • First, the original image is scaled in homogeneous into the final thumbnail size, i.e. the display size, 60x60 pixels in our experiment. • Then, the minimum recognizable pattern, denoted by  sd is determined by Eqn.4. • Finally, the scale dependent saliency S scale ( I ) is defined as S scale ( I )  S ( I p ) where S(.) is the standard saliency (Itti 1998) 2012/6/19 13
  • 14. Scale-dependent Saliency • the minimum recognizable pattern sd  • In our monitor : 1680X1050@65Hz , 120 DPI •  sd is in average 0.009 inches. • The size is approximately the distance between two pixel lines on the screen, i.e. N = 4 in Eqn.5. • As a result, in the final thumbnail four adjacent neighbors of one image pixel patch with value differences in color space within certain threshold will be assigned their mean value, which means those pixels are unable to be distinguished by human. • Differences : 50 in our experiment 2012/6/19 14
  • 16. Objectness Measure • Preserve object completeness as much as possible • Alexe et al. What is an object? (CVPR,2010). – A novel objectness measure, which is trained to distinguish object windows from background ones. – For a rectangular window w = (i,j,w,h) with a top-left corner at (i,j), width w and height h, its objectness is defined as the probability p(w) that w contains an object. 2012/6/19 16
  • 17. Objectness Measure • First sample nw windows W  {wi }iw1 for an input image • Then calculate the objectness map O as the average objectness reponse at each pixel 1 O(i, j )  , j )w pobj (w)  wW (i Where   max O(i, j ) i, j • nw = 10000 in our experiments 2012/6/19 17
  • 18. Scale and Object Aware Saliency S S so scale (i, j ) O(i, j ) 2012/6/19 18
  • 19. Cyclic Seam Carving • In many cases a seam has no choice but to cross objects due to the original definition of seam. • Discontinuous seams • Discontinuous seam-carving for video retargeting. CVPR, 2010 • Problem: object-structure damage, e.g. pixel shift problem 2012/6/19 19
  • 20. Cyclic Seam Carving • Cyclic Seams: warp the image 2012/6/19 20
  • 21. Cyclic Seam Carving • Energy function Escale    Esc  (1   )  S so where ρ is the weight and set to 0.3 . • The improved energy is then combined with the CSC algorithm to provide landmark point pairs needed for estimating TPS warping. 2012/6/19 21
  • 23. Image Warping Function • Many discrete retargeting methods generate excellent results in general but they sometimes create serious artifacts when the target has a size much smaller than the input. • We combine a continuous warping model with a discrete retargeting guidance. • Thin-plate-spline(TPS) as our warping function 2012/6/19 23
  • 24. Thin-plate-spline(TPS) • Landmark points P   pi  2 , i  1,2, and , nl  Q  qi  2 , i  1,2, , nl  , where p is mapped to q. • the TPS transformation T is defined as the transformation from P to Q that minimizes the regularized bending energy E(f) defined as E ( f )   qi  f (qi )  2 i 2 f 2 2 f 2 2 f 2   ( 2 )  2( )  ( 2 ) dxdy x xy y • The TPS warping is defined as T  arg min f E ( f ) 2012/6/19 24
  • 25. TPS Warping Function • Landmark point pairs P and Q is derived from the CSC retarget algorithm • A two-way solution • First, We sample randomly a landmark set b P (Fig. 6(a)) from original image and then trace their shifting in the CSC process. 2012/6/19 25
  • 26. TPS Warping Function • Then, point set Q is re-sampled uniformly on the target image, which is generated by CSC. • Finally, a sample set P is generated by mapping Q to the original image using warping estimated by Q and P. 2012/6/19 26
  • 27. Quantitative Experiments • To find the thumbnail that matches the description • From 10 x 10 image thumbnails • Different methods – Scaling(SL) – Seam carving (SC) – Improved seam carving (ISC) – The proposed SOAR algorithm (SOAR) 2012/6/19 27
  • 30. Qualitative Experiments • In general our method performs the best regarding the (foreground) object size in the thumbnails. 2012/6/19 30
  • 32. Limitations • Saliency distribution is scattered • Object is to big 2012/6/19 32
  • 33. Conclusion • A scale and object aware image retargeting method for thumbnail browsing. • Several new techniques – Scale dependent saliency – Objectness – Cyclic seam carving – A TPS-based continuous warping model 2012/6/19 33