SlideShare ist ein Scribd-Unternehmen logo
1 von 19
TOWARDS DATA-DRIVEN ESTIMATION OF
   IMAGE TAG RELEVANCE USING VISUALLY
SIMILAR AND DISSIMILAR FOLKSONOMY IMAGES

                      ACM Multimedia 2012:
            Workshop on Socially-Aware Multimedia
                          Nara – Oct. 29, 2012


        Sihyoung Lee1, Wesley De Neve1,2, Yong Man Ro1


1
    Image and Video Systems Lab, Dept. of Electrical Engineering, KAIST
             2
               Multimedia Lab, ELIS, Ghent University - iMinds
2 /19



                    Outline

• Introduction
• Motivation
• Proposed Image Tag Relevance Estimation
• Experiments
• Conclusions
3 /19



                    Outline

• Introduction
• Motivation
• Proposed Image Tag Relevance Estimation
• Experiments
• Conclusions
4 /19



                     Introduction

• Increasing online availability of images
   – thanks to easy-to-use multimedia devices and online services
   – thanks to cheap storage and bandwidth
   – thanks to an increasing number of people going online


• Some statistics
   – every minute, over 2,500 images are uploaded to Flickr
   – every day, over 300 million photos are uploaded to Facebook


• How to effectively retrieve images for consumption
  purposes?
5 /19


   Problems in Image Folksonomies

• Most image search engines strongly depend on tags
• Non-relevant tags hinder effective consumption

                              Among the 60 images retrieved, only 20 images
                                          are related to ‘apple’
6 /19



                    Outline

• Introduction
• Motivation
• Proposed Image Tag Relevance Estimation
• Experiments
• Conclusions
7 /19


                         Motivation

• The correlation between visual and semantic similarity
   – is high for images that are semantically and visually distant
   – is lower for images that are semantically and visually close

    The probability of having semantically similar images in a set of
      visually similar images is lower than the probability of having
     semantically dissimilar images in a set of visually dissimilar images


• The above observation motivated us to develop a novel
  technique for tag relevance learning
   – takes advantage of both visually similar and dissimilar images
8 /19




• Introduction
• Motivation
• Proposed Image Tag Relevance Estimation
• Experiments
• Conclusions
9 /19

                                      Conceptual Illustration of
                                       The Proposed Method
                                                                           food

                            bicycle

                                                                                                                    sign



               solder



                                               desert
                                                                                                                           airplane
                                                                                         desert
                                                                                                                                                                              desert          bicycle

                                 desert
building
                                                                                                      bicycle


                                                                                                                                         Image tag relevance estimation
                            desert

                                                                                                                                 rifle                                        desert          bicycle
                                                                                                           desert
                                                                                                                                                                                Image tag relevance
                                                        desert, bicycle                                                                                                   using visually dissimilar images
 ship


                               desert                                                             desert



                                                                                                                              bicycle
                                          street                                    bicycle
           nature




                                                                                                                                                                              desert          bicycle
                                                                                                                                                                               Image tag relevance
                                                                                                               atomium                                                      using the proposed method
                        basketball



                                                                          bicycle
10 /19


                                                 Proposed Method

• Let r (i, t) be an image tag relevance learning function
  based on the proposed method, then it is defined as
   r ( i, t ) := rsimilar ( i, t , k ) − rdissimilar ( i, t , l )
                                                                                                             ∑ vote( j, t )
         – rsimilar ( i, t , k ) := nt [ N s ( i, k ) ] − nt [ N rand ( k ) ] = j=N∑i ,vote( j , t ) − k ⋅
                                                                                                             j=I

                                                                                   s( k)                            I

                                                                                                             ∑ vote( j , t )
         – r    dissimilar   ( i, t , u ) := n [ N ( i, l ) ] − n [ N ( l ) ] = ∑( vote( j, t ) − l ⋅
                                             t    d          t    rand
                                                                                                             j =I

                                                                                    )
                                                                               j = N d i ,l                         I
                                                                     where nt[∙] represents the number of images annotated with t,
                                                                          Ns(i,k) is a set of k images visually similar to i ,
                                                                          Nd(i,l) is a set of l images visually dissimilar to i , and
                                                                          Nrand(k) is a set of k randomly selected neighbors

                                 Relationship between rsimilar ( i, t , k ) , rdissimilar ( i, t , l ) , and r ( i, t )
                                  rsimilar ( i, t , k )                       rdissimilar ( i, t , l )                         r ( i, t )
t
  relevant
                                           +                                                  -                                   ++
t irrelevant                               -                                                  +                                   --
11 /19



                                    Rationale

• For trelevant relevant to the content of i,
    – P(trelevant|Ns(i,k)) is higher than P(trelevant|Nrand(k))
        rsimilar(trelevant,i,k) thus returns a positive value
    – P(trelevant|Nd(i,l)) is lower than P(trelevant|Nrand(l))
        rdissimilar(trelevant,i,l) thus returns a negative value


• For tirrelevant irrelevant to the content of i,
    – P(tirrelevant|Ns(i,k)) is lower than P(tirrelevant|Nrand(k))
        rsimilar(tirrelevant,i,k) thus returns a negative value
    – P(tirrelevant|Nd(i,l)) is higher than P(tirrelevant|Nrand(l))
        rdissimilar(tirrelevant,i,l) thus returns a positive value
12 /19



                    Outline

• Introduction
• Motivation
• The Proposed Image Tag Relevance Estimation
• Experiments
• Conclusions
13 /19



        Experimental Setup (1/2)

• Image set used: subset of MIRFlickr-1M
   – 100,000 images annotated with 1,130,342 tags
        by 13,343 users
        concept vocabulary of 159,300 unique tags
   – test set
        1,000 images annotated with at least four tags
        annotated with 24,474 tags
           • we manually classified 6,534 tags as correct
           • we manually classified 17,940 tags as noisy

• Image descriptor
   – Bag of Visual Words (BoVW)
        vocabulary size: 500
   – use of cosine similarity for measuring image similarity
14 /19



        Experimental Setup (2/2)

• Metrics used for evaluating the effectiveness of the
  proposed technique for image tag relevance estimation
   – for image tag refinement
             A
                 noise


      NL =
             A
             where NL (Noise Level) denotes the proportion of irrelevant tag assignments in the set of all tag assignments,
                  A is the set of tag assignments in an image folksonomy,
                  Anoise is the set of incorrect (noisy) tag assignments

   – for tag-based image retrieval
                           It            ∩ I t ,m
                              relevant        retrieved


      P @ m for t =                                       ,
                                         m
             where Tt         is the set of all folksonomy images relevant to t,
                         relevant

                  Tt,mrelevant is the set of the m topmost images that have been retrieved for t

                  (given the estimated tag relevance values)
Effectiveness of Image Tag Relevance                                                              15 /19


                          Estimation
                   for Image Tag Refinement
      • Effectiveness of image tag relevance estimation using
        visually similar and dissimilar images, compared to
        previous approaches
            – neighbor voting and a variant of neighbor voting estimate image
              tag relevance by only making use of visually similar images
 
                                                                    After image tag refinement
                                    Before

                              image tag refinement   Using visually similar images   Using the proposed technique


    Number of relevant tags
                                     6,534                      5,881                           5,881

Number of irrelevant tags
                                     17,940                     13,117                         12,094

              NL
                                     0.733                      0.690                           0.673
Effectiveness of Image Tag Relevance                                16 /19


                Estimation
       for Tag-based Image Retrieval
• Effectiveness of image tag relevance estimation using
  visually similar and dissimilar images, compared to
  previous approaches
   – neighbor voting and a variant of neighbor voting estimate image
     tag relevance by only making use of visually similar images
17 /19



                    Outline

• Introduction
• Motivation
• Proposed Image Tag Relevance Estimation
• Experiments
• Conclusions
18 /19



                     Conclusions

• We proposed an image tag relevance technique that
  makes use of both visually similar and dissimilar images
   – increases the difference in image tag relevance between tags
     relevant and tags not relevant with respect to a seed image
   – comes with a low increase in computational complexity
• The effectiveness of the proposed technique was
  confirmed using MIRFLICKR-25000 and MIRFLICKR-1M
   – by showing that the proposed technique allows increasing the
     effectiveness of tag refinement and tag-based image retrieval
• Future research
   – combining visual information and tag statistics comparing our
     data-driven approach with a classifier-based approach for
     detecting a number of predefined semantic concepts
Thank you! Any questions?

Weitere ähnliche Inhalte

Andere mochten auch

Face annotation for personal photos using collaborative face recognition in o...
Face annotation for personal photos using collaborative face recognition in o...Face annotation for personal photos using collaborative face recognition in o...
Face annotation for personal photos using collaborative face recognition in o...Wesley De Neve
 
Sub-sampled dictionaries for coarse-to-fine sparse representation-based human...
Sub-sampled dictionaries for coarse-to-fine sparse representation-based human...Sub-sampled dictionaries for coarse-to-fine sparse representation-based human...
Sub-sampled dictionaries for coarse-to-fine sparse representation-based human...Wesley De Neve
 
Opsporen van videokopieën met behulp van visuele vingerafdrukken en schuivend...
Opsporen van videokopieën met behulp van visuele vingerafdrukken en schuivend...Opsporen van videokopieën met behulp van visuele vingerafdrukken en schuivend...
Opsporen van videokopieën met behulp van visuele vingerafdrukken en schuivend...Wesley De Neve
 
изменения и дополнения вкол.дог.
изменения и дополнения  вкол.дог.изменения и дополнения  вкол.дог.
изменения и дополнения вкол.дог.BAP27
 
M01 introduksi teori manajemen
M01 introduksi teori manajemenM01 introduksi teori manajemen
M01 introduksi teori manajemenCoky Fauzi Alfi
 
Cao tat thi chung
Cao tat thi chungCao tat thi chung
Cao tat thi chungHoan KL
 
STAR RUNNING CLUB
STAR RUNNING CLUBSTAR RUNNING CLUB
STAR RUNNING CLUBSubrina22
 
V730 network connection
V730 network connectionV730 network connection
V730 network connectionLệnh Xung
 
Will the real you please stand up
Will the real you please stand upWill the real you please stand up
Will the real you please stand upMary Rachel Abonado
 
Офисная Мини-АТС: Максимальная эффективность при минимуме затрат
Офисная Мини-АТС: Максимальная эффективность при минимуме затратОфисная Мини-АТС: Максимальная эффективность при минимуме затрат
Офисная Мини-АТС: Максимальная эффективность при минимуме затратtabtabus
 
Портфель читателя-исследователя апробация
Портфель читателя-исследователя апробацияПортфель читателя-исследователя апробация
Портфель читателя-исследователя апробацияШкольная лига РОСНАНО
 
BringTo (RPA) Automation Solution
BringTo (RPA) Automation SolutionBringTo (RPA) Automation Solution
BringTo (RPA) Automation SolutionAdam Wolff
 
Эффективные решения для развития бизнеса в Интернете
Эффективные решения для развития бизнеса в ИнтернетеЭффективные решения для развития бизнеса в Интернете
Эффективные решения для развития бизнеса в Интернетеtabtabus
 
Timeline
TimelineTimeline
Timelinescitso
 

Andere mochten auch (18)

Face annotation for personal photos using collaborative face recognition in o...
Face annotation for personal photos using collaborative face recognition in o...Face annotation for personal photos using collaborative face recognition in o...
Face annotation for personal photos using collaborative face recognition in o...
 
Sub-sampled dictionaries for coarse-to-fine sparse representation-based human...
Sub-sampled dictionaries for coarse-to-fine sparse representation-based human...Sub-sampled dictionaries for coarse-to-fine sparse representation-based human...
Sub-sampled dictionaries for coarse-to-fine sparse representation-based human...
 
Opsporen van videokopieën met behulp van visuele vingerafdrukken en schuivend...
Opsporen van videokopieën met behulp van visuele vingerafdrukken en schuivend...Opsporen van videokopieën met behulp van visuele vingerafdrukken en schuivend...
Opsporen van videokopieën met behulp van visuele vingerafdrukken en schuivend...
 
0499154500
04991545000499154500
0499154500
 
изменения и дополнения вкол.дог.
изменения и дополнения  вкол.дог.изменения и дополнения  вкол.дог.
изменения и дополнения вкол.дог.
 
M01 introduksi teori manajemen
M01 introduksi teori manajemenM01 introduksi teori manajemen
M01 introduksi teori manajemen
 
Cao tat thi chung
Cao tat thi chungCao tat thi chung
Cao tat thi chung
 
Downov sindrom
Downov sindromDownov sindrom
Downov sindrom
 
STAR RUNNING CLUB
STAR RUNNING CLUBSTAR RUNNING CLUB
STAR RUNNING CLUB
 
V730 network connection
V730 network connectionV730 network connection
V730 network connection
 
AUA Development Conference 2012 - Matthew Andrews
AUA Development Conference 2012 - Matthew AndrewsAUA Development Conference 2012 - Matthew Andrews
AUA Development Conference 2012 - Matthew Andrews
 
Will the real you please stand up
Will the real you please stand upWill the real you please stand up
Will the real you please stand up
 
Офисная Мини-АТС: Максимальная эффективность при минимуме затрат
Офисная Мини-АТС: Максимальная эффективность при минимуме затратОфисная Мини-АТС: Максимальная эффективность при минимуме затрат
Офисная Мини-АТС: Максимальная эффективность при минимуме затрат
 
Портфель читателя-исследователя апробация
Портфель читателя-исследователя апробацияПортфель читателя-исследователя апробация
Портфель читателя-исследователя апробация
 
BringTo (RPA) Automation Solution
BringTo (RPA) Automation SolutionBringTo (RPA) Automation Solution
BringTo (RPA) Automation Solution
 
Pay ateention
Pay ateentionPay ateention
Pay ateention
 
Эффективные решения для развития бизнеса в Интернете
Эффективные решения для развития бизнеса в ИнтернетеЭффективные решения для развития бизнеса в Интернете
Эффективные решения для развития бизнеса в Интернете
 
Timeline
TimelineTimeline
Timeline
 

Mehr von Wesley De Neve

Towards diagnosis of rotator cuff tears in 3-D MRI using 3-D convolutional ne...
Towards diagnosis of rotator cuff tears in 3-D MRI using 3-D convolutional ne...Towards diagnosis of rotator cuff tears in 3-D MRI using 3-D convolutional ne...
Towards diagnosis of rotator cuff tears in 3-D MRI using 3-D convolutional ne...Wesley De Neve
 
Investigating the biological relevance in trained embedding representations o...
Investigating the biological relevance in trained embedding representations o...Investigating the biological relevance in trained embedding representations o...
Investigating the biological relevance in trained embedding representations o...Wesley De Neve
 
Impact of adversarial examples on deep learning models for biomedical image s...
Impact of adversarial examples on deep learning models for biomedical image s...Impact of adversarial examples on deep learning models for biomedical image s...
Impact of adversarial examples on deep learning models for biomedical image s...Wesley De Neve
 
Learning Biologically Relevant Features Using Convolutional Neural Networks f...
Learning Biologically Relevant Features Using Convolutional Neural Networks f...Learning Biologically Relevant Features Using Convolutional Neural Networks f...
Learning Biologically Relevant Features Using Convolutional Neural Networks f...Wesley De Neve
 
The 5th Aslla Symposium
The 5th Aslla SymposiumThe 5th Aslla Symposium
The 5th Aslla SymposiumWesley De Neve
 
Ghent University Global Campus 101
Ghent University Global Campus 101Ghent University Global Campus 101
Ghent University Global Campus 101Wesley De Neve
 
Booklet for the First GUGC Research Symposium
Booklet for the First GUGC Research SymposiumBooklet for the First GUGC Research Symposium
Booklet for the First GUGC Research SymposiumWesley De Neve
 
Center for Biotech Data Science at Ghent University Global Campus
Center for Biotech Data Science at Ghent University Global CampusCenter for Biotech Data Science at Ghent University Global Campus
Center for Biotech Data Science at Ghent University Global CampusWesley De Neve
 
Center for Biotech Data Science at Ghent University Global Campus
Center for Biotech Data Science at Ghent University Global CampusCenter for Biotech Data Science at Ghent University Global Campus
Center for Biotech Data Science at Ghent University Global CampusWesley De Neve
 
Learning biologically relevant features using convolutional neural networks f...
Learning biologically relevant features using convolutional neural networks f...Learning biologically relevant features using convolutional neural networks f...
Learning biologically relevant features using convolutional neural networks f...Wesley De Neve
 
Towards reading genomic data using deep learning-driven NLP techniques
Towards reading genomic data using deep learning-driven NLP techniquesTowards reading genomic data using deep learning-driven NLP techniques
Towards reading genomic data using deep learning-driven NLP techniquesWesley De Neve
 
Deep Machine Learning for Making Sense of Biotech Data - From Clean Energy to...
Deep Machine Learning for Making Sense of Biotech Data - From Clean Energy to...Deep Machine Learning for Making Sense of Biotech Data - From Clean Energy to...
Deep Machine Learning for Making Sense of Biotech Data - From Clean Energy to...Wesley De Neve
 
GUGC Info Session - Informatics and Bioinformatics
GUGC Info Session - Informatics and BioinformaticsGUGC Info Session - Informatics and Bioinformatics
GUGC Info Session - Informatics and BioinformaticsWesley De Neve
 
Ghent University Global Campus - Sungkyunkwan University: Workshop on Researc...
Ghent University Global Campus - Sungkyunkwan University: Workshop on Researc...Ghent University Global Campus - Sungkyunkwan University: Workshop on Researc...
Ghent University Global Campus - Sungkyunkwan University: Workshop on Researc...Wesley De Neve
 
Ghent University and GUGC-K: Overview of Teaching and Research Activities
Ghent University and GUGC-K: Overview of Teaching and Research ActivitiesGhent University and GUGC-K: Overview of Teaching and Research Activities
Ghent University and GUGC-K: Overview of Teaching and Research ActivitiesWesley De Neve
 
Biotech Data Science @ GUGC in Korea: Deep Learning for Prediction of Drug-Ta...
Biotech Data Science @ GUGC in Korea: Deep Learning for Prediction of Drug-Ta...Biotech Data Science @ GUGC in Korea: Deep Learning for Prediction of Drug-Ta...
Biotech Data Science @ GUGC in Korea: Deep Learning for Prediction of Drug-Ta...Wesley De Neve
 
Exploring Deep Machine Learning for Automatic Right Whale Recognition and No...
 Exploring Deep Machine Learning for Automatic Right Whale Recognition and No... Exploring Deep Machine Learning for Automatic Right Whale Recognition and No...
Exploring Deep Machine Learning for Automatic Right Whale Recognition and No...Wesley De Neve
 
Deep Machine Learning for Automating Biotech Tasks Through Self-Learning Expe...
Deep Machine Learning for Automating Biotech Tasks Through Self-Learning Expe...Deep Machine Learning for Automating Biotech Tasks Through Self-Learning Expe...
Deep Machine Learning for Automating Biotech Tasks Through Self-Learning Expe...Wesley De Neve
 
Towards using multimedia technology for biological data processing
Towards using multimedia technology for biological data processingTowards using multimedia technology for biological data processing
Towards using multimedia technology for biological data processingWesley De Neve
 
Multimedia Lab @ Ghent University - iMinds - Organizational Overview & Outlin...
Multimedia Lab @ Ghent University - iMinds - Organizational Overview & Outlin...Multimedia Lab @ Ghent University - iMinds - Organizational Overview & Outlin...
Multimedia Lab @ Ghent University - iMinds - Organizational Overview & Outlin...Wesley De Neve
 

Mehr von Wesley De Neve (20)

Towards diagnosis of rotator cuff tears in 3-D MRI using 3-D convolutional ne...
Towards diagnosis of rotator cuff tears in 3-D MRI using 3-D convolutional ne...Towards diagnosis of rotator cuff tears in 3-D MRI using 3-D convolutional ne...
Towards diagnosis of rotator cuff tears in 3-D MRI using 3-D convolutional ne...
 
Investigating the biological relevance in trained embedding representations o...
Investigating the biological relevance in trained embedding representations o...Investigating the biological relevance in trained embedding representations o...
Investigating the biological relevance in trained embedding representations o...
 
Impact of adversarial examples on deep learning models for biomedical image s...
Impact of adversarial examples on deep learning models for biomedical image s...Impact of adversarial examples on deep learning models for biomedical image s...
Impact of adversarial examples on deep learning models for biomedical image s...
 
Learning Biologically Relevant Features Using Convolutional Neural Networks f...
Learning Biologically Relevant Features Using Convolutional Neural Networks f...Learning Biologically Relevant Features Using Convolutional Neural Networks f...
Learning Biologically Relevant Features Using Convolutional Neural Networks f...
 
The 5th Aslla Symposium
The 5th Aslla SymposiumThe 5th Aslla Symposium
The 5th Aslla Symposium
 
Ghent University Global Campus 101
Ghent University Global Campus 101Ghent University Global Campus 101
Ghent University Global Campus 101
 
Booklet for the First GUGC Research Symposium
Booklet for the First GUGC Research SymposiumBooklet for the First GUGC Research Symposium
Booklet for the First GUGC Research Symposium
 
Center for Biotech Data Science at Ghent University Global Campus
Center for Biotech Data Science at Ghent University Global CampusCenter for Biotech Data Science at Ghent University Global Campus
Center for Biotech Data Science at Ghent University Global Campus
 
Center for Biotech Data Science at Ghent University Global Campus
Center for Biotech Data Science at Ghent University Global CampusCenter for Biotech Data Science at Ghent University Global Campus
Center for Biotech Data Science at Ghent University Global Campus
 
Learning biologically relevant features using convolutional neural networks f...
Learning biologically relevant features using convolutional neural networks f...Learning biologically relevant features using convolutional neural networks f...
Learning biologically relevant features using convolutional neural networks f...
 
Towards reading genomic data using deep learning-driven NLP techniques
Towards reading genomic data using deep learning-driven NLP techniquesTowards reading genomic data using deep learning-driven NLP techniques
Towards reading genomic data using deep learning-driven NLP techniques
 
Deep Machine Learning for Making Sense of Biotech Data - From Clean Energy to...
Deep Machine Learning for Making Sense of Biotech Data - From Clean Energy to...Deep Machine Learning for Making Sense of Biotech Data - From Clean Energy to...
Deep Machine Learning for Making Sense of Biotech Data - From Clean Energy to...
 
GUGC Info Session - Informatics and Bioinformatics
GUGC Info Session - Informatics and BioinformaticsGUGC Info Session - Informatics and Bioinformatics
GUGC Info Session - Informatics and Bioinformatics
 
Ghent University Global Campus - Sungkyunkwan University: Workshop on Researc...
Ghent University Global Campus - Sungkyunkwan University: Workshop on Researc...Ghent University Global Campus - Sungkyunkwan University: Workshop on Researc...
Ghent University Global Campus - Sungkyunkwan University: Workshop on Researc...
 
Ghent University and GUGC-K: Overview of Teaching and Research Activities
Ghent University and GUGC-K: Overview of Teaching and Research ActivitiesGhent University and GUGC-K: Overview of Teaching and Research Activities
Ghent University and GUGC-K: Overview of Teaching and Research Activities
 
Biotech Data Science @ GUGC in Korea: Deep Learning for Prediction of Drug-Ta...
Biotech Data Science @ GUGC in Korea: Deep Learning for Prediction of Drug-Ta...Biotech Data Science @ GUGC in Korea: Deep Learning for Prediction of Drug-Ta...
Biotech Data Science @ GUGC in Korea: Deep Learning for Prediction of Drug-Ta...
 
Exploring Deep Machine Learning for Automatic Right Whale Recognition and No...
 Exploring Deep Machine Learning for Automatic Right Whale Recognition and No... Exploring Deep Machine Learning for Automatic Right Whale Recognition and No...
Exploring Deep Machine Learning for Automatic Right Whale Recognition and No...
 
Deep Machine Learning for Automating Biotech Tasks Through Self-Learning Expe...
Deep Machine Learning for Automating Biotech Tasks Through Self-Learning Expe...Deep Machine Learning for Automating Biotech Tasks Through Self-Learning Expe...
Deep Machine Learning for Automating Biotech Tasks Through Self-Learning Expe...
 
Towards using multimedia technology for biological data processing
Towards using multimedia technology for biological data processingTowards using multimedia technology for biological data processing
Towards using multimedia technology for biological data processing
 
Multimedia Lab @ Ghent University - iMinds - Organizational Overview & Outlin...
Multimedia Lab @ Ghent University - iMinds - Organizational Overview & Outlin...Multimedia Lab @ Ghent University - iMinds - Organizational Overview & Outlin...
Multimedia Lab @ Ghent University - iMinds - Organizational Overview & Outlin...
 

Kürzlich hochgeladen

ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesrafiqahmad00786416
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...DianaGray10
 
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu SubbuApidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbuapidays
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobeapidays
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FMESafe Software
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoffsammart93
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MIND CTI
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherRemote DBA Services
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native ApplicationsWSO2
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century educationjfdjdjcjdnsjd
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CVKhem
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...apidays
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsNanddeep Nachan
 

Kürzlich hochgeladen (20)

ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu SubbuApidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectors
 

Towards data driven estimation of image tag relevance using visually similar and dissimilar folksonomy images

  • 1. TOWARDS DATA-DRIVEN ESTIMATION OF IMAGE TAG RELEVANCE USING VISUALLY SIMILAR AND DISSIMILAR FOLKSONOMY IMAGES ACM Multimedia 2012: Workshop on Socially-Aware Multimedia Nara – Oct. 29, 2012 Sihyoung Lee1, Wesley De Neve1,2, Yong Man Ro1 1 Image and Video Systems Lab, Dept. of Electrical Engineering, KAIST 2 Multimedia Lab, ELIS, Ghent University - iMinds
  • 2. 2 /19 Outline • Introduction • Motivation • Proposed Image Tag Relevance Estimation • Experiments • Conclusions
  • 3. 3 /19 Outline • Introduction • Motivation • Proposed Image Tag Relevance Estimation • Experiments • Conclusions
  • 4. 4 /19 Introduction • Increasing online availability of images – thanks to easy-to-use multimedia devices and online services – thanks to cheap storage and bandwidth – thanks to an increasing number of people going online • Some statistics – every minute, over 2,500 images are uploaded to Flickr – every day, over 300 million photos are uploaded to Facebook • How to effectively retrieve images for consumption purposes?
  • 5. 5 /19 Problems in Image Folksonomies • Most image search engines strongly depend on tags • Non-relevant tags hinder effective consumption Among the 60 images retrieved, only 20 images are related to ‘apple’
  • 6. 6 /19 Outline • Introduction • Motivation • Proposed Image Tag Relevance Estimation • Experiments • Conclusions
  • 7. 7 /19 Motivation • The correlation between visual and semantic similarity – is high for images that are semantically and visually distant – is lower for images that are semantically and visually close  The probability of having semantically similar images in a set of visually similar images is lower than the probability of having semantically dissimilar images in a set of visually dissimilar images • The above observation motivated us to develop a novel technique for tag relevance learning – takes advantage of both visually similar and dissimilar images
  • 8. 8 /19 • Introduction • Motivation • Proposed Image Tag Relevance Estimation • Experiments • Conclusions
  • 9. 9 /19 Conceptual Illustration of The Proposed Method food bicycle sign solder desert airplane desert desert bicycle desert building bicycle Image tag relevance estimation desert rifle desert bicycle desert Image tag relevance desert, bicycle using visually dissimilar images ship desert desert bicycle street bicycle nature desert bicycle Image tag relevance atomium using the proposed method basketball bicycle
  • 10. 10 /19 Proposed Method • Let r (i, t) be an image tag relevance learning function based on the proposed method, then it is defined as r ( i, t ) := rsimilar ( i, t , k ) − rdissimilar ( i, t , l ) ∑ vote( j, t ) – rsimilar ( i, t , k ) := nt [ N s ( i, k ) ] − nt [ N rand ( k ) ] = j=N∑i ,vote( j , t ) − k ⋅ j=I s( k) I ∑ vote( j , t ) – r dissimilar ( i, t , u ) := n [ N ( i, l ) ] − n [ N ( l ) ] = ∑( vote( j, t ) − l ⋅ t d t rand j =I ) j = N d i ,l I where nt[∙] represents the number of images annotated with t, Ns(i,k) is a set of k images visually similar to i , Nd(i,l) is a set of l images visually dissimilar to i , and Nrand(k) is a set of k randomly selected neighbors Relationship between rsimilar ( i, t , k ) , rdissimilar ( i, t , l ) , and r ( i, t ) rsimilar ( i, t , k ) rdissimilar ( i, t , l ) r ( i, t ) t relevant + - ++ t irrelevant - + --
  • 11. 11 /19 Rationale • For trelevant relevant to the content of i, – P(trelevant|Ns(i,k)) is higher than P(trelevant|Nrand(k))  rsimilar(trelevant,i,k) thus returns a positive value – P(trelevant|Nd(i,l)) is lower than P(trelevant|Nrand(l))  rdissimilar(trelevant,i,l) thus returns a negative value • For tirrelevant irrelevant to the content of i, – P(tirrelevant|Ns(i,k)) is lower than P(tirrelevant|Nrand(k))  rsimilar(tirrelevant,i,k) thus returns a negative value – P(tirrelevant|Nd(i,l)) is higher than P(tirrelevant|Nrand(l))  rdissimilar(tirrelevant,i,l) thus returns a positive value
  • 12. 12 /19 Outline • Introduction • Motivation • The Proposed Image Tag Relevance Estimation • Experiments • Conclusions
  • 13. 13 /19 Experimental Setup (1/2) • Image set used: subset of MIRFlickr-1M – 100,000 images annotated with 1,130,342 tags  by 13,343 users  concept vocabulary of 159,300 unique tags – test set  1,000 images annotated with at least four tags  annotated with 24,474 tags • we manually classified 6,534 tags as correct • we manually classified 17,940 tags as noisy • Image descriptor – Bag of Visual Words (BoVW)  vocabulary size: 500 – use of cosine similarity for measuring image similarity
  • 14. 14 /19 Experimental Setup (2/2) • Metrics used for evaluating the effectiveness of the proposed technique for image tag relevance estimation – for image tag refinement A noise NL = A where NL (Noise Level) denotes the proportion of irrelevant tag assignments in the set of all tag assignments, A is the set of tag assignments in an image folksonomy, Anoise is the set of incorrect (noisy) tag assignments – for tag-based image retrieval It ∩ I t ,m relevant retrieved P @ m for t = , m where Tt is the set of all folksonomy images relevant to t, relevant Tt,mrelevant is the set of the m topmost images that have been retrieved for t (given the estimated tag relevance values)
  • 15. Effectiveness of Image Tag Relevance 15 /19 Estimation for Image Tag Refinement • Effectiveness of image tag relevance estimation using visually similar and dissimilar images, compared to previous approaches – neighbor voting and a variant of neighbor voting estimate image tag relevance by only making use of visually similar images   After image tag refinement Before image tag refinement Using visually similar images Using the proposed technique Number of relevant tags 6,534 5,881 5,881 Number of irrelevant tags 17,940 13,117 12,094 NL 0.733 0.690 0.673
  • 16. Effectiveness of Image Tag Relevance 16 /19 Estimation for Tag-based Image Retrieval • Effectiveness of image tag relevance estimation using visually similar and dissimilar images, compared to previous approaches – neighbor voting and a variant of neighbor voting estimate image tag relevance by only making use of visually similar images
  • 17. 17 /19 Outline • Introduction • Motivation • Proposed Image Tag Relevance Estimation • Experiments • Conclusions
  • 18. 18 /19 Conclusions • We proposed an image tag relevance technique that makes use of both visually similar and dissimilar images – increases the difference in image tag relevance between tags relevant and tags not relevant with respect to a seed image – comes with a low increase in computational complexity • The effectiveness of the proposed technique was confirmed using MIRFLICKR-25000 and MIRFLICKR-1M – by showing that the proposed technique allows increasing the effectiveness of tag refinement and tag-based image retrieval • Future research – combining visual information and tag statistics comparing our data-driven approach with a classifier-based approach for detecting a number of predefined semantic concepts
  • 19. Thank you! Any questions?