SlideShare ist ein Scribd-Unternehmen logo
1 von 17
Downloaden Sie, um offline zu lesen
UNICAMP-UFMG at MediaEval 2012:
             Genre Tagging Task

                                    ´
Jurandy Almeida1 , Thiago Salles2 , Eder F. Martins2 , Ot´vio A. B. Penatti1 ,
                                                         a
  Ricardo da S. Torres , Marcos A. Gon¸alves , and Jussara M. Almeida2
                      1
                                        c    2


          1
              RECOD Lab, Institute of Computing, University of Campinas (UNICAMP)
                {jurandy.almeida, penatti, rtorres}@ic.unicamp.br
      2 Dept.   of Computer Science, Federal University of Minas Gerais (UFMG)
                {tsalles, ederfm, mgoncalv, jussara}@dcc.ufmg.br


                     MediaEval’12 – Pisa, Italy – October 4-5 – 2012
Genre Tagging Task at MediaEval 2012                                     2




  Automatically assign tags to Internet videos collected from blip.tv.



  The focus is on tags related to the category of the video.



  Each video belongs to only one category.
Available Resources                                                  3




              Table: Resources made available for the task.

      Resources            Textual                    Visual
        Used      Title, Tags, Descriptions           Videos
      Not Used    Tweets, ASR transcripts      Shots and Keyframes
The 26 Genre Tags                                           4




  Default Category          Documentary
  Technology
                            The Mainstream Media
  Music and Entertainment   Business
  Politics
                            Food & Drink
  Educational
                            Health
  Videoblogging             Conferences and Other Events
  Religion
                            Literature
  Movies and Television     The Environment
  Sports
                            Personal or Auto-biographical
  Art                       School and Education
  Comedy
                            Travel
  Gaming                    Web Development and Sites
  Citizen Journalism        Autos and Vehicles
es            s
                   5




                                                                                                                                 cl           te
                                                                                                                               hi          si
                                                                                                                           ve          nd           l
                                                                                                                     d              ta           ca
                                                                                                                an               en           hi
                                                                                                           s                   m          ap
                                                                                                     to                    p           gr
                                                                                               au                       lo                   io
                                                                                                                   ve                      -b
                                                                                                           de                       to
                                                                                                   eb                          au                  n
                                                                                               w                       or                       io
                                                                                                                al                         at
                                                                                                           on                       uc
                                                                                                     rs                      ed
                                                                                               pe                      d
                                                                                                                   an
                                                                                                           ol
                                                                                                     ho
                                                                                               sc
                                                                                                                                       t
                                                                                                      ve
                                                                                                           l                    en
                                                                                               tra                      nm                                  nt
                                                                                                                                                              s
                                                                                                                    ro
                                                                                                                vi                                       ve
                                                                                                         en                                            re
                                                                                               th
                                                                                                     e                                           he
                                                                                                                                            ot
Distribution of Genres (Development Set)




                                                                                                                   e
                                                                                                            ur                         d
                                                                                                         at                    an
                                                                                                   er                     s
                                                                                               lit                     ce
                                                                                                                en
                                                                                                           er                   k
                                                                                                     nf                     in
                                                                                               co                      dr
                                                                                                                   d
                                                                                                           an
                                                                                                     od
                                                                                               fo
                                                                                                           th
                                                                                                     al                                           ia
                                                                                               he                                           ed
                                                                                                                ss                      m
                                                                                                         ne                    am
                                                                                                     si
                                                                                               bu                       tre
                                                                                                                 ns
                                                                                                               ai
                                                                                                         m
                                                                                                     e                     ry
                                                                                               th                      ta
                                                                                                                en
                                                                                                           m
                                                                                                     cu
                                                                                               do
                                                                                                              g                       m
                                                                                                           in                    lis
                                                                                                     m                     na
                                                                                               ga                    ur
                                                                                                                jo
                                                                                                      en
                                                                                                   tiz
                                                                                               ci
                                                                                                                y
                                                                                                           ed
                                                                                                     m
                                                                                               co
                                                                                                     t
                                                                                               ar
                                                                                                           ts
                                                                                                     or                                    on
                                                                                               sp                                        si
                                                                                                               n                       vi
                                                                                                           io                     le
                                                                                                     lig                       te
                                                                                               re                        d
                                                                                                                   an
                                                                                                               s
                                                                                                       ie
                                                                                                     ov                     ng
                                                                                               m                   g   gi
                                                                                                                lo                                      t
                                                                                                         ob                                       en
                                                                                                   de                      l                     m
                                                                                               vi                    na                     in
                                                                                                           at
                                                                                                                io                   rta
                                                                                                                                 te
                                                                                                     uc                    en
                                                                                               ed                      d
                                                                                                                an
                                                                                                       ic
                                                                                                     us
                                                                                               m                     gy
                                                                                                                lo
                                                                                                           no
                                                                                                     ch
                                                                                               te
                                                                                                           s                        y
                                                                                                     lit
                                                                                                         ic                    or
                                                                                               po                          eg
                                                                                                                     at
                                                                                                                tc
                                                                                                           ul
                                                                                                     fa
                                                                                               de
                                           800

                                                 700

                                                       600

                                                             500

                                                                   400

                                                                         300

                                                                               200

                                                                                     100

                                                                                           0
                                                              # of objects
Proposed Framework                        6




It relies on two learning strategies:


 1. Meta-learning scheme for combining
    classifiers (stacked generalization)
    trained with metadata information
    model as a bag-of-words.

 2. Histogram of motion patterns joint
    with a KNN classifier for visual
    information.
Visual Approach (HMP)                                                        7




  Keyframes: Not used

  Applying an algorithm to compare video sequences.
       “Comparison of Video Sequences with Histograms of Motion Patterns”.
       J. Almeida, N. J. Leite, and R. S. Torres.
       IEEE Conference on Image Processing (ICIP) 2011.

  It relies on three steps:
   1. partial decoding;
   2. feature extraction;
   3. signature generation.
Visual Approach (HMP)                                                                                                                 8




      Video Frames




                                                                        Motion Pattern           1    1             2   1
                                                                      0101100110010011
                                                                                                 2    1             0   3
                                                                                                Temporal        Spatial


                                            Histogram Distribution
    1: Partial Decoding                                                                        3: Signature Generation


                                    DC coefficient
                                                                                           Time Series of Macroblocks
                                            Pixel Block
                                                                                         106 111          91   94           90   93
                             Macroblock                    2: Feature Extraction
                                                                                         100    88        95   90           96   91
                                                                                     Previous             Current           Next


          I-frames


[Almeida et al., Comparison of Video Sequences with Histograms of Motion Patterns. ICIP 2011]
Textual Approach (BoW)                                     9




  Title, tags, and description used as textual features.

  Only metadata in English.

  Porter Stemming algorithm to remove affixes.

  Stop words removed.

  Bag-of-Words using T F xIDF as weights.
The Classifiers                                                                             10




  KNN for visual features

  Stacked generalization for textual features
      Level 0
         ◮   K-Nearest Neighbors (KNN): using the k nearest training examples to assign a
             class to test example.
         ◮   Na¨ Bayes (NB): a probabilistic learning method that aims at inferring a model
                ıve
             for each class by assigning to a test example the class associated with the most
             probable model that would have generated it.
         ◮   Random Forests (RF) a variation of decision tree’s bagging, in which an ensemble
             of de-correlated decision trees is learned.

      Level 1
         ◮   Support Vector Machine (SVM): a classifier that aims at finding an optimal
             separating hyperplane between the positive and negative training documents,
             maximizing the distance (margin) to the closest points from either class.
Stacked Generalization                                   11




             Figure: Stacked Generalization framework.
Experimental Protocol                          12




  5-fold cross validation

  Development set size: 5,288 videos (∼33%)

  Test set size: 10,161 videos (∼67%)

  Vocabulary of 54,796 terms

  Mean of 31.03 terms per video

  Total number of visual features: 4,888

  Mean of 3,703.59 visual features per video
Experimental Results                                                13




                    Table: MAP for each submitted run.

       Experiment                    Input                 MAP
         Run 1            audio/visual only, no ASR        0.1238
         Run 4        everything allowed, no uploader ID   0.2112
14




                                                                                                         s
                                                                                                      ite l
                                                                                            es s ca
                                                                                         cl d hi
                                                                                      hi an p
                                                                                    ve nt ra
                                                                                  d e iog
                                                                                an pm b
                                                                              s lo o− n
                                                                           to ve ut tio
                                                                      au de a a    or uc
                                                                         eb al d
                                                                      w on d e
                                                                           rs an                             s
                                                                      pe ol               t               nt
                           run1
                           run4




                                                                           ho         en               ve
                                                                      sc el nm                      re
                                                                           v iro                he
                                                                      tra env                ot
                                                                                        nd
                                                                         e e a                            Figure: AP per class achieved in each run.
                                                                      th tur es
                                                                             a c
                                                                         er n k
                                                                      lit ere drin
                                                                           nf d
                                                                      co an                       ia
                                                                         od                   ed
                                                                      fo lth               m
                                                                           a s am
                                                                      he s e  ne tr
                                                                           si ins
                                                                      bu ma tary sm
                                                                         e       n li
                                                                      th me rna
                                                                           cu jou
                                                                      do en
                                                                         tiz
                                                                      ci ing
                                                                           m
                                                                      ga edy
                                                                           m
                                                                      co                          n
                                                                          t                    io
                                                                      ar ts                 is
                                                                           or            ev
                                                                                      el
                                                                      sp ics d t
                                                                           lit an
                                                                      po ies
Experimental Results




                                                                         ov           g
                                                                                                    en
                                                                                                       t
                                                                      m ion gin                  m
                                                                          lig og             in
                                                                      re obl al rta
                                                                                  n e
                                                                         de o nt
                                                                      vi cati d e
                                                                           u n
                                                                      ed ic a y y
                                                                                        r
                                                                         us og o
                                                                      m nol teg
                                                                         ch ca
                                                                      te ult
                                                                           fa
                                                                      de
                       1




                                  0.8




                                         0.6




                                                     0.4




                                                            0.2




                                                                  0
                                        Average Precision
Experimental Results                                                              15




Visual features
perform better in most imbalanced classes like “food and drink” (AP = 0.372),
“school and education” (AP = 0.509), and “autos and vehicles” (AP = 0.692).


Textual features
perform better in the more frequent classes in development set as “default category”
(AP = 0.860) and “business” (AP = 0.557).
Conclusion                                                                          16




Remarks
We used different learning strategies for each data modality: video similarity for
processing visual content and an ensemble of classifiers for text-processing.


Findings
Obtained results demonstrate that the proposed framework is promising. By
combining textual and visual features, we can make a contribution to better results.


Future work
The investigation of learning strategies for combining features from different
modalities and considering other information sources, such as ASR transcripts, to
include more features semantically related to each category.
Acknowledgements                                  17




  Organizers of Tagging Task and MediaEval 2012


  Brazilian funding agencies
      FAPEMIG, FAPESP, CAPES, and CNPq

Weitere ähnliche Inhalte

Was ist angesagt?

Social Media Strategies For Business
Social Media Strategies For BusinessSocial Media Strategies For Business
Social Media Strategies For BusinessTWO Social
 
2010 Honda Insight Hybrid San Leandro
2010 Honda Insight Hybrid San Leandro2010 Honda Insight Hybrid San Leandro
2010 Honda Insight Hybrid San LeandroSan Leandro Honda
 
Roleplaying in virtual worlds
Roleplaying in virtual worldsRoleplaying in virtual worlds
Roleplaying in virtual worldsDoris Molero
 
2010 Honda Insight Hybrid Boston
2010 Honda  Insight Hybrid Boston2010 Honda  Insight Hybrid Boston
2010 Honda Insight Hybrid BostonAtamian Honda
 
2010 Honda Insight Hybrid Los Angeles
2010 Honda Insight Hybrid Los Angeles2010 Honda Insight Hybrid Los Angeles
2010 Honda Insight Hybrid Los AngelesMiller Honda
 
2010 Honda Insight Hybrid San Fernando
2010 Honda Insight Hybrid San Fernando2010 Honda Insight Hybrid San Fernando
2010 Honda Insight Hybrid San FernandoMiller Honda
 
2010 Honda Insight Hybrid
2010 Honda Insight Hybrid2010 Honda Insight Hybrid
2010 Honda Insight HybridBell Honda
 
KEC Green Games Catalog 3
KEC Green Games Catalog 3KEC Green Games Catalog 3
KEC Green Games Catalog 3KEC
 
Crowdsourcing Citizen Science Data Quality with a Human-Computer Learning Net...
Crowdsourcing Citizen Science Data Quality with a Human-Computer Learning Net...Crowdsourcing Citizen Science Data Quality with a Human-Computer Learning Net...
Crowdsourcing Citizen Science Data Quality with a Human-Computer Learning Net...Andrea Wiggins
 
Open Source Success: jQuery
Open Source Success: jQueryOpen Source Success: jQuery
Open Source Success: jQueryjeresig
 
Recruitment And Social Media
Recruitment And Social MediaRecruitment And Social Media
Recruitment And Social MediaTWO Social
 
Prom proposals
Prom proposalsProm proposals
Prom proposalsTomRaski
 
Ws5.1.3 finished
Ws5.1.3 finishedWs5.1.3 finished
Ws5.1.3 finishedhumaira28
 

Was ist angesagt? (19)

Social Media Strategies For Business
Social Media Strategies For BusinessSocial Media Strategies For Business
Social Media Strategies For Business
 
2010 Honda Insight Hybrid San Leandro
2010 Honda Insight Hybrid San Leandro2010 Honda Insight Hybrid San Leandro
2010 Honda Insight Hybrid San Leandro
 
1
11
1
 
Roleplaying in virtual worlds
Roleplaying in virtual worldsRoleplaying in virtual worlds
Roleplaying in virtual worlds
 
2010 Honda Insight Hybrid Boston
2010 Honda  Insight Hybrid Boston2010 Honda  Insight Hybrid Boston
2010 Honda Insight Hybrid Boston
 
2010 Honda Insight Hybrid Los Angeles
2010 Honda Insight Hybrid Los Angeles2010 Honda Insight Hybrid Los Angeles
2010 Honda Insight Hybrid Los Angeles
 
Kiss the BRD Good-Bye
Kiss the BRD Good-ByeKiss the BRD Good-Bye
Kiss the BRD Good-Bye
 
2010 Honda Insight Hybrid San Fernando
2010 Honda Insight Hybrid San Fernando2010 Honda Insight Hybrid San Fernando
2010 Honda Insight Hybrid San Fernando
 
2010 Honda Insight Hybrid
2010 Honda Insight Hybrid2010 Honda Insight Hybrid
2010 Honda Insight Hybrid
 
Moodle 2.0 themes
Moodle 2.0 themesMoodle 2.0 themes
Moodle 2.0 themes
 
3iying Final Heidi
3iying Final Heidi3iying Final Heidi
3iying Final Heidi
 
KEC Green Games Catalog 3
KEC Green Games Catalog 3KEC Green Games Catalog 3
KEC Green Games Catalog 3
 
Crowdsourcing Citizen Science Data Quality with a Human-Computer Learning Net...
Crowdsourcing Citizen Science Data Quality with a Human-Computer Learning Net...Crowdsourcing Citizen Science Data Quality with a Human-Computer Learning Net...
Crowdsourcing Citizen Science Data Quality with a Human-Computer Learning Net...
 
Open Source Success: jQuery
Open Source Success: jQueryOpen Source Success: jQuery
Open Source Success: jQuery
 
Recruitment And Social Media
Recruitment And Social MediaRecruitment And Social Media
Recruitment And Social Media
 
Developing and Assessing Teacher Effectiveness
Developing and Assessing Teacher EffectivenessDeveloping and Assessing Teacher Effectiveness
Developing and Assessing Teacher Effectiveness
 
Prom proposals
Prom proposalsProm proposals
Prom proposals
 
Presentation
PresentationPresentation
Presentation
 
Ws5.1.3 finished
Ws5.1.3 finishedWs5.1.3 finished
Ws5.1.3 finished
 

Ähnlich wie UNICAMP-UFMG at MediaEval 2012: Genre Tagging Task

Addressing cardiovascular disease at EU level: tangible plans for the future
Addressing cardiovascular disease at EU level: tangible plans for the futureAddressing cardiovascular disease at EU level: tangible plans for the future
Addressing cardiovascular disease at EU level: tangible plans for the futurePlan de Calidad para el SNS
 
2010 Honda Insight Hybrid Virginia Beach
2010 Honda Insight Hybrid Virginia Beach2010 Honda Insight Hybrid Virginia Beach
2010 Honda Insight Hybrid Virginia BeachCheckered Flag Honda
 
COH Online- The future of screening for distress in cancer settings (February11)
COH Online- The future of screening for distress in cancer settings (February11)COH Online- The future of screening for distress in cancer settings (February11)
COH Online- The future of screening for distress in cancer settings (February11)Alex J Mitchell
 
2010 Honda Insight Hybrid Los Angeles
2010 Honda Insight Hybrid Los Angeles2010 Honda Insight Hybrid Los Angeles
2010 Honda Insight Hybrid Los AngelesMiller Honda Van Nuys
 
Save Publishing
Save PublishingSave Publishing
Save PublishingPaul Ford
 
"Ukraine and Global Sourcing", The Ambassador of Ukraine to the Kingdom of No...
"Ukraine and Global Sourcing", The Ambassador of Ukraine to the Kingdom of No..."Ukraine and Global Sourcing", The Ambassador of Ukraine to the Kingdom of No...
"Ukraine and Global Sourcing", The Ambassador of Ukraine to the Kingdom of No...IKT-Norge
 
Making social media work for you | StreetGames National Conference 2013
Making social media work for you | StreetGames National Conference 2013Making social media work for you | StreetGames National Conference 2013
Making social media work for you | StreetGames National Conference 2013StreetGames
 
Media – data analysis
Media – data analysisMedia – data analysis
Media – data analysislennylavers
 
Media – data analysis
Media – data analysisMedia – data analysis
Media – data analysislennylavers
 
Geography of time and touchpoints time use planning 2012
Geography of time and touchpoints time use planning 2012Geography of time and touchpoints time use planning 2012
Geography of time and touchpoints time use planning 2012Posterscope
 
Conversation Clusters: Grouping Conversation Through Human Computer Dialog
Conversation Clusters: Grouping Conversation Through Human Computer DialogConversation Clusters: Grouping Conversation Through Human Computer Dialog
Conversation Clusters: Grouping Conversation Through Human Computer DialogTony Bergstrom
 
Social media strategies for business
Social media strategies for businessSocial media strategies for business
Social media strategies for businessRichard Spencer
 
Ipsos store brands muscle in May 2012
Ipsos store brands muscle in May 2012Ipsos store brands muscle in May 2012
Ipsos store brands muscle in May 2012Damjana Kocjanc
 
2010 Honda Insight Hybrid Omaha Nebraska
2010 Honda Insight Hybrid Omaha Nebraska2010 Honda Insight Hybrid Omaha Nebraska
2010 Honda Insight Hybrid Omaha NebraskaHonda Cars of Bellevue
 
2010 insight-hybrid-brochure-honda-katy-houston-tx
2010 insight-hybrid-brochure-honda-katy-houston-tx2010 insight-hybrid-brochure-honda-katy-houston-tx
2010 insight-hybrid-brochure-honda-katy-houston-txHonda Cars of Katy
 
2010 Honda Insight Hybrid Sedan Brochure | DCH Honda of Temecula
 2010 Honda Insight Hybrid Sedan Brochure | DCH Honda of Temecula 2010 Honda Insight Hybrid Sedan Brochure | DCH Honda of Temecula
2010 Honda Insight Hybrid Sedan Brochure | DCH Honda of TemeculaDCH Honda of Temecula
 

Ähnlich wie UNICAMP-UFMG at MediaEval 2012: Genre Tagging Task (20)

Addressing cardiovascular disease at EU level: tangible plans for the future
Addressing cardiovascular disease at EU level: tangible plans for the futureAddressing cardiovascular disease at EU level: tangible plans for the future
Addressing cardiovascular disease at EU level: tangible plans for the future
 
2010 Honda Insight Hybrid Virginia Beach
2010 Honda Insight Hybrid Virginia Beach2010 Honda Insight Hybrid Virginia Beach
2010 Honda Insight Hybrid Virginia Beach
 
Hit a Grand Slam with Legal Research
Hit a Grand Slam with Legal ResearchHit a Grand Slam with Legal Research
Hit a Grand Slam with Legal Research
 
COH Online- The future of screening for distress in cancer settings (February11)
COH Online- The future of screening for distress in cancer settings (February11)COH Online- The future of screening for distress in cancer settings (February11)
COH Online- The future of screening for distress in cancer settings (February11)
 
2010 Honda Insight Hybrid Los Angeles
2010 Honda Insight Hybrid Los Angeles2010 Honda Insight Hybrid Los Angeles
2010 Honda Insight Hybrid Los Angeles
 
Who Wants To Be A Librarian
Who Wants To Be A LibrarianWho Wants To Be A Librarian
Who Wants To Be A Librarian
 
Save Publishing
Save PublishingSave Publishing
Save Publishing
 
"Ukraine and Global Sourcing", The Ambassador of Ukraine to the Kingdom of No...
"Ukraine and Global Sourcing", The Ambassador of Ukraine to the Kingdom of No..."Ukraine and Global Sourcing", The Ambassador of Ukraine to the Kingdom of No...
"Ukraine and Global Sourcing", The Ambassador of Ukraine to the Kingdom of No...
 
Making social media work for you | StreetGames National Conference 2013
Making social media work for you | StreetGames National Conference 2013Making social media work for you | StreetGames National Conference 2013
Making social media work for you | StreetGames National Conference 2013
 
Group L
Group LGroup L
Group L
 
Media – data analysis
Media – data analysisMedia – data analysis
Media – data analysis
 
Media – data analysis
Media – data analysisMedia – data analysis
Media – data analysis
 
Geography of time and touchpoints time use planning 2012
Geography of time and touchpoints time use planning 2012Geography of time and touchpoints time use planning 2012
Geography of time and touchpoints time use planning 2012
 
Conversation Clusters: Grouping Conversation Through Human Computer Dialog
Conversation Clusters: Grouping Conversation Through Human Computer DialogConversation Clusters: Grouping Conversation Through Human Computer Dialog
Conversation Clusters: Grouping Conversation Through Human Computer Dialog
 
Social media strategies for business
Social media strategies for businessSocial media strategies for business
Social media strategies for business
 
Ipsos store brands muscle in May 2012
Ipsos store brands muscle in May 2012Ipsos store brands muscle in May 2012
Ipsos store brands muscle in May 2012
 
Google
GoogleGoogle
Google
 
2010 Honda Insight Hybrid Omaha Nebraska
2010 Honda Insight Hybrid Omaha Nebraska2010 Honda Insight Hybrid Omaha Nebraska
2010 Honda Insight Hybrid Omaha Nebraska
 
2010 insight-hybrid-brochure-honda-katy-houston-tx
2010 insight-hybrid-brochure-honda-katy-houston-tx2010 insight-hybrid-brochure-honda-katy-houston-tx
2010 insight-hybrid-brochure-honda-katy-houston-tx
 
2010 Honda Insight Hybrid Sedan Brochure | DCH Honda of Temecula
 2010 Honda Insight Hybrid Sedan Brochure | DCH Honda of Temecula 2010 Honda Insight Hybrid Sedan Brochure | DCH Honda of Temecula
2010 Honda Insight Hybrid Sedan Brochure | DCH Honda of Temecula
 

Mehr von MediaEval2012

MediaEval 2012 Opening
MediaEval 2012 OpeningMediaEval 2012 Opening
MediaEval 2012 OpeningMediaEval2012
 
A Multimodal Approach for Video Geocoding
A Multimodal Approach for   Video Geocoding A Multimodal Approach for   Video Geocoding
A Multimodal Approach for Video Geocoding MediaEval2012
 
Brave New Task: Musiclef Multimodal Music Tagging
Brave New Task: Musiclef Multimodal Music TaggingBrave New Task: Musiclef Multimodal Music Tagging
Brave New Task: Musiclef Multimodal Music TaggingMediaEval2012
 
Search and Hyperlinking Task at MediaEval 2012
Search and Hyperlinking Task at MediaEval 2012Search and Hyperlinking Task at MediaEval 2012
Search and Hyperlinking Task at MediaEval 2012MediaEval2012
 
CUNI at MediaEval 2012: Search and Hyperlinking Task
CUNI at MediaEval 2012: Search and Hyperlinking TaskCUNI at MediaEval 2012: Search and Hyperlinking Task
CUNI at MediaEval 2012: Search and Hyperlinking TaskMediaEval2012
 
DCU Search Runs at MediaEval 2012: Search and Hyperlinking Task
DCU Search Runs at MediaEval 2012: Search and Hyperlinking TaskDCU Search Runs at MediaEval 2012: Search and Hyperlinking Task
DCU Search Runs at MediaEval 2012: Search and Hyperlinking TaskMediaEval2012
 
Ghent University-IBBT at MediaEval 2012 Search and Hyperlinking: Semantic Sim...
Ghent University-IBBT at MediaEval 2012 Search and Hyperlinking: Semantic Sim...Ghent University-IBBT at MediaEval 2012 Search and Hyperlinking: Semantic Sim...
Ghent University-IBBT at MediaEval 2012 Search and Hyperlinking: Semantic Sim...MediaEval2012
 
Brave New Task: User Account Matching
Brave New Task: User Account MatchingBrave New Task: User Account Matching
Brave New Task: User Account MatchingMediaEval2012
 
The CLEF Initiative From 2010 to 2012 and Onwards
The CLEF Initiative From 2010 to 2012 and OnwardsThe CLEF Initiative From 2010 to 2012 and Onwards
The CLEF Initiative From 2010 to 2012 and OnwardsMediaEval2012
 
Overview of MediaEval 2012 Visual Privacy Task
Overview of MediaEval 2012 Visual Privacy TaskOverview of MediaEval 2012 Visual Privacy Task
Overview of MediaEval 2012 Visual Privacy TaskMediaEval2012
 
MediaEval 2012 Visual Privacy Task: Privacy and Intelligibility through Pixel...
MediaEval 2012 Visual Privacy Task: Privacy and Intelligibility through Pixel...MediaEval 2012 Visual Privacy Task: Privacy and Intelligibility through Pixel...
MediaEval 2012 Visual Privacy Task: Privacy and Intelligibility through Pixel...MediaEval2012
 
MediaEval 2012 Visual Privacy Task: Applying Transform-domain Scrambling to A...
MediaEval 2012 Visual Privacy Task: Applying Transform-domain Scrambling to A...MediaEval 2012 Visual Privacy Task: Applying Transform-domain Scrambling to A...
MediaEval 2012 Visual Privacy Task: Applying Transform-domain Scrambling to A...MediaEval2012
 
Violent Scenes Detection with Large, Brute-forced Acoustic and Visual Feature...
Violent Scenes Detection with Large, Brute-forced Acoustic and Visual Feature...Violent Scenes Detection with Large, Brute-forced Acoustic and Visual Feature...
Violent Scenes Detection with Large, Brute-forced Acoustic and Visual Feature...MediaEval2012
 
Technicolor/INRIA/Imperial College London at the MediaEval 2012 Violent Scene...
Technicolor/INRIA/Imperial College London at the MediaEval 2012 Violent Scene...Technicolor/INRIA/Imperial College London at the MediaEval 2012 Violent Scene...
Technicolor/INRIA/Imperial College London at the MediaEval 2012 Violent Scene...MediaEval2012
 
The MediaEval 2012 Affect Task: Violent Scenes Detectio
The MediaEval 2012 Affect Task: Violent Scenes DetectioThe MediaEval 2012 Affect Task: Violent Scenes Detectio
The MediaEval 2012 Affect Task: Violent Scenes DetectioMediaEval2012
 
NII, Japan at MediaEval 2012 Violent Scenes Detection Affect Task
NII, Japan at MediaEval 2012 Violent Scenes Detection Affect TaskNII, Japan at MediaEval 2012 Violent Scenes Detection Affect Task
NII, Japan at MediaEval 2012 Violent Scenes Detection Affect TaskMediaEval2012
 
LIG at MediaEval 2012 affect task: use of a generic method
LIG at MediaEval 2012 affect task: use of a generic methodLIG at MediaEval 2012 affect task: use of a generic method
LIG at MediaEval 2012 affect task: use of a generic methodMediaEval2012
 
Violence Detection in Video by Large Scale Multi-Scale Local Binary Pattern D...
Violence Detection in Video by Large Scale Multi-Scale Local Binary Pattern D...Violence Detection in Video by Large Scale Multi-Scale Local Binary Pattern D...
Violence Detection in Video by Large Scale Multi-Scale Local Binary Pattern D...MediaEval2012
 

Mehr von MediaEval2012 (20)

MediaEval 2012 Opening
MediaEval 2012 OpeningMediaEval 2012 Opening
MediaEval 2012 Opening
 
Closing
ClosingClosing
Closing
 
A Multimodal Approach for Video Geocoding
A Multimodal Approach for   Video Geocoding A Multimodal Approach for   Video Geocoding
A Multimodal Approach for Video Geocoding
 
Brave New Task: Musiclef Multimodal Music Tagging
Brave New Task: Musiclef Multimodal Music TaggingBrave New Task: Musiclef Multimodal Music Tagging
Brave New Task: Musiclef Multimodal Music Tagging
 
Search and Hyperlinking Task at MediaEval 2012
Search and Hyperlinking Task at MediaEval 2012Search and Hyperlinking Task at MediaEval 2012
Search and Hyperlinking Task at MediaEval 2012
 
CUNI at MediaEval 2012: Search and Hyperlinking Task
CUNI at MediaEval 2012: Search and Hyperlinking TaskCUNI at MediaEval 2012: Search and Hyperlinking Task
CUNI at MediaEval 2012: Search and Hyperlinking Task
 
DCU Search Runs at MediaEval 2012: Search and Hyperlinking Task
DCU Search Runs at MediaEval 2012: Search and Hyperlinking TaskDCU Search Runs at MediaEval 2012: Search and Hyperlinking Task
DCU Search Runs at MediaEval 2012: Search and Hyperlinking Task
 
Ghent University-IBBT at MediaEval 2012 Search and Hyperlinking: Semantic Sim...
Ghent University-IBBT at MediaEval 2012 Search and Hyperlinking: Semantic Sim...Ghent University-IBBT at MediaEval 2012 Search and Hyperlinking: Semantic Sim...
Ghent University-IBBT at MediaEval 2012 Search and Hyperlinking: Semantic Sim...
 
Brave New Task: User Account Matching
Brave New Task: User Account MatchingBrave New Task: User Account Matching
Brave New Task: User Account Matching
 
The CLEF Initiative From 2010 to 2012 and Onwards
The CLEF Initiative From 2010 to 2012 and OnwardsThe CLEF Initiative From 2010 to 2012 and Onwards
The CLEF Initiative From 2010 to 2012 and Onwards
 
Overview of MediaEval 2012 Visual Privacy Task
Overview of MediaEval 2012 Visual Privacy TaskOverview of MediaEval 2012 Visual Privacy Task
Overview of MediaEval 2012 Visual Privacy Task
 
MediaEval 2012 Visual Privacy Task: Privacy and Intelligibility through Pixel...
MediaEval 2012 Visual Privacy Task: Privacy and Intelligibility through Pixel...MediaEval 2012 Visual Privacy Task: Privacy and Intelligibility through Pixel...
MediaEval 2012 Visual Privacy Task: Privacy and Intelligibility through Pixel...
 
MediaEval 2012 Visual Privacy Task: Applying Transform-domain Scrambling to A...
MediaEval 2012 Visual Privacy Task: Applying Transform-domain Scrambling to A...MediaEval 2012 Visual Privacy Task: Applying Transform-domain Scrambling to A...
MediaEval 2012 Visual Privacy Task: Applying Transform-domain Scrambling to A...
 
Violent Scenes Detection with Large, Brute-forced Acoustic and Visual Feature...
Violent Scenes Detection with Large, Brute-forced Acoustic and Visual Feature...Violent Scenes Detection with Large, Brute-forced Acoustic and Visual Feature...
Violent Scenes Detection with Large, Brute-forced Acoustic and Visual Feature...
 
mevd2012 esra_
 mevd2012 esra_ mevd2012 esra_
mevd2012 esra_
 
Technicolor/INRIA/Imperial College London at the MediaEval 2012 Violent Scene...
Technicolor/INRIA/Imperial College London at the MediaEval 2012 Violent Scene...Technicolor/INRIA/Imperial College London at the MediaEval 2012 Violent Scene...
Technicolor/INRIA/Imperial College London at the MediaEval 2012 Violent Scene...
 
The MediaEval 2012 Affect Task: Violent Scenes Detectio
The MediaEval 2012 Affect Task: Violent Scenes DetectioThe MediaEval 2012 Affect Task: Violent Scenes Detectio
The MediaEval 2012 Affect Task: Violent Scenes Detectio
 
NII, Japan at MediaEval 2012 Violent Scenes Detection Affect Task
NII, Japan at MediaEval 2012 Violent Scenes Detection Affect TaskNII, Japan at MediaEval 2012 Violent Scenes Detection Affect Task
NII, Japan at MediaEval 2012 Violent Scenes Detection Affect Task
 
LIG at MediaEval 2012 affect task: use of a generic method
LIG at MediaEval 2012 affect task: use of a generic methodLIG at MediaEval 2012 affect task: use of a generic method
LIG at MediaEval 2012 affect task: use of a generic method
 
Violence Detection in Video by Large Scale Multi-Scale Local Binary Pattern D...
Violence Detection in Video by Large Scale Multi-Scale Local Binary Pattern D...Violence Detection in Video by Large Scale Multi-Scale Local Binary Pattern D...
Violence Detection in Video by Large Scale Multi-Scale Local Binary Pattern D...
 

Kürzlich hochgeladen

Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Igalia
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEarley Information Science
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?Antenna Manufacturer Coco
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Enterprise Knowledge
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessPixlogix Infotech
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...Neo4j
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)wesley chun
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 

Kürzlich hochgeladen (20)

Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your Business
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 

UNICAMP-UFMG at MediaEval 2012: Genre Tagging Task

  • 1. UNICAMP-UFMG at MediaEval 2012: Genre Tagging Task ´ Jurandy Almeida1 , Thiago Salles2 , Eder F. Martins2 , Ot´vio A. B. Penatti1 , a Ricardo da S. Torres , Marcos A. Gon¸alves , and Jussara M. Almeida2 1 c 2 1 RECOD Lab, Institute of Computing, University of Campinas (UNICAMP) {jurandy.almeida, penatti, rtorres}@ic.unicamp.br 2 Dept. of Computer Science, Federal University of Minas Gerais (UFMG) {tsalles, ederfm, mgoncalv, jussara}@dcc.ufmg.br MediaEval’12 – Pisa, Italy – October 4-5 – 2012
  • 2. Genre Tagging Task at MediaEval 2012 2 Automatically assign tags to Internet videos collected from blip.tv. The focus is on tags related to the category of the video. Each video belongs to only one category.
  • 3. Available Resources 3 Table: Resources made available for the task. Resources Textual Visual Used Title, Tags, Descriptions Videos Not Used Tweets, ASR transcripts Shots and Keyframes
  • 4. The 26 Genre Tags 4 Default Category Documentary Technology The Mainstream Media Music and Entertainment Business Politics Food & Drink Educational Health Videoblogging Conferences and Other Events Religion Literature Movies and Television The Environment Sports Personal or Auto-biographical Art School and Education Comedy Travel Gaming Web Development and Sites Citizen Journalism Autos and Vehicles
  • 5. es s 5 cl te hi si ve nd l d ta ca an en hi s m ap to p gr au lo io ve -b de to eb au n w or io al at on uc rs ed pe d an ol ho sc t ve l en tra nm nt s ro vi ve en re th e he ot Distribution of Genres (Development Set) e ur d at an er s lit ce en er k nf in co dr d an od fo th al ia he ed ss m ne am si bu tre ns ai m e ry th ta en m cu do g m in lis m na ga ur jo en tiz ci y ed m co t ar ts or on sp si n vi io le lig te re d an s ie ov ng m g gi lo t ob en de l m vi na in at io rta te uc en ed d an ic us m gy lo no ch te s y lit ic or po eg at tc ul fa de 800 700 600 500 400 300 200 100 0 # of objects
  • 6. Proposed Framework 6 It relies on two learning strategies: 1. Meta-learning scheme for combining classifiers (stacked generalization) trained with metadata information model as a bag-of-words. 2. Histogram of motion patterns joint with a KNN classifier for visual information.
  • 7. Visual Approach (HMP) 7 Keyframes: Not used Applying an algorithm to compare video sequences. “Comparison of Video Sequences with Histograms of Motion Patterns”. J. Almeida, N. J. Leite, and R. S. Torres. IEEE Conference on Image Processing (ICIP) 2011. It relies on three steps: 1. partial decoding; 2. feature extraction; 3. signature generation.
  • 8. Visual Approach (HMP) 8 Video Frames Motion Pattern 1 1 2 1 0101100110010011 2 1 0 3 Temporal Spatial Histogram Distribution 1: Partial Decoding 3: Signature Generation DC coefficient Time Series of Macroblocks Pixel Block 106 111 91 94 90 93 Macroblock 2: Feature Extraction 100 88 95 90 96 91 Previous Current Next I-frames [Almeida et al., Comparison of Video Sequences with Histograms of Motion Patterns. ICIP 2011]
  • 9. Textual Approach (BoW) 9 Title, tags, and description used as textual features. Only metadata in English. Porter Stemming algorithm to remove affixes. Stop words removed. Bag-of-Words using T F xIDF as weights.
  • 10. The Classifiers 10 KNN for visual features Stacked generalization for textual features Level 0 ◮ K-Nearest Neighbors (KNN): using the k nearest training examples to assign a class to test example. ◮ Na¨ Bayes (NB): a probabilistic learning method that aims at inferring a model ıve for each class by assigning to a test example the class associated with the most probable model that would have generated it. ◮ Random Forests (RF) a variation of decision tree’s bagging, in which an ensemble of de-correlated decision trees is learned. Level 1 ◮ Support Vector Machine (SVM): a classifier that aims at finding an optimal separating hyperplane between the positive and negative training documents, maximizing the distance (margin) to the closest points from either class.
  • 11. Stacked Generalization 11 Figure: Stacked Generalization framework.
  • 12. Experimental Protocol 12 5-fold cross validation Development set size: 5,288 videos (∼33%) Test set size: 10,161 videos (∼67%) Vocabulary of 54,796 terms Mean of 31.03 terms per video Total number of visual features: 4,888 Mean of 3,703.59 visual features per video
  • 13. Experimental Results 13 Table: MAP for each submitted run. Experiment Input MAP Run 1 audio/visual only, no ASR 0.1238 Run 4 everything allowed, no uploader ID 0.2112
  • 14. 14 s ite l es s ca cl d hi hi an p ve nt ra d e iog an pm b s lo o− n to ve ut tio au de a a or uc eb al d w on d e rs an s pe ol t nt run1 run4 ho en ve sc el nm re v iro he tra env ot nd e e a Figure: AP per class achieved in each run. th tur es a c er n k lit ere drin nf d co an ia od ed fo lth m a s am he s e ne tr si ins bu ma tary sm e n li th me rna cu jou do en tiz ci ing m ga edy m co n t io ar ts is or ev el sp ics d t lit an po ies Experimental Results ov g en t m ion gin m lig og in re obl al rta n e de o nt vi cati d e u n ed ic a y y r us og o m nol teg ch ca te ult fa de 1 0.8 0.6 0.4 0.2 0 Average Precision
  • 15. Experimental Results 15 Visual features perform better in most imbalanced classes like “food and drink” (AP = 0.372), “school and education” (AP = 0.509), and “autos and vehicles” (AP = 0.692). Textual features perform better in the more frequent classes in development set as “default category” (AP = 0.860) and “business” (AP = 0.557).
  • 16. Conclusion 16 Remarks We used different learning strategies for each data modality: video similarity for processing visual content and an ensemble of classifiers for text-processing. Findings Obtained results demonstrate that the proposed framework is promising. By combining textual and visual features, we can make a contribution to better results. Future work The investigation of learning strategies for combining features from different modalities and considering other information sources, such as ASR transcripts, to include more features semantically related to each category.
  • 17. Acknowledgements 17 Organizers of Tagging Task and MediaEval 2012 Brazilian funding agencies FAPEMIG, FAPESP, CAPES, and CNPq