SlideShare ist ein Scribd-Unternehmen logo
1 von 18
Multimodal Music Tagging Task


Nicola Orio – University of Padova
Cynthia C. S. Liem – Delft University of Technology
Geoffroy Peeters – UMR STMS IRCAM-CNRS, Paris
Markus Schedl – Johannes Kepler University, Linz

MediaEval, Pisa 05/10/2012   MusiClef: Multimodal Music Tagging Task   1
Multimodal music tagging
• Definition
    • Songs of a commercial music library need to be categorized
      according to their usage in TV and radio broadcasts (e.g.
      soundtracks, jingles)

• Practical motivation
    • The search for suitable music for video productions is a
      major activity for professionals and lay users alike
         • Collaborative filtering systems are taking their role
             • Notwithstanding their known limitations: long-tail, cold start…
    • Annotating professional music libraries is another important
      professional activity

MediaEval, Pisa 05/10/2012    MusiClef: Multimodal Music Tagging Task            2
Human assessment




 Different sources of information are routinely exploited
 by professionals to overcome limitations of individual media
MediaEval, Pisa 05/10/2012   MusiClef: Multimodal Music Tagging Task   3
Goals of MusiClef
• To focus evaluation on professional application scenarios
    • Textual description of music items

• To grant replication of experiments and results
    • Feature extraction phase is crucial – released features
      computed with public, open-source library (MIRToolbox)

• To promote the exploitation of multimodal sources of
  information
    • Content (audio) + Context (tags & webpages)

• To disseminate music related initiatives
    • Outside the music information retrieval community
MediaEval, Pisa 05/10/2012   MusiClef: Multimodal Music Tagging Task   4
Evaluation initiatives – 1
• MIREX (since 2004)
    • Community-based selection of tasks
         • Many tasks address audio feature extraction algorithms
    • Participants submit algorithms that are run by organizers
         • Music files are not shared with participants

• Million Song Dataset (since 2011)
    • Task on music recommendation proposed by organizers
    • Audio features are computed using proprietary algorithms
         • Only features are shared with participants


MediaEval, Pisa 05/10/2012   MusiClef: Multimodal Music Tagging Task   5
Evaluation initiatives – 2

• Quaero-Eval (since 2012)
         • Tasks agreed with participants
             • Strategies to grant public access to evaluation results
    • Participants run training experiments on a shared repository
         • Runs on test set made by the organizers




MediaEval, Pisa 05/10/2012     MusiClef: Multimodal Music Tagging Task   6
Test collection – 1
• Individual songs of pop and rock music
    • 1355 songs (from 218 artists)
    • train (975) and test (380) split

• Social tags
    • Gathered from Last.fm API

• Multilingual sets of Web pages related to artists+albums
    • Mined querying Google

• Acoustic features: MFCC (using MIRToolbox) with a
  window length of 200ms and 50% overlap

MediaEval, Pisa 05/10/2012   MusiClef: Multimodal Music Tagging Task   7
Test collection – 2
• Test collection created starting from the “500 Greatest
  Songs of All Time” (Rolling Stone)
    • Expected high number of social tags and web pages

• Ground truth created by experts in the domain
    • 355 tags selected (167 genre, 288 usage)
         • Tags associated to less than 20 songs were discarded

• Reference implementation in Matlab
    • Participants has an example to run a complete experiment
    • Code for the evaluation made already available

MediaEval, Pisa 05/10/2012   MusiClef: Multimodal Music Tagging Task   8
Evaluation measures

• Standard IR measures

•   Accuracy
•   Precision
•   Recall
•   Specificity
•   F-measure



MediaEval, Pisa 05/10/2012   MusiClef: Multimodal Music Tagging Task   9
Examining tags more closely
• Some tags are more equal than others…


                   hard rock                                                      ballroom
                                              melancholic
       travel
                               countryside
                                                                         bright

• Thus, we propose to also analyze results employing a
  higher-level tag categorization

MediaEval, Pisa 05/10/2012     MusiClef: Multimodal Music Tagging Task                       10
Tag categorization – 1
• Affective, mood-related aspects:
    • activity: the amount of perceived music
      activity, without implying strong positive or
      negative affective qualities (e.g.
      'fast', 'mellow', 'lazy')
    • affective state: affective qualities that can only be
      connected and attributed to living beings (e.g.
      'aggressive', 'hopeful')
    • atmosphere: affective qualities that can be
      connected to environments (e.g.
      'chaotic', 'intimate').

MediaEval, Pisa 05/10/2012   MusiClef: Multimodal Music Tagging Task   11
Tag categorization – 2
• Situation, time and space aspects of the music:
    • Physical situation: concrete physical environments
      (e.g. 'city', 'night').
    • Occasion: implications of time and space, typically
      connected to social events (e.g. 'holiday', 'glamour').
• Sociocultural genre (e.g. 'new wave', 'r&b', 'punk')
• Sound qualities:
    • timbral aspects (e.g. 'acoustic', 'bright')
    • temporal aspects (e.g. 'beat', 'groove').
• Other (e.g. 'catchy', 'evocative').
MediaEval, Pisa 05/10/2012   MusiClef: Multimodal Music Tagging Task   12
Reference implementation
• Made in MATLAB and released publicly
• Simple and straightforward approaches:
    • Individual GMMs for audio, user tags, web pages
    • Tagging process: 1-NN qualification using symmetrized KL


• Scenarios tested:
    • Audio, user tags, web pages individually
    • Majority vote
    • Union


MediaEval, Pisa 05/10/2012   MusiClef: Multimodal Music Tagging Task   13
Baseline results – 1
• Evaluation of the submitted runs and of the reference
  implementation
    • Results with different modalities over the full dataset

strategy          accuracy           recall           precision         specificity    f-measure
audio                        0.894            0.148           0.127            0.939        0.126
tags                         0.898            0.061           0.039            0.942        0.037
web pages                    0.897            0.050           0.007            0.954        0.011
majority                     0.880            0.123           0.086            0.922        0.086
union                        0.824            0.240           0.115            0.845        0.134


MediaEval, Pisa 05/10/2012           MusiClef: Multimodal Music Tagging Task                  14
Baseline results – 2
                                                                       1. activity, energy
                                                                       2. affective state
                                                                       3. atmosphere
                                                                       4. other
                                                                       5. situation: occasion
                                                                       6. situation: physical
                                                                       7. sociocultural genre
                                                                       8. sound: temporal
                                                                       9: sound: timbral




MediaEval, Pisa 05/10/2012   MusiClef: Multimodal Music Tagging Task                    15
Participation

• Initially a lot of interest - about 8 explicitly interested
  parties
• But ultimately just one participant (LUTIN UserLab)
    • Aggregation of estimators
• Currently investigating what happened to the 7 others
    • So far, it appears ISMIR 2012 was inconveniently close
    • The 3 other MusiClef co-organizers will discuss this there



MediaEval, Pisa 05/10/2012   MusiClef: Multimodal Music Tagging Task   16
Conclusions

• We established a multimodal music tagging benchmark task
• Special effort in facilitating deeper tag analysis
• We would like a 2013 multimodal music benchmark task
    • Depending on survey input
    • Depending on your input




MediaEval, Pisa 05/10/2012   MusiClef: Multimodal Music Tagging Task   17
Thank you for your attention!


For contact and more information: musiclef@dei.unipd.it




MediaEval, Pisa 05/10/2012   MusiClef: Multimodal Music Tagging Task   18

Weitere ähnliche Inhalte

Andere mochten auch

The MediaEval 2012 Affect Task: Violent Scenes Detectio
The MediaEval 2012 Affect Task: Violent Scenes DetectioThe MediaEval 2012 Affect Task: Violent Scenes Detectio
The MediaEval 2012 Affect Task: Violent Scenes DetectioMediaEval2012
 
LIG at MediaEval 2012 affect task: use of a generic method
LIG at MediaEval 2012 affect task: use of a generic methodLIG at MediaEval 2012 affect task: use of a generic method
LIG at MediaEval 2012 affect task: use of a generic methodMediaEval2012
 
Mentor Strategy Session: Business Plan and Video
Mentor Strategy Session: Business Plan and VideoMentor Strategy Session: Business Plan and Video
Mentor Strategy Session: Business Plan and VideoGrow America
 
Event Detection via LDA for the MediaEval2012 SED Task
Event Detection via LDA for the MediaEval2012 SED TaskEvent Detection via LDA for the MediaEval2012 SED Task
Event Detection via LDA for the MediaEval2012 SED TaskMediaEval2012
 
Week 2 discussion 2
Week 2 discussion 2Week 2 discussion 2
Week 2 discussion 2LILBIT2012
 
תחרות אלוף הידע
תחרות אלוף הידעתחרות אלוף הידע
תחרות אלוף הידעsabal1
 
John Richards: My Life Lessons As An Entrepreneur
John Richards: My Life Lessons As An EntrepreneurJohn Richards: My Life Lessons As An Entrepreneur
John Richards: My Life Lessons As An EntrepreneurGrow America
 
Violence Detection in Video by Large Scale Multi-Scale Local Binary Pattern D...
Violence Detection in Video by Large Scale Multi-Scale Local Binary Pattern D...Violence Detection in Video by Large Scale Multi-Scale Local Binary Pattern D...
Violence Detection in Video by Large Scale Multi-Scale Local Binary Pattern D...MediaEval2012
 
DCU Search Runs at MediaEval 2012: Search and Hyperlinking Task
DCU Search Runs at MediaEval 2012: Search and Hyperlinking TaskDCU Search Runs at MediaEval 2012: Search and Hyperlinking Task
DCU Search Runs at MediaEval 2012: Search and Hyperlinking TaskMediaEval2012
 
MediaEval 2012 Opening
MediaEval 2012 OpeningMediaEval 2012 Opening
MediaEval 2012 OpeningMediaEval2012
 
Violent Scenes Detection with Large, Brute-forced Acoustic and Visual Feature...
Violent Scenes Detection with Large, Brute-forced Acoustic and Visual Feature...Violent Scenes Detection with Large, Brute-forced Acoustic and Visual Feature...
Violent Scenes Detection with Large, Brute-forced Acoustic and Visual Feature...MediaEval2012
 
Secrets of Storytelling by Candace Klein
Secrets of Storytelling by Candace KleinSecrets of Storytelling by Candace Klein
Secrets of Storytelling by Candace KleinGrow America
 
Overview of MediaEval 2012 Visual Privacy Task
Overview of MediaEval 2012 Visual Privacy TaskOverview of MediaEval 2012 Visual Privacy Task
Overview of MediaEval 2012 Visual Privacy TaskMediaEval2012
 
Simha_23_REFFIT_Biochar_ICT_Published Version
Simha_23_REFFIT_Biochar_ICT_Published VersionSimha_23_REFFIT_Biochar_ICT_Published Version
Simha_23_REFFIT_Biochar_ICT_Published VersionPrithvi Simha
 
When Ideas and Opportunities Collide
When Ideas and Opportunities CollideWhen Ideas and Opportunities Collide
When Ideas and Opportunities CollideGrow America
 
Search and Hyperlinking Task at MediaEval 2012
Search and Hyperlinking Task at MediaEval 2012Search and Hyperlinking Task at MediaEval 2012
Search and Hyperlinking Task at MediaEval 2012MediaEval2012
 

Andere mochten auch (18)

The MediaEval 2012 Affect Task: Violent Scenes Detectio
The MediaEval 2012 Affect Task: Violent Scenes DetectioThe MediaEval 2012 Affect Task: Violent Scenes Detectio
The MediaEval 2012 Affect Task: Violent Scenes Detectio
 
Simha_RP
Simha_RPSimha_RP
Simha_RP
 
LIG at MediaEval 2012 affect task: use of a generic method
LIG at MediaEval 2012 affect task: use of a generic methodLIG at MediaEval 2012 affect task: use of a generic method
LIG at MediaEval 2012 affect task: use of a generic method
 
Mentor Strategy Session: Business Plan and Video
Mentor Strategy Session: Business Plan and VideoMentor Strategy Session: Business Plan and Video
Mentor Strategy Session: Business Plan and Video
 
Event Detection via LDA for the MediaEval2012 SED Task
Event Detection via LDA for the MediaEval2012 SED TaskEvent Detection via LDA for the MediaEval2012 SED Task
Event Detection via LDA for the MediaEval2012 SED Task
 
Week 2 discussion 2
Week 2 discussion 2Week 2 discussion 2
Week 2 discussion 2
 
תחרות אלוף הידע
תחרות אלוף הידעתחרות אלוף הידע
תחרות אלוף הידע
 
John Richards: My Life Lessons As An Entrepreneur
John Richards: My Life Lessons As An EntrepreneurJohn Richards: My Life Lessons As An Entrepreneur
John Richards: My Life Lessons As An Entrepreneur
 
Violence Detection in Video by Large Scale Multi-Scale Local Binary Pattern D...
Violence Detection in Video by Large Scale Multi-Scale Local Binary Pattern D...Violence Detection in Video by Large Scale Multi-Scale Local Binary Pattern D...
Violence Detection in Video by Large Scale Multi-Scale Local Binary Pattern D...
 
DCU Search Runs at MediaEval 2012: Search and Hyperlinking Task
DCU Search Runs at MediaEval 2012: Search and Hyperlinking TaskDCU Search Runs at MediaEval 2012: Search and Hyperlinking Task
DCU Search Runs at MediaEval 2012: Search and Hyperlinking Task
 
MediaEval 2012 Opening
MediaEval 2012 OpeningMediaEval 2012 Opening
MediaEval 2012 Opening
 
Violent Scenes Detection with Large, Brute-forced Acoustic and Visual Feature...
Violent Scenes Detection with Large, Brute-forced Acoustic and Visual Feature...Violent Scenes Detection with Large, Brute-forced Acoustic and Visual Feature...
Violent Scenes Detection with Large, Brute-forced Acoustic and Visual Feature...
 
Thotcon2013
Thotcon2013Thotcon2013
Thotcon2013
 
Secrets of Storytelling by Candace Klein
Secrets of Storytelling by Candace KleinSecrets of Storytelling by Candace Klein
Secrets of Storytelling by Candace Klein
 
Overview of MediaEval 2012 Visual Privacy Task
Overview of MediaEval 2012 Visual Privacy TaskOverview of MediaEval 2012 Visual Privacy Task
Overview of MediaEval 2012 Visual Privacy Task
 
Simha_23_REFFIT_Biochar_ICT_Published Version
Simha_23_REFFIT_Biochar_ICT_Published VersionSimha_23_REFFIT_Biochar_ICT_Published Version
Simha_23_REFFIT_Biochar_ICT_Published Version
 
When Ideas and Opportunities Collide
When Ideas and Opportunities CollideWhen Ideas and Opportunities Collide
When Ideas and Opportunities Collide
 
Search and Hyperlinking Task at MediaEval 2012
Search and Hyperlinking Task at MediaEval 2012Search and Hyperlinking Task at MediaEval 2012
Search and Hyperlinking Task at MediaEval 2012
 

Ähnlich wie Multimodal Music Tagging Task Overview

Music Recommendation 2018
Music Recommendation 2018Music Recommendation 2018
Music Recommendation 2018Fabien Gouyon
 
Trends in Music Recommendations 2018
Trends in Music Recommendations 2018Trends in Music Recommendations 2018
Trends in Music Recommendations 2018Karthik Murugesan
 
Learning how to learn, with Software Carpentry
Learning how to learn, with Software CarpentryLearning how to learn, with Software Carpentry
Learning how to learn, with Software CarpentrySoundSoftware ac.uk
 
BEING LUCKY. TRANSMEDIA AND CO-CREATION PRACTICES IN MUSIC VIDEO-CLIPS
BEING LUCKY. TRANSMEDIA AND CO-CREATION PRACTICES IN MUSIC VIDEO-CLIPS BEING LUCKY. TRANSMEDIA AND CO-CREATION PRACTICES IN MUSIC VIDEO-CLIPS
BEING LUCKY. TRANSMEDIA AND CO-CREATION PRACTICES IN MUSIC VIDEO-CLIPS Gemma San Cornelio Esquerdo
 
David L Page DCI KKP622 mid Project 1 report.20160610
David L Page DCI KKP622 mid Project 1 report.20160610David L Page DCI KKP622 mid Project 1 report.20160610
David L Page DCI KKP622 mid Project 1 report.20160610David L Page
 
David L Page DCI Project 1 Seminar Presentation.20180816.pdf
David L Page DCI Project 1 Seminar Presentation.20180816.pdfDavid L Page DCI Project 1 Seminar Presentation.20180816.pdf
David L Page DCI Project 1 Seminar Presentation.20180816.pdfDavid L Page
 
Introduction to Fast by Professor Mark Sandler
Introduction to Fast by  Professor Mark SandlerIntroduction to Fast by  Professor Mark Sandler
Introduction to Fast by Professor Mark SandlerFASTIMPACT
 
Mlapv
MlapvMlapv
Mlapvhhs
 
Copy of 3INFOR~1.powerpoint presentation
Copy of 3INFOR~1.powerpoint presentationCopy of 3INFOR~1.powerpoint presentation
Copy of 3INFOR~1.powerpoint presentationRouAnnAdobasNavarroz1
 
Authentic materials 2018 copy ppt
Authentic materials 2018 copy pptAuthentic materials 2018 copy ppt
Authentic materials 2018 copy ppthhs
 
Music and Media Communications
Music and Media CommunicationsMusic and Media Communications
Music and Media CommunicationsCHTCT
 
Mobile Phone Instruments, the Possibilities of Networks, and OSC
Mobile Phone Instruments, the Possibilities of Networks, and OSCMobile Phone Instruments, the Possibilities of Networks, and OSC
Mobile Phone Instruments, the Possibilities of Networks, and OSCNathanBowen8
 
3. Information Literacy - Information Literacy and Performance Task- Project....
3. Information Literacy - Information Literacy and Performance Task- Project....3. Information Literacy - Information Literacy and Performance Task- Project....
3. Information Literacy - Information Literacy and Performance Task- Project....JemeloSipsip1
 
3. Information Literacy - Information Literacy and Performance Task- Project....
3. Information Literacy - Information Literacy and Performance Task- Project....3. Information Literacy - Information Literacy and Performance Task- Project....
3. Information Literacy - Information Literacy and Performance Task- Project....ronald775612
 
3. Information Literacy - Information Literacy and Performance Task- Project....
3. Information Literacy - Information Literacy and Performance Task- Project....3. Information Literacy - Information Literacy and Performance Task- Project....
3. Information Literacy - Information Literacy and Performance Task- Project....EdwardEpan1
 
Badges for Nature (HASTAC/DML proposal)
Badges for Nature (HASTAC/DML proposal)Badges for Nature (HASTAC/DML proposal)
Badges for Nature (HASTAC/DML proposal)Jon Rosewell
 
Sound Matters: a framework for the creative use and re-use of sound: field re...
Sound Matters: a framework for the creative use and re-use of sound: field re...Sound Matters: a framework for the creative use and re-use of sound: field re...
Sound Matters: a framework for the creative use and re-use of sound: field re...Jisc
 

Ähnlich wie Multimodal Music Tagging Task Overview (20)

Music Recommendation 2018
Music Recommendation 2018Music Recommendation 2018
Music Recommendation 2018
 
Trends in Music Recommendations 2018
Trends in Music Recommendations 2018Trends in Music Recommendations 2018
Trends in Music Recommendations 2018
 
Learning how to learn, with Software Carpentry
Learning how to learn, with Software CarpentryLearning how to learn, with Software Carpentry
Learning how to learn, with Software Carpentry
 
BEING LUCKY. TRANSMEDIA AND CO-CREATION PRACTICES IN MUSIC VIDEO-CLIPS
BEING LUCKY. TRANSMEDIA AND CO-CREATION PRACTICES IN MUSIC VIDEO-CLIPS BEING LUCKY. TRANSMEDIA AND CO-CREATION PRACTICES IN MUSIC VIDEO-CLIPS
BEING LUCKY. TRANSMEDIA AND CO-CREATION PRACTICES IN MUSIC VIDEO-CLIPS
 
David L Page DCI KKP622 mid Project 1 report.20160610
David L Page DCI KKP622 mid Project 1 report.20160610David L Page DCI KKP622 mid Project 1 report.20160610
David L Page DCI KKP622 mid Project 1 report.20160610
 
M13517
M13517M13517
M13517
 
David L Page DCI Project 1 Seminar Presentation.20180816.pdf
David L Page DCI Project 1 Seminar Presentation.20180816.pdfDavid L Page DCI Project 1 Seminar Presentation.20180816.pdf
David L Page DCI Project 1 Seminar Presentation.20180816.pdf
 
Introduction to Fast by Professor Mark Sandler
Introduction to Fast by  Professor Mark SandlerIntroduction to Fast by  Professor Mark Sandler
Introduction to Fast by Professor Mark Sandler
 
MULHER@AVI2012
MULHER@AVI2012MULHER@AVI2012
MULHER@AVI2012
 
Mlapv
MlapvMlapv
Mlapv
 
Copy of 3INFOR~1.powerpoint presentation
Copy of 3INFOR~1.powerpoint presentationCopy of 3INFOR~1.powerpoint presentation
Copy of 3INFOR~1.powerpoint presentation
 
Authentic materials 2018 copy ppt
Authentic materials 2018 copy pptAuthentic materials 2018 copy ppt
Authentic materials 2018 copy ppt
 
Music and Media Communications
Music and Media CommunicationsMusic and Media Communications
Music and Media Communications
 
Mobile Phone Instruments, the Possibilities of Networks, and OSC
Mobile Phone Instruments, the Possibilities of Networks, and OSCMobile Phone Instruments, the Possibilities of Networks, and OSC
Mobile Phone Instruments, the Possibilities of Networks, and OSC
 
08b final event_experimente
08b final event_experimente08b final event_experimente
08b final event_experimente
 
3. Information Literacy - Information Literacy and Performance Task- Project....
3. Information Literacy - Information Literacy and Performance Task- Project....3. Information Literacy - Information Literacy and Performance Task- Project....
3. Information Literacy - Information Literacy and Performance Task- Project....
 
3. Information Literacy - Information Literacy and Performance Task- Project....
3. Information Literacy - Information Literacy and Performance Task- Project....3. Information Literacy - Information Literacy and Performance Task- Project....
3. Information Literacy - Information Literacy and Performance Task- Project....
 
3. Information Literacy - Information Literacy and Performance Task- Project....
3. Information Literacy - Information Literacy and Performance Task- Project....3. Information Literacy - Information Literacy and Performance Task- Project....
3. Information Literacy - Information Literacy and Performance Task- Project....
 
Badges for Nature (HASTAC/DML proposal)
Badges for Nature (HASTAC/DML proposal)Badges for Nature (HASTAC/DML proposal)
Badges for Nature (HASTAC/DML proposal)
 
Sound Matters: a framework for the creative use and re-use of sound: field re...
Sound Matters: a framework for the creative use and re-use of sound: field re...Sound Matters: a framework for the creative use and re-use of sound: field re...
Sound Matters: a framework for the creative use and re-use of sound: field re...
 

Mehr von MediaEval2012

A Multimodal Approach for Video Geocoding
A Multimodal Approach for   Video Geocoding A Multimodal Approach for   Video Geocoding
A Multimodal Approach for Video Geocoding MediaEval2012
 
CUNI at MediaEval 2012: Search and Hyperlinking Task
CUNI at MediaEval 2012: Search and Hyperlinking TaskCUNI at MediaEval 2012: Search and Hyperlinking Task
CUNI at MediaEval 2012: Search and Hyperlinking TaskMediaEval2012
 
Ghent University-IBBT at MediaEval 2012 Search and Hyperlinking: Semantic Sim...
Ghent University-IBBT at MediaEval 2012 Search and Hyperlinking: Semantic Sim...Ghent University-IBBT at MediaEval 2012 Search and Hyperlinking: Semantic Sim...
Ghent University-IBBT at MediaEval 2012 Search and Hyperlinking: Semantic Sim...MediaEval2012
 
Brave New Task: User Account Matching
Brave New Task: User Account MatchingBrave New Task: User Account Matching
Brave New Task: User Account MatchingMediaEval2012
 
The CLEF Initiative From 2010 to 2012 and Onwards
The CLEF Initiative From 2010 to 2012 and OnwardsThe CLEF Initiative From 2010 to 2012 and Onwards
The CLEF Initiative From 2010 to 2012 and OnwardsMediaEval2012
 
MediaEval 2012 Visual Privacy Task: Applying Transform-domain Scrambling to A...
MediaEval 2012 Visual Privacy Task: Applying Transform-domain Scrambling to A...MediaEval 2012 Visual Privacy Task: Applying Transform-domain Scrambling to A...
MediaEval 2012 Visual Privacy Task: Applying Transform-domain Scrambling to A...MediaEval2012
 
Technicolor/INRIA/Imperial College London at the MediaEval 2012 Violent Scene...
Technicolor/INRIA/Imperial College London at the MediaEval 2012 Violent Scene...Technicolor/INRIA/Imperial College London at the MediaEval 2012 Violent Scene...
Technicolor/INRIA/Imperial College London at the MediaEval 2012 Violent Scene...MediaEval2012
 
NII, Japan at MediaEval 2012 Violent Scenes Detection Affect Task
NII, Japan at MediaEval 2012 Violent Scenes Detection Affect TaskNII, Japan at MediaEval 2012 Violent Scenes Detection Affect Task
NII, Japan at MediaEval 2012 Violent Scenes Detection Affect TaskMediaEval2012
 
ARF @ MediaEval 2012: An Uninformed Approach to Violence Detection in Hollywo...
ARF @ MediaEval 2012: An Uninformed Approach to Violence Detection in Hollywo...ARF @ MediaEval 2012: An Uninformed Approach to Violence Detection in Hollywo...
ARF @ MediaEval 2012: An Uninformed Approach to Violence Detection in Hollywo...MediaEval2012
 
The Shanghai-Hongkong Team at MediaEval2012: Violent Scene Detection Using Tr...
The Shanghai-Hongkong Team at MediaEval2012: Violent Scene Detection Using Tr...The Shanghai-Hongkong Team at MediaEval2012: Violent Scene Detection Using Tr...
The Shanghai-Hongkong Team at MediaEval2012: Violent Scene Detection Using Tr...MediaEval2012
 
UNICAMP-UFMG at MediaEval 2012: Genre Tagging Task
UNICAMP-UFMG at MediaEval 2012: Genre Tagging TaskUNICAMP-UFMG at MediaEval 2012: Genre Tagging Task
UNICAMP-UFMG at MediaEval 2012: Genre Tagging TaskMediaEval2012
 
TUD at MediaEval 2012 genre tagging task: Multi-modality video categorization...
TUD at MediaEval 2012 genre tagging task: Multi-modality video categorization...TUD at MediaEval 2012 genre tagging task: Multi-modality video categorization...
TUD at MediaEval 2012 genre tagging task: Multi-modality video categorization...MediaEval2012
 
ARF @ MediaEval 2012: Multimodal Video Classification
ARF @ MediaEval 2012: Multimodal Video ClassificationARF @ MediaEval 2012: Multimodal Video Classification
ARF @ MediaEval 2012: Multimodal Video ClassificationMediaEval2012
 
TUB @ MediaEval 2012 Tagging Task: Feature Selection Methods for Bag-of-(visu...
TUB @ MediaEval 2012 Tagging Task: Feature Selection Methods for Bag-of-(visu...TUB @ MediaEval 2012 Tagging Task: Feature Selection Methods for Bag-of-(visu...
TUB @ MediaEval 2012 Tagging Task: Feature Selection Methods for Bag-of-(visu...MediaEval2012
 
KIT at MediaEval 2012 – Content–based Genre Classification with Visual Cues
KIT at MediaEval 2012 – Content–based Genre Classification with Visual CuesKIT at MediaEval 2012 – Content–based Genre Classification with Visual Cues
KIT at MediaEval 2012 – Content–based Genre Classification with Visual CuesMediaEval2012
 
Overview of the MediaEval 2012 Tagging Task
Overview of the MediaEval 2012 Tagging TaskOverview of the MediaEval 2012 Tagging Task
Overview of the MediaEval 2012 Tagging TaskMediaEval2012
 
CUHK System for the Spoken Web Search task at Mediaeval 2012
CUHK System for the Spoken Web Search task at Mediaeval 2012CUHK System for the Spoken Web Search task at Mediaeval 2012
CUHK System for the Spoken Web Search task at Mediaeval 2012MediaEval2012
 
The TUM Cumulative DTW Approach for the Mediaeval 2012 Spoken Web Search Task
The TUM Cumulative DTW Approach for the Mediaeval 2012 Spoken Web Search TaskThe TUM Cumulative DTW Approach for the Mediaeval 2012 Spoken Web Search Task
The TUM Cumulative DTW Approach for the Mediaeval 2012 Spoken Web Search TaskMediaEval2012
 

Mehr von MediaEval2012 (19)

A Multimodal Approach for Video Geocoding
A Multimodal Approach for   Video Geocoding A Multimodal Approach for   Video Geocoding
A Multimodal Approach for Video Geocoding
 
CUNI at MediaEval 2012: Search and Hyperlinking Task
CUNI at MediaEval 2012: Search and Hyperlinking TaskCUNI at MediaEval 2012: Search and Hyperlinking Task
CUNI at MediaEval 2012: Search and Hyperlinking Task
 
Ghent University-IBBT at MediaEval 2012 Search and Hyperlinking: Semantic Sim...
Ghent University-IBBT at MediaEval 2012 Search and Hyperlinking: Semantic Sim...Ghent University-IBBT at MediaEval 2012 Search and Hyperlinking: Semantic Sim...
Ghent University-IBBT at MediaEval 2012 Search and Hyperlinking: Semantic Sim...
 
Brave New Task: User Account Matching
Brave New Task: User Account MatchingBrave New Task: User Account Matching
Brave New Task: User Account Matching
 
The CLEF Initiative From 2010 to 2012 and Onwards
The CLEF Initiative From 2010 to 2012 and OnwardsThe CLEF Initiative From 2010 to 2012 and Onwards
The CLEF Initiative From 2010 to 2012 and Onwards
 
MediaEval 2012 Visual Privacy Task: Applying Transform-domain Scrambling to A...
MediaEval 2012 Visual Privacy Task: Applying Transform-domain Scrambling to A...MediaEval 2012 Visual Privacy Task: Applying Transform-domain Scrambling to A...
MediaEval 2012 Visual Privacy Task: Applying Transform-domain Scrambling to A...
 
mevd2012 esra_
 mevd2012 esra_ mevd2012 esra_
mevd2012 esra_
 
Technicolor/INRIA/Imperial College London at the MediaEval 2012 Violent Scene...
Technicolor/INRIA/Imperial College London at the MediaEval 2012 Violent Scene...Technicolor/INRIA/Imperial College London at the MediaEval 2012 Violent Scene...
Technicolor/INRIA/Imperial College London at the MediaEval 2012 Violent Scene...
 
NII, Japan at MediaEval 2012 Violent Scenes Detection Affect Task
NII, Japan at MediaEval 2012 Violent Scenes Detection Affect TaskNII, Japan at MediaEval 2012 Violent Scenes Detection Affect Task
NII, Japan at MediaEval 2012 Violent Scenes Detection Affect Task
 
ARF @ MediaEval 2012: An Uninformed Approach to Violence Detection in Hollywo...
ARF @ MediaEval 2012: An Uninformed Approach to Violence Detection in Hollywo...ARF @ MediaEval 2012: An Uninformed Approach to Violence Detection in Hollywo...
ARF @ MediaEval 2012: An Uninformed Approach to Violence Detection in Hollywo...
 
The Shanghai-Hongkong Team at MediaEval2012: Violent Scene Detection Using Tr...
The Shanghai-Hongkong Team at MediaEval2012: Violent Scene Detection Using Tr...The Shanghai-Hongkong Team at MediaEval2012: Violent Scene Detection Using Tr...
The Shanghai-Hongkong Team at MediaEval2012: Violent Scene Detection Using Tr...
 
UNICAMP-UFMG at MediaEval 2012: Genre Tagging Task
UNICAMP-UFMG at MediaEval 2012: Genre Tagging TaskUNICAMP-UFMG at MediaEval 2012: Genre Tagging Task
UNICAMP-UFMG at MediaEval 2012: Genre Tagging Task
 
TUD at MediaEval 2012 genre tagging task: Multi-modality video categorization...
TUD at MediaEval 2012 genre tagging task: Multi-modality video categorization...TUD at MediaEval 2012 genre tagging task: Multi-modality video categorization...
TUD at MediaEval 2012 genre tagging task: Multi-modality video categorization...
 
ARF @ MediaEval 2012: Multimodal Video Classification
ARF @ MediaEval 2012: Multimodal Video ClassificationARF @ MediaEval 2012: Multimodal Video Classification
ARF @ MediaEval 2012: Multimodal Video Classification
 
TUB @ MediaEval 2012 Tagging Task: Feature Selection Methods for Bag-of-(visu...
TUB @ MediaEval 2012 Tagging Task: Feature Selection Methods for Bag-of-(visu...TUB @ MediaEval 2012 Tagging Task: Feature Selection Methods for Bag-of-(visu...
TUB @ MediaEval 2012 Tagging Task: Feature Selection Methods for Bag-of-(visu...
 
KIT at MediaEval 2012 – Content–based Genre Classification with Visual Cues
KIT at MediaEval 2012 – Content–based Genre Classification with Visual CuesKIT at MediaEval 2012 – Content–based Genre Classification with Visual Cues
KIT at MediaEval 2012 – Content–based Genre Classification with Visual Cues
 
Overview of the MediaEval 2012 Tagging Task
Overview of the MediaEval 2012 Tagging TaskOverview of the MediaEval 2012 Tagging Task
Overview of the MediaEval 2012 Tagging Task
 
CUHK System for the Spoken Web Search task at Mediaeval 2012
CUHK System for the Spoken Web Search task at Mediaeval 2012CUHK System for the Spoken Web Search task at Mediaeval 2012
CUHK System for the Spoken Web Search task at Mediaeval 2012
 
The TUM Cumulative DTW Approach for the Mediaeval 2012 Spoken Web Search Task
The TUM Cumulative DTW Approach for the Mediaeval 2012 Spoken Web Search TaskThe TUM Cumulative DTW Approach for the Mediaeval 2012 Spoken Web Search Task
The TUM Cumulative DTW Approach for the Mediaeval 2012 Spoken Web Search Task
 

Kürzlich hochgeladen

Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions
 
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhisoniya singh
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxOnBoard
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Enterprise Knowledge
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitecturePixlogix Infotech
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machinePadma Pradeep
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 3652toLead Limited
 
Benefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksBenefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksSoftradix Technologies
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions
 
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024BookNet Canada
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptxLBM Solutions
 

Kürzlich hochgeladen (20)

Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping Elbows
 
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptx
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024
 
The transition to renewables in India.pdf
The transition to renewables in India.pdfThe transition to renewables in India.pdf
The transition to renewables in India.pdf
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC Architecture
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
 
Benefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksBenefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other Frameworks
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food Manufacturing
 
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptx
 

Multimodal Music Tagging Task Overview

  • 1. Multimodal Music Tagging Task Nicola Orio – University of Padova Cynthia C. S. Liem – Delft University of Technology Geoffroy Peeters – UMR STMS IRCAM-CNRS, Paris Markus Schedl – Johannes Kepler University, Linz MediaEval, Pisa 05/10/2012 MusiClef: Multimodal Music Tagging Task 1
  • 2. Multimodal music tagging • Definition • Songs of a commercial music library need to be categorized according to their usage in TV and radio broadcasts (e.g. soundtracks, jingles) • Practical motivation • The search for suitable music for video productions is a major activity for professionals and lay users alike • Collaborative filtering systems are taking their role • Notwithstanding their known limitations: long-tail, cold start… • Annotating professional music libraries is another important professional activity MediaEval, Pisa 05/10/2012 MusiClef: Multimodal Music Tagging Task 2
  • 3. Human assessment Different sources of information are routinely exploited by professionals to overcome limitations of individual media MediaEval, Pisa 05/10/2012 MusiClef: Multimodal Music Tagging Task 3
  • 4. Goals of MusiClef • To focus evaluation on professional application scenarios • Textual description of music items • To grant replication of experiments and results • Feature extraction phase is crucial – released features computed with public, open-source library (MIRToolbox) • To promote the exploitation of multimodal sources of information • Content (audio) + Context (tags & webpages) • To disseminate music related initiatives • Outside the music information retrieval community MediaEval, Pisa 05/10/2012 MusiClef: Multimodal Music Tagging Task 4
  • 5. Evaluation initiatives – 1 • MIREX (since 2004) • Community-based selection of tasks • Many tasks address audio feature extraction algorithms • Participants submit algorithms that are run by organizers • Music files are not shared with participants • Million Song Dataset (since 2011) • Task on music recommendation proposed by organizers • Audio features are computed using proprietary algorithms • Only features are shared with participants MediaEval, Pisa 05/10/2012 MusiClef: Multimodal Music Tagging Task 5
  • 6. Evaluation initiatives – 2 • Quaero-Eval (since 2012) • Tasks agreed with participants • Strategies to grant public access to evaluation results • Participants run training experiments on a shared repository • Runs on test set made by the organizers MediaEval, Pisa 05/10/2012 MusiClef: Multimodal Music Tagging Task 6
  • 7. Test collection – 1 • Individual songs of pop and rock music • 1355 songs (from 218 artists) • train (975) and test (380) split • Social tags • Gathered from Last.fm API • Multilingual sets of Web pages related to artists+albums • Mined querying Google • Acoustic features: MFCC (using MIRToolbox) with a window length of 200ms and 50% overlap MediaEval, Pisa 05/10/2012 MusiClef: Multimodal Music Tagging Task 7
  • 8. Test collection – 2 • Test collection created starting from the “500 Greatest Songs of All Time” (Rolling Stone) • Expected high number of social tags and web pages • Ground truth created by experts in the domain • 355 tags selected (167 genre, 288 usage) • Tags associated to less than 20 songs were discarded • Reference implementation in Matlab • Participants has an example to run a complete experiment • Code for the evaluation made already available MediaEval, Pisa 05/10/2012 MusiClef: Multimodal Music Tagging Task 8
  • 9. Evaluation measures • Standard IR measures • Accuracy • Precision • Recall • Specificity • F-measure MediaEval, Pisa 05/10/2012 MusiClef: Multimodal Music Tagging Task 9
  • 10. Examining tags more closely • Some tags are more equal than others… hard rock ballroom melancholic travel countryside bright • Thus, we propose to also analyze results employing a higher-level tag categorization MediaEval, Pisa 05/10/2012 MusiClef: Multimodal Music Tagging Task 10
  • 11. Tag categorization – 1 • Affective, mood-related aspects: • activity: the amount of perceived music activity, without implying strong positive or negative affective qualities (e.g. 'fast', 'mellow', 'lazy') • affective state: affective qualities that can only be connected and attributed to living beings (e.g. 'aggressive', 'hopeful') • atmosphere: affective qualities that can be connected to environments (e.g. 'chaotic', 'intimate'). MediaEval, Pisa 05/10/2012 MusiClef: Multimodal Music Tagging Task 11
  • 12. Tag categorization – 2 • Situation, time and space aspects of the music: • Physical situation: concrete physical environments (e.g. 'city', 'night'). • Occasion: implications of time and space, typically connected to social events (e.g. 'holiday', 'glamour'). • Sociocultural genre (e.g. 'new wave', 'r&b', 'punk') • Sound qualities: • timbral aspects (e.g. 'acoustic', 'bright') • temporal aspects (e.g. 'beat', 'groove'). • Other (e.g. 'catchy', 'evocative'). MediaEval, Pisa 05/10/2012 MusiClef: Multimodal Music Tagging Task 12
  • 13. Reference implementation • Made in MATLAB and released publicly • Simple and straightforward approaches: • Individual GMMs for audio, user tags, web pages • Tagging process: 1-NN qualification using symmetrized KL • Scenarios tested: • Audio, user tags, web pages individually • Majority vote • Union MediaEval, Pisa 05/10/2012 MusiClef: Multimodal Music Tagging Task 13
  • 14. Baseline results – 1 • Evaluation of the submitted runs and of the reference implementation • Results with different modalities over the full dataset strategy accuracy recall precision specificity f-measure audio 0.894 0.148 0.127 0.939 0.126 tags 0.898 0.061 0.039 0.942 0.037 web pages 0.897 0.050 0.007 0.954 0.011 majority 0.880 0.123 0.086 0.922 0.086 union 0.824 0.240 0.115 0.845 0.134 MediaEval, Pisa 05/10/2012 MusiClef: Multimodal Music Tagging Task 14
  • 15. Baseline results – 2 1. activity, energy 2. affective state 3. atmosphere 4. other 5. situation: occasion 6. situation: physical 7. sociocultural genre 8. sound: temporal 9: sound: timbral MediaEval, Pisa 05/10/2012 MusiClef: Multimodal Music Tagging Task 15
  • 16. Participation • Initially a lot of interest - about 8 explicitly interested parties • But ultimately just one participant (LUTIN UserLab) • Aggregation of estimators • Currently investigating what happened to the 7 others • So far, it appears ISMIR 2012 was inconveniently close • The 3 other MusiClef co-organizers will discuss this there MediaEval, Pisa 05/10/2012 MusiClef: Multimodal Music Tagging Task 16
  • 17. Conclusions • We established a multimodal music tagging benchmark task • Special effort in facilitating deeper tag analysis • We would like a 2013 multimodal music benchmark task • Depending on survey input • Depending on your input MediaEval, Pisa 05/10/2012 MusiClef: Multimodal Music Tagging Task 17
  • 18. Thank you for your attention! For contact and more information: musiclef@dei.unipd.it MediaEval, Pisa 05/10/2012 MusiClef: Multimodal Music Tagging Task 18