MediaEval 2016 - Emotion in Music Task: Lessons Learned

•

0 gefällt mir•393 views

M

Presenter: Anna Ajanaki

Emotion in Music Task: Lessons Learned
Anna Aljanaki1 Yi-Hsuan Yang2
Mohammad Soleymani1
1University of Geneva, Switzerland
2Academia Sinica, Taiwan
20-21 October, MediaEval 2016

Emotion in Music Task
2013 — Emotion in Music Brave New Task.
Organized by M. Soleymani, M.N. Caro, E.M. Schmidt and
Y.-H. Yang
2 subtasks - dynamic (per-second) music emotion
recognition and song-level emotion recognition
3 participating teams

Emotion in Music Task
Focused on audio analysis (optionally, metadata)
Most attention was paid to recognizing how emotion
changes over time
Used valence/arousal model

Valence/Arousal model

Dynamic emotion tracking (over duration of a piece)

Emotion in Music Task
2013 — Emotion in Music Brave New Task.
Organized by M. Soleymani, M.N. Caro, E.M. Schmidt and
Y.-H. Yang
2 tasks - dynamic (per-second) music emotion recognition
and song-level emotion recognition
3 participating teams
2014 — Emotion in Music Task, Second Edition
Organized by A. Aljanaki, Y.-H. Yang, M. Soleymani
2 tasks - dynamic (per-second) music emotion recognition
and feature design
7 participating teams

Emotion in Music Task
2013 — Emotion in Music Brave New Task.
Organized by M. Soleymani, M.N. Caro, E.M. Schmidt and
Y.-H. Yang
2 tasks - dynamic (per-second) music emotion recognition
and song-level emotion recognition
3 participating teams
2014 — Emotion in Music Task, Second Edition
Organized by A. Aljanaki, Y.-H. Yang, M. Soleymani
2 tasks - dynamic (per-second) music emotion recognition
and feature design
7 participating teams
2015 — Emotion in Music Task, Third Edition.
Organized by A. Aljanaki, Y.-H. Yang, M. Soleymani
1 task - dynamic (per-second) music emotion recognition,
three submissions - features, prediction on baseline
features, prediction on custom features.
11 participating teams

Quality of the annotations
Year 2013 2014 2015
Total length 9h 18min 12h 30min 3h 46min
Cronbach’s α for arousal .28 ± 0.28 .31 ± 0.30 .66 ± 0.26
GAM’s R2 for arousal .13 ± 0.10 .14 ± 0.11 .44 ± 0.19
Cronbach’s α for valence .28 ± 0.29 .20 ± 0.24 .51 ± 0.35
GAM’s R2 for valence .13 ± 0.10 .10 ± 0.08 .37 ± 0.21

Quality of the annotations
Year 2013 2014 2015
Total length 9h 18min 12h 30min 3h 46min
Cronbach’s α for arousal .28 ± 0.28 .31 ± 0.30 .66 ± 0.26
GAM’s R2 for arousal .13 ± 0.10 .14 ± 0.11 .44 ± 0.19
Cronbach’s α for valence .28 ± 0.29 .20 ± 0.24 .51 ± 0.35
GAM’s R2 for valence .13 ± 0.10 .10 ± 0.08 .37 ± 0.21
2013 & 2014 – 45 second excerpts. 2015 – full songs.
2013 & 2014 – Amazon Mechanical Turk Workers. 2015 –
Both lab and AMT workers.
2015 – introduced preliminary listening.

Quality of the annotations - Arousal

Quality of the annotations - Valence

Continuous annotation interface

Continuous annotation problems
Absolute scale
Reaction time
Scaling (’zoom’ levels)

Continuous annotation problems
Absolute scale ratings

Continuous annotation problems
We tried to scale each annotation to the dynamic mean of the
song: aj,i = aj,i + (Aj − A)

Continuous annotation problems
There is a reaction time in the annotations. Before listeners can
give judgements on the emotional content of music, they need
to listen to it for some time.

Continuous annotation problems
There is a scaling problem – the unit of emotional expression
can be structural section, or phrase, or a single note.

Best solutions
Method ρ RMSE
2013, BLSTM-RNN .31 ± .37 .08 ± .05
2014, LSTM .35 ± .45 .10 ± .05
2015, BLSTM-RNN .66 ± .25 .12 ± .06
Table: Winning algorithms on arousal, ordered by Spearman’s ρ.
BLSTM-RNN – Bi-directional Long-Short Term Memory Recurrent
Neural Networks.
Method ρ RMSE
2013, BLSTM-RNN .19 ± .43 .08 ± .04
2014, LSTM .20 ± .49 .08 ± .05
2015, BLSTM-RNN .17 ± .09 .12 ± .54
Table: Winning algorithms on valence, ordered by Spearman’s ρ.

Possible solutions and modiﬁcations
Change the task from emotion tracking to dynamics
tracking (diminuendo, crescendo, rallentando)

Possible solutions and modiﬁcations
Change the task from emotion tracking to dynamics
tracking (diminuendo, crescendo, rallentando)
Change the data collection interface

Categorical interface

Possible solutions and modiﬁcations
Change the task from emotion tracking to dynamics
tracking (diminuendo, crescendo, rallentando)
Change the data collection interface
Finding the practical task where continuous tracking is
necessary.
Retrieval by an emotional trajectory
Thumbnailing
Emotion prediction from physiological signals and audio

Acknowledgements
We thank Erik M. Schmidt, Mike N. Caro, Cheng-Ya Sha,
Alexander Lansky, Sung-Yen Liu and Eduardo Countinho for
their contributions to task developments, and anonymous
Turkers for their work.

Empfohlen

MediaEval 2016 - IR Evaluation: Putting the User Back in the Loop

MediaEval 2016 - IR Evaluation: Putting the User Back in the Loop

MediaEval 2016 - IR Evaluation: Putting the User Back in the Loopmultimediaeval

MediaEval 2016 - Verifying Multimedia Use Task Overview

MediaEval 2016 - Verifying Multimedia Use Task Overview

MediaEval 2016 - Verifying Multimedia Use Task Overviewmultimediaeval

MediaEval 2016 - HUCVL Predicting Interesting Key Frames with Deep Models

MediaEval 2016 - HUCVL Predicting Interesting Key Frames with Deep Models

MediaEval 2016 - HUCVL Predicting Interesting Key Frames with Deep Modelsmultimediaeval

MediaEval 2016: LAPI at Predicting Media Interestingness Task

MediaEval 2016: LAPI at Predicting Media Interestingness Task

MediaEval 2016: LAPI at Predicting Media Interestingness Taskmultimediaeval

The InVID Plug-in: Web Video Verification on the Browser

The InVID Plug-in: Web Video Verification on the Browser

The InVID Plug-in: Web Video Verification on the BrowserInVID Project

MediaEval 2016 - BUT Zero-Cost Speech Recognition

MediaEval 2016 - BUT Zero-Cost Speech Recognition

MediaEval 2016 - BUT Zero-Cost Speech Recognitionmultimediaeval

MediaEval 2015 - JRS at Synchronization of Multi-user Event Media Task

MediaEval 2015 - JRS at Synchronization of Multi-user Event Media Task

MediaEval 2015 - JRS at Synchronization of Multi-user Event Media Taskmultimediaeval

MediaEval 2015 - CERTH at MediaEval 2015 Synchronization of Multi-User Event ...

MediaEval 2015 - CERTH at MediaEval 2015 Synchronization of Multi-User Event ...

MediaEval 2015 - CERTH at MediaEval 2015 Synchronization of Multi-User Event ...multimediaeval

Empfohlen

MediaEval 2016 - IR Evaluation: Putting the User Back in the Loop

MediaEval 2016 - IR Evaluation: Putting the User Back in the Loop

MediaEval 2016 - IR Evaluation: Putting the User Back in the Loopmultimediaeval

MediaEval 2016 - Verifying Multimedia Use Task Overview

MediaEval 2016 - Verifying Multimedia Use Task Overview

MediaEval 2016 - Verifying Multimedia Use Task Overviewmultimediaeval

MediaEval 2016 - HUCVL Predicting Interesting Key Frames with Deep Models

MediaEval 2016 - HUCVL Predicting Interesting Key Frames with Deep Models

MediaEval 2016 - HUCVL Predicting Interesting Key Frames with Deep Modelsmultimediaeval

MediaEval 2016: LAPI at Predicting Media Interestingness Task

MediaEval 2016: LAPI at Predicting Media Interestingness Task

MediaEval 2016: LAPI at Predicting Media Interestingness Taskmultimediaeval

The InVID Plug-in: Web Video Verification on the Browser

The InVID Plug-in: Web Video Verification on the Browser

The InVID Plug-in: Web Video Verification on the BrowserInVID Project

MediaEval 2016 - BUT Zero-Cost Speech Recognition

MediaEval 2016 - BUT Zero-Cost Speech Recognition

MediaEval 2016 - BUT Zero-Cost Speech Recognitionmultimediaeval

MediaEval 2015 - JRS at Synchronization of Multi-user Event Media Task

MediaEval 2015 - JRS at Synchronization of Multi-user Event Media Task

MediaEval 2015 - JRS at Synchronization of Multi-user Event Media Taskmultimediaeval

MediaEval 2015 - CERTH at MediaEval 2015 Synchronization of Multi-User Event ...

MediaEval 2015 - CERTH at MediaEval 2015 Synchronization of Multi-User Event ...

MediaEval 2015 - CERTH at MediaEval 2015 Synchronization of Multi-User Event ...multimediaeval

MediaEval 2015 - GTM-UVigo Systems for Person Discovery Task at MediaEval 2015

MediaEval 2015 - GTM-UVigo Systems for Person Discovery Task at MediaEval 2015

MediaEval 2015 - GTM-UVigo Systems for Person Discovery Task at MediaEval 2015multimediaeval

MediaEval 2016 - Placing Images with Refined Language Models and Similarity S...

MediaEval 2016 - Placing Images with Refined Language Models and Similarity S...

MediaEval 2016 - Placing Images with Refined Language Models and Similarity S...multimediaeval

Media REVEALr: A social multimedia monitoring and intelligence system for Web...

Media REVEALr: A social multimedia monitoring and intelligence system for Web...

Media REVEALr: A social multimedia monitoring and intelligence system for Web...Symeon Papadopoulos

MediaEval 2016 - LAPI @ 2016 Retrieving Diverse Social Images Task: A Pseudo-...

MediaEval 2016 - LAPI @ 2016 Retrieving Diverse Social Images Task: A Pseudo-...

MediaEval 2016 - LAPI @ 2016 Retrieving Diverse Social Images Task: A Pseudo-...multimediaeval

MediaEval 2015 - Verifying Multimedia Use at MediaEval 2015

MediaEval 2015 - Verifying Multimedia Use at MediaEval 2015

MediaEval 2015 - Verifying Multimedia Use at MediaEval 2015multimediaeval

MediaEval 2016 - TUD-MMC Predicting media Interestingness Task

MediaEval 2016 - TUD-MMC Predicting media Interestingness Task

MediaEval 2016 - TUD-MMC Predicting media Interestingness Taskmultimediaeval

MediaEval 2016 - COSMIR and the OpenMIC Challenge: A Plan for Sustainable Mus...

MediaEval 2016 - COSMIR and the OpenMIC Challenge: A Plan for Sustainable Mus...

MediaEval 2016 - COSMIR and the OpenMIC Challenge: A Plan for Sustainable Mus...multimediaeval

MediaEval 2016: A Multimodal System for the Verifying Multimedia Use Task

MediaEval 2016: A Multimodal System for the Verifying Multimedia Use Task

MediaEval 2016: A Multimodal System for the Verifying Multimedia Use Taskmultimediaeval

Video Retrieval for Multimedia Verification of Breaking News on Social Networks

Video Retrieval for Multimedia Verification of Breaking News on Social Networks

Video Retrieval for Multimedia Verification of Breaking News on Social NetworksInVID Project

MediaEval 2016 - Simula Team @ Context of Experience Task

MediaEval 2016 - Simula Team @ Context of Experience Task

MediaEval 2016 - Simula Team @ Context of Experience Taskmultimediaeval

MediaEval 2015 - Synchronization of Multi-User Event Media at MediaEval 2015:...

MediaEval 2015 - Synchronization of Multi-User Event Media at MediaEval 2015:...

MediaEval 2015 - Synchronization of Multi-User Event Media at MediaEval 2015:...multimediaeval

MediaEval 2015 - Emotion in Music: Task Overview

MediaEval 2015 - Emotion in Music: Task Overview

MediaEval 2015 - Emotion in Music: Task Overviewmultimediaeval

0907008Sanjoy Dutta

Emotion in Music Task at MediaEval 2014

Emotion in Music Task at MediaEval 2014

Emotion in Music Task at MediaEval 2014multimediaeval

Graphical visualization of musical emotions

Graphical visualization of musical emotions

Graphical visualization of musical emotionsPranay Prasoon

Music&StressUniversity of the Philippines Manila

The Acoustic Emotion Gaussians Model for Emotion-based Music Annotation and R...

The Acoustic Emotion Gaussians Model for Emotion-based Music Annotation and R...

The Acoustic Emotion Gaussians Model for Emotion-based Music Annotation and R...Ju-Chiang Wang

Classification of Strokes in Table Tennis with a Three Stream Spatio-Temporal...

Classification of Strokes in Table Tennis with a Three Stream Spatio-Temporal...

Classification of Strokes in Table Tennis with a Three Stream Spatio-Temporal...multimediaeval

HCMUS at MediaEval 2020: Ensembles of Temporal Deep Neural Networks for Table...

HCMUS at MediaEval 2020: Ensembles of Temporal Deep Neural Networks for Table...

HCMUS at MediaEval 2020: Ensembles of Temporal Deep Neural Networks for Table...multimediaeval

Sports Video Classification: Classification of Strokes in Table Tennis for Me...

Sports Video Classification: Classification of Strokes in Table Tennis for Me...

Sports Video Classification: Classification of Strokes in Table Tennis for Me...multimediaeval

Predicting Media Memorability from a Multimodal Late Fusion of Self-Attention...

Predicting Media Memorability from a Multimodal Late Fusion of Self-Attention...

Predicting Media Memorability from a Multimodal Late Fusion of Self-Attention...multimediaeval

Essex-NLIP at MediaEval Predicting Media Memorability 2020 Task

Essex-NLIP at MediaEval Predicting Media Memorability 2020 Task

Essex-NLIP at MediaEval Predicting Media Memorability 2020 Taskmultimediaeval

Weitere ähnliche Inhalte

Andere mochten auch

MediaEval 2015 - GTM-UVigo Systems for Person Discovery Task at MediaEval 2015

MediaEval 2015 - GTM-UVigo Systems for Person Discovery Task at MediaEval 2015

MediaEval 2015 - GTM-UVigo Systems for Person Discovery Task at MediaEval 2015multimediaeval

MediaEval 2016 - Placing Images with Refined Language Models and Similarity S...

MediaEval 2016 - Placing Images with Refined Language Models and Similarity S...

MediaEval 2016 - Placing Images with Refined Language Models and Similarity S...multimediaeval

Media REVEALr: A social multimedia monitoring and intelligence system for Web...

Media REVEALr: A social multimedia monitoring and intelligence system for Web...

Media REVEALr: A social multimedia monitoring and intelligence system for Web...Symeon Papadopoulos

MediaEval 2016 - LAPI @ 2016 Retrieving Diverse Social Images Task: A Pseudo-...

MediaEval 2016 - LAPI @ 2016 Retrieving Diverse Social Images Task: A Pseudo-...

MediaEval 2016 - LAPI @ 2016 Retrieving Diverse Social Images Task: A Pseudo-...multimediaeval

MediaEval 2015 - Verifying Multimedia Use at MediaEval 2015

MediaEval 2015 - Verifying Multimedia Use at MediaEval 2015

MediaEval 2015 - Verifying Multimedia Use at MediaEval 2015multimediaeval

MediaEval 2016 - TUD-MMC Predicting media Interestingness Task

MediaEval 2016 - TUD-MMC Predicting media Interestingness Task

MediaEval 2016 - TUD-MMC Predicting media Interestingness Taskmultimediaeval

MediaEval 2016 - COSMIR and the OpenMIC Challenge: A Plan for Sustainable Mus...

MediaEval 2016 - COSMIR and the OpenMIC Challenge: A Plan for Sustainable Mus...

MediaEval 2016 - COSMIR and the OpenMIC Challenge: A Plan for Sustainable Mus...multimediaeval

MediaEval 2016: A Multimodal System for the Verifying Multimedia Use Task

MediaEval 2016: A Multimodal System for the Verifying Multimedia Use Task

MediaEval 2016: A Multimodal System for the Verifying Multimedia Use Taskmultimediaeval

Video Retrieval for Multimedia Verification of Breaking News on Social Networks

Video Retrieval for Multimedia Verification of Breaking News on Social Networks

Video Retrieval for Multimedia Verification of Breaking News on Social NetworksInVID Project

MediaEval 2016 - Simula Team @ Context of Experience Task

MediaEval 2016 - Simula Team @ Context of Experience Task

MediaEval 2016 - Simula Team @ Context of Experience Taskmultimediaeval

MediaEval 2015 - Synchronization of Multi-User Event Media at MediaEval 2015:...

MediaEval 2015 - Synchronization of Multi-User Event Media at MediaEval 2015:...

MediaEval 2015 - Synchronization of Multi-User Event Media at MediaEval 2015:...multimediaeval

Andere mochten auch (11)

MediaEval 2015 - GTM-UVigo Systems for Person Discovery Task at MediaEval 2015

MediaEval 2015 - GTM-UVigo Systems for Person Discovery Task at MediaEval 2015

MediaEval 2015 - GTM-UVigo Systems for Person Discovery Task at MediaEval 2015

MediaEval 2016 - Placing Images with Refined Language Models and Similarity S...

MediaEval 2016 - Placing Images with Refined Language Models and Similarity S...

MediaEval 2016 - Placing Images with Refined Language Models and Similarity S...

Media REVEALr: A social multimedia monitoring and intelligence system for Web...

Media REVEALr: A social multimedia monitoring and intelligence system for Web...

Media REVEALr: A social multimedia monitoring and intelligence system for Web...

MediaEval 2016 - LAPI @ 2016 Retrieving Diverse Social Images Task: A Pseudo-...

MediaEval 2016 - LAPI @ 2016 Retrieving Diverse Social Images Task: A Pseudo-...

MediaEval 2016 - LAPI @ 2016 Retrieving Diverse Social Images Task: A Pseudo-...

MediaEval 2015 - Verifying Multimedia Use at MediaEval 2015

MediaEval 2015 - Verifying Multimedia Use at MediaEval 2015

MediaEval 2015 - Verifying Multimedia Use at MediaEval 2015

MediaEval 2016 - TUD-MMC Predicting media Interestingness Task

MediaEval 2016 - TUD-MMC Predicting media Interestingness Task

MediaEval 2016 - TUD-MMC Predicting media Interestingness Task

MediaEval 2016 - COSMIR and the OpenMIC Challenge: A Plan for Sustainable Mus...

MediaEval 2016 - COSMIR and the OpenMIC Challenge: A Plan for Sustainable Mus...

MediaEval 2016 - COSMIR and the OpenMIC Challenge: A Plan for Sustainable Mus...

MediaEval 2016: A Multimodal System for the Verifying Multimedia Use Task

MediaEval 2016: A Multimodal System for the Verifying Multimedia Use Task

MediaEval 2016: A Multimodal System for the Verifying Multimedia Use Task

Video Retrieval for Multimedia Verification of Breaking News on Social Networks

Video Retrieval for Multimedia Verification of Breaking News on Social Networks

Video Retrieval for Multimedia Verification of Breaking News on Social Networks

MediaEval 2016 - Simula Team @ Context of Experience Task

MediaEval 2016 - Simula Team @ Context of Experience Task

MediaEval 2016 - Simula Team @ Context of Experience Task

MediaEval 2015 - Synchronization of Multi-User Event Media at MediaEval 2015:...

MediaEval 2015 - Synchronization of Multi-User Event Media at MediaEval 2015:...

MediaEval 2015 - Synchronization of Multi-User Event Media at MediaEval 2015:...

Ähnlich wie MediaEval 2016 - Emotion in Music Task: Lessons Learned

MediaEval 2015 - Emotion in Music: Task Overview

MediaEval 2015 - Emotion in Music: Task Overview

MediaEval 2015 - Emotion in Music: Task Overviewmultimediaeval

0907008Sanjoy Dutta

Emotion in Music Task at MediaEval 2014

Emotion in Music Task at MediaEval 2014

Emotion in Music Task at MediaEval 2014multimediaeval

Graphical visualization of musical emotions

Graphical visualization of musical emotions

Graphical visualization of musical emotionsPranay Prasoon

Music&StressUniversity of the Philippines Manila

The Acoustic Emotion Gaussians Model for Emotion-based Music Annotation and R...

The Acoustic Emotion Gaussians Model for Emotion-based Music Annotation and R...

The Acoustic Emotion Gaussians Model for Emotion-based Music Annotation and R...Ju-Chiang Wang

Ähnlich wie MediaEval 2016 - Emotion in Music Task: Lessons Learned (6)

MediaEval 2015 - Emotion in Music: Task Overview

MediaEval 2015 - Emotion in Music: Task Overview

MediaEval 2015 - Emotion in Music: Task Overview

0907008

Emotion in Music Task at MediaEval 2014

Emotion in Music Task at MediaEval 2014

Emotion in Music Task at MediaEval 2014

Graphical visualization of musical emotions

Graphical visualization of musical emotions

Graphical visualization of musical emotions

Music&Stress

The Acoustic Emotion Gaussians Model for Emotion-based Music Annotation and R...

The Acoustic Emotion Gaussians Model for Emotion-based Music Annotation and R...

The Acoustic Emotion Gaussians Model for Emotion-based Music Annotation and R...

Mehr von multimediaeval

Classification of Strokes in Table Tennis with a Three Stream Spatio-Temporal...

Classification of Strokes in Table Tennis with a Three Stream Spatio-Temporal...

Classification of Strokes in Table Tennis with a Three Stream Spatio-Temporal...multimediaeval

HCMUS at MediaEval 2020: Ensembles of Temporal Deep Neural Networks for Table...

HCMUS at MediaEval 2020: Ensembles of Temporal Deep Neural Networks for Table...

HCMUS at MediaEval 2020: Ensembles of Temporal Deep Neural Networks for Table...multimediaeval

Sports Video Classification: Classification of Strokes in Table Tennis for Me...

Sports Video Classification: Classification of Strokes in Table Tennis for Me...

Sports Video Classification: Classification of Strokes in Table Tennis for Me...multimediaeval

Predicting Media Memorability from a Multimodal Late Fusion of Self-Attention...

Predicting Media Memorability from a Multimodal Late Fusion of Self-Attention...

Predicting Media Memorability from a Multimodal Late Fusion of Self-Attention...multimediaeval

Essex-NLIP at MediaEval Predicting Media Memorability 2020 Task

Essex-NLIP at MediaEval Predicting Media Memorability 2020 Task

Essex-NLIP at MediaEval Predicting Media Memorability 2020 Taskmultimediaeval

Overview of MediaEval 2020 Predicting Media Memorability task: What Makes a V...

Overview of MediaEval 2020 Predicting Media Memorability task: What Makes a V...

Overview of MediaEval 2020 Predicting Media Memorability task: What Makes a V...multimediaeval

Fooling an Automatic Image Quality Estimator

Fooling an Automatic Image Quality Estimator

Fooling an Automatic Image Quality Estimatormultimediaeval

Fooling Blind Image Quality Assessment by Optimizing a Human-Understandable C...

Fooling Blind Image Quality Assessment by Optimizing a Human-Understandable C...

Fooling Blind Image Quality Assessment by Optimizing a Human-Understandable C...multimediaeval

Pixel Privacy: Quality Camouflage for Social Images

Pixel Privacy: Quality Camouflage for Social Images

Pixel Privacy: Quality Camouflage for Social Imagesmultimediaeval

HCMUS at MediaEval 2020:Image-Text Fusion for Automatic News-Images Re-Matching

HCMUS at MediaEval 2020:Image-Text Fusion for Automatic News-Images Re-Matching

HCMUS at MediaEval 2020:Image-Text Fusion for Automatic News-Images Re-Matchingmultimediaeval

Efficient Supervision Net: Polyp Segmentation using EfficientNet and Attentio...

Efficient Supervision Net: Polyp Segmentation using EfficientNet and Attentio...

Efficient Supervision Net: Polyp Segmentation using EfficientNet and Attentio...multimediaeval

HCMUS at Medico Automatic Polyp Segmentation Task 2020: PraNet and ResUnet++ ...

HCMUS at Medico Automatic Polyp Segmentation Task 2020: PraNet and ResUnet++ ...

HCMUS at Medico Automatic Polyp Segmentation Task 2020: PraNet and ResUnet++ ...multimediaeval

Depth-wise Separable Atrous Convolution for Polyps Segmentation in Gastro-Int...

Depth-wise Separable Atrous Convolution for Polyps Segmentation in Gastro-Int...

Depth-wise Separable Atrous Convolution for Polyps Segmentation in Gastro-Int...multimediaeval

Deep Conditional Adversarial learning for polyp Segmentation

Deep Conditional Adversarial learning for polyp Segmentation

Deep Conditional Adversarial learning for polyp Segmentationmultimediaeval

A Temporal-Spatial Attention Model for Medical Image Detection

A Temporal-Spatial Attention Model for Medical Image Detection

A Temporal-Spatial Attention Model for Medical Image Detectionmultimediaeval

HCMUS-Juniors 2020 at Medico Task in MediaEval 2020: Refined Deep Neural Netw...

HCMUS-Juniors 2020 at Medico Task in MediaEval 2020: Refined Deep Neural Netw...

HCMUS-Juniors 2020 at Medico Task in MediaEval 2020: Refined Deep Neural Netw...multimediaeval

Fine-tuning for Polyp Segmentation with Attention

Fine-tuning for Polyp Segmentation with Attention

Fine-tuning for Polyp Segmentation with Attentionmultimediaeval

Bigger Networks are not Always Better: Deep Convolutional Neural Networks for...

Bigger Networks are not Always Better: Deep Convolutional Neural Networks for...

Bigger Networks are not Always Better: Deep Convolutional Neural Networks for...multimediaeval

Insights for wellbeing: Predicting Personal Air Quality Index using Regressio...

Insights for wellbeing: Predicting Personal Air Quality Index using Regressio...

Insights for wellbeing: Predicting Personal Air Quality Index using Regressio...multimediaeval

Use Visual Features From Surrounding Scenes to Improve Personal Air Quality ...

Use Visual Features From Surrounding Scenes to Improve Personal Air Quality ...

Use Visual Features From Surrounding Scenes to Improve Personal Air Quality ...multimediaeval

Mehr von multimediaeval (20)

Classification of Strokes in Table Tennis with a Three Stream Spatio-Temporal...

Classification of Strokes in Table Tennis with a Three Stream Spatio-Temporal...

Classification of Strokes in Table Tennis with a Three Stream Spatio-Temporal...

HCMUS at MediaEval 2020: Ensembles of Temporal Deep Neural Networks for Table...

HCMUS at MediaEval 2020: Ensembles of Temporal Deep Neural Networks for Table...

HCMUS at MediaEval 2020: Ensembles of Temporal Deep Neural Networks for Table...

Sports Video Classification: Classification of Strokes in Table Tennis for Me...

Sports Video Classification: Classification of Strokes in Table Tennis for Me...

Sports Video Classification: Classification of Strokes in Table Tennis for Me...

Predicting Media Memorability from a Multimodal Late Fusion of Self-Attention...

Predicting Media Memorability from a Multimodal Late Fusion of Self-Attention...

Predicting Media Memorability from a Multimodal Late Fusion of Self-Attention...

Essex-NLIP at MediaEval Predicting Media Memorability 2020 Task

Essex-NLIP at MediaEval Predicting Media Memorability 2020 Task

Essex-NLIP at MediaEval Predicting Media Memorability 2020 Task

Overview of MediaEval 2020 Predicting Media Memorability task: What Makes a V...

Overview of MediaEval 2020 Predicting Media Memorability task: What Makes a V...

Overview of MediaEval 2020 Predicting Media Memorability task: What Makes a V...

Fooling an Automatic Image Quality Estimator

Fooling an Automatic Image Quality Estimator

Fooling an Automatic Image Quality Estimator

Fooling Blind Image Quality Assessment by Optimizing a Human-Understandable C...

Fooling Blind Image Quality Assessment by Optimizing a Human-Understandable C...

Fooling Blind Image Quality Assessment by Optimizing a Human-Understandable C...

Pixel Privacy: Quality Camouflage for Social Images

Pixel Privacy: Quality Camouflage for Social Images

Pixel Privacy: Quality Camouflage for Social Images

HCMUS at MediaEval 2020:Image-Text Fusion for Automatic News-Images Re-Matching

HCMUS at MediaEval 2020:Image-Text Fusion for Automatic News-Images Re-Matching

HCMUS at MediaEval 2020:Image-Text Fusion for Automatic News-Images Re-Matching

Efficient Supervision Net: Polyp Segmentation using EfficientNet and Attentio...

Efficient Supervision Net: Polyp Segmentation using EfficientNet and Attentio...

Efficient Supervision Net: Polyp Segmentation using EfficientNet and Attentio...

HCMUS at Medico Automatic Polyp Segmentation Task 2020: PraNet and ResUnet++ ...

HCMUS at Medico Automatic Polyp Segmentation Task 2020: PraNet and ResUnet++ ...

HCMUS at Medico Automatic Polyp Segmentation Task 2020: PraNet and ResUnet++ ...

Depth-wise Separable Atrous Convolution for Polyps Segmentation in Gastro-Int...

Depth-wise Separable Atrous Convolution for Polyps Segmentation in Gastro-Int...

Depth-wise Separable Atrous Convolution for Polyps Segmentation in Gastro-Int...

Deep Conditional Adversarial learning for polyp Segmentation

Deep Conditional Adversarial learning for polyp Segmentation

Deep Conditional Adversarial learning for polyp Segmentation

A Temporal-Spatial Attention Model for Medical Image Detection

A Temporal-Spatial Attention Model for Medical Image Detection

A Temporal-Spatial Attention Model for Medical Image Detection

HCMUS-Juniors 2020 at Medico Task in MediaEval 2020: Refined Deep Neural Netw...

HCMUS-Juniors 2020 at Medico Task in MediaEval 2020: Refined Deep Neural Netw...

HCMUS-Juniors 2020 at Medico Task in MediaEval 2020: Refined Deep Neural Netw...

Fine-tuning for Polyp Segmentation with Attention

Fine-tuning for Polyp Segmentation with Attention

Fine-tuning for Polyp Segmentation with Attention

Bigger Networks are not Always Better: Deep Convolutional Neural Networks for...

Bigger Networks are not Always Better: Deep Convolutional Neural Networks for...

Bigger Networks are not Always Better: Deep Convolutional Neural Networks for...

Insights for wellbeing: Predicting Personal Air Quality Index using Regressio...

Insights for wellbeing: Predicting Personal Air Quality Index using Regressio...

Insights for wellbeing: Predicting Personal Air Quality Index using Regressio...

Use Visual Features From Surrounding Scenes to Improve Personal Air Quality ...

Use Visual Features From Surrounding Scenes to Improve Personal Air Quality ...

Use Visual Features From Surrounding Scenes to Improve Personal Air Quality ...

Kürzlich hochgeladen

Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b

Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b

Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43bSérgio Sacani

Green chemistry and Sustainable development.pptx

Green chemistry and Sustainable development.pptx

Green chemistry and Sustainable development.pptxRajatChauhan518211

Artificial Intelligence In Microbiology by Dr. Prince C P

Artificial Intelligence In Microbiology by Dr. Prince C P

Artificial Intelligence In Microbiology by Dr. Prince C PPRINCE C P

Nanoparticles synthesis and characterization

Nanoparticles synthesis and characterization

Nanoparticles synthesis and characterization kaibalyasahoo82800

Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...

Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...

Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...anilsa9823

Labelling Requirements and Label Claims for Dietary Supplements and Recommend...

Labelling Requirements and Label Claims for Dietary Supplements and Recommend...

Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Lokesh Kothari

GFP in rDNA Technology (Biotechnology).pptx

GFP in rDNA Technology (Biotechnology).pptx

GFP in rDNA Technology (Biotechnology).pptxAleenaTreesaSaji

Natural Polymer Based Nanomaterials

Natural Polymer Based Nanomaterials

Natural Polymer Based NanomaterialsAArockiyaNisha

Cultivation of KODO MILLET . made by Ghanshyam pptx

Cultivation of KODO MILLET . made by Ghanshyam pptx

Cultivation of KODO MILLET . made by Ghanshyam pptxpradhanghanshyam7136

Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptx

Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptx

Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptxanandsmhk

CELL -Structural and Functional unit of life.pdf

CELL -Structural and Functional unit of life.pdf

CELL -Structural and Functional unit of life.pdfNistarini College, Purulia (W.B) India

Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis

Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis

Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral AnalysisDiwakar Mishra

Hubble Asteroid Hunter III. Physical properties of newly found asteroids

Hubble Asteroid Hunter III. Physical properties of newly found asteroids

Hubble Asteroid Hunter III. Physical properties of newly found asteroidsSérgio Sacani

Zoology 4th semester series (krishna).pdf

Zoology 4th semester series (krishna).pdf

Zoology 4th semester series (krishna).pdfSumit Kumar yadav

Boyles law module in the grade 10 science

Boyles law module in the grade 10 science

Boyles law module in the grade 10 sciencefloriejanemacaya1

TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...

TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...

TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...ssifa0344

STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCE

STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCE

STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCEPRINCE C P

PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...

PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...

PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...Sérgio Sacani

Engler and Prantl system of classification in plant taxonomy

Engler and Prantl system of classification in plant taxonomy

Engler and Prantl system of classification in plant taxonomyNistarini College, Purulia (W.B) India

Is RISC-V ready for HPC workload? Maybe?

Is RISC-V ready for HPC workload? Maybe?

Is RISC-V ready for HPC workload? Maybe?Patrick Diehl

Kürzlich hochgeladen (20)

Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b

Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b

Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b

Green chemistry and Sustainable development.pptx

Green chemistry and Sustainable development.pptx

Green chemistry and Sustainable development.pptx

Artificial Intelligence In Microbiology by Dr. Prince C P

Artificial Intelligence In Microbiology by Dr. Prince C P

Artificial Intelligence In Microbiology by Dr. Prince C P

Nanoparticles synthesis and characterization

Nanoparticles synthesis and characterization

Nanoparticles synthesis and characterization

Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...

Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...

Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...

Labelling Requirements and Label Claims for Dietary Supplements and Recommend...

Labelling Requirements and Label Claims for Dietary Supplements and Recommend...

Labelling Requirements and Label Claims for Dietary Supplements and Recommend...

GFP in rDNA Technology (Biotechnology).pptx

GFP in rDNA Technology (Biotechnology).pptx

GFP in rDNA Technology (Biotechnology).pptx

Natural Polymer Based Nanomaterials

Natural Polymer Based Nanomaterials

Natural Polymer Based Nanomaterials

Cultivation of KODO MILLET . made by Ghanshyam pptx

Cultivation of KODO MILLET . made by Ghanshyam pptx

Cultivation of KODO MILLET . made by Ghanshyam pptx

Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptx

Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptx

Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptx

CELL -Structural and Functional unit of life.pdf

CELL -Structural and Functional unit of life.pdf

CELL -Structural and Functional unit of life.pdf

Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis

Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis

Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis

Hubble Asteroid Hunter III. Physical properties of newly found asteroids

Hubble Asteroid Hunter III. Physical properties of newly found asteroids

Hubble Asteroid Hunter III. Physical properties of newly found asteroids

Zoology 4th semester series (krishna).pdf

Zoology 4th semester series (krishna).pdf

Zoology 4th semester series (krishna).pdf

Boyles law module in the grade 10 science

Boyles law module in the grade 10 science

Boyles law module in the grade 10 science

TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...

TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...

TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...

STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCE

STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCE

STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCE

PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...

PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...

PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...

Engler and Prantl system of classification in plant taxonomy

Engler and Prantl system of classification in plant taxonomy

Engler and Prantl system of classification in plant taxonomy

Is RISC-V ready for HPC workload? Maybe?

Is RISC-V ready for HPC workload? Maybe?

Is RISC-V ready for HPC workload? Maybe?

MediaEval 2016 - Emotion in Music Task: Lessons Learned

1. Emotion in Music Task: Lessons Learned Anna Aljanaki1 Yi-Hsuan Yang2 Mohammad Soleymani1 1University of Geneva, Switzerland 2Academia Sinica, Taiwan 20-21 October, MediaEval 2016

2. Emotion in Music Task 2013 — Emotion in Music Brave New Task. Organized by M. Soleymani, M.N. Caro, E.M. Schmidt and Y.-H. Yang 2 subtasks - dynamic (per-second) music emotion recognition and song-level emotion recognition 3 participating teams

3. Emotion in Music Task Focused on audio analysis (optionally, metadata) Most attention was paid to recognizing how emotion changes over time Used valence/arousal model

4. Valence/Arousal model

5. Dynamic emotion tracking (over duration of a piece)

6. Emotion in Music Task 2013 — Emotion in Music Brave New Task. Organized by M. Soleymani, M.N. Caro, E.M. Schmidt and Y.-H. Yang 2 tasks - dynamic (per-second) music emotion recognition and song-level emotion recognition 3 participating teams 2014 — Emotion in Music Task, Second Edition Organized by A. Aljanaki, Y.-H. Yang, M. Soleymani 2 tasks - dynamic (per-second) music emotion recognition and feature design 7 participating teams

7. Emotion in Music Task 2013 — Emotion in Music Brave New Task. Organized by M. Soleymani, M.N. Caro, E.M. Schmidt and Y.-H. Yang 2 tasks - dynamic (per-second) music emotion recognition and song-level emotion recognition 3 participating teams 2014 — Emotion in Music Task, Second Edition Organized by A. Aljanaki, Y.-H. Yang, M. Soleymani 2 tasks - dynamic (per-second) music emotion recognition and feature design 7 participating teams 2015 — Emotion in Music Task, Third Edition. Organized by A. Aljanaki, Y.-H. Yang, M. Soleymani 1 task - dynamic (per-second) music emotion recognition, three submissions - features, prediction on baseline features, prediction on custom features. 11 participating teams

8. Quality of the annotations Year 2013 2014 2015 Total length 9h 18min 12h 30min 3h 46min Cronbach’s α for arousal .28 ± 0.28 .31 ± 0.30 .66 ± 0.26 GAM’s R2 for arousal .13 ± 0.10 .14 ± 0.11 .44 ± 0.19 Cronbach’s α for valence .28 ± 0.29 .20 ± 0.24 .51 ± 0.35 GAM’s R2 for valence .13 ± 0.10 .10 ± 0.08 .37 ± 0.21

9. Quality of the annotations Year 2013 2014 2015 Total length 9h 18min 12h 30min 3h 46min Cronbach’s α for arousal .28 ± 0.28 .31 ± 0.30 .66 ± 0.26 GAM’s R2 for arousal .13 ± 0.10 .14 ± 0.11 .44 ± 0.19 Cronbach’s α for valence .28 ± 0.29 .20 ± 0.24 .51 ± 0.35 GAM’s R2 for valence .13 ± 0.10 .10 ± 0.08 .37 ± 0.21 2013 & 2014 – 45 second excerpts. 2015 – full songs. 2013 & 2014 – Amazon Mechanical Turk Workers. 2015 – Both lab and AMT workers. 2015 – introduced preliminary listening.

10. Quality of the annotations - Arousal

11. Quality of the annotations - Valence

12. Continuous annotation interface

13. Continuous annotation problems Absolute scale Reaction time Scaling (’zoom’ levels)

14. Continuous annotation problems Absolute scale ratings

15. Continuous annotation problems We tried to scale each annotation to the dynamic mean of the song: aj,i = aj,i + (Aj − A)

16. Continuous annotation problems There is a reaction time in the annotations. Before listeners can give judgements on the emotional content of music, they need to listen to it for some time.

17. Continuous annotation problems There is a scaling problem – the unit of emotional expression can be structural section, or phrase, or a single note.

18. Best solutions Method ρ RMSE 2013, BLSTM-RNN .31 ± .37 .08 ± .05 2014, LSTM .35 ± .45 .10 ± .05 2015, BLSTM-RNN .66 ± .25 .12 ± .06 Table: Winning algorithms on arousal, ordered by Spearman’s ρ. BLSTM-RNN – Bi-directional Long-Short Term Memory Recurrent Neural Networks. Method ρ RMSE 2013, BLSTM-RNN .19 ± .43 .08 ± .04 2014, LSTM .20 ± .49 .08 ± .05 2015, BLSTM-RNN .17 ± .09 .12 ± .54 Table: Winning algorithms on valence, ordered by Spearman’s ρ.

19. Possible solutions and modiﬁcations Change the task from emotion tracking to dynamics tracking (diminuendo, crescendo, rallentando)

20. Possible solutions and modiﬁcations Change the task from emotion tracking to dynamics tracking (diminuendo, crescendo, rallentando) Change the data collection interface

21. Categorical interface

22. Possible solutions and modiﬁcations Change the task from emotion tracking to dynamics tracking (diminuendo, crescendo, rallentando) Change the data collection interface Finding the practical task where continuous tracking is necessary. Retrieval by an emotional trajectory Thumbnailing Emotion prediction from physiological signals and audio

23. Acknowledgements We thank Erik M. Schmidt, Mike N. Caro, Cheng-Ya Sha, Alexander Lansky, Sung-Yen Liu and Eduardo Countinho for their contributions to task developments, and anonymous Turkers for their work.