SlideShare ist ein Scribd-Unternehmen logo
1 von 9
Downloaden Sie, um offline zu lesen
MediaEval 2018 AcousticBrainz Genre Task:
A baseline combining deep feature embeddings
across datasets
Sergio Oramas (Pandora Media Inc.)
Dmitry Bogdanov, Alastair Porter (Universitat Pompeu Fabra)
Features
● Use all features
● One-hot encode categorical features (key, scale)
● Standardize (zero mean, unit variance)
● In total 2669 input features
Single Dataset Model Architecture (Task 1)
● Basic DNN, only one hidden layer
● One model per dataset
● Model as feature extractor once trained
● Intermediate layer as a new 256-dim
feature embedding of each track
Fusion Architecture (Task 2)
● Obtain 4 feature embeddings per track
● Apply l2-norm to each one
● Concatenate embeddings
● Train fusion network
Validation Results (ROC AUC)
Discogs
Discogs + AllMusic
Discogs + AllMusic + Lastfm
Discogs + AllMusic + Lastfm + Tagtraum
0.7592
0.8713
0.8814
0.8863
● Macro ROC AUC improves combining feature embeddings
Maximizing macro F-score
● Network output: within [0,1]
● Plug-in rule approach
● Individual threshold for each genre/subgenre label
● The problem (Task 1): some labels are always
predicted
○ Infrequent labels + Uninformative classifier
○ We decided to keep those for simplicity
○ ~30 subgenres always predicted for Allmusic
○ This hurts the per-recording micro F-score
Test Results (Precision, Recall, F-score)
Conclusions
● Making use of all four datasets improves performance
● Our approach: Combine feature embeddings learnt by weak neural networks
○ More deep feature embeddings = Better predictions
○ Early fusion = No need to work on genre mapping
● What to optimize?
○ Task 1: Optimizing for per-label (macro) F-score worsens per-track micro F-score
● We ignored genre-subgenre hierarchy
○ Inheriting parent genres lead to worse performance scores
Future work
● ISMIR 2019
● Better individual NN models
○ More complex architectures (e.g., melbaseline 2018 or JKU 2017)
● How useful are different datasets for stacking?
○ What matters? (Size, taxonomy, source of annotations?)
○ Run an experiment on the intersection of all four datasets

Weitere ähnliche Inhalte

Mehr von multimediaeval

Pixel Privacy: Quality Camouflage for Social Images
Pixel Privacy: Quality Camouflage for Social ImagesPixel Privacy: Quality Camouflage for Social Images
Pixel Privacy: Quality Camouflage for Social Imagesmultimediaeval
 
HCMUS at MediaEval 2020:Image-Text Fusion for Automatic News-Images Re-Matching
HCMUS at MediaEval 2020:Image-Text Fusion for Automatic News-Images Re-MatchingHCMUS at MediaEval 2020:Image-Text Fusion for Automatic News-Images Re-Matching
HCMUS at MediaEval 2020:Image-Text Fusion for Automatic News-Images Re-Matchingmultimediaeval
 
Efficient Supervision Net: Polyp Segmentation using EfficientNet and Attentio...
Efficient Supervision Net: Polyp Segmentation using EfficientNet and Attentio...Efficient Supervision Net: Polyp Segmentation using EfficientNet and Attentio...
Efficient Supervision Net: Polyp Segmentation using EfficientNet and Attentio...multimediaeval
 
HCMUS at Medico Automatic Polyp Segmentation Task 2020: PraNet and ResUnet++ ...
HCMUS at Medico Automatic Polyp Segmentation Task 2020: PraNet and ResUnet++ ...HCMUS at Medico Automatic Polyp Segmentation Task 2020: PraNet and ResUnet++ ...
HCMUS at Medico Automatic Polyp Segmentation Task 2020: PraNet and ResUnet++ ...multimediaeval
 
Depth-wise Separable Atrous Convolution for Polyps Segmentation in Gastro-Int...
Depth-wise Separable Atrous Convolution for Polyps Segmentation in Gastro-Int...Depth-wise Separable Atrous Convolution for Polyps Segmentation in Gastro-Int...
Depth-wise Separable Atrous Convolution for Polyps Segmentation in Gastro-Int...multimediaeval
 
Deep Conditional Adversarial learning for polyp Segmentation
Deep Conditional Adversarial learning for polyp SegmentationDeep Conditional Adversarial learning for polyp Segmentation
Deep Conditional Adversarial learning for polyp Segmentationmultimediaeval
 
A Temporal-Spatial Attention Model for Medical Image Detection
A Temporal-Spatial Attention Model for Medical Image DetectionA Temporal-Spatial Attention Model for Medical Image Detection
A Temporal-Spatial Attention Model for Medical Image Detectionmultimediaeval
 
HCMUS-Juniors 2020 at Medico Task in MediaEval 2020: Refined Deep Neural Netw...
HCMUS-Juniors 2020 at Medico Task in MediaEval 2020: Refined Deep Neural Netw...HCMUS-Juniors 2020 at Medico Task in MediaEval 2020: Refined Deep Neural Netw...
HCMUS-Juniors 2020 at Medico Task in MediaEval 2020: Refined Deep Neural Netw...multimediaeval
 
Fine-tuning for Polyp Segmentation with Attention
Fine-tuning for Polyp Segmentation with AttentionFine-tuning for Polyp Segmentation with Attention
Fine-tuning for Polyp Segmentation with Attentionmultimediaeval
 
Bigger Networks are not Always Better: Deep Convolutional Neural Networks for...
Bigger Networks are not Always Better: Deep Convolutional Neural Networks for...Bigger Networks are not Always Better: Deep Convolutional Neural Networks for...
Bigger Networks are not Always Better: Deep Convolutional Neural Networks for...multimediaeval
 
Insights for wellbeing: Predicting Personal Air Quality Index using Regressio...
Insights for wellbeing: Predicting Personal Air Quality Index using Regressio...Insights for wellbeing: Predicting Personal Air Quality Index using Regressio...
Insights for wellbeing: Predicting Personal Air Quality Index using Regressio...multimediaeval
 
Use Visual Features From Surrounding Scenes to Improve Personal Air Quality ...
 Use Visual Features From Surrounding Scenes to Improve Personal Air Quality ... Use Visual Features From Surrounding Scenes to Improve Personal Air Quality ...
Use Visual Features From Surrounding Scenes to Improve Personal Air Quality ...multimediaeval
 
Personal Air Quality Index Prediction Using Inverse Distance Weighting Method
Personal Air Quality Index Prediction Using Inverse Distance Weighting MethodPersonal Air Quality Index Prediction Using Inverse Distance Weighting Method
Personal Air Quality Index Prediction Using Inverse Distance Weighting Methodmultimediaeval
 
Overview of MediaEval 2020 Insights for Wellbeing: Multimodal Personal Health...
Overview of MediaEval 2020 Insights for Wellbeing: Multimodal Personal Health...Overview of MediaEval 2020 Insights for Wellbeing: Multimodal Personal Health...
Overview of MediaEval 2020 Insights for Wellbeing: Multimodal Personal Health...multimediaeval
 
Ensemble based method for the classification of flooding event using social m...
Ensemble based method for the classification of flooding event using social m...Ensemble based method for the classification of flooding event using social m...
Ensemble based method for the classification of flooding event using social m...multimediaeval
 
Flood Detection via Twitter Streams using Textual and Visual Features
Flood Detection via Twitter Streams using Textual and Visual FeaturesFlood Detection via Twitter Streams using Textual and Visual Features
Flood Detection via Twitter Streams using Textual and Visual Featuresmultimediaeval
 
Floods Detection in Twitter Text and Images
Floods Detection in Twitter Text and ImagesFloods Detection in Twitter Text and Images
Floods Detection in Twitter Text and Imagesmultimediaeval
 
Flood Detection in Twitter Using a Novel Learning Method for Neural Networks
Flood Detection in Twitter Using a Novel Learning Method for Neural NetworksFlood Detection in Twitter Using a Novel Learning Method for Neural Networks
Flood Detection in Twitter Using a Novel Learning Method for Neural Networksmultimediaeval
 
The Flood-related Multimedia Task at MediaEval 2020
The Flood-related Multimedia Task at MediaEval 2020The Flood-related Multimedia Task at MediaEval 2020
The Flood-related Multimedia Task at MediaEval 2020multimediaeval
 
From Graph Convolutional Networks to Time Series
From Graph Convolutional Networks to Time SeriesFrom Graph Convolutional Networks to Time Series
From Graph Convolutional Networks to Time Seriesmultimediaeval
 

Mehr von multimediaeval (20)

Pixel Privacy: Quality Camouflage for Social Images
Pixel Privacy: Quality Camouflage for Social ImagesPixel Privacy: Quality Camouflage for Social Images
Pixel Privacy: Quality Camouflage for Social Images
 
HCMUS at MediaEval 2020:Image-Text Fusion for Automatic News-Images Re-Matching
HCMUS at MediaEval 2020:Image-Text Fusion for Automatic News-Images Re-MatchingHCMUS at MediaEval 2020:Image-Text Fusion for Automatic News-Images Re-Matching
HCMUS at MediaEval 2020:Image-Text Fusion for Automatic News-Images Re-Matching
 
Efficient Supervision Net: Polyp Segmentation using EfficientNet and Attentio...
Efficient Supervision Net: Polyp Segmentation using EfficientNet and Attentio...Efficient Supervision Net: Polyp Segmentation using EfficientNet and Attentio...
Efficient Supervision Net: Polyp Segmentation using EfficientNet and Attentio...
 
HCMUS at Medico Automatic Polyp Segmentation Task 2020: PraNet and ResUnet++ ...
HCMUS at Medico Automatic Polyp Segmentation Task 2020: PraNet and ResUnet++ ...HCMUS at Medico Automatic Polyp Segmentation Task 2020: PraNet and ResUnet++ ...
HCMUS at Medico Automatic Polyp Segmentation Task 2020: PraNet and ResUnet++ ...
 
Depth-wise Separable Atrous Convolution for Polyps Segmentation in Gastro-Int...
Depth-wise Separable Atrous Convolution for Polyps Segmentation in Gastro-Int...Depth-wise Separable Atrous Convolution for Polyps Segmentation in Gastro-Int...
Depth-wise Separable Atrous Convolution for Polyps Segmentation in Gastro-Int...
 
Deep Conditional Adversarial learning for polyp Segmentation
Deep Conditional Adversarial learning for polyp SegmentationDeep Conditional Adversarial learning for polyp Segmentation
Deep Conditional Adversarial learning for polyp Segmentation
 
A Temporal-Spatial Attention Model for Medical Image Detection
A Temporal-Spatial Attention Model for Medical Image DetectionA Temporal-Spatial Attention Model for Medical Image Detection
A Temporal-Spatial Attention Model for Medical Image Detection
 
HCMUS-Juniors 2020 at Medico Task in MediaEval 2020: Refined Deep Neural Netw...
HCMUS-Juniors 2020 at Medico Task in MediaEval 2020: Refined Deep Neural Netw...HCMUS-Juniors 2020 at Medico Task in MediaEval 2020: Refined Deep Neural Netw...
HCMUS-Juniors 2020 at Medico Task in MediaEval 2020: Refined Deep Neural Netw...
 
Fine-tuning for Polyp Segmentation with Attention
Fine-tuning for Polyp Segmentation with AttentionFine-tuning for Polyp Segmentation with Attention
Fine-tuning for Polyp Segmentation with Attention
 
Bigger Networks are not Always Better: Deep Convolutional Neural Networks for...
Bigger Networks are not Always Better: Deep Convolutional Neural Networks for...Bigger Networks are not Always Better: Deep Convolutional Neural Networks for...
Bigger Networks are not Always Better: Deep Convolutional Neural Networks for...
 
Insights for wellbeing: Predicting Personal Air Quality Index using Regressio...
Insights for wellbeing: Predicting Personal Air Quality Index using Regressio...Insights for wellbeing: Predicting Personal Air Quality Index using Regressio...
Insights for wellbeing: Predicting Personal Air Quality Index using Regressio...
 
Use Visual Features From Surrounding Scenes to Improve Personal Air Quality ...
 Use Visual Features From Surrounding Scenes to Improve Personal Air Quality ... Use Visual Features From Surrounding Scenes to Improve Personal Air Quality ...
Use Visual Features From Surrounding Scenes to Improve Personal Air Quality ...
 
Personal Air Quality Index Prediction Using Inverse Distance Weighting Method
Personal Air Quality Index Prediction Using Inverse Distance Weighting MethodPersonal Air Quality Index Prediction Using Inverse Distance Weighting Method
Personal Air Quality Index Prediction Using Inverse Distance Weighting Method
 
Overview of MediaEval 2020 Insights for Wellbeing: Multimodal Personal Health...
Overview of MediaEval 2020 Insights for Wellbeing: Multimodal Personal Health...Overview of MediaEval 2020 Insights for Wellbeing: Multimodal Personal Health...
Overview of MediaEval 2020 Insights for Wellbeing: Multimodal Personal Health...
 
Ensemble based method for the classification of flooding event using social m...
Ensemble based method for the classification of flooding event using social m...Ensemble based method for the classification of flooding event using social m...
Ensemble based method for the classification of flooding event using social m...
 
Flood Detection via Twitter Streams using Textual and Visual Features
Flood Detection via Twitter Streams using Textual and Visual FeaturesFlood Detection via Twitter Streams using Textual and Visual Features
Flood Detection via Twitter Streams using Textual and Visual Features
 
Floods Detection in Twitter Text and Images
Floods Detection in Twitter Text and ImagesFloods Detection in Twitter Text and Images
Floods Detection in Twitter Text and Images
 
Flood Detection in Twitter Using a Novel Learning Method for Neural Networks
Flood Detection in Twitter Using a Novel Learning Method for Neural NetworksFlood Detection in Twitter Using a Novel Learning Method for Neural Networks
Flood Detection in Twitter Using a Novel Learning Method for Neural Networks
 
The Flood-related Multimedia Task at MediaEval 2020
The Flood-related Multimedia Task at MediaEval 2020The Flood-related Multimedia Task at MediaEval 2020
The Flood-related Multimedia Task at MediaEval 2020
 
From Graph Convolutional Networks to Time Series
From Graph Convolutional Networks to Time SeriesFrom Graph Convolutional Networks to Time Series
From Graph Convolutional Networks to Time Series
 

Kürzlich hochgeladen

Factory Acceptance Test( FAT).pptx .
Factory Acceptance Test( FAT).pptx       .Factory Acceptance Test( FAT).pptx       .
Factory Acceptance Test( FAT).pptx .Poonam Aher Patil
 
Atp synthase , Atp synthase complex 1 to 4.
Atp synthase , Atp synthase complex 1 to 4.Atp synthase , Atp synthase complex 1 to 4.
Atp synthase , Atp synthase complex 1 to 4.Silpa
 
Phenolics: types, biosynthesis and functions.
Phenolics: types, biosynthesis and functions.Phenolics: types, biosynthesis and functions.
Phenolics: types, biosynthesis and functions.Silpa
 
Thyroid Physiology_Dr.E. Muralinath_ Associate Professor
Thyroid Physiology_Dr.E. Muralinath_ Associate ProfessorThyroid Physiology_Dr.E. Muralinath_ Associate Professor
Thyroid Physiology_Dr.E. Muralinath_ Associate Professormuralinath2
 
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 bAsymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 bSérgio Sacani
 
Proteomics: types, protein profiling steps etc.
Proteomics: types, protein profiling steps etc.Proteomics: types, protein profiling steps etc.
Proteomics: types, protein profiling steps etc.Silpa
 
Chemistry 5th semester paper 1st Notes.pdf
Chemistry 5th semester paper 1st Notes.pdfChemistry 5th semester paper 1st Notes.pdf
Chemistry 5th semester paper 1st Notes.pdfSumit Kumar yadav
 
FAIRSpectra - Enabling the FAIRification of Analytical Science
FAIRSpectra - Enabling the FAIRification of Analytical ScienceFAIRSpectra - Enabling the FAIRification of Analytical Science
FAIRSpectra - Enabling the FAIRification of Analytical ScienceAlex Henderson
 
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune WaterworldsBiogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune WaterworldsSérgio Sacani
 
TransientOffsetin14CAftertheCarringtonEventRecordedbyPolarTreeRings
TransientOffsetin14CAftertheCarringtonEventRecordedbyPolarTreeRingsTransientOffsetin14CAftertheCarringtonEventRecordedbyPolarTreeRings
TransientOffsetin14CAftertheCarringtonEventRecordedbyPolarTreeRingsSérgio Sacani
 
Cyathodium bryophyte: morphology, anatomy, reproduction etc.
Cyathodium bryophyte: morphology, anatomy, reproduction etc.Cyathodium bryophyte: morphology, anatomy, reproduction etc.
Cyathodium bryophyte: morphology, anatomy, reproduction etc.Silpa
 
Gwalior ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Gwalior ESCORT SERVICE❤CALL GIRL
Gwalior ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Gwalior ESCORT SERVICE❤CALL GIRLGwalior ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Gwalior ESCORT SERVICE❤CALL GIRL
Gwalior ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Gwalior ESCORT SERVICE❤CALL GIRLkantirani197
 
(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...
(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...
(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...Scintica Instrumentation
 
300003-World Science Day For Peace And Development.pptx
300003-World Science Day For Peace And Development.pptx300003-World Science Day For Peace And Development.pptx
300003-World Science Day For Peace And Development.pptxryanrooker
 
Call Girls Ahmedabad +917728919243 call me Independent Escort Service
Call Girls Ahmedabad +917728919243 call me Independent Escort ServiceCall Girls Ahmedabad +917728919243 call me Independent Escort Service
Call Girls Ahmedabad +917728919243 call me Independent Escort Serviceshivanisharma5244
 
Role of AI in seed science Predictive modelling and Beyond.pptx
Role of AI in seed science  Predictive modelling and  Beyond.pptxRole of AI in seed science  Predictive modelling and  Beyond.pptx
Role of AI in seed science Predictive modelling and Beyond.pptxArvind Kumar
 
Genome sequencing,shotgun sequencing.pptx
Genome sequencing,shotgun sequencing.pptxGenome sequencing,shotgun sequencing.pptx
Genome sequencing,shotgun sequencing.pptxSilpa
 
biology HL practice questions IB BIOLOGY
biology HL practice questions IB BIOLOGYbiology HL practice questions IB BIOLOGY
biology HL practice questions IB BIOLOGY1301aanya
 

Kürzlich hochgeladen (20)

Factory Acceptance Test( FAT).pptx .
Factory Acceptance Test( FAT).pptx       .Factory Acceptance Test( FAT).pptx       .
Factory Acceptance Test( FAT).pptx .
 
Atp synthase , Atp synthase complex 1 to 4.
Atp synthase , Atp synthase complex 1 to 4.Atp synthase , Atp synthase complex 1 to 4.
Atp synthase , Atp synthase complex 1 to 4.
 
Phenolics: types, biosynthesis and functions.
Phenolics: types, biosynthesis and functions.Phenolics: types, biosynthesis and functions.
Phenolics: types, biosynthesis and functions.
 
Thyroid Physiology_Dr.E. Muralinath_ Associate Professor
Thyroid Physiology_Dr.E. Muralinath_ Associate ProfessorThyroid Physiology_Dr.E. Muralinath_ Associate Professor
Thyroid Physiology_Dr.E. Muralinath_ Associate Professor
 
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 bAsymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
 
Proteomics: types, protein profiling steps etc.
Proteomics: types, protein profiling steps etc.Proteomics: types, protein profiling steps etc.
Proteomics: types, protein profiling steps etc.
 
Chemistry 5th semester paper 1st Notes.pdf
Chemistry 5th semester paper 1st Notes.pdfChemistry 5th semester paper 1st Notes.pdf
Chemistry 5th semester paper 1st Notes.pdf
 
FAIRSpectra - Enabling the FAIRification of Analytical Science
FAIRSpectra - Enabling the FAIRification of Analytical ScienceFAIRSpectra - Enabling the FAIRification of Analytical Science
FAIRSpectra - Enabling the FAIRification of Analytical Science
 
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune WaterworldsBiogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
 
TransientOffsetin14CAftertheCarringtonEventRecordedbyPolarTreeRings
TransientOffsetin14CAftertheCarringtonEventRecordedbyPolarTreeRingsTransientOffsetin14CAftertheCarringtonEventRecordedbyPolarTreeRings
TransientOffsetin14CAftertheCarringtonEventRecordedbyPolarTreeRings
 
Cyathodium bryophyte: morphology, anatomy, reproduction etc.
Cyathodium bryophyte: morphology, anatomy, reproduction etc.Cyathodium bryophyte: morphology, anatomy, reproduction etc.
Cyathodium bryophyte: morphology, anatomy, reproduction etc.
 
Gwalior ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Gwalior ESCORT SERVICE❤CALL GIRL
Gwalior ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Gwalior ESCORT SERVICE❤CALL GIRLGwalior ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Gwalior ESCORT SERVICE❤CALL GIRL
Gwalior ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Gwalior ESCORT SERVICE❤CALL GIRL
 
(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...
(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...
(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...
 
300003-World Science Day For Peace And Development.pptx
300003-World Science Day For Peace And Development.pptx300003-World Science Day For Peace And Development.pptx
300003-World Science Day For Peace And Development.pptx
 
Site Acceptance Test .
Site Acceptance Test                    .Site Acceptance Test                    .
Site Acceptance Test .
 
Call Girls Ahmedabad +917728919243 call me Independent Escort Service
Call Girls Ahmedabad +917728919243 call me Independent Escort ServiceCall Girls Ahmedabad +917728919243 call me Independent Escort Service
Call Girls Ahmedabad +917728919243 call me Independent Escort Service
 
Role of AI in seed science Predictive modelling and Beyond.pptx
Role of AI in seed science  Predictive modelling and  Beyond.pptxRole of AI in seed science  Predictive modelling and  Beyond.pptx
Role of AI in seed science Predictive modelling and Beyond.pptx
 
PATNA CALL GIRLS 8617370543 LOW PRICE ESCORT SERVICE
PATNA CALL GIRLS 8617370543 LOW PRICE ESCORT SERVICEPATNA CALL GIRLS 8617370543 LOW PRICE ESCORT SERVICE
PATNA CALL GIRLS 8617370543 LOW PRICE ESCORT SERVICE
 
Genome sequencing,shotgun sequencing.pptx
Genome sequencing,shotgun sequencing.pptxGenome sequencing,shotgun sequencing.pptx
Genome sequencing,shotgun sequencing.pptx
 
biology HL practice questions IB BIOLOGY
biology HL practice questions IB BIOLOGYbiology HL practice questions IB BIOLOGY
biology HL practice questions IB BIOLOGY
 

MediaEval 2018: AcousticBrainz Genre Task: baseline combining deep feature embeddings across datasets

  • 1. MediaEval 2018 AcousticBrainz Genre Task: A baseline combining deep feature embeddings across datasets Sergio Oramas (Pandora Media Inc.) Dmitry Bogdanov, Alastair Porter (Universitat Pompeu Fabra)
  • 2. Features ● Use all features ● One-hot encode categorical features (key, scale) ● Standardize (zero mean, unit variance) ● In total 2669 input features
  • 3. Single Dataset Model Architecture (Task 1) ● Basic DNN, only one hidden layer ● One model per dataset ● Model as feature extractor once trained ● Intermediate layer as a new 256-dim feature embedding of each track
  • 4. Fusion Architecture (Task 2) ● Obtain 4 feature embeddings per track ● Apply l2-norm to each one ● Concatenate embeddings ● Train fusion network
  • 5. Validation Results (ROC AUC) Discogs Discogs + AllMusic Discogs + AllMusic + Lastfm Discogs + AllMusic + Lastfm + Tagtraum 0.7592 0.8713 0.8814 0.8863 ● Macro ROC AUC improves combining feature embeddings
  • 6. Maximizing macro F-score ● Network output: within [0,1] ● Plug-in rule approach ● Individual threshold for each genre/subgenre label ● The problem (Task 1): some labels are always predicted ○ Infrequent labels + Uninformative classifier ○ We decided to keep those for simplicity ○ ~30 subgenres always predicted for Allmusic ○ This hurts the per-recording micro F-score
  • 7. Test Results (Precision, Recall, F-score)
  • 8. Conclusions ● Making use of all four datasets improves performance ● Our approach: Combine feature embeddings learnt by weak neural networks ○ More deep feature embeddings = Better predictions ○ Early fusion = No need to work on genre mapping ● What to optimize? ○ Task 1: Optimizing for per-label (macro) F-score worsens per-track micro F-score ● We ignored genre-subgenre hierarchy ○ Inheriting parent genres lead to worse performance scores
  • 9. Future work ● ISMIR 2019 ● Better individual NN models ○ More complex architectures (e.g., melbaseline 2018 or JKU 2017) ● How useful are different datasets for stacking? ○ What matters? (Size, taxonomy, source of annotations?) ○ Run an experiment on the intersection of all four datasets