SlideShare ist ein Scribd-Unternehmen logo
1 von 14
Downloaden Sie, um offline zu lesen
MediaEval2020
Predicting Media Memorability
Task Overview
Alba García Seco de Herrera, Rukiye Savran Kiziltepe, Jon Chamberlain, Mihai Gabriel Constantin,
Claire-Hélène Demarty, Faiyaz Doctor, Bogdan Ionescu, Alan Smeaton
Presentation Video
Task Description
Goal: predicting how memorable a video is to viewers
15/12/2020 MediaEval2020 2
• Automatically predicting short-term and
long-term memorability
• TRECVid 2019 Video to Text dataset1
• Sound and more action
1. Awad, G., Butt, A.A., Lee, Y., Fiscus, J., Godil, A., Delgado, A., Smeaton, A.F. and Graham, Y., Trecvid 2019:
An evaluation campaign to benchmark video activity detection, video captioning and matching, and video
search & retrieval. 2019.
Annotation Tool
• Short-term memorability : a few minutes after memorization
• Long-term memorability: 24 – 72 hours later
15/12/2020 MediaEval2020 3
Romain Cohendet, Claire-Hélène Demarty, Ngoc Duong, and Martin Engilberge. VideoMem: Constructing, Analyzing, Predicting Short-term and Long-term Video Memorability. Proceedings of the IEEE
International Conference on Computer Vision. 2019.
Video Memorability Game
Annotation Protocol
Step 1 (180 videos)
• 40 targets– repeated after a few minutes
• 60 fillers – non target videos
• 20 vigilance fillers – repeated quickly to monitor the attention
15/12/2020 MediaEval2020 4
Romain Cohendet, Claire-Hélène Demarty, Ngoc Duong, and Martin Engilberge. VideoMem: Constructing, Analyzing, Predicting Short-term and Long-term Video Memorability. Proceedings of the IEEE
International Conference on Computer Vision. 2019.
Step 2 (120 videos)
• 40 targets– randomly chosen from non-vigilance fillers
• 80 fillers – randomly chosen new videos
Dataset Description
• TRECVid 2019
(Video to Text)
• 1500 videos
• 1000 training set
• 500 test set
15/12/2020 MediaEval2020 5
Dataset Description
15/12/2020 MediaEval2020 6
• AlexNetFC7
• HOG
• HSVHist
• RGBHist
• LBP
• VGGFC7
• C3D
• Text descriptions
• Annotations
• Response time
• Key press
• Video position
Short-term memorability score
Long-term memorability score
Examples (Low Short-term and Long-term Memorability)
15/12/2020 MediaEval2020 7
• At football game, the ball is kicked past end zone and
woman is knocked down from her knees
• football player are playing at a football field.
• At a college football game, during a kickoff, the kicker
kicks the ball over the endzone and hits a spectator
in the face while they are trying to catch it.
• a person is injured when the football player kicked a
ball across a field during a game
• Football kicks football during a day game and a
cheerleader tries to catch it and ball hits her in the
head.
Examples (High Short-term and Long-term Memorability)
15/12/2020 MediaEval2020 8
• Two boys wearing white shirts on playground swings
• Two young men, are on a swing and yell, outdoors.
Results (Mean Spearman's Rank Correlation Scores )
• 14 teams registered
• 9 teams submitted 28 runs
• 8 papers
• Spearman’s rank correlation
15/12/2020 MediaEval2020 9
Short-term Long-term
Spearman Pearson MSE Spearman Pearson MSE
Mean 0.058 0.066 0.013 0.036 0.043 0.051
Variance 0.002 0.002 0.000 0.002 0.001 0.000
15/12/2020 MediaEval2020 10
Spearman Pearson MSE Spearman Pearson MSE
CUC_DMT run1-required 0.06 0.055 0.01 0.049 0.05 0.05
run1-required 0.054 0.044 0.01 0.113 0.121 0.05
run2-required 0.05 0.072 0.01 0.059 0.071 0.05
run3-required - - - 0.109 0.119 0.05
run4-required 0.076 0.092 0.01 0.041 0.058 0.05
memento10k 0.137 0.13 0.01 - - -
DCU@ML-Labsrun1-required 0.034 0.078 0.1 -0.01 0.022 0.09
HSV-Run1 0.042 0.042 0.01 0.032 0.016 0.05
RGB-Run2 -0.003 -0.026 0.01 0.043 0.042 0.04
RGB-Run3 -0.015 -0.012 0.01 0.032 0.037 0.04
RGB-HSV-Run4 -0.022 -0.001 0.01 -0.017 -0.012 0.04
Score-Run5 0.02 0.054 0.01 -0.054 -0.036 0.05
GTH-UPM run1-required 0.016 0.011 0.01 -0.041 -0.028 0.05
run0-required 0.007 0.029 0.01 0.028 0.033 0.05
run1-required -0.01 -0.019 0.01 0.012 0.021 0.05
run2-required 0.053 0.085 0.01 0.037 0.033 0.05
run3-required 0.05 0.053 0.01 0.014 0.017 0.05
run1-audiovisual 0.099 0.09 0.01 0.077 0.085 0.06
run2-vilbert 0.098 0.085 0.01 -0.017 0.011 0.06
run3-text 0.073 0.091 0.01 0.019 0.049 0.06
run4-all-SLT 0.101 0.09 0.01 0.078 0.085 0.06
run5-all-required 0.101 0.09 0.01 0.067 0.066 0.05
run1-required 0.136 0.145 0.01 0.012 0.012 0.05
run7 0.102 0.127 0.01 0.056 0.059 0.04
run8 0.091 0.095 0.01 0.077 0.068 0.05
run9 0.085 0.124 0.01 0.044 0.048 0.05
run42 0.116 0.144 0.01 0.076 0.069 0.05
MMSys run 0.007 0.01 0.01 0.048 0.032 0.05
MG-UCB
Team Run
Short-term Long-term
DCU-Audio
Essex-NLIP
KT-UPB
MeMAD
Results (Official Results on Test-set for Teams’ all runs)
Results (Official Results on Test-set for Teams’ best runs–Short-term)
15/12/2020 MediaEval2020 11
DCU-Audio memento10k 0.137 Audio Gestalt => Multimodal Deep Learning-based Late Fusion (Momento10K)
MG-UCB run1-required 0.136 Visual, Audio, Textual, Visiolinguistic Features=> Weighted Average
MeMAD run4-all-SLT and run5-all-required
0.101 Visual, Audio, Textual =>SVR , BR, GRU => Weighted Late Fusion
CUC_DMT run1-required 0.06 Multi-level Encoding and Captions=> Gradient Boosting, Random Forest, Neural Network
KT-UPB run2-required 0.053 C3D => Random Forest
Essex-NLIP HSV-Run1 0.042 HSV => Random Forest
DCU@ML-Labs run1-required 0.034 C3D => SemNET (Momento10K)
GTH-UPM run1-required 0.016 Multimodal Late Fusion of Self-Attention => SVR => Bidirectional LSTM
MMSys run 0.007 -
Team Run Approach
Short-term
Results (Official Results on Test-set for Teams’ best runs–Long-term)
15/12/2020 MediaEval2020 12
DCU-Audio run1-required 0.113 Audio Gestalt => Multimodal Deep Learning-based Late Fusion (Momento10K)
MeMAD run4-all-SLT 0.078 Visual, Audio, Textual, Visiolinguistic Features=> Weighted Average
MG-UCB run8 0.077 Visual, Audio, Textual =>SVR , BR, GRU => Weighted Late Fusion
CUC_DMT run1-required 0.049 Multi-level Encoding and Captions=> Gradient Boosting, Random Forest, Neural Network
MMSys run 0.048 -
Essex-NLIP RGB-Run2 0.043 RGB => Random Forest
KT-UPB run2-required 0.037 C3D => Random Forest
DCU@ML-Labs run1-required -0.01 C3D => SemNET (Momento10K)
GTH-UPM run1-required -0.041 Multimodal Late Fusion of Self-Attention => SVR => Bidirectional LSTM
Team Run Approach
Long-term
Conclusion
• Short-term memorability – better results
• Long-term memorability – results slightly lower
• The best results:
• DCU-Audio (0.137; 0.113)
• MG-UCB (0.136; 0.77)
• MeMAD (0.101; 0.078)
• Audio and captions
• Fusion
• Deep learning techniques
• More annotations
15/12/2020 MediaEval2020 13
THANK YOU!

Weitere ähnliche Inhalte

Ähnlich wie Overview of MediaEval 2020 Predicting Media Memorability task: What Makes a Video Memorable?

Learn2Sign : Sign language recognition and translation using human keypoint e...
Learn2Sign : Sign language recognition and translation using human keypoint e...Learn2Sign : Sign language recognition and translation using human keypoint e...
Learn2Sign : Sign language recognition and translation using human keypoint e...Universitat Politècnica de Catalunya
 
Forecasting database performance
Forecasting database performanceForecasting database performance
Forecasting database performanceShenglin Du
 
TRECVID 2016 : Video to Text Description
TRECVID 2016 : Video to Text DescriptionTRECVID 2016 : Video to Text Description
TRECVID 2016 : Video to Text DescriptionGeorge Awad
 
Search and Hyperlinking Overview @MediaEval2014
Search and Hyperlinking Overview @MediaEval2014Search and Hyperlinking Overview @MediaEval2014
Search and Hyperlinking Overview @MediaEval2014Maria Eskevich
 
Automatic Report Generation of a Football Match
Automatic Report Generation of a Football MatchAutomatic Report Generation of a Football Match
Automatic Report Generation of a Football MatchIRJET Journal
 
Alex Tellez, Deep Learning Applications
Alex Tellez, Deep Learning ApplicationsAlex Tellez, Deep Learning Applications
Alex Tellez, Deep Learning ApplicationsSri Ambati
 
Registration System for Training Program in STC
Registration System for Training Program in STCRegistration System for Training Program in STC
Registration System for Training Program in STCalraee
 
Planning & Scheduling - Training
Planning & Scheduling - TrainingPlanning & Scheduling - Training
Planning & Scheduling - TrainingMohammed Feroze
 
Kaushlendr Profile-v6.12(DEC-2016)
Kaushlendr Profile-v6.12(DEC-2016)Kaushlendr Profile-v6.12(DEC-2016)
Kaushlendr Profile-v6.12(DEC-2016)Kaushlendr Partap
 
Quality of Experience of Web-based Adaptive HTTP Streaming Clients in Real-Wo...
Quality of Experience of Web-based Adaptive HTTP Streaming Clients in Real-Wo...Quality of Experience of Web-based Adaptive HTTP Streaming Clients in Real-Wo...
Quality of Experience of Web-based Adaptive HTTP Streaming Clients in Real-Wo...Alpen-Adria-Universität
 
Agile project management in heavy engineering design (John Underhill, Babcock)
Agile project management in heavy engineering design (John Underhill, Babcock)Agile project management in heavy engineering design (John Underhill, Babcock)
Agile project management in heavy engineering design (John Underhill, Babcock)Association for Project Management
 
DCU Search Runs at MediaEval 2012: Search and Hyperlinking Task
DCU Search Runs at MediaEval 2012: Search and Hyperlinking TaskDCU Search Runs at MediaEval 2012: Search and Hyperlinking Task
DCU Search Runs at MediaEval 2012: Search and Hyperlinking TaskMediaEval2012
 
Google Glass, The META and Co. - How to calibrate your Optical See-Through He...
Google Glass, The META and Co. - How to calibrate your Optical See-Through He...Google Glass, The META and Co. - How to calibrate your Optical See-Through He...
Google Glass, The META and Co. - How to calibrate your Optical See-Through He...Jens Grubert
 
“MLPerf: An Industry Standard Performance Benchmark Suite for Machine Learnin...
“MLPerf: An Industry Standard Performance Benchmark Suite for Machine Learnin...“MLPerf: An Industry Standard Performance Benchmark Suite for Machine Learnin...
“MLPerf: An Industry Standard Performance Benchmark Suite for Machine Learnin...Edge AI and Vision Alliance
 
Rich Internet Application for Semi-Automatic Annotation of Semantic Shots on ...
Rich Internet Application for Semi-Automatic Annotation of Semantic Shots on ...Rich Internet Application for Semi-Automatic Annotation of Semantic Shots on ...
Rich Internet Application for Semi-Automatic Annotation of Semantic Shots on ...Universitat Politècnica de Catalunya
 
Runtime Performance Optimizations for an OpenFOAM Simulation
Runtime Performance Optimizations for an OpenFOAM SimulationRuntime Performance Optimizations for an OpenFOAM Simulation
Runtime Performance Optimizations for an OpenFOAM SimulationFisnik Kraja
 
Lessons Learned.pptx
Lessons Learned.pptxLessons Learned.pptx
Lessons Learned.pptxDooScooby1
 
Digital video watermarking using modified lsb and dct technique
Digital video watermarking using modified lsb and dct techniqueDigital video watermarking using modified lsb and dct technique
Digital video watermarking using modified lsb and dct techniqueeSAT Publishing House
 
MIPI DevCon 2016: How to Use the VESA Display Stream Compression (DSC) Standa...
MIPI DevCon 2016: How to Use the VESA Display Stream Compression (DSC) Standa...MIPI DevCon 2016: How to Use the VESA Display Stream Compression (DSC) Standa...
MIPI DevCon 2016: How to Use the VESA Display Stream Compression (DSC) Standa...MIPI Alliance
 

Ähnlich wie Overview of MediaEval 2020 Predicting Media Memorability task: What Makes a Video Memorable? (20)

Learn2Sign : Sign language recognition and translation using human keypoint e...
Learn2Sign : Sign language recognition and translation using human keypoint e...Learn2Sign : Sign language recognition and translation using human keypoint e...
Learn2Sign : Sign language recognition and translation using human keypoint e...
 
Forecasting database performance
Forecasting database performanceForecasting database performance
Forecasting database performance
 
TRECVID 2016 : Video to Text Description
TRECVID 2016 : Video to Text DescriptionTRECVID 2016 : Video to Text Description
TRECVID 2016 : Video to Text Description
 
Search and Hyperlinking Overview @MediaEval2014
Search and Hyperlinking Overview @MediaEval2014Search and Hyperlinking Overview @MediaEval2014
Search and Hyperlinking Overview @MediaEval2014
 
Automatic Report Generation of a Football Match
Automatic Report Generation of a Football MatchAutomatic Report Generation of a Football Match
Automatic Report Generation of a Football Match
 
Alex Tellez, Deep Learning Applications
Alex Tellez, Deep Learning ApplicationsAlex Tellez, Deep Learning Applications
Alex Tellez, Deep Learning Applications
 
Registration System for Training Program in STC
Registration System for Training Program in STCRegistration System for Training Program in STC
Registration System for Training Program in STC
 
Planning & Scheduling - Training
Planning & Scheduling - TrainingPlanning & Scheduling - Training
Planning & Scheduling - Training
 
Kaushlendr Profile-v6.12(DEC-2016)
Kaushlendr Profile-v6.12(DEC-2016)Kaushlendr Profile-v6.12(DEC-2016)
Kaushlendr Profile-v6.12(DEC-2016)
 
Quality of Experience of Web-based Adaptive HTTP Streaming Clients in Real-Wo...
Quality of Experience of Web-based Adaptive HTTP Streaming Clients in Real-Wo...Quality of Experience of Web-based Adaptive HTTP Streaming Clients in Real-Wo...
Quality of Experience of Web-based Adaptive HTTP Streaming Clients in Real-Wo...
 
Agile project management in heavy engineering design (John Underhill, Babcock)
Agile project management in heavy engineering design (John Underhill, Babcock)Agile project management in heavy engineering design (John Underhill, Babcock)
Agile project management in heavy engineering design (John Underhill, Babcock)
 
DCU Search Runs at MediaEval 2012: Search and Hyperlinking Task
DCU Search Runs at MediaEval 2012: Search and Hyperlinking TaskDCU Search Runs at MediaEval 2012: Search and Hyperlinking Task
DCU Search Runs at MediaEval 2012: Search and Hyperlinking Task
 
Google Glass, The META and Co. - How to calibrate your Optical See-Through He...
Google Glass, The META and Co. - How to calibrate your Optical See-Through He...Google Glass, The META and Co. - How to calibrate your Optical See-Through He...
Google Glass, The META and Co. - How to calibrate your Optical See-Through He...
 
“MLPerf: An Industry Standard Performance Benchmark Suite for Machine Learnin...
“MLPerf: An Industry Standard Performance Benchmark Suite for Machine Learnin...“MLPerf: An Industry Standard Performance Benchmark Suite for Machine Learnin...
“MLPerf: An Industry Standard Performance Benchmark Suite for Machine Learnin...
 
AcademicProject
AcademicProjectAcademicProject
AcademicProject
 
Rich Internet Application for Semi-Automatic Annotation of Semantic Shots on ...
Rich Internet Application for Semi-Automatic Annotation of Semantic Shots on ...Rich Internet Application for Semi-Automatic Annotation of Semantic Shots on ...
Rich Internet Application for Semi-Automatic Annotation of Semantic Shots on ...
 
Runtime Performance Optimizations for an OpenFOAM Simulation
Runtime Performance Optimizations for an OpenFOAM SimulationRuntime Performance Optimizations for an OpenFOAM Simulation
Runtime Performance Optimizations for an OpenFOAM Simulation
 
Lessons Learned.pptx
Lessons Learned.pptxLessons Learned.pptx
Lessons Learned.pptx
 
Digital video watermarking using modified lsb and dct technique
Digital video watermarking using modified lsb and dct techniqueDigital video watermarking using modified lsb and dct technique
Digital video watermarking using modified lsb and dct technique
 
MIPI DevCon 2016: How to Use the VESA Display Stream Compression (DSC) Standa...
MIPI DevCon 2016: How to Use the VESA Display Stream Compression (DSC) Standa...MIPI DevCon 2016: How to Use the VESA Display Stream Compression (DSC) Standa...
MIPI DevCon 2016: How to Use the VESA Display Stream Compression (DSC) Standa...
 

Mehr von multimediaeval

Classification of Strokes in Table Tennis with a Three Stream Spatio-Temporal...
Classification of Strokes in Table Tennis with a Three Stream Spatio-Temporal...Classification of Strokes in Table Tennis with a Three Stream Spatio-Temporal...
Classification of Strokes in Table Tennis with a Three Stream Spatio-Temporal...multimediaeval
 
HCMUS at MediaEval 2020: Ensembles of Temporal Deep Neural Networks for Table...
HCMUS at MediaEval 2020: Ensembles of Temporal Deep Neural Networks for Table...HCMUS at MediaEval 2020: Ensembles of Temporal Deep Neural Networks for Table...
HCMUS at MediaEval 2020: Ensembles of Temporal Deep Neural Networks for Table...multimediaeval
 
Sports Video Classification: Classification of Strokes in Table Tennis for Me...
Sports Video Classification: Classification of Strokes in Table Tennis for Me...Sports Video Classification: Classification of Strokes in Table Tennis for Me...
Sports Video Classification: Classification of Strokes in Table Tennis for Me...multimediaeval
 
Predicting Media Memorability from a Multimodal Late Fusion of Self-Attention...
Predicting Media Memorability from a Multimodal Late Fusion of Self-Attention...Predicting Media Memorability from a Multimodal Late Fusion of Self-Attention...
Predicting Media Memorability from a Multimodal Late Fusion of Self-Attention...multimediaeval
 
Essex-NLIP at MediaEval Predicting Media Memorability 2020 Task
Essex-NLIP at MediaEval Predicting Media Memorability 2020 TaskEssex-NLIP at MediaEval Predicting Media Memorability 2020 Task
Essex-NLIP at MediaEval Predicting Media Memorability 2020 Taskmultimediaeval
 
Fooling an Automatic Image Quality Estimator
Fooling an Automatic Image Quality EstimatorFooling an Automatic Image Quality Estimator
Fooling an Automatic Image Quality Estimatormultimediaeval
 
Fooling Blind Image Quality Assessment by Optimizing a Human-Understandable C...
Fooling Blind Image Quality Assessment by Optimizing a Human-Understandable C...Fooling Blind Image Quality Assessment by Optimizing a Human-Understandable C...
Fooling Blind Image Quality Assessment by Optimizing a Human-Understandable C...multimediaeval
 
Pixel Privacy: Quality Camouflage for Social Images
Pixel Privacy: Quality Camouflage for Social ImagesPixel Privacy: Quality Camouflage for Social Images
Pixel Privacy: Quality Camouflage for Social Imagesmultimediaeval
 
HCMUS at MediaEval 2020:Image-Text Fusion for Automatic News-Images Re-Matching
HCMUS at MediaEval 2020:Image-Text Fusion for Automatic News-Images Re-MatchingHCMUS at MediaEval 2020:Image-Text Fusion for Automatic News-Images Re-Matching
HCMUS at MediaEval 2020:Image-Text Fusion for Automatic News-Images Re-Matchingmultimediaeval
 
Efficient Supervision Net: Polyp Segmentation using EfficientNet and Attentio...
Efficient Supervision Net: Polyp Segmentation using EfficientNet and Attentio...Efficient Supervision Net: Polyp Segmentation using EfficientNet and Attentio...
Efficient Supervision Net: Polyp Segmentation using EfficientNet and Attentio...multimediaeval
 
HCMUS at Medico Automatic Polyp Segmentation Task 2020: PraNet and ResUnet++ ...
HCMUS at Medico Automatic Polyp Segmentation Task 2020: PraNet and ResUnet++ ...HCMUS at Medico Automatic Polyp Segmentation Task 2020: PraNet and ResUnet++ ...
HCMUS at Medico Automatic Polyp Segmentation Task 2020: PraNet and ResUnet++ ...multimediaeval
 
Depth-wise Separable Atrous Convolution for Polyps Segmentation in Gastro-Int...
Depth-wise Separable Atrous Convolution for Polyps Segmentation in Gastro-Int...Depth-wise Separable Atrous Convolution for Polyps Segmentation in Gastro-Int...
Depth-wise Separable Atrous Convolution for Polyps Segmentation in Gastro-Int...multimediaeval
 
Deep Conditional Adversarial learning for polyp Segmentation
Deep Conditional Adversarial learning for polyp SegmentationDeep Conditional Adversarial learning for polyp Segmentation
Deep Conditional Adversarial learning for polyp Segmentationmultimediaeval
 
A Temporal-Spatial Attention Model for Medical Image Detection
A Temporal-Spatial Attention Model for Medical Image DetectionA Temporal-Spatial Attention Model for Medical Image Detection
A Temporal-Spatial Attention Model for Medical Image Detectionmultimediaeval
 
HCMUS-Juniors 2020 at Medico Task in MediaEval 2020: Refined Deep Neural Netw...
HCMUS-Juniors 2020 at Medico Task in MediaEval 2020: Refined Deep Neural Netw...HCMUS-Juniors 2020 at Medico Task in MediaEval 2020: Refined Deep Neural Netw...
HCMUS-Juniors 2020 at Medico Task in MediaEval 2020: Refined Deep Neural Netw...multimediaeval
 
Fine-tuning for Polyp Segmentation with Attention
Fine-tuning for Polyp Segmentation with AttentionFine-tuning for Polyp Segmentation with Attention
Fine-tuning for Polyp Segmentation with Attentionmultimediaeval
 
Bigger Networks are not Always Better: Deep Convolutional Neural Networks for...
Bigger Networks are not Always Better: Deep Convolutional Neural Networks for...Bigger Networks are not Always Better: Deep Convolutional Neural Networks for...
Bigger Networks are not Always Better: Deep Convolutional Neural Networks for...multimediaeval
 
Insights for wellbeing: Predicting Personal Air Quality Index using Regressio...
Insights for wellbeing: Predicting Personal Air Quality Index using Regressio...Insights for wellbeing: Predicting Personal Air Quality Index using Regressio...
Insights for wellbeing: Predicting Personal Air Quality Index using Regressio...multimediaeval
 
Use Visual Features From Surrounding Scenes to Improve Personal Air Quality ...
 Use Visual Features From Surrounding Scenes to Improve Personal Air Quality ... Use Visual Features From Surrounding Scenes to Improve Personal Air Quality ...
Use Visual Features From Surrounding Scenes to Improve Personal Air Quality ...multimediaeval
 
Personal Air Quality Index Prediction Using Inverse Distance Weighting Method
Personal Air Quality Index Prediction Using Inverse Distance Weighting MethodPersonal Air Quality Index Prediction Using Inverse Distance Weighting Method
Personal Air Quality Index Prediction Using Inverse Distance Weighting Methodmultimediaeval
 

Mehr von multimediaeval (20)

Classification of Strokes in Table Tennis with a Three Stream Spatio-Temporal...
Classification of Strokes in Table Tennis with a Three Stream Spatio-Temporal...Classification of Strokes in Table Tennis with a Three Stream Spatio-Temporal...
Classification of Strokes in Table Tennis with a Three Stream Spatio-Temporal...
 
HCMUS at MediaEval 2020: Ensembles of Temporal Deep Neural Networks for Table...
HCMUS at MediaEval 2020: Ensembles of Temporal Deep Neural Networks for Table...HCMUS at MediaEval 2020: Ensembles of Temporal Deep Neural Networks for Table...
HCMUS at MediaEval 2020: Ensembles of Temporal Deep Neural Networks for Table...
 
Sports Video Classification: Classification of Strokes in Table Tennis for Me...
Sports Video Classification: Classification of Strokes in Table Tennis for Me...Sports Video Classification: Classification of Strokes in Table Tennis for Me...
Sports Video Classification: Classification of Strokes in Table Tennis for Me...
 
Predicting Media Memorability from a Multimodal Late Fusion of Self-Attention...
Predicting Media Memorability from a Multimodal Late Fusion of Self-Attention...Predicting Media Memorability from a Multimodal Late Fusion of Self-Attention...
Predicting Media Memorability from a Multimodal Late Fusion of Self-Attention...
 
Essex-NLIP at MediaEval Predicting Media Memorability 2020 Task
Essex-NLIP at MediaEval Predicting Media Memorability 2020 TaskEssex-NLIP at MediaEval Predicting Media Memorability 2020 Task
Essex-NLIP at MediaEval Predicting Media Memorability 2020 Task
 
Fooling an Automatic Image Quality Estimator
Fooling an Automatic Image Quality EstimatorFooling an Automatic Image Quality Estimator
Fooling an Automatic Image Quality Estimator
 
Fooling Blind Image Quality Assessment by Optimizing a Human-Understandable C...
Fooling Blind Image Quality Assessment by Optimizing a Human-Understandable C...Fooling Blind Image Quality Assessment by Optimizing a Human-Understandable C...
Fooling Blind Image Quality Assessment by Optimizing a Human-Understandable C...
 
Pixel Privacy: Quality Camouflage for Social Images
Pixel Privacy: Quality Camouflage for Social ImagesPixel Privacy: Quality Camouflage for Social Images
Pixel Privacy: Quality Camouflage for Social Images
 
HCMUS at MediaEval 2020:Image-Text Fusion for Automatic News-Images Re-Matching
HCMUS at MediaEval 2020:Image-Text Fusion for Automatic News-Images Re-MatchingHCMUS at MediaEval 2020:Image-Text Fusion for Automatic News-Images Re-Matching
HCMUS at MediaEval 2020:Image-Text Fusion for Automatic News-Images Re-Matching
 
Efficient Supervision Net: Polyp Segmentation using EfficientNet and Attentio...
Efficient Supervision Net: Polyp Segmentation using EfficientNet and Attentio...Efficient Supervision Net: Polyp Segmentation using EfficientNet and Attentio...
Efficient Supervision Net: Polyp Segmentation using EfficientNet and Attentio...
 
HCMUS at Medico Automatic Polyp Segmentation Task 2020: PraNet and ResUnet++ ...
HCMUS at Medico Automatic Polyp Segmentation Task 2020: PraNet and ResUnet++ ...HCMUS at Medico Automatic Polyp Segmentation Task 2020: PraNet and ResUnet++ ...
HCMUS at Medico Automatic Polyp Segmentation Task 2020: PraNet and ResUnet++ ...
 
Depth-wise Separable Atrous Convolution for Polyps Segmentation in Gastro-Int...
Depth-wise Separable Atrous Convolution for Polyps Segmentation in Gastro-Int...Depth-wise Separable Atrous Convolution for Polyps Segmentation in Gastro-Int...
Depth-wise Separable Atrous Convolution for Polyps Segmentation in Gastro-Int...
 
Deep Conditional Adversarial learning for polyp Segmentation
Deep Conditional Adversarial learning for polyp SegmentationDeep Conditional Adversarial learning for polyp Segmentation
Deep Conditional Adversarial learning for polyp Segmentation
 
A Temporal-Spatial Attention Model for Medical Image Detection
A Temporal-Spatial Attention Model for Medical Image DetectionA Temporal-Spatial Attention Model for Medical Image Detection
A Temporal-Spatial Attention Model for Medical Image Detection
 
HCMUS-Juniors 2020 at Medico Task in MediaEval 2020: Refined Deep Neural Netw...
HCMUS-Juniors 2020 at Medico Task in MediaEval 2020: Refined Deep Neural Netw...HCMUS-Juniors 2020 at Medico Task in MediaEval 2020: Refined Deep Neural Netw...
HCMUS-Juniors 2020 at Medico Task in MediaEval 2020: Refined Deep Neural Netw...
 
Fine-tuning for Polyp Segmentation with Attention
Fine-tuning for Polyp Segmentation with AttentionFine-tuning for Polyp Segmentation with Attention
Fine-tuning for Polyp Segmentation with Attention
 
Bigger Networks are not Always Better: Deep Convolutional Neural Networks for...
Bigger Networks are not Always Better: Deep Convolutional Neural Networks for...Bigger Networks are not Always Better: Deep Convolutional Neural Networks for...
Bigger Networks are not Always Better: Deep Convolutional Neural Networks for...
 
Insights for wellbeing: Predicting Personal Air Quality Index using Regressio...
Insights for wellbeing: Predicting Personal Air Quality Index using Regressio...Insights for wellbeing: Predicting Personal Air Quality Index using Regressio...
Insights for wellbeing: Predicting Personal Air Quality Index using Regressio...
 
Use Visual Features From Surrounding Scenes to Improve Personal Air Quality ...
 Use Visual Features From Surrounding Scenes to Improve Personal Air Quality ... Use Visual Features From Surrounding Scenes to Improve Personal Air Quality ...
Use Visual Features From Surrounding Scenes to Improve Personal Air Quality ...
 
Personal Air Quality Index Prediction Using Inverse Distance Weighting Method
Personal Air Quality Index Prediction Using Inverse Distance Weighting MethodPersonal Air Quality Index Prediction Using Inverse Distance Weighting Method
Personal Air Quality Index Prediction Using Inverse Distance Weighting Method
 

Kürzlich hochgeladen

ALL ABOUT MIXTURES IN GRADE 7 CLASS PPTX
ALL ABOUT MIXTURES IN GRADE 7 CLASS PPTXALL ABOUT MIXTURES IN GRADE 7 CLASS PPTX
ALL ABOUT MIXTURES IN GRADE 7 CLASS PPTXDole Philippines School
 
PROJECTILE MOTION-Horizontal and Vertical
PROJECTILE MOTION-Horizontal and VerticalPROJECTILE MOTION-Horizontal and Vertical
PROJECTILE MOTION-Horizontal and VerticalMAESTRELLAMesa2
 
BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.
BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.
BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.PraveenaKalaiselvan1
 
The dark energy paradox leads to a new structure of spacetime.pptx
The dark energy paradox leads to a new structure of spacetime.pptxThe dark energy paradox leads to a new structure of spacetime.pptx
The dark energy paradox leads to a new structure of spacetime.pptxEran Akiva Sinbar
 
Fertilization: Sperm and the egg—collectively called the gametes—fuse togethe...
Fertilization: Sperm and the egg—collectively called the gametes—fuse togethe...Fertilization: Sperm and the egg—collectively called the gametes—fuse togethe...
Fertilization: Sperm and the egg—collectively called the gametes—fuse togethe...D. B. S. College Kanpur
 
Harmful and Useful Microorganisms Presentation
Harmful and Useful Microorganisms PresentationHarmful and Useful Microorganisms Presentation
Harmful and Useful Microorganisms Presentationtahreemzahra82
 
GENERAL PHYSICS 2 REFRACTION OF LIGHT SENIOR HIGH SCHOOL GENPHYS2.pptx
GENERAL PHYSICS 2 REFRACTION OF LIGHT SENIOR HIGH SCHOOL GENPHYS2.pptxGENERAL PHYSICS 2 REFRACTION OF LIGHT SENIOR HIGH SCHOOL GENPHYS2.pptx
GENERAL PHYSICS 2 REFRACTION OF LIGHT SENIOR HIGH SCHOOL GENPHYS2.pptxRitchAndruAgustin
 
Thermodynamics ,types of system,formulae ,gibbs free energy .pptx
Thermodynamics ,types of system,formulae ,gibbs free energy .pptxThermodynamics ,types of system,formulae ,gibbs free energy .pptx
Thermodynamics ,types of system,formulae ,gibbs free energy .pptxuniversity
 
Environmental Biotechnology Topic:- Microbial Biosensor
Environmental Biotechnology Topic:- Microbial BiosensorEnvironmental Biotechnology Topic:- Microbial Biosensor
Environmental Biotechnology Topic:- Microbial Biosensorsonawaneprad
 
Base editing, prime editing, Cas13 & RNA editing and organelle base editing
Base editing, prime editing, Cas13 & RNA editing and organelle base editingBase editing, prime editing, Cas13 & RNA editing and organelle base editing
Base editing, prime editing, Cas13 & RNA editing and organelle base editingNetHelix
 
OECD bibliometric indicators: Selected highlights, April 2024
OECD bibliometric indicators: Selected highlights, April 2024OECD bibliometric indicators: Selected highlights, April 2024
OECD bibliometric indicators: Selected highlights, April 2024innovationoecd
 
《Queensland毕业文凭-昆士兰大学毕业证成绩单》
《Queensland毕业文凭-昆士兰大学毕业证成绩单》《Queensland毕业文凭-昆士兰大学毕业证成绩单》
《Queensland毕业文凭-昆士兰大学毕业证成绩单》rnrncn29
 
Pests of safflower_Binomics_Identification_Dr.UPR.pdf
Pests of safflower_Binomics_Identification_Dr.UPR.pdfPests of safflower_Binomics_Identification_Dr.UPR.pdf
Pests of safflower_Binomics_Identification_Dr.UPR.pdfPirithiRaju
 
Dubai Calls Girl Lisa O525547819 Lexi Call Girls In Dubai
Dubai Calls Girl Lisa O525547819 Lexi Call Girls In DubaiDubai Calls Girl Lisa O525547819 Lexi Call Girls In Dubai
Dubai Calls Girl Lisa O525547819 Lexi Call Girls In Dubaikojalkojal131
 
STOPPED FLOW METHOD & APPLICATION MURUGAVENI B.pptx
STOPPED FLOW METHOD & APPLICATION MURUGAVENI B.pptxSTOPPED FLOW METHOD & APPLICATION MURUGAVENI B.pptx
STOPPED FLOW METHOD & APPLICATION MURUGAVENI B.pptxMurugaveni B
 
Best Call Girls In Sector 29 Gurgaon❤️8860477959 EscorTs Service In 24/7 Delh...
Best Call Girls In Sector 29 Gurgaon❤️8860477959 EscorTs Service In 24/7 Delh...Best Call Girls In Sector 29 Gurgaon❤️8860477959 EscorTs Service In 24/7 Delh...
Best Call Girls In Sector 29 Gurgaon❤️8860477959 EscorTs Service In 24/7 Delh...lizamodels9
 
(9818099198) Call Girls In Noida Sector 14 (NOIDA ESCORTS)
(9818099198) Call Girls In Noida Sector 14 (NOIDA ESCORTS)(9818099198) Call Girls In Noida Sector 14 (NOIDA ESCORTS)
(9818099198) Call Girls In Noida Sector 14 (NOIDA ESCORTS)riyaescorts54
 
Speech, hearing, noise, intelligibility.pptx
Speech, hearing, noise, intelligibility.pptxSpeech, hearing, noise, intelligibility.pptx
Speech, hearing, noise, intelligibility.pptxpriyankatabhane
 
GenAI talk for Young at Wageningen University & Research (WUR) March 2024
GenAI talk for Young at Wageningen University & Research (WUR) March 2024GenAI talk for Young at Wageningen University & Research (WUR) March 2024
GenAI talk for Young at Wageningen University & Research (WUR) March 2024Jene van der Heide
 

Kürzlich hochgeladen (20)

ALL ABOUT MIXTURES IN GRADE 7 CLASS PPTX
ALL ABOUT MIXTURES IN GRADE 7 CLASS PPTXALL ABOUT MIXTURES IN GRADE 7 CLASS PPTX
ALL ABOUT MIXTURES IN GRADE 7 CLASS PPTX
 
PROJECTILE MOTION-Horizontal and Vertical
PROJECTILE MOTION-Horizontal and VerticalPROJECTILE MOTION-Horizontal and Vertical
PROJECTILE MOTION-Horizontal and Vertical
 
BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.
BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.
BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.
 
Volatile Oils Pharmacognosy And Phytochemistry -I
Volatile Oils Pharmacognosy And Phytochemistry -IVolatile Oils Pharmacognosy And Phytochemistry -I
Volatile Oils Pharmacognosy And Phytochemistry -I
 
The dark energy paradox leads to a new structure of spacetime.pptx
The dark energy paradox leads to a new structure of spacetime.pptxThe dark energy paradox leads to a new structure of spacetime.pptx
The dark energy paradox leads to a new structure of spacetime.pptx
 
Fertilization: Sperm and the egg—collectively called the gametes—fuse togethe...
Fertilization: Sperm and the egg—collectively called the gametes—fuse togethe...Fertilization: Sperm and the egg—collectively called the gametes—fuse togethe...
Fertilization: Sperm and the egg—collectively called the gametes—fuse togethe...
 
Harmful and Useful Microorganisms Presentation
Harmful and Useful Microorganisms PresentationHarmful and Useful Microorganisms Presentation
Harmful and Useful Microorganisms Presentation
 
GENERAL PHYSICS 2 REFRACTION OF LIGHT SENIOR HIGH SCHOOL GENPHYS2.pptx
GENERAL PHYSICS 2 REFRACTION OF LIGHT SENIOR HIGH SCHOOL GENPHYS2.pptxGENERAL PHYSICS 2 REFRACTION OF LIGHT SENIOR HIGH SCHOOL GENPHYS2.pptx
GENERAL PHYSICS 2 REFRACTION OF LIGHT SENIOR HIGH SCHOOL GENPHYS2.pptx
 
Thermodynamics ,types of system,formulae ,gibbs free energy .pptx
Thermodynamics ,types of system,formulae ,gibbs free energy .pptxThermodynamics ,types of system,formulae ,gibbs free energy .pptx
Thermodynamics ,types of system,formulae ,gibbs free energy .pptx
 
Environmental Biotechnology Topic:- Microbial Biosensor
Environmental Biotechnology Topic:- Microbial BiosensorEnvironmental Biotechnology Topic:- Microbial Biosensor
Environmental Biotechnology Topic:- Microbial Biosensor
 
Base editing, prime editing, Cas13 & RNA editing and organelle base editing
Base editing, prime editing, Cas13 & RNA editing and organelle base editingBase editing, prime editing, Cas13 & RNA editing and organelle base editing
Base editing, prime editing, Cas13 & RNA editing and organelle base editing
 
OECD bibliometric indicators: Selected highlights, April 2024
OECD bibliometric indicators: Selected highlights, April 2024OECD bibliometric indicators: Selected highlights, April 2024
OECD bibliometric indicators: Selected highlights, April 2024
 
《Queensland毕业文凭-昆士兰大学毕业证成绩单》
《Queensland毕业文凭-昆士兰大学毕业证成绩单》《Queensland毕业文凭-昆士兰大学毕业证成绩单》
《Queensland毕业文凭-昆士兰大学毕业证成绩单》
 
Pests of safflower_Binomics_Identification_Dr.UPR.pdf
Pests of safflower_Binomics_Identification_Dr.UPR.pdfPests of safflower_Binomics_Identification_Dr.UPR.pdf
Pests of safflower_Binomics_Identification_Dr.UPR.pdf
 
Dubai Calls Girl Lisa O525547819 Lexi Call Girls In Dubai
Dubai Calls Girl Lisa O525547819 Lexi Call Girls In DubaiDubai Calls Girl Lisa O525547819 Lexi Call Girls In Dubai
Dubai Calls Girl Lisa O525547819 Lexi Call Girls In Dubai
 
STOPPED FLOW METHOD & APPLICATION MURUGAVENI B.pptx
STOPPED FLOW METHOD & APPLICATION MURUGAVENI B.pptxSTOPPED FLOW METHOD & APPLICATION MURUGAVENI B.pptx
STOPPED FLOW METHOD & APPLICATION MURUGAVENI B.pptx
 
Best Call Girls In Sector 29 Gurgaon❤️8860477959 EscorTs Service In 24/7 Delh...
Best Call Girls In Sector 29 Gurgaon❤️8860477959 EscorTs Service In 24/7 Delh...Best Call Girls In Sector 29 Gurgaon❤️8860477959 EscorTs Service In 24/7 Delh...
Best Call Girls In Sector 29 Gurgaon❤️8860477959 EscorTs Service In 24/7 Delh...
 
(9818099198) Call Girls In Noida Sector 14 (NOIDA ESCORTS)
(9818099198) Call Girls In Noida Sector 14 (NOIDA ESCORTS)(9818099198) Call Girls In Noida Sector 14 (NOIDA ESCORTS)
(9818099198) Call Girls In Noida Sector 14 (NOIDA ESCORTS)
 
Speech, hearing, noise, intelligibility.pptx
Speech, hearing, noise, intelligibility.pptxSpeech, hearing, noise, intelligibility.pptx
Speech, hearing, noise, intelligibility.pptx
 
GenAI talk for Young at Wageningen University & Research (WUR) March 2024
GenAI talk for Young at Wageningen University & Research (WUR) March 2024GenAI talk for Young at Wageningen University & Research (WUR) March 2024
GenAI talk for Young at Wageningen University & Research (WUR) March 2024
 

Overview of MediaEval 2020 Predicting Media Memorability task: What Makes a Video Memorable?

  • 1. MediaEval2020 Predicting Media Memorability Task Overview Alba García Seco de Herrera, Rukiye Savran Kiziltepe, Jon Chamberlain, Mihai Gabriel Constantin, Claire-Hélène Demarty, Faiyaz Doctor, Bogdan Ionescu, Alan Smeaton Presentation Video
  • 2. Task Description Goal: predicting how memorable a video is to viewers 15/12/2020 MediaEval2020 2 • Automatically predicting short-term and long-term memorability • TRECVid 2019 Video to Text dataset1 • Sound and more action 1. Awad, G., Butt, A.A., Lee, Y., Fiscus, J., Godil, A., Delgado, A., Smeaton, A.F. and Graham, Y., Trecvid 2019: An evaluation campaign to benchmark video activity detection, video captioning and matching, and video search & retrieval. 2019.
  • 3. Annotation Tool • Short-term memorability : a few minutes after memorization • Long-term memorability: 24 – 72 hours later 15/12/2020 MediaEval2020 3 Romain Cohendet, Claire-Hélène Demarty, Ngoc Duong, and Martin Engilberge. VideoMem: Constructing, Analyzing, Predicting Short-term and Long-term Video Memorability. Proceedings of the IEEE International Conference on Computer Vision. 2019. Video Memorability Game
  • 4. Annotation Protocol Step 1 (180 videos) • 40 targets– repeated after a few minutes • 60 fillers – non target videos • 20 vigilance fillers – repeated quickly to monitor the attention 15/12/2020 MediaEval2020 4 Romain Cohendet, Claire-Hélène Demarty, Ngoc Duong, and Martin Engilberge. VideoMem: Constructing, Analyzing, Predicting Short-term and Long-term Video Memorability. Proceedings of the IEEE International Conference on Computer Vision. 2019. Step 2 (120 videos) • 40 targets– randomly chosen from non-vigilance fillers • 80 fillers – randomly chosen new videos
  • 5. Dataset Description • TRECVid 2019 (Video to Text) • 1500 videos • 1000 training set • 500 test set 15/12/2020 MediaEval2020 5
  • 6. Dataset Description 15/12/2020 MediaEval2020 6 • AlexNetFC7 • HOG • HSVHist • RGBHist • LBP • VGGFC7 • C3D • Text descriptions • Annotations • Response time • Key press • Video position Short-term memorability score Long-term memorability score
  • 7. Examples (Low Short-term and Long-term Memorability) 15/12/2020 MediaEval2020 7 • At football game, the ball is kicked past end zone and woman is knocked down from her knees • football player are playing at a football field. • At a college football game, during a kickoff, the kicker kicks the ball over the endzone and hits a spectator in the face while they are trying to catch it. • a person is injured when the football player kicked a ball across a field during a game • Football kicks football during a day game and a cheerleader tries to catch it and ball hits her in the head.
  • 8. Examples (High Short-term and Long-term Memorability) 15/12/2020 MediaEval2020 8 • Two boys wearing white shirts on playground swings • Two young men, are on a swing and yell, outdoors.
  • 9. Results (Mean Spearman's Rank Correlation Scores ) • 14 teams registered • 9 teams submitted 28 runs • 8 papers • Spearman’s rank correlation 15/12/2020 MediaEval2020 9 Short-term Long-term Spearman Pearson MSE Spearman Pearson MSE Mean 0.058 0.066 0.013 0.036 0.043 0.051 Variance 0.002 0.002 0.000 0.002 0.001 0.000
  • 10. 15/12/2020 MediaEval2020 10 Spearman Pearson MSE Spearman Pearson MSE CUC_DMT run1-required 0.06 0.055 0.01 0.049 0.05 0.05 run1-required 0.054 0.044 0.01 0.113 0.121 0.05 run2-required 0.05 0.072 0.01 0.059 0.071 0.05 run3-required - - - 0.109 0.119 0.05 run4-required 0.076 0.092 0.01 0.041 0.058 0.05 memento10k 0.137 0.13 0.01 - - - DCU@ML-Labsrun1-required 0.034 0.078 0.1 -0.01 0.022 0.09 HSV-Run1 0.042 0.042 0.01 0.032 0.016 0.05 RGB-Run2 -0.003 -0.026 0.01 0.043 0.042 0.04 RGB-Run3 -0.015 -0.012 0.01 0.032 0.037 0.04 RGB-HSV-Run4 -0.022 -0.001 0.01 -0.017 -0.012 0.04 Score-Run5 0.02 0.054 0.01 -0.054 -0.036 0.05 GTH-UPM run1-required 0.016 0.011 0.01 -0.041 -0.028 0.05 run0-required 0.007 0.029 0.01 0.028 0.033 0.05 run1-required -0.01 -0.019 0.01 0.012 0.021 0.05 run2-required 0.053 0.085 0.01 0.037 0.033 0.05 run3-required 0.05 0.053 0.01 0.014 0.017 0.05 run1-audiovisual 0.099 0.09 0.01 0.077 0.085 0.06 run2-vilbert 0.098 0.085 0.01 -0.017 0.011 0.06 run3-text 0.073 0.091 0.01 0.019 0.049 0.06 run4-all-SLT 0.101 0.09 0.01 0.078 0.085 0.06 run5-all-required 0.101 0.09 0.01 0.067 0.066 0.05 run1-required 0.136 0.145 0.01 0.012 0.012 0.05 run7 0.102 0.127 0.01 0.056 0.059 0.04 run8 0.091 0.095 0.01 0.077 0.068 0.05 run9 0.085 0.124 0.01 0.044 0.048 0.05 run42 0.116 0.144 0.01 0.076 0.069 0.05 MMSys run 0.007 0.01 0.01 0.048 0.032 0.05 MG-UCB Team Run Short-term Long-term DCU-Audio Essex-NLIP KT-UPB MeMAD Results (Official Results on Test-set for Teams’ all runs)
  • 11. Results (Official Results on Test-set for Teams’ best runs–Short-term) 15/12/2020 MediaEval2020 11 DCU-Audio memento10k 0.137 Audio Gestalt => Multimodal Deep Learning-based Late Fusion (Momento10K) MG-UCB run1-required 0.136 Visual, Audio, Textual, Visiolinguistic Features=> Weighted Average MeMAD run4-all-SLT and run5-all-required 0.101 Visual, Audio, Textual =>SVR , BR, GRU => Weighted Late Fusion CUC_DMT run1-required 0.06 Multi-level Encoding and Captions=> Gradient Boosting, Random Forest, Neural Network KT-UPB run2-required 0.053 C3D => Random Forest Essex-NLIP HSV-Run1 0.042 HSV => Random Forest DCU@ML-Labs run1-required 0.034 C3D => SemNET (Momento10K) GTH-UPM run1-required 0.016 Multimodal Late Fusion of Self-Attention => SVR => Bidirectional LSTM MMSys run 0.007 - Team Run Approach Short-term
  • 12. Results (Official Results on Test-set for Teams’ best runs–Long-term) 15/12/2020 MediaEval2020 12 DCU-Audio run1-required 0.113 Audio Gestalt => Multimodal Deep Learning-based Late Fusion (Momento10K) MeMAD run4-all-SLT 0.078 Visual, Audio, Textual, Visiolinguistic Features=> Weighted Average MG-UCB run8 0.077 Visual, Audio, Textual =>SVR , BR, GRU => Weighted Late Fusion CUC_DMT run1-required 0.049 Multi-level Encoding and Captions=> Gradient Boosting, Random Forest, Neural Network MMSys run 0.048 - Essex-NLIP RGB-Run2 0.043 RGB => Random Forest KT-UPB run2-required 0.037 C3D => Random Forest DCU@ML-Labs run1-required -0.01 C3D => SemNET (Momento10K) GTH-UPM run1-required -0.041 Multimodal Late Fusion of Self-Attention => SVR => Bidirectional LSTM Team Run Approach Long-term
  • 13. Conclusion • Short-term memorability – better results • Long-term memorability – results slightly lower • The best results: • DCU-Audio (0.137; 0.113) • MG-UCB (0.136; 0.77) • MeMAD (0.101; 0.078) • Audio and captions • Fusion • Deep learning techniques • More annotations 15/12/2020 MediaEval2020 13