SlideShare ist ein Scribd-Unternehmen logo
1 von 12
Downloaden Sie, um offline zu lesen
Medieval :
Fine grained sport action recognition.
Application to Table Tennis.
Pierre-Etienne Martin
Univ. Bordeaux, LaBRI
Jenny Benois-Pineau
Univ. Bordeaux, LaBRI
Renaud Péteri
Univ. La Rochelle, MIA
Julien Morlier
Univ. Bordeaux, Bordeaux
IMS
1
Goal of sport video analysis
2
Improve athletes performances for teachers and athletes
through tools
Sport video analysis today: improving performance of athletes, efficient coaching, study of
competitors
MediaEval 2018 EURECOM, Sophia Antipolis
1- Goal
- Extract strokes in the temporal dimension
- Classify the strokes
MediaEval 2018 EURECOM, Sophia Antipolis3
Offensive Forehand Loop
Input
Output
t
[1] H. Bilen, B. Fernando, E. Gavves, and A. Vedaldi, “Action recognition with dynamic image networks,” CoRR, vol. abs/1612.00738, 2016.
[2] J. Carreira and, A. Zisserman, “Quo vadis, action recognition? A new model and the kinetics dataset,” CoRR, vol. abs/1705.07750, 2017.
[3] G. Varol, I. Laptev, and C. Schmid, “Long-term temporal convolutions for action recognition,” IEEE Trans. Pattern Anal. Mach. Intell., vol. 40, no. 6, pp.
1510–1517, 2018.
Use of Dynamic Images[1]
Very deep 3D CNN[2]
Long-term Temporal Convolutions[3]
2 - Related Work
4 MediaEval 2018 EURECOM, Sophia Antipolis
Popular Corpus : UCF-101
Corpus TTStroke-21
5
129 videos at 120 fps
1 387 / 1 074 annotations before / after filtering for 20 classes
Total of 1 048 strokes extracted
Handedness video player:
Left 28
Right 101
Annotation length:
Min - 76 frames (0.63s)
Max - 272 frames (2.27s)
Mean - 173 frames +- 43.76 (1.45s +- 0.36)
Actions length:
Min - 99 frames (0.82s)
Max - 276 frames (2.30s)
Mean - 174 frames +- 43.14 (1.46s +- 0.36)
Acquisition
Annotation platform
Samples TTStroke-21
MIA/ULr, LABRI/UBx, IMS/UBx
CRISP project
In a continuous completion process: recording
and annotation sessions 1/month MediaEval 2018 EURECOM, Sophia Antipolis
[4] C. Liu, “Beyond pixels: Exploring new representations and applications for motion analysis,” Ph.D. dissertation, Massachusetts Institute of Technology, 5
2009.
[5] Z. Zivkovic and F. van der Heijden, “Efficient adaptive density estimation per image pixel for the task of background subtraction,” Pattern Recognition
Letters, vol. 27, no. 7, pp. 773–780, 2006.
[6] P-E. Martin, J. Benois-Pineau, R. Péteri and J.Morlier. Sport Action Recognition with Siamese Spatio-Temporal CNNs: Application to Table Tennis.
Proceedings of the International Conference on Content-Based Multimedia Indexing (CBMI), 2018, La Rochelle, France
Our Research [6] (1)
6 MediaEval 2018 EURECOM, Sophia Antipolis
Original Frame Motion estimation[4]
Foreground estimation[5]
Foreground Motion
Nous ne pouvons pas afficher cette image pour l’instant.
Principle : extract moving sportsmen first, then classify the action
7 MediaEval 2018 EURECOM, Sophia Antipolis
Our Research (2)
Spatial Segmentation using foreground motion
Final segmentation
Smoothing over temporal dimension
using gaussian kernel of size 40 and
standard deviation 4.44.
Xmax
XgXroi
8
Siamese Spatio-Temporal Convolutional Neural Network
(W,H,T) = (100,120,120)
Very deep 3D CNN[1]
[7] J. Carreira and, A. Zisserman, “Quo vadis, action recognition? A new model and the kinetics dataset,” CoRR, vol. abs/1705.07750, 2017.
Our Research(3)
MediaEval 2018 EURECOM, Sophia Antipolis
Proposed tasks for MediaEval(1)
9
Offensive Forehand Loop
Input
- Task n°1 : Stroke recognition with temporal boundaries known
Output
Given a set of clips with annotated boundaries of strokes: recognize each stroke temporally
segmented accordingly to the given taxonomy of 21 classes.
Dataset splitting: Train 80% Test 20%
Evaluation metric : Accuracy
Tool provided : xml reader for annotation extraction
Find the class of each temporal segment
MediaEval 2018 EURECOM, Sophia Antipolis
- Task n°2 : Stroke recognition with temporal decision allowing 10% of error range on temporal borders
Offensive Forehand Loop
Input
t
Output
Proposed tasks for MediaEval(2)
Given a set of clips with annotated boundaries of strokes recognize each stroke NOT temporally
segmented accordingly to the given taxonomy.
Dataset splitting : Train 80% Test 20%
Evaluation metric : Accuracy
Tool provided : xml reader for annotation extraction
Perform on test set whch is neither temporally segmented or
labelled.
t1 t2
MediaEval 2018 EURECOM, Sophia Antipolis10
Proposed tasks for MediaEval
➔ Data formats tools supplied:
➔ - a .dtd file for the expected stroke output xml file
➔ - allows validation of xml files with simple tools “eclipse”
➔ <!ELEMENT stroke(TimeStart, TimeEnd,Label) EMPTY >
➔ ....
➔ <!ELEMENT Lablel (#PCDATA) REQUIRED>
MediaEval 2018 EURECOM, Sophia Antipolis11
➔ Your Questions, please
MediaEval 2018 EURECOM, Sophia Antipolis12

Weitere ähnliche Inhalte

Was ist angesagt?

Microcalcification oriented content based mammogram retrieval for breast canc...
Microcalcification oriented content based mammogram retrieval for breast canc...Microcalcification oriented content based mammogram retrieval for breast canc...
Microcalcification oriented content based mammogram retrieval for breast canc...Lazaros Tsochatzidis
 
Svm based cbir of breast masses on mammograms
Svm based cbir of breast masses on mammogramsSvm based cbir of breast masses on mammograms
Svm based cbir of breast masses on mammogramsKonstantinos Zagoris
 
Unpaired MR Motion Artifact Deep Learning Using Outlier-Rejecting Bootstrap A...
Unpaired MR Motion Artifact Deep Learning Using Outlier-Rejecting Bootstrap A...Unpaired MR Motion Artifact Deep Learning Using Outlier-Rejecting Bootstrap A...
Unpaired MR Motion Artifact Deep Learning Using Outlier-Rejecting Bootstrap A...Gyutaek Oh
 
A vision-based uncut crop edge detection method for automated guidance of hea...
A vision-based uncut crop edge detection method for automated guidance of hea...A vision-based uncut crop edge detection method for automated guidance of hea...
A vision-based uncut crop edge detection method for automated guidance of hea...Institute of Agricultural Machinery, NARO
 
A multi-sensor based uncut crop edge detection method for head-feeding combin...
A multi-sensor based uncut crop edge detection method for head-feeding combin...A multi-sensor based uncut crop edge detection method for head-feeding combin...
A multi-sensor based uncut crop edge detection method for head-feeding combin...Institute of Agricultural Machinery, NARO
 
Lec13: Clustering Based Medical Image Segmentation Methods
Lec13: Clustering Based Medical Image Segmentation MethodsLec13: Clustering Based Medical Image Segmentation Methods
Lec13: Clustering Based Medical Image Segmentation MethodsUlaş Bağcı
 
Segmentation problems in medical images
Segmentation problems in medical imagesSegmentation problems in medical images
Segmentation problems in medical imagesJimin Lee
 
Introduction to deep learning and recent research topics in medical field
Introduction to deep learning and recent research topics in medical fieldIntroduction to deep learning and recent research topics in medical field
Introduction to deep learning and recent research topics in medical fieldJimin Lee
 
Multi-energy Bone Subtraction in Chest Radiography by Eigenvalue Decomposition
Multi-energy Bone Subtraction in Chest Radiography by Eigenvalue DecompositionMulti-energy Bone Subtraction in Chest Radiography by Eigenvalue Decomposition
Multi-energy Bone Subtraction in Chest Radiography by Eigenvalue DecompositionBoahKim2
 

Was ist angesagt? (9)

Microcalcification oriented content based mammogram retrieval for breast canc...
Microcalcification oriented content based mammogram retrieval for breast canc...Microcalcification oriented content based mammogram retrieval for breast canc...
Microcalcification oriented content based mammogram retrieval for breast canc...
 
Svm based cbir of breast masses on mammograms
Svm based cbir of breast masses on mammogramsSvm based cbir of breast masses on mammograms
Svm based cbir of breast masses on mammograms
 
Unpaired MR Motion Artifact Deep Learning Using Outlier-Rejecting Bootstrap A...
Unpaired MR Motion Artifact Deep Learning Using Outlier-Rejecting Bootstrap A...Unpaired MR Motion Artifact Deep Learning Using Outlier-Rejecting Bootstrap A...
Unpaired MR Motion Artifact Deep Learning Using Outlier-Rejecting Bootstrap A...
 
A vision-based uncut crop edge detection method for automated guidance of hea...
A vision-based uncut crop edge detection method for automated guidance of hea...A vision-based uncut crop edge detection method for automated guidance of hea...
A vision-based uncut crop edge detection method for automated guidance of hea...
 
A multi-sensor based uncut crop edge detection method for head-feeding combin...
A multi-sensor based uncut crop edge detection method for head-feeding combin...A multi-sensor based uncut crop edge detection method for head-feeding combin...
A multi-sensor based uncut crop edge detection method for head-feeding combin...
 
Lec13: Clustering Based Medical Image Segmentation Methods
Lec13: Clustering Based Medical Image Segmentation MethodsLec13: Clustering Based Medical Image Segmentation Methods
Lec13: Clustering Based Medical Image Segmentation Methods
 
Segmentation problems in medical images
Segmentation problems in medical imagesSegmentation problems in medical images
Segmentation problems in medical images
 
Introduction to deep learning and recent research topics in medical field
Introduction to deep learning and recent research topics in medical fieldIntroduction to deep learning and recent research topics in medical field
Introduction to deep learning and recent research topics in medical field
 
Multi-energy Bone Subtraction in Chest Radiography by Eigenvalue Decomposition
Multi-energy Bone Subtraction in Chest Radiography by Eigenvalue DecompositionMulti-energy Bone Subtraction in Chest Radiography by Eigenvalue Decomposition
Multi-energy Bone Subtraction in Chest Radiography by Eigenvalue Decomposition
 

Ähnlich wie MediaEval 2018: Fine grained sport action recognition: Application to table tennis

Siamese Spatio-temporal convolutional neural network for stroke classificatio...
Siamese Spatio-temporal convolutional neural network for stroke classificatio...Siamese Spatio-temporal convolutional neural network for stroke classificatio...
Siamese Spatio-temporal convolutional neural network for stroke classificatio...multimediaeval
 
Fractional step discriminant pruning
Fractional step discriminant pruningFractional step discriminant pruning
Fractional step discriminant pruningVasileiosMezaris
 
Robust Tracking Via Feature Mapping Method and Support Vector Machine
Robust Tracking Via Feature Mapping Method and Support Vector MachineRobust Tracking Via Feature Mapping Method and Support Vector Machine
Robust Tracking Via Feature Mapping Method and Support Vector MachineIRJET Journal
 
IRJET- Review on Human Action Detection in Stored Videos using Support Vector...
IRJET- Review on Human Action Detection in Stored Videos using Support Vector...IRJET- Review on Human Action Detection in Stored Videos using Support Vector...
IRJET- Review on Human Action Detection in Stored Videos using Support Vector...IRJET Journal
 
IRJET- Survey on Detection of Crime
IRJET-  	  Survey on Detection of CrimeIRJET-  	  Survey on Detection of Crime
IRJET- Survey on Detection of CrimeIRJET Journal
 
IRJET- Design an Approach for Prediction of Human Activity Recognition us...
IRJET-  	  Design an Approach for Prediction of Human Activity Recognition us...IRJET-  	  Design an Approach for Prediction of Human Activity Recognition us...
IRJET- Design an Approach for Prediction of Human Activity Recognition us...IRJET Journal
 
IRJET - Automatic Attendance Provision using Image Processing
IRJET - Automatic Attendance Provision using Image ProcessingIRJET - Automatic Attendance Provision using Image Processing
IRJET - Automatic Attendance Provision using Image ProcessingIRJET Journal
 
International Journal of Image Processing (IJIP) Volume (4) Issue (1)
International Journal of Image Processing (IJIP) Volume (4) Issue (1)International Journal of Image Processing (IJIP) Volume (4) Issue (1)
International Journal of Image Processing (IJIP) Volume (4) Issue (1)CSCJournals
 
A Framework for Human Action Detection via Extraction of Multimodal Features
A Framework for Human Action Detection via Extraction of Multimodal FeaturesA Framework for Human Action Detection via Extraction of Multimodal Features
A Framework for Human Action Detection via Extraction of Multimodal FeaturesCSCJournals
 
VoLearn: A Cross-Modal Operable Motion-Learning System Combined with Virtual ...
VoLearn: A Cross-Modal Operable Motion-Learning System Combined with Virtual ...VoLearn: A Cross-Modal Operable Motion-Learning System Combined with Virtual ...
VoLearn: A Cross-Modal Operable Motion-Learning System Combined with Virtual ...sugiuralab
 
Crowd Density Estimation Using Base Line Filtering
Crowd Density Estimation Using Base Line FilteringCrowd Density Estimation Using Base Line Filtering
Crowd Density Estimation Using Base Line Filteringpaperpublications3
 
Relevant Content Detection in Cataract Surgery Videos (Invited Talk 1 at IPTA...
Relevant Content Detection in Cataract Surgery Videos (Invited Talk 1 at IPTA...Relevant Content Detection in Cataract Surgery Videos (Invited Talk 1 at IPTA...
Relevant Content Detection in Cataract Surgery Videos (Invited Talk 1 at IPTA...klschoef
 
TVSum: Summarizing Web Videos Using Titles
TVSum: Summarizing Web Videos Using TitlesTVSum: Summarizing Web Videos Using Titles
TVSum: Summarizing Web Videos Using TitlesNEERAJ BAGHEL
 
Fast Human Detection in Surveillance Video
Fast Human Detection in Surveillance VideoFast Human Detection in Surveillance Video
Fast Human Detection in Surveillance VideoIOSR Journals
 
파이콘 한국 2019 튜토리얼 - 설명가능인공지능이란? (Part 1)
파이콘 한국 2019 튜토리얼 - 설명가능인공지능이란? (Part 1)파이콘 한국 2019 튜토리얼 - 설명가능인공지능이란? (Part 1)
파이콘 한국 2019 튜토리얼 - 설명가능인공지능이란? (Part 1)XAIC
 
Real-Time Face-Age-Gender Detection System
Real-Time Face-Age-Gender Detection SystemReal-Time Face-Age-Gender Detection System
Real-Time Face-Age-Gender Detection SystemIRJET Journal
 
Image annotation - Segmentation & Annotation
Image annotation - Segmentation & AnnotationImage annotation - Segmentation & Annotation
Image annotation - Segmentation & AnnotationTaposh Roy
 
Biomedcompcenter. Main Projects
Biomedcompcenter. Main ProjectsBiomedcompcenter. Main Projects
Biomedcompcenter. Main ProjectsTurlapov
 
Research overview
Research overviewResearch overview
Research overviewdagunisa
 

Ähnlich wie MediaEval 2018: Fine grained sport action recognition: Application to table tennis (20)

Siamese Spatio-temporal convolutional neural network for stroke classificatio...
Siamese Spatio-temporal convolutional neural network for stroke classificatio...Siamese Spatio-temporal convolutional neural network for stroke classificatio...
Siamese Spatio-temporal convolutional neural network for stroke classificatio...
 
Fractional step discriminant pruning
Fractional step discriminant pruningFractional step discriminant pruning
Fractional step discriminant pruning
 
Robust Tracking Via Feature Mapping Method and Support Vector Machine
Robust Tracking Via Feature Mapping Method and Support Vector MachineRobust Tracking Via Feature Mapping Method and Support Vector Machine
Robust Tracking Via Feature Mapping Method and Support Vector Machine
 
IRJET- Review on Human Action Detection in Stored Videos using Support Vector...
IRJET- Review on Human Action Detection in Stored Videos using Support Vector...IRJET- Review on Human Action Detection in Stored Videos using Support Vector...
IRJET- Review on Human Action Detection in Stored Videos using Support Vector...
 
IRJET- Survey on Detection of Crime
IRJET-  	  Survey on Detection of CrimeIRJET-  	  Survey on Detection of Crime
IRJET- Survey on Detection of Crime
 
IRJET- Design an Approach for Prediction of Human Activity Recognition us...
IRJET-  	  Design an Approach for Prediction of Human Activity Recognition us...IRJET-  	  Design an Approach for Prediction of Human Activity Recognition us...
IRJET- Design an Approach for Prediction of Human Activity Recognition us...
 
IRJET - Automatic Attendance Provision using Image Processing
IRJET - Automatic Attendance Provision using Image ProcessingIRJET - Automatic Attendance Provision using Image Processing
IRJET - Automatic Attendance Provision using Image Processing
 
International Journal of Image Processing (IJIP) Volume (4) Issue (1)
International Journal of Image Processing (IJIP) Volume (4) Issue (1)International Journal of Image Processing (IJIP) Volume (4) Issue (1)
International Journal of Image Processing (IJIP) Volume (4) Issue (1)
 
Kb gait-recognition
Kb gait-recognitionKb gait-recognition
Kb gait-recognition
 
A Framework for Human Action Detection via Extraction of Multimodal Features
A Framework for Human Action Detection via Extraction of Multimodal FeaturesA Framework for Human Action Detection via Extraction of Multimodal Features
A Framework for Human Action Detection via Extraction of Multimodal Features
 
VoLearn: A Cross-Modal Operable Motion-Learning System Combined with Virtual ...
VoLearn: A Cross-Modal Operable Motion-Learning System Combined with Virtual ...VoLearn: A Cross-Modal Operable Motion-Learning System Combined with Virtual ...
VoLearn: A Cross-Modal Operable Motion-Learning System Combined with Virtual ...
 
Crowd Density Estimation Using Base Line Filtering
Crowd Density Estimation Using Base Line FilteringCrowd Density Estimation Using Base Line Filtering
Crowd Density Estimation Using Base Line Filtering
 
Relevant Content Detection in Cataract Surgery Videos (Invited Talk 1 at IPTA...
Relevant Content Detection in Cataract Surgery Videos (Invited Talk 1 at IPTA...Relevant Content Detection in Cataract Surgery Videos (Invited Talk 1 at IPTA...
Relevant Content Detection in Cataract Surgery Videos (Invited Talk 1 at IPTA...
 
TVSum: Summarizing Web Videos Using Titles
TVSum: Summarizing Web Videos Using TitlesTVSum: Summarizing Web Videos Using Titles
TVSum: Summarizing Web Videos Using Titles
 
Fast Human Detection in Surveillance Video
Fast Human Detection in Surveillance VideoFast Human Detection in Surveillance Video
Fast Human Detection in Surveillance Video
 
파이콘 한국 2019 튜토리얼 - 설명가능인공지능이란? (Part 1)
파이콘 한국 2019 튜토리얼 - 설명가능인공지능이란? (Part 1)파이콘 한국 2019 튜토리얼 - 설명가능인공지능이란? (Part 1)
파이콘 한국 2019 튜토리얼 - 설명가능인공지능이란? (Part 1)
 
Real-Time Face-Age-Gender Detection System
Real-Time Face-Age-Gender Detection SystemReal-Time Face-Age-Gender Detection System
Real-Time Face-Age-Gender Detection System
 
Image annotation - Segmentation & Annotation
Image annotation - Segmentation & AnnotationImage annotation - Segmentation & Annotation
Image annotation - Segmentation & Annotation
 
Biomedcompcenter. Main Projects
Biomedcompcenter. Main ProjectsBiomedcompcenter. Main Projects
Biomedcompcenter. Main Projects
 
Research overview
Research overviewResearch overview
Research overview
 

Mehr von multimediaeval

Classification of Strokes in Table Tennis with a Three Stream Spatio-Temporal...
Classification of Strokes in Table Tennis with a Three Stream Spatio-Temporal...Classification of Strokes in Table Tennis with a Three Stream Spatio-Temporal...
Classification of Strokes in Table Tennis with a Three Stream Spatio-Temporal...multimediaeval
 
HCMUS at MediaEval 2020: Ensembles of Temporal Deep Neural Networks for Table...
HCMUS at MediaEval 2020: Ensembles of Temporal Deep Neural Networks for Table...HCMUS at MediaEval 2020: Ensembles of Temporal Deep Neural Networks for Table...
HCMUS at MediaEval 2020: Ensembles of Temporal Deep Neural Networks for Table...multimediaeval
 
Sports Video Classification: Classification of Strokes in Table Tennis for Me...
Sports Video Classification: Classification of Strokes in Table Tennis for Me...Sports Video Classification: Classification of Strokes in Table Tennis for Me...
Sports Video Classification: Classification of Strokes in Table Tennis for Me...multimediaeval
 
Predicting Media Memorability from a Multimodal Late Fusion of Self-Attention...
Predicting Media Memorability from a Multimodal Late Fusion of Self-Attention...Predicting Media Memorability from a Multimodal Late Fusion of Self-Attention...
Predicting Media Memorability from a Multimodal Late Fusion of Self-Attention...multimediaeval
 
Essex-NLIP at MediaEval Predicting Media Memorability 2020 Task
Essex-NLIP at MediaEval Predicting Media Memorability 2020 TaskEssex-NLIP at MediaEval Predicting Media Memorability 2020 Task
Essex-NLIP at MediaEval Predicting Media Memorability 2020 Taskmultimediaeval
 
Overview of MediaEval 2020 Predicting Media Memorability task: What Makes a V...
Overview of MediaEval 2020 Predicting Media Memorability task: What Makes a V...Overview of MediaEval 2020 Predicting Media Memorability task: What Makes a V...
Overview of MediaEval 2020 Predicting Media Memorability task: What Makes a V...multimediaeval
 
Fooling an Automatic Image Quality Estimator
Fooling an Automatic Image Quality EstimatorFooling an Automatic Image Quality Estimator
Fooling an Automatic Image Quality Estimatormultimediaeval
 
Fooling Blind Image Quality Assessment by Optimizing a Human-Understandable C...
Fooling Blind Image Quality Assessment by Optimizing a Human-Understandable C...Fooling Blind Image Quality Assessment by Optimizing a Human-Understandable C...
Fooling Blind Image Quality Assessment by Optimizing a Human-Understandable C...multimediaeval
 
Pixel Privacy: Quality Camouflage for Social Images
Pixel Privacy: Quality Camouflage for Social ImagesPixel Privacy: Quality Camouflage for Social Images
Pixel Privacy: Quality Camouflage for Social Imagesmultimediaeval
 
HCMUS at MediaEval 2020:Image-Text Fusion for Automatic News-Images Re-Matching
HCMUS at MediaEval 2020:Image-Text Fusion for Automatic News-Images Re-MatchingHCMUS at MediaEval 2020:Image-Text Fusion for Automatic News-Images Re-Matching
HCMUS at MediaEval 2020:Image-Text Fusion for Automatic News-Images Re-Matchingmultimediaeval
 
Efficient Supervision Net: Polyp Segmentation using EfficientNet and Attentio...
Efficient Supervision Net: Polyp Segmentation using EfficientNet and Attentio...Efficient Supervision Net: Polyp Segmentation using EfficientNet and Attentio...
Efficient Supervision Net: Polyp Segmentation using EfficientNet and Attentio...multimediaeval
 
HCMUS at Medico Automatic Polyp Segmentation Task 2020: PraNet and ResUnet++ ...
HCMUS at Medico Automatic Polyp Segmentation Task 2020: PraNet and ResUnet++ ...HCMUS at Medico Automatic Polyp Segmentation Task 2020: PraNet and ResUnet++ ...
HCMUS at Medico Automatic Polyp Segmentation Task 2020: PraNet and ResUnet++ ...multimediaeval
 
Depth-wise Separable Atrous Convolution for Polyps Segmentation in Gastro-Int...
Depth-wise Separable Atrous Convolution for Polyps Segmentation in Gastro-Int...Depth-wise Separable Atrous Convolution for Polyps Segmentation in Gastro-Int...
Depth-wise Separable Atrous Convolution for Polyps Segmentation in Gastro-Int...multimediaeval
 
Deep Conditional Adversarial learning for polyp Segmentation
Deep Conditional Adversarial learning for polyp SegmentationDeep Conditional Adversarial learning for polyp Segmentation
Deep Conditional Adversarial learning for polyp Segmentationmultimediaeval
 
A Temporal-Spatial Attention Model for Medical Image Detection
A Temporal-Spatial Attention Model for Medical Image DetectionA Temporal-Spatial Attention Model for Medical Image Detection
A Temporal-Spatial Attention Model for Medical Image Detectionmultimediaeval
 
HCMUS-Juniors 2020 at Medico Task in MediaEval 2020: Refined Deep Neural Netw...
HCMUS-Juniors 2020 at Medico Task in MediaEval 2020: Refined Deep Neural Netw...HCMUS-Juniors 2020 at Medico Task in MediaEval 2020: Refined Deep Neural Netw...
HCMUS-Juniors 2020 at Medico Task in MediaEval 2020: Refined Deep Neural Netw...multimediaeval
 
Fine-tuning for Polyp Segmentation with Attention
Fine-tuning for Polyp Segmentation with AttentionFine-tuning for Polyp Segmentation with Attention
Fine-tuning for Polyp Segmentation with Attentionmultimediaeval
 
Bigger Networks are not Always Better: Deep Convolutional Neural Networks for...
Bigger Networks are not Always Better: Deep Convolutional Neural Networks for...Bigger Networks are not Always Better: Deep Convolutional Neural Networks for...
Bigger Networks are not Always Better: Deep Convolutional Neural Networks for...multimediaeval
 
Insights for wellbeing: Predicting Personal Air Quality Index using Regressio...
Insights for wellbeing: Predicting Personal Air Quality Index using Regressio...Insights for wellbeing: Predicting Personal Air Quality Index using Regressio...
Insights for wellbeing: Predicting Personal Air Quality Index using Regressio...multimediaeval
 
Use Visual Features From Surrounding Scenes to Improve Personal Air Quality ...
 Use Visual Features From Surrounding Scenes to Improve Personal Air Quality ... Use Visual Features From Surrounding Scenes to Improve Personal Air Quality ...
Use Visual Features From Surrounding Scenes to Improve Personal Air Quality ...multimediaeval
 

Mehr von multimediaeval (20)

Classification of Strokes in Table Tennis with a Three Stream Spatio-Temporal...
Classification of Strokes in Table Tennis with a Three Stream Spatio-Temporal...Classification of Strokes in Table Tennis with a Three Stream Spatio-Temporal...
Classification of Strokes in Table Tennis with a Three Stream Spatio-Temporal...
 
HCMUS at MediaEval 2020: Ensembles of Temporal Deep Neural Networks for Table...
HCMUS at MediaEval 2020: Ensembles of Temporal Deep Neural Networks for Table...HCMUS at MediaEval 2020: Ensembles of Temporal Deep Neural Networks for Table...
HCMUS at MediaEval 2020: Ensembles of Temporal Deep Neural Networks for Table...
 
Sports Video Classification: Classification of Strokes in Table Tennis for Me...
Sports Video Classification: Classification of Strokes in Table Tennis for Me...Sports Video Classification: Classification of Strokes in Table Tennis for Me...
Sports Video Classification: Classification of Strokes in Table Tennis for Me...
 
Predicting Media Memorability from a Multimodal Late Fusion of Self-Attention...
Predicting Media Memorability from a Multimodal Late Fusion of Self-Attention...Predicting Media Memorability from a Multimodal Late Fusion of Self-Attention...
Predicting Media Memorability from a Multimodal Late Fusion of Self-Attention...
 
Essex-NLIP at MediaEval Predicting Media Memorability 2020 Task
Essex-NLIP at MediaEval Predicting Media Memorability 2020 TaskEssex-NLIP at MediaEval Predicting Media Memorability 2020 Task
Essex-NLIP at MediaEval Predicting Media Memorability 2020 Task
 
Overview of MediaEval 2020 Predicting Media Memorability task: What Makes a V...
Overview of MediaEval 2020 Predicting Media Memorability task: What Makes a V...Overview of MediaEval 2020 Predicting Media Memorability task: What Makes a V...
Overview of MediaEval 2020 Predicting Media Memorability task: What Makes a V...
 
Fooling an Automatic Image Quality Estimator
Fooling an Automatic Image Quality EstimatorFooling an Automatic Image Quality Estimator
Fooling an Automatic Image Quality Estimator
 
Fooling Blind Image Quality Assessment by Optimizing a Human-Understandable C...
Fooling Blind Image Quality Assessment by Optimizing a Human-Understandable C...Fooling Blind Image Quality Assessment by Optimizing a Human-Understandable C...
Fooling Blind Image Quality Assessment by Optimizing a Human-Understandable C...
 
Pixel Privacy: Quality Camouflage for Social Images
Pixel Privacy: Quality Camouflage for Social ImagesPixel Privacy: Quality Camouflage for Social Images
Pixel Privacy: Quality Camouflage for Social Images
 
HCMUS at MediaEval 2020:Image-Text Fusion for Automatic News-Images Re-Matching
HCMUS at MediaEval 2020:Image-Text Fusion for Automatic News-Images Re-MatchingHCMUS at MediaEval 2020:Image-Text Fusion for Automatic News-Images Re-Matching
HCMUS at MediaEval 2020:Image-Text Fusion for Automatic News-Images Re-Matching
 
Efficient Supervision Net: Polyp Segmentation using EfficientNet and Attentio...
Efficient Supervision Net: Polyp Segmentation using EfficientNet and Attentio...Efficient Supervision Net: Polyp Segmentation using EfficientNet and Attentio...
Efficient Supervision Net: Polyp Segmentation using EfficientNet and Attentio...
 
HCMUS at Medico Automatic Polyp Segmentation Task 2020: PraNet and ResUnet++ ...
HCMUS at Medico Automatic Polyp Segmentation Task 2020: PraNet and ResUnet++ ...HCMUS at Medico Automatic Polyp Segmentation Task 2020: PraNet and ResUnet++ ...
HCMUS at Medico Automatic Polyp Segmentation Task 2020: PraNet and ResUnet++ ...
 
Depth-wise Separable Atrous Convolution for Polyps Segmentation in Gastro-Int...
Depth-wise Separable Atrous Convolution for Polyps Segmentation in Gastro-Int...Depth-wise Separable Atrous Convolution for Polyps Segmentation in Gastro-Int...
Depth-wise Separable Atrous Convolution for Polyps Segmentation in Gastro-Int...
 
Deep Conditional Adversarial learning for polyp Segmentation
Deep Conditional Adversarial learning for polyp SegmentationDeep Conditional Adversarial learning for polyp Segmentation
Deep Conditional Adversarial learning for polyp Segmentation
 
A Temporal-Spatial Attention Model for Medical Image Detection
A Temporal-Spatial Attention Model for Medical Image DetectionA Temporal-Spatial Attention Model for Medical Image Detection
A Temporal-Spatial Attention Model for Medical Image Detection
 
HCMUS-Juniors 2020 at Medico Task in MediaEval 2020: Refined Deep Neural Netw...
HCMUS-Juniors 2020 at Medico Task in MediaEval 2020: Refined Deep Neural Netw...HCMUS-Juniors 2020 at Medico Task in MediaEval 2020: Refined Deep Neural Netw...
HCMUS-Juniors 2020 at Medico Task in MediaEval 2020: Refined Deep Neural Netw...
 
Fine-tuning for Polyp Segmentation with Attention
Fine-tuning for Polyp Segmentation with AttentionFine-tuning for Polyp Segmentation with Attention
Fine-tuning for Polyp Segmentation with Attention
 
Bigger Networks are not Always Better: Deep Convolutional Neural Networks for...
Bigger Networks are not Always Better: Deep Convolutional Neural Networks for...Bigger Networks are not Always Better: Deep Convolutional Neural Networks for...
Bigger Networks are not Always Better: Deep Convolutional Neural Networks for...
 
Insights for wellbeing: Predicting Personal Air Quality Index using Regressio...
Insights for wellbeing: Predicting Personal Air Quality Index using Regressio...Insights for wellbeing: Predicting Personal Air Quality Index using Regressio...
Insights for wellbeing: Predicting Personal Air Quality Index using Regressio...
 
Use Visual Features From Surrounding Scenes to Improve Personal Air Quality ...
 Use Visual Features From Surrounding Scenes to Improve Personal Air Quality ... Use Visual Features From Surrounding Scenes to Improve Personal Air Quality ...
Use Visual Features From Surrounding Scenes to Improve Personal Air Quality ...
 

Kürzlich hochgeladen

Dr. E. Muralinath_ Blood indices_clinical aspects
Dr. E. Muralinath_ Blood indices_clinical  aspectsDr. E. Muralinath_ Blood indices_clinical  aspects
Dr. E. Muralinath_ Blood indices_clinical aspectsmuralinath2
 
PSYCHOSOCIAL NEEDS. in nursing II sem pptx
PSYCHOSOCIAL NEEDS. in nursing II sem pptxPSYCHOSOCIAL NEEDS. in nursing II sem pptx
PSYCHOSOCIAL NEEDS. in nursing II sem pptxSuji236384
 
Exploring Criminology and Criminal Behaviour.pdf
Exploring Criminology and Criminal Behaviour.pdfExploring Criminology and Criminal Behaviour.pdf
Exploring Criminology and Criminal Behaviour.pdfrohankumarsinghrore1
 
Stages in the normal growth curve
Stages in the normal growth curveStages in the normal growth curve
Stages in the normal growth curveAreesha Ahmad
 
GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)Areesha Ahmad
 
GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)Areesha Ahmad
 
CURRENT SCENARIO OF POULTRY PRODUCTION IN INDIA
CURRENT SCENARIO OF POULTRY PRODUCTION IN INDIACURRENT SCENARIO OF POULTRY PRODUCTION IN INDIA
CURRENT SCENARIO OF POULTRY PRODUCTION IN INDIADr. TATHAGAT KHOBRAGADE
 
Module for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learningModule for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learninglevieagacer
 
Use of mutants in understanding seedling development.pptx
Use of mutants in understanding seedling development.pptxUse of mutants in understanding seedling development.pptx
Use of mutants in understanding seedling development.pptxRenuJangid3
 
Porella : features, morphology, anatomy, reproduction etc.
Porella : features, morphology, anatomy, reproduction etc.Porella : features, morphology, anatomy, reproduction etc.
Porella : features, morphology, anatomy, reproduction etc.Silpa
 
Human genetics..........................pptx
Human genetics..........................pptxHuman genetics..........................pptx
Human genetics..........................pptxSilpa
 
Digital Dentistry.Digital Dentistryvv.pptx
Digital Dentistry.Digital Dentistryvv.pptxDigital Dentistry.Digital Dentistryvv.pptx
Digital Dentistry.Digital Dentistryvv.pptxMohamedFarag457087
 
Thyroid Physiology_Dr.E. Muralinath_ Associate Professor
Thyroid Physiology_Dr.E. Muralinath_ Associate ProfessorThyroid Physiology_Dr.E. Muralinath_ Associate Professor
Thyroid Physiology_Dr.E. Muralinath_ Associate Professormuralinath2
 
development of diagnostic enzyme assay to detect leuser virus
development of diagnostic enzyme assay to detect leuser virusdevelopment of diagnostic enzyme assay to detect leuser virus
development of diagnostic enzyme assay to detect leuser virusNazaninKarimi6
 
An introduction on sequence tagged site mapping
An introduction on sequence tagged site mappingAn introduction on sequence tagged site mapping
An introduction on sequence tagged site mappingadibshanto115
 
300003-World Science Day For Peace And Development.pptx
300003-World Science Day For Peace And Development.pptx300003-World Science Day For Peace And Development.pptx
300003-World Science Day For Peace And Development.pptxryanrooker
 
GBSN - Microbiology (Unit 3)
GBSN - Microbiology (Unit 3)GBSN - Microbiology (Unit 3)
GBSN - Microbiology (Unit 3)Areesha Ahmad
 
COMPUTING ANTI-DERIVATIVES (Integration by SUBSTITUTION)
COMPUTING ANTI-DERIVATIVES(Integration by SUBSTITUTION)COMPUTING ANTI-DERIVATIVES(Integration by SUBSTITUTION)
COMPUTING ANTI-DERIVATIVES (Integration by SUBSTITUTION)AkefAfaneh2
 
Bacterial Identification and Classifications
Bacterial Identification and ClassificationsBacterial Identification and Classifications
Bacterial Identification and ClassificationsAreesha Ahmad
 

Kürzlich hochgeladen (20)

Dr. E. Muralinath_ Blood indices_clinical aspects
Dr. E. Muralinath_ Blood indices_clinical  aspectsDr. E. Muralinath_ Blood indices_clinical  aspects
Dr. E. Muralinath_ Blood indices_clinical aspects
 
PSYCHOSOCIAL NEEDS. in nursing II sem pptx
PSYCHOSOCIAL NEEDS. in nursing II sem pptxPSYCHOSOCIAL NEEDS. in nursing II sem pptx
PSYCHOSOCIAL NEEDS. in nursing II sem pptx
 
Exploring Criminology and Criminal Behaviour.pdf
Exploring Criminology and Criminal Behaviour.pdfExploring Criminology and Criminal Behaviour.pdf
Exploring Criminology and Criminal Behaviour.pdf
 
Stages in the normal growth curve
Stages in the normal growth curveStages in the normal growth curve
Stages in the normal growth curve
 
GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)
 
GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)
 
CURRENT SCENARIO OF POULTRY PRODUCTION IN INDIA
CURRENT SCENARIO OF POULTRY PRODUCTION IN INDIACURRENT SCENARIO OF POULTRY PRODUCTION IN INDIA
CURRENT SCENARIO OF POULTRY PRODUCTION IN INDIA
 
Module for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learningModule for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learning
 
Use of mutants in understanding seedling development.pptx
Use of mutants in understanding seedling development.pptxUse of mutants in understanding seedling development.pptx
Use of mutants in understanding seedling development.pptx
 
Porella : features, morphology, anatomy, reproduction etc.
Porella : features, morphology, anatomy, reproduction etc.Porella : features, morphology, anatomy, reproduction etc.
Porella : features, morphology, anatomy, reproduction etc.
 
Human genetics..........................pptx
Human genetics..........................pptxHuman genetics..........................pptx
Human genetics..........................pptx
 
Digital Dentistry.Digital Dentistryvv.pptx
Digital Dentistry.Digital Dentistryvv.pptxDigital Dentistry.Digital Dentistryvv.pptx
Digital Dentistry.Digital Dentistryvv.pptx
 
Thyroid Physiology_Dr.E. Muralinath_ Associate Professor
Thyroid Physiology_Dr.E. Muralinath_ Associate ProfessorThyroid Physiology_Dr.E. Muralinath_ Associate Professor
Thyroid Physiology_Dr.E. Muralinath_ Associate Professor
 
development of diagnostic enzyme assay to detect leuser virus
development of diagnostic enzyme assay to detect leuser virusdevelopment of diagnostic enzyme assay to detect leuser virus
development of diagnostic enzyme assay to detect leuser virus
 
An introduction on sequence tagged site mapping
An introduction on sequence tagged site mappingAn introduction on sequence tagged site mapping
An introduction on sequence tagged site mapping
 
300003-World Science Day For Peace And Development.pptx
300003-World Science Day For Peace And Development.pptx300003-World Science Day For Peace And Development.pptx
300003-World Science Day For Peace And Development.pptx
 
GBSN - Microbiology (Unit 3)
GBSN - Microbiology (Unit 3)GBSN - Microbiology (Unit 3)
GBSN - Microbiology (Unit 3)
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
COMPUTING ANTI-DERIVATIVES (Integration by SUBSTITUTION)
COMPUTING ANTI-DERIVATIVES(Integration by SUBSTITUTION)COMPUTING ANTI-DERIVATIVES(Integration by SUBSTITUTION)
COMPUTING ANTI-DERIVATIVES (Integration by SUBSTITUTION)
 
Bacterial Identification and Classifications
Bacterial Identification and ClassificationsBacterial Identification and Classifications
Bacterial Identification and Classifications
 

MediaEval 2018: Fine grained sport action recognition: Application to table tennis

  • 1. Medieval : Fine grained sport action recognition. Application to Table Tennis. Pierre-Etienne Martin Univ. Bordeaux, LaBRI Jenny Benois-Pineau Univ. Bordeaux, LaBRI Renaud Péteri Univ. La Rochelle, MIA Julien Morlier Univ. Bordeaux, Bordeaux IMS 1
  • 2. Goal of sport video analysis 2 Improve athletes performances for teachers and athletes through tools Sport video analysis today: improving performance of athletes, efficient coaching, study of competitors MediaEval 2018 EURECOM, Sophia Antipolis
  • 3. 1- Goal - Extract strokes in the temporal dimension - Classify the strokes MediaEval 2018 EURECOM, Sophia Antipolis3 Offensive Forehand Loop Input Output t
  • 4. [1] H. Bilen, B. Fernando, E. Gavves, and A. Vedaldi, “Action recognition with dynamic image networks,” CoRR, vol. abs/1612.00738, 2016. [2] J. Carreira and, A. Zisserman, “Quo vadis, action recognition? A new model and the kinetics dataset,” CoRR, vol. abs/1705.07750, 2017. [3] G. Varol, I. Laptev, and C. Schmid, “Long-term temporal convolutions for action recognition,” IEEE Trans. Pattern Anal. Mach. Intell., vol. 40, no. 6, pp. 1510–1517, 2018. Use of Dynamic Images[1] Very deep 3D CNN[2] Long-term Temporal Convolutions[3] 2 - Related Work 4 MediaEval 2018 EURECOM, Sophia Antipolis Popular Corpus : UCF-101
  • 5. Corpus TTStroke-21 5 129 videos at 120 fps 1 387 / 1 074 annotations before / after filtering for 20 classes Total of 1 048 strokes extracted Handedness video player: Left 28 Right 101 Annotation length: Min - 76 frames (0.63s) Max - 272 frames (2.27s) Mean - 173 frames +- 43.76 (1.45s +- 0.36) Actions length: Min - 99 frames (0.82s) Max - 276 frames (2.30s) Mean - 174 frames +- 43.14 (1.46s +- 0.36) Acquisition Annotation platform Samples TTStroke-21 MIA/ULr, LABRI/UBx, IMS/UBx CRISP project In a continuous completion process: recording and annotation sessions 1/month MediaEval 2018 EURECOM, Sophia Antipolis
  • 6. [4] C. Liu, “Beyond pixels: Exploring new representations and applications for motion analysis,” Ph.D. dissertation, Massachusetts Institute of Technology, 5 2009. [5] Z. Zivkovic and F. van der Heijden, “Efficient adaptive density estimation per image pixel for the task of background subtraction,” Pattern Recognition Letters, vol. 27, no. 7, pp. 773–780, 2006. [6] P-E. Martin, J. Benois-Pineau, R. Péteri and J.Morlier. Sport Action Recognition with Siamese Spatio-Temporal CNNs: Application to Table Tennis. Proceedings of the International Conference on Content-Based Multimedia Indexing (CBMI), 2018, La Rochelle, France Our Research [6] (1) 6 MediaEval 2018 EURECOM, Sophia Antipolis Original Frame Motion estimation[4] Foreground estimation[5] Foreground Motion Nous ne pouvons pas afficher cette image pour l’instant. Principle : extract moving sportsmen first, then classify the action
  • 7. 7 MediaEval 2018 EURECOM, Sophia Antipolis Our Research (2) Spatial Segmentation using foreground motion Final segmentation Smoothing over temporal dimension using gaussian kernel of size 40 and standard deviation 4.44. Xmax XgXroi
  • 8. 8 Siamese Spatio-Temporal Convolutional Neural Network (W,H,T) = (100,120,120) Very deep 3D CNN[1] [7] J. Carreira and, A. Zisserman, “Quo vadis, action recognition? A new model and the kinetics dataset,” CoRR, vol. abs/1705.07750, 2017. Our Research(3) MediaEval 2018 EURECOM, Sophia Antipolis
  • 9. Proposed tasks for MediaEval(1) 9 Offensive Forehand Loop Input - Task n°1 : Stroke recognition with temporal boundaries known Output Given a set of clips with annotated boundaries of strokes: recognize each stroke temporally segmented accordingly to the given taxonomy of 21 classes. Dataset splitting: Train 80% Test 20% Evaluation metric : Accuracy Tool provided : xml reader for annotation extraction Find the class of each temporal segment MediaEval 2018 EURECOM, Sophia Antipolis
  • 10. - Task n°2 : Stroke recognition with temporal decision allowing 10% of error range on temporal borders Offensive Forehand Loop Input t Output Proposed tasks for MediaEval(2) Given a set of clips with annotated boundaries of strokes recognize each stroke NOT temporally segmented accordingly to the given taxonomy. Dataset splitting : Train 80% Test 20% Evaluation metric : Accuracy Tool provided : xml reader for annotation extraction Perform on test set whch is neither temporally segmented or labelled. t1 t2 MediaEval 2018 EURECOM, Sophia Antipolis10
  • 11. Proposed tasks for MediaEval ➔ Data formats tools supplied: ➔ - a .dtd file for the expected stroke output xml file ➔ - allows validation of xml files with simple tools “eclipse” ➔ <!ELEMENT stroke(TimeStart, TimeEnd,Label) EMPTY > ➔ .... ➔ <!ELEMENT Lablel (#PCDATA) REQUIRED> MediaEval 2018 EURECOM, Sophia Antipolis11
  • 12. ➔ Your Questions, please MediaEval 2018 EURECOM, Sophia Antipolis12