SlideShare ist ein Scribd-Unternehmen logo
1 von 19
Downloaden Sie, um offline zu lesen
A fast smart-cropping method and dataset for
video retargeting
Konstantinos Apostolidis, Vasileios Mezaris
Information Technologies Institute (ITI), CERTH
Thessaloniki, Greece
This work was supported by the H2020 project ReTV (grant agreement No 780656)
Problem Statement
● Traditional TV and desktop computer monitors: landscape aspect ratios (16:9 or 4:3)
● Nowadays, mobile devices use different aspect ratios
● Video sharing platforms dictate the use of specific video aspect ratios
● Existing videos would have to be transformed to comply with their specifications
● Straightforward approaches for transforming a video to a different aspect ratio:
○ static cropping of content
○ padding the frames with black borders
➢significant loss of visual content that might even be in the center of
attention
➢shrinks the original video by introducing large borders in the output video
Problem Statement
● The results of such simple approaches are often unsatisfactory
● Common video aspect ratio transformation methods of the literature:
○ Warping
○ Seam carving
➢Both introduce distortions and may alter the semantics of the video
Problem Statement
1. SmartVidCrop:
○ New video cropping method that minimizes the loss of semantically important
visual content
○ Selects the main focus out of the multiple possible salient regions of the video
○ Introduces a new filtering-through-clustering processing step
2. RetargetVid:
○ First publicly available benchmark dataset for video retargeting
○ Ground-truth video cropping results for 200 videos
○ Each video was annotated by 6 human annotators
○ Two target aspect ratios
Proposed Method
● Remove borders
● Calculate crop window dimensions
● Saliency detection
● Thresholding
● Filtering-through-clustering procedure
● Center of mass
● Shot detection
● Time-series smoothing
ow
oh
fh
fw
ow/oh = 16/9
fw/fh = 4/5
fh = oh
fw < ow
fh
ow
oh
fw
Original frame
Final frame
Proposed Method
● Remove borders
● Calculate crop window dimensions
● Saliency detection
● Thresholding
● Filtering-through-clustering procedure
● Center of mass
● Shot detection
● Time-series smoothing
ow/oh = 4/5
fw/fh = 16/9
fh < oh
fw = ow
Proposed Method
● Remove borders
● Calculate crop window dimensions
● Saliency detection
● Thresholding
● Filtering-through-clustering procedure
● Center of mass
● Shot detection
● Time-series smoothing
Proposed Method
● Remove borders
● Calculate crop window dimensions
● Saliency detection
● Thresholding
● Filtering-through-clustering procedure
● Center of mass
● Shot detection
● Time-series smoothing
Proposed Method
● Remove borders
● Calculate crop window dimensions
● Saliency detection
● Thresholding
● Filtering-through-clustering procedure
● Center of mass
● Shot detection
● Time-series smoothing
Proposed Method
● Remove borders
● Calculate crop window dimensions
● Saliency detection
● Thresholding
● Filtering-through-clustering procedure
● Center of mass
● Shot detection
● Time-series smoothing
Proposed Method
● Remove borders
● Calculate crop window dimensions
● Saliency detection
● Thresholding
● Filtering-through-clustering procedure
● Center of mass
● Shot detection
● Time-series smoothing
yi
xi
t
Time-series of center of mass
displacement
Proposed Method
● Remove borders
● Calculate crop window dimensions
● Saliency detection
● Thresholding
● Filtering-through-clustering procedure
● Center of mass
● Shot detection
● Time-series smoothing
t
Shot transitions Video
frames
Shot #1
Shot #2
Shot #3
Proposed Method
● Remove borders
● Calculate crop window dimensions
● Saliency detection
● Thresholding
● Filtering-through-clustering procedure
● Center of mass
● Shot detection
● Time-series smoothing
Smoothed time-series
Inferred time-series
t
Smoothed time-series of center of mass
displacement
● Read the first out of every five videos frames
● Remove borders
● Calculate crop window dimensions
● Saliency detection
● Thresholding
● Filtering-through-clustering procedure
● Center of mass
● Shot detection
● Time-series smoothing
Proposed Method
RetargetVid dataset
● Motivation: Objective comparisons of video retargeting methods are difficult and
scarce in the literature
● Construction of a dataset for the task of video aspect ratio transformation
● Publicly available annotations
● 200 videos from the publicly available videos of the DHF1k dataset
● Annotations by 6 human subjects
● 1:3 and 3:1 target aspect ratios
● Dataset presents a good balance between inter-annotator agreement and diversity
Results
● Google AutoFlip: a cropping method for video retargeting with available source code
● We utilize the constructed RetargetVid dataset to compare our method to AutoFlip
Method Worst Best Mean t
Results for 1:3 target aspect ratio
AutoFlip 50.0 52.1 50.8 40
Ours (w/o clustering step) 45.3 47.4 46.1 17
SmartCrop 48.6 50.9 49.9 19
Results for 3:1 target aspect ratio
AutoFlip 70.9 74.7 72.2 41
Ours (w/o clustering step) 66.3 72.9 68.1 17
SmartCrop 70.1 73.6 71.4 20
Results
RetargetVid dataset and SmartVidCrop
● Source code of SmartVidCrop
● Ground-truth annotations for the RetargetVid dataset
● Updated results of our SmartVidCrop method, achieved by further optimizing it
are publicly available at:
https://github.com/bmezaris/RetargetVid
Try it yourself at:
Contacts:
Vasileios Mezaris, bmezaris@iti.gr
Konstantinos Apostolidis, kapost@iti.gr
This work was supported by the H2020 project ReTV (grant agreement No 780656)
http://multimedia2.iti.gr/videosmartcropping/service/start.html
Or ask us about the underlying algorithms
and how these can be integrated in your system

Weitere ähnliche Inhalte

Mehr von VasileiosMezaris

Combining textual and visual features for Ad-hoc Video Search
Combining textual and visual features for Ad-hoc Video SearchCombining textual and visual features for Ad-hoc Video Search
Combining textual and visual features for Ad-hoc Video SearchVasileiosMezaris
 
Explaining the decisions of image/video classifiers
Explaining the decisions of image/video classifiersExplaining the decisions of image/video classifiers
Explaining the decisions of image/video classifiersVasileiosMezaris
 
Learning visual explanations for DCNN-based image classifiers using an attent...
Learning visual explanations for DCNN-based image classifiers using an attent...Learning visual explanations for DCNN-based image classifiers using an attent...
Learning visual explanations for DCNN-based image classifiers using an attent...VasileiosMezaris
 
Are all combinations equal? Combining textual and visual features with multi...
Are all combinations equal?  Combining textual and visual features with multi...Are all combinations equal?  Combining textual and visual features with multi...
Are all combinations equal? Combining textual and visual features with multi...VasileiosMezaris
 
CA-SUM Video Summarization
CA-SUM Video SummarizationCA-SUM Video Summarization
CA-SUM Video SummarizationVasileiosMezaris
 
Video smart cropping web application
Video smart cropping web applicationVideo smart cropping web application
Video smart cropping web applicationVasileiosMezaris
 
PGL SUM Video Summarization
PGL SUM Video SummarizationPGL SUM Video Summarization
PGL SUM Video SummarizationVasileiosMezaris
 
Hard-Negatives Selection Strategy for Cross-Modal Retrieval
Hard-Negatives Selection Strategy for Cross-Modal RetrievalHard-Negatives Selection Strategy for Cross-Modal Retrieval
Hard-Negatives Selection Strategy for Cross-Modal RetrievalVasileiosMezaris
 
Misinformation on the internet: Video and AI
Misinformation on the internet: Video and AIMisinformation on the internet: Video and AI
Misinformation on the internet: Video and AIVasileiosMezaris
 
PoR_evaluation_measure_acm_mm_2020
PoR_evaluation_measure_acm_mm_2020PoR_evaluation_measure_acm_mm_2020
PoR_evaluation_measure_acm_mm_2020VasileiosMezaris
 
GAN-based video summarization
GAN-based video summarizationGAN-based video summarization
GAN-based video summarizationVasileiosMezaris
 
Migration-related video retrieval
Migration-related video retrievalMigration-related video retrieval
Migration-related video retrievalVasileiosMezaris
 
Fractional step discriminant pruning
Fractional step discriminant pruningFractional step discriminant pruning
Fractional step discriminant pruningVasileiosMezaris
 
Icme2020 tutorial video_summarization_part1
Icme2020 tutorial video_summarization_part1Icme2020 tutorial video_summarization_part1
Icme2020 tutorial video_summarization_part1VasileiosMezaris
 
Video, AI and News: video analysis and verification technologies for supporti...
Video, AI and News: video analysis and verification technologies for supporti...Video, AI and News: video analysis and verification technologies for supporti...
Video, AI and News: video analysis and verification technologies for supporti...VasileiosMezaris
 
Unsupervised Video Summarization via Attention-Driven Adversarial Learning
Unsupervised Video Summarization via Attention-Driven Adversarial LearningUnsupervised Video Summarization via Attention-Driven Adversarial Learning
Unsupervised Video Summarization via Attention-Driven Adversarial LearningVasileiosMezaris
 
Subclass deep neural networks
Subclass deep neural networksSubclass deep neural networks
Subclass deep neural networksVasileiosMezaris
 
Video & AI: capabilities and limitations of AI in detecting video manipulations
Video & AI: capabilities and limitations of AI in detecting video manipulationsVideo & AI: capabilities and limitations of AI in detecting video manipulations
Video & AI: capabilities and limitations of AI in detecting video manipulationsVasileiosMezaris
 

Mehr von VasileiosMezaris (20)

Combining textual and visual features for Ad-hoc Video Search
Combining textual and visual features for Ad-hoc Video SearchCombining textual and visual features for Ad-hoc Video Search
Combining textual and visual features for Ad-hoc Video Search
 
Explaining the decisions of image/video classifiers
Explaining the decisions of image/video classifiersExplaining the decisions of image/video classifiers
Explaining the decisions of image/video classifiers
 
Learning visual explanations for DCNN-based image classifiers using an attent...
Learning visual explanations for DCNN-based image classifiers using an attent...Learning visual explanations for DCNN-based image classifiers using an attent...
Learning visual explanations for DCNN-based image classifiers using an attent...
 
Are all combinations equal? Combining textual and visual features with multi...
Are all combinations equal?  Combining textual and visual features with multi...Are all combinations equal?  Combining textual and visual features with multi...
Are all combinations equal? Combining textual and visual features with multi...
 
CA-SUM Video Summarization
CA-SUM Video SummarizationCA-SUM Video Summarization
CA-SUM Video Summarization
 
Video smart cropping web application
Video smart cropping web applicationVideo smart cropping web application
Video smart cropping web application
 
PGL SUM Video Summarization
PGL SUM Video SummarizationPGL SUM Video Summarization
PGL SUM Video Summarization
 
Video Thumbnail Selector
Video Thumbnail SelectorVideo Thumbnail Selector
Video Thumbnail Selector
 
Hard-Negatives Selection Strategy for Cross-Modal Retrieval
Hard-Negatives Selection Strategy for Cross-Modal RetrievalHard-Negatives Selection Strategy for Cross-Modal Retrieval
Hard-Negatives Selection Strategy for Cross-Modal Retrieval
 
Misinformation on the internet: Video and AI
Misinformation on the internet: Video and AIMisinformation on the internet: Video and AI
Misinformation on the internet: Video and AI
 
LSTM Structured Pruning
LSTM Structured PruningLSTM Structured Pruning
LSTM Structured Pruning
 
PoR_evaluation_measure_acm_mm_2020
PoR_evaluation_measure_acm_mm_2020PoR_evaluation_measure_acm_mm_2020
PoR_evaluation_measure_acm_mm_2020
 
GAN-based video summarization
GAN-based video summarizationGAN-based video summarization
GAN-based video summarization
 
Migration-related video retrieval
Migration-related video retrievalMigration-related video retrieval
Migration-related video retrieval
 
Fractional step discriminant pruning
Fractional step discriminant pruningFractional step discriminant pruning
Fractional step discriminant pruning
 
Icme2020 tutorial video_summarization_part1
Icme2020 tutorial video_summarization_part1Icme2020 tutorial video_summarization_part1
Icme2020 tutorial video_summarization_part1
 
Video, AI and News: video analysis and verification technologies for supporti...
Video, AI and News: video analysis and verification technologies for supporti...Video, AI and News: video analysis and verification technologies for supporti...
Video, AI and News: video analysis and verification technologies for supporti...
 
Unsupervised Video Summarization via Attention-Driven Adversarial Learning
Unsupervised Video Summarization via Attention-Driven Adversarial LearningUnsupervised Video Summarization via Attention-Driven Adversarial Learning
Unsupervised Video Summarization via Attention-Driven Adversarial Learning
 
Subclass deep neural networks
Subclass deep neural networksSubclass deep neural networks
Subclass deep neural networks
 
Video & AI: capabilities and limitations of AI in detecting video manipulations
Video & AI: capabilities and limitations of AI in detecting video manipulationsVideo & AI: capabilities and limitations of AI in detecting video manipulations
Video & AI: capabilities and limitations of AI in detecting video manipulations
 

Kürzlich hochgeladen

All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...Sérgio Sacani
 
Botany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdfBotany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdfSumit Kumar yadav
 
DIFFERENCE IN BACK CROSS AND TEST CROSS
DIFFERENCE IN  BACK CROSS AND TEST CROSSDIFFERENCE IN  BACK CROSS AND TEST CROSS
DIFFERENCE IN BACK CROSS AND TEST CROSSLeenakshiTyagi
 
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 60009654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000Sapana Sha
 
Animal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptxAnimal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptxUmerFayaz5
 
Spermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatidSpermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatidSarthak Sekhar Mondal
 
Zoology 4th semester series (krishna).pdf
Zoology 4th semester series (krishna).pdfZoology 4th semester series (krishna).pdf
Zoology 4th semester series (krishna).pdfSumit Kumar yadav
 
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 bAsymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 bSérgio Sacani
 
GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)Areesha Ahmad
 
Natural Polymer Based Nanomaterials
Natural Polymer Based NanomaterialsNatural Polymer Based Nanomaterials
Natural Polymer Based NanomaterialsAArockiyaNisha
 
Hubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroidsHubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroidsSérgio Sacani
 
GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)Areesha Ahmad
 
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...anilsa9823
 
Botany krishna series 2nd semester Only Mcq type questions
Botany krishna series 2nd semester Only Mcq type questionsBotany krishna series 2nd semester Only Mcq type questions
Botany krishna series 2nd semester Only Mcq type questionsSumit Kumar yadav
 
Nanoparticles synthesis and characterization​ ​
Nanoparticles synthesis and characterization​  ​Nanoparticles synthesis and characterization​  ​
Nanoparticles synthesis and characterization​ ​kaibalyasahoo82800
 
Biopesticide (2).pptx .This slides helps to know the different types of biop...
Biopesticide (2).pptx  .This slides helps to know the different types of biop...Biopesticide (2).pptx  .This slides helps to know the different types of biop...
Biopesticide (2).pptx .This slides helps to know the different types of biop...RohitNehra6
 
Presentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptxPresentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptxgindu3009
 
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Lokesh Kothari
 
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCRStunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCRDelhi Call girls
 

Kürzlich hochgeladen (20)

All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
 
Botany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdfBotany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdf
 
DIFFERENCE IN BACK CROSS AND TEST CROSS
DIFFERENCE IN  BACK CROSS AND TEST CROSSDIFFERENCE IN  BACK CROSS AND TEST CROSS
DIFFERENCE IN BACK CROSS AND TEST CROSS
 
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 60009654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
 
Animal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptxAnimal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptx
 
Spermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatidSpermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatid
 
Zoology 4th semester series (krishna).pdf
Zoology 4th semester series (krishna).pdfZoology 4th semester series (krishna).pdf
Zoology 4th semester series (krishna).pdf
 
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 bAsymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
 
GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)
 
Natural Polymer Based Nanomaterials
Natural Polymer Based NanomaterialsNatural Polymer Based Nanomaterials
Natural Polymer Based Nanomaterials
 
Hubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroidsHubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroids
 
GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)
 
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
 
Botany krishna series 2nd semester Only Mcq type questions
Botany krishna series 2nd semester Only Mcq type questionsBotany krishna series 2nd semester Only Mcq type questions
Botany krishna series 2nd semester Only Mcq type questions
 
Nanoparticles synthesis and characterization​ ​
Nanoparticles synthesis and characterization​  ​Nanoparticles synthesis and characterization​  ​
Nanoparticles synthesis and characterization​ ​
 
Biopesticide (2).pptx .This slides helps to know the different types of biop...
Biopesticide (2).pptx  .This slides helps to know the different types of biop...Biopesticide (2).pptx  .This slides helps to know the different types of biop...
Biopesticide (2).pptx .This slides helps to know the different types of biop...
 
Presentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptxPresentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptx
 
The Philosophy of Science
The Philosophy of ScienceThe Philosophy of Science
The Philosophy of Science
 
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
 
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCRStunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
 

A Fast Smart-Cropping Method and Dataset for Video Retargeting

  • 1. A fast smart-cropping method and dataset for video retargeting Konstantinos Apostolidis, Vasileios Mezaris Information Technologies Institute (ITI), CERTH Thessaloniki, Greece This work was supported by the H2020 project ReTV (grant agreement No 780656)
  • 2. Problem Statement ● Traditional TV and desktop computer monitors: landscape aspect ratios (16:9 or 4:3) ● Nowadays, mobile devices use different aspect ratios ● Video sharing platforms dictate the use of specific video aspect ratios ● Existing videos would have to be transformed to comply with their specifications
  • 3. ● Straightforward approaches for transforming a video to a different aspect ratio: ○ static cropping of content ○ padding the frames with black borders ➢significant loss of visual content that might even be in the center of attention ➢shrinks the original video by introducing large borders in the output video Problem Statement ● The results of such simple approaches are often unsatisfactory ● Common video aspect ratio transformation methods of the literature: ○ Warping ○ Seam carving ➢Both introduce distortions and may alter the semantics of the video
  • 4. Problem Statement 1. SmartVidCrop: ○ New video cropping method that minimizes the loss of semantically important visual content ○ Selects the main focus out of the multiple possible salient regions of the video ○ Introduces a new filtering-through-clustering processing step 2. RetargetVid: ○ First publicly available benchmark dataset for video retargeting ○ Ground-truth video cropping results for 200 videos ○ Each video was annotated by 6 human annotators ○ Two target aspect ratios
  • 5. Proposed Method ● Remove borders ● Calculate crop window dimensions ● Saliency detection ● Thresholding ● Filtering-through-clustering procedure ● Center of mass ● Shot detection ● Time-series smoothing
  • 6. ow oh fh fw ow/oh = 16/9 fw/fh = 4/5 fh = oh fw < ow fh ow oh fw Original frame Final frame Proposed Method ● Remove borders ● Calculate crop window dimensions ● Saliency detection ● Thresholding ● Filtering-through-clustering procedure ● Center of mass ● Shot detection ● Time-series smoothing ow/oh = 4/5 fw/fh = 16/9 fh < oh fw = ow
  • 7. Proposed Method ● Remove borders ● Calculate crop window dimensions ● Saliency detection ● Thresholding ● Filtering-through-clustering procedure ● Center of mass ● Shot detection ● Time-series smoothing
  • 8. Proposed Method ● Remove borders ● Calculate crop window dimensions ● Saliency detection ● Thresholding ● Filtering-through-clustering procedure ● Center of mass ● Shot detection ● Time-series smoothing
  • 9. Proposed Method ● Remove borders ● Calculate crop window dimensions ● Saliency detection ● Thresholding ● Filtering-through-clustering procedure ● Center of mass ● Shot detection ● Time-series smoothing
  • 10. Proposed Method ● Remove borders ● Calculate crop window dimensions ● Saliency detection ● Thresholding ● Filtering-through-clustering procedure ● Center of mass ● Shot detection ● Time-series smoothing
  • 11. Proposed Method ● Remove borders ● Calculate crop window dimensions ● Saliency detection ● Thresholding ● Filtering-through-clustering procedure ● Center of mass ● Shot detection ● Time-series smoothing yi xi t Time-series of center of mass displacement
  • 12. Proposed Method ● Remove borders ● Calculate crop window dimensions ● Saliency detection ● Thresholding ● Filtering-through-clustering procedure ● Center of mass ● Shot detection ● Time-series smoothing t Shot transitions Video frames Shot #1 Shot #2 Shot #3
  • 13. Proposed Method ● Remove borders ● Calculate crop window dimensions ● Saliency detection ● Thresholding ● Filtering-through-clustering procedure ● Center of mass ● Shot detection ● Time-series smoothing Smoothed time-series Inferred time-series t Smoothed time-series of center of mass displacement
  • 14. ● Read the first out of every five videos frames ● Remove borders ● Calculate crop window dimensions ● Saliency detection ● Thresholding ● Filtering-through-clustering procedure ● Center of mass ● Shot detection ● Time-series smoothing Proposed Method
  • 15. RetargetVid dataset ● Motivation: Objective comparisons of video retargeting methods are difficult and scarce in the literature ● Construction of a dataset for the task of video aspect ratio transformation ● Publicly available annotations ● 200 videos from the publicly available videos of the DHF1k dataset ● Annotations by 6 human subjects ● 1:3 and 3:1 target aspect ratios ● Dataset presents a good balance between inter-annotator agreement and diversity
  • 16. Results ● Google AutoFlip: a cropping method for video retargeting with available source code ● We utilize the constructed RetargetVid dataset to compare our method to AutoFlip
  • 17. Method Worst Best Mean t Results for 1:3 target aspect ratio AutoFlip 50.0 52.1 50.8 40 Ours (w/o clustering step) 45.3 47.4 46.1 17 SmartCrop 48.6 50.9 49.9 19 Results for 3:1 target aspect ratio AutoFlip 70.9 74.7 72.2 41 Ours (w/o clustering step) 66.3 72.9 68.1 17 SmartCrop 70.1 73.6 71.4 20 Results
  • 18. RetargetVid dataset and SmartVidCrop ● Source code of SmartVidCrop ● Ground-truth annotations for the RetargetVid dataset ● Updated results of our SmartVidCrop method, achieved by further optimizing it are publicly available at: https://github.com/bmezaris/RetargetVid
  • 19. Try it yourself at: Contacts: Vasileios Mezaris, bmezaris@iti.gr Konstantinos Apostolidis, kapost@iti.gr This work was supported by the H2020 project ReTV (grant agreement No 780656) http://multimedia2.iti.gr/videosmartcropping/service/start.html Or ask us about the underlying algorithms and how these can be integrated in your system