SlideShare ist ein Scribd-Unternehmen logo
1 von 12
Detection of Violent Scenes using Affective Features




Esra Acar
Competence Center Information Retrieval and Machine Learning




               4. October 2012
▶ Motivation
▶ Background
▶ The Method
   Audio Features
   Visual Features
▶ Results & Discussion
▶ Conclusions & Future Work




          4. October 2012   Detection of Violent Scenes using Affective Features   2
▶ The MediaEval 2012 Affect Task aims at detecting violent
  segments in movies.

▶ A recent work on horror scene recognition detects horror
  scenes by affect-related features.

▶ We investigate whether
   affect-related features provide good representation of
    violence, and
   making abstractions from low-level features is better than
    directly using low-level data.



          4. October 2012   Detection of Violent Scenes using Affective Features   3
▶ The affective content of a video corresponds to
   the intensity (i.e. arousal), and
   the type (i.e. valence) of emotion
  expected to arise in the user while watching that video.

▶ Recent research efforts propose methods to map low-level
  features to high-level emotions.

▶ Film-makers intend to elicit some particular emotions (i.e.
  expected emotions) in the audience.
▶ When we refer to violence as an expected emotion in
  videos, affect-related features are applicable for violence
  detection.

           4. October 2012   Detection of Violent Scenes using Affective Features   4
▶ The method uses affect-related audio and visual features to
  represent violence.

▶ Low-level audio and visual features are extracted.
▶ Mid-level audio features are generated based on the low-
  level ones.

▶ The audio and visual features are then fused at the feature-
  level and a two-class SVM is trained.




          4. October 2012   Detection of Violent Scenes using Affective Features   5
▶ Affect-related audio features used in the work are:
   Audio energy
          related to the arousal aspect.
          high/low energy corresponds to high/low emotion intensity.
          used for vocal emotion detection.
      Mel-Frequency Cepstral Coefficients (MFCC)
          related to the arousal aspect.
          works well for the detection of excitement/non-excitement.
      Pitch
          related to the valence aspect.
          significant for emotion detection in speech and music.




           4. October 2012   Detection of Violent Scenes using Affective Features   6
▶   Each video shot has different numbers of audio energy, pitch and
    MFCC feature vectors (due to varying shot durations).
▶   Audio representations are obtained by computing mean and
    standard deviation for these audio features.

▶   Abstraction for MFCC:
     MFCC-based Bag of Audio Words (BoAW) approach is chosen to
       generate mid-level audio representations.
     Two different audio vocabularies are constructed: violence and
       non-violence vocabularies (by k-means clustering).
     MFCC of violent/non-violent movie segments are used to
       construct violence/non-violence words.
     Violence and non-violence word occurrences within a video shot
       are represented by a BoAW histogram.



            4. October 2012   Detection of Violent Scenes using Affective Features   7
▶ Average motion
   related to the arousal aspect.
   Motion vectors are computed using block-based motion
     estimation.
   Average motion is found as the average magnitude of all
     motion vectors.

▶ We compute average motion around the keyframe of video
  shots.




          4. October 2012   Detection of Violent Scenes using Affective Features   8
▶ The performance of our method was assessed on 3
  Hollywood movies (evaluation criteria: MAP at 100).

▶ We submitted five runs:
   r1-low-level: low-level audio and visual features,
   Runs based on mid-level audio and low-level visual features
         r2-mid-level-100k: 100k samples for dictionary construction,
         r3-mid-level-300k: 300k samples for dictionary construction,
         r4-mid-level-300k-default: 300k samples for dictionary
          construction + SVM default parameters, and
         r5-mid-level-500k: 500k samples for dictionary construction.




           4. October 2012   Detection of Violent Scenes using Affective Features   9
Table 1 – Precision, Recall and F-measure at shot level
        Run                                    AED-P           AED-R             AED-F
        r1-low-level                             0.141           0.597           0.2287
        r2-mid-level-100k                        0.140           0.629           0.2285
        r3-mid-level-300k                        0.144           0.625           0.2337
        r4-mid-level-300k-default                0.190           0.627           0.2971
        r5-mid-level-500k                        0.154           0.603           0.2457

                   Table 2 – Mean Average Precision (MAP) values at 20 and 100
        Run                                      MAP at 20               MAP at 100
        r1-low-level                                0.2132                   0.18502
        r2-mid-level-100k                           0.2037                   0.14492
        r3-mid-level-300k                           0.3593                   0.18538
        r4-mid-level-300k-default                   0.1547                   0.15083
        r5-mid-level-500k                            0.15                    0.11527
▶   Slightly better performance is achieved with mid-level representations compared
    to the low-level one.
▶   Using affect-related features to describe violence needs some improvements
    (especially the visual part).
               4. October 2012         Detection of Violent Scenes using Affective Features   10
▶ The aim of this work was to investigate whether affect-
  related features are well-suited to describe violence.
▶ Affect-related audio and visual features are merged in a
  supervised manner using SVM.

▶ Our main finding is that more sophisticated affect-related
  features are necessary to describe the content of videos
  (especially the visual part).
▶ Our next step in this work is to use
   mid-level features such as human facial features, and
   more sophisticated motion descriptors such as Lagrangian
      measures
  for video content representation.


          4. October 2012   Detection of Violent Scenes using Affective Features   11
Thank you!


                  Questions?



4. October 2012    Detection of Violent Scenes using Affective Features   12

Weitere Àhnliche Inhalte

Ähnlich wie mevd2012 esra_

[CARS2012@RecSys]Optimal Feature Selection for Context-Aware Recommendation u...
[CARS2012@RecSys]Optimal Feature Selection for Context-Aware Recommendation u...[CARS2012@RecSys]Optimal Feature Selection for Context-Aware Recommendation u...
[CARS2012@RecSys]Optimal Feature Selection for Context-Aware Recommendation u...
YONG ZHENG
 
Human Action Recognition Based on Spacio-temporal features-Poster
Human Action Recognition Based on Spacio-temporal features-PosterHuman Action Recognition Based on Spacio-temporal features-Poster
Human Action Recognition Based on Spacio-temporal features-Poster
nikhilus85
 
Thesis_Oral
Thesis_OralThesis_Oral
Thesis_Oral
FanDong Wei
 

Ähnlich wie mevd2012 esra_ (20)

TARGET DETECTION AND CLASSIFICATION PERFORMANCE ENHANCEMENT USING SUPERRESOLU...
TARGET DETECTION AND CLASSIFICATION PERFORMANCE ENHANCEMENT USING SUPERRESOLU...TARGET DETECTION AND CLASSIFICATION PERFORMANCE ENHANCEMENT USING SUPERRESOLU...
TARGET DETECTION AND CLASSIFICATION PERFORMANCE ENHANCEMENT USING SUPERRESOLU...
 
TARGET DETECTION AND CLASSIFICATION PERFORMANCE ENHANCEMENT USING SUPERRESOLU...
TARGET DETECTION AND CLASSIFICATION PERFORMANCE ENHANCEMENT USING SUPERRESOLU...TARGET DETECTION AND CLASSIFICATION PERFORMANCE ENHANCEMENT USING SUPERRESOLU...
TARGET DETECTION AND CLASSIFICATION PERFORMANCE ENHANCEMENT USING SUPERRESOLU...
 
MUSIC RECOMMENDATION THROUGH FACE RECOGNITION AND EMOTION DETECTION
MUSIC RECOMMENDATION THROUGH FACE RECOGNITION AND EMOTION DETECTIONMUSIC RECOMMENDATION THROUGH FACE RECOGNITION AND EMOTION DETECTION
MUSIC RECOMMENDATION THROUGH FACE RECOGNITION AND EMOTION DETECTION
 
A survey on coding binary visual features extracted from video sequences
A survey on coding binary visual features extracted from video sequencesA survey on coding binary visual features extracted from video sequences
A survey on coding binary visual features extracted from video sequences
 
IMAGE RESOLUTION ENHANCEMENT BY USING SWT AND DWT
IMAGE RESOLUTION ENHANCEMENT BY USING SWT AND DWTIMAGE RESOLUTION ENHANCEMENT BY USING SWT AND DWT
IMAGE RESOLUTION ENHANCEMENT BY USING SWT AND DWT
 
C04841417
C04841417C04841417
C04841417
 
Quality assessment of stereoscopic 3 d image compression by binocular integra...
Quality assessment of stereoscopic 3 d image compression by binocular integra...Quality assessment of stereoscopic 3 d image compression by binocular integra...
Quality assessment of stereoscopic 3 d image compression by binocular integra...
 
Ee 417 Senior Design
Ee 417 Senior DesignEe 417 Senior Design
Ee 417 Senior Design
 
[CARS2012@RecSys]Optimal Feature Selection for Context-Aware Recommendation u...
[CARS2012@RecSys]Optimal Feature Selection for Context-Aware Recommendation u...[CARS2012@RecSys]Optimal Feature Selection for Context-Aware Recommendation u...
[CARS2012@RecSys]Optimal Feature Selection for Context-Aware Recommendation u...
 
An Effective Attendance Management System using Face Recognition
An Effective Attendance Management System using Face RecognitionAn Effective Attendance Management System using Face Recognition
An Effective Attendance Management System using Face Recognition
 
Applying Media Content Analysis to the Production of Musical Videos as Summar...
Applying Media Content Analysis to the Production of Musical Videos as Summar...Applying Media Content Analysis to the Production of Musical Videos as Summar...
Applying Media Content Analysis to the Production of Musical Videos as Summar...
 
Human Action Recognition Based on Spacio-temporal features-Poster
Human Action Recognition Based on Spacio-temporal features-PosterHuman Action Recognition Based on Spacio-temporal features-Poster
Human Action Recognition Based on Spacio-temporal features-Poster
 
IRJET- Heuristic Approach for Low Light Image Enhancement using Deep Learning
IRJET- Heuristic Approach for Low Light Image Enhancement using Deep LearningIRJET- Heuristic Approach for Low Light Image Enhancement using Deep Learning
IRJET- Heuristic Approach for Low Light Image Enhancement using Deep Learning
 
IRJET- Image Caption Generation System using Neural Network with Attention Me...
IRJET- Image Caption Generation System using Neural Network with Attention Me...IRJET- Image Caption Generation System using Neural Network with Attention Me...
IRJET- Image Caption Generation System using Neural Network with Attention Me...
 
IRJET- Face Spoofing Detection Based on Texture Analysis and Color Space Conv...
IRJET- Face Spoofing Detection Based on Texture Analysis and Color Space Conv...IRJET- Face Spoofing Detection Based on Texture Analysis and Color Space Conv...
IRJET- Face Spoofing Detection Based on Texture Analysis and Color Space Conv...
 
Quality of Multimedia Experience: Past, Present and Future
Quality of Multimedia Experience: Past, Present and FutureQuality of Multimedia Experience: Past, Present and Future
Quality of Multimedia Experience: Past, Present and Future
 
Study of Various Edge Detection Techniques and Implementation of Real Time Fr...
Study of Various Edge Detection Techniques and Implementation of Real Time Fr...Study of Various Edge Detection Techniques and Implementation of Real Time Fr...
Study of Various Edge Detection Techniques and Implementation of Real Time Fr...
 
IRJET- SEPD Technique for Removal of Salt and Pepper Noise in Digital Images
IRJET- SEPD Technique for Removal of Salt and Pepper Noise in Digital ImagesIRJET- SEPD Technique for Removal of Salt and Pepper Noise in Digital Images
IRJET- SEPD Technique for Removal of Salt and Pepper Noise in Digital Images
 
IRJET- A Review on Various Restoration Techniques in Digital Image Processing
IRJET- A Review on Various Restoration Techniques in Digital Image ProcessingIRJET- A Review on Various Restoration Techniques in Digital Image Processing
IRJET- A Review on Various Restoration Techniques in Digital Image Processing
 
Thesis_Oral
Thesis_OralThesis_Oral
Thesis_Oral
 

Mehr von MediaEval2012

MediaEval 2012 Opening
MediaEval 2012 OpeningMediaEval 2012 Opening
MediaEval 2012 Opening
MediaEval2012
 
A Multimodal Approach for Video Geocoding
A Multimodal Approach for   Video Geocoding A Multimodal Approach for   Video Geocoding
A Multimodal Approach for Video Geocoding
MediaEval2012
 
Brave New Task: Musiclef Multimodal Music Tagging
Brave New Task: Musiclef Multimodal Music TaggingBrave New Task: Musiclef Multimodal Music Tagging
Brave New Task: Musiclef Multimodal Music Tagging
MediaEval2012
 
Search and Hyperlinking Task at MediaEval 2012
Search and Hyperlinking Task at MediaEval 2012Search and Hyperlinking Task at MediaEval 2012
Search and Hyperlinking Task at MediaEval 2012
MediaEval2012
 
CUNI at MediaEval 2012: Search and Hyperlinking Task
CUNI at MediaEval 2012: Search and Hyperlinking TaskCUNI at MediaEval 2012: Search and Hyperlinking Task
CUNI at MediaEval 2012: Search and Hyperlinking Task
MediaEval2012
 
DCU Search Runs at MediaEval 2012: Search and Hyperlinking Task
DCU Search Runs at MediaEval 2012: Search and Hyperlinking TaskDCU Search Runs at MediaEval 2012: Search and Hyperlinking Task
DCU Search Runs at MediaEval 2012: Search and Hyperlinking Task
MediaEval2012
 
Ghent University-IBBT at MediaEval 2012 Search and Hyperlinking: Semantic Sim...
Ghent University-IBBT at MediaEval 2012 Search and Hyperlinking: Semantic Sim...Ghent University-IBBT at MediaEval 2012 Search and Hyperlinking: Semantic Sim...
Ghent University-IBBT at MediaEval 2012 Search and Hyperlinking: Semantic Sim...
MediaEval2012
 
Brave New Task: User Account Matching
Brave New Task: User Account MatchingBrave New Task: User Account Matching
Brave New Task: User Account Matching
MediaEval2012
 
The CLEF Initiative From 2010 to 2012 and Onwards
The CLEF Initiative From 2010 to 2012 and OnwardsThe CLEF Initiative From 2010 to 2012 and Onwards
The CLEF Initiative From 2010 to 2012 and Onwards
MediaEval2012
 
Overview of MediaEval 2012 Visual Privacy Task
Overview of MediaEval 2012 Visual Privacy TaskOverview of MediaEval 2012 Visual Privacy Task
Overview of MediaEval 2012 Visual Privacy Task
MediaEval2012
 
MediaEval 2012 Visual Privacy Task: Privacy and Intelligibility through Pixel...
MediaEval 2012 Visual Privacy Task: Privacy and Intelligibility through Pixel...MediaEval 2012 Visual Privacy Task: Privacy and Intelligibility through Pixel...
MediaEval 2012 Visual Privacy Task: Privacy and Intelligibility through Pixel...
MediaEval2012
 
MediaEval 2012 Visual Privacy Task: Applying Transform-domain Scrambling to A...
MediaEval 2012 Visual Privacy Task: Applying Transform-domain Scrambling to A...MediaEval 2012 Visual Privacy Task: Applying Transform-domain Scrambling to A...
MediaEval 2012 Visual Privacy Task: Applying Transform-domain Scrambling to A...
MediaEval2012
 
Violent Scenes Detection with Large, Brute-forced Acoustic and Visual Feature...
Violent Scenes Detection with Large, Brute-forced Acoustic and Visual Feature...Violent Scenes Detection with Large, Brute-forced Acoustic and Visual Feature...
Violent Scenes Detection with Large, Brute-forced Acoustic and Visual Feature...
MediaEval2012
 
Technicolor/INRIA/Imperial College London at the MediaEval 2012 Violent Scene...
Technicolor/INRIA/Imperial College London at the MediaEval 2012 Violent Scene...Technicolor/INRIA/Imperial College London at the MediaEval 2012 Violent Scene...
Technicolor/INRIA/Imperial College London at the MediaEval 2012 Violent Scene...
MediaEval2012
 
The MediaEval 2012 Affect Task: Violent Scenes Detectio
The MediaEval 2012 Affect Task: Violent Scenes DetectioThe MediaEval 2012 Affect Task: Violent Scenes Detectio
The MediaEval 2012 Affect Task: Violent Scenes Detectio
MediaEval2012
 
NII, Japan at MediaEval 2012 Violent Scenes Detection Affect Task
NII, Japan at MediaEval 2012 Violent Scenes Detection Affect TaskNII, Japan at MediaEval 2012 Violent Scenes Detection Affect Task
NII, Japan at MediaEval 2012 Violent Scenes Detection Affect Task
MediaEval2012
 
LIG at MediaEval 2012 affect task: use of a generic method
LIG at MediaEval 2012 affect task: use of a generic methodLIG at MediaEval 2012 affect task: use of a generic method
LIG at MediaEval 2012 affect task: use of a generic method
MediaEval2012
 
Violence Detection in Video by Large Scale Multi-Scale Local Binary Pattern D...
Violence Detection in Video by Large Scale Multi-Scale Local Binary Pattern D...Violence Detection in Video by Large Scale Multi-Scale Local Binary Pattern D...
Violence Detection in Video by Large Scale Multi-Scale Local Binary Pattern D...
MediaEval2012
 
ARF @ MediaEval 2012: An Uninformed Approach to Violence Detection in Hollywo...
ARF @ MediaEval 2012: An Uninformed Approach to Violence Detection in Hollywo...ARF @ MediaEval 2012: An Uninformed Approach to Violence Detection in Hollywo...
ARF @ MediaEval 2012: An Uninformed Approach to Violence Detection in Hollywo...
MediaEval2012
 

Mehr von MediaEval2012 (20)

MediaEval 2012 Opening
MediaEval 2012 OpeningMediaEval 2012 Opening
MediaEval 2012 Opening
 
Closing
ClosingClosing
Closing
 
A Multimodal Approach for Video Geocoding
A Multimodal Approach for   Video Geocoding A Multimodal Approach for   Video Geocoding
A Multimodal Approach for Video Geocoding
 
Brave New Task: Musiclef Multimodal Music Tagging
Brave New Task: Musiclef Multimodal Music TaggingBrave New Task: Musiclef Multimodal Music Tagging
Brave New Task: Musiclef Multimodal Music Tagging
 
Search and Hyperlinking Task at MediaEval 2012
Search and Hyperlinking Task at MediaEval 2012Search and Hyperlinking Task at MediaEval 2012
Search and Hyperlinking Task at MediaEval 2012
 
CUNI at MediaEval 2012: Search and Hyperlinking Task
CUNI at MediaEval 2012: Search and Hyperlinking TaskCUNI at MediaEval 2012: Search and Hyperlinking Task
CUNI at MediaEval 2012: Search and Hyperlinking Task
 
DCU Search Runs at MediaEval 2012: Search and Hyperlinking Task
DCU Search Runs at MediaEval 2012: Search and Hyperlinking TaskDCU Search Runs at MediaEval 2012: Search and Hyperlinking Task
DCU Search Runs at MediaEval 2012: Search and Hyperlinking Task
 
Ghent University-IBBT at MediaEval 2012 Search and Hyperlinking: Semantic Sim...
Ghent University-IBBT at MediaEval 2012 Search and Hyperlinking: Semantic Sim...Ghent University-IBBT at MediaEval 2012 Search and Hyperlinking: Semantic Sim...
Ghent University-IBBT at MediaEval 2012 Search and Hyperlinking: Semantic Sim...
 
Brave New Task: User Account Matching
Brave New Task: User Account MatchingBrave New Task: User Account Matching
Brave New Task: User Account Matching
 
The CLEF Initiative From 2010 to 2012 and Onwards
The CLEF Initiative From 2010 to 2012 and OnwardsThe CLEF Initiative From 2010 to 2012 and Onwards
The CLEF Initiative From 2010 to 2012 and Onwards
 
Overview of MediaEval 2012 Visual Privacy Task
Overview of MediaEval 2012 Visual Privacy TaskOverview of MediaEval 2012 Visual Privacy Task
Overview of MediaEval 2012 Visual Privacy Task
 
MediaEval 2012 Visual Privacy Task: Privacy and Intelligibility through Pixel...
MediaEval 2012 Visual Privacy Task: Privacy and Intelligibility through Pixel...MediaEval 2012 Visual Privacy Task: Privacy and Intelligibility through Pixel...
MediaEval 2012 Visual Privacy Task: Privacy and Intelligibility through Pixel...
 
MediaEval 2012 Visual Privacy Task: Applying Transform-domain Scrambling to A...
MediaEval 2012 Visual Privacy Task: Applying Transform-domain Scrambling to A...MediaEval 2012 Visual Privacy Task: Applying Transform-domain Scrambling to A...
MediaEval 2012 Visual Privacy Task: Applying Transform-domain Scrambling to A...
 
Violent Scenes Detection with Large, Brute-forced Acoustic and Visual Feature...
Violent Scenes Detection with Large, Brute-forced Acoustic and Visual Feature...Violent Scenes Detection with Large, Brute-forced Acoustic and Visual Feature...
Violent Scenes Detection with Large, Brute-forced Acoustic and Visual Feature...
 
Technicolor/INRIA/Imperial College London at the MediaEval 2012 Violent Scene...
Technicolor/INRIA/Imperial College London at the MediaEval 2012 Violent Scene...Technicolor/INRIA/Imperial College London at the MediaEval 2012 Violent Scene...
Technicolor/INRIA/Imperial College London at the MediaEval 2012 Violent Scene...
 
The MediaEval 2012 Affect Task: Violent Scenes Detectio
The MediaEval 2012 Affect Task: Violent Scenes DetectioThe MediaEval 2012 Affect Task: Violent Scenes Detectio
The MediaEval 2012 Affect Task: Violent Scenes Detectio
 
NII, Japan at MediaEval 2012 Violent Scenes Detection Affect Task
NII, Japan at MediaEval 2012 Violent Scenes Detection Affect TaskNII, Japan at MediaEval 2012 Violent Scenes Detection Affect Task
NII, Japan at MediaEval 2012 Violent Scenes Detection Affect Task
 
LIG at MediaEval 2012 affect task: use of a generic method
LIG at MediaEval 2012 affect task: use of a generic methodLIG at MediaEval 2012 affect task: use of a generic method
LIG at MediaEval 2012 affect task: use of a generic method
 
Violence Detection in Video by Large Scale Multi-Scale Local Binary Pattern D...
Violence Detection in Video by Large Scale Multi-Scale Local Binary Pattern D...Violence Detection in Video by Large Scale Multi-Scale Local Binary Pattern D...
Violence Detection in Video by Large Scale Multi-Scale Local Binary Pattern D...
 
ARF @ MediaEval 2012: An Uninformed Approach to Violence Detection in Hollywo...
ARF @ MediaEval 2012: An Uninformed Approach to Violence Detection in Hollywo...ARF @ MediaEval 2012: An Uninformed Approach to Violence Detection in Hollywo...
ARF @ MediaEval 2012: An Uninformed Approach to Violence Detection in Hollywo...
 

KĂŒrzlich hochgeladen

+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎+971_581248768%)**%*]'#abortion pills for sale in dubai@
 

KĂŒrzlich hochgeladen (20)

Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 
Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Ransomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdfRansomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdf
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Navi Mumbai Call Girls đŸ„° 8617370543 Service Offer VIP Hot Model
Navi Mumbai Call Girls đŸ„° 8617370543 Service Offer VIP Hot ModelNavi Mumbai Call Girls đŸ„° 8617370543 Service Offer VIP Hot Model
Navi Mumbai Call Girls đŸ„° 8617370543 Service Offer VIP Hot Model
 
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu SubbuApidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024
 

mevd2012 esra_

  • 1. Detection of Violent Scenes using Affective Features Esra Acar Competence Center Information Retrieval and Machine Learning 4. October 2012
  • 2. ▶ Motivation ▶ Background ▶ The Method  Audio Features  Visual Features ▶ Results & Discussion ▶ Conclusions & Future Work 4. October 2012 Detection of Violent Scenes using Affective Features 2
  • 3. ▶ The MediaEval 2012 Affect Task aims at detecting violent segments in movies. ▶ A recent work on horror scene recognition detects horror scenes by affect-related features. ▶ We investigate whether  affect-related features provide good representation of violence, and  making abstractions from low-level features is better than directly using low-level data. 4. October 2012 Detection of Violent Scenes using Affective Features 3
  • 4. ▶ The affective content of a video corresponds to  the intensity (i.e. arousal), and  the type (i.e. valence) of emotion expected to arise in the user while watching that video. ▶ Recent research efforts propose methods to map low-level features to high-level emotions. ▶ Film-makers intend to elicit some particular emotions (i.e. expected emotions) in the audience. ▶ When we refer to violence as an expected emotion in videos, affect-related features are applicable for violence detection. 4. October 2012 Detection of Violent Scenes using Affective Features 4
  • 5. ▶ The method uses affect-related audio and visual features to represent violence. ▶ Low-level audio and visual features are extracted. ▶ Mid-level audio features are generated based on the low- level ones. ▶ The audio and visual features are then fused at the feature- level and a two-class SVM is trained. 4. October 2012 Detection of Violent Scenes using Affective Features 5
  • 6. ▶ Affect-related audio features used in the work are:  Audio energy  related to the arousal aspect.  high/low energy corresponds to high/low emotion intensity.  used for vocal emotion detection.  Mel-Frequency Cepstral Coefficients (MFCC)  related to the arousal aspect.  works well for the detection of excitement/non-excitement.  Pitch  related to the valence aspect.  significant for emotion detection in speech and music. 4. October 2012 Detection of Violent Scenes using Affective Features 6
  • 7. ▶ Each video shot has different numbers of audio energy, pitch and MFCC feature vectors (due to varying shot durations). ▶ Audio representations are obtained by computing mean and standard deviation for these audio features. ▶ Abstraction for MFCC:  MFCC-based Bag of Audio Words (BoAW) approach is chosen to generate mid-level audio representations.  Two different audio vocabularies are constructed: violence and non-violence vocabularies (by k-means clustering).  MFCC of violent/non-violent movie segments are used to construct violence/non-violence words.  Violence and non-violence word occurrences within a video shot are represented by a BoAW histogram. 4. October 2012 Detection of Violent Scenes using Affective Features 7
  • 8. ▶ Average motion  related to the arousal aspect.  Motion vectors are computed using block-based motion estimation.  Average motion is found as the average magnitude of all motion vectors. ▶ We compute average motion around the keyframe of video shots. 4. October 2012 Detection of Violent Scenes using Affective Features 8
  • 9. ▶ The performance of our method was assessed on 3 Hollywood movies (evaluation criteria: MAP at 100). ▶ We submitted five runs:  r1-low-level: low-level audio and visual features,  Runs based on mid-level audio and low-level visual features  r2-mid-level-100k: 100k samples for dictionary construction,  r3-mid-level-300k: 300k samples for dictionary construction,  r4-mid-level-300k-default: 300k samples for dictionary construction + SVM default parameters, and  r5-mid-level-500k: 500k samples for dictionary construction. 4. October 2012 Detection of Violent Scenes using Affective Features 9
  • 10. Table 1 – Precision, Recall and F-measure at shot level Run AED-P AED-R AED-F r1-low-level 0.141 0.597 0.2287 r2-mid-level-100k 0.140 0.629 0.2285 r3-mid-level-300k 0.144 0.625 0.2337 r4-mid-level-300k-default 0.190 0.627 0.2971 r5-mid-level-500k 0.154 0.603 0.2457 Table 2 – Mean Average Precision (MAP) values at 20 and 100 Run MAP at 20 MAP at 100 r1-low-level 0.2132 0.18502 r2-mid-level-100k 0.2037 0.14492 r3-mid-level-300k 0.3593 0.18538 r4-mid-level-300k-default 0.1547 0.15083 r5-mid-level-500k 0.15 0.11527 ▶ Slightly better performance is achieved with mid-level representations compared to the low-level one. ▶ Using affect-related features to describe violence needs some improvements (especially the visual part). 4. October 2012 Detection of Violent Scenes using Affective Features 10
  • 11. ▶ The aim of this work was to investigate whether affect- related features are well-suited to describe violence. ▶ Affect-related audio and visual features are merged in a supervised manner using SVM. ▶ Our main finding is that more sophisticated affect-related features are necessary to describe the content of videos (especially the visual part). ▶ Our next step in this work is to use  mid-level features such as human facial features, and  more sophisticated motion descriptors such as Lagrangian measures for video content representation. 4. October 2012 Detection of Violent Scenes using Affective Features 11
  • 12. Thank you! Questions? 4. October 2012 Detection of Violent Scenes using Affective Features 12