SlideShare ist ein Scribd-Unternehmen logo
1 von 44
Human Action Recognition Using 3D
Joint Information and Pyramidal
HOOFD Features
MSc Thesis by
Barış Can Üstündağ
Thesis Advisor: Prof. Dr. Mustafa Ünel
Buraya görseller eklenecek
• Introduction to Human Action Recognition
– Motivation, Applications
– Related Work
• Human Action Recognition Using 3D Joint Information and HOOFD
Features
– Acquiring Depth Data
– Feature Extraction
• 3D Joints
• HOOFD
– Feature Representation
– Classification
• Experiments
– Datasets
• MSR Action 3D Dataset
• MSR Action Pairs Dataset
• MSRC-12 Gesture Dataset
• Conclusions & Future Work
Outline
• Introduction to Human Action Recognition
– Motivation, Applications
– Related Work
• Human Action Recognition Using 3D Joint Information and HOOFD
Features
– Acquiring Depth Data
– Feature Extraction
• 3D Joints
• HOOFD
– Feature Representation
– Classification
• Experiments
– Datasets
• MSR Action 3D Dataset
• MSR Action Pairs Dataset
• MSRC-12 Gesture Dataset
• Conclusions & Future Work
Outline
• Motion Perception
– Gunnar Johansson [1971]
• Sequence of images for
Human Motion Analysis
• ‘Moving Light Displays’
enable identification of
people and gender
• Motion Capture [2014]
– Dawn of the Planet of
the Apes
Motivation
• Vast amount of Data
YouTube
• More than 34K hours of video
uploaded every day
Surveillance Cameras
• ~30 M cameras in the US
• ~700K video hours every day
Motivation
• Video Categorization
Movies TV YouTube
Motivation
• Video Categorization
– How many human-pixels are there?
Movies TV YouTube
Motivation
• Video Categorization
– How many human-pixels are there?
Movies TV YouTube
35% 34% 40%
Motivation
• Rehabilitation
– 15M people suffer fom
stroke every year
– Automated systems
– Gamification
Motivation - Application
• Release of Low-cost Depth Cameras
– Kinect (2010)
– Google Tango (developers only, 2014)
– Leap Motion (2013)
• Effective and robust performance
given
– Complex background
– Challenging viewpoints
– Occlusions
Motivation – Why depth?
Google Tango
Leap Motion
Intensity
Based
• Extraction of Cuboids
• Motion History
Images, Motion
Energy Images
Depth
Map
Based
• Depth Motion Maps
• Histogram of
Oriented 4D Normals
Skeletal
Data
Based
• SMIJ – Sequence of
most informative
Joints
• HOJ3D – Histogram
of 3D Joint Locations
Related Work
Related Work
• Extraction of Cuboids,
Dollar et al. [CVPR, 2005]
• Motion History Images
Motion Energy Images,
Gorelick et al. [PAMI, 2007]
Intensity
Based
Related Work
• Histogram of Oriented
4D Normals (HON4D)
Oreifej et al. [CVPR, 2013]
• Depth Motion Maps,
Yang et al. [JRTIP, 2012]
Depth Map
Based
Related Work
• Sequence of Most Informative Joints (SMIJ),
Ofli et al. [CVIU, 2013]
• View Invariant Human
Action Recognition
Using Histogram of
3D Joints,
Xia et al. [CVPR, 2012]
Skeletal Data
Based
• Introduction to Human Action Recognition
– Motivation, Applications
– Related Work
• Human Action Recognition Using 3D Joint Information and HOOFD
Features
– Acquiring Depth Data
– Feature Extraction
• 3D Joints
• HOOFD
– Feature Representation
– Classification
• Experiments
– Datasets
• MSR Action 3D Dataset
• MSR Action Pairs Dataset
• MSRC-12 Gesture Dataset
• Conclusions & Future Work
Outline
Acquiring
Depth Data
Feature
Extraction
Feature
Representation
Classification
Human Action Recognition Using 3D Joint Information and HOOFD Features
• Depth
Acquisition
• Formation of
shadows
• Eliminating the
noise
• 3D Joints
• HOOFD
• Signal Warping
• Pyramidal HOOFD
Features
• Naive Bayes
• Support Vector
Machines
• Kinect
– Depth data acquisition is
accomplised by using ‘Light
Coding’ Method
• In order to process the depth data
in any application
– Formation of shadows
– Eliminating the noise
Acquiring Depth
Data
Feature
Extraction
Feature
Representation
Classification
• Shadows
– Generated by the foreground objects
• Noise
– Rough object boundaries caused gaps
and holes on depth data
• Bilateral Filter
Space term Range term
Acquiring Depth
Data
Feature
Extraction
Feature
Representation
Classification
• Joint Features
– 20 Joints are provided
by Kinect SDK
– 10 Joint Angles and their
derivatives calculated:
T
kk
k
1

• Joint Features
– Mapped to spherical
Coordinates
– Origin is aligned to
the hip center
– Radius parameter is
discarded
Acquiring
Depth Data
Feature Extraction
Feature
Representation
Classification
• Histogram of Oriented Optical Flows
from Depth (HOOFD)
Acquiring
Depth Data
Feature Extraction
Feature
Representation
Classification
- Optical Flow from Depth Data
•Mapping of depth data to intensity image
•Depth values (z) represented as intensity (I)
•Optical flow field which is invariant to sudden change of brightness
- Optical Flow
• 2D displacement of pixel patches on the
image plane
• Brightness Constancy Equation
• Linearizing assuming small (u,v) using Taylor
Series Expansion
• Histogram of Oriented Optical Flows
from Depth (HOOFD)
),(),,( , ttyyxxItyxI 
0),,(),(),,(),(),,(  tyxIyxvtyxIyxutyxI tyx
t
x
yxu


),(
t
y
yxv


),(
• Optical Flow – Lucas Kanade Method
• Apply it within a local patch
• Minimize using Least-Squares method
 

yx
tyx IvtyxIutyxIvuE
,
2
),,(),,(),(


























ty
tx
yyx
yxx
II
II
v
u
III
III
2
2
bA u

  bAAA TT 1
u



• Optical Flow – Horn Schunk Method
• Assumption: global smoothness in the flow over the whole image
    dydxvvuuE
D
yxyxs   2222Smoothness error:
  dydxIvIuIE
D
tyxc   2Error in brightness
constancy equation
sc EE Minimize:
• Histogram of Oriented Optical Flow from
Depth
• Binning according to:
– Primary Angle between the flow vector and the horizontal axis
– Magnitude of the flow vector
• Orientation & Magnitude images
Histogram Binning example with bin size = 4
)(tan 1
u
v
 22
vuM 
• Signal Warping
– If it is a longer action instance -> Discard frames
– If it is a shorter action instance -> Replicate and
insert frames
Acquiring
Depth Data
Feature
Extraction
Feature
Representation
Classification
• Pyramidal HOOFD Features
– Histogram of Oriented Optical Flow from Depth
After obtaining optical flows patches
1. Patches are extracted around each joint
Acquiring
Depth Data
Feature
Extraction
Feature
Representation
Classification
• Pyramidal HOOFD Features
– Histogram of Oriented Optical Flow from Depth
After obtaining optical flows patches
1. Patches are extracted around each joint
2. HOOFDs are calculated in a pyramidal
fashion
Level 2
Level 3
Level 1
Acquiring
Depth Data
Feature
Extraction
Feature
Representation
Classification
Level 2
Level 3
Level 1
Acquiring
Depth Data
Feature
Extraction
Feature
Representation
Classification
Level 2
Level 3
Level 1
Acquiring
Depth Data
Feature
Extraction
Feature
Representation
Classification
• Supervised learning
methods
– Training examples are
attached to known classes
• Spam filtering on an e-mail
client
– Examples: Naive Bayes,
Support Vector Machines
Acquiring
Depth Data
Feature
Extraction
Feature
Representation
Classification
• Naive Bayes Classifier
– Independence assumption between features
• For example: a car ‘Volkswagen’ with a red color
and 17 inch wheels and these features contribute
independently to classify that this car is a
‘Volkswagen’
Acquiring
Depth Data
Feature
Extraction
Feature
Representation
Classification
• Support Vector Machines
– Calculates the choice of the most optimal
hyperplane that defines the decision boundary
between two classes
Acquiring
Depth Data
Feature
Extraction
Feature
Representation
Classification
• Introduction to Human Action Recognition
– Motivation, Applications
– Related Work
• Action Recognition Using 3D Joint Information and HOOFD Features
– Acquiring Depth Data
– Feature Extraction
• 3D Joints
• HOOFD
– Feature Representation
– Classification
• Experiments
– Datasets
• MSR Action 3D Dataset
• MSR Action Pairs Dataset
• MSRC-12 Gesture Dataset
• Conclusions & Future Work
Outline
• Datasets
– MSR Action 3D
• 10 Subjects
• 20 Actions
– MSR Pairs 3D
• 10 Subjects
• 12 Actions
– MSRC-12 Gesture
• 30 Subjects
• 12 Actions
Experiments
Experiment - 1
Settings
• Dataset: MSRC-12 Gesture
• Feature: Joint Features
• Ratio:
• Leave-one-subject-out-cross-valuation
• 50% Training 50% Test
• 75% Training 25% Test
Experiment - 1
Experiment - 1
Experiment - 2
Settings
• Feature: HOOFD Features
• Dataset: MSR Action 3D
• Ratio: 50% Training 50% Test
Experiment - 2
Settings
• Feature: HOOFD Features
• Dataset: MSR Action 3D
• Ratio: 50% Training 50% Test
Experiment - 2
Settings
• Feature: HOOFD Features
• Dataset: MSR Action 3D
• Ratio: 50% Training 50% Test
Smash Action Forward Punch Action
Experiment - 3
Settings
• Feature: HOOFD Features
• Dataset: MSR Action Pairs
• Ratio: 50% Training 50% Test
Conclusion & Future Work
• We developed a novel human action recognition framework by fusing 3D Joint information and
HOOFD features
• We proposed a new feature called Histogram of Oriented Optical Flow from Depth (HOOFD)
• Several experiments with publicly available datasets were conducted to assess the performance of
the proposed technique.
• Comparison with state-of-the-art algorithms show the success of our algorithm.
• As future work,
– Potential of HOOFD will be fully explored
– Different popular classification approaches will be employed (Bag of Words, Random Forest, Boosted Trees)
Thank You ...
???

Weitere ähnliche Inhalte

Was ist angesagt?

Flow Trajectory Approach for Human Action Recognition
Flow Trajectory Approach for Human Action RecognitionFlow Trajectory Approach for Human Action Recognition
Flow Trajectory Approach for Human Action RecognitionIRJET Journal
 
Stereo Vision Human Motion Detection and Tracking in Uncontrolled Environment
Stereo Vision Human Motion Detection and Tracking in Uncontrolled EnvironmentStereo Vision Human Motion Detection and Tracking in Uncontrolled Environment
Stereo Vision Human Motion Detection and Tracking in Uncontrolled EnvironmentTELKOMNIKA JOURNAL
 
【ITSC2015】Fine-grained Walking Activity Recognition via Driving Recorder Dataset
【ITSC2015】Fine-grained Walking Activity Recognition via Driving Recorder Dataset【ITSC2015】Fine-grained Walking Activity Recognition via Driving Recorder Dataset
【ITSC2015】Fine-grained Walking Activity Recognition via Driving Recorder DatasetHirokatsu Kataoka
 
motion and feature based person tracking in survillance videos
motion and feature based person tracking in survillance videosmotion and feature based person tracking in survillance videos
motion and feature based person tracking in survillance videosshiva kumar cheruku
 
Camera calibration technique
Camera calibration techniqueCamera calibration technique
Camera calibration techniqueKrzysztof Wegner
 
ADAPTIVE, SCALABLE, TRANSFORMDOMAIN GLOBAL MOTION ESTIMATION FOR VIDEO STABIL...
ADAPTIVE, SCALABLE, TRANSFORMDOMAIN GLOBAL MOTION ESTIMATION FOR VIDEO STABIL...ADAPTIVE, SCALABLE, TRANSFORMDOMAIN GLOBAL MOTION ESTIMATION FOR VIDEO STABIL...
ADAPTIVE, SCALABLE, TRANSFORMDOMAIN GLOBAL MOTION ESTIMATION FOR VIDEO STABIL...cscpconf
 
Trajectory Based Unusual Human Movement Identification for ATM System
	 Trajectory Based Unusual Human Movement Identification for ATM System	 Trajectory Based Unusual Human Movement Identification for ATM System
Trajectory Based Unusual Human Movement Identification for ATM SystemIRJET Journal
 
【BMVC2016】Recognition of Transitional Action for Short-Term Action Prediction...
【BMVC2016】Recognition of Transitional Action for Short-Term Action Prediction...【BMVC2016】Recognition of Transitional Action for Short-Term Action Prediction...
【BMVC2016】Recognition of Transitional Action for Short-Term Action Prediction...Hirokatsu Kataoka
 
C0365025029
C0365025029C0365025029
C0365025029theijes
 
Geometric wavelet transform for optical flow estimation algorithm
Geometric wavelet transform for optical flow estimation algorithmGeometric wavelet transform for optical flow estimation algorithm
Geometric wavelet transform for optical flow estimation algorithmijcga
 
【ISVC2015】Evaluation of Vision-based Human Activity Recognition in Dense Traj...
【ISVC2015】Evaluation of Vision-based Human Activity Recognition in Dense Traj...【ISVC2015】Evaluation of Vision-based Human Activity Recognition in Dense Traj...
【ISVC2015】Evaluation of Vision-based Human Activity Recognition in Dense Traj...Hirokatsu Kataoka
 
Iris & Peri-ocular Recognition
Iris & Peri-ocular RecognitionIris & Peri-ocular Recognition
Iris & Peri-ocular RecognitionShashank Dhariwal
 
CHARACTERIZING HUMAN BEHAVIOURS USING STATISTICAL MOTION DESCRIPTOR
CHARACTERIZING HUMAN BEHAVIOURS USING STATISTICAL MOTION DESCRIPTORCHARACTERIZING HUMAN BEHAVIOURS USING STATISTICAL MOTION DESCRIPTOR
CHARACTERIZING HUMAN BEHAVIOURS USING STATISTICAL MOTION DESCRIPTORsipij
 
Extended Co-occurrence HOG with Dense Trajectories for Fine-grained Activity ...
Extended Co-occurrence HOG with Dense Trajectories for Fine-grained Activity ...Extended Co-occurrence HOG with Dense Trajectories for Fine-grained Activity ...
Extended Co-occurrence HOG with Dense Trajectories for Fine-grained Activity ...Hirokatsu Kataoka
 
Pose Machine
Pose MachinePose Machine
Pose MachineWei Yang
 
Development of Human Tracking in Video Surveillance System for Activity Anal...
Development of Human Tracking in Video Surveillance System  for Activity Anal...Development of Human Tracking in Video Surveillance System  for Activity Anal...
Development of Human Tracking in Video Surveillance System for Activity Anal...IOSR Journals
 
Comparison of Some Motion Detection Methods in cases of Single and Multiple M...
Comparison of Some Motion Detection Methods in cases of Single and Multiple M...Comparison of Some Motion Detection Methods in cases of Single and Multiple M...
Comparison of Some Motion Detection Methods in cases of Single and Multiple M...CSCJournals
 

Was ist angesagt? (20)

Flow Trajectory Approach for Human Action Recognition
Flow Trajectory Approach for Human Action RecognitionFlow Trajectory Approach for Human Action Recognition
Flow Trajectory Approach for Human Action Recognition
 
Background subtraction
Background subtractionBackground subtraction
Background subtraction
 
Moving object detection
Moving object detectionMoving object detection
Moving object detection
 
Stereo Vision Human Motion Detection and Tracking in Uncontrolled Environment
Stereo Vision Human Motion Detection and Tracking in Uncontrolled EnvironmentStereo Vision Human Motion Detection and Tracking in Uncontrolled Environment
Stereo Vision Human Motion Detection and Tracking in Uncontrolled Environment
 
【ITSC2015】Fine-grained Walking Activity Recognition via Driving Recorder Dataset
【ITSC2015】Fine-grained Walking Activity Recognition via Driving Recorder Dataset【ITSC2015】Fine-grained Walking Activity Recognition via Driving Recorder Dataset
【ITSC2015】Fine-grained Walking Activity Recognition via Driving Recorder Dataset
 
motion and feature based person tracking in survillance videos
motion and feature based person tracking in survillance videosmotion and feature based person tracking in survillance videos
motion and feature based person tracking in survillance videos
 
Camera calibration technique
Camera calibration techniqueCamera calibration technique
Camera calibration technique
 
ADAPTIVE, SCALABLE, TRANSFORMDOMAIN GLOBAL MOTION ESTIMATION FOR VIDEO STABIL...
ADAPTIVE, SCALABLE, TRANSFORMDOMAIN GLOBAL MOTION ESTIMATION FOR VIDEO STABIL...ADAPTIVE, SCALABLE, TRANSFORMDOMAIN GLOBAL MOTION ESTIMATION FOR VIDEO STABIL...
ADAPTIVE, SCALABLE, TRANSFORMDOMAIN GLOBAL MOTION ESTIMATION FOR VIDEO STABIL...
 
Trajectory Based Unusual Human Movement Identification for ATM System
	 Trajectory Based Unusual Human Movement Identification for ATM System	 Trajectory Based Unusual Human Movement Identification for ATM System
Trajectory Based Unusual Human Movement Identification for ATM System
 
【BMVC2016】Recognition of Transitional Action for Short-Term Action Prediction...
【BMVC2016】Recognition of Transitional Action for Short-Term Action Prediction...【BMVC2016】Recognition of Transitional Action for Short-Term Action Prediction...
【BMVC2016】Recognition of Transitional Action for Short-Term Action Prediction...
 
C0365025029
C0365025029C0365025029
C0365025029
 
Geometric wavelet transform for optical flow estimation algorithm
Geometric wavelet transform for optical flow estimation algorithmGeometric wavelet transform for optical flow estimation algorithm
Geometric wavelet transform for optical flow estimation algorithm
 
【ISVC2015】Evaluation of Vision-based Human Activity Recognition in Dense Traj...
【ISVC2015】Evaluation of Vision-based Human Activity Recognition in Dense Traj...【ISVC2015】Evaluation of Vision-based Human Activity Recognition in Dense Traj...
【ISVC2015】Evaluation of Vision-based Human Activity Recognition in Dense Traj...
 
Iris & Peri-ocular Recognition
Iris & Peri-ocular RecognitionIris & Peri-ocular Recognition
Iris & Peri-ocular Recognition
 
CHARACTERIZING HUMAN BEHAVIOURS USING STATISTICAL MOTION DESCRIPTOR
CHARACTERIZING HUMAN BEHAVIOURS USING STATISTICAL MOTION DESCRIPTORCHARACTERIZING HUMAN BEHAVIOURS USING STATISTICAL MOTION DESCRIPTOR
CHARACTERIZING HUMAN BEHAVIOURS USING STATISTICAL MOTION DESCRIPTOR
 
Extended Co-occurrence HOG with Dense Trajectories for Fine-grained Activity ...
Extended Co-occurrence HOG with Dense Trajectories for Fine-grained Activity ...Extended Co-occurrence HOG with Dense Trajectories for Fine-grained Activity ...
Extended Co-occurrence HOG with Dense Trajectories for Fine-grained Activity ...
 
Pose Machine
Pose MachinePose Machine
Pose Machine
 
Background Subtraction Algorithm for Moving Object Detection Using Denoising ...
Background Subtraction Algorithm for Moving Object Detection Using Denoising ...Background Subtraction Algorithm for Moving Object Detection Using Denoising ...
Background Subtraction Algorithm for Moving Object Detection Using Denoising ...
 
Development of Human Tracking in Video Surveillance System for Activity Anal...
Development of Human Tracking in Video Surveillance System  for Activity Anal...Development of Human Tracking in Video Surveillance System  for Activity Anal...
Development of Human Tracking in Video Surveillance System for Activity Anal...
 
Comparison of Some Motion Detection Methods in cases of Single and Multiple M...
Comparison of Some Motion Detection Methods in cases of Single and Multiple M...Comparison of Some Motion Detection Methods in cases of Single and Multiple M...
Comparison of Some Motion Detection Methods in cases of Single and Multiple M...
 

Andere mochten auch

An Application of Video Segmentation Using Optical Flows
An Application of Video Segmentation Using Optical FlowsAn Application of Video Segmentation Using Optical Flows
An Application of Video Segmentation Using Optical FlowsYusuf Uzun
 
Fusion of Skeletal and Silhouette-based Features for Human Action Recognition...
Fusion of Skeletal and Silhouette-based Features for Human Action Recognition...Fusion of Skeletal and Silhouette-based Features for Human Action Recognition...
Fusion of Skeletal and Silhouette-based Features for Human Action Recognition...Francisco (Paco) Florez-Revuelta
 
Silhouette Rimko
Silhouette RimkoSilhouette Rimko
Silhouette RimkoRimkoc
 
cvpr2011: human activity recognition - part 1: introduction
cvpr2011: human activity recognition - part 1: introductioncvpr2011: human activity recognition - part 1: introduction
cvpr2011: human activity recognition - part 1: introductionzukun
 
"Embedded Lucas-Kanade Tracking: How it Works, How to Implement It, and How t...
"Embedded Lucas-Kanade Tracking: How it Works, How to Implement It, and How t..."Embedded Lucas-Kanade Tracking: How it Works, How to Implement It, and How t...
"Embedded Lucas-Kanade Tracking: How it Works, How to Implement It, and How t...Edge AI and Vision Alliance
 
Human activity recognition
Human activity recognition Human activity recognition
Human activity recognition srikanthgadam
 
Types of Joint Present in the vertebral column (simplified version)
Types of Joint Present in the vertebral column (simplified version)Types of Joint Present in the vertebral column (simplified version)
Types of Joint Present in the vertebral column (simplified version)Quan Fu Gan
 
Surgical anatomy of knee joint
Surgical anatomy of knee jointSurgical anatomy of knee joint
Surgical anatomy of knee jointTausifahmed786
 
Chapter 8 Joints of the Skeletal System
Chapter 8   Joints of the Skeletal SystemChapter 8   Joints of the Skeletal System
Chapter 8 Joints of the Skeletal Systemsgossett5757
 
Chapter 7 Skeletal System
Chapter 7   Skeletal SystemChapter 7   Skeletal System
Chapter 7 Skeletal Systemsgossett5757
 
Google Presentation
Google PresentationGoogle Presentation
Google Presentationguesta599e2
 
Slideshare Powerpoint presentation
Slideshare Powerpoint presentationSlideshare Powerpoint presentation
Slideshare Powerpoint presentationelliehood
 

Andere mochten auch (15)

An Application of Video Segmentation Using Optical Flows
An Application of Video Segmentation Using Optical FlowsAn Application of Video Segmentation Using Optical Flows
An Application of Video Segmentation Using Optical Flows
 
Fusion of Skeletal and Silhouette-based Features for Human Action Recognition...
Fusion of Skeletal and Silhouette-based Features for Human Action Recognition...Fusion of Skeletal and Silhouette-based Features for Human Action Recognition...
Fusion of Skeletal and Silhouette-based Features for Human Action Recognition...
 
Silhouette Rimko
Silhouette RimkoSilhouette Rimko
Silhouette Rimko
 
cvpr2011: human activity recognition - part 1: introduction
cvpr2011: human activity recognition - part 1: introductioncvpr2011: human activity recognition - part 1: introduction
cvpr2011: human activity recognition - part 1: introduction
 
Articulations
ArticulationsArticulations
Articulations
 
Human anatomy articulations
Human anatomy articulationsHuman anatomy articulations
Human anatomy articulations
 
"Embedded Lucas-Kanade Tracking: How it Works, How to Implement It, and How t...
"Embedded Lucas-Kanade Tracking: How it Works, How to Implement It, and How t..."Embedded Lucas-Kanade Tracking: How it Works, How to Implement It, and How t...
"Embedded Lucas-Kanade Tracking: How it Works, How to Implement It, and How t...
 
Human activity recognition
Human activity recognition Human activity recognition
Human activity recognition
 
Types of Joint Present in the vertebral column (simplified version)
Types of Joint Present in the vertebral column (simplified version)Types of Joint Present in the vertebral column (simplified version)
Types of Joint Present in the vertebral column (simplified version)
 
Surgical anatomy of knee joint
Surgical anatomy of knee jointSurgical anatomy of knee joint
Surgical anatomy of knee joint
 
Video Inpainting detection using inconsistencies in optical Flow
Video Inpainting detection using inconsistencies in optical FlowVideo Inpainting detection using inconsistencies in optical Flow
Video Inpainting detection using inconsistencies in optical Flow
 
Chapter 8 Joints of the Skeletal System
Chapter 8   Joints of the Skeletal SystemChapter 8   Joints of the Skeletal System
Chapter 8 Joints of the Skeletal System
 
Chapter 7 Skeletal System
Chapter 7   Skeletal SystemChapter 7   Skeletal System
Chapter 7 Skeletal System
 
Google Presentation
Google PresentationGoogle Presentation
Google Presentation
 
Slideshare Powerpoint presentation
Slideshare Powerpoint presentationSlideshare Powerpoint presentation
Slideshare Powerpoint presentation
 

Ähnlich wie Human Action Recognition Using 3D Joint Information and HOOFD Features

TechnicalBackgroundOverview
TechnicalBackgroundOverviewTechnicalBackgroundOverview
TechnicalBackgroundOverviewMotaz El-Saban
 
Moving object detection in video surveillance
Moving object detection in video surveillanceMoving object detection in video surveillance
Moving object detection in video surveillanceAshfaqul Haque John
 
So Predictable! Continuous 3D Hand Trajectory Prediction in Virtual Reality
So Predictable! Continuous 3D Hand Trajectory Prediction in Virtual Reality So Predictable! Continuous 3D Hand Trajectory Prediction in Virtual Reality
So Predictable! Continuous 3D Hand Trajectory Prediction in Virtual Reality ivaderivader
 
The Art of Intelligence – A Practical Introduction Machine Learning for Orac...
The Art of Intelligence – A Practical Introduction Machine Learning for Orac...The Art of Intelligence – A Practical Introduction Machine Learning for Orac...
The Art of Intelligence – A Practical Introduction Machine Learning for Orac...Lucas Jellema
 
Lab-Forming Fields and Field-Forming Labs
Lab-Forming Fields and Field-Forming LabsLab-Forming Fields and Field-Forming Labs
Lab-Forming Fields and Field-Forming LabsKurata Takeshi
 
Soundarya m.sc
Soundarya m.scSoundarya m.sc
Soundarya m.scsowfi
 
Introduction talk to Computer Vision
Introduction talk to Computer Vision Introduction talk to Computer Vision
Introduction talk to Computer Vision Chen Sagiv
 
Depth-Based Real Time Head Motion Tracking Using 3D Template Matching
Depth-Based Real Time Head Motion Tracking Using 3D Template MatchingDepth-Based Real Time Head Motion Tracking Using 3D Template Matching
Depth-Based Real Time Head Motion Tracking Using 3D Template Matching愚 屠
 
HOL, GDCT AND LDCT FOR PEDESTRIAN DETECTION
HOL, GDCT AND LDCT FOR PEDESTRIAN DETECTIONHOL, GDCT AND LDCT FOR PEDESTRIAN DETECTION
HOL, GDCT AND LDCT FOR PEDESTRIAN DETECTIONcsandit
 
HOL, GDCT AND LDCT FOR PEDESTRIAN DETECTION
HOL, GDCT AND LDCT FOR PEDESTRIAN DETECTIONHOL, GDCT AND LDCT FOR PEDESTRIAN DETECTION
HOL, GDCT AND LDCT FOR PEDESTRIAN DETECTIONcscpconf
 
Data quality evaluation & orbit identification from scatterometer
Data quality evaluation & orbit identification from scatterometerData quality evaluation & orbit identification from scatterometer
Data quality evaluation & orbit identification from scatterometerMudit Dholakia
 
Using Panoramic Videos for Multi-Person Localization and Tracking In A 3D Pan...
Using Panoramic Videos for Multi-Person Localization and Tracking In A 3D Pan...Using Panoramic Videos for Multi-Person Localization and Tracking In A 3D Pan...
Using Panoramic Videos for Multi-Person Localization and Tracking In A 3D Pan...Fan Yang
 
Techniques for Organization and Visualization of Community Photo Collections
Techniques for Organization and Visualization of Community Photo CollectionsTechniques for Organization and Visualization of Community Photo Collections
Techniques for Organization and Visualization of Community Photo CollectionsKumar Srijan
 
PEARC17: Visual exploration and analysis of time series earthquake data
PEARC17: Visual exploration and analysis of time series earthquake dataPEARC17: Visual exploration and analysis of time series earthquake data
PEARC17: Visual exploration and analysis of time series earthquake dataAmit Chourasia
 
RS in the context of Big Data-v4
RS in the context of Big Data-v4RS in the context of Big Data-v4
RS in the context of Big Data-v4Khadija Atiya
 
Fusion of Multi-MAV Data
Fusion of Multi-MAV DataFusion of Multi-MAV Data
Fusion of Multi-MAV DataDariolakis
 

Ähnlich wie Human Action Recognition Using 3D Joint Information and HOOFD Features (20)

Dip lect1-sent
Dip lect1-sentDip lect1-sent
Dip lect1-sent
 
TechnicalBackgroundOverview
TechnicalBackgroundOverviewTechnicalBackgroundOverview
TechnicalBackgroundOverview
 
Moving object detection in video surveillance
Moving object detection in video surveillanceMoving object detection in video surveillance
Moving object detection in video surveillance
 
So Predictable! Continuous 3D Hand Trajectory Prediction in Virtual Reality
So Predictable! Continuous 3D Hand Trajectory Prediction in Virtual Reality So Predictable! Continuous 3D Hand Trajectory Prediction in Virtual Reality
So Predictable! Continuous 3D Hand Trajectory Prediction in Virtual Reality
 
I0343065072
I0343065072I0343065072
I0343065072
 
The Art of Intelligence – A Practical Introduction Machine Learning for Orac...
The Art of Intelligence – A Practical Introduction Machine Learning for Orac...The Art of Intelligence – A Practical Introduction Machine Learning for Orac...
The Art of Intelligence – A Practical Introduction Machine Learning for Orac...
 
Lab-Forming Fields and Field-Forming Labs
Lab-Forming Fields and Field-Forming LabsLab-Forming Fields and Field-Forming Labs
Lab-Forming Fields and Field-Forming Labs
 
Soundarya m.sc
Soundarya m.scSoundarya m.sc
Soundarya m.sc
 
Introduction talk to Computer Vision
Introduction talk to Computer Vision Introduction talk to Computer Vision
Introduction talk to Computer Vision
 
Depth-Based Real Time Head Motion Tracking Using 3D Template Matching
Depth-Based Real Time Head Motion Tracking Using 3D Template MatchingDepth-Based Real Time Head Motion Tracking Using 3D Template Matching
Depth-Based Real Time Head Motion Tracking Using 3D Template Matching
 
HOL, GDCT AND LDCT FOR PEDESTRIAN DETECTION
HOL, GDCT AND LDCT FOR PEDESTRIAN DETECTIONHOL, GDCT AND LDCT FOR PEDESTRIAN DETECTION
HOL, GDCT AND LDCT FOR PEDESTRIAN DETECTION
 
HOL, GDCT AND LDCT FOR PEDESTRIAN DETECTION
HOL, GDCT AND LDCT FOR PEDESTRIAN DETECTIONHOL, GDCT AND LDCT FOR PEDESTRIAN DETECTION
HOL, GDCT AND LDCT FOR PEDESTRIAN DETECTION
 
Data quality evaluation & orbit identification from scatterometer
Data quality evaluation & orbit identification from scatterometerData quality evaluation & orbit identification from scatterometer
Data quality evaluation & orbit identification from scatterometer
 
Spatio-temporal reasoning for traffic scene understanding
Spatio-temporal reasoning for traffic scene understandingSpatio-temporal reasoning for traffic scene understanding
Spatio-temporal reasoning for traffic scene understanding
 
Using Panoramic Videos for Multi-Person Localization and Tracking In A 3D Pan...
Using Panoramic Videos for Multi-Person Localization and Tracking In A 3D Pan...Using Panoramic Videos for Multi-Person Localization and Tracking In A 3D Pan...
Using Panoramic Videos for Multi-Person Localization and Tracking In A 3D Pan...
 
PPT s01-machine vision-s2
PPT s01-machine vision-s2PPT s01-machine vision-s2
PPT s01-machine vision-s2
 
Techniques for Organization and Visualization of Community Photo Collections
Techniques for Organization and Visualization of Community Photo CollectionsTechniques for Organization and Visualization of Community Photo Collections
Techniques for Organization and Visualization of Community Photo Collections
 
PEARC17: Visual exploration and analysis of time series earthquake data
PEARC17: Visual exploration and analysis of time series earthquake dataPEARC17: Visual exploration and analysis of time series earthquake data
PEARC17: Visual exploration and analysis of time series earthquake data
 
RS in the context of Big Data-v4
RS in the context of Big Data-v4RS in the context of Big Data-v4
RS in the context of Big Data-v4
 
Fusion of Multi-MAV Data
Fusion of Multi-MAV DataFusion of Multi-MAV Data
Fusion of Multi-MAV Data
 

Kürzlich hochgeladen

Unlocking the Future of AI Agents with Large Language Models
Unlocking the Future of AI Agents with Large Language ModelsUnlocking the Future of AI Agents with Large Language Models
Unlocking the Future of AI Agents with Large Language Modelsaagamshah0812
 
The Guide to Integrating Generative AI into Unified Continuous Testing Platfo...
The Guide to Integrating Generative AI into Unified Continuous Testing Platfo...The Guide to Integrating Generative AI into Unified Continuous Testing Platfo...
The Guide to Integrating Generative AI into Unified Continuous Testing Platfo...kalichargn70th171
 
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...harshavardhanraghave
 
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...Health
 
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...Steffen Staab
 
A Secure and Reliable Document Management System is Essential.docx
A Secure and Reliable Document Management System is Essential.docxA Secure and Reliable Document Management System is Essential.docx
A Secure and Reliable Document Management System is Essential.docxComplianceQuest1
 
10 Trends Likely to Shape Enterprise Technology in 2024
10 Trends Likely to Shape Enterprise Technology in 202410 Trends Likely to Shape Enterprise Technology in 2024
10 Trends Likely to Shape Enterprise Technology in 2024Mind IT Systems
 
HR Software Buyers Guide in 2024 - HRSoftware.com
HR Software Buyers Guide in 2024 - HRSoftware.comHR Software Buyers Guide in 2024 - HRSoftware.com
HR Software Buyers Guide in 2024 - HRSoftware.comFatema Valibhai
 
VTU technical seminar 8Th Sem on Scikit-learn
VTU technical seminar 8Th Sem on Scikit-learnVTU technical seminar 8Th Sem on Scikit-learn
VTU technical seminar 8Th Sem on Scikit-learnAmarnathKambale
 
Unveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time ApplicationsUnveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time ApplicationsAlberto González Trastoy
 
Exploring the Best Video Editing App.pdf
Exploring the Best Video Editing App.pdfExploring the Best Video Editing App.pdf
Exploring the Best Video Editing App.pdfproinshot.com
 
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdfLearn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdfkalichargn70th171
 
Right Money Management App For Your Financial Goals
Right Money Management App For Your Financial GoalsRight Money Management App For Your Financial Goals
Right Money Management App For Your Financial GoalsJhone kinadey
 
Define the academic and professional writing..pdf
Define the academic and professional writing..pdfDefine the academic and professional writing..pdf
Define the academic and professional writing..pdfPearlKirahMaeRagusta1
 
Diamond Application Development Crafting Solutions with Precision
Diamond Application Development Crafting Solutions with PrecisionDiamond Application Development Crafting Solutions with Precision
Diamond Application Development Crafting Solutions with PrecisionSolGuruz
 
How To Use Server-Side Rendering with Nuxt.js
How To Use Server-Side Rendering with Nuxt.jsHow To Use Server-Side Rendering with Nuxt.js
How To Use Server-Side Rendering with Nuxt.jsAndolasoft Inc
 
Software Quality Assurance Interview Questions
Software Quality Assurance Interview QuestionsSoftware Quality Assurance Interview Questions
Software Quality Assurance Interview QuestionsArshad QA
 
Optimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTVOptimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTVshikhaohhpro
 
Direct Style Effect Systems - The Print[A] Example - A Comprehension Aid
Direct Style Effect Systems -The Print[A] Example- A Comprehension AidDirect Style Effect Systems -The Print[A] Example- A Comprehension Aid
Direct Style Effect Systems - The Print[A] Example - A Comprehension AidPhilip Schwarz
 

Kürzlich hochgeladen (20)

Unlocking the Future of AI Agents with Large Language Models
Unlocking the Future of AI Agents with Large Language ModelsUnlocking the Future of AI Agents with Large Language Models
Unlocking the Future of AI Agents with Large Language Models
 
The Guide to Integrating Generative AI into Unified Continuous Testing Platfo...
The Guide to Integrating Generative AI into Unified Continuous Testing Platfo...The Guide to Integrating Generative AI into Unified Continuous Testing Platfo...
The Guide to Integrating Generative AI into Unified Continuous Testing Platfo...
 
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...
 
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
 
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
 
A Secure and Reliable Document Management System is Essential.docx
A Secure and Reliable Document Management System is Essential.docxA Secure and Reliable Document Management System is Essential.docx
A Secure and Reliable Document Management System is Essential.docx
 
10 Trends Likely to Shape Enterprise Technology in 2024
10 Trends Likely to Shape Enterprise Technology in 202410 Trends Likely to Shape Enterprise Technology in 2024
10 Trends Likely to Shape Enterprise Technology in 2024
 
HR Software Buyers Guide in 2024 - HRSoftware.com
HR Software Buyers Guide in 2024 - HRSoftware.comHR Software Buyers Guide in 2024 - HRSoftware.com
HR Software Buyers Guide in 2024 - HRSoftware.com
 
VTU technical seminar 8Th Sem on Scikit-learn
VTU technical seminar 8Th Sem on Scikit-learnVTU technical seminar 8Th Sem on Scikit-learn
VTU technical seminar 8Th Sem on Scikit-learn
 
Unveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time ApplicationsUnveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
 
Exploring the Best Video Editing App.pdf
Exploring the Best Video Editing App.pdfExploring the Best Video Editing App.pdf
Exploring the Best Video Editing App.pdf
 
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdfLearn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
 
Right Money Management App For Your Financial Goals
Right Money Management App For Your Financial GoalsRight Money Management App For Your Financial Goals
Right Money Management App For Your Financial Goals
 
Define the academic and professional writing..pdf
Define the academic and professional writing..pdfDefine the academic and professional writing..pdf
Define the academic and professional writing..pdf
 
Diamond Application Development Crafting Solutions with Precision
Diamond Application Development Crafting Solutions with PrecisionDiamond Application Development Crafting Solutions with Precision
Diamond Application Development Crafting Solutions with Precision
 
How To Use Server-Side Rendering with Nuxt.js
How To Use Server-Side Rendering with Nuxt.jsHow To Use Server-Side Rendering with Nuxt.js
How To Use Server-Side Rendering with Nuxt.js
 
Software Quality Assurance Interview Questions
Software Quality Assurance Interview QuestionsSoftware Quality Assurance Interview Questions
Software Quality Assurance Interview Questions
 
Optimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTVOptimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTV
 
Microsoft AI Transformation Partner Playbook.pdf
Microsoft AI Transformation Partner Playbook.pdfMicrosoft AI Transformation Partner Playbook.pdf
Microsoft AI Transformation Partner Playbook.pdf
 
Direct Style Effect Systems - The Print[A] Example - A Comprehension Aid
Direct Style Effect Systems -The Print[A] Example- A Comprehension AidDirect Style Effect Systems -The Print[A] Example- A Comprehension Aid
Direct Style Effect Systems - The Print[A] Example - A Comprehension Aid
 

Human Action Recognition Using 3D Joint Information and HOOFD Features

  • 1. Human Action Recognition Using 3D Joint Information and Pyramidal HOOFD Features MSc Thesis by Barış Can Üstündağ Thesis Advisor: Prof. Dr. Mustafa Ünel Buraya görseller eklenecek
  • 2. • Introduction to Human Action Recognition – Motivation, Applications – Related Work • Human Action Recognition Using 3D Joint Information and HOOFD Features – Acquiring Depth Data – Feature Extraction • 3D Joints • HOOFD – Feature Representation – Classification • Experiments – Datasets • MSR Action 3D Dataset • MSR Action Pairs Dataset • MSRC-12 Gesture Dataset • Conclusions & Future Work Outline
  • 3. • Introduction to Human Action Recognition – Motivation, Applications – Related Work • Human Action Recognition Using 3D Joint Information and HOOFD Features – Acquiring Depth Data – Feature Extraction • 3D Joints • HOOFD – Feature Representation – Classification • Experiments – Datasets • MSR Action 3D Dataset • MSR Action Pairs Dataset • MSRC-12 Gesture Dataset • Conclusions & Future Work Outline
  • 4. • Motion Perception – Gunnar Johansson [1971] • Sequence of images for Human Motion Analysis • ‘Moving Light Displays’ enable identification of people and gender • Motion Capture [2014] – Dawn of the Planet of the Apes Motivation
  • 5. • Vast amount of Data YouTube • More than 34K hours of video uploaded every day Surveillance Cameras • ~30 M cameras in the US • ~700K video hours every day Motivation
  • 6. • Video Categorization Movies TV YouTube Motivation
  • 7. • Video Categorization – How many human-pixels are there? Movies TV YouTube Motivation
  • 8. • Video Categorization – How many human-pixels are there? Movies TV YouTube 35% 34% 40% Motivation
  • 9. • Rehabilitation – 15M people suffer fom stroke every year – Automated systems – Gamification Motivation - Application
  • 10. • Release of Low-cost Depth Cameras – Kinect (2010) – Google Tango (developers only, 2014) – Leap Motion (2013) • Effective and robust performance given – Complex background – Challenging viewpoints – Occlusions Motivation – Why depth? Google Tango Leap Motion
  • 11. Intensity Based • Extraction of Cuboids • Motion History Images, Motion Energy Images Depth Map Based • Depth Motion Maps • Histogram of Oriented 4D Normals Skeletal Data Based • SMIJ – Sequence of most informative Joints • HOJ3D – Histogram of 3D Joint Locations Related Work
  • 12. Related Work • Extraction of Cuboids, Dollar et al. [CVPR, 2005] • Motion History Images Motion Energy Images, Gorelick et al. [PAMI, 2007] Intensity Based
  • 13. Related Work • Histogram of Oriented 4D Normals (HON4D) Oreifej et al. [CVPR, 2013] • Depth Motion Maps, Yang et al. [JRTIP, 2012] Depth Map Based
  • 14. Related Work • Sequence of Most Informative Joints (SMIJ), Ofli et al. [CVIU, 2013] • View Invariant Human Action Recognition Using Histogram of 3D Joints, Xia et al. [CVPR, 2012] Skeletal Data Based
  • 15. • Introduction to Human Action Recognition – Motivation, Applications – Related Work • Human Action Recognition Using 3D Joint Information and HOOFD Features – Acquiring Depth Data – Feature Extraction • 3D Joints • HOOFD – Feature Representation – Classification • Experiments – Datasets • MSR Action 3D Dataset • MSR Action Pairs Dataset • MSRC-12 Gesture Dataset • Conclusions & Future Work Outline
  • 16. Acquiring Depth Data Feature Extraction Feature Representation Classification Human Action Recognition Using 3D Joint Information and HOOFD Features • Depth Acquisition • Formation of shadows • Eliminating the noise • 3D Joints • HOOFD • Signal Warping • Pyramidal HOOFD Features • Naive Bayes • Support Vector Machines
  • 17. • Kinect – Depth data acquisition is accomplised by using ‘Light Coding’ Method • In order to process the depth data in any application – Formation of shadows – Eliminating the noise Acquiring Depth Data Feature Extraction Feature Representation Classification
  • 18. • Shadows – Generated by the foreground objects • Noise – Rough object boundaries caused gaps and holes on depth data • Bilateral Filter Space term Range term Acquiring Depth Data Feature Extraction Feature Representation Classification
  • 19. • Joint Features – 20 Joints are provided by Kinect SDK – 10 Joint Angles and their derivatives calculated: T kk k 1 
  • 20. • Joint Features – Mapped to spherical Coordinates – Origin is aligned to the hip center – Radius parameter is discarded Acquiring Depth Data Feature Extraction Feature Representation Classification
  • 21. • Histogram of Oriented Optical Flows from Depth (HOOFD) Acquiring Depth Data Feature Extraction Feature Representation Classification - Optical Flow from Depth Data •Mapping of depth data to intensity image •Depth values (z) represented as intensity (I) •Optical flow field which is invariant to sudden change of brightness
  • 22. - Optical Flow • 2D displacement of pixel patches on the image plane • Brightness Constancy Equation • Linearizing assuming small (u,v) using Taylor Series Expansion • Histogram of Oriented Optical Flows from Depth (HOOFD) ),(),,( , ttyyxxItyxI  0),,(),(),,(),(),,(  tyxIyxvtyxIyxutyxI tyx t x yxu   ),( t y yxv   ),(
  • 23. • Optical Flow – Lucas Kanade Method • Apply it within a local patch • Minimize using Least-Squares method    yx tyx IvtyxIutyxIvuE , 2 ),,(),,(),(                           ty tx yyx yxx II II v u III III 2 2 bA u    bAAA TT 1 u   
  • 24. • Optical Flow – Horn Schunk Method • Assumption: global smoothness in the flow over the whole image     dydxvvuuE D yxyxs   2222Smoothness error:   dydxIvIuIE D tyxc   2Error in brightness constancy equation sc EE Minimize:
  • 25. • Histogram of Oriented Optical Flow from Depth • Binning according to: – Primary Angle between the flow vector and the horizontal axis – Magnitude of the flow vector • Orientation & Magnitude images Histogram Binning example with bin size = 4 )(tan 1 u v  22 vuM 
  • 26. • Signal Warping – If it is a longer action instance -> Discard frames – If it is a shorter action instance -> Replicate and insert frames Acquiring Depth Data Feature Extraction Feature Representation Classification
  • 27. • Pyramidal HOOFD Features – Histogram of Oriented Optical Flow from Depth After obtaining optical flows patches 1. Patches are extracted around each joint Acquiring Depth Data Feature Extraction Feature Representation Classification
  • 28. • Pyramidal HOOFD Features – Histogram of Oriented Optical Flow from Depth After obtaining optical flows patches 1. Patches are extracted around each joint 2. HOOFDs are calculated in a pyramidal fashion Level 2 Level 3 Level 1 Acquiring Depth Data Feature Extraction Feature Representation Classification
  • 29. Level 2 Level 3 Level 1 Acquiring Depth Data Feature Extraction Feature Representation Classification
  • 30. Level 2 Level 3 Level 1 Acquiring Depth Data Feature Extraction Feature Representation Classification
  • 31. • Supervised learning methods – Training examples are attached to known classes • Spam filtering on an e-mail client – Examples: Naive Bayes, Support Vector Machines Acquiring Depth Data Feature Extraction Feature Representation Classification
  • 32. • Naive Bayes Classifier – Independence assumption between features • For example: a car ‘Volkswagen’ with a red color and 17 inch wheels and these features contribute independently to classify that this car is a ‘Volkswagen’ Acquiring Depth Data Feature Extraction Feature Representation Classification
  • 33. • Support Vector Machines – Calculates the choice of the most optimal hyperplane that defines the decision boundary between two classes Acquiring Depth Data Feature Extraction Feature Representation Classification
  • 34. • Introduction to Human Action Recognition – Motivation, Applications – Related Work • Action Recognition Using 3D Joint Information and HOOFD Features – Acquiring Depth Data – Feature Extraction • 3D Joints • HOOFD – Feature Representation – Classification • Experiments – Datasets • MSR Action 3D Dataset • MSR Action Pairs Dataset • MSRC-12 Gesture Dataset • Conclusions & Future Work Outline
  • 35. • Datasets – MSR Action 3D • 10 Subjects • 20 Actions – MSR Pairs 3D • 10 Subjects • 12 Actions – MSRC-12 Gesture • 30 Subjects • 12 Actions Experiments
  • 36. Experiment - 1 Settings • Dataset: MSRC-12 Gesture • Feature: Joint Features • Ratio: • Leave-one-subject-out-cross-valuation • 50% Training 50% Test • 75% Training 25% Test
  • 39. Experiment - 2 Settings • Feature: HOOFD Features • Dataset: MSR Action 3D • Ratio: 50% Training 50% Test
  • 40. Experiment - 2 Settings • Feature: HOOFD Features • Dataset: MSR Action 3D • Ratio: 50% Training 50% Test
  • 41. Experiment - 2 Settings • Feature: HOOFD Features • Dataset: MSR Action 3D • Ratio: 50% Training 50% Test Smash Action Forward Punch Action
  • 42. Experiment - 3 Settings • Feature: HOOFD Features • Dataset: MSR Action Pairs • Ratio: 50% Training 50% Test
  • 43. Conclusion & Future Work • We developed a novel human action recognition framework by fusing 3D Joint information and HOOFD features • We proposed a new feature called Histogram of Oriented Optical Flow from Depth (HOOFD) • Several experiments with publicly available datasets were conducted to assess the performance of the proposed technique. • Comparison with state-of-the-art algorithms show the success of our algorithm. • As future work, – Potential of HOOFD will be fully explored – Different popular classification approaches will be employed (Bag of Words, Random Forest, Boosted Trees)

Hinweis der Redaktion

  1. Brightness values of individual pixels on a local patch are preserved. By linearizing the equation around I(x,y,t) using Taylor series expansion we obtained the second equation
  2. Even though we assumed that the equation is equal to 0, practically it is not. We then discretize the equation and applied it within a local patch and we acquired this cost function Minimizing this function using least squares gives us the optical flow vectors as a result
  3. However in the literature there is also another method proposed by Horn and Schunk, which introduced a global smoothness constraid over the whole image. This is a useful method to correct errors that is caused by the gaps and holes on depth data. Smoothness is introduced by minimizing the velocities, optical flow vectors
  4. HON4D: To make the descriptors more discriminative, they quantized the 4D space using the vertices of a polychoron Dictionary Learning – Group Sparsity Geometric Constraint with Temporal Pyramid Matching