SlideShare ist ein Scribd-Unternehmen logo
1 von 20
Downloaden Sie, um offline zu lesen
Continuous Human Action Recognition 
in Ambient Assisted Living Scenarios 
International Workshop on Enhanced 
Living Environments (ELEMENT 2014) 
Würzburg, Germany 
Alexandros Andre Chaaraoui 
Francisco Flórez-Revuelta
Human action recognition in AAL 
Human action recognition with a bag of key poses 
Continuous human action recognition 
Experimentation 
Overview
Architecture for AAL 
Camera 1 
Motion 
Detection 
Human 
Behaviour 
Analysis 
Multi-view Human Behaviour Analysis 
Privacy 
Reasoning 
System 
Alarm Actuators 
Camera 2 
Motion 
Detection 
Human 
Behaviour 
Analysis 
Camera N 
Motion 
Detection 
Human 
Behaviour 
Analysis 
... 
Setup and Profiles 
DB (Activities, 
Inhabitants, 
Objects, ...) 
Log 
Event 
Long-term 
analysis 
... 
... 
Environmental 
Sensor Information 
Caregiver
Human behaviour analysis
Bag of key poses
Use with RGB and RGB-D data
Results with RGB data 
Weizmann 
MuHAVi-­‐8 
MuHAVi-­‐14 
DHA 
IXMAS
Results with RGB-D data
Original method 
Chaaraoui, A.A.; Climent-Pérez, P.; Flórez-Revuelta, F.: Silhouette-based Human Action Recognition 
using Sequences of Key Poses, Pattern Recognition Letters, 34(15):1799–1807, 2013. 
Multi-view action recognition 
Chaaraoui, A.A.; Climent-Pérez, P.; Flórez-Revuelta, F.: An Efficient Approach for Multi-view Human 
Action Recognition Based on Bag-of-Key-Poses, Lecture Notes in Computer Science, 7559:29-40, 
2012. 
Evolutionary optimisation 
Chaaraoui, A.A.; Flórez-Revuelta, F.: Optimizing human action recognition based on a cooperative 
coevolutionary algorithm, Engineering Applications of Artificial Intelligence, Volume 31:116–125, 2014. 
Incremental learning 
Chaaraoui, A.A.; Flórez-Revuelta, F.: Adaptive Human Action Recognition With an Evolving Bag of Key 
Poses, IEEE Transactions on Autonomous Mental Development, 6(2):139-152, 2014 
Use of RGB-D data 
Chaaraoui, A.A.; Padilla-López, J.R.; Climent-Pérez, P.; Flórez-Revuelta, F.: Evolutionary joint selection 
to improve human action recognition with RGB-D devices, Expert Systems with Applications, 41(3): 
786-794, 2014. 
More information
The previous methods work with pre-segmented sequences 
However, accurate recognition and outstanding temporal performance led us to 
extend it for continuous scenarios 
A sliding and growing window is used to process the continuous stream at 
different overlapping locations and scales 
A null class is considered in order to discard unknown actions and avoid false 
positives. This class corresponds to all the behaviours that may be observed 
and have not been modelled during the learning 
Continuous human action recognition is performed by detecting and classifying 
action zones 
Continuous recognition
Action sequences may contain irrelevant segments which are common among 
actions and therefore ambiguous for classication 
Action zones = most discriminative segments with respect to the other action 
classes in the course of an action 
Action zones are shorter than the original sequences. Then, the matching time 
will be signicantly reduced 
This will allow to consider a larger number of sliding windows at every moment 
Action zones
The bag of key poses is built similarly to the original method 
The discrimination value of each key pose wkp is obtained: 
For each training sequence of action class a and specific temporal instant t: 
1. For each action class a, the nearest neighbour key pose kpa(t) is obtained 
2. The raw class evidence values for all the classes 
Learning of action zones
The bag of key poses is built similarly to the original method 
The discrimination value of each key pose wkp is obtained: 
For each training sequence of action class a and specific temporal instant t: 
1. For each action class a, the nearest neighbour key pose kpa(t) is obtained 
2. The raw class evidence values for all the classes 
Learning of action zones
The bag of key poses is built similarly to the original method 
The discrimination value of each key pose wkp is obtained: 
For each training sequence of action class a and specific temporal instant t: 
1. For each action class a, the nearest neighbour key pose kpa(t) is obtained 
2. The raw class evidence values for all the classes 
3. Normalisation is applied with respect to the highest value observed: 
Learning of action zones
4. Gaussian smoothing is performed centered in the current frame, considering 
only the frames from a temporal instant u ≤ t 
5. The final class evidence H (t ) is obtained by attenuating the resulting value: 
Learning of action zones
4. Gaussian smoothing is performed centered in the current frame, considering 
only the frames from a temporal instant u ≤ t 
5. The final class evidence H (t ) is obtained by attenuating the resulting value: 
Action zones are detected by defining thresholds HT1(t), HT2(t),…, HTA(t) 
So, for a sequence belonging to an action class, the action zone is determined 
by the frames where 
Learning of action zones
Then, the action zones for every learning sequence constitute the knowledge 
base 
A sliding and growing window is used to process the continuous stream at 
different overlapping locations and scales 
These segments of key poses are compared with the learned action zones 
using DTW 
In some cases, even the nearest key pose is very different to the input frame 
Therefore, a set of threshold parameters DT1, DT2, …, DTA indicate the highest 
allowed distance to trigger the recognition 
If the match is not good enough, the frame is labelled as null class 
Continuous recognition
Two sets of parameters to establish: HT1(t), HT2(t),…, HTA(t) and DT1, DT2, …, 
DTA 
Set with an evolutionary algorithm that finds the best performing combination for 
both sets 
Comparison with the ground truth is obtained at segment level 
An action must be recognised with a delay lower than τ frames 
Experimentation
Validation with the multi-view IXMAS and the single-view Weizmann datasets 
The windows grows in 5-frame steps and when lengthmax is reached, it slides 10 
frames 
A delayed recognition is accepted for τ = 60 frames ≅ 2 seconds 
Experimentation 
NOTE: 
Approach 1: Use of action zones 
Approach 2: Use of the whole sequences
Francisco (Paco) Flórez-Revuelta 
F.Florez@kingston.ac.uk @fflorezrevuelta 
www.dtic.ua.es/~florez franciscoflorezrevuelta

Weitere ähnliche Inhalte

Andere mochten auch

Human action recognition optimization based on evolutionary feature
Human action recognition optimization based on evolutionary feature Human action recognition optimization based on evolutionary feature
Human action recognition optimization based on evolutionary feature Francisco (Paco) Florez-Revuelta
 
Wellness check-up for your business plans should include complete mind and bo...
Wellness check-up for your business plans should include complete mind and bo...Wellness check-up for your business plans should include complete mind and bo...
Wellness check-up for your business plans should include complete mind and bo...James Feldman
 
Speculative design
Speculative design Speculative design
Speculative design egoodman
 
Activity recognition for video surveillance
Activity recognition for video surveillanceActivity recognition for video surveillance
Activity recognition for video surveillancevariable_orr
 
Speculative Design - First presentation [27-02]
Speculative Design - First presentation [27-02]Speculative Design - First presentation [27-02]
Speculative Design - First presentation [27-02]Adriano Mescia
 
Wireless Body Area network
Wireless Body Area networkWireless Body Area network
Wireless Body Area networkRajeev N
 
PERSON DE-IDENTIFICATION IN VIDEOS ppt
PERSON DE-IDENTIFICATION IN VIDEOS  pptPERSON DE-IDENTIFICATION IN VIDEOS  ppt
PERSON DE-IDENTIFICATION IN VIDEOS pptsravya raju
 
Wireless Body Area Networking
Wireless Body Area NetworkingWireless Body Area Networking
Wireless Body Area Networkingsubhradeep mitra
 
Wireless Body Area Networks
Wireless Body Area Networks Wireless Body Area Networks
Wireless Body Area Networks Musfiqur Rahman
 
Wireless Body Area Networks for healthcare (Wban)
Wireless Body Area Networks  for healthcare (Wban)Wireless Body Area Networks  for healthcare (Wban)
Wireless Body Area Networks for healthcare (Wban)no0orcom
 
Child Internet Safety summit: online safety and inspection
Child Internet Safety summit: online safety and inspectionChild Internet Safety summit: online safety and inspection
Child Internet Safety summit: online safety and inspectionOfsted
 
Wireless body area network
Wireless body area networkWireless body area network
Wireless body area networkPratik Gondaliya
 
wireless body area networks(WBAN)
wireless body area networks(WBAN)wireless body area networks(WBAN)
wireless body area networks(WBAN)kramsheeda
 
HUMAN MOTION DETECTION AND TRACKING FOR VIDEO SURVEILLANCE
HUMAN MOTION DETECTION AND TRACKING FOR VIDEO SURVEILLANCEHUMAN MOTION DETECTION AND TRACKING FOR VIDEO SURVEILLANCE
HUMAN MOTION DETECTION AND TRACKING FOR VIDEO SURVEILLANCEAswinraj Manickam
 
Moving object detection
Moving object detectionMoving object detection
Moving object detectionManav Mittal
 

Andere mochten auch (18)

Human action recognition optimization based on evolutionary feature
Human action recognition optimization based on evolutionary feature Human action recognition optimization based on evolutionary feature
Human action recognition optimization based on evolutionary feature
 
Wellness check-up for your business plans should include complete mind and bo...
Wellness check-up for your business plans should include complete mind and bo...Wellness check-up for your business plans should include complete mind and bo...
Wellness check-up for your business plans should include complete mind and bo...
 
Speculative design
Speculative design Speculative design
Speculative design
 
Activity recognition for video surveillance
Activity recognition for video surveillanceActivity recognition for video surveillance
Activity recognition for video surveillance
 
Speculative Design - First presentation [27-02]
Speculative Design - First presentation [27-02]Speculative Design - First presentation [27-02]
Speculative Design - First presentation [27-02]
 
Wban
WbanWban
Wban
 
Wireless Body Area network
Wireless Body Area networkWireless Body Area network
Wireless Body Area network
 
PERSON DE-IDENTIFICATION IN VIDEOS ppt
PERSON DE-IDENTIFICATION IN VIDEOS  pptPERSON DE-IDENTIFICATION IN VIDEOS  ppt
PERSON DE-IDENTIFICATION IN VIDEOS ppt
 
Wireless Body Area Networking
Wireless Body Area NetworkingWireless Body Area Networking
Wireless Body Area Networking
 
Wireless Body Area Networks
Wireless Body Area Networks Wireless Body Area Networks
Wireless Body Area Networks
 
Wireless Body Area Networks for healthcare (Wban)
Wireless Body Area Networks  for healthcare (Wban)Wireless Body Area Networks  for healthcare (Wban)
Wireless Body Area Networks for healthcare (Wban)
 
Child Internet Safety summit: online safety and inspection
Child Internet Safety summit: online safety and inspectionChild Internet Safety summit: online safety and inspection
Child Internet Safety summit: online safety and inspection
 
Wireless body area network
Wireless body area networkWireless body area network
Wireless body area network
 
Body Area Network
Body Area NetworkBody Area Network
Body Area Network
 
wireless body area networks(WBAN)
wireless body area networks(WBAN)wireless body area networks(WBAN)
wireless body area networks(WBAN)
 
HUMAN MOTION DETECTION AND TRACKING FOR VIDEO SURVEILLANCE
HUMAN MOTION DETECTION AND TRACKING FOR VIDEO SURVEILLANCEHUMAN MOTION DETECTION AND TRACKING FOR VIDEO SURVEILLANCE
HUMAN MOTION DETECTION AND TRACKING FOR VIDEO SURVEILLANCE
 
Moving object detection
Moving object detectionMoving object detection
Moving object detection
 
Surveillance
SurveillanceSurveillance
Surveillance
 

Ähnlich wie Continuous human action recognition in ambient assisted living scenarios

Multi-Layer Perceptrons
Multi-Layer PerceptronsMulti-Layer Perceptrons
Multi-Layer PerceptronsESCOM
 
Deep Transfer Learning: A new deep learning glitch classification method for ...
Deep Transfer Learning: A new deep learning glitch classification method for ...Deep Transfer Learning: A new deep learning glitch classification method for ...
Deep Transfer Learning: A new deep learning glitch classification method for ...Daniel George
 
Lecture 7: Recurrent Neural Networks
Lecture 7: Recurrent Neural NetworksLecture 7: Recurrent Neural Networks
Lecture 7: Recurrent Neural NetworksSang Jun Lee
 
Jörg Stelzer
Jörg StelzerJörg Stelzer
Jörg Stelzerbutest
 
Evaluating Classification Algorithms Applied To Data Streams Esteban Donato
Evaluating Classification Algorithms Applied To Data Streams   Esteban DonatoEvaluating Classification Algorithms Applied To Data Streams   Esteban Donato
Evaluating Classification Algorithms Applied To Data Streams Esteban DonatoEsteban Donato
 
14889574 dl ml RNN Deeplearning MMMm.ppt
14889574 dl ml RNN Deeplearning MMMm.ppt14889574 dl ml RNN Deeplearning MMMm.ppt
14889574 dl ml RNN Deeplearning MMMm.pptManiMaran230751
 
Temporal Hypermap Theory and Application
Temporal Hypermap Theory and ApplicationTemporal Hypermap Theory and Application
Temporal Hypermap Theory and ApplicationAbel Nyamapfene
 
Introduction to deep learning
Introduction to deep learningIntroduction to deep learning
Introduction to deep learningJunaid Bhat
 
deepnet-lourentzou.ppt
deepnet-lourentzou.pptdeepnet-lourentzou.ppt
deepnet-lourentzou.pptyang947066
 
EE660_Report_YaxinLiu_8448347171
EE660_Report_YaxinLiu_8448347171EE660_Report_YaxinLiu_8448347171
EE660_Report_YaxinLiu_8448347171Yaxin Liu
 
Machine learning and Neural Networks
Machine learning and Neural NetworksMachine learning and Neural Networks
Machine learning and Neural Networksbutest
 
K-Nearest Neighbor Classifier
K-Nearest Neighbor ClassifierK-Nearest Neighbor Classifier
K-Nearest Neighbor ClassifierNeha Kulkarni
 
journal paper publication
journal paper publicationjournal paper publication
journal paper publicationchaitanya451336
 
Learning to Search Henry Kautz
Learning to Search Henry KautzLearning to Search Henry Kautz
Learning to Search Henry Kautzbutest
 
Learning to Search Henry Kautz
Learning to Search Henry KautzLearning to Search Henry Kautz
Learning to Search Henry Kautzbutest
 
KNN.pptx
KNN.pptxKNN.pptx
KNN.pptxdfgd7
 
MAXIMUM CORRENTROPY BASED DICTIONARY LEARNING FOR PHYSICAL ACTIVITY RECOGNITI...
MAXIMUM CORRENTROPY BASED DICTIONARY LEARNING FOR PHYSICAL ACTIVITY RECOGNITI...MAXIMUM CORRENTROPY BASED DICTIONARY LEARNING FOR PHYSICAL ACTIVITY RECOGNITI...
MAXIMUM CORRENTROPY BASED DICTIONARY LEARNING FOR PHYSICAL ACTIVITY RECOGNITI...sherinmm
 
Maximum Correntropy Based Dictionary Learning Framework for Physical Activity...
Maximum Correntropy Based Dictionary Learning Framework for Physical Activity...Maximum Correntropy Based Dictionary Learning Framework for Physical Activity...
Maximum Correntropy Based Dictionary Learning Framework for Physical Activity...sherinmm
 

Ähnlich wie Continuous human action recognition in ambient assisted living scenarios (20)

Multi-Layer Perceptrons
Multi-Layer PerceptronsMulti-Layer Perceptrons
Multi-Layer Perceptrons
 
Deep Transfer Learning: A new deep learning glitch classification method for ...
Deep Transfer Learning: A new deep learning glitch classification method for ...Deep Transfer Learning: A new deep learning glitch classification method for ...
Deep Transfer Learning: A new deep learning glitch classification method for ...
 
Lecture 7: Recurrent Neural Networks
Lecture 7: Recurrent Neural NetworksLecture 7: Recurrent Neural Networks
Lecture 7: Recurrent Neural Networks
 
Jörg Stelzer
Jörg StelzerJörg Stelzer
Jörg Stelzer
 
Evaluating Classification Algorithms Applied To Data Streams Esteban Donato
Evaluating Classification Algorithms Applied To Data Streams   Esteban DonatoEvaluating Classification Algorithms Applied To Data Streams   Esteban Donato
Evaluating Classification Algorithms Applied To Data Streams Esteban Donato
 
14889574 dl ml RNN Deeplearning MMMm.ppt
14889574 dl ml RNN Deeplearning MMMm.ppt14889574 dl ml RNN Deeplearning MMMm.ppt
14889574 dl ml RNN Deeplearning MMMm.ppt
 
Temporal Hypermap Theory and Application
Temporal Hypermap Theory and ApplicationTemporal Hypermap Theory and Application
Temporal Hypermap Theory and Application
 
Introduction to deep learning
Introduction to deep learningIntroduction to deep learning
Introduction to deep learning
 
deepnet-lourentzou.ppt
deepnet-lourentzou.pptdeepnet-lourentzou.ppt
deepnet-lourentzou.ppt
 
EE660_Report_YaxinLiu_8448347171
EE660_Report_YaxinLiu_8448347171EE660_Report_YaxinLiu_8448347171
EE660_Report_YaxinLiu_8448347171
 
Machine learning and Neural Networks
Machine learning and Neural NetworksMachine learning and Neural Networks
Machine learning and Neural Networks
 
K-Nearest Neighbor Classifier
K-Nearest Neighbor ClassifierK-Nearest Neighbor Classifier
K-Nearest Neighbor Classifier
 
nnml.ppt
nnml.pptnnml.ppt
nnml.ppt
 
journal paper publication
journal paper publicationjournal paper publication
journal paper publication
 
Learning to Search Henry Kautz
Learning to Search Henry KautzLearning to Search Henry Kautz
Learning to Search Henry Kautz
 
Learning to Search Henry Kautz
Learning to Search Henry KautzLearning to Search Henry Kautz
Learning to Search Henry Kautz
 
KNN.pptx
KNN.pptxKNN.pptx
KNN.pptx
 
KNN.pptx
KNN.pptxKNN.pptx
KNN.pptx
 
MAXIMUM CORRENTROPY BASED DICTIONARY LEARNING FOR PHYSICAL ACTIVITY RECOGNITI...
MAXIMUM CORRENTROPY BASED DICTIONARY LEARNING FOR PHYSICAL ACTIVITY RECOGNITI...MAXIMUM CORRENTROPY BASED DICTIONARY LEARNING FOR PHYSICAL ACTIVITY RECOGNITI...
MAXIMUM CORRENTROPY BASED DICTIONARY LEARNING FOR PHYSICAL ACTIVITY RECOGNITI...
 
Maximum Correntropy Based Dictionary Learning Framework for Physical Activity...
Maximum Correntropy Based Dictionary Learning Framework for Physical Activity...Maximum Correntropy Based Dictionary Learning Framework for Physical Activity...
Maximum Correntropy Based Dictionary Learning Framework for Physical Activity...
 

Mehr von Francisco (Paco) Florez-Revuelta

Visual monitoring of people in private spaces. From the “Big Brother” to the...
Visual monitoring of people in private spaces.  From the “Big Brother” to the...Visual monitoring of people in private spaces.  From the “Big Brother” to the...
Visual monitoring of people in private spaces. From the “Big Brother” to the...Francisco (Paco) Florez-Revuelta
 
Topology-Preserving Ordering of the RGB Space with an Evolutionary Algorithm
Topology-Preserving Ordering of the RGB Space with an Evolutionary AlgorithmTopology-Preserving Ordering of the RGB Space with an Evolutionary Algorithm
Topology-Preserving Ordering of the RGB Space with an Evolutionary AlgorithmFrancisco (Paco) Florez-Revuelta
 
Using Wavelet Transform to Disaggregate Electrical Power Consumption into the...
Using Wavelet Transform to Disaggregate Electrical Power Consumption into the...Using Wavelet Transform to Disaggregate Electrical Power Consumption into the...
Using Wavelet Transform to Disaggregate Electrical Power Consumption into the...Francisco (Paco) Florez-Revuelta
 
A Multiple Kernel Learning Based Fusion Framework for Real-Time Multi-View Ac...
A Multiple Kernel Learning Based Fusion Framework for Real-Time Multi-View Ac...A Multiple Kernel Learning Based Fusion Framework for Real-Time Multi-View Ac...
A Multiple Kernel Learning Based Fusion Framework for Real-Time Multi-View Ac...Francisco (Paco) Florez-Revuelta
 
Visual Privacy by Context: A Level-Based Visualisation Scheme - UCAmI 2014
Visual Privacy by Context: A Level-Based Visualisation Scheme - UCAmI 2014Visual Privacy by Context: A Level-Based Visualisation Scheme - UCAmI 2014
Visual Privacy by Context: A Level-Based Visualisation Scheme - UCAmI 2014Francisco (Paco) Florez-Revuelta
 
Evolutionary algorithm for dense pixel matching in presence of distortions
Evolutionary algorithm for dense pixel matching in presence of distortionsEvolutionary algorithm for dense pixel matching in presence of distortions
Evolutionary algorithm for dense pixel matching in presence of distortionsFrancisco (Paco) Florez-Revuelta
 
Fusion of Skeletal and Silhouette-based Features for Human Action Recognition...
Fusion of Skeletal and Silhouette-based Features for Human Action Recognition...Fusion of Skeletal and Silhouette-based Features for Human Action Recognition...
Fusion of Skeletal and Silhouette-based Features for Human Action Recognition...Francisco (Paco) Florez-Revuelta
 
Analysis of the topology preservation of accelerated Growing Neural Gas in th...
Analysis of the topology preservation of accelerated Growing Neural Gas in th...Analysis of the topology preservation of accelerated Growing Neural Gas in th...
Analysis of the topology preservation of accelerated Growing Neural Gas in th...Francisco (Paco) Florez-Revuelta
 
Producto Topográfico Geodésico: Mejora para medir la preservación de la topol...
Producto Topográfico Geodésico: Mejora para medir la preservación de la topol...Producto Topográfico Geodésico: Mejora para medir la preservación de la topol...
Producto Topográfico Geodésico: Mejora para medir la preservación de la topol...Francisco (Paco) Florez-Revuelta
 
Specific crossover and mutation operators for a grouping problem based on int...
Specific crossover and mutation operators for a grouping problem based on int...Specific crossover and mutation operators for a grouping problem based on int...
Specific crossover and mutation operators for a grouping problem based on int...Francisco (Paco) Florez-Revuelta
 
Ambient Assisted Living. ICT for the support of the daily living of elderly a...
Ambient Assisted Living. ICT for the support of the daily living of elderly a...Ambient Assisted Living. ICT for the support of the daily living of elderly a...
Ambient Assisted Living. ICT for the support of the daily living of elderly a...Francisco (Paco) Florez-Revuelta
 

Mehr von Francisco (Paco) Florez-Revuelta (16)

AAL Forum 2016
AAL Forum 2016AAL Forum 2016
AAL Forum 2016
 
Visual monitoring of people in private spaces
Visual monitoring of people in private spacesVisual monitoring of people in private spaces
Visual monitoring of people in private spaces
 
Visual monitoring of people in private spaces. From the “Big Brother” to the...
Visual monitoring of people in private spaces.  From the “Big Brother” to the...Visual monitoring of people in private spaces.  From the “Big Brother” to the...
Visual monitoring of people in private spaces. From the “Big Brother” to the...
 
Topology-Preserving Ordering of the RGB Space with an Evolutionary Algorithm
Topology-Preserving Ordering of the RGB Space with an Evolutionary AlgorithmTopology-Preserving Ordering of the RGB Space with an Evolutionary Algorithm
Topology-Preserving Ordering of the RGB Space with an Evolutionary Algorithm
 
Using Wavelet Transform to Disaggregate Electrical Power Consumption into the...
Using Wavelet Transform to Disaggregate Electrical Power Consumption into the...Using Wavelet Transform to Disaggregate Electrical Power Consumption into the...
Using Wavelet Transform to Disaggregate Electrical Power Consumption into the...
 
A Multiple Kernel Learning Based Fusion Framework for Real-Time Multi-View Ac...
A Multiple Kernel Learning Based Fusion Framework for Real-Time Multi-View Ac...A Multiple Kernel Learning Based Fusion Framework for Real-Time Multi-View Ac...
A Multiple Kernel Learning Based Fusion Framework for Real-Time Multi-View Ac...
 
Visual Privacy by Context: A Level-Based Visualisation Scheme - UCAmI 2014
Visual Privacy by Context: A Level-Based Visualisation Scheme - UCAmI 2014Visual Privacy by Context: A Level-Based Visualisation Scheme - UCAmI 2014
Visual Privacy by Context: A Level-Based Visualisation Scheme - UCAmI 2014
 
Vision-based monitoring of people in private spaces
Vision-based monitoring of people in private spacesVision-based monitoring of people in private spaces
Vision-based monitoring of people in private spaces
 
Evolutionary algorithm for dense pixel matching in presence of distortions
Evolutionary algorithm for dense pixel matching in presence of distortionsEvolutionary algorithm for dense pixel matching in presence of distortions
Evolutionary algorithm for dense pixel matching in presence of distortions
 
Fusion of Skeletal and Silhouette-based Features for Human Action Recognition...
Fusion of Skeletal and Silhouette-based Features for Human Action Recognition...Fusion of Skeletal and Silhouette-based Features for Human Action Recognition...
Fusion of Skeletal and Silhouette-based Features for Human Action Recognition...
 
Analysis of the topology preservation of accelerated Growing Neural Gas in th...
Analysis of the topology preservation of accelerated Growing Neural Gas in th...Analysis of the topology preservation of accelerated Growing Neural Gas in th...
Analysis of the topology preservation of accelerated Growing Neural Gas in th...
 
Producto Topográfico Geodésico: Mejora para medir la preservación de la topol...
Producto Topográfico Geodésico: Mejora para medir la preservación de la topol...Producto Topográfico Geodésico: Mejora para medir la preservación de la topol...
Producto Topográfico Geodésico: Mejora para medir la preservación de la topol...
 
Specific crossover and mutation operators for a grouping problem based on int...
Specific crossover and mutation operators for a grouping problem based on int...Specific crossover and mutation operators for a grouping problem based on int...
Specific crossover and mutation operators for a grouping problem based on int...
 
Ambient Assisted Living. ICT for the support of the daily living of elderly a...
Ambient Assisted Living. ICT for the support of the daily living of elderly a...Ambient Assisted Living. ICT for the support of the daily living of elderly a...
Ambient Assisted Living. ICT for the support of the daily living of elderly a...
 
Use of Kinect in a Smart Home
Use of Kinect in a Smart HomeUse of Kinect in a Smart Home
Use of Kinect in a Smart Home
 
Metaltic
MetalticMetaltic
Metaltic
 

Kürzlich hochgeladen

Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native ApplicationsWSO2
 
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...apidays
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusZilliz
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MIND CTI
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingEdi Saputra
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century educationjfdjdjcjdnsjd
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamUiPathCommunity
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesrafiqahmad00786416
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Orbitshub
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistandanishmna97
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Victor Rentea
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FMESafe Software
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Jeffrey Haguewood
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobeapidays
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 
Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Zilliz
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxRustici Software
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDropbox
 
[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdfSandro Moreira
 

Kürzlich hochgeladen (20)

Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with Milvus
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistan
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf
 

Continuous human action recognition in ambient assisted living scenarios

  • 1. Continuous Human Action Recognition in Ambient Assisted Living Scenarios International Workshop on Enhanced Living Environments (ELEMENT 2014) Würzburg, Germany Alexandros Andre Chaaraoui Francisco Flórez-Revuelta
  • 2. Human action recognition in AAL Human action recognition with a bag of key poses Continuous human action recognition Experimentation Overview
  • 3. Architecture for AAL Camera 1 Motion Detection Human Behaviour Analysis Multi-view Human Behaviour Analysis Privacy Reasoning System Alarm Actuators Camera 2 Motion Detection Human Behaviour Analysis Camera N Motion Detection Human Behaviour Analysis ... Setup and Profiles DB (Activities, Inhabitants, Objects, ...) Log Event Long-term analysis ... ... Environmental Sensor Information Caregiver
  • 5. Bag of key poses
  • 6. Use with RGB and RGB-D data
  • 7. Results with RGB data Weizmann MuHAVi-­‐8 MuHAVi-­‐14 DHA IXMAS
  • 9. Original method Chaaraoui, A.A.; Climent-Pérez, P.; Flórez-Revuelta, F.: Silhouette-based Human Action Recognition using Sequences of Key Poses, Pattern Recognition Letters, 34(15):1799–1807, 2013. Multi-view action recognition Chaaraoui, A.A.; Climent-Pérez, P.; Flórez-Revuelta, F.: An Efficient Approach for Multi-view Human Action Recognition Based on Bag-of-Key-Poses, Lecture Notes in Computer Science, 7559:29-40, 2012. Evolutionary optimisation Chaaraoui, A.A.; Flórez-Revuelta, F.: Optimizing human action recognition based on a cooperative coevolutionary algorithm, Engineering Applications of Artificial Intelligence, Volume 31:116–125, 2014. Incremental learning Chaaraoui, A.A.; Flórez-Revuelta, F.: Adaptive Human Action Recognition With an Evolving Bag of Key Poses, IEEE Transactions on Autonomous Mental Development, 6(2):139-152, 2014 Use of RGB-D data Chaaraoui, A.A.; Padilla-López, J.R.; Climent-Pérez, P.; Flórez-Revuelta, F.: Evolutionary joint selection to improve human action recognition with RGB-D devices, Expert Systems with Applications, 41(3): 786-794, 2014. More information
  • 10. The previous methods work with pre-segmented sequences However, accurate recognition and outstanding temporal performance led us to extend it for continuous scenarios A sliding and growing window is used to process the continuous stream at different overlapping locations and scales A null class is considered in order to discard unknown actions and avoid false positives. This class corresponds to all the behaviours that may be observed and have not been modelled during the learning Continuous human action recognition is performed by detecting and classifying action zones Continuous recognition
  • 11. Action sequences may contain irrelevant segments which are common among actions and therefore ambiguous for classication Action zones = most discriminative segments with respect to the other action classes in the course of an action Action zones are shorter than the original sequences. Then, the matching time will be signicantly reduced This will allow to consider a larger number of sliding windows at every moment Action zones
  • 12. The bag of key poses is built similarly to the original method The discrimination value of each key pose wkp is obtained: For each training sequence of action class a and specific temporal instant t: 1. For each action class a, the nearest neighbour key pose kpa(t) is obtained 2. The raw class evidence values for all the classes Learning of action zones
  • 13. The bag of key poses is built similarly to the original method The discrimination value of each key pose wkp is obtained: For each training sequence of action class a and specific temporal instant t: 1. For each action class a, the nearest neighbour key pose kpa(t) is obtained 2. The raw class evidence values for all the classes Learning of action zones
  • 14. The bag of key poses is built similarly to the original method The discrimination value of each key pose wkp is obtained: For each training sequence of action class a and specific temporal instant t: 1. For each action class a, the nearest neighbour key pose kpa(t) is obtained 2. The raw class evidence values for all the classes 3. Normalisation is applied with respect to the highest value observed: Learning of action zones
  • 15. 4. Gaussian smoothing is performed centered in the current frame, considering only the frames from a temporal instant u ≤ t 5. The final class evidence H (t ) is obtained by attenuating the resulting value: Learning of action zones
  • 16. 4. Gaussian smoothing is performed centered in the current frame, considering only the frames from a temporal instant u ≤ t 5. The final class evidence H (t ) is obtained by attenuating the resulting value: Action zones are detected by defining thresholds HT1(t), HT2(t),…, HTA(t) So, for a sequence belonging to an action class, the action zone is determined by the frames where Learning of action zones
  • 17. Then, the action zones for every learning sequence constitute the knowledge base A sliding and growing window is used to process the continuous stream at different overlapping locations and scales These segments of key poses are compared with the learned action zones using DTW In some cases, even the nearest key pose is very different to the input frame Therefore, a set of threshold parameters DT1, DT2, …, DTA indicate the highest allowed distance to trigger the recognition If the match is not good enough, the frame is labelled as null class Continuous recognition
  • 18. Two sets of parameters to establish: HT1(t), HT2(t),…, HTA(t) and DT1, DT2, …, DTA Set with an evolutionary algorithm that finds the best performing combination for both sets Comparison with the ground truth is obtained at segment level An action must be recognised with a delay lower than τ frames Experimentation
  • 19. Validation with the multi-view IXMAS and the single-view Weizmann datasets The windows grows in 5-frame steps and when lengthmax is reached, it slides 10 frames A delayed recognition is accepted for τ = 60 frames ≅ 2 seconds Experimentation NOTE: Approach 1: Use of action zones Approach 2: Use of the whole sequences
  • 20. Francisco (Paco) Flórez-Revuelta F.Florez@kingston.ac.uk @fflorezrevuelta www.dtic.ua.es/~florez franciscoflorezrevuelta