SlideShare ist ein Scribd-Unternehmen logo
1 von 14
Using Neural Networks and 3D sensors data to model 
LIBRAS gestures recognition 
Gabriel S. P Moreira - Gustavo R. Matuck - Osamu Saotome - Adilson M. da Cunha 
ITA – Brazilian Aeronautics Institute of Technology
Introduction 
 1,799,885 citizens in the country having great difficulty to hear 
(CENSUS 2010) 
 The Brazilian Signs Language (LIBRAS) is the sign language used 
by the majority of the deaf people 
 Second official language by the Brazilian courts, on laws 10.436 
(2002) and 5.626 (2005) 
 This research is an initial investigation of LIBRAS recognition 
using 3D sensors and neural networks
Related Research 
 Main approaches of hand tracking systems: 
 Data-glove-based - electromechanical gloves with sensors 
 Vision-based - continuous image frames from regular RGB or 3D sensors 
 Main machine learning techniques: 
 Artificial Neural Networks (ANN) 
 Hidden Markov Models (HMM) 
 Usage of HMM to recognize 47 Libras gestures types, captured with regular RGB 
cameras [Souza et al. 2007]. 
 3D hand and gesture recognition system using Microsoft Kinect sensor, for 
dynamic gestures, and HD color sensor, for static gestures recognition [Caputo et 
al. 2012] 
 A real-time system to recognize a set of Libras alphabet (only static gestures), 
using MS Kinect and neural networks [Anjo et al. 2012].
LIBRAS alphabet 
 All alphabet signs are made with just one hand. 
 20 (twenty) static gestures and 6 (six) dynamic gestures (H, J, K, X, Y, Z) 
executed with movement.
Case Study 
 Six (6) people volunteered to record the signs of the LIBRAS complete alphabet, 
divided in three groups: deaf people (2), LIBRAS teachers (2), and students (2) 
 LIBRAS recognition process
Case Study – Data Acquisition 
 Gestures recorded using: 
 COTS 3D sensor Creative Senz3D GestureCam™ [Creative 2013] 
 Intel® Skeletal Hand Tracking Library (Experimental Release) [Intel 2013] 
 In this investigation, for each frame, it was recorded the absolute 3D coordinates (X, 
Y, and Z) of 5 fingertips and the center of the hand palm, based on the sensor 
distance. That resulted in 18 attributes with continuous position of hand and fingers. 
Creative Senz3D GestureCam 
Hand Tracking
Case Study – Pre-Processing (1/2) 
 Normalization to Hand Relative Coordinates – Absolute 
fingertips coordinates converted to relative coordinates, in 
relation to hand’s center; 
 Training Samples Transformations - There were only 11 samples 
for each alphabet letter. Therefore, a strategy was created and 
implemented to generate new samples based on training 
samples, by rotation and scaling geometric transformations.
Case Study – Pre-Processing (2/2) 
 Frames Sampling – Because gesture number frames varied with 
gesture duration and camera FPS rate, this processes selected 
18 equidistant frames for each gesture each frame containing 
all finger tips 3D coordinates, at some point in time.
Case Study – Neural Network Model 
 Multi-Layer Perceptron (MLP), with back-propagation learning, 
sigmoidal function activation and Sum Squared Error (SSE) to measure 
the learning performance. 
 MLP Architecture: 
 Input Layer (18 relative fingertips and hand 3D coords x 18 sampled frames for 
each gesture = 324 neurons) 
 Hidden Layer (tested from 50 to 400, stepped by 50) 
 Output Layer (26 neurons – the alphabet letters)
Case Study – Results Analysis 
 Dataset split: Training (64%), Validation (18%), and test (18%) sets 
 The best MLP Network configuration correctly classified 61.54% (32/52) of 
unseen data during the network training (test set). Its architecture was 
composed of 200 neurons on the hidden layer. 
 Alphabet test sets classification results
Limitations 
 Difficulty was to find people capable of accurately 
recording LIBRAS gestures, especially deaf people and 
teachers 
 The chosen 3D sensor presented the following limitations 
 An instability of finger tracking on experimental library (Intel® 
Skeletal Hand Tracking Library) 
 The variability of frames per seconds rate may have introduced a 
bias when comparing samples, because of the temporality of 
gestures
Conclusion 
 Initial investigation of LIBRAS recognition, using 3D sensor data 
and Multi-Layer Perceptron. 
 Six LIBRAS teachers, students, and deaf people have volunteered 
for recording the alphabet gestures. 
 Some strategies were developed for pre-processing gestures 
data , to deal with temporality of gestures, normalizing 
coordinates, and geometric transformations (rotation and 
scaling). 
 The best model correctly classified 61.53% patterns of the test 
set. 
 MLP network was not as effective as expected when trained with 
few samples and operating over noisy gestures data
Recommendations and Sugestions 
 Improvements in classification models can be evaluated 
using more representative samples of gestures data. 
 As a natural continuation of this research, still on its 
beginning phase, the authors intend to explore other 3D 
sensors, pre-processing approaches, and learning models, 
like Support Vector Machines and Hidden Markov Models.
Thank you! 
Gabriel S. P Moreira - Gustavo R. Matuck - Osamu Saotome - Adilson M. da Cunha 
ITA – Brazilian Aeronautics Institute of Technology 
gspmoreira@gmail.com

Weitere ähnliche Inhalte

Was ist angesagt?

VLSI IN NEURAL NETWORKS
VLSI IN NEURAL NETWORKSVLSI IN NEURAL NETWORKS
VLSI IN NEURAL NETWORKSMohan Moki
 
text detection and reading of product label for blind persons
 text detection and reading of product label for blind persons text detection and reading of product label for blind persons
text detection and reading of product label for blind personsVivek Chamorshikar
 
Deep learning: the final frontier for time series analysis and signal process...
Deep learning: the final frontier for time series analysis and signal process...Deep learning: the final frontier for time series analysis and signal process...
Deep learning: the final frontier for time series analysis and signal process...Alex Honchar
 
Curriculum Vitae
Curriculum VitaeCurriculum Vitae
Curriculum Vitaebutest
 
Introduction to Segmentation in Computer vision
Introduction to Segmentation in Computer vision Introduction to Segmentation in Computer vision
Introduction to Segmentation in Computer vision ParrotAI
 
Interactive Wall (Multi Touch Interactive Surface)
Interactive Wall (Multi Touch Interactive Surface)Interactive Wall (Multi Touch Interactive Surface)
Interactive Wall (Multi Touch Interactive Surface)alaxandre
 
Brain-inspired equivalence structure extraction technique for generating fr...
Brain-inspired equivalence structure extraction technique for generating fr...Brain-inspired equivalence structure extraction technique for generating fr...
Brain-inspired equivalence structure extraction technique for generating fr...ドワンゴ 人工知能研究所
 
Motion Object Detection Using BGS Technique
Motion Object Detection Using BGS TechniqueMotion Object Detection Using BGS Technique
Motion Object Detection Using BGS Techniqueijcoa
 

Was ist angesagt? (9)

VLSI IN NEURAL NETWORKS
VLSI IN NEURAL NETWORKSVLSI IN NEURAL NETWORKS
VLSI IN NEURAL NETWORKS
 
text detection and reading of product label for blind persons
 text detection and reading of product label for blind persons text detection and reading of product label for blind persons
text detection and reading of product label for blind persons
 
Deep learning: the final frontier for time series analysis and signal process...
Deep learning: the final frontier for time series analysis and signal process...Deep learning: the final frontier for time series analysis and signal process...
Deep learning: the final frontier for time series analysis and signal process...
 
Image recognition
Image recognitionImage recognition
Image recognition
 
Curriculum Vitae
Curriculum VitaeCurriculum Vitae
Curriculum Vitae
 
Introduction to Segmentation in Computer vision
Introduction to Segmentation in Computer vision Introduction to Segmentation in Computer vision
Introduction to Segmentation in Computer vision
 
Interactive Wall (Multi Touch Interactive Surface)
Interactive Wall (Multi Touch Interactive Surface)Interactive Wall (Multi Touch Interactive Surface)
Interactive Wall (Multi Touch Interactive Surface)
 
Brain-inspired equivalence structure extraction technique for generating fr...
Brain-inspired equivalence structure extraction technique for generating fr...Brain-inspired equivalence structure extraction technique for generating fr...
Brain-inspired equivalence structure extraction technique for generating fr...
 
Motion Object Detection Using BGS Technique
Motion Object Detection Using BGS TechniqueMotion Object Detection Using BGS Technique
Motion Object Detection Using BGS Technique
 

Ähnlich wie Using Neural Networks and 3D sensors data to model LIBRAS gestures recognition - KDMile 2014

Kinect Sensor based Indian Sign Language Detection with Voice Extraction
Kinect Sensor based Indian Sign Language Detection with Voice ExtractionKinect Sensor based Indian Sign Language Detection with Voice Extraction
Kinect Sensor based Indian Sign Language Detection with Voice ExtractionIJCSIS Research Publications
 
Review on Hand Gesture Recognition
Review on Hand Gesture RecognitionReview on Hand Gesture Recognition
Review on Hand Gesture Recognitiondbpublications
 
Real Time Sign Language Detection
Real Time Sign Language DetectionReal Time Sign Language Detection
Real Time Sign Language DetectionIRJET Journal
 
Analysis of Inertial Sensor Data Using Trajectory Recognition Algorithm
Analysis of Inertial Sensor Data Using Trajectory Recognition AlgorithmAnalysis of Inertial Sensor Data Using Trajectory Recognition Algorithm
Analysis of Inertial Sensor Data Using Trajectory Recognition Algorithmijcisjournal
 
Iaetsd appearance based american sign language recognition
Iaetsd appearance based  american sign language recognitionIaetsd appearance based  american sign language recognition
Iaetsd appearance based american sign language recognitionIaetsd Iaetsd
 
Final Thesis Presentation
Final Thesis PresentationFinal Thesis Presentation
Final Thesis PresentationSajid Rasheed
 
Speech Recognition using HMM & GMM Models: A Review on Techniques and Approaches
Speech Recognition using HMM & GMM Models: A Review on Techniques and ApproachesSpeech Recognition using HMM & GMM Models: A Review on Techniques and Approaches
Speech Recognition using HMM & GMM Models: A Review on Techniques and Approachesijsrd.com
 
Real time Myanmar Sign Language Recognition System using PCA and SVM
Real time Myanmar Sign Language Recognition System using PCA and SVMReal time Myanmar Sign Language Recognition System using PCA and SVM
Real time Myanmar Sign Language Recognition System using PCA and SVMijtsrd
 
Smart hand gestures recognition using K-NN based algorithm for video annotati...
Smart hand gestures recognition using K-NN based algorithm for video annotati...Smart hand gestures recognition using K-NN based algorithm for video annotati...
Smart hand gestures recognition using K-NN based algorithm for video annotati...nooriasukmaningtyas
 
Short-term hand gestures recognition based on electromyography signals
Short-term hand gestures recognition based on electromyography signalsShort-term hand gestures recognition based on electromyography signals
Short-term hand gestures recognition based on electromyography signalsIAESIJAI
 
Deep convolutional neural network for hand sign language recognition using mo...
Deep convolutional neural network for hand sign language recognition using mo...Deep convolutional neural network for hand sign language recognition using mo...
Deep convolutional neural network for hand sign language recognition using mo...journalBEEI
 
Using The Hausdorff Algorithm to Enhance Kinect's Recognition of Arabic Sign ...
Using The Hausdorff Algorithm to Enhance Kinect's Recognition of Arabic Sign ...Using The Hausdorff Algorithm to Enhance Kinect's Recognition of Arabic Sign ...
Using The Hausdorff Algorithm to Enhance Kinect's Recognition of Arabic Sign ...CSCJournals
 
Hand geometry recognition: an approach for closed and separated fingers
Hand geometry recognition: an approach for closed and  separated fingersHand geometry recognition: an approach for closed and  separated fingers
Hand geometry recognition: an approach for closed and separated fingersIJECEIAES
 
Segmentation, tracking and feature extraction
Segmentation, tracking and feature extractionSegmentation, tracking and feature extraction
Segmentation, tracking and feature extractionijcsa
 
Hand Segmentation Techniques to Hand Gesture Recognition for Natural Human Co...
Hand Segmentation Techniques to Hand Gesture Recognition for Natural Human Co...Hand Segmentation Techniques to Hand Gesture Recognition for Natural Human Co...
Hand Segmentation Techniques to Hand Gesture Recognition for Natural Human Co...Waqas Tariq
 
A Novel Framework For Numerical Character Recognition With Zoning Distance Fe...
A Novel Framework For Numerical Character Recognition With Zoning Distance Fe...A Novel Framework For Numerical Character Recognition With Zoning Distance Fe...
A Novel Framework For Numerical Character Recognition With Zoning Distance Fe...IJERD Editor
 
Development of Sign Signal Translation System Based on Altera’s FPGA DE2 Board
Development of Sign Signal Translation System Based on Altera’s FPGA DE2 BoardDevelopment of Sign Signal Translation System Based on Altera’s FPGA DE2 Board
Development of Sign Signal Translation System Based on Altera’s FPGA DE2 BoardWaqas Tariq
 

Ähnlich wie Using Neural Networks and 3D sensors data to model LIBRAS gestures recognition - KDMile 2014 (20)

Kinect Sensor based Indian Sign Language Detection with Voice Extraction
Kinect Sensor based Indian Sign Language Detection with Voice ExtractionKinect Sensor based Indian Sign Language Detection with Voice Extraction
Kinect Sensor based Indian Sign Language Detection with Voice Extraction
 
Review on Hand Gesture Recognition
Review on Hand Gesture RecognitionReview on Hand Gesture Recognition
Review on Hand Gesture Recognition
 
Real Time Sign Language Detection
Real Time Sign Language DetectionReal Time Sign Language Detection
Real Time Sign Language Detection
 
gesture-recognition
gesture-recognitiongesture-recognition
gesture-recognition
 
Analysis of Inertial Sensor Data Using Trajectory Recognition Algorithm
Analysis of Inertial Sensor Data Using Trajectory Recognition AlgorithmAnalysis of Inertial Sensor Data Using Trajectory Recognition Algorithm
Analysis of Inertial Sensor Data Using Trajectory Recognition Algorithm
 
Iaetsd appearance based american sign language recognition
Iaetsd appearance based  american sign language recognitionIaetsd appearance based  american sign language recognition
Iaetsd appearance based american sign language recognition
 
final ppt
final pptfinal ppt
final ppt
 
Final Thesis Presentation
Final Thesis PresentationFinal Thesis Presentation
Final Thesis Presentation
 
Speech Recognition using HMM & GMM Models: A Review on Techniques and Approaches
Speech Recognition using HMM & GMM Models: A Review on Techniques and ApproachesSpeech Recognition using HMM & GMM Models: A Review on Techniques and Approaches
Speech Recognition using HMM & GMM Models: A Review on Techniques and Approaches
 
Real time Myanmar Sign Language Recognition System using PCA and SVM
Real time Myanmar Sign Language Recognition System using PCA and SVMReal time Myanmar Sign Language Recognition System using PCA and SVM
Real time Myanmar Sign Language Recognition System using PCA and SVM
 
Finger tracking
Finger trackingFinger tracking
Finger tracking
 
Smart hand gestures recognition using K-NN based algorithm for video annotati...
Smart hand gestures recognition using K-NN based algorithm for video annotati...Smart hand gestures recognition using K-NN based algorithm for video annotati...
Smart hand gestures recognition using K-NN based algorithm for video annotati...
 
Short-term hand gestures recognition based on electromyography signals
Short-term hand gestures recognition based on electromyography signalsShort-term hand gestures recognition based on electromyography signals
Short-term hand gestures recognition based on electromyography signals
 
Deep convolutional neural network for hand sign language recognition using mo...
Deep convolutional neural network for hand sign language recognition using mo...Deep convolutional neural network for hand sign language recognition using mo...
Deep convolutional neural network for hand sign language recognition using mo...
 
Using The Hausdorff Algorithm to Enhance Kinect's Recognition of Arabic Sign ...
Using The Hausdorff Algorithm to Enhance Kinect's Recognition of Arabic Sign ...Using The Hausdorff Algorithm to Enhance Kinect's Recognition of Arabic Sign ...
Using The Hausdorff Algorithm to Enhance Kinect's Recognition of Arabic Sign ...
 
Hand geometry recognition: an approach for closed and separated fingers
Hand geometry recognition: an approach for closed and  separated fingersHand geometry recognition: an approach for closed and  separated fingers
Hand geometry recognition: an approach for closed and separated fingers
 
Segmentation, tracking and feature extraction
Segmentation, tracking and feature extractionSegmentation, tracking and feature extraction
Segmentation, tracking and feature extraction
 
Hand Segmentation Techniques to Hand Gesture Recognition for Natural Human Co...
Hand Segmentation Techniques to Hand Gesture Recognition for Natural Human Co...Hand Segmentation Techniques to Hand Gesture Recognition for Natural Human Co...
Hand Segmentation Techniques to Hand Gesture Recognition for Natural Human Co...
 
A Novel Framework For Numerical Character Recognition With Zoning Distance Fe...
A Novel Framework For Numerical Character Recognition With Zoning Distance Fe...A Novel Framework For Numerical Character Recognition With Zoning Distance Fe...
A Novel Framework For Numerical Character Recognition With Zoning Distance Fe...
 
Development of Sign Signal Translation System Based on Altera’s FPGA DE2 Board
Development of Sign Signal Translation System Based on Altera’s FPGA DE2 BoardDevelopment of Sign Signal Translation System Based on Altera’s FPGA DE2 Board
Development of Sign Signal Translation System Based on Altera’s FPGA DE2 Board
 

Mehr von Gabriel Moreira

[Phd Thesis Defense] CHAMELEON: A Deep Learning Meta-Architecture for News Re...
[Phd Thesis Defense] CHAMELEON: A Deep Learning Meta-Architecture for News Re...[Phd Thesis Defense] CHAMELEON: A Deep Learning Meta-Architecture for News Re...
[Phd Thesis Defense] CHAMELEON: A Deep Learning Meta-Architecture for News Re...Gabriel Moreira
 
PAPIs LATAM 2019 - Training and deploying ML models with Kubeflow and TensorF...
PAPIs LATAM 2019 - Training and deploying ML models with Kubeflow and TensorF...PAPIs LATAM 2019 - Training and deploying ML models with Kubeflow and TensorF...
PAPIs LATAM 2019 - Training and deploying ML models with Kubeflow and TensorF...Gabriel Moreira
 
Deep Learning for Recommender Systems @ TDC SP 2019
Deep Learning for Recommender Systems @ TDC SP 2019Deep Learning for Recommender Systems @ TDC SP 2019
Deep Learning for Recommender Systems @ TDC SP 2019Gabriel Moreira
 
PAPIs LATAM 2019 - Training and deploying ML models with Kubeflow and TensorF...
PAPIs LATAM 2019 - Training and deploying ML models with Kubeflow and TensorF...PAPIs LATAM 2019 - Training and deploying ML models with Kubeflow and TensorF...
PAPIs LATAM 2019 - Training and deploying ML models with Kubeflow and TensorF...Gabriel Moreira
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data ScienceGabriel Moreira
 
Sistemas de Recomendação sem Enrolação
Sistemas de Recomendação sem Enrolação Sistemas de Recomendação sem Enrolação
Sistemas de Recomendação sem Enrolação Gabriel Moreira
 
Deep Recommender Systems - PAPIs.io LATAM 2018
Deep Recommender Systems - PAPIs.io LATAM 2018Deep Recommender Systems - PAPIs.io LATAM 2018
Deep Recommender Systems - PAPIs.io LATAM 2018Gabriel Moreira
 
CI&T Tech Summit 2017 - Machine Learning para Sistemas de Recomendação
CI&T Tech Summit 2017 - Machine Learning para Sistemas de RecomendaçãoCI&T Tech Summit 2017 - Machine Learning para Sistemas de Recomendação
CI&T Tech Summit 2017 - Machine Learning para Sistemas de RecomendaçãoGabriel Moreira
 
Feature Engineering - Getting most out of data for predictive models - TDC 2017
Feature Engineering - Getting most out of data for predictive models - TDC 2017Feature Engineering - Getting most out of data for predictive models - TDC 2017
Feature Engineering - Getting most out of data for predictive models - TDC 2017Gabriel Moreira
 
Feature Engineering - Getting most out of data for predictive models
Feature Engineering - Getting most out of data for predictive modelsFeature Engineering - Getting most out of data for predictive models
Feature Engineering - Getting most out of data for predictive modelsGabriel Moreira
 
Discovering User's Topics of Interest in Recommender Systems @ Meetup Machine...
Discovering User's Topics of Interest in Recommender Systems @ Meetup Machine...Discovering User's Topics of Interest in Recommender Systems @ Meetup Machine...
Discovering User's Topics of Interest in Recommender Systems @ Meetup Machine...Gabriel Moreira
 
Discovering User's Topics of Interest in Recommender Systems
Discovering User's Topics of Interest in Recommender SystemsDiscovering User's Topics of Interest in Recommender Systems
Discovering User's Topics of Interest in Recommender SystemsGabriel Moreira
 
Python for Data Science - Python Brasil 11 (2015)
Python for Data Science - Python Brasil 11 (2015)Python for Data Science - Python Brasil 11 (2015)
Python for Data Science - Python Brasil 11 (2015)Gabriel Moreira
 
Python for Data Science - TDC 2015
Python for Data Science - TDC 2015Python for Data Science - TDC 2015
Python for Data Science - TDC 2015Gabriel Moreira
 
Developing GeoGames for Education with Kinect and Android for ArcGIS Runtime
Developing GeoGames for Education with Kinect and Android for ArcGIS RuntimeDeveloping GeoGames for Education with Kinect and Android for ArcGIS Runtime
Developing GeoGames for Education with Kinect and Android for ArcGIS RuntimeGabriel Moreira
 
Dojo Imagem de Android - 19/06/2012
Dojo Imagem de Android - 19/06/2012Dojo Imagem de Android - 19/06/2012
Dojo Imagem de Android - 19/06/2012Gabriel Moreira
 
Agile Testing e outros amendoins
Agile Testing e outros amendoinsAgile Testing e outros amendoins
Agile Testing e outros amendoinsGabriel Moreira
 
ArcGIS Runtime For Android
ArcGIS Runtime For AndroidArcGIS Runtime For Android
ArcGIS Runtime For AndroidGabriel Moreira
 

Mehr von Gabriel Moreira (20)

[Phd Thesis Defense] CHAMELEON: A Deep Learning Meta-Architecture for News Re...
[Phd Thesis Defense] CHAMELEON: A Deep Learning Meta-Architecture for News Re...[Phd Thesis Defense] CHAMELEON: A Deep Learning Meta-Architecture for News Re...
[Phd Thesis Defense] CHAMELEON: A Deep Learning Meta-Architecture for News Re...
 
PAPIs LATAM 2019 - Training and deploying ML models with Kubeflow and TensorF...
PAPIs LATAM 2019 - Training and deploying ML models with Kubeflow and TensorF...PAPIs LATAM 2019 - Training and deploying ML models with Kubeflow and TensorF...
PAPIs LATAM 2019 - Training and deploying ML models with Kubeflow and TensorF...
 
Deep Learning for Recommender Systems @ TDC SP 2019
Deep Learning for Recommender Systems @ TDC SP 2019Deep Learning for Recommender Systems @ TDC SP 2019
Deep Learning for Recommender Systems @ TDC SP 2019
 
PAPIs LATAM 2019 - Training and deploying ML models with Kubeflow and TensorF...
PAPIs LATAM 2019 - Training and deploying ML models with Kubeflow and TensorF...PAPIs LATAM 2019 - Training and deploying ML models with Kubeflow and TensorF...
PAPIs LATAM 2019 - Training and deploying ML models with Kubeflow and TensorF...
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
 
Sistemas de Recomendação sem Enrolação
Sistemas de Recomendação sem Enrolação Sistemas de Recomendação sem Enrolação
Sistemas de Recomendação sem Enrolação
 
Deep Recommender Systems - PAPIs.io LATAM 2018
Deep Recommender Systems - PAPIs.io LATAM 2018Deep Recommender Systems - PAPIs.io LATAM 2018
Deep Recommender Systems - PAPIs.io LATAM 2018
 
CI&T Tech Summit 2017 - Machine Learning para Sistemas de Recomendação
CI&T Tech Summit 2017 - Machine Learning para Sistemas de RecomendaçãoCI&T Tech Summit 2017 - Machine Learning para Sistemas de Recomendação
CI&T Tech Summit 2017 - Machine Learning para Sistemas de Recomendação
 
Feature Engineering - Getting most out of data for predictive models - TDC 2017
Feature Engineering - Getting most out of data for predictive models - TDC 2017Feature Engineering - Getting most out of data for predictive models - TDC 2017
Feature Engineering - Getting most out of data for predictive models - TDC 2017
 
Feature Engineering - Getting most out of data for predictive models
Feature Engineering - Getting most out of data for predictive modelsFeature Engineering - Getting most out of data for predictive models
Feature Engineering - Getting most out of data for predictive models
 
Discovering User's Topics of Interest in Recommender Systems @ Meetup Machine...
Discovering User's Topics of Interest in Recommender Systems @ Meetup Machine...Discovering User's Topics of Interest in Recommender Systems @ Meetup Machine...
Discovering User's Topics of Interest in Recommender Systems @ Meetup Machine...
 
lsrs15_ciandt
lsrs15_ciandtlsrs15_ciandt
lsrs15_ciandt
 
Discovering User's Topics of Interest in Recommender Systems
Discovering User's Topics of Interest in Recommender SystemsDiscovering User's Topics of Interest in Recommender Systems
Discovering User's Topics of Interest in Recommender Systems
 
Python for Data Science - Python Brasil 11 (2015)
Python for Data Science - Python Brasil 11 (2015)Python for Data Science - Python Brasil 11 (2015)
Python for Data Science - Python Brasil 11 (2015)
 
Python for Data Science - TDC 2015
Python for Data Science - TDC 2015Python for Data Science - TDC 2015
Python for Data Science - TDC 2015
 
Python for Data Science
Python for Data SciencePython for Data Science
Python for Data Science
 
Developing GeoGames for Education with Kinect and Android for ArcGIS Runtime
Developing GeoGames for Education with Kinect and Android for ArcGIS RuntimeDeveloping GeoGames for Education with Kinect and Android for ArcGIS Runtime
Developing GeoGames for Education with Kinect and Android for ArcGIS Runtime
 
Dojo Imagem de Android - 19/06/2012
Dojo Imagem de Android - 19/06/2012Dojo Imagem de Android - 19/06/2012
Dojo Imagem de Android - 19/06/2012
 
Agile Testing e outros amendoins
Agile Testing e outros amendoinsAgile Testing e outros amendoins
Agile Testing e outros amendoins
 
ArcGIS Runtime For Android
ArcGIS Runtime For AndroidArcGIS Runtime For Android
ArcGIS Runtime For Android
 

Kürzlich hochgeladen

React Native vs Ionic - The Best Mobile App Framework
React Native vs Ionic - The Best Mobile App FrameworkReact Native vs Ionic - The Best Mobile App Framework
React Native vs Ionic - The Best Mobile App FrameworkPixlogix Infotech
 
MuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotes
MuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotesMuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotes
MuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotesManik S Magar
 
UiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to HeroUiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to HeroUiPathCommunity
 
Scale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL RouterScale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL RouterMydbops
 
2024 April Patch Tuesday
2024 April Patch Tuesday2024 April Patch Tuesday
2024 April Patch TuesdayIvanti
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsPixlogix Infotech
 
Emixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native developmentEmixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native developmentPim van der Noll
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024Lonnie McRorey
 
Varsha Sewlal- Cyber Attacks on Critical Critical Infrastructure
Varsha Sewlal- Cyber Attacks on Critical Critical InfrastructureVarsha Sewlal- Cyber Attacks on Critical Critical Infrastructure
Varsha Sewlal- Cyber Attacks on Critical Critical Infrastructureitnewsafrica
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxLoriGlavin3
 
Data governance with Unity Catalog Presentation
Data governance with Unity Catalog PresentationData governance with Unity Catalog Presentation
Data governance with Unity Catalog PresentationKnoldus Inc.
 
A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersNicole Novielli
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxLoriGlavin3
 
Generative AI - Gitex v1Generative AI - Gitex v1.pptx
Generative AI - Gitex v1Generative AI - Gitex v1.pptxGenerative AI - Gitex v1Generative AI - Gitex v1.pptx
Generative AI - Gitex v1Generative AI - Gitex v1.pptxfnnc6jmgwh
 
Bridging Between CAD & GIS: 6 Ways to Automate Your Data Integration
Bridging Between CAD & GIS:  6 Ways to Automate Your Data IntegrationBridging Between CAD & GIS:  6 Ways to Automate Your Data Integration
Bridging Between CAD & GIS: 6 Ways to Automate Your Data Integrationmarketing932765
 
Glenn Lazarus- Why Your Observability Strategy Needs Security Observability
Glenn Lazarus- Why Your Observability Strategy Needs Security ObservabilityGlenn Lazarus- Why Your Observability Strategy Needs Security Observability
Glenn Lazarus- Why Your Observability Strategy Needs Security Observabilityitnewsafrica
 
QCon London: Mastering long-running processes in modern architectures
QCon London: Mastering long-running processes in modern architecturesQCon London: Mastering long-running processes in modern architectures
QCon London: Mastering long-running processes in modern architecturesBernd Ruecker
 

Kürzlich hochgeladen (20)

React Native vs Ionic - The Best Mobile App Framework
React Native vs Ionic - The Best Mobile App FrameworkReact Native vs Ionic - The Best Mobile App Framework
React Native vs Ionic - The Best Mobile App Framework
 
MuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotes
MuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotesMuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotes
MuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotes
 
UiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to HeroUiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to Hero
 
Scale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL RouterScale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL Router
 
2024 April Patch Tuesday
2024 April Patch Tuesday2024 April Patch Tuesday
2024 April Patch Tuesday
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and Cons
 
Emixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native developmentEmixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native development
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024
 
Varsha Sewlal- Cyber Attacks on Critical Critical Infrastructure
Varsha Sewlal- Cyber Attacks on Critical Critical InfrastructureVarsha Sewlal- Cyber Attacks on Critical Critical Infrastructure
Varsha Sewlal- Cyber Attacks on Critical Critical Infrastructure
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptx
 
Data governance with Unity Catalog Presentation
Data governance with Unity Catalog PresentationData governance with Unity Catalog Presentation
Data governance with Unity Catalog Presentation
 
A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software Developers
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
 
Generative AI - Gitex v1Generative AI - Gitex v1.pptx
Generative AI - Gitex v1Generative AI - Gitex v1.pptxGenerative AI - Gitex v1Generative AI - Gitex v1.pptx
Generative AI - Gitex v1Generative AI - Gitex v1.pptx
 
Bridging Between CAD & GIS: 6 Ways to Automate Your Data Integration
Bridging Between CAD & GIS:  6 Ways to Automate Your Data IntegrationBridging Between CAD & GIS:  6 Ways to Automate Your Data Integration
Bridging Between CAD & GIS: 6 Ways to Automate Your Data Integration
 
Glenn Lazarus- Why Your Observability Strategy Needs Security Observability
Glenn Lazarus- Why Your Observability Strategy Needs Security ObservabilityGlenn Lazarus- Why Your Observability Strategy Needs Security Observability
Glenn Lazarus- Why Your Observability Strategy Needs Security Observability
 
QCon London: Mastering long-running processes in modern architectures
QCon London: Mastering long-running processes in modern architecturesQCon London: Mastering long-running processes in modern architectures
QCon London: Mastering long-running processes in modern architectures
 

Using Neural Networks and 3D sensors data to model LIBRAS gestures recognition - KDMile 2014

  • 1. Using Neural Networks and 3D sensors data to model LIBRAS gestures recognition Gabriel S. P Moreira - Gustavo R. Matuck - Osamu Saotome - Adilson M. da Cunha ITA – Brazilian Aeronautics Institute of Technology
  • 2. Introduction  1,799,885 citizens in the country having great difficulty to hear (CENSUS 2010)  The Brazilian Signs Language (LIBRAS) is the sign language used by the majority of the deaf people  Second official language by the Brazilian courts, on laws 10.436 (2002) and 5.626 (2005)  This research is an initial investigation of LIBRAS recognition using 3D sensors and neural networks
  • 3. Related Research  Main approaches of hand tracking systems:  Data-glove-based - electromechanical gloves with sensors  Vision-based - continuous image frames from regular RGB or 3D sensors  Main machine learning techniques:  Artificial Neural Networks (ANN)  Hidden Markov Models (HMM)  Usage of HMM to recognize 47 Libras gestures types, captured with regular RGB cameras [Souza et al. 2007].  3D hand and gesture recognition system using Microsoft Kinect sensor, for dynamic gestures, and HD color sensor, for static gestures recognition [Caputo et al. 2012]  A real-time system to recognize a set of Libras alphabet (only static gestures), using MS Kinect and neural networks [Anjo et al. 2012].
  • 4. LIBRAS alphabet  All alphabet signs are made with just one hand.  20 (twenty) static gestures and 6 (six) dynamic gestures (H, J, K, X, Y, Z) executed with movement.
  • 5. Case Study  Six (6) people volunteered to record the signs of the LIBRAS complete alphabet, divided in three groups: deaf people (2), LIBRAS teachers (2), and students (2)  LIBRAS recognition process
  • 6. Case Study – Data Acquisition  Gestures recorded using:  COTS 3D sensor Creative Senz3D GestureCam™ [Creative 2013]  Intel® Skeletal Hand Tracking Library (Experimental Release) [Intel 2013]  In this investigation, for each frame, it was recorded the absolute 3D coordinates (X, Y, and Z) of 5 fingertips and the center of the hand palm, based on the sensor distance. That resulted in 18 attributes with continuous position of hand and fingers. Creative Senz3D GestureCam Hand Tracking
  • 7. Case Study – Pre-Processing (1/2)  Normalization to Hand Relative Coordinates – Absolute fingertips coordinates converted to relative coordinates, in relation to hand’s center;  Training Samples Transformations - There were only 11 samples for each alphabet letter. Therefore, a strategy was created and implemented to generate new samples based on training samples, by rotation and scaling geometric transformations.
  • 8. Case Study – Pre-Processing (2/2)  Frames Sampling – Because gesture number frames varied with gesture duration and camera FPS rate, this processes selected 18 equidistant frames for each gesture each frame containing all finger tips 3D coordinates, at some point in time.
  • 9. Case Study – Neural Network Model  Multi-Layer Perceptron (MLP), with back-propagation learning, sigmoidal function activation and Sum Squared Error (SSE) to measure the learning performance.  MLP Architecture:  Input Layer (18 relative fingertips and hand 3D coords x 18 sampled frames for each gesture = 324 neurons)  Hidden Layer (tested from 50 to 400, stepped by 50)  Output Layer (26 neurons – the alphabet letters)
  • 10. Case Study – Results Analysis  Dataset split: Training (64%), Validation (18%), and test (18%) sets  The best MLP Network configuration correctly classified 61.54% (32/52) of unseen data during the network training (test set). Its architecture was composed of 200 neurons on the hidden layer.  Alphabet test sets classification results
  • 11. Limitations  Difficulty was to find people capable of accurately recording LIBRAS gestures, especially deaf people and teachers  The chosen 3D sensor presented the following limitations  An instability of finger tracking on experimental library (Intel® Skeletal Hand Tracking Library)  The variability of frames per seconds rate may have introduced a bias when comparing samples, because of the temporality of gestures
  • 12. Conclusion  Initial investigation of LIBRAS recognition, using 3D sensor data and Multi-Layer Perceptron.  Six LIBRAS teachers, students, and deaf people have volunteered for recording the alphabet gestures.  Some strategies were developed for pre-processing gestures data , to deal with temporality of gestures, normalizing coordinates, and geometric transformations (rotation and scaling).  The best model correctly classified 61.53% patterns of the test set.  MLP network was not as effective as expected when trained with few samples and operating over noisy gestures data
  • 13. Recommendations and Sugestions  Improvements in classification models can be evaluated using more representative samples of gestures data.  As a natural continuation of this research, still on its beginning phase, the authors intend to explore other 3D sensors, pre-processing approaches, and learning models, like Support Vector Machines and Hidden Markov Models.
  • 14. Thank you! Gabriel S. P Moreira - Gustavo R. Matuck - Osamu Saotome - Adilson M. da Cunha ITA – Brazilian Aeronautics Institute of Technology gspmoreira@gmail.com

Hinweis der Redaktion

  1. The Brazilian 2010 Census depicted 1,799,885 citizens in the country having great difficulty to hear. Signs languages are the natural languages of deaf communities. The Brazilian Signs Language (Linguagem Brasileira de Sinais - LIBRAS) is the sign language used by the majority of the deaf people in the Brazilian urban regions, and was standardized and recognized as the second official language by the Brazilian courts, on laws 10.436 (2002) and 5.626 (2005) [Decree 2005]. For many deaf-and-dumb people in Brazil, LIBRAS is the only way they can communicate. But, unfortunately, very few hearing people are knowledgeable and skilled on LIBRAS. That leads difficulties for deaf people to be assisted on hospitals, schools and universities, and government services. This paper describes an investigation about the recognition of LIBRAS alphabet signs gestures, what would allow the spelling of Portuguese words. For the recognition, it was used a 3D sensor to capture hand and fingers coordinates, some pre-processing strategies, and neural networks to classify gestures. Section 2 presents a brief literature research. Section 3 describes the conducted investigation, including data collection, pre-processing, learning models training and result analysis, followed by limitations and conclusions, in sections 4 and 5.
  2. Many attempts have been made to recognize signs languages gestures, generally represented by hands postures and gestures, and translate them to spoken languages letters, words, and expressions. The two major classes of hand tracking systems are: (1) data-glove-based, which requires electromechanical gloves with sensors; and (2) vision-based, which works with continuous image frames [Pistori and Neto 2004], usually regular RGB cameras or 3D sensors (like Microsoft Kinect [Kulshreshth et al. 2005]). For recognition models, some studies have been using machine-learning techniques, mostly Neural Networks and Hidden Markov Models (HMM). In [Caputo et al. 2012], it is presented a 3D hand and gesture recognition system using Microsoft Kinect sensor, for dynamic gestures, and HD color sensor, for static gestures recognition. In [Kulshreshth et al. 2013], a real-time finger tracking technique was developed, using Kinect sensor signals, based on feature vectors calculation, using Fourier descriptors of equidistant points. Researches addressing the Brazilian Signs Language have been conducted in the last years. In [Anjo et al. 2012], it is presented a real-time system to recognize a set of Libras alphabet (only static gestures), using MS Kinect and neural networks. In [Souza et al. 2007], authors use HMM to recognize 47 Libras gestures types, captured with regular RGB cameras. In this article, we report the usage of vision based finger tracking, using Creative Senz3D GestureCam™ [Creative 2013], a set of pre-processing approaches, and neural network models to recognize all letters from Brazilian Signs Language (LIBRAS) alphabet, represented by static and dynamic gestures.
  3. The LIBRAS alphabet letters are represented by 20 (twenty) static gestures and 6 (six) dynamic gestures (H, J, K, X, Y, Z) executed with movement. All alphabet signs are made with just one hand. This investigation considers only one-hand's data for recording and recognition.
  4. Six (6) people volunteered to record LIBRAS alphabet signs, divided in three groups: deaf people (2), LIBRAS teachers (2), and students (2). For all of them, it was recorded two samples of all alphabets, except for one person, which had only one recorded sample. That resulted in 11 complete alphabet samples (268 letter gestures). The LIBRAS signs recognition process stages are Data Acquisition, Pre-processing, Models Training, and Results Analysis, as shown in Figure 1, as described in the following sections.
  5. In this stage, the gestures were recorded by using the COTS 3D sensor Creative Senz3D GestureCam™ [Creative 2013], and the Intel® Skeletal Hand Tracking Library (Experimental Release) [Intel 2013] for finger tracking, based on tracking approaches presented in [Melax et al. 2013]. In each frame, the library provided the full 6 Degrees Of Freedom (6-DOF) position and the orientation of 17 bones of a hand. The number of frames per second (fps) rate varied from 20-30 during recording. In this investigation, for each frame, it was recorded the absolute 3D coordinates (X, Y, and Z) of 5 fingertips and the center of the hand palm, based on the sensor distance, That resulted in 18 attributes with continuous position of hands and fingers.
  6. Normalization to Hand Relative Coordinates – Absolute fingertips coordinates converted to relative coordinates, in relation to hand’s center; Training Samples Transformations - There were only 11 samples for each alphabet letter. Therefore, a strategy was created and implemented to generate new samples based on training samples, by rotation and scaling geometric transformations; and Frames Sampling – where, for each gesture, it was recorded a different number of frames, because gesture time length varies, depending on alphabet letter and person style. The used sensors frames per second (fps) rate also varied, ranging from 20 to 30 fps. So, as the input for neural network model was fixed, it was necessary to normalize the input size, sampling a fixed number of frames for each gesture. This process selected 18 equidistant frames for each gesture  , each frame containing all finger tips 3D coordinates, at some point in time.
  7. Normalization to Hand Relative Coordinates – Absolute fingertips coordinates converted to relative coordinates, in relation to hand’s center; Training Samples Transformations - There were only 11 samples for each alphabet letter. Therefore, a strategy was created and implemented to generate new samples based on training samples, by rotation and scaling geometric transformations; and Frames Sampling – where, for each gesture, it was recorded a different number of frames, because gesture time length varies, depending on alphabet letter and person style. The used sensors frames per second (fps) rate also varied, ranging from 20 to 30 fps. So, as the input for neural network model was fixed, it was necessary to normalize the input size, sampling a fixed number of frames for each gesture. This process selected 18 equidistant frames for each gesture  , each frame containing all finger tips 3D coordinates, at some point in time.
  8. The neural network used in this investigation was the traditional Multi-Layer Perceptron (MLP), with the back-propagation learning algorithm [Montini 2013]. All neurons used in the hidden and output layers had sigmoidal function activation. The Sum Squared Error (SSE) parameter was used to measure the learning performance.   On this investigation, the MLP Network was composed by the input layer (where the processed data sensor were applied), one hidden layer (tested with 50 to 400 neurons, stepped by 50), and the output layer (with 26 neurons - the alphabet letters). Mathematically, only one hidden layer of neurons is enough to perform nonlinear mapping. In order to provide a better way to train the neural network, a cross-validation process and a momentum parameter were applied. All data obtained from gestures signals were divided into training (64%), validation   (18%), and test sets (18%). All range data used by the MLP Network is between [0,1]. Several simulations were performed   with different MLP Neural Networks configurations.  Mathematically, only one hidden layer of neurons is enough to perform nonlinear mapping The stimulus between the neurons and the input data is measured by equation (a). By the neuron output signal (Y) showed in equation (b), the sigmoidal function was applied on the hidden and output neurons. The network error and the weights adaptation were measured by equations (c) and (d), respectively.
  9. In order to provide a better way to train the neural network, a cross-validation process and a momentum parameter were applied. All data obtained from gestures signals were divided into training (64%), validation   (18%), and test sets (18%). All range data used by the MLP Network is between [0,1]. Several simulations were performed   with different MLP Neural Networks configurations.  Mathematically, only one hidden layer of neurons is enough to perform nonlinear mapping After many simulations , the best MLP Network configuration correctly classified 61.54%   (32/52) of unseen data during the network training (test set). Its architecture was composed of 200 neurons on the hidden layer. This MLP Network training process was conducted for 1,000 epochs, after then the learning model started overfitting the sample data. Learning rate chosen by authors started with the value of 0.2  , and decreased as epochs. The momentum parameter was used to improve the learning performance, avoiding local minima in the error surface. Table 1 shows the classification results for the test set (two samples of each letter), presenting the MLP Neural Network difficulties to recognize some letters like “f”, “k”, “t”, “x”, and “z”. On the other hand, the neural model correctly classified 10 letters with 100% of success (“a”, “c”, “d”, “j”, “l”, “m”, “o”, “u”, “v”, and “w”).  
  10. In this investigation, an initial research was described for recognizing the Brazilian signs language LIBRAS (LInguagem BRAsileira de Sinais) alphabet using Neural Networks over visual 3D sensor data. Six LIBRAS teachers, students, and deaf people have volunteered for recording the alphabet gestures. Some strategies were developed for pre-processing gestures data  , to deal with temporality of gestures (frames sampling), normalizing coordinates, and increasing training samples, using geometric transformations (rotation and scaling). Neural networks were assessed for gestures recognition, with different settings, training epochs, learning rates, and momentum  factor. The best model correctly classified 61.53% patterns of the test set. Analyzing the results from this investigation, it can be observed that the MLP network was not as effective as expected when operating over noisy gestures data. Improvements in recognition models can be evaluated using more representative samples of gestures data. As a natural continuation of this research, still on its beginning phase, the authors intend to explore other 3D sensors, pre-processing approaches, and learning models, like Support Vector Machines and Hidden Markov Models.  R1 - What kind of strategies were developed? The authors commented about fixing 18 frames but they did not justify this choice.  Já está escrito ao longo da frase a estratégia de pré-processamento dos dados.....  R1 - You did not comment anything about momentum factor
  11. Improvements in recognition models can be evaluated using more representative samples of gestures data. As a natural continuation of this research, still on its beginning phase, the authors intend to explore other 3D sensors, pre-processing approaches, and learning models, like Support Vector Machines and Hidden Markov Models.