SlideShare ist ein Scribd-Unternehmen logo
1 von 11
Gaze-Net: Appearance-Based Gaze
Estimation using Capsule Networks
Bhanuka Mahanama(@mahanama94)
Yasith Jayawardana (@yasithmilinda)
Sampath Jayarathna (@openmaze)
Department of Computer Science
Old Dominion University
Gaze-Net: Appearance Based Gaze Estimation@NirdsLab
Outline
● Introduction
● Related work
● Approach
● Proposed Architecture
● Experiments and Results
● Conclusion
2/11
Gaze-Net: Appearance Based Gaze Estimation@NirdsLab
Introduction
● Gaze Estimation Applications
○ Physiological studies
○ Human-computer interaction
● Modern methods
○ Convolution Neural Networks
○ Facial Region
○ Ocular Region
3/11
Appearance based-multi user eye tracking
(https://mgaze.nirds.cs.odu.edu/)
Gaze-Net: Appearance Based Gaze Estimation@NirdsLab
Related Work
● Estimation methods
○ Fixed head-pose - early methods (Sewell et al.[2010])
○ Variable head pose
■ Explicit pose data (Zhang et al.[2015])
■ Implicit pose (Zhang et al.[2016], Krafka et al.[2017])
● Training methods
○ Data driven (Zhang et al.[2015, 2016])
○ User specific (Kassner et al. [2014], Huang et al.[2014], Papoutsaki et al.[2016])
4/11
Gaze-Net: Appearance Based Gaze Estimation@NirdsLab
Approach
● Two-step approach
○ Classify
○ Estimate
● Classification
○ Convolution NN
○ Capsule Network
● Estimation
○ Fully connected
● Regularization
○ Reconstruction
○ Estimation error
5/11
Left Top Middle Top Right Top
Left Bottom Middle Bottom Right Bottom
Gaze-Net: Appearance Based Gaze Estimation@NirdsLab
Training and Testing
● Training
○ MPIIGaze dataset (200,000+ images)
■ https://arxiv.org/abs/1711.09017
● Testing
○ MPIIGaze dataset
○ Columbia Gaze dataset (~5000 images)
6/11
MPIIGaze Dataset: Raw images
(https://www.mpi-inf.mpg.de/)
MPIIGaze Dataset: Processed
images
(https://www.mpi-inf.mpg.de/)
Gaze-Net: Appearance Based Gaze Estimation@NirdsLab
Experiments
● Metrics
○ Accuracy - Gaze categorization
○ Mean Absolute Error - Gaze estimation
● Experiment conditions
○ No regularization
○ Gaze estimation regularization
○ Image Reconstruction
○ Estimation + Reconstruction
7/11
Accuracy MAE (Estimation)
No Regularization 67.15 -
Image
Reconstruction
65.97 -
Gaze Error 63.98 2.88
Gaze Error +
Reconstruction
62.67 2.84
Figure 2: Comparison of MPIIGaze image
reconstructionwith the original images.Œ
top row shows the reconstructed images,
and the bottŠom row shows the original
images.
Table 1: Classi€cation Accuracy (ACC) and Mean
AbsoluteError (MAE) of Gaze Estimation for each
Regularizationmethod.
Gaze-Net: Appearance Based Gaze Estimation@NirdsLab
Transfer Learning
● Transfer Learning
○ Knowledge from one problem on another
● Dataset
○ Columbia Gaze Dataset
○ Ocular region extracted using PoseNet
■ PoseNet: Real-time pose estimation model
■ https://github.com/tensorflow/tfjs-
models/tree/master/posenet
○ Per participant experiments
8/11
Processed images from Columbia Gaze
Dataset
Gaze-Net: Appearance Based Gaze Estimation@NirdsLab
Transfer Learning - Experiments
● Conditions
○ No retraining
○ Retraining estimation
network
9/11
MAE
(Estimation)
No Retraining 10.04
Retraining Estimation
Network
5.92
Table 2: Mean Absolute Error (MAE) of
gaze estimation be-fore and a…er training
on Columbia Gaze Dataset.
Gaze-Net: Appearance Based Gaze Estimation@NirdsLab
Discussion
● Gaze estimation with ocular
images
○ Decoding head pose
○ Decoding eye rotation
● Transfer learning for
personalizing
○ Generalized model from
larger dataset
○ Personalized from a smaller
dataset
10/11
Figure 3: Dimension perturbations.Each row shows the
reconstruction when one of the 16 dimensions in the
GazeCaps output is tweaked by intervals of 0.125 in the
range[−0.25,0.25]
Gaze-Net: Appearance Based Gaze Estimation@NirdsLab
Questions?
● Ocular images are sufficient for
○ Decoding facial orientation
○ Eye rotation
○ Estimating gaze
● Transfer learning
○ Better performing personalized models
● More info
○ MGaze: https://mgaze.nirds.cs.odu.edu/
○ Research Group: @NirdsLab
○ Homepage: https://www.cs.odu.edu/~bhanuka/
○ Twitter: @mahanama94, @yasithmilinda, @openmaze
11/11

Weitere ähnliche Inhalte

Was ist angesagt?

Ultrafast Optical signal processing
Ultrafast Optical signal processingUltrafast Optical signal processing
Ultrafast Optical signal processingHossein Babashah
 
J. Park, AAAI 2022, MLILAB, KAIST AI
J. Park, AAAI 2022, MLILAB, KAIST AIJ. Park, AAAI 2022, MLILAB, KAIST AI
J. Park, AAAI 2022, MLILAB, KAIST AIMLILAB
 
Shallow Dense Network for Effective Image Classification
Shallow Dense Network for Effective Image ClassificationShallow Dense Network for Effective Image Classification
Shallow Dense Network for Effective Image ClassificationA. Hasib Uddin
 
Numerical Integral using NNI
Numerical Integral using NNINumerical Integral using NNI
Numerical Integral using NNIFahmeen Mazhar
 
Feature disentanglement in generating a three dimensional structure from a tw...
Feature disentanglement in generating a three dimensional structure from a tw...Feature disentanglement in generating a three dimensional structure from a tw...
Feature disentanglement in generating a three dimensional structure from a tw...Chung Hyung Jin
 
CenterForDomainSpecificComputing-Poster
CenterForDomainSpecificComputing-PosterCenterForDomainSpecificComputing-Poster
CenterForDomainSpecificComputing-PosterYunming Zhang
 
JOB SCHEDULING USING ANT COLONY OPTIMIZATION ALGORITHM
JOB SCHEDULING USING ANT COLONY OPTIMIZATION ALGORITHMJOB SCHEDULING USING ANT COLONY OPTIMIZATION ALGORITHM
JOB SCHEDULING USING ANT COLONY OPTIMIZATION ALGORITHMmailjkb
 
Matlab reversible watermarking based on invariant image classification and d...
Matlab  reversible watermarking based on invariant image classification and d...Matlab  reversible watermarking based on invariant image classification and d...
Matlab reversible watermarking based on invariant image classification and d...Ecway Technologies
 

Was ist angesagt? (8)

Ultrafast Optical signal processing
Ultrafast Optical signal processingUltrafast Optical signal processing
Ultrafast Optical signal processing
 
J. Park, AAAI 2022, MLILAB, KAIST AI
J. Park, AAAI 2022, MLILAB, KAIST AIJ. Park, AAAI 2022, MLILAB, KAIST AI
J. Park, AAAI 2022, MLILAB, KAIST AI
 
Shallow Dense Network for Effective Image Classification
Shallow Dense Network for Effective Image ClassificationShallow Dense Network for Effective Image Classification
Shallow Dense Network for Effective Image Classification
 
Numerical Integral using NNI
Numerical Integral using NNINumerical Integral using NNI
Numerical Integral using NNI
 
Feature disentanglement in generating a three dimensional structure from a tw...
Feature disentanglement in generating a three dimensional structure from a tw...Feature disentanglement in generating a three dimensional structure from a tw...
Feature disentanglement in generating a three dimensional structure from a tw...
 
CenterForDomainSpecificComputing-Poster
CenterForDomainSpecificComputing-PosterCenterForDomainSpecificComputing-Poster
CenterForDomainSpecificComputing-Poster
 
JOB SCHEDULING USING ANT COLONY OPTIMIZATION ALGORITHM
JOB SCHEDULING USING ANT COLONY OPTIMIZATION ALGORITHMJOB SCHEDULING USING ANT COLONY OPTIMIZATION ALGORITHM
JOB SCHEDULING USING ANT COLONY OPTIMIZATION ALGORITHM
 
Matlab reversible watermarking based on invariant image classification and d...
Matlab  reversible watermarking based on invariant image classification and d...Matlab  reversible watermarking based on invariant image classification and d...
Matlab reversible watermarking based on invariant image classification and d...
 

Ähnlich wie Gaze-Net: Appearance-Based Gaze Estimation using CapsuleNetworks

Semantic Segmentation on Satellite Imagery
Semantic Segmentation on Satellite ImagerySemantic Segmentation on Satellite Imagery
Semantic Segmentation on Satellite ImageryRAHUL BHOJWANI
 
Learning with Relative Attributes
Learning with Relative AttributesLearning with Relative Attributes
Learning with Relative AttributesVikas Jain
 
Neural networks for semantic gaze analysis in xr settings
Neural networks for semantic gaze analysis in xr settingsNeural networks for semantic gaze analysis in xr settings
Neural networks for semantic gaze analysis in xr settingsJaey Jeong
 
Poster_Reseau_Neurones_Journees_2013
Poster_Reseau_Neurones_Journees_2013Poster_Reseau_Neurones_Journees_2013
Poster_Reseau_Neurones_Journees_2013Pedro Lopes
 
Prepare for the final thesis presentation
Prepare for the final thesis presentationPrepare for the final thesis presentation
Prepare for the final thesis presentationnaoki0625
 
Supervised embedding techniques in search ranking system
Supervised embedding techniques in search ranking systemSupervised embedding techniques in search ranking system
Supervised embedding techniques in search ranking systemMarsan Ma
 
Decomposing image generation into layout priction and conditional synthesis
Decomposing image generation into layout priction and conditional synthesisDecomposing image generation into layout priction and conditional synthesis
Decomposing image generation into layout priction and conditional synthesisNaeem Shehzad
 
Unsupervised representation learning for gaze estimation
Unsupervised representation learning for gaze estimationUnsupervised representation learning for gaze estimation
Unsupervised representation learning for gaze estimationJaey Jeong
 
Parallel Computing Application
Parallel Computing ApplicationParallel Computing Application
Parallel Computing Applicationhanis salwan
 
PR-330: How To Train Your ViT? Data, Augmentation, and Regularization in Visi...
PR-330: How To Train Your ViT? Data, Augmentation, and Regularization in Visi...PR-330: How To Train Your ViT? Data, Augmentation, and Regularization in Visi...
PR-330: How To Train Your ViT? Data, Augmentation, and Regularization in Visi...Jinwon Lee
 
Identifying Land Patterns from Satellite Images using Deep Learning
Identifying Land Patterns from Satellite Images using Deep LearningIdentifying Land Patterns from Satellite Images using Deep Learning
Identifying Land Patterns from Satellite Images using Deep LearningSoumyadeep Debnath
 
Memory Efficient Graph Convolutional Network based Distributed Link Prediction
Memory Efficient Graph Convolutional Network based Distributed Link PredictionMemory Efficient Graph Convolutional Network based Distributed Link Prediction
Memory Efficient Graph Convolutional Network based Distributed Link Predictionmiyurud
 
Using Bayesian Optimization to Tune Machine Learning Models
Using Bayesian Optimization to Tune Machine Learning ModelsUsing Bayesian Optimization to Tune Machine Learning Models
Using Bayesian Optimization to Tune Machine Learning ModelsScott Clark
 
Using Bayesian Optimization to Tune Machine Learning Models
Using Bayesian Optimization to Tune Machine Learning ModelsUsing Bayesian Optimization to Tune Machine Learning Models
Using Bayesian Optimization to Tune Machine Learning ModelsSigOpt
 
Deep Implicit Layers: Learning Structured Problems with Neural Networks
Deep Implicit Layers: Learning Structured Problems with Neural NetworksDeep Implicit Layers: Learning Structured Problems with Neural Networks
Deep Implicit Layers: Learning Structured Problems with Neural NetworksSangwoo Mo
 
Web Traffic Time Series Forecasting
Web Traffic  Time Series ForecastingWeb Traffic  Time Series Forecasting
Web Traffic Time Series ForecastingBillTubbs
 
Ai based glaucoma detection using deep learning
Ai based glaucoma detection using deep learningAi based glaucoma detection using deep learning
Ai based glaucoma detection using deep learningjaijoy6
 
PAISS (PRAIRIE AI Summer School) Digest July 2018
PAISS (PRAIRIE AI Summer School) Digest July 2018 PAISS (PRAIRIE AI Summer School) Digest July 2018
PAISS (PRAIRIE AI Summer School) Digest July 2018 Natalia Díaz Rodríguez
 
K-Means Clustering in Moving Objects Extraction with Selective Background
K-Means Clustering in Moving Objects Extraction with Selective BackgroundK-Means Clustering in Moving Objects Extraction with Selective Background
K-Means Clustering in Moving Objects Extraction with Selective BackgroundIJCSIS Research Publications
 

Ähnlich wie Gaze-Net: Appearance-Based Gaze Estimation using CapsuleNetworks (20)

Semantic Segmentation on Satellite Imagery
Semantic Segmentation on Satellite ImagerySemantic Segmentation on Satellite Imagery
Semantic Segmentation on Satellite Imagery
 
Learning with Relative Attributes
Learning with Relative AttributesLearning with Relative Attributes
Learning with Relative Attributes
 
Neural networks for semantic gaze analysis in xr settings
Neural networks for semantic gaze analysis in xr settingsNeural networks for semantic gaze analysis in xr settings
Neural networks for semantic gaze analysis in xr settings
 
Poster_Reseau_Neurones_Journees_2013
Poster_Reseau_Neurones_Journees_2013Poster_Reseau_Neurones_Journees_2013
Poster_Reseau_Neurones_Journees_2013
 
Prepare for the final thesis presentation
Prepare for the final thesis presentationPrepare for the final thesis presentation
Prepare for the final thesis presentation
 
Supervised embedding techniques in search ranking system
Supervised embedding techniques in search ranking systemSupervised embedding techniques in search ranking system
Supervised embedding techniques in search ranking system
 
neuralAC
neuralACneuralAC
neuralAC
 
Decomposing image generation into layout priction and conditional synthesis
Decomposing image generation into layout priction and conditional synthesisDecomposing image generation into layout priction and conditional synthesis
Decomposing image generation into layout priction and conditional synthesis
 
Unsupervised representation learning for gaze estimation
Unsupervised representation learning for gaze estimationUnsupervised representation learning for gaze estimation
Unsupervised representation learning for gaze estimation
 
Parallel Computing Application
Parallel Computing ApplicationParallel Computing Application
Parallel Computing Application
 
PR-330: How To Train Your ViT? Data, Augmentation, and Regularization in Visi...
PR-330: How To Train Your ViT? Data, Augmentation, and Regularization in Visi...PR-330: How To Train Your ViT? Data, Augmentation, and Regularization in Visi...
PR-330: How To Train Your ViT? Data, Augmentation, and Regularization in Visi...
 
Identifying Land Patterns from Satellite Images using Deep Learning
Identifying Land Patterns from Satellite Images using Deep LearningIdentifying Land Patterns from Satellite Images using Deep Learning
Identifying Land Patterns from Satellite Images using Deep Learning
 
Memory Efficient Graph Convolutional Network based Distributed Link Prediction
Memory Efficient Graph Convolutional Network based Distributed Link PredictionMemory Efficient Graph Convolutional Network based Distributed Link Prediction
Memory Efficient Graph Convolutional Network based Distributed Link Prediction
 
Using Bayesian Optimization to Tune Machine Learning Models
Using Bayesian Optimization to Tune Machine Learning ModelsUsing Bayesian Optimization to Tune Machine Learning Models
Using Bayesian Optimization to Tune Machine Learning Models
 
Using Bayesian Optimization to Tune Machine Learning Models
Using Bayesian Optimization to Tune Machine Learning ModelsUsing Bayesian Optimization to Tune Machine Learning Models
Using Bayesian Optimization to Tune Machine Learning Models
 
Deep Implicit Layers: Learning Structured Problems with Neural Networks
Deep Implicit Layers: Learning Structured Problems with Neural NetworksDeep Implicit Layers: Learning Structured Problems with Neural Networks
Deep Implicit Layers: Learning Structured Problems with Neural Networks
 
Web Traffic Time Series Forecasting
Web Traffic  Time Series ForecastingWeb Traffic  Time Series Forecasting
Web Traffic Time Series Forecasting
 
Ai based glaucoma detection using deep learning
Ai based glaucoma detection using deep learningAi based glaucoma detection using deep learning
Ai based glaucoma detection using deep learning
 
PAISS (PRAIRIE AI Summer School) Digest July 2018
PAISS (PRAIRIE AI Summer School) Digest July 2018 PAISS (PRAIRIE AI Summer School) Digest July 2018
PAISS (PRAIRIE AI Summer School) Digest July 2018
 
K-Means Clustering in Moving Objects Extraction with Selective Background
K-Means Clustering in Moving Objects Extraction with Selective BackgroundK-Means Clustering in Moving Objects Extraction with Selective Background
K-Means Clustering in Moving Objects Extraction with Selective Background
 

Kürzlich hochgeladen

"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxLoriGlavin3
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc
 
Sample pptx for embedding into website for demo
Sample pptx for embedding into website for demoSample pptx for embedding into website for demo
Sample pptx for embedding into website for demoHarshalMandlekar2
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxLoriGlavin3
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxLoriGlavin3
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningLars Bell
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 3652toLead Limited
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University
 
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESSALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESmohitsingh558521
 
unit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxunit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxBkGupta21
 
Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rick Flair
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteDianaGray10
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfMounikaPolabathina
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 

Kürzlich hochgeladen (20)

"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
 
Sample pptx for embedding into website for demo
Sample pptx for embedding into website for demoSample pptx for embedding into website for demo
Sample pptx for embedding into website for demo
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine Tuning
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
 
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESSALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
 
unit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxunit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptx
 
Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test Suite
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdf
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 

Gaze-Net: Appearance-Based Gaze Estimation using CapsuleNetworks

  • 1. Gaze-Net: Appearance-Based Gaze Estimation using Capsule Networks Bhanuka Mahanama(@mahanama94) Yasith Jayawardana (@yasithmilinda) Sampath Jayarathna (@openmaze) Department of Computer Science Old Dominion University
  • 2. Gaze-Net: Appearance Based Gaze Estimation@NirdsLab Outline ● Introduction ● Related work ● Approach ● Proposed Architecture ● Experiments and Results ● Conclusion 2/11
  • 3. Gaze-Net: Appearance Based Gaze Estimation@NirdsLab Introduction ● Gaze Estimation Applications ○ Physiological studies ○ Human-computer interaction ● Modern methods ○ Convolution Neural Networks ○ Facial Region ○ Ocular Region 3/11 Appearance based-multi user eye tracking (https://mgaze.nirds.cs.odu.edu/)
  • 4. Gaze-Net: Appearance Based Gaze Estimation@NirdsLab Related Work ● Estimation methods ○ Fixed head-pose - early methods (Sewell et al.[2010]) ○ Variable head pose ■ Explicit pose data (Zhang et al.[2015]) ■ Implicit pose (Zhang et al.[2016], Krafka et al.[2017]) ● Training methods ○ Data driven (Zhang et al.[2015, 2016]) ○ User specific (Kassner et al. [2014], Huang et al.[2014], Papoutsaki et al.[2016]) 4/11
  • 5. Gaze-Net: Appearance Based Gaze Estimation@NirdsLab Approach ● Two-step approach ○ Classify ○ Estimate ● Classification ○ Convolution NN ○ Capsule Network ● Estimation ○ Fully connected ● Regularization ○ Reconstruction ○ Estimation error 5/11 Left Top Middle Top Right Top Left Bottom Middle Bottom Right Bottom
  • 6. Gaze-Net: Appearance Based Gaze Estimation@NirdsLab Training and Testing ● Training ○ MPIIGaze dataset (200,000+ images) ■ https://arxiv.org/abs/1711.09017 ● Testing ○ MPIIGaze dataset ○ Columbia Gaze dataset (~5000 images) 6/11 MPIIGaze Dataset: Raw images (https://www.mpi-inf.mpg.de/) MPIIGaze Dataset: Processed images (https://www.mpi-inf.mpg.de/)
  • 7. Gaze-Net: Appearance Based Gaze Estimation@NirdsLab Experiments ● Metrics ○ Accuracy - Gaze categorization ○ Mean Absolute Error - Gaze estimation ● Experiment conditions ○ No regularization ○ Gaze estimation regularization ○ Image Reconstruction ○ Estimation + Reconstruction 7/11 Accuracy MAE (Estimation) No Regularization 67.15 - Image Reconstruction 65.97 - Gaze Error 63.98 2.88 Gaze Error + Reconstruction 62.67 2.84 Figure 2: Comparison of MPIIGaze image reconstructionwith the original images.Œ top row shows the reconstructed images, and the bottŠom row shows the original images. Table 1: Classi€cation Accuracy (ACC) and Mean AbsoluteError (MAE) of Gaze Estimation for each Regularizationmethod.
  • 8. Gaze-Net: Appearance Based Gaze Estimation@NirdsLab Transfer Learning ● Transfer Learning ○ Knowledge from one problem on another ● Dataset ○ Columbia Gaze Dataset ○ Ocular region extracted using PoseNet ■ PoseNet: Real-time pose estimation model ■ https://github.com/tensorflow/tfjs- models/tree/master/posenet ○ Per participant experiments 8/11 Processed images from Columbia Gaze Dataset
  • 9. Gaze-Net: Appearance Based Gaze Estimation@NirdsLab Transfer Learning - Experiments ● Conditions ○ No retraining ○ Retraining estimation network 9/11 MAE (Estimation) No Retraining 10.04 Retraining Estimation Network 5.92 Table 2: Mean Absolute Error (MAE) of gaze estimation be-fore and a…er training on Columbia Gaze Dataset.
  • 10. Gaze-Net: Appearance Based Gaze Estimation@NirdsLab Discussion ● Gaze estimation with ocular images ○ Decoding head pose ○ Decoding eye rotation ● Transfer learning for personalizing ○ Generalized model from larger dataset ○ Personalized from a smaller dataset 10/11 Figure 3: Dimension perturbations.Each row shows the reconstruction when one of the 16 dimensions in the GazeCaps output is tweaked by intervals of 0.125 in the range[−0.25,0.25]
  • 11. Gaze-Net: Appearance Based Gaze Estimation@NirdsLab Questions? ● Ocular images are sufficient for ○ Decoding facial orientation ○ Eye rotation ○ Estimating gaze ● Transfer learning ○ Better performing personalized models ● More info ○ MGaze: https://mgaze.nirds.cs.odu.edu/ ○ Research Group: @NirdsLab ○ Homepage: https://www.cs.odu.edu/~bhanuka/ ○ Twitter: @mahanama94, @yasithmilinda, @openmaze 11/11