SlideShare ist ein Scribd-Unternehmen logo
1 von 26
Downloaden Sie, um offline zu lesen
Bio-inspired Active Vision
Martin Peniak, Davide Marocco, Ron Babich and John Tran
Outline

 Introduction
 Classical vision vs. active vision
        Show few examples, e.g. template matching, face detection etc.
        Pros&Cons
        Explain what is active perception (use current + davides slides)
        Active Vision – many possibilities but we like GA+NN, why?
               Based on neural network and genetic algorithm
               What is neural network?
               What is genetic algorithm?
        Pros&Cons
               Little computation required due to neural network architecture, fast
               No external representation needed thanks to GA evolving nn weights
               Generality, invariance, and application in multiple domains
               Cons: need more research, so far applied in limited domains


 Previous research by Floreano, Davide, me (ESA)
        Show videos and pictures
 Describe work done at NVIDIA
 Future work: AXA grant
                                                                                      2
GPU Computing Lab
Traditional Computer Vision




                              4
Traditional Computer Vision
“Teaching a computer to classify objects has proved much harder than was originally anticipated”
Thomas Serre - Center for Biological and Computational Learning at MIT



    Specific template or computational
    representation is required to allow object
    recognition

    Must be flexible enough to account with all
    kinds of variations




                                                                                                   5
Biological Vision
“Researchers have been interested for years in trying to copy biological vision systems,
simply because they are so good” ~ David Hogg - computer vision expert at Leeds University, UK


   Highly optimized over millions of years of
   evolution, developing complex neural structures
   to represent and process stimuli

   Superiority of biological vision systems is only
   partially understood

   Hardware architecture and the style of
   computation in nervous systems are
   fundamentally different                                                                       6
Biological Vision




                    7
Seeing is a way of acting




                            8
Active Vision

 Inspired by the vision systems of natural organisms that have
 been evolving for millions of years

 In contrast to standard computer vision systems, biological
 organisms actively interact with the world in order to make sense
 of it

 Humans and also other animals do not look at a scene in fixed
 steadiness. Instead, they actively explore interesting parts of the
 scene by rapid saccadic movements
                                                                   9
Creating Active Vision Systems
Evolutionary Robotics Approach



                                 10
Evolutionary Robotics

 New technique for the automatic creation of autonomous robots

 Inspired by the Darwinian principle of selective reproduction of
 the fittest

 Views robots as autonomous artificial organisms that develop
 their own skills in close interaction with the environment and
 without human intervention

 Drawing heavily on biology and ethology, it uses the tools of
 neural networks, genetic algorithms, dynamic systems, and
 biomorphic engineering
                                                                    11
Genetic Algorithms (GAs) are adaptive heuristic search
                                                               algorithm premised on the evolutionary ideas of natural
                                                               selection and genetic. The basic concept of GAs is
                ...                                            designed to simulate processes in natural system
                                                               necessary for evolution.

                                                                                      Population
                                                                                    (Chromosomes)




                                                         ...
                                       ...
                                                                        Genetic                       Evaluation
                                                                       operators                       (Fitness)




Artificial neural networks (ANNs) are very powerful brain-inspired                     Selection
                                                                                     (Mating Pool)
computational models, which have been used in many different
                                                                                                                   12
areas such as engineering, medicine, finance, and many others.
Related Research
Mars Rover obstacle avoidance (Peniak et al.)




                                                13
Related Research
Koala robot obstacle avoidance (Marocco et al.)




                                                  14
Related Research
Autonomous driving car (Floreano et al.)




                                           15
Related Research
Object recognition (Floreano et al.)




                                       16
Going Further
Designing active vision system for real-world object recognition



                                                                   17
Task

Design active vision system that can learn to recognize the
following objects




                                                              18
Method

 Evolution of the active vision system for real-world object recognition
       training the system in a parallel manner on multiple objects viewed from many different angles and under different lighting conditions


 Amsterdam Library of Object Images (ALOI)
       provides a color image collection of one-thousand small objects
       recorded for scientific purposes
       systematically varied viewing angle, illumination angle, and illumination color



 Active Vision Training
       trained on a set of objects from the ALOI library
       each genotype is evaluated during multiple trials with different randomly rotated objects and under varying lighting conditions
       evolutionary pressure provided by a fitness function that evaluates overall success or failure of the object classification
       trained on increasingly larger number of objects


 Active Vision Testing
       robustness and resiliency of recognition of the dataset
       generalization to previously unseen instances of the learned objects




                                                                                                                                                19
Experimental Setup

 Recurrent Neural Network
     Inputs: 8x8 neurons for retina, 2 neurons for proprioception (x,y pos)
     No hidden neurons
     Outputs: 5 object recognition neurons, 2 neurons to move retina (16px max)
 Genetic Algorithm
     Generations: 10000
     Number of individuals: 100
     Number of trials: 36+16 (object rotations + varying lighting conditions)
     Mutation probability: 10%
     Reproduction: best 20% of individuals create new population
     Elitism used (best individual is preserved)


                                                                                  20
Experimental Setup

 Each individual (neural network) could freely move the retina and
 read the input from the source image (128x128) for 20 steps

 At each step, neural network controlled the behavior of the
 system (retina position) and provide recognition output

 The recognition output neuron with the highest activation was
 considered the network’s guess about what the object was
    Fitness function = number of correct answers / number of total steps


                                                                           21
GPU Accelerating GA and ANN

 GPUs were used to accelerate:
    Evolutionary process – parallel execution of trials
    Neural Network – parallel calculation of neural activities




                                                                 22
Results

 Fitness can not reach 1.0 since it takes few time-steps to recognize an object
 All objects are correctly classified at the end of the each test
                    0.9

                    0.8

                    0.7

                    0.6

                    0.5
          fitness




                    0.4

                    0.3

                    0.2

                    0.1

                     0
                          0   1000   2000   3000      4000      5000     6000         7000   8000   9000   10000
                                                             generations

                                                   best fitness     average fitness
                                                                                                                   23
Evolved Behavior




                   24
Future Work




              25
"Imagination is the highest form of research"
                                 Albert Einstein




      Thank you!




                                                   26

Weitere ähnliche Inhalte

Ähnlich wie Bio-inspired Active Vision System

Aquila: An Open-Source GPU-Accelerated Toolkit for Cognitive and Neuro-Roboti...
Aquila: An Open-Source GPU-Accelerated Toolkit for Cognitive and Neuro-Roboti...Aquila: An Open-Source GPU-Accelerated Toolkit for Cognitive and Neuro-Roboti...
Aquila: An Open-Source GPU-Accelerated Toolkit for Cognitive and Neuro-Roboti...Martin Peniak
 
Interpretability and informatics of deep learning in medical images3
Interpretability and informatics of deep learning in medical images3Interpretability and informatics of deep learning in medical images3
Interpretability and informatics of deep learning in medical images3Namkug Kim
 
Ai complete note
Ai complete noteAi complete note
Ai complete noteNajar Aryal
 
Lebanon SoftShore Artificial Intelligence Seminar - March 38, 2014
Lebanon SoftShore Artificial Intelligence Seminar - March 38, 2014Lebanon SoftShore Artificial Intelligence Seminar - March 38, 2014
Lebanon SoftShore Artificial Intelligence Seminar - March 38, 2014lebsoftshore
 
Fine grained recognition plants from image
Fine grained recognition plants from imageFine grained recognition plants from image
Fine grained recognition plants from imageAhmed Tememe
 
Analytical Review on the Correlation between Ai and Neuroscience
Analytical Review on the Correlation between Ai and NeuroscienceAnalytical Review on the Correlation between Ai and Neuroscience
Analytical Review on the Correlation between Ai and NeuroscienceIOSR Journals
 
Unit I & II in Principles of Soft computing
Unit I & II in Principles of Soft computing Unit I & II in Principles of Soft computing
Unit I & II in Principles of Soft computing Sivagowry Shathesh
 
An introduction to Machine Learning
An introduction to Machine LearningAn introduction to Machine Learning
An introduction to Machine Learningbutest
 
Introduction to Soft Computing by Dr.S.Jagadeesh Kumar
Introduction to Soft Computing by Dr.S.Jagadeesh KumarIntroduction to Soft Computing by Dr.S.Jagadeesh Kumar
Introduction to Soft Computing by Dr.S.Jagadeesh KumarDr.S.Jagadeesh Kumar
 
An introduc on to Machine Learning
An introduc on to Machine LearningAn introduc on to Machine Learning
An introduc on to Machine Learningbutest
 
Artificial intelligent Lec 1-ai-introduction-
Artificial intelligent Lec 1-ai-introduction-Artificial intelligent Lec 1-ai-introduction-
Artificial intelligent Lec 1-ai-introduction-Taymoor Nazmy
 
Neural Networks
Neural Networks Neural Networks
Neural Networks Eric Su
 
Cyberinfrastructure Day 2010: Applications in Biocomputing
Cyberinfrastructure Day 2010: Applications in BiocomputingCyberinfrastructure Day 2010: Applications in Biocomputing
Cyberinfrastructure Day 2010: Applications in BiocomputingJeremy Yang
 

Ähnlich wie Bio-inspired Active Vision System (20)

Aquila: An Open-Source GPU-Accelerated Toolkit for Cognitive and Neuro-Roboti...
Aquila: An Open-Source GPU-Accelerated Toolkit for Cognitive and Neuro-Roboti...Aquila: An Open-Source GPU-Accelerated Toolkit for Cognitive and Neuro-Roboti...
Aquila: An Open-Source GPU-Accelerated Toolkit for Cognitive and Neuro-Roboti...
 
Interpretability and informatics of deep learning in medical images3
Interpretability and informatics of deep learning in medical images3Interpretability and informatics of deep learning in medical images3
Interpretability and informatics of deep learning in medical images3
 
Ai complete note
Ai complete noteAi complete note
Ai complete note
 
Lebanon SoftShore Artificial Intelligence Seminar - March 38, 2014
Lebanon SoftShore Artificial Intelligence Seminar - March 38, 2014Lebanon SoftShore Artificial Intelligence Seminar - March 38, 2014
Lebanon SoftShore Artificial Intelligence Seminar - March 38, 2014
 
Semantic Hybridized Image Features in Visual Diagnostic of Plant Health
Semantic Hybridized Image Features in Visual Diagnostic of Plant HealthSemantic Hybridized Image Features in Visual Diagnostic of Plant Health
Semantic Hybridized Image Features in Visual Diagnostic of Plant Health
 
Fine grained recognition plants from image
Fine grained recognition plants from imageFine grained recognition plants from image
Fine grained recognition plants from image
 
Analytical Review on the Correlation between Ai and Neuroscience
Analytical Review on the Correlation between Ai and NeuroscienceAnalytical Review on the Correlation between Ai and Neuroscience
Analytical Review on the Correlation between Ai and Neuroscience
 
1_intro2AI.pdf
1_intro2AI.pdf1_intro2AI.pdf
1_intro2AI.pdf
 
AI KIMSRAD.pptx
AI KIMSRAD.pptxAI KIMSRAD.pptx
AI KIMSRAD.pptx
 
Unit I & II in Principles of Soft computing
Unit I & II in Principles of Soft computing Unit I & II in Principles of Soft computing
Unit I & II in Principles of Soft computing
 
Bci
BciBci
Bci
 
Bci
BciBci
Bci
 
Artificial Brain - Overview 2013
Artificial Brain - Overview 2013Artificial Brain - Overview 2013
Artificial Brain - Overview 2013
 
An introduction to Machine Learning
An introduction to Machine LearningAn introduction to Machine Learning
An introduction to Machine Learning
 
Introduction to Soft Computing by Dr.S.Jagadeesh Kumar
Introduction to Soft Computing by Dr.S.Jagadeesh KumarIntroduction to Soft Computing by Dr.S.Jagadeesh Kumar
Introduction to Soft Computing by Dr.S.Jagadeesh Kumar
 
An introduc on to Machine Learning
An introduc on to Machine LearningAn introduc on to Machine Learning
An introduc on to Machine Learning
 
Artificial intelligent Lec 1-ai-introduction-
Artificial intelligent Lec 1-ai-introduction-Artificial intelligent Lec 1-ai-introduction-
Artificial intelligent Lec 1-ai-introduction-
 
Learning where to look: focus and attention in deep vision
Learning where to look: focus and attention in deep visionLearning where to look: focus and attention in deep vision
Learning where to look: focus and attention in deep vision
 
Neural Networks
Neural Networks Neural Networks
Neural Networks
 
Cyberinfrastructure Day 2010: Applications in Biocomputing
Cyberinfrastructure Day 2010: Applications in BiocomputingCyberinfrastructure Day 2010: Applications in Biocomputing
Cyberinfrastructure Day 2010: Applications in Biocomputing
 

Mehr von Martin Peniak

GPU-accelerated Action Acquisition Through Multiple Time Scales Recurrent Neu...
GPU-accelerated Action Acquisition Through Multiple Time Scales Recurrent Neu...GPU-accelerated Action Acquisition Through Multiple Time Scales Recurrent Neu...
GPU-accelerated Action Acquisition Through Multiple Time Scales Recurrent Neu...Martin Peniak
 
What is Aquila Software Architecture for Cognitive Robotics?
What is Aquila Software Architecture for Cognitive Robotics?What is Aquila Software Architecture for Cognitive Robotics?
What is Aquila Software Architecture for Cognitive Robotics?Martin Peniak
 
Cognitive robotics tools and technology
Cognitive robotics tools and technologyCognitive robotics tools and technology
Cognitive robotics tools and technologyMartin Peniak
 
Fluoridation, the scientific fraud of a century
Fluoridation, the scientific fraud of a centuryFluoridation, the scientific fraud of a century
Fluoridation, the scientific fraud of a centuryMartin Peniak
 
Introduction to humanoid robot iCub, YARP and simulator
Introduction to humanoid robot iCub, YARP and simulatorIntroduction to humanoid robot iCub, YARP and simulator
Introduction to humanoid robot iCub, YARP and simulatorMartin Peniak
 
Co-evolving controller and sensing abilities in a simulated Mars Rover explorer
Co-evolving controller and sensing abilities in a simulated Mars Rover explorerCo-evolving controller and sensing abilities in a simulated Mars Rover explorer
Co-evolving controller and sensing abilities in a simulated Mars Rover explorerMartin Peniak
 
Vedecká evidencia poukazujúca na spojenie hmoty a vedomia
Vedecká evidencia poukazujúca na spojenie hmoty a vedomiaVedecká evidencia poukazujúca na spojenie hmoty a vedomia
Vedecká evidencia poukazujúca na spojenie hmoty a vedomiaMartin Peniak
 
Scientific evidence for a connection between mind and matter
Scientific evidence for a connection between mind and matterScientific evidence for a connection between mind and matter
Scientific evidence for a connection between mind and matterMartin Peniak
 
Introduction to parallel computing using CUDA
Introduction to parallel computing using CUDAIntroduction to parallel computing using CUDA
Introduction to parallel computing using CUDAMartin Peniak
 

Mehr von Martin Peniak (10)

GPU-accelerated Action Acquisition Through Multiple Time Scales Recurrent Neu...
GPU-accelerated Action Acquisition Through Multiple Time Scales Recurrent Neu...GPU-accelerated Action Acquisition Through Multiple Time Scales Recurrent Neu...
GPU-accelerated Action Acquisition Through Multiple Time Scales Recurrent Neu...
 
What is Aquila Software Architecture for Cognitive Robotics?
What is Aquila Software Architecture for Cognitive Robotics?What is Aquila Software Architecture for Cognitive Robotics?
What is Aquila Software Architecture for Cognitive Robotics?
 
Cognitive robotics tools and technology
Cognitive robotics tools and technologyCognitive robotics tools and technology
Cognitive robotics tools and technology
 
Aquila 2.0
Aquila 2.0Aquila 2.0
Aquila 2.0
 
Fluoridation, the scientific fraud of a century
Fluoridation, the scientific fraud of a centuryFluoridation, the scientific fraud of a century
Fluoridation, the scientific fraud of a century
 
Introduction to humanoid robot iCub, YARP and simulator
Introduction to humanoid robot iCub, YARP and simulatorIntroduction to humanoid robot iCub, YARP and simulator
Introduction to humanoid robot iCub, YARP and simulator
 
Co-evolving controller and sensing abilities in a simulated Mars Rover explorer
Co-evolving controller and sensing abilities in a simulated Mars Rover explorerCo-evolving controller and sensing abilities in a simulated Mars Rover explorer
Co-evolving controller and sensing abilities in a simulated Mars Rover explorer
 
Vedecká evidencia poukazujúca na spojenie hmoty a vedomia
Vedecká evidencia poukazujúca na spojenie hmoty a vedomiaVedecká evidencia poukazujúca na spojenie hmoty a vedomia
Vedecká evidencia poukazujúca na spojenie hmoty a vedomia
 
Scientific evidence for a connection between mind and matter
Scientific evidence for a connection between mind and matterScientific evidence for a connection between mind and matter
Scientific evidence for a connection between mind and matter
 
Introduction to parallel computing using CUDA
Introduction to parallel computing using CUDAIntroduction to parallel computing using CUDA
Introduction to parallel computing using CUDA
 

Kürzlich hochgeladen

Decarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a realityDecarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a realityIES VE
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxLoriGlavin3
 
Glenn Lazarus- Why Your Observability Strategy Needs Security Observability
Glenn Lazarus- Why Your Observability Strategy Needs Security ObservabilityGlenn Lazarus- Why Your Observability Strategy Needs Security Observability
Glenn Lazarus- Why Your Observability Strategy Needs Security Observabilityitnewsafrica
 
2024 April Patch Tuesday
2024 April Patch Tuesday2024 April Patch Tuesday
2024 April Patch TuesdayIvanti
 
Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Farhan Tariq
 
Varsha Sewlal- Cyber Attacks on Critical Critical Infrastructure
Varsha Sewlal- Cyber Attacks on Critical Critical InfrastructureVarsha Sewlal- Cyber Attacks on Critical Critical Infrastructure
Varsha Sewlal- Cyber Attacks on Critical Critical Infrastructureitnewsafrica
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxLoriGlavin3
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity PlanDatabarracks
 
A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersNicole Novielli
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxLoriGlavin3
 
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24Mark Goldstein
 
So einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdfSo einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdfpanagenda
 
Testing tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examplesTesting tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examplesKari Kakkonen
 
UiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to HeroUiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to HeroUiPathCommunity
 
Data governance with Unity Catalog Presentation
Data governance with Unity Catalog PresentationData governance with Unity Catalog Presentation
Data governance with Unity Catalog PresentationKnoldus Inc.
 
Bridging Between CAD & GIS: 6 Ways to Automate Your Data Integration
Bridging Between CAD & GIS:  6 Ways to Automate Your Data IntegrationBridging Between CAD & GIS:  6 Ways to Automate Your Data Integration
Bridging Between CAD & GIS: 6 Ways to Automate Your Data Integrationmarketing932765
 
Time Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsTime Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsNathaniel Shimoni
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024Lonnie McRorey
 
Generative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfGenerative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfIngrid Airi González
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfLoriGlavin3
 

Kürzlich hochgeladen (20)

Decarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a realityDecarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a reality
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
 
Glenn Lazarus- Why Your Observability Strategy Needs Security Observability
Glenn Lazarus- Why Your Observability Strategy Needs Security ObservabilityGlenn Lazarus- Why Your Observability Strategy Needs Security Observability
Glenn Lazarus- Why Your Observability Strategy Needs Security Observability
 
2024 April Patch Tuesday
2024 April Patch Tuesday2024 April Patch Tuesday
2024 April Patch Tuesday
 
Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...
 
Varsha Sewlal- Cyber Attacks on Critical Critical Infrastructure
Varsha Sewlal- Cyber Attacks on Critical Critical InfrastructureVarsha Sewlal- Cyber Attacks on Critical Critical Infrastructure
Varsha Sewlal- Cyber Attacks on Critical Critical Infrastructure
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity Plan
 
A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software Developers
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
 
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
 
So einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdfSo einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdf
 
Testing tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examplesTesting tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examples
 
UiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to HeroUiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to Hero
 
Data governance with Unity Catalog Presentation
Data governance with Unity Catalog PresentationData governance with Unity Catalog Presentation
Data governance with Unity Catalog Presentation
 
Bridging Between CAD & GIS: 6 Ways to Automate Your Data Integration
Bridging Between CAD & GIS:  6 Ways to Automate Your Data IntegrationBridging Between CAD & GIS:  6 Ways to Automate Your Data Integration
Bridging Between CAD & GIS: 6 Ways to Automate Your Data Integration
 
Time Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsTime Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directions
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024
 
Generative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfGenerative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdf
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdf
 

Bio-inspired Active Vision System

  • 1. Bio-inspired Active Vision Martin Peniak, Davide Marocco, Ron Babich and John Tran
  • 2. Outline Introduction Classical vision vs. active vision Show few examples, e.g. template matching, face detection etc. Pros&Cons Explain what is active perception (use current + davides slides) Active Vision – many possibilities but we like GA+NN, why? Based on neural network and genetic algorithm What is neural network? What is genetic algorithm? Pros&Cons Little computation required due to neural network architecture, fast No external representation needed thanks to GA evolving nn weights Generality, invariance, and application in multiple domains Cons: need more research, so far applied in limited domains Previous research by Floreano, Davide, me (ESA) Show videos and pictures Describe work done at NVIDIA Future work: AXA grant 2
  • 5. Traditional Computer Vision “Teaching a computer to classify objects has proved much harder than was originally anticipated” Thomas Serre - Center for Biological and Computational Learning at MIT Specific template or computational representation is required to allow object recognition Must be flexible enough to account with all kinds of variations 5
  • 6. Biological Vision “Researchers have been interested for years in trying to copy biological vision systems, simply because they are so good” ~ David Hogg - computer vision expert at Leeds University, UK Highly optimized over millions of years of evolution, developing complex neural structures to represent and process stimuli Superiority of biological vision systems is only partially understood Hardware architecture and the style of computation in nervous systems are fundamentally different 6
  • 8. Seeing is a way of acting 8
  • 9. Active Vision Inspired by the vision systems of natural organisms that have been evolving for millions of years In contrast to standard computer vision systems, biological organisms actively interact with the world in order to make sense of it Humans and also other animals do not look at a scene in fixed steadiness. Instead, they actively explore interesting parts of the scene by rapid saccadic movements 9
  • 10. Creating Active Vision Systems Evolutionary Robotics Approach 10
  • 11. Evolutionary Robotics New technique for the automatic creation of autonomous robots Inspired by the Darwinian principle of selective reproduction of the fittest Views robots as autonomous artificial organisms that develop their own skills in close interaction with the environment and without human intervention Drawing heavily on biology and ethology, it uses the tools of neural networks, genetic algorithms, dynamic systems, and biomorphic engineering 11
  • 12. Genetic Algorithms (GAs) are adaptive heuristic search algorithm premised on the evolutionary ideas of natural selection and genetic. The basic concept of GAs is ... designed to simulate processes in natural system necessary for evolution. Population (Chromosomes) ... ... Genetic Evaluation operators (Fitness) Artificial neural networks (ANNs) are very powerful brain-inspired Selection (Mating Pool) computational models, which have been used in many different 12 areas such as engineering, medicine, finance, and many others.
  • 13. Related Research Mars Rover obstacle avoidance (Peniak et al.) 13
  • 14. Related Research Koala robot obstacle avoidance (Marocco et al.) 14
  • 15. Related Research Autonomous driving car (Floreano et al.) 15
  • 16. Related Research Object recognition (Floreano et al.) 16
  • 17. Going Further Designing active vision system for real-world object recognition 17
  • 18. Task Design active vision system that can learn to recognize the following objects 18
  • 19. Method Evolution of the active vision system for real-world object recognition training the system in a parallel manner on multiple objects viewed from many different angles and under different lighting conditions Amsterdam Library of Object Images (ALOI) provides a color image collection of one-thousand small objects recorded for scientific purposes systematically varied viewing angle, illumination angle, and illumination color Active Vision Training trained on a set of objects from the ALOI library each genotype is evaluated during multiple trials with different randomly rotated objects and under varying lighting conditions evolutionary pressure provided by a fitness function that evaluates overall success or failure of the object classification trained on increasingly larger number of objects Active Vision Testing robustness and resiliency of recognition of the dataset generalization to previously unseen instances of the learned objects 19
  • 20. Experimental Setup Recurrent Neural Network Inputs: 8x8 neurons for retina, 2 neurons for proprioception (x,y pos) No hidden neurons Outputs: 5 object recognition neurons, 2 neurons to move retina (16px max) Genetic Algorithm Generations: 10000 Number of individuals: 100 Number of trials: 36+16 (object rotations + varying lighting conditions) Mutation probability: 10% Reproduction: best 20% of individuals create new population Elitism used (best individual is preserved) 20
  • 21. Experimental Setup Each individual (neural network) could freely move the retina and read the input from the source image (128x128) for 20 steps At each step, neural network controlled the behavior of the system (retina position) and provide recognition output The recognition output neuron with the highest activation was considered the network’s guess about what the object was Fitness function = number of correct answers / number of total steps 21
  • 22. GPU Accelerating GA and ANN GPUs were used to accelerate: Evolutionary process – parallel execution of trials Neural Network – parallel calculation of neural activities 22
  • 23. Results Fitness can not reach 1.0 since it takes few time-steps to recognize an object All objects are correctly classified at the end of the each test 0.9 0.8 0.7 0.6 0.5 fitness 0.4 0.3 0.2 0.1 0 0 1000 2000 3000 4000 5000 6000 7000 8000 9000 10000 generations best fitness average fitness 23
  • 26. "Imagination is the highest form of research" Albert Einstein Thank you! 26