Academic Course: 10 On-line adaptation, learning, evolution

•

1 gefällt mir•963 views

FET AWARE project - Self Awareness in Autonomic Systems

By Gusz Eiben & Mark Hoogendoorn

Technologie Bildung

Designed by Gusz Eiben & Mark Hoogendoorn
Outline
• Population-based Adaptive Systems
• Types of adaptation: evolution, individual
(lifetime) learning, social learning
• Machine learning
• Reinforcement learning
• Off-line vs. on-line adaptation

Designed by Gusz Eiben & Mark Hoogendoorn
Population-based Adaptive Systems
PAS have two essential features
•They consist of a group of basic units that can
perform actions, e.g., computation,
communication, interaction, etc.
•The ability to adapt at
– individual level (modify agent ) and/or
– group level (add/remove agent).

Designed by Gusz Eiben & Mark Hoogendoorn
Types of adaptation
• Evolutionary learning (EL): Changes at population
level (assumed non-Lamarckian)
• Lifetime learning (LL): Changes at agent level
– Individual learning (IL): adaptation autonomously
through a purely internal procedure
– Social learning (SL): adaptation through interaction
/communication

Designed by Gusz Eiben & Mark Hoogendoorn
Taxonomy of adaptation
Adaptation
Evolutionary
Learning
Lifetime
Learning
Individual
Learning
Social
Learning

Designed by Gusz Eiben & Mark Hoogendoorn
Taxonomy of adaptation 2
Adaptation
Evolutionary
Learning
Lifetime
Learning
Individual
Learning
Social
Learning
Learning
Evolution

Designed by Gusz Eiben & Mark Hoogendoorn
Adaptation ≠ operation
• Operation: controller is being used
– Sensory inputs  outputs (motor, comm. device)
– Robot behavior changes, not the controller
• Adaptation: controller is being changed
– Present controller  new controller
– Uses utility/reward/fitness info
– It may require
• One single robot – learning
• More robots – evolution, social learning
• Adaptation + operation = generate + test
• Off-line (initial controller design, before start) vs. on-line (after
start)

Designed by Gusz Eiben & Mark Hoogendoorn
Genotype
Developmental
Engine(decoder)
Genetic operators:
mutation & xover
Learning
operators
Robot
behavior
State of the
environment
Phenotype =
controller
Reward
Fitness
Selection
operators

Designed by Gusz Eiben & Mark Hoogendoorn
Phenotype
Genotype
Developmental
Engine(decoder)
Genetic operators:
mutation & xover
Learning
operators
Robot
behavior
State of the
environment
Reward
Fitness
Selection
operators
controllershape

Designed by Gusz Eiben & Mark Hoogendoorn
Evolutionary loop
Genotype
DevelopmentalEngine
Genetic operators:
mutation & xover
Learning operator(s)
Robot
behavior
Changes in
environment
Controller =
phenotype
Reward
Fitness
Selection
operator(s)

Designed by Gusz Eiben & Mark Hoogendoorn
Learning loop
Genotype
DevelopmentalEngine
Genetic operators:
mutation & xover
Learning operator(s)
Robot
behavior
Changes in
environment
Controller =
phenotype
Reward
Fitness
Selection
operator(s)

Designed by Gusz Eiben & Mark Hoogendoorn
ENVIRONMENTAGENT
Reward r(t)
State s(t)
Action a(t)

Designed by Gusz Eiben & Mark Hoogendoorn
Reinforcement learning
Agent in situation/state st chooses action at
World changes to situation/state st+1
Agent perceives situation st+1 and gets reward rt+1
Telling the agent what to do is its
POLICY πt(s, a) = P r{at = a|st = s}
Given the situation at time t is s, the policy gives the probability the agent’s
action will be a.
For example: πt(s, goforward) = 0.5, πt(s, gobackward) = 0.5.
Reinforcement learning ⇒ Get/ﬁnd/learn the policy

Designed by Gusz Eiben & Mark Hoogendoorn
Further reading
• Evert Haasdijk and A.E. Eiben and Alan F.T.
Winfield, Individual Social and Evolutionary
Adaptation in Collective Systems , Serge
Kernbach (eds.) , Handbook of Collective
Robotics , Pan Stanford , 2011

Weitere ähnliche Inhalte

Mehr von FET AWARE project - Self Awareness in Autonomic Systems

Academic Course: 02 Self-organization and emergence in networked systemsFET AWARE project - Self Awareness in Autonomic Systems

Academic Course: 01 Self-awarenesss and Computational Self-awarenessFET AWARE project - Self Awareness in Autonomic Systems

Awareness: Layman Seminar SlidesFET AWARE project - Self Awareness in Autonomic Systems

Industry Training: 04 Awareness ApplicationsFET AWARE project - Self Awareness in Autonomic Systems

Industry Training: 03 Awareness SimulationFET AWARE project - Self Awareness in Autonomic Systems

Industry Training: 02 Awareness PropertiesFET AWARE project - Self Awareness in Autonomic Systems

Industry Training: 01 Awareness OverviewFET AWARE project - Self Awareness in Autonomic Systems

Robot Swarms as Ensembles of Cooperating Components - Matthias HolzlFET AWARE project - Self Awareness in Autonomic Systems

Towards Systematically Engineering Ensembles - Martin WirsingFET AWARE project - Self Awareness in Autonomic Systems

Capturing the Immune System: From the wet-lab to the robot, building better ...FET AWARE project - Self Awareness in Autonomic Systems

Underwater search and rescue in swarm robotics - Mark Read FET AWARE project - Self Awareness in Autonomic Systems

Computational Self-awareness in Smart-Camera Networks - Lukas EsterleFET AWARE project - Self Awareness in Autonomic Systems

Why Robots may need to be self-‐aware, before we can really trust them - Ala...FET AWARE project - Self Awareness in Autonomic Systems

Morphogenetic Engineering: Reconciling Architecture and Self-Organization Thr...FET AWARE project - Self Awareness in Autonomic Systems

Ensemble-oriented programming of self-adaptive systems - Michele LoretiFET AWARE project - Self Awareness in Autonomic Systems

Self-awareness and Adaptive Technologies: the Future of Operating Systems? FET AWARE project - Self Awareness in Autonomic Systems

EnhancingWeb Process Self-Awareness with Context-Aware Service CompositionFET AWARE project - Self Awareness in Autonomic Systems

Testing cooperative autonomous systems for unwanted emergent behaviour and da...FET AWARE project - Self Awareness in Autonomic Systems

Enduring Institutions and Self-Organising Trust-Adaptive Systems for an Open ...FET AWARE project - Self Awareness in Autonomic Systems

SmartContent: A self protecting and context aware active contentFET AWARE project - Self Awareness in Autonomic Systems

Mehr von FET AWARE project - Self Awareness in Autonomic Systems (20)

Academic Course: 02 Self-organization and emergence in networked systems

Academic Course: 01 Self-awarenesss and Computational Self-awareness

Awareness: Layman Seminar Slides

Industry Training: 04 Awareness Applications

Industry Training: 03 Awareness Simulation

Industry Training: 02 Awareness Properties

Industry Training: 01 Awareness Overview

Robot Swarms as Ensembles of Cooperating Components - Matthias Holzl

Towards Systematically Engineering Ensembles - Martin Wirsing

Capturing the Immune System: From the wet-lab to the robot, building better ...

Underwater search and rescue in swarm robotics - Mark Read

Computational Self-awareness in Smart-Camera Networks - Lukas Esterle

Why Robots may need to be self-‐aware, before we can really trust them - Ala...

Morphogenetic Engineering: Reconciling Architecture and Self-Organization Thr...

Ensemble-oriented programming of self-adaptive systems - Michele Loreti

Self-awareness and Adaptive Technologies: the Future of Operating Systems?

EnhancingWeb Process Self-Awareness with Context-Aware Service Composition

Testing cooperative autonomous systems for unwanted emergent behaviour and da...

Enduring Institutions and Self-Organising Trust-Adaptive Systems for an Open ...

SmartContent: A self protecting and context aware active content

Kürzlich hochgeladen

Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...Neo4j

Scaling API-first – The story of a global engineering organizationRadu Cotescu

Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoffsammart93

2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong

Driving Behavioral Change for Information Management through Data-Driven Gree...Enterprise Knowledge

What Are The Drone Anti-jamming Systems Technology?Antenna Manufacturer Coco

Powerful Google developer tools for immediate impact! (2023-24 C)wesley chun

Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays

Boost PC performance: How more available memory can improve productivityPrincipled Technologies

Advantages of Hiring UIUX Design Service Providers for Your BusinessPixlogix Infotech

The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los

Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung

Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun

TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc

How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes

Artificial Intelligence: Facts and MythsJoaquim Jorge

Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobeapidays

🐬 The future of MySQL is Postgres 🐘RTylerCroy

AWS Community Day CPH - Three problems of TerraformAndrey Devyatkin

Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUK Journal

Kürzlich hochgeladen (20)

Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...

Scaling API-first – The story of a global engineering organization

Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff

2024: Domino Containers - The Next Step. News from the Domino Container commu...

Driving Behavioral Change for Information Management through Data-Driven Gree...

What Are The Drone Anti-jamming Systems Technology?

Powerful Google developer tools for immediate impact! (2023-24 C)

Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...

Boost PC performance: How more available memory can improve productivity

Advantages of Hiring UIUX Design Service Providers for Your Business

The 7 Things I Know About Cyber Security After 25 Years | April 2024

Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...

Data Cloud, More than a CDP by Matt Robison

TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery

How to Troubleshoot Apps for the Modern Connected Worker

Artificial Intelligence: Facts and Myths

Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe

🐬 The future of MySQL is Postgres 🐘

AWS Community Day CPH - Three problems of Terraform

Understanding Discord NSFW Servers A Guide for Responsible Users.pdf

Academic Course: 10 On-line adaptation, learning, evolution

1. Designed by Gusz Eiben & Mark Hoogendoorn On-line adaptation, learning, evolution

2. Designed by Gusz Eiben & Mark Hoogendoorn Outline • Population-based Adaptive Systems • Types of adaptation: evolution, individual (lifetime) learning, social learning • Machine learning • Reinforcement learning • Off-line vs. on-line adaptation

3. Designed by Gusz Eiben & Mark Hoogendoorn Population-based Adaptive Systems PAS have two essential features •They consist of a group of basic units that can perform actions, e.g., computation, communication, interaction, etc. •The ability to adapt at – individual level (modify agent ) and/or – group level (add/remove agent).

4. Designed by Gusz Eiben & Mark Hoogendoorn Types of adaptation • Evolutionary learning (EL): Changes at population level (assumed non-Lamarckian) • Lifetime learning (LL): Changes at agent level – Individual learning (IL): adaptation autonomously through a purely internal procedure – Social learning (SL): adaptation through interaction /communication

5. Designed by Gusz Eiben & Mark Hoogendoorn Taxonomy of adaptation Adaptation Evolutionary Learning Lifetime Learning Individual Learning Social Learning

6. Designed by Gusz Eiben & Mark Hoogendoorn Taxonomy of adaptation 2 Adaptation Evolutionary Learning Lifetime Learning Individual Learning Social Learning Learning Evolution

7. Designed by Gusz Eiben & Mark Hoogendoorn Adaptation ≠ operation • Operation: controller is being used – Sensory inputs  outputs (motor, comm. device) – Robot behavior changes, not the controller • Adaptation: controller is being changed – Present controller  new controller – Uses utility/reward/fitness info – It may require • One single robot – learning • More robots – evolution, social learning • Adaptation + operation = generate + test • Off-line (initial controller design, before start) vs. on-line (after start)

8. Designed by Gusz Eiben & Mark Hoogendoorn Genotype Developmental Engine(decoder) Genetic operators: mutation & xover Learning operators Robot behavior State of the environment Phenotype = controller Reward Fitness Selection operators

9. Designed by Gusz Eiben & Mark Hoogendoorn Genotype Developmental Engine(decoder) Genetic operators: mutation & xover Learning operators Robot behavior State of the environment Phenotype = controller Reward Fitness Selection operators

10. Designed by Gusz Eiben & Mark Hoogendoorn Genotype Developmental Engine(decoder) Genetic operators: mutation & xover Learning operators Robot behavior State of the environment Reward Fitness Selection operators Phenotype controllershape

11. Designed by Gusz Eiben & Mark Hoogendoorn Phenotype Genotype Developmental Engine(decoder) Genetic operators: mutation & xover Learning operators Robot behavior State of the environment Reward Fitness Selection operators controllershape

12. Designed by Gusz Eiben & Mark Hoogendoorn Evolutionary loop Genotype DevelopmentalEngine Genetic operators: mutation & xover Learning operator(s) Robot behavior Changes in environment Controller = phenotype Reward Fitness Selection operator(s)

13. Designed by Gusz Eiben & Mark Hoogendoorn Learning loop Genotype DevelopmentalEngine Genetic operators: mutation & xover Learning operator(s) Robot behavior Changes in environment Controller = phenotype Reward Fitness Selection operator(s)

14. Designed by Gusz Eiben & Mark Hoogendoorn ENVIRONMENTAGENT Reward r(t) State s(t) Action a(t)

15. Designed by Gusz Eiben & Mark Hoogendoorn Reinforcement learning Agent in situation/state st chooses action at World changes to situation/state st+1 Agent perceives situation st+1 and gets reward rt+1 Telling the agent what to do is its POLICY πt(s, a) = P r{at = a|st = s} Given the situation at time t is s, the policy gives the probability the agent’s action will be a. For example: πt(s, goforward) = 0.5, πt(s, gobackward) = 0.5. Reinforcement learning ⇒ Get/ﬁnd/learn the policy

16. Designed by Gusz Eiben & Mark Hoogendoorn Further reading • Evert Haasdijk and A.E. Eiben and Alan F.T. Winfield, Individual Social and Evolutionary Adaptation in Collective Systems , Serge Kernbach (eds.) , Handbook of Collective Robotics , Pan Stanford , 2011

Academic Course: 10 On-line adaptation, learning, evolution

Empfohlen

Empfohlen

Weitere ähnliche Inhalte

Mehr von FET AWARE project - Self Awareness in Autonomic Systems

Mehr von FET AWARE project - Self Awareness in Autonomic Systems (20)

Kürzlich hochgeladen

Kürzlich hochgeladen (20)

Academic Course: 10 On-line adaptation, learning, evolution