SlideShare ist ein Scribd-Unternehmen logo
1 von 36
CSE 3311-Artificial Intelligence
2
Agents
 An agent is anything that can be viewed as
perceiving its environment through sensors and
acting upon that environment through actuators
 Human agent:
 Sensors: eyes, ears, and other organs
 Actuators: hands, legs, mouth, and other body parts
 Robotic agent:
 Sensors: cameras and infrared range finders
 Actuators: various motors
3
Agents
 The agent function maps from percept histories to
actions:
 [f: P*  A]
 Percept history is also known as percept sequence.
4
Agents
Agent
?
Sensors
Actuators
Environment
Percepts
Actions
5
Vacuum-cleaner world
 Percepts: location and contents, e.g.,
[A,Dirty]
 Actions: Left, Right, Suck, NoOp
A vacuum-cleaner agent
6
Rational agents
 An agent should strive to "do the right thing", based on
 what it can perceive and
 the actions it can perform
 The right action is the one that will cause the agent to be most
successful
 Performance measure: An objective criterion for success of an
agent's behavior
 E.g., performance measure of a vacuum-cleaner agent could be
 amount of dirt cleaned up,
 amount of time taken,
 amount of electricity consumed,
 amount of noise generated, etc.
7
Rational agents
 Rational Agent:
 Given:
 Possible percept sequence,
 The evidence provided by the percept sequence and
 Whatever built-in knowledge the agent has.
 Aim: For each possible percept sequence a rational
agent should select an action
 Subject to: maximize its performance measure.
8
Rational and Omniscience Agent
 Omniscience agent knows the actual outcome of its
actions, and can act accordingly.
 Omniscience agent is impossible.
 Aim: For each possible percept sequence a rational
agent should select an action
 In summary, rational at any time depends on 4
things
 The performance measure (degree of success)
 Percept sequence (perception history)
 Knowledge about the environment
 The performed actions
9
Ideal Rational agents
 For each possible percept sequence, an ideal
rational agent should do whatever action is
expected to maximize its performance
measure, on the basis of the evidence
provided by the percept sequence and
whatever built-in knowledge the agent has.
10
Mapping (Percept seq to actions )
 A rule transfer percept sequences to action.
 Very long list.
 Mapping describe agents then ideal mapping
describe ideal agents
 Square root problem (Program and mapping
table e.g).
11
Autonomy
 Agent actions are completely based on the
built-in knowledge (mapping table) then the
agent called lacks autonomy. (e.g. clock)
 Behavior can be based on both
 Its own experience
 Built-in-knowledge
 Truly autonomous intelligent agent should be
avoid unsuccessful result (e.g. dung beetle).
12
Rational agents
 Rationality is distinct from omniscience (all-
knowing with infinite knowledge)
 Agents can perform actions in order to modify
future percepts so as to obtain useful
information (information gathering,
exploration)
 An agent is autonomous if its behavior is
determined by its own experience (with ability
to learn and adapt)
13
PEAS
 Task environment – are essentially the ‘problems’ to
which rational agents are ‘solutions’.
 Task environment specified by PEAS:
 Performance measure
 Environment
 Actuators
 Sensors
 Must first specify the setting for intelligent agent
design
14
PEAS: Example 1
 Agent: Automated taxi driver
 Performance measure: Safe, fast, legal, comfortable
trip, maximize profits
 Environment: Roads, other traffic, pedestrians,
customers
 Actuators: Steering wheel, accelerator, brake,
signal, horn
 Sensors: Cameras, sonar, speedometer, GPS,
odometer, engine sensors, keyboard
15
PEAS : Example 2
 Agent: Medical diagnosis system
 Performance measure: Healthy patient,
minimize costs, lawsuits
 Environment: Patient, hospital, staff
 Actuators: Screen display (questions, tests,
diagnoses, treatments, referrals)
 Sensors: Keyboard (entry of symptoms,
findings, patient's answers)
16
PEAS : Example 3
 Agent: Part-picking robot
 Performance measure: Percentage of parts in
correct bins
 Environment: Conveyor belt with parts, bins
 Actuators: Jointed arm and hand
 Sensors: Camera, joint angle sensors
17
PEAS : Example 4
 Agent: Interactive English tutor
 Performance measure: Maximize student's
score on test
 Environment: Set of students
 Actuators: Screen display (exercises,
suggestions, corrections)
 Sensors: Keyboard
18
Environment types
 Fully observable (vs. partially observable): An agent's sensors
give it access to the complete state of the environment at each
point in time.
 Deterministic (vs. stochastic): The next state of the environment
is completely determined by the current state and the action
executed by the agent. (If the environment is deterministic except
for the actions of other agents, then the environment is strategic)
 Episodic (vs. sequential): The agent's experience is divided into
atomic "episodes" (each episode consists of the agent perceiving
and then performing a single action), and the choice of action in
each episode depends only on the episode itself.
19
Environment types
 Static (vs. dynamic): The environment is unchanged
while an agent is deliberating.
 The environment is semi dynamic if the environment itself
does not change with the passage of time but the agent's
performance score does.
 Discrete (vs. continuous): A limited number of
distinct, clearly defined percepts and actions.
 Single agent (vs. multiagent): An agent operating by
itself in an environment.
20
Environment types
Chess with Chess without Taxi driving
a clock a clock
Fully observable Yes Yes No
Deterministic Strategic Strategic No
Episodic No No No
Static Semi Yes No
Discrete Yes Yes No
Single agent No No No
 The environment type largely determines the agent design
 The real world is (of course) partially observable, stochastic, sequential,
dynamic, continuous, multi-agent
21
Agents and Environments
 The agent function maps from percept histories to actions:
 [f: P*  A]
 The agent program runs on the physical architecture to produce f
 agent = architecture + program
22
Agent functions and programs
 An agent is completely specified by the agent
function mapping percept sequences to
actions
 An agent function (or a small equivalence
class) is rational
 Aim: find a way to implement the rational
agent function concisely
23
Table-lookup agent
 Drawbacks:
 Huge table
 Take a long time to build the table
 No autonomy
 Even with learning, need a long time to learn the table entries
24
Agent types
 Four basic types in order of increasing
generality:
 Simple reflex agents
 Model-based reflex agents
 Goal-based agents
 Utility-based agents
25
Simple reflex agents
 These agents select actions on the basis of current
percept, ignoring the rest of the percept history.
 Agent program for a vacuum-cleaner agent
26
Simple reflex agents
27
Simple reflex agents
 This types of agents are very simple to implement.
 They work properly if the correct decision can be made on the
basis of current percept only i.e. if the environment is fully
observable.
 Example of vacuum cleaning agent with a dirt sensor only
28
Model-based reflex agents
 To handle partial observability it is effective to keep track of the
world it can’t see now.
 i.e. an agent should maintain an internal state representing a
model of the world. The state changes through percepts and
actions.
 Example:
 Changing state by percept: An overtaking car will generally be
closer to behind than it was a moment ago.
 Changing state by action: If the car moves to the right lane there is
a gap in the place before.
29
Model-based reflex agents
30
Model-based reflex agents
31
Goal-based agents
 The state of environment is not always sufficient to decide
what to do.
 Example: At a road junction, the taxi can turn left, turn right,
or go straight. The percept and internal state says it’s a
junction but which way to go depend on the goal (what way
is the passenger’s destination).
32
Goal-based agents
33
Utility-based agents
 Goals sometimes are not enough to generate high quality
behavior.
 For example, out of many possible routes a taxi can follow,
some are quicker, safer, more reliable and cheaper than
others.
 The degree of happiness while achieving a goal is
represented by utility.
 A utility function maps a state (or a sequence of states) onto
a real number which describes the degree of happiness.
34
Utility-based agents
35
Learning agents
 A learning agent has a performance element that
decides which actions to take,
 and a learning element that can change the performance
element to be more efficient as the agents learns.
 A critic component is used to evaluate how well the agent
is doing and provide feedback to the learning component,
 and a problem generator that can deviate from the usual
routine and explore new possibilities.
36
Learning agents

Weitere ähnliche Inhalte

Ähnlich wie Slide01 - Intelligent Agents.ppt

Ähnlich wie Slide01 - Intelligent Agents.ppt (20)

m2-agents.ppt
m2-agents.pptm2-agents.ppt
m2-agents.ppt
 
Artificial Intelligent Agents
Artificial Intelligent AgentsArtificial Intelligent Agents
Artificial Intelligent Agents
 
agents.pdf
agents.pdfagents.pdf
agents.pdf
 
Week 2.pdf
Week 2.pdfWeek 2.pdf
Week 2.pdf
 
Lecture 2 Agents.pptx
Lecture 2 Agents.pptxLecture 2 Agents.pptx
Lecture 2 Agents.pptx
 
Agents-and-Problem-Solving-20022024-094442am.pdf
Agents-and-Problem-Solving-20022024-094442am.pdfAgents-and-Problem-Solving-20022024-094442am.pdf
Agents-and-Problem-Solving-20022024-094442am.pdf
 
Infosec
InfosecInfosec
Infosec
 
AI_02_Intelligent Agents.pptx
AI_02_Intelligent Agents.pptxAI_02_Intelligent Agents.pptx
AI_02_Intelligent Agents.pptx
 
Week 3.pdf
Week 3.pdfWeek 3.pdf
Week 3.pdf
 
introduction to inteligent IntelligentAgent.ppt
introduction to inteligent IntelligentAgent.pptintroduction to inteligent IntelligentAgent.ppt
introduction to inteligent IntelligentAgent.ppt
 
Agents_AI.ppt
Agents_AI.pptAgents_AI.ppt
Agents_AI.ppt
 
Unit-1.pptx
Unit-1.pptxUnit-1.pptx
Unit-1.pptx
 
Lecture 02-agents
Lecture 02-agentsLecture 02-agents
Lecture 02-agents
 
Unit 1.ppt
Unit 1.pptUnit 1.ppt
Unit 1.ppt
 
Artificial intelligence introduction
Artificial intelligence introductionArtificial intelligence introduction
Artificial intelligence introduction
 
Lec 2 agents
Lec 2 agentsLec 2 agents
Lec 2 agents
 
Lec 2-agents
Lec 2-agentsLec 2-agents
Lec 2-agents
 
AI_Ch2.pptx
AI_Ch2.pptxAI_Ch2.pptx
AI_Ch2.pptx
 
AI Agents, Agents in Artificial Intelligence
AI Agents, Agents in Artificial IntelligenceAI Agents, Agents in Artificial Intelligence
AI Agents, Agents in Artificial Intelligence
 
m2-agents.pptx
m2-agents.pptxm2-agents.pptx
m2-agents.pptx
 

Kürzlich hochgeladen

PREDICTING RIVER WATER QUALITY ppt presentation
PREDICTING  RIVER  WATER QUALITY  ppt presentationPREDICTING  RIVER  WATER QUALITY  ppt presentation
PREDICTING RIVER WATER QUALITY ppt presentationvaddepallysandeep122
 
Open Source Summit NA 2024: Open Source Cloud Costs - OpenCost's Impact on En...
Open Source Summit NA 2024: Open Source Cloud Costs - OpenCost's Impact on En...Open Source Summit NA 2024: Open Source Cloud Costs - OpenCost's Impact on En...
Open Source Summit NA 2024: Open Source Cloud Costs - OpenCost's Impact on En...Matt Ray
 
Comparing Linux OS Image Update Models - EOSS 2024.pdf
Comparing Linux OS Image Update Models - EOSS 2024.pdfComparing Linux OS Image Update Models - EOSS 2024.pdf
Comparing Linux OS Image Update Models - EOSS 2024.pdfDrew Moseley
 
Global Identity Enrolment and Verification Pro Solution - Cizo Technology Ser...
Global Identity Enrolment and Verification Pro Solution - Cizo Technology Ser...Global Identity Enrolment and Verification Pro Solution - Cizo Technology Ser...
Global Identity Enrolment and Verification Pro Solution - Cizo Technology Ser...Cizo Technology Services
 
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptxKnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptxTier1 app
 
Automate your Kamailio Test Calls - Kamailio World 2024
Automate your Kamailio Test Calls - Kamailio World 2024Automate your Kamailio Test Calls - Kamailio World 2024
Automate your Kamailio Test Calls - Kamailio World 2024Andreas Granig
 
Folding Cheat Sheet #4 - fourth in a series
Folding Cheat Sheet #4 - fourth in a seriesFolding Cheat Sheet #4 - fourth in a series
Folding Cheat Sheet #4 - fourth in a seriesPhilip Schwarz
 
英国UN学位证,北安普顿大学毕业证书1:1制作
英国UN学位证,北安普顿大学毕业证书1:1制作英国UN学位证,北安普顿大学毕业证书1:1制作
英国UN学位证,北安普顿大学毕业证书1:1制作qr0udbr0
 
Machine Learning Software Engineering Patterns and Their Engineering
Machine Learning Software Engineering Patterns and Their EngineeringMachine Learning Software Engineering Patterns and Their Engineering
Machine Learning Software Engineering Patterns and Their EngineeringHironori Washizaki
 
Cyber security and its impact on E commerce
Cyber security and its impact on E commerceCyber security and its impact on E commerce
Cyber security and its impact on E commercemanigoyal112
 
SuccessFactors 1H 2024 Release - Sneak-Peek by Deloitte Germany
SuccessFactors 1H 2024 Release - Sneak-Peek by Deloitte GermanySuccessFactors 1H 2024 Release - Sneak-Peek by Deloitte Germany
SuccessFactors 1H 2024 Release - Sneak-Peek by Deloitte GermanyChristoph Pohl
 
Cloud Data Center Network Construction - IEEE
Cloud Data Center Network Construction - IEEECloud Data Center Network Construction - IEEE
Cloud Data Center Network Construction - IEEEVICTOR MAESTRE RAMIREZ
 
SensoDat: Simulation-based Sensor Dataset of Self-driving Cars
SensoDat: Simulation-based Sensor Dataset of Self-driving CarsSensoDat: Simulation-based Sensor Dataset of Self-driving Cars
SensoDat: Simulation-based Sensor Dataset of Self-driving CarsChristian Birchler
 
办理学位证(UQ文凭证书)昆士兰大学毕业证成绩单原版一模一样
办理学位证(UQ文凭证书)昆士兰大学毕业证成绩单原版一模一样办理学位证(UQ文凭证书)昆士兰大学毕业证成绩单原版一模一样
办理学位证(UQ文凭证书)昆士兰大学毕业证成绩单原版一模一样umasea
 
How to submit a standout Adobe Champion Application
How to submit a standout Adobe Champion ApplicationHow to submit a standout Adobe Champion Application
How to submit a standout Adobe Champion ApplicationBradBedford3
 
Taming Distributed Systems: Key Insights from Wix's Large-Scale Experience - ...
Taming Distributed Systems: Key Insights from Wix's Large-Scale Experience - ...Taming Distributed Systems: Key Insights from Wix's Large-Scale Experience - ...
Taming Distributed Systems: Key Insights from Wix's Large-Scale Experience - ...Natan Silnitsky
 
What is Advanced Excel and what are some best practices for designing and cre...
What is Advanced Excel and what are some best practices for designing and cre...What is Advanced Excel and what are some best practices for designing and cre...
What is Advanced Excel and what are some best practices for designing and cre...Technogeeks
 
Sending Calendar Invites on SES and Calendarsnack.pdf
Sending Calendar Invites on SES and Calendarsnack.pdfSending Calendar Invites on SES and Calendarsnack.pdf
Sending Calendar Invites on SES and Calendarsnack.pdf31events.com
 
Software Coding for software engineering
Software Coding for software engineeringSoftware Coding for software engineering
Software Coding for software engineeringssuserb3a23b
 

Kürzlich hochgeladen (20)

PREDICTING RIVER WATER QUALITY ppt presentation
PREDICTING  RIVER  WATER QUALITY  ppt presentationPREDICTING  RIVER  WATER QUALITY  ppt presentation
PREDICTING RIVER WATER QUALITY ppt presentation
 
Open Source Summit NA 2024: Open Source Cloud Costs - OpenCost's Impact on En...
Open Source Summit NA 2024: Open Source Cloud Costs - OpenCost's Impact on En...Open Source Summit NA 2024: Open Source Cloud Costs - OpenCost's Impact on En...
Open Source Summit NA 2024: Open Source Cloud Costs - OpenCost's Impact on En...
 
Comparing Linux OS Image Update Models - EOSS 2024.pdf
Comparing Linux OS Image Update Models - EOSS 2024.pdfComparing Linux OS Image Update Models - EOSS 2024.pdf
Comparing Linux OS Image Update Models - EOSS 2024.pdf
 
Global Identity Enrolment and Verification Pro Solution - Cizo Technology Ser...
Global Identity Enrolment and Verification Pro Solution - Cizo Technology Ser...Global Identity Enrolment and Verification Pro Solution - Cizo Technology Ser...
Global Identity Enrolment and Verification Pro Solution - Cizo Technology Ser...
 
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptxKnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx
 
Automate your Kamailio Test Calls - Kamailio World 2024
Automate your Kamailio Test Calls - Kamailio World 2024Automate your Kamailio Test Calls - Kamailio World 2024
Automate your Kamailio Test Calls - Kamailio World 2024
 
Folding Cheat Sheet #4 - fourth in a series
Folding Cheat Sheet #4 - fourth in a seriesFolding Cheat Sheet #4 - fourth in a series
Folding Cheat Sheet #4 - fourth in a series
 
英国UN学位证,北安普顿大学毕业证书1:1制作
英国UN学位证,北安普顿大学毕业证书1:1制作英国UN学位证,北安普顿大学毕业证书1:1制作
英国UN学位证,北安普顿大学毕业证书1:1制作
 
Machine Learning Software Engineering Patterns and Their Engineering
Machine Learning Software Engineering Patterns and Their EngineeringMachine Learning Software Engineering Patterns and Their Engineering
Machine Learning Software Engineering Patterns and Their Engineering
 
Cyber security and its impact on E commerce
Cyber security and its impact on E commerceCyber security and its impact on E commerce
Cyber security and its impact on E commerce
 
SuccessFactors 1H 2024 Release - Sneak-Peek by Deloitte Germany
SuccessFactors 1H 2024 Release - Sneak-Peek by Deloitte GermanySuccessFactors 1H 2024 Release - Sneak-Peek by Deloitte Germany
SuccessFactors 1H 2024 Release - Sneak-Peek by Deloitte Germany
 
Cloud Data Center Network Construction - IEEE
Cloud Data Center Network Construction - IEEECloud Data Center Network Construction - IEEE
Cloud Data Center Network Construction - IEEE
 
SensoDat: Simulation-based Sensor Dataset of Self-driving Cars
SensoDat: Simulation-based Sensor Dataset of Self-driving CarsSensoDat: Simulation-based Sensor Dataset of Self-driving Cars
SensoDat: Simulation-based Sensor Dataset of Self-driving Cars
 
办理学位证(UQ文凭证书)昆士兰大学毕业证成绩单原版一模一样
办理学位证(UQ文凭证书)昆士兰大学毕业证成绩单原版一模一样办理学位证(UQ文凭证书)昆士兰大学毕业证成绩单原版一模一样
办理学位证(UQ文凭证书)昆士兰大学毕业证成绩单原版一模一样
 
How to submit a standout Adobe Champion Application
How to submit a standout Adobe Champion ApplicationHow to submit a standout Adobe Champion Application
How to submit a standout Adobe Champion Application
 
Taming Distributed Systems: Key Insights from Wix's Large-Scale Experience - ...
Taming Distributed Systems: Key Insights from Wix's Large-Scale Experience - ...Taming Distributed Systems: Key Insights from Wix's Large-Scale Experience - ...
Taming Distributed Systems: Key Insights from Wix's Large-Scale Experience - ...
 
What is Advanced Excel and what are some best practices for designing and cre...
What is Advanced Excel and what are some best practices for designing and cre...What is Advanced Excel and what are some best practices for designing and cre...
What is Advanced Excel and what are some best practices for designing and cre...
 
Sending Calendar Invites on SES and Calendarsnack.pdf
Sending Calendar Invites on SES and Calendarsnack.pdfSending Calendar Invites on SES and Calendarsnack.pdf
Sending Calendar Invites on SES and Calendarsnack.pdf
 
Odoo Development Company in India | Devintelle Consulting Service
Odoo Development Company in India | Devintelle Consulting ServiceOdoo Development Company in India | Devintelle Consulting Service
Odoo Development Company in India | Devintelle Consulting Service
 
Software Coding for software engineering
Software Coding for software engineeringSoftware Coding for software engineering
Software Coding for software engineering
 

Slide01 - Intelligent Agents.ppt

  • 2. 2 Agents  An agent is anything that can be viewed as perceiving its environment through sensors and acting upon that environment through actuators  Human agent:  Sensors: eyes, ears, and other organs  Actuators: hands, legs, mouth, and other body parts  Robotic agent:  Sensors: cameras and infrared range finders  Actuators: various motors
  • 3. 3 Agents  The agent function maps from percept histories to actions:  [f: P*  A]  Percept history is also known as percept sequence.
  • 5. 5 Vacuum-cleaner world  Percepts: location and contents, e.g., [A,Dirty]  Actions: Left, Right, Suck, NoOp A vacuum-cleaner agent
  • 6. 6 Rational agents  An agent should strive to "do the right thing", based on  what it can perceive and  the actions it can perform  The right action is the one that will cause the agent to be most successful  Performance measure: An objective criterion for success of an agent's behavior  E.g., performance measure of a vacuum-cleaner agent could be  amount of dirt cleaned up,  amount of time taken,  amount of electricity consumed,  amount of noise generated, etc.
  • 7. 7 Rational agents  Rational Agent:  Given:  Possible percept sequence,  The evidence provided by the percept sequence and  Whatever built-in knowledge the agent has.  Aim: For each possible percept sequence a rational agent should select an action  Subject to: maximize its performance measure.
  • 8. 8 Rational and Omniscience Agent  Omniscience agent knows the actual outcome of its actions, and can act accordingly.  Omniscience agent is impossible.  Aim: For each possible percept sequence a rational agent should select an action  In summary, rational at any time depends on 4 things  The performance measure (degree of success)  Percept sequence (perception history)  Knowledge about the environment  The performed actions
  • 9. 9 Ideal Rational agents  For each possible percept sequence, an ideal rational agent should do whatever action is expected to maximize its performance measure, on the basis of the evidence provided by the percept sequence and whatever built-in knowledge the agent has.
  • 10. 10 Mapping (Percept seq to actions )  A rule transfer percept sequences to action.  Very long list.  Mapping describe agents then ideal mapping describe ideal agents  Square root problem (Program and mapping table e.g).
  • 11. 11 Autonomy  Agent actions are completely based on the built-in knowledge (mapping table) then the agent called lacks autonomy. (e.g. clock)  Behavior can be based on both  Its own experience  Built-in-knowledge  Truly autonomous intelligent agent should be avoid unsuccessful result (e.g. dung beetle).
  • 12. 12 Rational agents  Rationality is distinct from omniscience (all- knowing with infinite knowledge)  Agents can perform actions in order to modify future percepts so as to obtain useful information (information gathering, exploration)  An agent is autonomous if its behavior is determined by its own experience (with ability to learn and adapt)
  • 13. 13 PEAS  Task environment – are essentially the ‘problems’ to which rational agents are ‘solutions’.  Task environment specified by PEAS:  Performance measure  Environment  Actuators  Sensors  Must first specify the setting for intelligent agent design
  • 14. 14 PEAS: Example 1  Agent: Automated taxi driver  Performance measure: Safe, fast, legal, comfortable trip, maximize profits  Environment: Roads, other traffic, pedestrians, customers  Actuators: Steering wheel, accelerator, brake, signal, horn  Sensors: Cameras, sonar, speedometer, GPS, odometer, engine sensors, keyboard
  • 15. 15 PEAS : Example 2  Agent: Medical diagnosis system  Performance measure: Healthy patient, minimize costs, lawsuits  Environment: Patient, hospital, staff  Actuators: Screen display (questions, tests, diagnoses, treatments, referrals)  Sensors: Keyboard (entry of symptoms, findings, patient's answers)
  • 16. 16 PEAS : Example 3  Agent: Part-picking robot  Performance measure: Percentage of parts in correct bins  Environment: Conveyor belt with parts, bins  Actuators: Jointed arm and hand  Sensors: Camera, joint angle sensors
  • 17. 17 PEAS : Example 4  Agent: Interactive English tutor  Performance measure: Maximize student's score on test  Environment: Set of students  Actuators: Screen display (exercises, suggestions, corrections)  Sensors: Keyboard
  • 18. 18 Environment types  Fully observable (vs. partially observable): An agent's sensors give it access to the complete state of the environment at each point in time.  Deterministic (vs. stochastic): The next state of the environment is completely determined by the current state and the action executed by the agent. (If the environment is deterministic except for the actions of other agents, then the environment is strategic)  Episodic (vs. sequential): The agent's experience is divided into atomic "episodes" (each episode consists of the agent perceiving and then performing a single action), and the choice of action in each episode depends only on the episode itself.
  • 19. 19 Environment types  Static (vs. dynamic): The environment is unchanged while an agent is deliberating.  The environment is semi dynamic if the environment itself does not change with the passage of time but the agent's performance score does.  Discrete (vs. continuous): A limited number of distinct, clearly defined percepts and actions.  Single agent (vs. multiagent): An agent operating by itself in an environment.
  • 20. 20 Environment types Chess with Chess without Taxi driving a clock a clock Fully observable Yes Yes No Deterministic Strategic Strategic No Episodic No No No Static Semi Yes No Discrete Yes Yes No Single agent No No No  The environment type largely determines the agent design  The real world is (of course) partially observable, stochastic, sequential, dynamic, continuous, multi-agent
  • 21. 21 Agents and Environments  The agent function maps from percept histories to actions:  [f: P*  A]  The agent program runs on the physical architecture to produce f  agent = architecture + program
  • 22. 22 Agent functions and programs  An agent is completely specified by the agent function mapping percept sequences to actions  An agent function (or a small equivalence class) is rational  Aim: find a way to implement the rational agent function concisely
  • 23. 23 Table-lookup agent  Drawbacks:  Huge table  Take a long time to build the table  No autonomy  Even with learning, need a long time to learn the table entries
  • 24. 24 Agent types  Four basic types in order of increasing generality:  Simple reflex agents  Model-based reflex agents  Goal-based agents  Utility-based agents
  • 25. 25 Simple reflex agents  These agents select actions on the basis of current percept, ignoring the rest of the percept history.  Agent program for a vacuum-cleaner agent
  • 27. 27 Simple reflex agents  This types of agents are very simple to implement.  They work properly if the correct decision can be made on the basis of current percept only i.e. if the environment is fully observable.  Example of vacuum cleaning agent with a dirt sensor only
  • 28. 28 Model-based reflex agents  To handle partial observability it is effective to keep track of the world it can’t see now.  i.e. an agent should maintain an internal state representing a model of the world. The state changes through percepts and actions.  Example:  Changing state by percept: An overtaking car will generally be closer to behind than it was a moment ago.  Changing state by action: If the car moves to the right lane there is a gap in the place before.
  • 31. 31 Goal-based agents  The state of environment is not always sufficient to decide what to do.  Example: At a road junction, the taxi can turn left, turn right, or go straight. The percept and internal state says it’s a junction but which way to go depend on the goal (what way is the passenger’s destination).
  • 33. 33 Utility-based agents  Goals sometimes are not enough to generate high quality behavior.  For example, out of many possible routes a taxi can follow, some are quicker, safer, more reliable and cheaper than others.  The degree of happiness while achieving a goal is represented by utility.  A utility function maps a state (or a sequence of states) onto a real number which describes the degree of happiness.
  • 35. 35 Learning agents  A learning agent has a performance element that decides which actions to take,  and a learning element that can change the performance element to be more efficient as the agents learns.  A critic component is used to evaluate how well the agent is doing and provide feedback to the learning component,  and a problem generator that can deviate from the usual routine and explore new possibilities.