Artificial Intelligence: an introduction.pdf

Artiﬁcial
intelligence: an
introduction
Eleonora Ciceri - 19 January 2021
1

Artiﬁcial intelligence is the intelligence exhibited by machines
2

Artificial intelligence: a definition
”An ideal intelligent machine is a flexible rational agent that perceives its
environment and takes actions that maximize its chance of success at an
arbitrary goal”
3
???

arbitrary goal”
4
!

arbitrary goal”
5
!

arbitrary goal”
6
!

Are machines really intelligent?
Machines learn to do a task without having a proper knowledge base
Machines are NOT intelligent! They mimic human behaviors without giving
them any semantics
7
Human reasoning
✔ Know precisely what a cat is
✔ Recognize the cat in the photo
✔ “The photo contains a cat”
Machine “reasoning”
✔ Cat = set of features
✔ Lack of features in “ears” area
✔ “The photo does not contain a cat”

This brings to classiﬁcation errors...
8

...which sometimes are not so good
9

Well, sometimes even humans are not so good!
“Machines cannot learn what humans cannot do”
10

Beware!
Artiﬁcial intelligence is NOT the solution to every task you could think
(machines are not intelligent as humans, remember?)
11

Remember this guy?
Andrew exhibits artiﬁcial general
intelligence (or, full AI):
“The machine can successfully perform any
intellectual task that a human being can”
(BTW, thanks Asimov for the wishful thinking)
12

The bicentennial man is NOT the reality
No matter what you wish, what you can
achieve is what is called weak AI:
“Weak AI deﬁnes a non-sentient computer
intelligence that is focused on one narrow
task”
(So long for Andrew, right?)
13

An again, beware!
Artiﬁcial intelligence is NOT typical of just robots
(computers are machine, too)
(or, better: robots are computers, too)
14

So, what does artiﬁcial intelligence solve?
15

Reasoning
Reasoning is the act of applying logic to establish and verify facts
Example: Automated theorem proving
16
http://www.repubblica.it/scienze/2016/06/13/news/d
imostrazione_matematica_piu_lunga-141910538/?ref
=HREC1-36

Planning
Planning is the act of realizing strategies or action sequences, typically by:
- intelligent agents
- autonomous robots
- unmanned vehicles
17

Learning
Learners are systems that can learn from data
Example: distinguish between spam/non-spam emails
...We will discuss this later…
18

Natural language processing (NLP)
NLP is the act of understanding natural language, i.e., enabling computers to
derive meaning from human or natural input
19

Perception
Perception is the identiﬁcation and interpretation of sensory information in
order to understand and represent the environment
20

What is an algorithm?
An algorithm is a self-contained step-by-step
set of operations to be performed
22

What is machine learning?
Machine learning explores the study and construction of algorithms that can:
- learn from data
- make predictions on data
23

Different “types” of machine learning
Supervised learning: humans assist computers while learning
Unsupervised learning: humans just give data to computers, and
let them understand the rules governing data by themselves
Reinforcement learning: humans let computers do their
learning, but from time to time help them in
understanding whether they are learning well
24

Supervised learning
(AKA: train your algorithm to make you happy)
25

Deﬁnitions
From now on, I’ll refer to the following concepts:
(Well, I could use boring images of computers, but it’s cuter in this way)
26
The
learner
Features
(x)
Class
(y)
The amount of time
required for him to
decide to sit down
The volume of his
owner’s voice
Task: “sit down”

Deﬁnitions
From now on, I’ll refer to the following concepts:
(Well, I could use boring images of computers, but it’s cuter in this way)
27
Features
(x)
Class
(y)
The
learner
Features
(x)
Class
(y)
positive class
(label: “happy”)
negative class
(label: “sad”)
This is called
binary
classification
(two classes in the
outcomes)
The amount of time
required for him to
decide to sit down
The volume of his
owner’s voice
Task: “sit down”

How to learn to make your owner happy?
The learner has to:
- Observe the distribution of classes with
respect to features (i.e., voice volume
and spent time distributions)
- Understand which features
(time/volume) are needed to make the
owner happy/sad
28

How does it work?
29
Data collection
Build a set of data used to
train the model
Training
Make your learner read and
understand the data in
training set, so as to make it
extract the rules that
describe such data
Validation
Test your algorithm, to
evaluate your algorithm
performance on fresh data
Training set
collected data
Test set
collected data
Test set
Training set Training set
collected data
Test set

Data collection
What is “data collection” in our case?
1. we make the dog sit in different conditions
- different volume voice
- different time needed to sit down
2. we observe the owner to understand if he is happy or sad
30

Data collection: representation
31
The data we collect are points in this
space, where:
- coordinates are its features
(voice volume and time to sit)
- each point is associated with a
class (happy/sad)

Data collection: representation
32
the more we move to this direction, the more
the owner should be sad, because he will
have to scream and wait for the dog to sit
hence, we expect to find more “sad” samples
in the upper-right corner of the graph

Data collection
33
Features (x) Class (y)
... ...

Data collection: labeling errors
34
... ...
There could be some
labels that are
wrongly attributed to
data points
The more of these
errors, the more the
probability the learner
will learn wrong things

Data collection: forming training and test set
35
... ...
we
keep
these
as
training
set
and
use
them
to
train
our
learner
we
keep
these
as
test
set
and
use
them
to
compute
the
performance
of
the
learner

Training (i.e., using training set to learn from data)
36
the learner tries to understand how to
divide “happy” and “sad” samples
this will serve him to classify new
samples: when the new sample
arrives, the learner will know if it will
be on this or that side of the line
SAD
HAPPY

Training with a small dataset
37
how
the
learner divides
betw
een
happy and
sad
SAD
HAPPY

38
how
the
learner divides
betw
een
happy and
sad
SAD
HAPPY
Probably our mighty owner would still be
happy here, but the learner does not have
enough data to know it

39
how
the
learner divides
betw
een
happy and
sad
SAD
HAPPY
Probably our mighty owner would still be
happy here, but the learner does not have
enough data to know it
OVERFITTING
The learner learned the available data,
but not the rule that generated them

Training with an adequate amount of data
40
how the learner divides
between happy and sad
SAD
HAPPY

Training with an adequate amount of data
41
how the learner divides
between happy and sad
SAD
HAPPY
These determine
the classiﬁcation
error

The model
By the way, the plane dividing the space in
“what belongs to the SAD class” and “what
belongs to the HAPPY class” is called
model
We can see it as a function
y = f(x)
i.e., the function that computes the class
given the features
42
42
SAD
HAPPY

How can I use the model?
Given a new sample (e.g., from the test set)
made of its features, i.e.,:
- time needed to sit down
- volume of voice
the model tells us which class it will be
associated with:
- happy, if below the line
- sad, if above the line
43
SAD
HAPPY
this will be associated
with the “sad” class

Validation: computing the performance on test set
To measure the performance of a model:
44
Positive Negative
Positive True Positive (TP) False Positive (FP)
Negative False Negative (FN) True Negative (TN)
actual class
predicted class
Precision =
TP
TP + FP
(how many of the positive samples the model
recognized were actually positive)
Recall =
TP
TP + FN
(how many of the positive samples the model
was able to recognize)
Accuracy =
TP + TN
TP + TN + FP + FN
(how many samples were detected with their actual,
correct label)

Examples of application of supervised learning
Given an ECG beat, determine which arrhythmia class it belongs to
- Features (x): the points constituting the beat
- Classes (y): normal beat, arrhythmia type 1, arrhythmia type 2…
Given some HRV characteristics, determine if the patient was stressed during
their acquisition
- Features (x): the HRV characteristics
- Classes (y): stressed, non-stressed
45

Problems of supervised learning
Expensive data gathering
Imagine how many times the dog has to sit to
learn how to make his owner happy
The larger the training set:
- The better the result
- The larger the cost
46

A learner is customized
If owners are all the same, a single learner could
make all owners happy…
47

…but since owners have diﬀerent behaviors and
preferences, each owner has to be provided with
his own learner
48

What does this mean?
49
To satisfy him... ...this learner can be
trained to sit down...
...with this dataset
this trains a model
with a certain
accuracy
...or with this dataset
this trains the same
model as before,
with a different
accuracy
To satisfy him instead...
...this learner can be
trained to fetch the ball...
this trains a
different model
with its own
accuracy

50
this trains a model
with a certain
accuracy
model as before,
with a different
accuracy
this trains a
different model
with its own
accuracy
These models can
be both used to
satisfy the first
owner, but not the
second

51
this trains a model
with a certain
accuracy
model as before,
with a different
accuracy
this trains a
different model
with its own
accuracy
This model can be
used to satisfy the
second owner, but
not the first

More in “health” terms:
To detect
arrhythmia in ECG
...we build a model...
this trains a model
with a certain
accuracy
model as before,
with a different
accuracy
To detect stress in HRV ...with this dataset
this trains a
different model
with its own
accuracy
“normal”
“arrhythmia”
“arrhythmia”
...we build a model...
“normal”
“arrhythmia”
2.2 1.7 … 5.0 “stress”
7.8 1.5 … 4.6 “stress”
1.4 5.7 … 9.2 “no stress”

Training data: a good representative of real data?
A practical example:
I may decide to train my learner to recognize cats
in photos (classes: “there is a cat” / “there is not a
cat”), using super beautiful photos that are
taken with professional cameras
The model I extract will not probably work with
pictures of my cat, taken with a smartphone,
with different resolution and different cat
postures

This brings to lack of accuracy, even in
production
54
Villa Del Balbianello
furniture
squirrel

This brings to lack of accuracy, even in
production
55

Multiclass classification
Up to now, we have seen binary classification (i.e., the task of classifying an
item into two classes)
Multiclass classification is the task of classifying an item into three or more
classes
56
Classes
Husky
Samoyed
French bulldog
…
Australian shepherd

Multiclass classification
In our health-related examples:
Stress detection
This task is a binary classification task (stress/no-stress)
Arrhythmia detection
This task is a multiclass classification task (normal/arrhythmia-type-1, …,
arrhythmia-type-N)
57

Neural networks
(a speciﬁc example of supervised learning)
58

The neuron
Remember the fact that a model can be seen as a function?
y = f(x)
Visually:
59
f(x)
x1
x2
x3
y

The neuron: a practical example
We could build, for instance, a function that computes the price of a house (y)
given the size of the house (x):
60
f(x)
size price

Connecting more neurons
But what if we can combine many of these functions?
- some of the inputs build up more complex features
- these features are further combined
- at the end, via combinations of combinations, I reach the outcome
61
family size
price
size
number of
rooms walkability
city
wealth
school quality

Artificial neural network
An artificial neural network (ANN) is a model used to learn concepts for
which data are described by a large number of inputs (i.e., “many x”)
The magic of a neural network is that you don’t have to define the measures in
the middle; we just provide inputs, and the network is able on its own to find
semantics to the internal nodes
62
family size
price
size
number of
rooms walkability
city
wealth
school quality

What does this have to do with the human brain?
Not a lot, actually :)
The only similarity (which is a loose analogy) is as follows:
However, even today we are not able to know what a single neuron does
63
x1
x2
x3
y
x1
x2
x3
y

Types of neural networks
64
Shallow neural network (NN)
Few layers (i.e., few neurons)
Deep neural network (DNN)
Many layers (i.e., many neurons)
(ah, the trending “deep learning”!)
Convolutional neural network (CNN)
Used mostly for images
Recurrent neural network (RNN)
Used mostly for time series (e.g., audio)

A couple of nice examples
MariFlow: self-driving Mario Kart with Recurrent Neural Network
https://www.youtube.com/watch?v=Ipi40cb_RsI
Neural network plays Flappy Bird:
https://www.youtube.com/watch?v=QWdEub_7EcA
65

Unsupervised learning
(AKA: leave your learner in the blue)
66

Unsupervised learning is the machine learning task of inferring a function to
describe hidden structure from unlabeled data
Since the examples given to the learner are unlabeled, there is no error or reward
signal to evaluate a potential solution
67
Supervised learning
{#edges=4;label=‘square’}
{#edges=3;label=‘triangle’}
class
square
class
triangle
{#edges=4}
{#edges=4}
{#edges=4}
{#edges=4}
{#edges=3}
{#edges=3}
{#edges=3}
class
1
class
2
Two
classes

Reinforcement learning
(AKA: prizes are learners’ best friends)
68

Reinforcement learning
With reinforcement learning, a computer program interacts with a dynamic
environment in which it must perform a certain goal without a teacher
explicitly telling it whether it has come close to its goal
69
The
learner
“good”
action
“bad”
action
The
goal
time

Genetic algorithms:
are they about DNA?
70

Short answer: no DNA involved
A genetic algorithm mimics the process of natural selection
71
✔ Start with an initial
population
✔ Evaluate the ﬁtness of each
individual to the desired
requirements
Crossover
Combine aspects of
the selected
individuals
Selection
Select the individuals
that best ﬁt to the
requirements
Mutation
Randomly change a
portion of chromosome
enough
generations?
yes
no
END

Short answer: no DNA involved
A genetic algorithm mimics the process of natural selection
72
✔ Start with an initial
population
✔ Evaluate the ﬁtness of each
individual to the desired
requirements
Crossover
Combine aspects of
the selected
individuals
Selection
Select the individuals
that best ﬁt to the
requirements
Mutation
Randomly change a
portion of chromosome
enough
generations?
yes
no
END
Genetic algorithms
are evolutionary
algorithms which
are not part of the
machine learning
suite

A couple of nice examples
Genetic cars: https://rednuht.org/genetic_cars_2/
Genetic walkers: http://rednuht.org/genetic_walkers/
73

References
Backward chaining: https://en.wikipedia.org/wiki/Backward_chaining
Artificial intelligence: https://en.wikipedia.org/wiki/Artificial_intelligence
General intelligence:
https://en.wikipedia.org/wiki/Artificial_general_intelligence
Machine learning: https://en.wikipedia.org/wiki/Machine_learning
Algorithm: https://en.wikipedia.org/wiki/Algorithm
Unsupervised learning: https://en.wikipedia.org/wiki/Unsupervised_learning
75

References
Artificial neural networks: https://en.wikipedia.org/wiki/Artificial_neural_network
Introduction to artificial neural networks: https://www.youtube.com/watch?v=IS-PeWbvqbs
Genetic algorithms: https://en.wikipedia.org/wiki/Genetic_algorithm
Genetic algorithms for beginners:
http://www.theprojectspot.com/tutorial-post/creating-a-genetic-algorithm-for-beginners/3
Precision and recall: https://en.wikipedia.org/wiki/Precision_and_recall
Neural network and deep learning:
https://www.coursera.org/learn/neural-networks-deep-learning?specialization=deep-learnin
g
76

Artificial Intelligence: an introduction.pdf

Empfohlen

Empfohlen

Weitere ähnliche Inhalte

Ähnlich wie Artificial Intelligence: an introduction.pdf

Ähnlich wie Artificial Intelligence: an introduction.pdf (20)

Mehr von Eleonora Ciceri

Mehr von Eleonora Ciceri (18)

Kürzlich hochgeladen

Kürzlich hochgeladen (20)

Artificial Intelligence: an introduction.pdf