SlideShare ist ein Scribd-Unternehmen logo
1 von 27
Downloaden Sie, um offline zu lesen
Neural Networks
Chapter 3
2
Outline
3.1 Introduction
3.2 Training Single TLUs
– Gradient Descent
– Widrow-Hoff Rule
– Generalized Delta Procedure
3.3 Neural Networks
– The Backpropagation Method
– Derivation of the Backpropagation Learning Rule
3.4 Generalization, Accuracy, and Overfitting
3.5 Discussion
3
3.1 Introduction
 TLU (threshold logic unit): Basic units for neural networks
– Based on some properties of biological neurons
 Training set
– Input: real value, boolean value, …
– Output:
 : associated actions (Label, Class …)
 Target of training
– Finding corresponds “acceptably” to the members of
the training set.
– Supervised learning: Labels are given along with the input
vectors.
jd
),...,(Xvector,dim-n:X 1 nxx
( )f X
4
3.2.1 TLU Geometry
 Training TLU: Adjusting variable weights
 A single TLU: Perceptron, Adaline (adaptive linear element)
[Rosenblatt 1962, Widrow 1962]
 Elements of TLU
– Weight:
– Threshold: 
 Output of TLU: Using weighted sum
– 1 if s   > 0
– 0 if s   < 0
 Hyperplane
– WX   = 0
1( ,..., )nW w w
s W X 
5
6
3.2.2 Augmented Vectors
 Adopting the convention that threshold is fixed to 0.
 Arbitrary thresholds: (n + 1)-dimensional vector
 W = (w1, …, wn, 1)
 Output of TLU
– 1 if WX  0
– 0 if WX < 0
7
3.2.3 Gradient Decent Methods
 Training TLU: minimizing the error function by adjusting weight values.
 Two ways: Batch learning v.s. incremental learning
 Commonly used error function: squared error
– Gradient :
– Chain rule:
 Solution of nonlinearity of f / s :
– Ignoring threshod function: f = s
– Replacing threshold function with differentiable nonlinear
function
2
)( fd 















11
,...,,...,
ni
def
www

W
XX
WW s
f
fd
s
s
s 












)(2

8
3.2.4 The Widrow-Hoff Procedure
 Weight update procedure:
– Using f = s = WX
– Data labeled 1  1, Data labeled 0  1
 Gradient:
 New weight vector
 Widrow-Hoff (delta) rule
– (d  f) > 0  increasing s  decreasing (d  f)
– (d  f) < 0  decreasing s  increasing (d  f)
XX
W
)(2)(2 fd
s
f
fd 





XWW )( fdc 
9
The Generalized Delta Procedure
 Sigmoid function (differentiable): [Rumelhart, et al. 1986]
)1/(1)( x
esf 

10
The Generalized Delta Procedure (II)
 Gradient:
 Generalized delta procedure:
– Target output: 1, 0
– Output f = output of sigmoid function
– f(1– f) = 0, where f = 0 or 1
– Weight change can occur only within „fuzzy‟ region
surrounding the hyperplane (near the point f(s) = ½).
XX
W
)1()(2)(2 fffd
s
f
fd 





XWW )1()( fffdc 
11
The Error-Correction Procedure
 Using threshold unit: (d – f) can be either 1 or –1.
 In the linearly separable case, after finite iterations, W will be
converged to the solution.
 In the nonlinearly separable case, W will never be converged.
 The Widrow-Hoff and generalized delta procedures will find
minimum squared error solutions even when the minimum error is
not zero.
XWW )( fdc 
12
Training Process
 Data
 L
kkdkX 1))(),(( 
)()(
)2()2(
)1()1(
LdLX
dX
dX




NN
X(k) f(k)
d(k)
-
)())()(()()( kkfkdckk XWW 
• Update Rule
13
3.3 Neural Networks
 Need for use of multiple TLUs
– Feedforward network: no cycle
– Recurrent network: cycle (treated in a later chapter)
– Layered feedforward network
 jth layer can receive input only from j – 1th layer.
 Example :
2121 xxxxf 
14
Notation
 Hidden unit: neurons in all but the last layer
 Output of j-th layer: X(j)  input of (j+1)-th layer
 Input vector: X(0)
 Final output: f
 The weight of i-th sigmoid unit in the j-th layer: Wi
(j)
 Weighted sum of i-th sigmoid unit in the j-th layer: si
(j)
 Number of sigmoid units in j-th layer: mj
)()1()( j
i
jj
is WX  
 )(
,1
)(
,
)(
,1
)(
)1(
,...,,..., j
im
j
il
j
i
j
i j
www 
W
15
그림 3.5
16
3.3.3 The Backpropagation Method
 Gradient of Wi
(j) :
 Weight update:
)()1()( j
i
jj
is WX  
)()(
2
)(
)(2
)(
j
i
j
i
j
i s
f
fd
s
fd
s 







)1()()()()( 
 jj
i
j
i
j
i
j
i c XWW 
)1()()1(
)(
)1(
)()(
)(
)()(
2)(2 















jj
i
j
j
i
j
j
i
j
i
j
i
j
i
j
i
s
f
fd
s
s
s
XX
X
WW


Local gradient
17
Weight Changes in the Final Layer
 Local gradient:
 Weight update:
)1()(
)( )(
)(
fffd
s
f
fd j
i
k




)1()()()(
)1()( 
 kkkk
fffdc XWW
18
3.3.5 Weights in Intermediate Layers
 Local gradient:
– The final ouput f, depends on si
(j) through of the summed
inputs to the sigmoids in the (j+1)-th layer.
 Need for computation of
)(
)1(
1
)1(
)(
)1(
)1(
1
)(
)1(
)1()(
)1(
)1()(
)1(
1
)1(
1
)(
)(
11
1
1
)(
......)(
)(
j
i
j
l
m
l
j
lj
i
j
l
j
l
m
l
j
i
j
m
j
m
j
i
j
l
j
l
j
i
j
j
j
i
j
i
s
s
s
s
s
f
fd
s
s
s
f
s
s
s
f
s
s
s
f
fd
s
f
fd
jj
j
j




















































)(
)1(
j
i
j
l
s
s

 
19
Weight Update in Hidden Layers (cont.)
– v  i : v = i :
– Conseqeuntly,
 
)1( )()()1(
,
1
1
)(
)(
)1(
,)(
1
1
)1(
,
)(
)(
)1(
1
1
)1(
,
)()1()()1(
j
i
j
i
j
li
m
v
j
i
j
vj
lvj
i
m
v
j
lv
l
v
j
i
j
l
m
v
j
lv
l
v
j
l
jj
l
ffw
s
f
w
s
wf
s
s
wfs
j
j
j






















WX
0)(
)(



j
i
j
v
s
f )1( )()(
)(
)(
j
v
j
vj
i
j
v
ff
s
f








1
1
)1(
.
)1()()()(
)1(
jm
l
j
li
j
l
j
i
j
i
j
i wff 
20
Weight Update in Hidden Layers (cont.)
 Attention to recursive equation of local gradient!
 Backpropagation:
– Error is back-propagated from the output layer to the input layer
– Local gradient of the latter layer is used in calculating local
gradient of the former layer.
))(1()(
fdffk






1
1
)1(
.
)1()()()(
)1(
jm
l
j
li
j
i
j
i
j
i
j
i wff 
21
3.3.5 (con’t)
 Example (even parity function)
– Learning rate: 1.0
input target
1 0 1 0
0 0 1 1
0 1 1 0
1 1 1 1
2121 xxxxf 
22
Generalization, Accuracy, Overfitting
 Generalization ability:
– NN appropriately classifies vectors not in the training set.
– Measurement = accuracy
 Curve fitting
– Number of training input vectors  number of degrees of
freedom of the network.
– In the case of m data points, is (m-1)-degree polynomial best
model? No, it can not capture any special information.
 Overfitting
– Extra degrees of freedom are essentially just fitting the noise.
– Given sufficient data, the Occam’s Razor principle dictates to
choose the lowest-degree polynomial that adequately fits the
data.
23
Overfitting
24
Generalization (cont’d)
 Out-of-sample-set error rate
– Error rate on data drawn from the same underlying distribution of
training set.
 Dividing available data into a training set and a validation set
– Usually use 2/3 for training and 1/3 for validation
 k-fold cross validation
– k disjoint subsets (called folds).
– Repeat training k times with the configuration: one validation set,
k-1 (combined) training sets.
– Take average of the error rate of each validation as the out-of-
sample error.
– Empirically 10-fold is preferred.
25
Fig 3.8 Error Versus Number of
Hidden Units
Fig 9. Estimate of Generalization Error
Versus Number of Hidden Units
26
3.5 Additional Readings & Discussion
 Applications
– Pattern recognition, automatic control, brain-function modeling
– Designing and training neural networks still need experience and
experiments.
 Major annual conferences
– Neural Information Processing Systems (NIPS)
– International Conference on Machine Learning (ICML)
– Computational Learning Theory (COLT)
 Major journals
– Neural Computation
– IEEE Transactions on Neural Networks
– Machine Learning
27
Homework
 Page 55 ~ 57
– Ex 3.1; Ex3.4, Ex3.6, Ex. 3.7
How to submit your homework:
Email to lqzhang@sjtu.edu.cn
Filename specification: [学号]+[姓名]+[章节].doc

Weitere ähnliche Inhalte

Was ist angesagt?

Artificial Neural Network
Artificial Neural Network Artificial Neural Network
Artificial Neural Network Iman Ardekani
 
Artificial Neural Network seminar presentation using ppt.
Artificial Neural Network seminar presentation using ppt.Artificial Neural Network seminar presentation using ppt.
Artificial Neural Network seminar presentation using ppt.Mohd Faiz
 
Artificial neural network model & hidden layers in multilayer artificial neur...
Artificial neural network model & hidden layers in multilayer artificial neur...Artificial neural network model & hidden layers in multilayer artificial neur...
Artificial neural network model & hidden layers in multilayer artificial neur...Muhammad Ishaq
 
Nural network ER. Abhishek k. upadhyay
Nural network ER. Abhishek  k. upadhyayNural network ER. Abhishek  k. upadhyay
Nural network ER. Abhishek k. upadhyayabhishek upadhyay
 
Artificial Neural Networks Lect2: Neurobiology & Architectures of ANNS
Artificial Neural Networks Lect2: Neurobiology & Architectures of ANNSArtificial Neural Networks Lect2: Neurobiology & Architectures of ANNS
Artificial Neural Networks Lect2: Neurobiology & Architectures of ANNSMohammed Bennamoun
 
Fundamental, An Introduction to Neural Networks
Fundamental, An Introduction to Neural NetworksFundamental, An Introduction to Neural Networks
Fundamental, An Introduction to Neural NetworksNelson Piedra
 
Introduction to Neural networks (under graduate course) Lecture 7 of 9
Introduction to Neural networks (under graduate course) Lecture 7 of 9Introduction to Neural networks (under graduate course) Lecture 7 of 9
Introduction to Neural networks (under graduate course) Lecture 7 of 9Randa Elanwar
 
lecture07.ppt
lecture07.pptlecture07.ppt
lecture07.pptbutest
 
Adaline madaline
Adaline madalineAdaline madaline
Adaline madalineNagarajan
 
Artificial Neural Network (draft)
Artificial Neural Network (draft)Artificial Neural Network (draft)
Artificial Neural Network (draft)James Boulie
 
Feedforward neural network
Feedforward neural networkFeedforward neural network
Feedforward neural networkSopheaktra YONG
 
Introduction to Neural networks (under graduate course) Lecture 9 of 9
Introduction to Neural networks (under graduate course) Lecture 9 of 9Introduction to Neural networks (under graduate course) Lecture 9 of 9
Introduction to Neural networks (under graduate course) Lecture 9 of 9Randa Elanwar
 
Artificial neural network
Artificial neural networkArtificial neural network
Artificial neural networkIshaneeSharma
 
Artificial Neural Network(Artificial intelligence)
Artificial Neural Network(Artificial intelligence)Artificial Neural Network(Artificial intelligence)
Artificial Neural Network(Artificial intelligence)spartacus131211
 
Artificial neural network
Artificial neural networkArtificial neural network
Artificial neural networkmustafa aadel
 
Artificial neural network
Artificial neural networkArtificial neural network
Artificial neural networksweetysweety8
 

Was ist angesagt? (20)

Artificial Neural Network
Artificial Neural Network Artificial Neural Network
Artificial Neural Network
 
Artificial Neural Network seminar presentation using ppt.
Artificial Neural Network seminar presentation using ppt.Artificial Neural Network seminar presentation using ppt.
Artificial Neural Network seminar presentation using ppt.
 
Artificial neural network model & hidden layers in multilayer artificial neur...
Artificial neural network model & hidden layers in multilayer artificial neur...Artificial neural network model & hidden layers in multilayer artificial neur...
Artificial neural network model & hidden layers in multilayer artificial neur...
 
Nural network ER. Abhishek k. upadhyay
Nural network ER. Abhishek  k. upadhyayNural network ER. Abhishek  k. upadhyay
Nural network ER. Abhishek k. upadhyay
 
Artificial Neural Networks Lect2: Neurobiology & Architectures of ANNS
Artificial Neural Networks Lect2: Neurobiology & Architectures of ANNSArtificial Neural Networks Lect2: Neurobiology & Architectures of ANNS
Artificial Neural Networks Lect2: Neurobiology & Architectures of ANNS
 
Fundamental, An Introduction to Neural Networks
Fundamental, An Introduction to Neural NetworksFundamental, An Introduction to Neural Networks
Fundamental, An Introduction to Neural Networks
 
Introduction to Neural networks (under graduate course) Lecture 7 of 9
Introduction to Neural networks (under graduate course) Lecture 7 of 9Introduction to Neural networks (under graduate course) Lecture 7 of 9
Introduction to Neural networks (under graduate course) Lecture 7 of 9
 
04 Multi-layer Feedforward Networks
04 Multi-layer Feedforward Networks04 Multi-layer Feedforward Networks
04 Multi-layer Feedforward Networks
 
lecture07.ppt
lecture07.pptlecture07.ppt
lecture07.ppt
 
Adaline madaline
Adaline madalineAdaline madaline
Adaline madaline
 
Artificial Neural Network (draft)
Artificial Neural Network (draft)Artificial Neural Network (draft)
Artificial Neural Network (draft)
 
Feedforward neural network
Feedforward neural networkFeedforward neural network
Feedforward neural network
 
Introduction to Neural networks (under graduate course) Lecture 9 of 9
Introduction to Neural networks (under graduate course) Lecture 9 of 9Introduction to Neural networks (under graduate course) Lecture 9 of 9
Introduction to Neural networks (under graduate course) Lecture 9 of 9
 
Artificial neural network
Artificial neural networkArtificial neural network
Artificial neural network
 
Artificial Neural Network(Artificial intelligence)
Artificial Neural Network(Artificial intelligence)Artificial Neural Network(Artificial intelligence)
Artificial Neural Network(Artificial intelligence)
 
03 Single layer Perception Classifier
03 Single layer Perception Classifier03 Single layer Perception Classifier
03 Single layer Perception Classifier
 
Artificial neural network
Artificial neural networkArtificial neural network
Artificial neural network
 
Perceptron
PerceptronPerceptron
Perceptron
 
Artificial neural network
Artificial neural networkArtificial neural network
Artificial neural network
 
Neural Networks
Neural NetworksNeural Networks
Neural Networks
 

Ähnlich wie 03 neural network

Ähnlich wie 03 neural network (20)

Chapter No. 6: Backpropagation Networks
Chapter No. 6:  Backpropagation NetworksChapter No. 6:  Backpropagation Networks
Chapter No. 6: Backpropagation Networks
 
Neural Networks
Neural NetworksNeural Networks
Neural Networks
 
NN-Ch6.PDF
NN-Ch6.PDFNN-Ch6.PDF
NN-Ch6.PDF
 
Lec 6-bp
Lec 6-bpLec 6-bp
Lec 6-bp
 
UofT_ML_lecture.pptx
UofT_ML_lecture.pptxUofT_ML_lecture.pptx
UofT_ML_lecture.pptx
 
Multilayer Perceptron (DLAI D1L2 2017 UPC Deep Learning for Artificial Intell...
Multilayer Perceptron (DLAI D1L2 2017 UPC Deep Learning for Artificial Intell...Multilayer Perceptron (DLAI D1L2 2017 UPC Deep Learning for Artificial Intell...
Multilayer Perceptron (DLAI D1L2 2017 UPC Deep Learning for Artificial Intell...
 
Presentation on SOM
Presentation on SOMPresentation on SOM
Presentation on SOM
 
Neural Networks - How do they work?
Neural Networks - How do they work?Neural Networks - How do they work?
Neural Networks - How do they work?
 
neural networksNnf
neural networksNnfneural networksNnf
neural networksNnf
 
Artificial Neural Networks Lect7: Neural networks based on competition
Artificial Neural Networks Lect7: Neural networks based on competitionArtificial Neural Networks Lect7: Neural networks based on competition
Artificial Neural Networks Lect7: Neural networks based on competition
 
Max net
Max netMax net
Max net
 
CS767_Lecture_05.pptx
CS767_Lecture_05.pptxCS767_Lecture_05.pptx
CS767_Lecture_05.pptx
 
ICPR 2016
ICPR 2016ICPR 2016
ICPR 2016
 
Jörg Stelzer
Jörg StelzerJörg Stelzer
Jörg Stelzer
 
Introduction
IntroductionIntroduction
Introduction
 
Artificial Neural Networks
Artificial Neural NetworksArtificial Neural Networks
Artificial Neural Networks
 
机器学习Adaboost
机器学习Adaboost机器学习Adaboost
机器学习Adaboost
 
Anomaly detection using deep one class classifier
Anomaly detection using deep one class classifierAnomaly detection using deep one class classifier
Anomaly detection using deep one class classifier
 
ACUMENS ON NEURAL NET AKG 20 7 23.pptx
ACUMENS ON NEURAL NET AKG 20 7 23.pptxACUMENS ON NEURAL NET AKG 20 7 23.pptx
ACUMENS ON NEURAL NET AKG 20 7 23.pptx
 
NN-Ch2.PDF
NN-Ch2.PDFNN-Ch2.PDF
NN-Ch2.PDF
 

Mehr von Tianlu Wang

14 pro resolution
14 pro resolution14 pro resolution
14 pro resolutionTianlu Wang
 
13 propositional calculus
13 propositional calculus13 propositional calculus
13 propositional calculusTianlu Wang
 
12 adversal search
12 adversal search12 adversal search
12 adversal searchTianlu Wang
 
11 alternative search
11 alternative search11 alternative search
11 alternative searchTianlu Wang
 
21 situation calculus
21 situation calculus21 situation calculus
21 situation calculusTianlu Wang
 
20 bayes learning
20 bayes learning20 bayes learning
20 bayes learningTianlu Wang
 
19 uncertain evidence
19 uncertain evidence19 uncertain evidence
19 uncertain evidenceTianlu Wang
 
18 common knowledge
18 common knowledge18 common knowledge
18 common knowledgeTianlu Wang
 
17 2 expert systems
17 2 expert systems17 2 expert systems
17 2 expert systemsTianlu Wang
 
17 1 knowledge-based system
17 1 knowledge-based system17 1 knowledge-based system
17 1 knowledge-based systemTianlu Wang
 
16 2 predicate resolution
16 2 predicate resolution16 2 predicate resolution
16 2 predicate resolutionTianlu Wang
 
16 1 predicate resolution
16 1 predicate resolution16 1 predicate resolution
16 1 predicate resolutionTianlu Wang
 
09 heuristic search
09 heuristic search09 heuristic search
09 heuristic searchTianlu Wang
 
08 uninformed search
08 uninformed search08 uninformed search
08 uninformed searchTianlu Wang
 

Mehr von Tianlu Wang (20)

L7 er2
L7 er2L7 er2
L7 er2
 
L8 design1
L8 design1L8 design1
L8 design1
 
L9 design2
L9 design2L9 design2
L9 design2
 
14 pro resolution
14 pro resolution14 pro resolution
14 pro resolution
 
13 propositional calculus
13 propositional calculus13 propositional calculus
13 propositional calculus
 
12 adversal search
12 adversal search12 adversal search
12 adversal search
 
11 alternative search
11 alternative search11 alternative search
11 alternative search
 
10 2 sum
10 2 sum10 2 sum
10 2 sum
 
22 planning
22 planning22 planning
22 planning
 
21 situation calculus
21 situation calculus21 situation calculus
21 situation calculus
 
20 bayes learning
20 bayes learning20 bayes learning
20 bayes learning
 
19 uncertain evidence
19 uncertain evidence19 uncertain evidence
19 uncertain evidence
 
18 common knowledge
18 common knowledge18 common knowledge
18 common knowledge
 
17 2 expert systems
17 2 expert systems17 2 expert systems
17 2 expert systems
 
17 1 knowledge-based system
17 1 knowledge-based system17 1 knowledge-based system
17 1 knowledge-based system
 
16 2 predicate resolution
16 2 predicate resolution16 2 predicate resolution
16 2 predicate resolution
 
16 1 predicate resolution
16 1 predicate resolution16 1 predicate resolution
16 1 predicate resolution
 
15 predicate
15 predicate15 predicate
15 predicate
 
09 heuristic search
09 heuristic search09 heuristic search
09 heuristic search
 
08 uninformed search
08 uninformed search08 uninformed search
08 uninformed search
 

Kürzlich hochgeladen

Patrakarpuram ) Cheap Call Girls In Lucknow (Adult Only) 🧈 8923113531 𓀓 Esco...
Patrakarpuram ) Cheap Call Girls In Lucknow  (Adult Only) 🧈 8923113531 𓀓 Esco...Patrakarpuram ) Cheap Call Girls In Lucknow  (Adult Only) 🧈 8923113531 𓀓 Esco...
Patrakarpuram ) Cheap Call Girls In Lucknow (Adult Only) 🧈 8923113531 𓀓 Esco...akbard9823
 
Lucknow 💋 Call Girls in Lucknow ₹7.5k Pick Up & Drop With Cash Payment 892311...
Lucknow 💋 Call Girls in Lucknow ₹7.5k Pick Up & Drop With Cash Payment 892311...Lucknow 💋 Call Girls in Lucknow ₹7.5k Pick Up & Drop With Cash Payment 892311...
Lucknow 💋 Call Girls in Lucknow ₹7.5k Pick Up & Drop With Cash Payment 892311...anilsa9823
 
RAK Call Girls Service # 971559085003 # Call Girl Service In RAK
RAK Call Girls Service # 971559085003 # Call Girl Service In RAKRAK Call Girls Service # 971559085003 # Call Girl Service In RAK
RAK Call Girls Service # 971559085003 # Call Girl Service In RAKedwardsara83
 
Lucknow 💋 Call Girl in Lucknow | Whatsapp No 8923113531 VIP Escorts Service A...
Lucknow 💋 Call Girl in Lucknow | Whatsapp No 8923113531 VIP Escorts Service A...Lucknow 💋 Call Girl in Lucknow | Whatsapp No 8923113531 VIP Escorts Service A...
Lucknow 💋 Call Girl in Lucknow | Whatsapp No 8923113531 VIP Escorts Service A...anilsa9823
 
Lucknow 💋 Russian Call Girls Lucknow - Book 8923113531 Call Girls Available 2...
Lucknow 💋 Russian Call Girls Lucknow - Book 8923113531 Call Girls Available 2...Lucknow 💋 Russian Call Girls Lucknow - Book 8923113531 Call Girls Available 2...
Lucknow 💋 Russian Call Girls Lucknow - Book 8923113531 Call Girls Available 2...anilsa9823
 
Lucknow 💋 Call Girls in Lucknow | Service-oriented sexy call girls 8923113531...
Lucknow 💋 Call Girls in Lucknow | Service-oriented sexy call girls 8923113531...Lucknow 💋 Call Girls in Lucknow | Service-oriented sexy call girls 8923113531...
Lucknow 💋 Call Girls in Lucknow | Service-oriented sexy call girls 8923113531...anilsa9823
 
Indira Nagar Lucknow #Call Girls Lucknow ₹7.5k Pick Up & Drop With Cash Payme...
Indira Nagar Lucknow #Call Girls Lucknow ₹7.5k Pick Up & Drop With Cash Payme...Indira Nagar Lucknow #Call Girls Lucknow ₹7.5k Pick Up & Drop With Cash Payme...
Indira Nagar Lucknow #Call Girls Lucknow ₹7.5k Pick Up & Drop With Cash Payme...akbard9823
 
Lucknow 💋 Escorts Service Lucknow Phone No 8923113531 Elite Escort Service Av...
Lucknow 💋 Escorts Service Lucknow Phone No 8923113531 Elite Escort Service Av...Lucknow 💋 Escorts Service Lucknow Phone No 8923113531 Elite Escort Service Av...
Lucknow 💋 Escorts Service Lucknow Phone No 8923113531 Elite Escort Service Av...anilsa9823
 
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Pari Chowk | Noida
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Pari Chowk | NoidaFULL ENJOY 🔝 8264348440 🔝 Call Girls in Pari Chowk | Noida
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Pari Chowk | Noidasoniya singh
 
Hazratganj ] (Call Girls) in Lucknow - 450+ Call Girl Cash Payment 🧄 89231135...
Hazratganj ] (Call Girls) in Lucknow - 450+ Call Girl Cash Payment 🧄 89231135...Hazratganj ] (Call Girls) in Lucknow - 450+ Call Girl Cash Payment 🧄 89231135...
Hazratganj ] (Call Girls) in Lucknow - 450+ Call Girl Cash Payment 🧄 89231135...akbard9823
 
Authentic # 00971556872006 # Hot Call Girls Service in Dubai By International...
Authentic # 00971556872006 # Hot Call Girls Service in Dubai By International...Authentic # 00971556872006 # Hot Call Girls Service in Dubai By International...
Authentic # 00971556872006 # Hot Call Girls Service in Dubai By International...home
 
Lucknow 💋 Cheap Call Girls In Lucknow Finest Escorts Service 8923113531 Avail...
Lucknow 💋 Cheap Call Girls In Lucknow Finest Escorts Service 8923113531 Avail...Lucknow 💋 Cheap Call Girls In Lucknow Finest Escorts Service 8923113531 Avail...
Lucknow 💋 Cheap Call Girls In Lucknow Finest Escorts Service 8923113531 Avail...anilsa9823
 
Lucknow 💋 Russian Call Girls Lucknow | Whatsapp No 8923113531 VIP Escorts Ser...
Lucknow 💋 Russian Call Girls Lucknow | Whatsapp No 8923113531 VIP Escorts Ser...Lucknow 💋 Russian Call Girls Lucknow | Whatsapp No 8923113531 VIP Escorts Ser...
Lucknow 💋 Russian Call Girls Lucknow | Whatsapp No 8923113531 VIP Escorts Ser...anilsa9823
 
Lucknow 💋 Female Escorts Service in Lucknow | Service-oriented sexy call girl...
Lucknow 💋 Female Escorts Service in Lucknow | Service-oriented sexy call girl...Lucknow 💋 Female Escorts Service in Lucknow | Service-oriented sexy call girl...
Lucknow 💋 Female Escorts Service in Lucknow | Service-oriented sexy call girl...anilsa9823
 
Alex and Chloe by Daniel Johnson Storyboard
Alex and Chloe by Daniel Johnson StoryboardAlex and Chloe by Daniel Johnson Storyboard
Alex and Chloe by Daniel Johnson Storyboardthephillipta
 
AaliyahBell_themist_v01.pdf .
AaliyahBell_themist_v01.pdf             .AaliyahBell_themist_v01.pdf             .
AaliyahBell_themist_v01.pdf .AaliyahB2
 
Best Call girls in Lucknow - 9548086042 - with hotel room
Best Call girls in Lucknow - 9548086042 - with hotel roomBest Call girls in Lucknow - 9548086042 - with hotel room
Best Call girls in Lucknow - 9548086042 - with hotel roomdiscovermytutordmt
 
Young⚡Call Girls in Lajpat Nagar Delhi >༒9667401043 Escort Service
Young⚡Call Girls in Lajpat Nagar Delhi >༒9667401043 Escort ServiceYoung⚡Call Girls in Lajpat Nagar Delhi >༒9667401043 Escort Service
Young⚡Call Girls in Lajpat Nagar Delhi >༒9667401043 Escort Servicesonnydelhi1992
 

Kürzlich hochgeladen (20)

Patrakarpuram ) Cheap Call Girls In Lucknow (Adult Only) 🧈 8923113531 𓀓 Esco...
Patrakarpuram ) Cheap Call Girls In Lucknow  (Adult Only) 🧈 8923113531 𓀓 Esco...Patrakarpuram ) Cheap Call Girls In Lucknow  (Adult Only) 🧈 8923113531 𓀓 Esco...
Patrakarpuram ) Cheap Call Girls In Lucknow (Adult Only) 🧈 8923113531 𓀓 Esco...
 
Lucknow 💋 Call Girls in Lucknow ₹7.5k Pick Up & Drop With Cash Payment 892311...
Lucknow 💋 Call Girls in Lucknow ₹7.5k Pick Up & Drop With Cash Payment 892311...Lucknow 💋 Call Girls in Lucknow ₹7.5k Pick Up & Drop With Cash Payment 892311...
Lucknow 💋 Call Girls in Lucknow ₹7.5k Pick Up & Drop With Cash Payment 892311...
 
RAK Call Girls Service # 971559085003 # Call Girl Service In RAK
RAK Call Girls Service # 971559085003 # Call Girl Service In RAKRAK Call Girls Service # 971559085003 # Call Girl Service In RAK
RAK Call Girls Service # 971559085003 # Call Girl Service In RAK
 
Lucknow 💋 Call Girl in Lucknow | Whatsapp No 8923113531 VIP Escorts Service A...
Lucknow 💋 Call Girl in Lucknow | Whatsapp No 8923113531 VIP Escorts Service A...Lucknow 💋 Call Girl in Lucknow | Whatsapp No 8923113531 VIP Escorts Service A...
Lucknow 💋 Call Girl in Lucknow | Whatsapp No 8923113531 VIP Escorts Service A...
 
Lucknow 💋 Russian Call Girls Lucknow - Book 8923113531 Call Girls Available 2...
Lucknow 💋 Russian Call Girls Lucknow - Book 8923113531 Call Girls Available 2...Lucknow 💋 Russian Call Girls Lucknow - Book 8923113531 Call Girls Available 2...
Lucknow 💋 Russian Call Girls Lucknow - Book 8923113531 Call Girls Available 2...
 
Lucknow 💋 Call Girls in Lucknow | Service-oriented sexy call girls 8923113531...
Lucknow 💋 Call Girls in Lucknow | Service-oriented sexy call girls 8923113531...Lucknow 💋 Call Girls in Lucknow | Service-oriented sexy call girls 8923113531...
Lucknow 💋 Call Girls in Lucknow | Service-oriented sexy call girls 8923113531...
 
Indira Nagar Lucknow #Call Girls Lucknow ₹7.5k Pick Up & Drop With Cash Payme...
Indira Nagar Lucknow #Call Girls Lucknow ₹7.5k Pick Up & Drop With Cash Payme...Indira Nagar Lucknow #Call Girls Lucknow ₹7.5k Pick Up & Drop With Cash Payme...
Indira Nagar Lucknow #Call Girls Lucknow ₹7.5k Pick Up & Drop With Cash Payme...
 
Lucknow 💋 Escorts Service Lucknow Phone No 8923113531 Elite Escort Service Av...
Lucknow 💋 Escorts Service Lucknow Phone No 8923113531 Elite Escort Service Av...Lucknow 💋 Escorts Service Lucknow Phone No 8923113531 Elite Escort Service Av...
Lucknow 💋 Escorts Service Lucknow Phone No 8923113531 Elite Escort Service Av...
 
RAJKOT CALL GIRL 76313*77252 CALL GIRL IN RAJKOT
RAJKOT CALL GIRL 76313*77252 CALL GIRL IN RAJKOTRAJKOT CALL GIRL 76313*77252 CALL GIRL IN RAJKOT
RAJKOT CALL GIRL 76313*77252 CALL GIRL IN RAJKOT
 
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Pari Chowk | Noida
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Pari Chowk | NoidaFULL ENJOY 🔝 8264348440 🔝 Call Girls in Pari Chowk | Noida
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Pari Chowk | Noida
 
Dxb Call Girls # +971529501107 # Call Girls In Dxb Dubai || (UAE)
Dxb Call Girls # +971529501107 # Call Girls In Dxb Dubai || (UAE)Dxb Call Girls # +971529501107 # Call Girls In Dxb Dubai || (UAE)
Dxb Call Girls # +971529501107 # Call Girls In Dxb Dubai || (UAE)
 
Hazratganj ] (Call Girls) in Lucknow - 450+ Call Girl Cash Payment 🧄 89231135...
Hazratganj ] (Call Girls) in Lucknow - 450+ Call Girl Cash Payment 🧄 89231135...Hazratganj ] (Call Girls) in Lucknow - 450+ Call Girl Cash Payment 🧄 89231135...
Hazratganj ] (Call Girls) in Lucknow - 450+ Call Girl Cash Payment 🧄 89231135...
 
Authentic # 00971556872006 # Hot Call Girls Service in Dubai By International...
Authentic # 00971556872006 # Hot Call Girls Service in Dubai By International...Authentic # 00971556872006 # Hot Call Girls Service in Dubai By International...
Authentic # 00971556872006 # Hot Call Girls Service in Dubai By International...
 
Lucknow 💋 Cheap Call Girls In Lucknow Finest Escorts Service 8923113531 Avail...
Lucknow 💋 Cheap Call Girls In Lucknow Finest Escorts Service 8923113531 Avail...Lucknow 💋 Cheap Call Girls In Lucknow Finest Escorts Service 8923113531 Avail...
Lucknow 💋 Cheap Call Girls In Lucknow Finest Escorts Service 8923113531 Avail...
 
Lucknow 💋 Russian Call Girls Lucknow | Whatsapp No 8923113531 VIP Escorts Ser...
Lucknow 💋 Russian Call Girls Lucknow | Whatsapp No 8923113531 VIP Escorts Ser...Lucknow 💋 Russian Call Girls Lucknow | Whatsapp No 8923113531 VIP Escorts Ser...
Lucknow 💋 Russian Call Girls Lucknow | Whatsapp No 8923113531 VIP Escorts Ser...
 
Lucknow 💋 Female Escorts Service in Lucknow | Service-oriented sexy call girl...
Lucknow 💋 Female Escorts Service in Lucknow | Service-oriented sexy call girl...Lucknow 💋 Female Escorts Service in Lucknow | Service-oriented sexy call girl...
Lucknow 💋 Female Escorts Service in Lucknow | Service-oriented sexy call girl...
 
Alex and Chloe by Daniel Johnson Storyboard
Alex and Chloe by Daniel Johnson StoryboardAlex and Chloe by Daniel Johnson Storyboard
Alex and Chloe by Daniel Johnson Storyboard
 
AaliyahBell_themist_v01.pdf .
AaliyahBell_themist_v01.pdf             .AaliyahBell_themist_v01.pdf             .
AaliyahBell_themist_v01.pdf .
 
Best Call girls in Lucknow - 9548086042 - with hotel room
Best Call girls in Lucknow - 9548086042 - with hotel roomBest Call girls in Lucknow - 9548086042 - with hotel room
Best Call girls in Lucknow - 9548086042 - with hotel room
 
Young⚡Call Girls in Lajpat Nagar Delhi >༒9667401043 Escort Service
Young⚡Call Girls in Lajpat Nagar Delhi >༒9667401043 Escort ServiceYoung⚡Call Girls in Lajpat Nagar Delhi >༒9667401043 Escort Service
Young⚡Call Girls in Lajpat Nagar Delhi >༒9667401043 Escort Service
 

03 neural network

  • 2. 2 Outline 3.1 Introduction 3.2 Training Single TLUs – Gradient Descent – Widrow-Hoff Rule – Generalized Delta Procedure 3.3 Neural Networks – The Backpropagation Method – Derivation of the Backpropagation Learning Rule 3.4 Generalization, Accuracy, and Overfitting 3.5 Discussion
  • 3. 3 3.1 Introduction  TLU (threshold logic unit): Basic units for neural networks – Based on some properties of biological neurons  Training set – Input: real value, boolean value, … – Output:  : associated actions (Label, Class …)  Target of training – Finding corresponds “acceptably” to the members of the training set. – Supervised learning: Labels are given along with the input vectors. jd ),...,(Xvector,dim-n:X 1 nxx ( )f X
  • 4. 4 3.2.1 TLU Geometry  Training TLU: Adjusting variable weights  A single TLU: Perceptron, Adaline (adaptive linear element) [Rosenblatt 1962, Widrow 1962]  Elements of TLU – Weight: – Threshold:   Output of TLU: Using weighted sum – 1 if s   > 0 – 0 if s   < 0  Hyperplane – WX   = 0 1( ,..., )nW w w s W X 
  • 5. 5
  • 6. 6 3.2.2 Augmented Vectors  Adopting the convention that threshold is fixed to 0.  Arbitrary thresholds: (n + 1)-dimensional vector  W = (w1, …, wn, 1)  Output of TLU – 1 if WX  0 – 0 if WX < 0
  • 7. 7 3.2.3 Gradient Decent Methods  Training TLU: minimizing the error function by adjusting weight values.  Two ways: Batch learning v.s. incremental learning  Commonly used error function: squared error – Gradient : – Chain rule:  Solution of nonlinearity of f / s : – Ignoring threshod function: f = s – Replacing threshold function with differentiable nonlinear function 2 )( fd                 11 ,...,,..., ni def www  W XX WW s f fd s s s              )(2 
  • 8. 8 3.2.4 The Widrow-Hoff Procedure  Weight update procedure: – Using f = s = WX – Data labeled 1  1, Data labeled 0  1  Gradient:  New weight vector  Widrow-Hoff (delta) rule – (d  f) > 0  increasing s  decreasing (d  f) – (d  f) < 0  decreasing s  increasing (d  f) XX W )(2)(2 fd s f fd       XWW )( fdc 
  • 9. 9 The Generalized Delta Procedure  Sigmoid function (differentiable): [Rumelhart, et al. 1986] )1/(1)( x esf  
  • 10. 10 The Generalized Delta Procedure (II)  Gradient:  Generalized delta procedure: – Target output: 1, 0 – Output f = output of sigmoid function – f(1– f) = 0, where f = 0 or 1 – Weight change can occur only within „fuzzy‟ region surrounding the hyperplane (near the point f(s) = ½). XX W )1()(2)(2 fffd s f fd       XWW )1()( fffdc 
  • 11. 11 The Error-Correction Procedure  Using threshold unit: (d – f) can be either 1 or –1.  In the linearly separable case, after finite iterations, W will be converged to the solution.  In the nonlinearly separable case, W will never be converged.  The Widrow-Hoff and generalized delta procedures will find minimum squared error solutions even when the minimum error is not zero. XWW )( fdc 
  • 12. 12 Training Process  Data  L kkdkX 1))(),((  )()( )2()2( )1()1( LdLX dX dX     NN X(k) f(k) d(k) - )())()(()()( kkfkdckk XWW  • Update Rule
  • 13. 13 3.3 Neural Networks  Need for use of multiple TLUs – Feedforward network: no cycle – Recurrent network: cycle (treated in a later chapter) – Layered feedforward network  jth layer can receive input only from j – 1th layer.  Example : 2121 xxxxf 
  • 14. 14 Notation  Hidden unit: neurons in all but the last layer  Output of j-th layer: X(j)  input of (j+1)-th layer  Input vector: X(0)  Final output: f  The weight of i-th sigmoid unit in the j-th layer: Wi (j)  Weighted sum of i-th sigmoid unit in the j-th layer: si (j)  Number of sigmoid units in j-th layer: mj )()1()( j i jj is WX    )( ,1 )( , )( ,1 )( )1( ,...,,..., j im j il j i j i j www  W
  • 16. 16 3.3.3 The Backpropagation Method  Gradient of Wi (j) :  Weight update: )()1()( j i jj is WX   )()( 2 )( )(2 )( j i j i j i s f fd s fd s         )1()()()()(   jj i j i j i j i c XWW  )1()()1( )( )1( )()( )( )()( 2)(2                 jj i j j i j j i j i j i j i j i s f fd s s s XX X WW   Local gradient
  • 17. 17 Weight Changes in the Final Layer  Local gradient:  Weight update: )1()( )( )( )( fffd s f fd j i k     )1()()()( )1()(   kkkk fffdc XWW
  • 18. 18 3.3.5 Weights in Intermediate Layers  Local gradient: – The final ouput f, depends on si (j) through of the summed inputs to the sigmoids in the (j+1)-th layer.  Need for computation of )( )1( 1 )1( )( )1( )1( 1 )( )1( )1()( )1( )1()( )1( 1 )1( 1 )( )( 11 1 1 )( ......)( )( j i j l m l j lj i j l j l m l j i j m j m j i j l j l j i j j j i j i s s s s s f fd s s s f s s s f s s s f fd s f fd jj j j                                                     )( )1( j i j l s s   
  • 19. 19 Weight Update in Hidden Layers (cont.) – v  i : v = i : – Conseqeuntly,   )1( )()()1( , 1 1 )( )( )1( ,)( 1 1 )1( , )( )( )1( 1 1 )1( , )()1()()1( j i j i j li m v j i j vj lvj i m v j lv l v j i j l m v j lv l v j l jj l ffw s f w s wf s s wfs j j j                       WX 0)( )(    j i j v s f )1( )()( )( )( j v j vj i j v ff s f         1 1 )1( . )1()()()( )1( jm l j li j l j i j i j i wff 
  • 20. 20 Weight Update in Hidden Layers (cont.)  Attention to recursive equation of local gradient!  Backpropagation: – Error is back-propagated from the output layer to the input layer – Local gradient of the latter layer is used in calculating local gradient of the former layer. ))(1()( fdffk       1 1 )1( . )1()()()( )1( jm l j li j i j i j i j i wff 
  • 21. 21 3.3.5 (con’t)  Example (even parity function) – Learning rate: 1.0 input target 1 0 1 0 0 0 1 1 0 1 1 0 1 1 1 1 2121 xxxxf 
  • 22. 22 Generalization, Accuracy, Overfitting  Generalization ability: – NN appropriately classifies vectors not in the training set. – Measurement = accuracy  Curve fitting – Number of training input vectors  number of degrees of freedom of the network. – In the case of m data points, is (m-1)-degree polynomial best model? No, it can not capture any special information.  Overfitting – Extra degrees of freedom are essentially just fitting the noise. – Given sufficient data, the Occam’s Razor principle dictates to choose the lowest-degree polynomial that adequately fits the data.
  • 24. 24 Generalization (cont’d)  Out-of-sample-set error rate – Error rate on data drawn from the same underlying distribution of training set.  Dividing available data into a training set and a validation set – Usually use 2/3 for training and 1/3 for validation  k-fold cross validation – k disjoint subsets (called folds). – Repeat training k times with the configuration: one validation set, k-1 (combined) training sets. – Take average of the error rate of each validation as the out-of- sample error. – Empirically 10-fold is preferred.
  • 25. 25 Fig 3.8 Error Versus Number of Hidden Units Fig 9. Estimate of Generalization Error Versus Number of Hidden Units
  • 26. 26 3.5 Additional Readings & Discussion  Applications – Pattern recognition, automatic control, brain-function modeling – Designing and training neural networks still need experience and experiments.  Major annual conferences – Neural Information Processing Systems (NIPS) – International Conference on Machine Learning (ICML) – Computational Learning Theory (COLT)  Major journals – Neural Computation – IEEE Transactions on Neural Networks – Machine Learning
  • 27. 27 Homework  Page 55 ~ 57 – Ex 3.1; Ex3.4, Ex3.6, Ex. 3.7 How to submit your homework: Email to lqzhang@sjtu.edu.cn Filename specification: [学号]+[姓名]+[章节].doc