5.MLP(Multi-Layer Perceptron)

•

1 gefällt mir•773 views

艾鍗科技

https://youtu.be/RHvROP94qZ0

Ingenieurwesen

神經網路的基礎--
MLP(Multi-Layer Perceptron)
17

Artificial neural network
Optimizer
Mini-batch
Activation functions
Loss functions
Batch Normalization
Avoid Overfitting: Weight Decay, Dropout
MLP(Multi-Layer Perceptron)
18

AND, OR gate use one Perceptron
x1
x2
F
0.5
0.5
- 0.7AND -0.5
x1
x2
1
1
OR
F
X1 X2 Y
--------------------
0 0 0
0 1 0
1 0 0
1 1 1
X1 X2 Y
--------------------
0 0 0
0 1 1
1 0 1
1 1 1

Quiz: XOR gate
x1
x2
F
?
?
?XOR
OR
X1 X2 Y
--------------------
0 0 0
0 1 1
1 0 1
1 1 0
Can XOR gate use only one Perceptron ?!

Single Perceptron == 線性
OR
(0,0) (1,0)
(0,1)
(1,1)
(0,0) (1,0)
(0,1)
(1,1)
AND OR
0.5X1+0.5X2-0.7=0
X1
X2
X1
X2
(1.4,0)
-0.5X1-0.5X2+0.7=0
X1+X2-0.5=0
(0.5,0)

(0,0) (1,0)
(0,1)
(1,1)
XOR
X1
X2
Lab: keras/concept/NN_concept.ipynb

增加隱藏層的效果可做更難的分類
增加隱藏層
https://www.intechopen.com/books/artificial-neural-networks-architectures-
and-applications/applications-of-artificial-neural-networks-in-chemical-
problems

串聯更多的perceptron Neural Network
27
x1
x2
a1
a2
w11
w21
w12
w22
1b1
a3
w13
w23
2b2
b3b3
= + +
( ) ( ) ( )
( ) ( ) ( )
( ) ( ) ( )
= ( ) ( ) ( )
A=XW+B
A simple Function

Multi-layer neural network
MLP uses multiple hidden layers between the input
and output layers to extract meaningful features
A Neural Network = A Function
MLP(Multi-Layer Perceptron)
28

2 layers Neural Network
29
x1
x2
𝑎
( )
𝑎
( )
𝑎
( )
𝑎
( )
𝑏
( )
𝑏
( )
( )( )
𝑏
( )
𝑏
( )
𝑏
( )
𝑏
( )
y1
y2
𝑎
( )
𝑎
( )
𝑎
( )
𝑎
( )
𝑏
( )
𝑏
( )
( )( )
𝑏
( )
𝑏
( )
𝑏
( )
𝑏
( )
𝑤
( )
𝑤
( )
𝑤
( )
𝑤
( )
𝑤
( )
𝑤
( )
𝑤
( )
=
𝑤
( )
𝑤
( )
𝑤
( )
𝑤
( )
𝑤
( )
𝑤
( )
𝑤
( )
=
𝑤
( )
𝑤
( )
𝑤
( )
𝑤
( )
𝑤
( )
𝑤
( )
𝑤
( )
𝑤
( )
𝑤
( )
𝑤
( )
=
𝑤
( )
𝑤
( )
𝑤
( )
𝑤
( )
𝑤
( )
𝑤
( )2x3
3x3 3x2
1x2 1x2
( )
=
( )
+
( )
+
( )
𝐵
( )
= 𝑏
( )
𝑏
( )
𝑏
( )

Find network weights to minimize the training
error between true and estimated labels of training
examples, e.g.:
Training of multi-layer networks
31

Back-propagation: gradients are computed in the
direction from output to input layers and
combined using chain rule
SGD(Stochastic gradient descent): compute the
weight update w.r.t. one training example at a time,
cycle through training examples in random order in
multiple epochs  Slow Convergence
每次隨機選一個樣本,一筆一筆去更新很慢
• mini-batch SGD (a batch of samples computed
simultaneously)  faster to complete one epoch
Optimizer
32

Mini-batch is expected to be called several times
consecutively on different chunks of a dataset so as to
implement out-of-core or online learning.
This is especially useful when the whole dataset is too
big to fit in memory at once.
Mini-batch vs. Epoch
*一個epoch = 看完所有training data 一次
*依照mini-batch 把所有training data 拆成多份
假設全部有1000 筆資料
batch size = 100 可拆成10 份 一個epoch 內會更新10 次
batch size = 10 可拆成100 份 一個epoch 內會更新100 次
*如何設定batch size?
不要設太大，常用28, 32, 128, 256, …
mini-batch: partial fit method
34

To avoid falling into the local minimum and further
increase the training speed
Adaptive Learning Rate/Gradient algorithms
1. Adagrad
2. Momentum
3. RMSProp
4. Adam
5. …
Adaptive Learning Rate/Gradient algorithms
36

Weitere ähnliche Inhalte

Was ist angesagt?

PyTorch Python Tutorial | Deep Learning Using PyTorch | Image Classifier Usin...Edureka!

Cs231n 2017 lecture9 CNN ArchitectureYanbin Kong

Recurrent Neural Networks. Part 1: TheoryAndrii Gakhov

Python Numpy Source CodesAmarjeetsingh Thakur

Nural network ER. Abhishek k. upadhyay Learning rulesabhishek upadhyay

Deep learning (Machine learning) tutorial for beginnersTerry Taewoong Um

Convolutional Neural Network - CNN | How CNN Works | Deep Learning Course | S...Simplilearn

Artificial Neural Networks Lect3: Neural Network Learning rulesMohammed Bennamoun

Perceptron & Neural NetworksNAGUR SHAREEF SHAIK

A CGRA-based Approachfor Accelerating Convolutional Neural NetworksShinya Takamaeda-Y

Activation functions and Training Algorithms for Deep Neural networkGayatri Khanvilkar

AlexNet, VGG, GoogleNet, ResnetJungwon Kim

PR-284: End-to-End Object Detection with Transformers(DETR)Jinwon Lee

backpropagation in neural networksAkash Goel

AutoencodersCloudxLab

TENSOR DECOMPOSITION WITH PYTHONAndré Panisson

CNN Machine learning DeepLearningAbhishek Sharma

Introduction to CNNShuai Zhang

Convolutional Neural Networks (CNN)Gaurav Mittal

Training Neural NetworksDatabricks

Was ist angesagt? (20)

PyTorch Python Tutorial | Deep Learning Using PyTorch | Image Classifier Usin...

Cs231n 2017 lecture9 CNN Architecture

Recurrent Neural Networks. Part 1: Theory

Python Numpy Source Codes

Nural network ER. Abhishek k. upadhyay Learning rules

Deep learning (Machine learning) tutorial for beginners

Convolutional Neural Network - CNN | How CNN Works | Deep Learning Course | S...

Artificial Neural Networks Lect3: Neural Network Learning rules

Perceptron & Neural Networks

A CGRA-based Approachfor Accelerating Convolutional Neural Networks

Activation functions and Training Algorithms for Deep Neural network

AlexNet, VGG, GoogleNet, Resnet

PR-284: End-to-End Object Detection with Transformers(DETR)

backpropagation in neural networks

Autoencoders

TENSOR DECOMPOSITION WITH PYTHON

CNN Machine learning DeepLearning

Introduction to CNN

Convolutional Neural Networks (CNN)

Training Neural Networks

Ähnlich wie 5.MLP(Multi-Layer Perceptron)

6Vaibhav Shah

Multilayer Perceptron - Elisa Sayrol - UPC Barcelona 2018Universitat Politècnica de Catalunya

Deep Feed Forward Neural Networks and RegularizationYan Xu

Simple, fast, and scalable torch7 tutorialJin-Hwa Kim

Introduction to Neural Networks and Deep LearningVahid Mirjalili

Eye deepsveitser

Anomaly detection using deep one class classifier홍배 김

ann-ics320Part4.pptGayathriRHICETCSESTA

1-pytorch-CNN-RNN.pdfAndrey63387

Neural network basic and introduction of Deep learningTapas Majumdar

Deep Neural Networks (D1L2 Insight@DCU Machine Learning Workshop 2017)Universitat Politècnica de Catalunya

Multilayer Neuronal network hardware implementation Nabil Chouba

10-Perceptron.pdfESTIBALYZJIMENEZCAST

【DL輪読会】Incorporating group update for speech enhancement based on convolutio...Deep Learning JP

neural networksNnfSandilya Sridhara

Artificial Neural Networks Lect7: Neural networks based on competitionMohammed Bennamoun

ISCAS'18: A Deep Neural Network on the Nested RNS (NRNS) on an FPGA: Applied ...Hiroki Nakahara

Efficient Implementation of Self-Organizing Map for Sparse Input Dataymelka

Neural networkMahmoud Hussein

Ähnlich wie 5.MLP(Multi-Layer Perceptron) (20)

Multilayer Perceptron - Elisa Sayrol - UPC Barcelona 2018

Deep Feed Forward Neural Networks and Regularization

Simple, fast, and scalable torch7 tutorial

Introduction to Neural Networks and Deep Learning

Eye deep

Anomaly detection using deep one class classifier

ann-ics320Part4.ppt

1-pytorch-CNN-RNN.pdf

Neural network basic and introduction of Deep learning

Deep Neural Networks (D1L2 Insight@DCU Machine Learning Workshop 2017)

Multilayer Neuronal network hardware implementation

10-Perceptron.pdf

【DL輪読会】Incorporating group update for speech enhancement based on convolutio...

neural networksNnf

Artificial Neural Networks Lect7: Neural networks based on competition

ISCAS'18: A Deep Neural Network on the Nested RNS (NRNS) on an FPGA: Applied ...

Efficient Implementation of Self-Organizing Map for Sparse Input Data

Neural network

Mehr von 艾鍗科技

TinyML - 4 speech recognition 艾鍗科技

Appendix 1 Goolge colab艾鍗科技

Project-IOT於餐館系統的應用艾鍗科技

02 IoT implementation艾鍗科技

Openvino ncs2艾鍗科技

Step motor艾鍗科技

3. data features艾鍗科技

心率血氧檢測與運動促進艾鍗科技

利用音樂&情境燈幫助放鬆艾鍗科技

IoT感測器驅動程式在樹莓派上實作艾鍗科技

無線聲控遙控車艾鍗科技

最佳光源的研究和實作艾鍗科技

無線監控網路攝影機與控制自走車艾鍗科技

Reinforcement Learning艾鍗科技

Linux Device Tree艾鍗科技

人臉辨識考勤系統艾鍗科技

智慧家庭Smart Home艾鍗科技

智能健身艾鍗科技

雲端智能盆栽艾鍗科技

腦波分析疲勞駕駛預警系統艾鍗科技

Mehr von 艾鍗科技 (20)

TinyML - 4 speech recognition

Appendix 1 Goolge colab

Project-IOT於餐館系統的應用

02 IoT implementation

Openvino ncs2

Step motor

3. data features

心率血氧檢測與運動促進

利用音樂&情境燈幫助放鬆

IoT感測器驅動程式在樹莓派上實作

無線聲控遙控車

最佳光源的研究和實作

無線監控網路攝影機與控制自走車

Reinforcement Learning

Linux Device Tree

人臉辨識考勤系統

智慧家庭Smart Home

智能健身

雲端智能盆栽

腦波分析疲勞駕駛預警系統

Kürzlich hochgeladen

Double Revolving field theory-how the rotor develops torqueBhangaleSonal

A CASE STUDY ON CERAMIC INDUSTRY OF BANGLADESH.pptxmaisarahman1

Work-Permit-Receiver-in-Saudi-Aramco.pptxJuliansyahHarahap1

Engineering Drawing focus on projection of planesRAJNEESHKUMAR341697

Rums floating Omkareshwar FSPV IM_16112021.pdfsmsksolar

HOA1&2 - Module 3 - PREHISTORCI ARCHITECTURE OF KERALA.pptxSCMS School of Architecture

AIRCANVAS[1].pdf mini project for btech studentsvanyagupta248

Hostel management system project report..pdfKamal Acharya

Unleashing the Power of the SORA AI lastest leapRishantSharmaFr

XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXssuser89054b

"Lesotho Leaps Forward: A Chronicle of Transformative Developments"mphochane1998

scipt v1.pptxcxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx...HenryBriggs2

Minimum and Maximum Modes of microprocessor 8086anil_gaur

HAND TOOLS USED AT ELECTRONICS WORK PRESENTED BY KOUSTAV SARKARKOUSTAV SARKAR

Computer Networks Basics of Network DevicesChandrakantDivate1

Standard vs Custom Battery Packs - Decoding the Power PlayEpec Engineered Technologies

Bridge Jacking Design Sample Calculation.pptxnuruddin69

2016EF22_0 solar project report rooftop projectssmsksolar

Kuwait City MTP kit ((+919101817206)) Buy Abortion Pills Kuwaitjaanualu31

Call Girls in South Ex (delhi) call me [🔝9953056974🔝] escort service 24X79953056974 Low Rate Call Girls In Saket, Delhi NCR

Kürzlich hochgeladen (20)

Double Revolving field theory-how the rotor develops torque

A CASE STUDY ON CERAMIC INDUSTRY OF BANGLADESH.pptx

Work-Permit-Receiver-in-Saudi-Aramco.pptx

Engineering Drawing focus on projection of planes

Rums floating Omkareshwar FSPV IM_16112021.pdf

HOA1&2 - Module 3 - PREHISTORCI ARCHITECTURE OF KERALA.pptx

AIRCANVAS[1].pdf mini project for btech students

Hostel management system project report..pdf

Unleashing the Power of the SORA AI lastest leap

XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

"Lesotho Leaps Forward: A Chronicle of Transformative Developments"

scipt v1.pptxcxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx...

Minimum and Maximum Modes of microprocessor 8086

HAND TOOLS USED AT ELECTRONICS WORK PRESENTED BY KOUSTAV SARKAR

Computer Networks Basics of Network Devices

Standard vs Custom Battery Packs - Decoding the Power Play

Bridge Jacking Design Sample Calculation.pptx

2016EF22_0 solar project report rooftop projects

Kuwait City MTP kit ((+919101817206)) Buy Abortion Pills Kuwait

Call Girls in South Ex (delhi) call me [🔝9953056974🔝] escort service 24X7

5.MLP(Multi-Layer Perceptron)

1. 神經網路的基礎-- MLP(Multi-Layer Perceptron) 17

2. Artificial neural network Optimizer Mini-batch Activation functions Loss functions Batch Normalization Avoid Overfitting: Weight Decay, Dropout MLP(Multi-Layer Perceptron) 18

3. A single neuron (perceptron) 19

4. AND, OR gate use one Perceptron x1 x2 F 0.5 0.5 - 0.7AND -0.5 x1 x2 1 1 OR F X1 X2 Y -------------------- 0 0 0 0 1 0 1 0 0 1 1 1 X1 X2 Y -------------------- 0 0 0 0 1 1 1 0 1 1 1 1

5. Quiz: XOR gate x1 x2 F ? ? ?XOR OR X1 X2 Y -------------------- 0 0 0 0 1 1 1 0 1 1 1 0 Can XOR gate use only one Perceptron ?!

6. Single Perceptron == 線性 OR (0,0) (1,0) (0,1) (1,1) (0,0) (1,0) (0,1) (1,1) AND OR 0.5X1+0.5X2-0.7=0 X1 X2 X1 X2 (1.4,0) -0.5X1-0.5X2+0.7=0 X1+X2-0.5=0 (0.5,0)

7. (0,0) (1,0) (0,1) (1,1) XOR X1 X2 Lab: keras/concept/NN_concept.ipynb

8. Feature transformation

9. Feature transformation

10. 增加隱藏層的效果可做更難的分類增加隱藏層 https://www.intechopen.com/books/artificial-neural-networks-architectures- and-applications/applications-of-artificial-neural-networks-in-chemical- problems

11. 串聯更多的perceptron Neural Network 27 x1 x2 a1 a2 w11 w21 w12 w22 1b1 a3 w13 w23 2b2 b3b3 = + + ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) = ( ) ( ) ( ) A=XW+B A simple Function

12. Multi-layer neural network MLP uses multiple hidden layers between the input and output layers to extract meaningful features A Neural Network = A Function MLP(Multi-Layer Perceptron) 28

13. 2 layers Neural Network 29 x1 x2 𝑎 ( ) 𝑎 ( ) 𝑎 ( ) 𝑎 ( ) 𝑏 ( ) 𝑏 ( ) ( )( ) 𝑏 ( ) 𝑏 ( ) 𝑏 ( ) 𝑏 ( ) y1 y2 𝑎 ( ) 𝑎 ( ) 𝑎 ( ) 𝑎 ( ) 𝑏 ( ) 𝑏 ( ) ( )( ) 𝑏 ( ) 𝑏 ( ) 𝑏 ( ) 𝑏 ( ) 𝑤 ( ) 𝑤 ( ) 𝑤 ( ) 𝑤 ( ) 𝑤 ( ) 𝑤 ( ) 𝑤 ( ) = 𝑤 ( ) 𝑤 ( ) 𝑤 ( ) 𝑤 ( ) 𝑤 ( ) 𝑤 ( ) 𝑤 ( ) = 𝑤 ( ) 𝑤 ( ) 𝑤 ( ) 𝑤 ( ) 𝑤 ( ) 𝑤 ( ) 𝑤 ( ) 𝑤 ( ) 𝑤 ( ) 𝑤 ( ) = 𝑤 ( ) 𝑤 ( ) 𝑤 ( ) 𝑤 ( ) 𝑤 ( ) 𝑤 ( )2x3 3x3 3x2 1x2 1x2 ( ) = ( ) + ( ) + ( ) 𝐵 ( ) = 𝑏 ( ) 𝑏 ( ) 𝑏 ( )

14. Training neural networks 30

15. Find network weights to minimize the training error between true and estimated labels of training examples, e.g.: Training of multi-layer networks 31

16. Back-propagation: gradients are computed in the direction from output to input layers and combined using chain rule SGD(Stochastic gradient descent): compute the weight update w.r.t. one training example at a time, cycle through training examples in random order in multiple epochs  Slow Convergence 每次隨機選一個樣本,一筆一筆去更新很慢 • mini-batch SGD (a batch of samples computed simultaneously)  faster to complete one epoch Optimizer 32

17. 33

18. Mini-batch is expected to be called several times consecutively on different chunks of a dataset so as to implement out-of-core or online learning. This is especially useful when the whole dataset is too big to fit in memory at once. Mini-batch vs. Epoch *一個epoch = 看完所有training data 一次 *依照mini-batch 把所有training data 拆成多份假設全部有1000 筆資料 batch size = 100 可拆成10 份 一個epoch 內會更新10 次 batch size = 10 可拆成100 份 一個epoch 內會更新100 次 *如何設定batch size? 不要設太大，常用28, 32, 128, 256, … mini-batch: partial fit method 34

19. Overview of Neural Network 35

20. To avoid falling into the local minimum and further increase the training speed Adaptive Learning Rate/Gradient algorithms 1. Adagrad 2. Momentum 3. RMSProp 4. Adam 5. … Adaptive Learning Rate/Gradient algorithms 36

5.MLP(Multi-Layer Perceptron)

Empfohlen

Empfohlen

Weitere ähnliche Inhalte

Was ist angesagt?

Was ist angesagt? (20)

Ähnlich wie 5.MLP(Multi-Layer Perceptron)

Ähnlich wie 5.MLP(Multi-Layer Perceptron) (20)

Mehr von 艾鍗科技

Mehr von 艾鍗科技 (20)

Kürzlich hochgeladen

Kürzlich hochgeladen (20)

5.MLP(Multi-Layer Perceptron)