Neural Turing Machine Tutorial

Neural Turing Machine
Mark Chang

大綱
• 神經元 -> 類神經網路
• 短期記憶 -> 類神經網路到深度學習
• 神經圖靈機(Neural Turing Machine)

神經元與動作電位
http://humanphisiology.wikispaces.com/file/view/neuron.png/216460
814/neuron.png
http://upload.wikimedia.org/wikipedia/commons/thumb/4
/4a/Action_potential.svg/1037px-Action_potential.svg.png

神經突觸
http://www.quia.com/files/quia/users/lmcgee/Systems/endocrine-nervous/synapse.gif

模擬神經元
nW1
W2
x1
x2
b
Wb
y
nin
nout

(0,0)
x2
x1
模擬神經元
1
0

二元分類：AND Gate
x1 x2 y
0 0 0
0 1 0
1 0 0
1 1 1
(0,0)
(0,1) (1,1)
(1,0)
0
1
n20
20
b
-30
yx1
x2

二元分類：OR Gate
x1 x2 y
0 0 0
0 1 1
1 0 1
1 1 1
(0,0)
(0,1) (1,1)
(1,0)
0
1
n20
20
b
-10
yx1
x2

XOR Gate ?
(0,0)
(0,1) (1,1)
(1,0)
0
0
1
x1 x2 y
0 0 0
0 1 1
1 0 1
1 1 0

二元分類：XOR Gate
n
-20
20
b
-10
y
(0,0)
(0,1) (1,1)
(1,0)
0
1
(0,0)
(0,1) (1,1)
(1,0)
1
0
(0,0)
(0,1) (1,1)
(1,0)
0
0
1
n1
20
20
b
-30
x1
x2
n2
20
20
b
-10
x1
x2
x1 x2 n1 n2 y
0 0 0 0 0
0 1 0 1 1
1 0 0 1 1
1 1 1 1 0

類神經網路
x
y
n11
n12
n21
n22W12,y
W12,x
b
W11,y
W11,bW12,b
b
W11,x W21,11
W22,12
W21,12
W22,11
W21,bW22,b
z1
z2
Input
Layer
Hidden
Layer
Output
Layer

視覺認知
http://www.nature.com/neuro/journal/v8/n8/images/nn0805-975-F1.jpg

（監督式）機器學習的過程
訓練資料機器學習模型輸出值
正確答案
對答案
如果答錯了，
要修正模型
機器學習模型測試資料
訓練
完成
輸出值

長期記憶
http://www.pnas.org/content/102/49/17846/F7.large.jpg

訓練類神經網路
• 用隨機值初始化模型參數w
• Forward Propagation
– 用目前的模型參數計算出答案
• 計算錯誤量（用Error Function）
• Backward Propagation
– 用錯誤量來修正模型

訓練類神經網路
訓練資料機器學習模型輸出值
正確答案
對答案
如果答錯了，
要修正模型
初始化 Forward
Propagation
Error
Function
Backward
Propagation

初始化
• 將所有的W隨機設成-N～N之間的數
• 每層之間W的值都不能相同
x
y
n11
n12
n21
n22W12,y
W12,x
b
W11,y
W11,bW12,b
b
W11,x W21,11
W22,12
W21,12
W22,11
W21,bW22,b
z1
z2

Backward Propagation
http://cpmarkchang.logdown.com/posts/277349-neural-network-backward-propagation

短期記憶
白
白日依山盡，黃河入海流
白日
白日依
…..
白日依山

短期記憶
白 n(白)
日 n(日)
nW1
W2
x1
x2
b
Wb
y
nW1
W2
x1
x2
b
Wb
y

Recurrent Neural Network
白
日 n(n(白),日)
n(白)
依 n(n(n(白),日),依)

類神經網路到深度學習
Feedforward Neural Network Recurrent Neural Network
Long Short Term MemoryNeural Turing Machine

把上一個時間點的nout，接回這個時間點的nin

….
x0
y0 y1
x1 x2
y2 yt
xt

x0 x1 xt-1 xt
y0 y1 yt-1 yt

Backward Propagation Through Time
t = 0 t = 1

Backward Propagation Through Time
http://cpmarkchang.logdown.com/posts/278457-neural-network-recurrent-neural-network

實作
• Recurrent Neural Network

Long-Short Term Memory
xt m yt
Cin
c cc
k n
b
nout
Memory Cell
kout
CreadCforgetCwrite
mout,t
mout,t-1
Coutmin,t

輸入值 Cin
讀取開關 Cread遺忘開關 Cforget寫入開關 Cwrite
輸出值Cout

• 寫入開關Cwrite：控制是否可寫入記憶體

• 遺忘開關Cforget：控制是否保留之前的值

• 讀取開關Cread ：控制是否可讀取記憶體

Training: Backward Propagation
http://www.felixgers.de/papers/phd.pdf

https://class.coursera.org/neuralnets-2012-001/lecture/95

Input
Output
Read/Write
Head
controller
Memory

Memory
Memory Address
Memory Block
Block
Length
0 1 … i … n
0
j
m
……

Read Operation
11 2
21 3
42 1
Read Operation:
0 000.9 0.1
0 1 … i … n
Read Vector:
Head Location:
Memory :
1.1
1.0
2.2

Erase Operation
Erase Operation:
0
1
1
11 2
21 3
42 1
0 000.9 0.1
0 1 … i … n
0
j
m
……
11 2
3
1
0.1 1.8
0.2 3.6
Head Location:
Erase Vector:
Memory :

Add Operation
Add Operation:
1
1
0
0 000.9 0.1
0 1 … i … n
11 2
3
1
0.1 1.8
0.2 3.6
2
3
10.2 3.6
1.9
1.9
1.1
1.0
Add Vector:
Memory :
Head Location:
0
j
m
……

Controller
controller
Input
Read Vector:
Head Location:
Output
Add Vector:
Erase Vector:
Addressing
Mechanisms
Content Addressing Parameter:
Interpolation Parameter:
Convolutional Shift Parameter:
Sharpening Parameter:
Memory Key:

0 0000 1
.45 .05 .500 0 0
.45 .05 .50 0 0 0
0 0 0 1 0 0
Head Location:
11 2 04 0
21 3 01 1
42 1 15 00 000.9 0.1
Head Location:
Memory:Previous State
2
3
1
Memory
Key:
00 1
Controller
Outputs
Content
Addressing
Interpolation
Convolutional
Shift
Sharpening

Content Addressing
11 2 04 0
21 3 01 1
42 1 15 0
2
3
1
.16 .16 .16 .16 .16 .160 0000 1 .15 .10 .47 .08 .13 .17
Memory Key:Memory :
Head Location:
找出記憶體中與內容相近的位置。
參數：調整集中度

Interpolation
0 000.9 0.1
0 0000 1
0 0000 1 0 000.9 0.1.45 .05 .50 0 0 0
將讀寫頭位置與上一個時段位置結合。
參數：調整目前的與上個時段的比率

Convolutional Shift
.45 .05 .50 0 0 0 .45 .05 .50 0 0 0
.45.05 .50 0 0 0 .45 .05 .500 0 0
.45 .05 .50 0 0 0
.025 .475 .025 .25 0 .225
01 0 00 1 .5 0 .5
-1 0 1-1 0 1 -1 0 1
將內的數值做平移。
參數：調整平移方向

Sharpening
0 0 0 1 0 0 0 .37 0 .62 0 0
0 .45 .05 .50 0 0
.16 .16 .16 .16 .16 .16
使中的值更集中（或分散）。
參數：調整集中度

Implementation
http://awawfumin.blogspot.tw/2015/03/neural-turing-machines-implementation.html

Experiment: Repeat Copy
https://github.com/fumin/ntm

Evolution of Recurrent Neural Network
Long Short Term Memory
短期記憶
可控制記憶體的讀寫
可更靈活地控制記憶體讀寫頭
的位置

實作
• Neural Turing Machine

延伸閱讀
• 機器學習相關
– Logistic Regression
• http://cpmarkchang.logdown.com/posts/189069-logisti-regression-model
– Overfitting and Regularization
• http://cpmarkchang.logdown.com/posts/193261-machine-learning-overfitting-and-regularization
– Model Selection
• http://cpmarkchang.logdown.com/posts/193914-machine-learning-model-selection
• 類神經網路相關
– Neural Network Backward Propagation
• http://cpmarkchang.logdown.com/posts/277349-neural-network-backward-propagation
– Recurrent Neural Network
• http://cpmarkchang.logdown.com/posts/278457-neural-network-recurrent-neural-network
– Long Short Term Memory
• http://deeplearning.cs.cmu.edu/pdfs/Hochreiter97_lstm.pdf
• http://www.felixgers.de/papers/phd.pdf
– Neural Turing Machine
• http://arxiv.org/pdf/1410.5401.pdf
• http://awawfumin.blogspot.tw/2015/03/neural-turing-machines-implementation.html

線上課程
• 機器學習相關
– https://www.coursera.org/course/ntumlone
– https://www.coursera.org/course/ntumltwo
• 類神經網路相關
– https://www.youtube.com/playlist?list=PL6Xpj9I5
qXYEcOhn7TqghAJ6NAPrNmUBH
– https://www.coursera.org/course/neuralnets

神經圖靈機原始碼
• https://github.com/fumin/ntm

講者聯絡方式：
• Mark Chang
– facebook：
https://www.facebook.com/ckmarkoh.chang
– Github：http://github.com/ckmarkoh
– Blog：http://cpmarkchang.logdown.com
– email：ckmarkoh at gmail.com
• Fumin
– Github：https://github.com/fumin
– Email：awawfumin at gmail.com

Neural Turing Machine Tutorial

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Viewers also liked

Viewers also liked (7)

Similar to Neural Turing Machine Tutorial

Similar to Neural Turing Machine Tutorial (20)

More from Mark Chang

More from Mark Chang (20)

Neural Turing Machine Tutorial

Editor's Notes