Deep Learning for Computer Vision: Recurrent Neural Networks (UPC 2016)

•

5 gefällt mir•4,927 views

http://imatge-upc.github.io/telecombcn-2016-dlcv/ Deep learning technologies are at the core of the current revolution in artificial intelligence for multimedia data analysis. The convergence of big annotated data and affordable GPU hardware has allowed the training of neural networks for data analysis tasks which had been addressed until now with hand-crafted features. Architectures such as convolutional neural networks, recurrent neural networks and Q-nets for reinforcement learning have shaped a brand new scenario in signal processing. This course will cover the basic principles and applications of deep learning to computer vision problems, such as image classification, object detection or text captioning.

Daten & Analysen

Day 2 Lecture 6
Recurrent Neural
Networks
Xavier Giró-i-Nieto
[course site]

5
Multilayer Perceptron
Alex Graves, “Supervised Sequence Labelling with Recurrent Neural Networks”
The output depends
ONLY on the current
input.

6
Recurrent Neural Network (RNN)
Alex Graves, “Supervised Sequence Labelling with Recurrent Neural Networks”
The hidden layers and the
output depend from previous
states of the hidden layers

7
Recurrent Neural Network (RNN)
Alex Graves, “Supervised Sequence Labelling with Recurrent Neural Networks”
The hidden layers and the
output depend from previous
states of the hidden layers

8
Recurrent Neural Network (RNN)
time
time
Rotation
90o
Front View Side View
Rotation
90o

9
Recurrent Neural Networks (RNN)
Alex Graves, “Supervised Sequence Labelling with Recurrent Neural Networks”
Each node represents a
layer
of neurons at a single
timestep.
tt-1 t+1

10
Recurrent Neural Networks (RNN)
Alex Graves, “Supervised Sequence Labelling with Recurrent Neural Networks”
tt-1 t+1
The input is a SEQUENCE x(t)
of any length.

11
Recurrent Neural Networks (RNN)
Common visual sequences:
Still image Spatial scan
(zigzag, snake)
The input is a SEQUENCE x(t)
of any length.

12
Recurrent Neural Networks (RNN)
Common visual sequences:
Video Temporal
sampling
The input is a SEQUENCE x(t)
of any length.
...
t

13
Recurrent Neural Networks (RNN)
Alex Graves, “Supervised Sequence Labelling with Recurrent Neural Networks”
Must learn temporally shared
weights w2; in addition to w1 &
w3.
tt-1 t+1

14
Bidirectional RNN (BRNN)
Alex Graves, “Supervised Sequence Labelling with Recurrent Neural Networks”
Must learn weights w2,
w3, w4 & w5; in addition to
w1 & w6.

15
Alex Graves, “Supervised Sequence Labelling with Recurrent Neural Networks”
Bidirectional RNN (BRNN)

16Slide: Santi Pascual
Formulation: One hidden layer
Delay unit
(z-1
)

17Slide: Santi Pascual
Formulation: Single recurrence
One-time
Recurrence

18Slide: Santi Pascual
Formulation: Multiple recurrences
Recurrence
One time-step
recurrence
T time steps
recurrences

19Slide: Santi Pascual
RNN problems
Long term memory vanishes because of the T nested
multiplications by U.
...

20Slide: Santi Pascual
RNN problems
During training, gradients may explode or vanish because of
temporal depth.
Example:
Back-propagation in
time with 3 steps.

22
Hochreiter, Sepp, and Jürgen Schmidhuber. "Long short-term memory." Neural computation 9, no.
8 (1997): 1735-1780.
Long Short-Term Memory (LSTM)

23Figure: Cristopher Olah, “Understanding LSTM Networks” (2015)
Long Short-Term Memory (LSTM)
Based on a standard RNN whose neuron activates with tanh...

24
Long Short-Term Memory (LSTM)
Ct
is the cell state, which flows through the entire chain...
Figure: Cristopher Olah, “Understanding LSTM Networks” (2015)

25
Long Short-Term Memory (LSTM)
...and is updated with a sum instead of a product. This avoid
memory vanishing and exploding/vanishing backprop gradients.
Figure: Cristopher Olah, “Understanding LSTM Networks” (2015)

26
Long Short-Term Memory (LSTM)
Three gates are governed by sigmoid units (btw [0,1]) define
the control of in & out information..
Figure: Cristopher Olah, “Understanding LSTM Networks” (2015)

27
Long Short-Term Memory (LSTM)
Forget Gate:
Concatenate
Figure: Cristopher Olah, “Understanding LSTM Networks” (2015) / Slide: Alberto Montes

28
Long Short-Term Memory (LSTM)
Input Gate Layer
New contribution to cell state
Classic neuron
Figure: Cristopher Olah, “Understanding LSTM Networks” (2015) / Slide: Alberto Montes

29
Long Short-Term Memory (LSTM)
Update Cell State (memory):
Figure: Cristopher Olah, “Understanding LSTM Networks” (2015) / Slide: Alberto Montes

30
Long Short-Term Memory (LSTM)
Output Gate Layer
Output to next layer
Figure: Cristopher Olah, “Understanding LSTM Networks” (2015) / Slide: Alberto Montes

31
Long Short-Term Memory (LSTM)
Figure: Cristopher Olah, “Understanding LSTM Networks” (2015) / Slide: Alberto Montes

32
Gated Recurrent Unit (GRU)
Cho, Kyunghyun, Bart Van Merriënboer, Caglar Gulcehre, Dzmitry Bahdanau, Fethi Bougares, Holger
Schwenk, and Yoshua Bengio. "Learning phrase representations using RNN encoder-decoder for
statistical machine translation." AMNLP 2014.
Similar performance as LSTM with less computation.

33Figure: Cristopher Olah, “Understanding LSTM Networks” (2015) / Slide: Alberto Montes
Gated Recurrent Unit (GRU)

34
Applications: Machine Translation
Cho, Kyunghyun, Bart Van Merriënboer, Caglar Gulcehre, Dzmitry Bahdanau, Fethi Bougares, Holger
Schwenk, and Yoshua Bengio. "Learning phrase representations using RNN encoder-decoder for
statistical machine translation." arXiv preprint arXiv:1406.1078 (2014).
Language IN
Language OUT

35
Applications: Image Classification
van den Oord, Aaron, Nal Kalchbrenner, and Koray Kavukcuoglu. "Pixel Recurrent Neural Networks."
arXiv preprint arXiv:1601.06759 (2016).
RowLSTM
Diagonal
BiLSTM
Classification MNIST

36
Applications: Segmentation
Francesco Visin, Marco Ciccone, Adriana Romero, Kyle Kastner, Kyunghyun Cho, Yoshua Bengio,
Matteo Matteucci, Aaron Courville, “ReSeg: A Recurrent Neural Network-Based Model for Semantic
Segmentation”. DeepVision CVPRW 2016.

37
Learn more
Nando de Freitas (University of Oxford)

38
Thanks ! Q&A ?
Follow me at
https://imatge.upc.edu/web/people/xavier-giro
@DocXavi
/ProfessorXavi

Weitere ähnliche Inhalte

Was ist angesagt?

Attention in Deep Learning健程杨

Word2Vechyunyoung Lee

Introduction to Recurrent Neural NetworkYan Xu

Deep Learning: Recurrent Neural Network (Chapter 10) Larry Guo

Recurrent neural networkSyed Annus Ali SHah

Recurrent Neural NetworkMohammad Sabouri

Focal loss for dense object detectionDaeHeeKim31

Recurrent Neural Networks, LSTM and GRUananth

Machine Learning - Introduction to TensorflowAndrew Ferlitsch

R-CNNMohamed Rashid

History of deep learningayatan2

TensorFlow and Keras: An OverviewPoo Kuan Hoong

Seq2Seq (encoder decoder) model佳蓉倪

Recurrent Neural Networks for Text Analysisodsc

Introduction to Deep Learning, Keras, and TensorFlowSri Ambati

[Paper Reading] Attention is All You NeedDaiki Tanaka

Counterpropagation NETWORKESCOM

Recurrent Neural NetworksRakuten Group, Inc.

Attention Models (D3L6 2017 UPC Deep Learning for Computer Vision)Universitat Politècnica de Catalunya

rnn BASICSPriyanka Reddy

Was ist angesagt? (20)

Attention in Deep Learning

Word2Vec

Introduction to Recurrent Neural Network

Deep Learning: Recurrent Neural Network (Chapter 10)

Recurrent neural network

Recurrent Neural Network

Focal loss for dense object detection

Recurrent Neural Networks, LSTM and GRU

Machine Learning - Introduction to Tensorflow

R-CNN

History of deep learning

TensorFlow and Keras: An Overview

Seq2Seq (encoder decoder) model

Recurrent Neural Networks for Text Analysis

Introduction to Deep Learning, Keras, and TensorFlow

[Paper Reading] Attention is All You Need

Counterpropagation NETWORK

Recurrent Neural Networks

Attention Models (D3L6 2017 UPC Deep Learning for Computer Vision)

rnn BASICS

Andere mochten auch

Deep Learning for Computer Vision: Unsupervised Learning (UPC 2016)Universitat Politècnica de Catalunya

Deep Learning for Computer Vision: Data Augmentation (UPC 2016)Universitat Politècnica de Catalunya

Deep Learning for Computer Vision: Transfer Learning and Domain Adaptation (U...Universitat Politècnica de Catalunya

Deep Learning for Computer Vision: Visualization (UPC 2016)Universitat Politècnica de Catalunya

Deep Learning for Computer Vision: Image Retrieval (UPC 2016)Universitat Politècnica de Catalunya

Deep Learning for Computer Vision: Memory usage and computational considerati...Universitat Politècnica de Catalunya

Deep Learning for Computer Vision: Optimization (UPC 2016)Universitat Politècnica de Catalunya

Deep Learning for Computer Vision: Saliency Prediction (UPC 2016)Universitat Politècnica de Catalunya

Deep Learning for Computer Vision: Image Classification (UPC 2016)Universitat Politècnica de Catalunya

Deep Learning for Computer Vision: ImageNet Challenge (UPC 2016)Universitat Politècnica de Catalunya

Deep Learning for Computer Vision: Object Detection (UPC 2016)Universitat Politècnica de Catalunya

Deep Learning for Computer Vision: Generative models and adversarial training...Universitat Politècnica de Catalunya

Deep Learning for Computer Vision: Deep Networks (UPC 2016)Universitat Politècnica de Catalunya

Deep Learning for Computer Vision: Backward Propagation (UPC 2016)Universitat Politècnica de Catalunya

Deep Learning for Computer Vision: Face Recognition (UPC 2016)Universitat Politècnica de Catalunya

Deep Learning for Computer Vision: Software Frameworks (UPC 2016)Universitat Politècnica de Catalunya

Deep Learning for Computer Vision: Segmentation (UPC 2016)Universitat Politècnica de Catalunya

Deep Learning for Computer Vision: Welcome (UPC TelecomBCN 2016)Universitat Politècnica de Catalunya

Deep Learning for Computer Vision: Video Analytics (UPC 2016)Universitat Politècnica de Catalunya

Deep Learning for Computer Vision: Medical Imaging (UPC 2016)Universitat Politècnica de Catalunya

Andere mochten auch (20)

Deep Learning for Computer Vision: Unsupervised Learning (UPC 2016)

Deep Learning for Computer Vision: Data Augmentation (UPC 2016)

Deep Learning for Computer Vision: Transfer Learning and Domain Adaptation (U...

Deep Learning for Computer Vision: Visualization (UPC 2016)

Deep Learning for Computer Vision: Image Retrieval (UPC 2016)

Deep Learning for Computer Vision: Memory usage and computational considerati...

Deep Learning for Computer Vision: Optimization (UPC 2016)

Deep Learning for Computer Vision: Saliency Prediction (UPC 2016)

Deep Learning for Computer Vision: Image Classification (UPC 2016)

Deep Learning for Computer Vision: ImageNet Challenge (UPC 2016)

Deep Learning for Computer Vision: Object Detection (UPC 2016)

Deep Learning for Computer Vision: Generative models and adversarial training...

Deep Learning for Computer Vision: Deep Networks (UPC 2016)

Deep Learning for Computer Vision: Backward Propagation (UPC 2016)

Deep Learning for Computer Vision: Face Recognition (UPC 2016)

Deep Learning for Computer Vision: Software Frameworks (UPC 2016)

Deep Learning for Computer Vision: Segmentation (UPC 2016)

Deep Learning for Computer Vision: Welcome (UPC TelecomBCN 2016)

Deep Learning for Computer Vision: Video Analytics (UPC 2016)

Deep Learning for Computer Vision: Medical Imaging (UPC 2016)

Ähnlich wie Deep Learning for Computer Vision: Recurrent Neural Networks (UPC 2016)

Video Analysis with Recurrent Neural Networks (Master Computer Vision Barcelo...Universitat Politècnica de Catalunya

Recurrent Neural Networks (D2L8 Insight@DCU Machine Learning Workshop 2017)Universitat Politècnica de Catalunya

Recurrent Neural Networks (D2L2 2017 UPC Deep Learning for Computer Vision)Universitat Politècnica de Catalunya

Recurrent Neural Networks RNN - Xavier Giro - UPC TelecomBCN Barcelona 2020Universitat Politècnica de Catalunya

Recurrent Neural Networks I (D2L2 Deep Learning for Speech and Language UPC 2...Universitat Politècnica de Catalunya

Ted Willke, Senior Principal Engineer, Intel Labs at MLconf NYCMLconf

Lecture 7: Recurrent Neural NetworksSang Jun Lee

RNN and its applicationsSungjoon Choi

Recurrent Neural Networks II (D2L3 Deep Learning for Speech and Language UPC ...Universitat Politècnica de Catalunya

Lect1_Threshold_Logic_Unit lecture 1 - ANNMostafaHazemMostafaa

NeurSciAConeAdam Cone

Skip RNN: Learning to Skip State Updates in RNNs (ICLR 2018)Universitat Politècnica de Catalunya

Poster for RepL4NLP - Multilingual Modal Sense Classification Using a Convolu...Ana Marasović

14889574 dl ml RNN Deeplearning MMMm.pptManiMaran230751

Recurrent neural networks for sequence learning and learning human identity f...SungminYou

State of the art time-series analysis with deep learning by Javier Ordóñez at...Big Data Spain

NEURAL NETWORKSESCOM

JAISTサマースクール2016「脳を知るための理論」講義04 Neural Networks and Neuroscience hirokazutanaka

PixelCNN, Wavenet, Normalizing Flows - Santiago Pascual - UPC Barcelona 2018Universitat Politècnica de Catalunya

Brain-Inspired Computation based on Spiking Neural Networks ...Jorge Pires

Ähnlich wie Deep Learning for Computer Vision: Recurrent Neural Networks (UPC 2016) (20)

Video Analysis with Recurrent Neural Networks (Master Computer Vision Barcelo...

Recurrent Neural Networks (D2L8 Insight@DCU Machine Learning Workshop 2017)

Recurrent Neural Networks (D2L2 2017 UPC Deep Learning for Computer Vision)

Recurrent Neural Networks RNN - Xavier Giro - UPC TelecomBCN Barcelona 2020

Recurrent Neural Networks I (D2L2 Deep Learning for Speech and Language UPC 2...

Ted Willke, Senior Principal Engineer, Intel Labs at MLconf NYC

Lecture 7: Recurrent Neural Networks

RNN and its applications

Recurrent Neural Networks II (D2L3 Deep Learning for Speech and Language UPC ...

Lect1_Threshold_Logic_Unit lecture 1 - ANN

NeurSciACone

Skip RNN: Learning to Skip State Updates in RNNs (ICLR 2018)

Poster for RepL4NLP - Multilingual Modal Sense Classification Using a Convolu...

14889574 dl ml RNN Deeplearning MMMm.ppt

Recurrent neural networks for sequence learning and learning human identity f...

State of the art time-series analysis with deep learning by Javier Ordóñez at...

NEURAL NETWORKS

JAISTサマースクール2016「脳を知るための理論」講義04 Neural Networks and Neuroscience

PixelCNN, Wavenet, Normalizing Flows - Santiago Pascual - UPC Barcelona 2018

Brain-Inspired Computation based on Spiking Neural Networks ...

Mehr von Universitat Politècnica de Catalunya

Deep Generative Learning for All - The Gen AI Hype (Spring 2024)Universitat Politècnica de Catalunya

Deep Generative Learning for AllUniversitat Politècnica de Catalunya

The Transformer in Vision | Xavier Giro | Master in Computer Vision Barcelona...Universitat Politècnica de Catalunya

Towards Sign Language Translation & Production | Xavier Giro-i-NietoUniversitat Politècnica de Catalunya

The Transformer - Xavier Giró - UPC Barcelona 2021Universitat Politècnica de Catalunya

Learning Representations for Sign Language Videos - Xavier Giro - NIST TRECVI...Universitat Politècnica de Catalunya

Open challenges in sign language translation and productionUniversitat Politècnica de Catalunya

Generation of Synthetic Referring Expressions for Object Segmentation in VideosUniversitat Politècnica de Catalunya

Discovery and Learning of Navigation Goals from Pixels in MinecraftUniversitat Politècnica de Catalunya

Learn2Sign : Sign language recognition and translation using human keypoint e...Universitat Politècnica de Catalunya

Intepretability / Explainable AI for Deep Neural NetworksUniversitat Politècnica de Catalunya

Convolutional Neural Networks - Xavier Giro - UPC TelecomBCN Barcelona 2020Universitat Politècnica de Catalunya

Self-Supervised Audio-Visual Learning - Xavier Giro - UPC TelecomBCN Barcelon...Universitat Politècnica de Catalunya

Attention for Deep Learning - Xavier Giro - UPC TelecomBCN Barcelona 2020Universitat Politècnica de Catalunya

Generative Adversarial Networks GAN - Xavier Giro - UPC TelecomBCN Barcelona ...Universitat Politècnica de Catalunya

Q-Learning with a Neural Network - Xavier Giró - UPC Barcelona 2020Universitat Politècnica de Catalunya

Language and Vision with Deep Learning - Xavier Giró - ACM ICMR 2020 (Tutorial)Universitat Politècnica de Catalunya

Image Segmentation with Deep Learning - Xavier Giro & Carles Ventura - ISSonD...Universitat Politècnica de Catalunya

Curriculum Learning for Recurrent Video Object SegmentationUniversitat Politècnica de Catalunya

Deep Self-supervised Learning for All - Xavier Giro - X-Europe 2020Universitat Politècnica de Catalunya

Mehr von Universitat Politècnica de Catalunya (20)

Deep Generative Learning for All - The Gen AI Hype (Spring 2024)

Deep Generative Learning for All

The Transformer in Vision | Xavier Giro | Master in Computer Vision Barcelona...

Towards Sign Language Translation & Production | Xavier Giro-i-Nieto

The Transformer - Xavier Giró - UPC Barcelona 2021

Learning Representations for Sign Language Videos - Xavier Giro - NIST TRECVI...

Open challenges in sign language translation and production

Generation of Synthetic Referring Expressions for Object Segmentation in Videos

Discovery and Learning of Navigation Goals from Pixels in Minecraft

Learn2Sign : Sign language recognition and translation using human keypoint e...

Intepretability / Explainable AI for Deep Neural Networks

Convolutional Neural Networks - Xavier Giro - UPC TelecomBCN Barcelona 2020

Self-Supervised Audio-Visual Learning - Xavier Giro - UPC TelecomBCN Barcelon...

Attention for Deep Learning - Xavier Giro - UPC TelecomBCN Barcelona 2020

Generative Adversarial Networks GAN - Xavier Giro - UPC TelecomBCN Barcelona ...

Q-Learning with a Neural Network - Xavier Giró - UPC Barcelona 2020

Language and Vision with Deep Learning - Xavier Giró - ACM ICMR 2020 (Tutorial)

Image Segmentation with Deep Learning - Xavier Giro & Carles Ventura - ISSonD...

Curriculum Learning for Recurrent Video Object Segmentation

Deep Self-supervised Learning for All - Xavier Giro - X-Europe 2020

Kürzlich hochgeladen

Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfRachmat Ramadhan H

VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...Call Girls In Delhi Whatsup 9873940964 Enjoy Unlimited Pleasure

Mature dropshipping via API with DroFx.pptxolyaivanovalion

Dubai Call Girls Wifey O52&786472 Call Girls Dubaihf8803863

꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...Call Girls In Delhi Whatsup 9873940964 Enjoy Unlimited Pleasure

Call Girls In Mahipalpur O9654467111 Escorts ServiceSapana Sha

Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Callshivangimorya083

Brighton SEO | April 2024 | Data StorytellingNeil Barnes

Midocean dropshipping via API with DroFxolyaivanovalion

Industrialised data - the key to AI success.pdfLars Albertsson

Ravak dropshipping via API with DroFx.pptxolyaivanovalion

April 2024 - Crypto Market Report's Analysismanisha194592

Customer Service Analytics - Make Sense of All Your Data.pptxEmmanuel Dauda

Log Analysis using OSSEC sasoasasasas.pptxJohnnyPlasten

CebaBaby dropshipping via API with DroFX.pptxolyaivanovalion

VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...Suhani Kapoor

Generative AI on Enterprise Cloud with NiFi and MilvusTimothy Spann

Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsappssapnasaifi408

VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...Suhani Kapoor

VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130Suhani Kapoor

Kürzlich hochgeladen (20)

Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf

VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...

Mature dropshipping via API with DroFx.pptx

Dubai Call Girls Wifey O52&786472 Call Girls Dubai

꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...

Call Girls In Mahipalpur O9654467111 Escorts Service

Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call

Brighton SEO | April 2024 | Data Storytelling

Midocean dropshipping via API with DroFx

Industrialised data - the key to AI success.pdf

Ravak dropshipping via API with DroFx.pptx

April 2024 - Crypto Market Report's Analysis

Customer Service Analytics - Make Sense of All Your Data.pptx

Log Analysis using OSSEC sasoasasasas.pptx

CebaBaby dropshipping via API with DroFX.pptx

VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...

Generative AI on Enterprise Cloud with NiFi and Milvus

Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps

VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...

VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130

Deep Learning for Computer Vision: Recurrent Neural Networks (UPC 2016)

1. Day 2 Lecture 6 Recurrent Neural Networks Xavier Giró-i-Nieto [course site]

2. 2 Acknowledgments Santi Pascual

3. 3 General idea ConvNet (or CNN)

4. 4 General idea ConvNet (or CNN)

5. 5 Multilayer Perceptron Alex Graves, “Supervised Sequence Labelling with Recurrent Neural Networks” The output depends ONLY on the current input.

6. 6 Recurrent Neural Network (RNN) Alex Graves, “Supervised Sequence Labelling with Recurrent Neural Networks” The hidden layers and the output depend from previous states of the hidden layers

7. 7 Recurrent Neural Network (RNN) Alex Graves, “Supervised Sequence Labelling with Recurrent Neural Networks” The hidden layers and the output depend from previous states of the hidden layers

8. 8 Recurrent Neural Network (RNN) time time Rotation 90o Front View Side View Rotation 90o

9. 9 Recurrent Neural Networks (RNN) Alex Graves, “Supervised Sequence Labelling with Recurrent Neural Networks” Each node represents a layer of neurons at a single timestep. tt-1 t+1

10. 10 Recurrent Neural Networks (RNN) Alex Graves, “Supervised Sequence Labelling with Recurrent Neural Networks” tt-1 t+1 The input is a SEQUENCE x(t) of any length.

11. 11 Recurrent Neural Networks (RNN) Common visual sequences: Still image Spatial scan (zigzag, snake) The input is a SEQUENCE x(t) of any length.

12. 12 Recurrent Neural Networks (RNN) Common visual sequences: Video Temporal sampling The input is a SEQUENCE x(t) of any length. ... t

13. 13 Recurrent Neural Networks (RNN) Alex Graves, “Supervised Sequence Labelling with Recurrent Neural Networks” Must learn temporally shared weights w2; in addition to w1 & w3. tt-1 t+1

14. 14 Bidirectional RNN (BRNN) Alex Graves, “Supervised Sequence Labelling with Recurrent Neural Networks” Must learn weights w2, w3, w4 & w5; in addition to w1 & w6.

15. 15 Alex Graves, “Supervised Sequence Labelling with Recurrent Neural Networks” Bidirectional RNN (BRNN)

16. 16Slide: Santi Pascual Formulation: One hidden layer Delay unit (z-1 )

17. 17Slide: Santi Pascual Formulation: Single recurrence One-time Recurrence

18. 18Slide: Santi Pascual Formulation: Multiple recurrences Recurrence One time-step recurrence T time steps recurrences

19. 19Slide: Santi Pascual RNN problems Long term memory vanishes because of the T nested multiplications by U. ...

20. 20Slide: Santi Pascual RNN problems During training, gradients may explode or vanish because of temporal depth. Example: Back-propagation in time with 3 steps.

21. 21 Long Short-Term Memory (LSTM)

22. 22 Hochreiter, Sepp, and Jürgen Schmidhuber. "Long short-term memory." Neural computation 9, no. 8 (1997): 1735-1780. Long Short-Term Memory (LSTM)

23. 23Figure: Cristopher Olah, “Understanding LSTM Networks” (2015) Long Short-Term Memory (LSTM) Based on a standard RNN whose neuron activates with tanh...

24. 24 Long Short-Term Memory (LSTM) Ct is the cell state, which flows through the entire chain... Figure: Cristopher Olah, “Understanding LSTM Networks” (2015)

25. 25 Long Short-Term Memory (LSTM) ...and is updated with a sum instead of a product. This avoid memory vanishing and exploding/vanishing backprop gradients. Figure: Cristopher Olah, “Understanding LSTM Networks” (2015)

26. 26 Long Short-Term Memory (LSTM) Three gates are governed by sigmoid units (btw [0,1]) define the control of in & out information.. Figure: Cristopher Olah, “Understanding LSTM Networks” (2015)

27. 27 Long Short-Term Memory (LSTM) Forget Gate: Concatenate Figure: Cristopher Olah, “Understanding LSTM Networks” (2015) / Slide: Alberto Montes

28. 28 Long Short-Term Memory (LSTM) Input Gate Layer New contribution to cell state Classic neuron Figure: Cristopher Olah, “Understanding LSTM Networks” (2015) / Slide: Alberto Montes

29. 29 Long Short-Term Memory (LSTM) Update Cell State (memory): Figure: Cristopher Olah, “Understanding LSTM Networks” (2015) / Slide: Alberto Montes

30. 30 Long Short-Term Memory (LSTM) Output Gate Layer Output to next layer Figure: Cristopher Olah, “Understanding LSTM Networks” (2015) / Slide: Alberto Montes

31. 31 Long Short-Term Memory (LSTM) Figure: Cristopher Olah, “Understanding LSTM Networks” (2015) / Slide: Alberto Montes

32. 32 Gated Recurrent Unit (GRU) Cho, Kyunghyun, Bart Van Merriënboer, Caglar Gulcehre, Dzmitry Bahdanau, Fethi Bougares, Holger Schwenk, and Yoshua Bengio. "Learning phrase representations using RNN encoder-decoder for statistical machine translation." AMNLP 2014. Similar performance as LSTM with less computation.

33. 33Figure: Cristopher Olah, “Understanding LSTM Networks” (2015) / Slide: Alberto Montes Gated Recurrent Unit (GRU)

34. 34 Applications: Machine Translation Cho, Kyunghyun, Bart Van Merriënboer, Caglar Gulcehre, Dzmitry Bahdanau, Fethi Bougares, Holger Schwenk, and Yoshua Bengio. "Learning phrase representations using RNN encoder-decoder for statistical machine translation." arXiv preprint arXiv:1406.1078 (2014). Language IN Language OUT

35. 35 Applications: Image Classification van den Oord, Aaron, Nal Kalchbrenner, and Koray Kavukcuoglu. "Pixel Recurrent Neural Networks." arXiv preprint arXiv:1601.06759 (2016). RowLSTM Diagonal BiLSTM Classification MNIST

36. 36 Applications: Segmentation Francesco Visin, Marco Ciccone, Adriana Romero, Kyle Kastner, Kyunghyun Cho, Yoshua Bengio, Matteo Matteucci, Aaron Courville, “ReSeg: A Recurrent Neural Network-Based Model for Semantic Segmentation”. DeepVision CVPRW 2016.

37. 37 Learn more Nando de Freitas (University of Oxford)

38. 38 Thanks ! Q&A ? Follow me at https://imatge.upc.edu/web/people/xavier-giro @DocXavi /ProfessorXavi

Deep Learning for Computer Vision: Recurrent Neural Networks (UPC 2016)

Empfohlen

Empfohlen

Weitere ähnliche Inhalte

Was ist angesagt?

Was ist angesagt? (20)

Andere mochten auch

Andere mochten auch (20)

Ähnlich wie Deep Learning for Computer Vision: Recurrent Neural Networks (UPC 2016)

Ähnlich wie Deep Learning for Computer Vision: Recurrent Neural Networks (UPC 2016) (20)

Mehr von Universitat Politècnica de Catalunya

Mehr von Universitat Politècnica de Catalunya (20)

Kürzlich hochgeladen

Kürzlich hochgeladen (20)

Deep Learning for Computer Vision: Recurrent Neural Networks (UPC 2016)