Convolutional Neural Networks (D1L3 2017 UPC Deep Learning for Computer Vision)

•

0 likes•655 views

https://telecombcn-dl.github.io/2017-dlcv/ Deep learning technologies are at the core of the current revolution in artificial intelligence for multimedia data analysis. The convergence of large-scale annotated datasets and affordable GPU hardware has allowed the training of neural networks for data analysis tasks which were previously addressed with hand-crafted features. Architectures such as convolutional neural networks, recurrent neural networks and Q-nets for reinforcement learning have shaped a brand new scenario in signal processing. This course will cover the basic principles and applications of deep learning to computer vision problems, such as image classification, object detection or image captioning.

Data & Analytics

[course site]
Elisa Sayrol Clols
Elisa.sayrol@upc.edu
Associate Professor
Universitat Politecnica de Catalunya
Technical University of Catalonia
Convolutional Neural
Networks
Day 1 Lecture 3
#DLUPC

The i-th layer is defined by a matrix Wi
and a vector bi, and the activation is
simply a dot product plus bi:
Deep Neural Network
Num parameters to learn at i-th layer:
2
x1
x2
x3
x4
y1
Layer 1
Layer 2
Layer 3
Layer 0
y2

From Neurons to Convolutional Neural Networks
What if the Input is an image?

From Neurons to Convolutional Neural Networks
For a 200x200 image,
we have 4x104
neurons
each one with 4x104
inputs, that is 16x108
parameters, only for one
layer (not counting the
bias)!!!
Figure Credit: Ranzatto

From Neurons to Convolutional Neural Networks
For a 200x200 image, we
have 4x104
neurons each one
with 10x10 “local
connections” (also called
receptive field) inputs, that is
4x106
What else can we do to
reduce the number of
parameters? Figure Credit: Ranzatto

From Neurons to Convolutional Neural Networks
Translation invariance: we can use same
parameters to capture a specific “feature” in any
area of the image. We can try different sets of
parameters to capture different features.
These operations are equivalent to perform
convolutions with different filters.
Ex: With100 different filters (or feature extractors)
of size 10x10, the number of parameters is 104
Figure Credit: Ranzatto

From Neurons to Convolutional Neural Networks
● sparse interactions
● parameter sharing
● equivariant representations

From Neurons to Convolutional Neural Networks
… and don’t forget the activation function!
Figure Credit: Ranzatto
ReLu PReLu

From Neurons to Convolutional Neural Networks
Most ConvNets use Pooling (or
subsampling) to reduce dimensionality
and provide invariance to small local
changes.
Pooling options:
• Max
• Average
• Stochastic pooling
Figure Credit: Ranzatto

From Neurons to Convolutional Neural Networks
Padding (P): When doing the
convolution in the borders, you may
add values to compute the
convolution.
When the values are zero, that is
quite common, the technique is called
zero-padding.
When padding is not used the output
size is reduced.
FxF=3x3

From Neurons to Convolutional Neural Networks
Stride (S): When doing the
convolution or another operation, like
pooling, we may decide to slide not
pixel by pixel but every 2 or more
pixels. The number of pixels that we
skip is the value of the stride.
It might be used to reduce the
dimensionality of the output

References
Neocognitron: A Self-organizing Neural Network Model for a
Mechanism of Pattern Recognition Unaffected by Shift in Position,
Kunihiko Fukushima
NHK BroadcastingScienceResearchLaboratories,Kinuta, Setagaya,
Tokyo, Japan, Bio Cybernetics, (4) 1980, 193-202
Deep Learning
An MIT Press book
Ian Goodfellow and Yoshua Bengio and Aaron Courville
http://www.deeplearningbook.org/contents/convnets.html
Stanford Course in Convolutional NN for Visual Representation (2017)
http://cs231n.github.io/convolutional-networks/
2016 course: https://youtu.be/LxfUGhug-iQ

What's hot

https://telecombcn-dl.github.io/2018-dlai/ Deep learning technologies are at the core of the current revolution in artificial intelligence for multimedia data analysis. The convergence of large-scale annotated datasets and affordable GPU hardware has allowed the training of neural networks for data analysis tasks which were previously addressed with hand-crafted features. Architectures such as convolutional neural networks, recurrent neural networks or Q-nets for reinforcement learning have shaped a brand new scenario in signal processing. This course will cover the basic principles of deep learning from both an algorithmic and computational perspectives.

Convolutional Neural Networks - Veronica Vilaplana - UPC Barcelona 2018

Universitat Politècnica de Catalunya

https://github.com/telecombcn-dl/dlmm-2017-dcu Deep learning technologies are at the core of the current revolution in artificial intelligence for multimedia data analysis. The convergence of big annotated data and affordable GPU hardware has allowed the training of neural networks for data analysis tasks which had been addressed until now with hand-crafted features. Architectures such as convolutional neural networks, recurrent neural networks and Q-nets for reinforcement learning have shaped a brand new scenario in signal processing. This course will cover the basic principles and applications of deep learning to computer vision problems, such as image classification, object detection or text captioning.

Unsupervised Deep Learning (D2L1 Insight@DCU Machine Learning Workshop 2017)

Universitat Politècnica de Catalunya

https://telecombcn-dl.github.io/2017-dlai/ Deep learning technologies are at the core of the current revolution in artificial intelligence for multimedia data analysis. The convergence of large-scale annotated datasets and affordable GPU hardware has allowed the training of neural networks for data analysis tasks which were previously addressed with hand-crafted features. Architectures such as convolutional neural networks, recurrent neural networks or Q-nets for reinforcement learning have shaped a brand new scenario in signal processing. This course will cover the basic principles of deep learning from both an algorithmic and computational perspectives.

Transfer Learning and Domain Adaptation (DLAI D5L2 2017 UPC Deep Learning for...

Universitat Politècnica de Catalunya

http://imatge-upc.github.io/telecombcn-2016-dlcv/ Deep learning technologies are at the core of the current revolution in artificial intelligence for multimedia data analysis. The convergence of big annotated data and affordable GPU hardware has allowed the training of neural networks for data analysis tasks which had been addressed until now with hand-crafted features. Architectures such as convolutional neural networks, recurrent neural networks and Q-nets for reinforcement learning have shaped a brand new scenario in signal processing. This course will cover the basic principles and applications of deep learning to computer vision problems, such as image classification, object detection or text captioning.

Deep Learning for Computer Vision: Deep Networks (UPC 2016)

Universitat Politècnica de Catalunya

Deep Neural Networks (D1L2 Insight@DCU Machine Learning Workshop 2017)

Universitat Politècnica de Catalunya

Deep Learning for Computer Vision: Attention Models (UPC 2016)

Universitat Politècnica de Catalunya

Deep Learning for Computer Vision: Transfer Learning and Domain Adaptation (U...

Universitat Politècnica de Catalunya

Optimization for Deep Networks (D2L1 2017 UPC Deep Learning for Computer Vision)

Universitat Politècnica de Catalunya

Convolutional Neural Networks (DLAI D5L1 2017 UPC Deep Learning for Artificia...

Universitat Politècnica de Catalunya

In this project, we propose methods for semantic segmentation with the deep learning state-of-the-art models. Moreover, we want to filterize the segmentation to the specific object in specific application. Instead of concentrating on unnecessary objects we can focus on special ones and make it more specialize and effecient for special purposes. Furtheromore, In this project, we leverage models that are suitable for face segmentation. The models that are used in this project are Mask-RCNN and DeepLabv3. The experimental results clearly indicate that how illustrated approach are efficient and robust in the segmentation task to the previous work in the field of segmentation. These models are reached to 74.4 and 86.6 precision of Mean of Intersection over Union. The visual Results of the models are shown in Appendix part.

Semantic segmentation with Convolutional Neural Network Approaches

Fellowship at Vodafone FutureLab

Deep Learning for Computer Vision: Data Augmentation (UPC 2016)

Universitat Politècnica de Catalunya

Generative Models and Adversarial Training (D2L3 Insight@DCU Machine Learning...

Universitat Politècnica de Catalunya

Multilayer Perceptron (DLAI D1L2 2017 UPC Deep Learning for Artificial Intell...

Universitat Politècnica de Catalunya

Transfer Learning and Domain Adaptation - Ramon Morros - UPC Barcelona 2018

Universitat Politècnica de Catalunya

Conditional Image Generation with PixelCNN Decoders

suga93

Joint unsupervised learning of deep representations and image clusters

Universitat Politècnica de Catalunya

Learning Convolutional Neural Networks for Graphs

Mathias Niepert

Convolutional neural networks 이론과 응용

홍배 김

Transfer Learning and Domain Adaptation (D2L3 2017 UPC Deep Learning for Comp...

Universitat Politècnica de Catalunya

Variational Autoencoders VAE - Santiago Pascual - UPC Barcelona 2018

Universitat Politècnica de Catalunya

What's hot (20)