SlideShare ist ein Scribd-Unternehmen logo
1 von 23
Convolutional Neural
Network and Its
Applications
144133E – M.G.K.C.PIYARTHNA
What is CNN?
In machine learning, a convolutional neural network is a class of deep,
feed-forward artificial neural networks that has successfully been
applied fpr analyzing visual imagery.
In the field of ComputerVision and Natural Language Processing, there
can be found more influential innovations by using the concept of
convolutional neural network in Machine Language.
Motivation
• Convolutional Neural Networks (CNN) are biologically-inspired
variants of MLPs. From Hubel andWiesel’s early work on the cat’s
visual cortex ,we know the visual cortex contains a complex
arrangement of cells.These cells are sensitive to small sub-regions of
the visual field, called a receptive field.The sub-regions are tiled to
cover the entire visual field.These cells act as local filters over the
input space and are well-suited to exploit the strong spatially local
correlation present in natural images.
• The animal visual cortex being the most powerful visual processing
system in existence, it seems natural to emulate its behavior
CNN Architecture
ConvNet Architectures
• LeNet (1990s)
• AlexNet (2012)
• ZF Net (2013)
• GoogLeNet (2014)
• VGGNet (2014)
• ResNets (2015)
• DenseNet (August 2016)
Four main operations in the ConvNet
• Convolution
• Non Linearity
• Pooling or Sub Sampling
• Classification (Fully Connected Layer)
• An Image is a matrix of pixel
values
• Channel is a conventional term
used to refer to a certain
component of an image.
• A grayscale image, on the other
hand, has just one channel.
The Convolution Step
• The primary purpose of
Convolution in case of a
ConvNet is to extract features
from the input image.
• In CNN terminology, the 3×3 matrix is called a ‘filter‘ or ‘kernel’ or
‘feature detector’
• the matrix formed by sliding the filter over the image and computing
the dot product is called the ‘Convolved Feature’ or ‘Activation Map’
or the ‘Feature Map‘.
• It is important to note that filters acts as feature detectors from the
original input image.
• In practice, a CNN learns the values of these filters on its own during
the training process.The more number of filters we have, the more
image features get extracted and the better our network becomes at
recognizing patterns in unseen images.
• The size of the Feature Map (Convolved Feature) is controlled by
three parameters
• Depth: Depth corresponds to the number of filters we use for the
convolution operation.
• Stride: Stride is the number of pixels by which we slide our filter
matrix over the input matrix.
• Zero-padding: Sometimes, it is convenient to pad the input
matrix with zeros around the border, so that we can apply the filter to
bordering elements of our input image matrix.
Introducing Non Linearity (ReLU)
• ReLU is an element wise
operation (applied per pixel)
and replaces all negative pixel
values in the feature map by
zero
• Convolution is a linear
operation – element wise
matrix multiplication and
addition, so we account for
non-linearity by introducing a
non-linear function like ReLU
The Pooling Step
• Spatial Pooling (also called
subsampling or downsampling)
reduces the dimensionality of
each feature map but
retains the most
important information. Spatial
Pooling can be of different
types: Max, Average, Sum etc.
• In case of Max Pooling, we
define a spatial neighborhood
(for example, a 2×2 window)
Fully Connected Layer
• The term “Fully Connected”
implies that every neuron in the
previous layer is connected to
every neuron on the next layer.
• The output from the convolutional
and pooling layers represent high-
level features of the input image.
• The purpose of the Fully
Connected layer is to use these
features for classifying the input
image into various classes based
on the training dataset.
Putting it all together – Training using
Backpropagation
• input image is a boat, the
target probability is 1 for Boat
class and 0 for other
three classes
• Input Image = Boat
• TargetVector = [0, 0, 1, 0]
Putting it all together – Training using
Backpropagation
• Step1:We initialize all filters and parameters
• Step2: The network takes a training image as input, goes through the forward propagation step
(convolution, ReLU and pooling operations along with forward propagation in the FullyConnected
layer) and finds the output probabilities for each class
• Lets say the output probabilities for the boat image above are [0.2, 0.4, 0.1, 0.3]
• Step3: Calculate the total error at the output layer (summation over all 4 classes)
• Total Error = ∑ ½ (target probability – output probability) ²
• Step4:The weights are adjusted in proportion to their contribution to the total error.
• When the same image is input again, output probabilities might now be [0.1, 0.1, 0.7, 0.1], which is
closer to the target vector [0, 0, 1, 0].
• This means that the network has learnt to classify this particular image correctly by adjusting its
weights / filters such that the output error is reduced.
CNN Applications
• computer vision
face recognition, scene labeling, image classification, action
recognition, human pose estimation and document analysis
• natural language processing
field of speech recognition and text classification
Face recognition
• Identifying all the faces in the
picture
• Focusing on each face despite
bad lighting or different pose
• Identifying unique features
• Comparing identified features
to existing database and
determining the person's name
Scene labeling
• Real-time scene parsing in
natural conditions.
• Training on SiftFlow dataset(33
classes).
• Display one label per
component in the final
prediction
• Can also used Barcelona
Dataset(170 classes) , Stanford
Background Dataset(8 classes)
Speech Recognition
• Noise robustness
• Distant speech recognition
• Low-footprint models
• Channel-mismatched training-test conditions
Do you know?
• Facebook uses neural nets for
their automatic tagging
algorithms
• Google for their photo search
• Amazon for their product
recommendations
• Pinterest for their home feed
personalization
• Instagram for their search
infrastructure
Q & A?
Convolutional Neural Network and Its Applications

Weitere ähnliche Inhalte

Was ist angesagt?

Convolutional Neural Networks
Convolutional Neural NetworksConvolutional Neural Networks
Convolutional Neural NetworksAshray Bhandare
 
Introduction to CNN
Introduction to CNNIntroduction to CNN
Introduction to CNNShuai Zhang
 
CNN Machine learning DeepLearning
CNN Machine learning DeepLearningCNN Machine learning DeepLearning
CNN Machine learning DeepLearningAbhishek Sharma
 
Convolutional Neural Network (CNN) - image recognition
Convolutional Neural Network (CNN)  - image recognitionConvolutional Neural Network (CNN)  - image recognition
Convolutional Neural Network (CNN) - image recognitionYUNG-KUEI CHEN
 
Introduction to Deep Learning
Introduction to Deep LearningIntroduction to Deep Learning
Introduction to Deep LearningOswald Campesato
 
Convolution Neural Network (CNN)
Convolution Neural Network (CNN)Convolution Neural Network (CNN)
Convolution Neural Network (CNN)Basit Rafiq
 
Convolution Neural Network (CNN)
Convolution Neural Network (CNN)Convolution Neural Network (CNN)
Convolution Neural Network (CNN)Suraj Aavula
 
Convolutional Neural Network
Convolutional Neural NetworkConvolutional Neural Network
Convolutional Neural NetworkVignesh Suresh
 
CNN and its applications by ketaki
CNN and its applications by ketakiCNN and its applications by ketaki
CNN and its applications by ketakiKetaki Patwari
 
What Is Deep Learning? | Introduction to Deep Learning | Deep Learning Tutori...
What Is Deep Learning? | Introduction to Deep Learning | Deep Learning Tutori...What Is Deep Learning? | Introduction to Deep Learning | Deep Learning Tutori...
What Is Deep Learning? | Introduction to Deep Learning | Deep Learning Tutori...Simplilearn
 
Convolutional neural network
Convolutional neural networkConvolutional neural network
Convolutional neural networkFerdous ahmed
 
Image classification using CNN
Image classification using CNNImage classification using CNN
Image classification using CNNNoura Hussein
 
Overview of Convolutional Neural Networks
Overview of Convolutional Neural NetworksOverview of Convolutional Neural Networks
Overview of Convolutional Neural Networksananth
 
Convolutional Neural Network (CNN)
Convolutional Neural Network (CNN)Convolutional Neural Network (CNN)
Convolutional Neural Network (CNN)Muhammad Haroon
 
Deep Learning - Overview of my work II
Deep Learning - Overview of my work IIDeep Learning - Overview of my work II
Deep Learning - Overview of my work IIMohamed Loey
 
Deep Learning - CNN and RNN
Deep Learning - CNN and RNNDeep Learning - CNN and RNN
Deep Learning - CNN and RNNAshray Bhandare
 

Was ist angesagt? (20)

Convolutional Neural Networks
Convolutional Neural NetworksConvolutional Neural Networks
Convolutional Neural Networks
 
Introduction to CNN
Introduction to CNNIntroduction to CNN
Introduction to CNN
 
CNN Machine learning DeepLearning
CNN Machine learning DeepLearningCNN Machine learning DeepLearning
CNN Machine learning DeepLearning
 
Convolutional Neural Network (CNN) - image recognition
Convolutional Neural Network (CNN)  - image recognitionConvolutional Neural Network (CNN)  - image recognition
Convolutional Neural Network (CNN) - image recognition
 
Introduction to Deep Learning
Introduction to Deep LearningIntroduction to Deep Learning
Introduction to Deep Learning
 
Convolution Neural Network (CNN)
Convolution Neural Network (CNN)Convolution Neural Network (CNN)
Convolution Neural Network (CNN)
 
Convolution Neural Network (CNN)
Convolution Neural Network (CNN)Convolution Neural Network (CNN)
Convolution Neural Network (CNN)
 
Convolutional Neural Network
Convolutional Neural NetworkConvolutional Neural Network
Convolutional Neural Network
 
CNN and its applications by ketaki
CNN and its applications by ketakiCNN and its applications by ketaki
CNN and its applications by ketaki
 
CNN Tutorial
CNN TutorialCNN Tutorial
CNN Tutorial
 
What Is Deep Learning? | Introduction to Deep Learning | Deep Learning Tutori...
What Is Deep Learning? | Introduction to Deep Learning | Deep Learning Tutori...What Is Deep Learning? | Introduction to Deep Learning | Deep Learning Tutori...
What Is Deep Learning? | Introduction to Deep Learning | Deep Learning Tutori...
 
Convolutional neural network
Convolutional neural networkConvolutional neural network
Convolutional neural network
 
cnn ppt.pptx
cnn ppt.pptxcnn ppt.pptx
cnn ppt.pptx
 
Image classification using CNN
Image classification using CNNImage classification using CNN
Image classification using CNN
 
Overview of Convolutional Neural Networks
Overview of Convolutional Neural NetworksOverview of Convolutional Neural Networks
Overview of Convolutional Neural Networks
 
Convolutional Neural Network (CNN)
Convolutional Neural Network (CNN)Convolutional Neural Network (CNN)
Convolutional Neural Network (CNN)
 
Cnn method
Cnn methodCnn method
Cnn method
 
AlexNet
AlexNetAlexNet
AlexNet
 
Deep Learning - Overview of my work II
Deep Learning - Overview of my work IIDeep Learning - Overview of my work II
Deep Learning - Overview of my work II
 
Deep Learning - CNN and RNN
Deep Learning - CNN and RNNDeep Learning - CNN and RNN
Deep Learning - CNN and RNN
 

Ähnlich wie Convolutional Neural Network and Its Applications

Introduction to Convolutional Neural Networks
Introduction to Convolutional Neural NetworksIntroduction to Convolutional Neural Networks
Introduction to Convolutional Neural NetworksParrotAI
 
Automatic Attendace using convolutional neural network Face Recognition
Automatic Attendace using convolutional neural network Face RecognitionAutomatic Attendace using convolutional neural network Face Recognition
Automatic Attendace using convolutional neural network Face Recognitionvatsal199567
 
build a Convolutional Neural Network (CNN) using TensorFlow in Python
build a Convolutional Neural Network (CNN) using TensorFlow in Pythonbuild a Convolutional Neural Network (CNN) using TensorFlow in Python
build a Convolutional Neural Network (CNN) using TensorFlow in PythonKv Sagar
 
intro-to-cnn-April_2020.pptx
intro-to-cnn-April_2020.pptxintro-to-cnn-April_2020.pptx
intro-to-cnn-April_2020.pptxssuser3aa461
 
Facial Emotion Detection on Children's Emotional Face
Facial Emotion Detection on Children's Emotional FaceFacial Emotion Detection on Children's Emotional Face
Facial Emotion Detection on Children's Emotional FaceTakrim Ul Islam Laskar
 
Deep Computer Vision - 1.pptx
Deep Computer Vision - 1.pptxDeep Computer Vision - 1.pptx
Deep Computer Vision - 1.pptxJawadHaider36
 
Deep learning from a novice perspective
Deep learning from a novice perspectiveDeep learning from a novice perspective
Deep learning from a novice perspectiveAnirban Santara
 
Deep Neural Network DNN.docx
Deep Neural Network DNN.docxDeep Neural Network DNN.docx
Deep Neural Network DNN.docxjaffarbikat
 
Speech Processing with deep learning
Speech Processing  with deep learningSpeech Processing  with deep learning
Speech Processing with deep learningMohamed Essam
 
DSRLab seminar Introduction to deep learning
DSRLab seminar   Introduction to deep learningDSRLab seminar   Introduction to deep learning
DSRLab seminar Introduction to deep learningPoo Kuan Hoong
 
Autoencoders for image_classification
Autoencoders for image_classificationAutoencoders for image_classification
Autoencoders for image_classificationCenk Bircanoğlu
 
ML in Astronomy - Workshop 1.pptx
ML in Astronomy - Workshop 1.pptxML in Astronomy - Workshop 1.pptx
ML in Astronomy - Workshop 1.pptxAstronomyClubIITBHU
 
11_Saloni Malhotra_SummerTraining_PPT.pptx
11_Saloni Malhotra_SummerTraining_PPT.pptx11_Saloni Malhotra_SummerTraining_PPT.pptx
11_Saloni Malhotra_SummerTraining_PPT.pptxSaloniMalhotra23
 
A Survey of Convolutional Neural Networks
A Survey of Convolutional Neural NetworksA Survey of Convolutional Neural Networks
A Survey of Convolutional Neural NetworksRimzim Thube
 

Ähnlich wie Convolutional Neural Network and Its Applications (20)

Deep Learning
Deep LearningDeep Learning
Deep Learning
 
Introduction to Convolutional Neural Networks
Introduction to Convolutional Neural NetworksIntroduction to Convolutional Neural Networks
Introduction to Convolutional Neural Networks
 
cnn.pdf
cnn.pdfcnn.pdf
cnn.pdf
 
Automatic Attendace using convolutional neural network Face Recognition
Automatic Attendace using convolutional neural network Face RecognitionAutomatic Attendace using convolutional neural network Face Recognition
Automatic Attendace using convolutional neural network Face Recognition
 
build a Convolutional Neural Network (CNN) using TensorFlow in Python
build a Convolutional Neural Network (CNN) using TensorFlow in Pythonbuild a Convolutional Neural Network (CNN) using TensorFlow in Python
build a Convolutional Neural Network (CNN) using TensorFlow in Python
 
intro-to-cnn-April_2020.pptx
intro-to-cnn-April_2020.pptxintro-to-cnn-April_2020.pptx
intro-to-cnn-April_2020.pptx
 
Facial Emotion Detection on Children's Emotional Face
Facial Emotion Detection on Children's Emotional FaceFacial Emotion Detection on Children's Emotional Face
Facial Emotion Detection on Children's Emotional Face
 
Cnn
CnnCnn
Cnn
 
Deep Computer Vision - 1.pptx
Deep Computer Vision - 1.pptxDeep Computer Vision - 1.pptx
Deep Computer Vision - 1.pptx
 
deep learning
deep learningdeep learning
deep learning
 
Deep learning from a novice perspective
Deep learning from a novice perspectiveDeep learning from a novice perspective
Deep learning from a novice perspective
 
Deep Neural Network DNN.docx
Deep Neural Network DNN.docxDeep Neural Network DNN.docx
Deep Neural Network DNN.docx
 
Speech Processing with deep learning
Speech Processing  with deep learningSpeech Processing  with deep learning
Speech Processing with deep learning
 
Mnist report
Mnist reportMnist report
Mnist report
 
DSRLab seminar Introduction to deep learning
DSRLab seminar   Introduction to deep learningDSRLab seminar   Introduction to deep learning
DSRLab seminar Introduction to deep learning
 
Autoencoders for image_classification
Autoencoders for image_classificationAutoencoders for image_classification
Autoencoders for image_classification
 
ML in Astronomy - Workshop 1.pptx
ML in Astronomy - Workshop 1.pptxML in Astronomy - Workshop 1.pptx
ML in Astronomy - Workshop 1.pptx
 
11_Saloni Malhotra_SummerTraining_PPT.pptx
11_Saloni Malhotra_SummerTraining_PPT.pptx11_Saloni Malhotra_SummerTraining_PPT.pptx
11_Saloni Malhotra_SummerTraining_PPT.pptx
 
A Survey of Convolutional Neural Networks
A Survey of Convolutional Neural NetworksA Survey of Convolutional Neural Networks
A Survey of Convolutional Neural Networks
 
Mnist report ppt
Mnist report pptMnist report ppt
Mnist report ppt
 

Kürzlich hochgeladen

SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piececharlottematthew16
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024The Digital Insurer
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr LapshynFwdays
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsMiki Katsuragi
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyAlfredo García Lavilla
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Wonjun Hwang
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...Fwdays
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenHervé Boutemy
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machinePadma Pradeep
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 3652toLead Limited
 

Kürzlich hochgeladen (20)

SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piece
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering Tips
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easy
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache Maven
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365
 

Convolutional Neural Network and Its Applications

  • 1. Convolutional Neural Network and Its Applications 144133E – M.G.K.C.PIYARTHNA
  • 2. What is CNN? In machine learning, a convolutional neural network is a class of deep, feed-forward artificial neural networks that has successfully been applied fpr analyzing visual imagery. In the field of ComputerVision and Natural Language Processing, there can be found more influential innovations by using the concept of convolutional neural network in Machine Language.
  • 3. Motivation • Convolutional Neural Networks (CNN) are biologically-inspired variants of MLPs. From Hubel andWiesel’s early work on the cat’s visual cortex ,we know the visual cortex contains a complex arrangement of cells.These cells are sensitive to small sub-regions of the visual field, called a receptive field.The sub-regions are tiled to cover the entire visual field.These cells act as local filters over the input space and are well-suited to exploit the strong spatially local correlation present in natural images. • The animal visual cortex being the most powerful visual processing system in existence, it seems natural to emulate its behavior
  • 5. ConvNet Architectures • LeNet (1990s) • AlexNet (2012) • ZF Net (2013) • GoogLeNet (2014) • VGGNet (2014) • ResNets (2015) • DenseNet (August 2016)
  • 6. Four main operations in the ConvNet • Convolution • Non Linearity • Pooling or Sub Sampling • Classification (Fully Connected Layer)
  • 7. • An Image is a matrix of pixel values • Channel is a conventional term used to refer to a certain component of an image. • A grayscale image, on the other hand, has just one channel.
  • 8. The Convolution Step • The primary purpose of Convolution in case of a ConvNet is to extract features from the input image.
  • 9. • In CNN terminology, the 3×3 matrix is called a ‘filter‘ or ‘kernel’ or ‘feature detector’ • the matrix formed by sliding the filter over the image and computing the dot product is called the ‘Convolved Feature’ or ‘Activation Map’ or the ‘Feature Map‘. • It is important to note that filters acts as feature detectors from the original input image. • In practice, a CNN learns the values of these filters on its own during the training process.The more number of filters we have, the more image features get extracted and the better our network becomes at recognizing patterns in unseen images.
  • 10.
  • 11. • The size of the Feature Map (Convolved Feature) is controlled by three parameters • Depth: Depth corresponds to the number of filters we use for the convolution operation. • Stride: Stride is the number of pixels by which we slide our filter matrix over the input matrix. • Zero-padding: Sometimes, it is convenient to pad the input matrix with zeros around the border, so that we can apply the filter to bordering elements of our input image matrix.
  • 12. Introducing Non Linearity (ReLU) • ReLU is an element wise operation (applied per pixel) and replaces all negative pixel values in the feature map by zero • Convolution is a linear operation – element wise matrix multiplication and addition, so we account for non-linearity by introducing a non-linear function like ReLU
  • 13. The Pooling Step • Spatial Pooling (also called subsampling or downsampling) reduces the dimensionality of each feature map but retains the most important information. Spatial Pooling can be of different types: Max, Average, Sum etc. • In case of Max Pooling, we define a spatial neighborhood (for example, a 2×2 window)
  • 14. Fully Connected Layer • The term “Fully Connected” implies that every neuron in the previous layer is connected to every neuron on the next layer. • The output from the convolutional and pooling layers represent high- level features of the input image. • The purpose of the Fully Connected layer is to use these features for classifying the input image into various classes based on the training dataset.
  • 15. Putting it all together – Training using Backpropagation • input image is a boat, the target probability is 1 for Boat class and 0 for other three classes • Input Image = Boat • TargetVector = [0, 0, 1, 0]
  • 16. Putting it all together – Training using Backpropagation • Step1:We initialize all filters and parameters • Step2: The network takes a training image as input, goes through the forward propagation step (convolution, ReLU and pooling operations along with forward propagation in the FullyConnected layer) and finds the output probabilities for each class • Lets say the output probabilities for the boat image above are [0.2, 0.4, 0.1, 0.3] • Step3: Calculate the total error at the output layer (summation over all 4 classes) • Total Error = ∑ ½ (target probability – output probability) ² • Step4:The weights are adjusted in proportion to their contribution to the total error. • When the same image is input again, output probabilities might now be [0.1, 0.1, 0.7, 0.1], which is closer to the target vector [0, 0, 1, 0]. • This means that the network has learnt to classify this particular image correctly by adjusting its weights / filters such that the output error is reduced.
  • 17. CNN Applications • computer vision face recognition, scene labeling, image classification, action recognition, human pose estimation and document analysis • natural language processing field of speech recognition and text classification
  • 18. Face recognition • Identifying all the faces in the picture • Focusing on each face despite bad lighting or different pose • Identifying unique features • Comparing identified features to existing database and determining the person's name
  • 19. Scene labeling • Real-time scene parsing in natural conditions. • Training on SiftFlow dataset(33 classes). • Display one label per component in the final prediction • Can also used Barcelona Dataset(170 classes) , Stanford Background Dataset(8 classes)
  • 20. Speech Recognition • Noise robustness • Distant speech recognition • Low-footprint models • Channel-mismatched training-test conditions
  • 21. Do you know? • Facebook uses neural nets for their automatic tagging algorithms • Google for their photo search • Amazon for their product recommendations • Pinterest for their home feed personalization • Instagram for their search infrastructure