SlideShare ist ein Scribd-Unternehmen logo
1 von 59
Downloaden Sie, um offline zu lesen
AI and Deep Learning
Biswa Gourav Singh
Deep Learning Practice
AMD
MS Clemson University, 2010
Dr. Subrat Panda
Principal Architect AI and DS
Capillary Technologies
IIT Kharagpur 2002
IIT Kharagpur 2009 - PhD
Brief Introduction about Subrat
● BTech ( 2002) , PhD (2009) – CSE, IIT Kharagpur
● Synopsys (EDA), IBM (CPU), NVIDIA (GPU), Taro (Full Stack Engineer), Capillary
(Principal Architect - AI)
● Applying AI to Retail
● Co-Founded IDLI (for social good) with Prof. Amit Sethi (IIT Bombay), Jacob
Minz (Synopsys) and Biswa Gourav Singh (AMD)
● https://www.facebook.com/groups/idliai/
● Linked In - https://www.linkedin.com/in/subratpanda/
● Facebook - https://www.facebook.com/subratpanda
● Twitter - @subratpanda
Brief Introduction about Biswa
● BTech ( 2005) , MS (2010) – Clemson University
● Synopsys (EDA), IBM (CPU), ARM, Qualcomm, HP, AMD
● Deep Learning Performance
● Co-Founded IDLI (for social good) with Prof. Amit Sethi (IIT Bombay), Jacob
Minz (Synopsys) and Subrat Panda (Capillary)
● https://www.facebook.com/groups/idliai/
● Linked In - https://www.linkedin.com/in/biswagsingh/
● Facebook - https://www.facebook.com/biswa.singh
Agenda
1. Deep Learning: Overview - 5 minutes - Subrat
2. Deep Learning and Industries - 10 minutes - Subrat
3. How to do Deep Learning - 10 minutes - Subrat
4. Tensor-Flow, a DL framework - 20 minutes - Biswa
5. CNN Introduction - 10 minutes - Biswa
6. Transfer Learning - 15 minutes - Subrat
7. Building Deep Learning Application - 10 minutes - Subrat
a. Building a Product in the age of APIs - Subrat
8. Challenges of Deep Learning - 5 minutes - Subrat / Biswa ( Performance )
9. Q and A - 10 minutes
The AI landscape
Deep Learning: An Overview
Courtesy: NVIDIA
Machine Learning – http://techleer.com
Deep Learning: An Overview
The Neuron
• The neuron is the basic information processing unit of a
NN. It consists of:
1 A set of synapses or connecting links, each link
characterized by a weight: W1
, W2
, …, Wm
2 An adder function (linear combiner) which computes the
weighted sum of the inputs:
2 Activation function (squashing function) for limiting
the amplitude of the output of the neuron.
The Neuron
Input
signal
Synaptic
weights
Summing
function
Bias
b
Activation
function
Local
Field
v
Output
y
x1
x2
xm
w2
wm
w1
Is a single perceptron or layer enough ?
- No
- Perceptrons learn linearly separable functions
- Cannot learn a XOR function
- Minsky & Papert (1969) offered solution to XOR problem
by combining perceptron unit responses using a second layer
of Units. Piecewise linear classification using an MLP with
threshold (perceptron) unit
MLP - Multi-Layer Perceptron
• No connections within a layer
• No direct connections between input and output layers
• Fully connected between layers
• Often more than 3 layers
• Number of output units need not equal number of input units
• Number of hidden units per layer can be more or less than
input or output units
3 Layer Neural Network
What do each layer do ?
1st layer draws linear
boundaries
2nd layer combines
the boundaries
3rd layer can
generate arbitrarily
complex boundaries
Forward Activity - Backward Error
Back Propagation
Forward pass phase: computes ‘functional signal’, feed forward
propagation of input pattern signals through network.
Backward pass phase: computes ‘error signal’, propagates
the error backwards through network starting at output units
(where the error is the difference between actual and desired
output values)
Deep Learning and Industries
HORIZONTAL APPLICATIONS
Deep Learning
MARKETING APPLICATIONS
Predicting
Lifetime Value
(LTV)
Wallet share
estimation
Churn
Customer
segmentation
Product mix Cross selling
Recommendatio
n algorithms
Up-selling
Channel
optimization
Discount
targeting
Reactivation
likelihood
Adwords
optimization and
ad buying
RISK APPLICATIONS
Credit risk
Treasury or
currency risk
Fraud
detection
Accounts
Payable
Recovery
Anti-money
laundering
HUMAN RESOURCES APPLICATIONS
Resume
screening
Employee churn
Training
recommendatio
n
Talent
management
MORE HORIZONTAL APPLICATIONS…
Sales
• Lead prioritization
Customer support
• Call routing
• Call center message optimization
• Call center volume forecasting
Logistics
• Demand forecasting
VERTICAL APPLICATIONS
HEALTHCARE APPLICATIONS
Claims review
prioritization
Medicare/medicai
d fraud
Medical resources
allocation
Alerting and
diagnostics from
real-time patient
data
Prescription
compliance
Physician attrition
Survival analysis
Medication
(dosage)
effectiveness
Readmission risk
RETAIL APPLICATIONS
Price
optimization
Location of new
stores
Product layout in
stores
Merchandizing
Inventory
Management
(how many
units)
Shrinkage
analytics
Warranty
Analytics
Market Basket
Analysis
Cannibalization
Analysis
Next Best Offer
Analysis
In store traffic
patterns
TRAVEL APPLICATIONS
Aircraft
scheduling
Seat/gate
management
Air crew
scheduling
Dynamic
pricing
Customer
complaint
resolution
Maintenance
optimization
Tourism
forecasting
Life Sciences
Identifying
biomarkers
Drug/chemical
discovery
Analyzing study
results
Identifying negative
responses
Diagnostic test
development
Diagnostic targeting
Predicting drug
demand
Prescription
adherence
Putative safety
signals
Social media
marketing
Image analysis
Clinical trial design
COGS optimization
Insurance
Claims prediction
Claims handling
Price sensitivity
Investments
Agent & branch
performance
DM, product mix
Hospitality
Dynamic pricing
Promos/upgrades/
offers
Table
management &
reservations
Workforce
management
Manufacturing
Failure analysis
Quality
management
Inventory
management
Warranty/pricing
Direct
Marketing
Response rates
Segmentations for
mailings
Reactivation
likelihood
RFM
Discount targeting
Phone marketing
Email Marketing
Construction
Contractor
performance
Design issue
prediction
Agriculture
Yield management
Automation
Mall Operators
Tenant capacity to
pay
Tenant selection
Education
Automated essay
scoring
Dynamic courses
Utilities
Optimize
Distribution
Network
Predict
Commodity
Requirements
How to do Deep Learning
- Pick up a framework
- Pick up an algorithm to use
- Understand its hyper-parameters
- Get data labelled
- Make sure you understand data well (Visualization Tools etc)
- Is your data enough ? - Very domain specific question.
- Define what accuracy you need for your algorithm ?
- Choose the platform / architecture etc ?
Tools
Building a Deep Learning Application
- Make sure that you actually need Deep Learning - don’t just jump into it.
- You need to have DATA (Labelled) - curated.
- GIGO is the rule on using data.
- Choose the right algorithm/network/framework for your Use case.
- Find out if you can use Transfer Learning - if yes go ahead try it out.
- If you don’t have data - Prepare a data capture strategy (buy it. Collect it
etc) - and use API based service of known providers if you can use it.
- Do not reinvent the wheel of trying to build a network from scratch.
- Play with hyper-parameters of known models/framework etc.
- Some cases an ensemble is good enough for the problem.
- Look at Kaggle, Arxiv and other data science competition platforms
TensorFlow, a DL framework
1) A deep learning library open sourced by google
2) Created for task with heavy numeric computation
3) Based on dataflow graph
Tensorflow code
Tensorflow code: Getting the output
sess.run(fetches, feeds)
Fetches: List of graph
nodes.
To fetch the the output of
the operation execute the
graph in run().
Feeds:
Feed data to the graph
So What we covered so far?
We first built a graph using variables and placeholders
We then deployed the graph onto a session, which is the execution
environment
Next we will see how to train the model
Handwritten Digit classification: MNIST
Matrix Notation
Error Metric
Training
DEMO
RELU and
DROPOUT
Dropout
A regularization technique works great in Deep Neural network
Randomly Drops certain % of nodes in training to reduce overfitting
An efficient way of doing model averaging as we randomly drop nodes for
each training example
Convolutional Neural Network (CNN)
Problem With Fully connected network on images:
Losing shape information when we are flattening
the image into a single array
Visual cortex system of brain doesn't work like
fully connected network.
Huble and Wiesel experiment on cat reveals
1) Local connection
2) Hierarchical layer
3) Spatial Invariance : Any size, rotation, shift
CNN to rescue
Convolution layer
Conv Net on MNIST
Pooling
The idea of pooling in convolutional neural networks is to do two things:
● Reduce the number of parameters in your network (pooling is also
called “down-sampling” for this reason)
● To make feature detection more robust by making it more impervious
to scale and orientation changes
Max Pooling
Average Pooling
Feature extraction
DEMO
Acknowledgement
Materials have been taken from
1) Martin Gorner Tensorflow Tutorial
2) Stanford CS224d Tensorflow Tutorial
3) Stanford CS231n course material
Transfer Learning
Transfer Learning - http://ruder.io/transfer-learning/
Transfer Learning
Transfer Learning - http://ruder.io/transfer-learning/
Transfer Learning
Drivers of ML Success
Transfer Learning
Transfer Learning Demo using TF
http://ruder.io/transfer-learning/
RNNs
LSTMs
Combining CNN and RNN - Image Captioning
Challenges of Deep Learning
- Explainability - How do you learn what you learn ?
- Debugging - What has gone wrong ?
- Hyper-parameter optimization
- Why is this not converging ?
- Is my learning rate good ?
- Do I need to change the number of layers ?
- Am I resilient to bad or erroneous data ?
- Deep domain knowledge is sometimes required to make sense of what
the data speaks about.
Building a AI product in the age of APIs
- Google
- Amazon
- Microsoft
- Other small players.
- Vision, Speech, Video Analytics etc
- Build your own solutions taking inputs from those APIs - enhanced
inferencing
Q and A
Thank you!!
Subrat Panda - https://www.linkedin.com/in/subratpanda/
Biswa G Singh - https://www.linkedin.com/in/biswagsingh/
Consider Joining IDLI - https://www.facebook.com/groups/idliai/ - Indian
Deep Learning Initiative
Navin Manaswi - https://www.linkedin.com/in/navin-manaswi-1a708b8/
"Deep Learning by python using TensorFlow" By Navin - Book to be
launched in Jan end
Acknowledgements
- Team at Capillary Technologies
- SMC organizing team
Navin Manaswi
Deep Learning-AI Architect and Partner
at Symphony AI Incubator
IIT Kanpur 2004
References
- www.tensorflow.org
- https://www.slideshare.net/LuMa921/deep-learning-a-visual-introduction
- http://ruder.io/transfer-learning/
- Many more PPTs available on the internet from Universities and Good
Samaritans

Weitere ähnliche Inhalte

Was ist angesagt?

Application of machine learning in industrial applications
Application of machine learning in industrial applicationsApplication of machine learning in industrial applications
Application of machine learning in industrial applications
Anish Das
 
II-SDV 2017: The Next Era: Deep Learning for Biomedical Research
II-SDV 2017: The Next Era: Deep Learning for Biomedical ResearchII-SDV 2017: The Next Era: Deep Learning for Biomedical Research
II-SDV 2017: The Next Era: Deep Learning for Biomedical Research
Dr. Haxel Consult
 
intelligent computing relating to cloud computing
intelligent computing relating to cloud computingintelligent computing relating to cloud computing
intelligent computing relating to cloud computing
Er. rahul abhishek
 

Was ist angesagt? (20)

II-SDV 2017: Auto Classification: Can/Should AI replace You?
II-SDV 2017: Auto Classification: Can/Should AI replace You? II-SDV 2017: Auto Classification: Can/Should AI replace You?
II-SDV 2017: Auto Classification: Can/Should AI replace You?
 
Application of machine learning in industrial applications
Application of machine learning in industrial applicationsApplication of machine learning in industrial applications
Application of machine learning in industrial applications
 
Machine learning seminar presentation
Machine learning seminar presentationMachine learning seminar presentation
Machine learning seminar presentation
 
Deep learning and neural network converted
Deep learning and neural network convertedDeep learning and neural network converted
Deep learning and neural network converted
 
introduction to machin learning
introduction to machin learningintroduction to machin learning
introduction to machin learning
 
Notes from Coursera Deep Learning courses by Andrew Ng
Notes from Coursera Deep Learning courses by Andrew NgNotes from Coursera Deep Learning courses by Andrew Ng
Notes from Coursera Deep Learning courses by Andrew Ng
 
Wiki stage 20151128 - v001
Wiki stage   20151128 - v001Wiki stage   20151128 - v001
Wiki stage 20151128 - v001
 
The Convergence of HPC and Deep Learning
The Convergence of HPC and Deep LearningThe Convergence of HPC and Deep Learning
The Convergence of HPC and Deep Learning
 
10 Lessons Learned from Building Machine Learning Systems
10 Lessons Learned from Building Machine Learning Systems10 Lessons Learned from Building Machine Learning Systems
10 Lessons Learned from Building Machine Learning Systems
 
Artificial intelligence
Artificial intelligenceArtificial intelligence
Artificial intelligence
 
Introduction to Machine Learning
Introduction to Machine LearningIntroduction to Machine Learning
Introduction to Machine Learning
 
Cognitive Computing - A Primer
Cognitive Computing - A PrimerCognitive Computing - A Primer
Cognitive Computing - A Primer
 
From Narrow AI to Artificial General Intelligence (AGI)
From Narrow AI to Artificial General Intelligence (AGI)From Narrow AI to Artificial General Intelligence (AGI)
From Narrow AI to Artificial General Intelligence (AGI)
 
An introduction to Deep Learning
An introduction to Deep LearningAn introduction to Deep Learning
An introduction to Deep Learning
 
Cognitive Computing
Cognitive ComputingCognitive Computing
Cognitive Computing
 
Harry Surden - Artificial Intelligence and Law Overview
Harry Surden - Artificial Intelligence and Law OverviewHarry Surden - Artificial Intelligence and Law Overview
Harry Surden - Artificial Intelligence and Law Overview
 
II-SDV 2017: The Next Era: Deep Learning for Biomedical Research
II-SDV 2017: The Next Era: Deep Learning for Biomedical ResearchII-SDV 2017: The Next Era: Deep Learning for Biomedical Research
II-SDV 2017: The Next Era: Deep Learning for Biomedical Research
 
intelligent computing relating to cloud computing
intelligent computing relating to cloud computingintelligent computing relating to cloud computing
intelligent computing relating to cloud computing
 
101 Webinar - Artificial Intelligence, Deep Learning and Geospatial
101 Webinar - Artificial Intelligence, Deep Learning and Geospatial101 Webinar - Artificial Intelligence, Deep Learning and Geospatial
101 Webinar - Artificial Intelligence, Deep Learning and Geospatial
 
Artificial Intelligence, Machine Learning & Tools (v. 2020 ITA)
Artificial Intelligence, Machine Learning & Tools (v. 2020 ITA)Artificial Intelligence, Machine Learning & Tools (v. 2020 ITA)
Artificial Intelligence, Machine Learning & Tools (v. 2020 ITA)
 

Ähnlich wie AI and Deep Learning

Learn to Build an App to Find Similar Images using Deep Learning- Piotr Teterwak
Learn to Build an App to Find Similar Images using Deep Learning- Piotr TeterwakLearn to Build an App to Find Similar Images using Deep Learning- Piotr Teterwak
Learn to Build an App to Find Similar Images using Deep Learning- Piotr Teterwak
PyData
 
Computer Vision for Beginners
Computer Vision for BeginnersComputer Vision for Beginners
Computer Vision for Beginners
Sanghamitra Deb
 
Separating Hype from Reality in Deep Learning with Sameer Farooqui
 Separating Hype from Reality in Deep Learning with Sameer Farooqui Separating Hype from Reality in Deep Learning with Sameer Farooqui
Separating Hype from Reality in Deep Learning with Sameer Farooqui
Databricks
 

Ähnlich wie AI and Deep Learning (20)

Distributed deep learning_over_spark_20_nov_2014_ver_2.8
Distributed deep learning_over_spark_20_nov_2014_ver_2.8Distributed deep learning_over_spark_20_nov_2014_ver_2.8
Distributed deep learning_over_spark_20_nov_2014_ver_2.8
 
Machine_Learning_with_MATLAB_Seminar_Latest.pdf
Machine_Learning_with_MATLAB_Seminar_Latest.pdfMachine_Learning_with_MATLAB_Seminar_Latest.pdf
Machine_Learning_with_MATLAB_Seminar_Latest.pdf
 
B4UConference_machine learning_deeplearning
B4UConference_machine learning_deeplearningB4UConference_machine learning_deeplearning
B4UConference_machine learning_deeplearning
 
Novi sad ai event 1-2018
Novi sad ai event 1-2018Novi sad ai event 1-2018
Novi sad ai event 1-2018
 
DEF CON 24 - Clarence Chio - machine duping 101
DEF CON 24 - Clarence Chio - machine duping 101DEF CON 24 - Clarence Chio - machine duping 101
DEF CON 24 - Clarence Chio - machine duping 101
 
Deep Learning And Business Models (VNITC 2015-09-13)
Deep Learning And Business Models (VNITC 2015-09-13)Deep Learning And Business Models (VNITC 2015-09-13)
Deep Learning And Business Models (VNITC 2015-09-13)
 
Learn to Build an App to Find Similar Images using Deep Learning- Piotr Teterwak
Learn to Build an App to Find Similar Images using Deep Learning- Piotr TeterwakLearn to Build an App to Find Similar Images using Deep Learning- Piotr Teterwak
Learn to Build an App to Find Similar Images using Deep Learning- Piotr Teterwak
 
Cognitive Toolkit - Deep Learning framework from Microsoft
Cognitive Toolkit - Deep Learning framework from MicrosoftCognitive Toolkit - Deep Learning framework from Microsoft
Cognitive Toolkit - Deep Learning framework from Microsoft
 
Machine Learning AND Deep Learning for OpenPOWER
Machine Learning AND Deep Learning for OpenPOWERMachine Learning AND Deep Learning for OpenPOWER
Machine Learning AND Deep Learning for OpenPOWER
 
Computer Vision for Beginners
Computer Vision for BeginnersComputer Vision for Beginners
Computer Vision for Beginners
 
AI & ML in Defence Systems - Sunil Chomal
AI & ML in Defence Systems   - Sunil ChomalAI & ML in Defence Systems   - Sunil Chomal
AI & ML in Defence Systems - Sunil Chomal
 
Model Evaluation in the land of Deep Learning
Model Evaluation in the land of Deep LearningModel Evaluation in the land of Deep Learning
Model Evaluation in the land of Deep Learning
 
Distributed Deep Learning + others for Spark Meetup
Distributed Deep Learning + others for Spark MeetupDistributed Deep Learning + others for Spark Meetup
Distributed Deep Learning + others for Spark Meetup
 
Big data 2.0, deep learning and financial Usecases
Big data 2.0, deep learning and financial UsecasesBig data 2.0, deep learning and financial Usecases
Big data 2.0, deep learning and financial Usecases
 
MachinaFiesta: A Vision into Machine Learning 🚀
MachinaFiesta: A Vision into Machine Learning 🚀MachinaFiesta: A Vision into Machine Learning 🚀
MachinaFiesta: A Vision into Machine Learning 🚀
 
Deep learning
Deep learningDeep learning
Deep learning
 
Separating Hype from Reality in Deep Learning with Sameer Farooqui
 Separating Hype from Reality in Deep Learning with Sameer Farooqui Separating Hype from Reality in Deep Learning with Sameer Farooqui
Separating Hype from Reality in Deep Learning with Sameer Farooqui
 
Machine Duping 101: Pwning Deep Learning Systems
Machine Duping 101: Pwning Deep Learning SystemsMachine Duping 101: Pwning Deep Learning Systems
Machine Duping 101: Pwning Deep Learning Systems
 
Best Artificial Intelligence Course | Online program | certification course
Best Artificial Intelligence Course | Online program | certification course Best Artificial Intelligence Course | Online program | certification course
Best Artificial Intelligence Course | Online program | certification course
 
Introduction to Machine Learning with SciKit-Learn
Introduction to Machine Learning with SciKit-LearnIntroduction to Machine Learning with SciKit-Learn
Introduction to Machine Learning with SciKit-Learn
 

Mehr von Subrat Panda, PhD (9)

Role of technology in agriculture courses by srmist & the hindu
Role of technology in agriculture courses by srmist & the hinduRole of technology in agriculture courses by srmist & the hindu
Role of technology in agriculture courses by srmist & the hindu
 
Journey so far
Journey so farJourney so far
Journey so far
 
AI in security
AI in securityAI in security
AI in security
 
AI in Retail
AI in RetailAI in Retail
AI in Retail
 
Machine Learning in the Financial Industry
Machine Learning in the Financial IndustryMachine Learning in the Financial Industry
Machine Learning in the Financial Industry
 
An introduction to reinforcement learning
An introduction to reinforcement learningAn introduction to reinforcement learning
An introduction to reinforcement learning
 
ML Workshop at SACON 2018
ML Workshop at SACON 2018ML Workshop at SACON 2018
ML Workshop at SACON 2018
 
AI and The future of work
AI and The future of work AI and The future of work
AI and The future of work
 
AI in retail
AI in retailAI in retail
AI in retail
 

Kürzlich hochgeladen

+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 

Kürzlich hochgeladen (20)

2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Top 10 Most Downloaded Games on Play Store in 2024
Top 10 Most Downloaded Games on Play Store in 2024Top 10 Most Downloaded Games on Play Store in 2024
Top 10 Most Downloaded Games on Play Store in 2024
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 

AI and Deep Learning

  • 1. AI and Deep Learning Biswa Gourav Singh Deep Learning Practice AMD MS Clemson University, 2010 Dr. Subrat Panda Principal Architect AI and DS Capillary Technologies IIT Kharagpur 2002 IIT Kharagpur 2009 - PhD
  • 2. Brief Introduction about Subrat ● BTech ( 2002) , PhD (2009) – CSE, IIT Kharagpur ● Synopsys (EDA), IBM (CPU), NVIDIA (GPU), Taro (Full Stack Engineer), Capillary (Principal Architect - AI) ● Applying AI to Retail ● Co-Founded IDLI (for social good) with Prof. Amit Sethi (IIT Bombay), Jacob Minz (Synopsys) and Biswa Gourav Singh (AMD) ● https://www.facebook.com/groups/idliai/ ● Linked In - https://www.linkedin.com/in/subratpanda/ ● Facebook - https://www.facebook.com/subratpanda ● Twitter - @subratpanda
  • 3. Brief Introduction about Biswa ● BTech ( 2005) , MS (2010) – Clemson University ● Synopsys (EDA), IBM (CPU), ARM, Qualcomm, HP, AMD ● Deep Learning Performance ● Co-Founded IDLI (for social good) with Prof. Amit Sethi (IIT Bombay), Jacob Minz (Synopsys) and Subrat Panda (Capillary) ● https://www.facebook.com/groups/idliai/ ● Linked In - https://www.linkedin.com/in/biswagsingh/ ● Facebook - https://www.facebook.com/biswa.singh
  • 4. Agenda 1. Deep Learning: Overview - 5 minutes - Subrat 2. Deep Learning and Industries - 10 minutes - Subrat 3. How to do Deep Learning - 10 minutes - Subrat 4. Tensor-Flow, a DL framework - 20 minutes - Biswa 5. CNN Introduction - 10 minutes - Biswa 6. Transfer Learning - 15 minutes - Subrat 7. Building Deep Learning Application - 10 minutes - Subrat a. Building a Product in the age of APIs - Subrat 8. Challenges of Deep Learning - 5 minutes - Subrat / Biswa ( Performance ) 9. Q and A - 10 minutes
  • 5. The AI landscape Deep Learning: An Overview Courtesy: NVIDIA
  • 6. Machine Learning – http://techleer.com Deep Learning: An Overview
  • 7. The Neuron • The neuron is the basic information processing unit of a NN. It consists of: 1 A set of synapses or connecting links, each link characterized by a weight: W1 , W2 , …, Wm 2 An adder function (linear combiner) which computes the weighted sum of the inputs: 2 Activation function (squashing function) for limiting the amplitude of the output of the neuron.
  • 9. Is a single perceptron or layer enough ? - No - Perceptrons learn linearly separable functions - Cannot learn a XOR function - Minsky & Papert (1969) offered solution to XOR problem by combining perceptron unit responses using a second layer of Units. Piecewise linear classification using an MLP with threshold (perceptron) unit
  • 10. MLP - Multi-Layer Perceptron • No connections within a layer • No direct connections between input and output layers • Fully connected between layers • Often more than 3 layers • Number of output units need not equal number of input units • Number of hidden units per layer can be more or less than input or output units
  • 11. 3 Layer Neural Network
  • 12. What do each layer do ? 1st layer draws linear boundaries 2nd layer combines the boundaries 3rd layer can generate arbitrarily complex boundaries
  • 13. Forward Activity - Backward Error
  • 14. Back Propagation Forward pass phase: computes ‘functional signal’, feed forward propagation of input pattern signals through network. Backward pass phase: computes ‘error signal’, propagates the error backwards through network starting at output units (where the error is the difference between actual and desired output values)
  • 15. Deep Learning and Industries
  • 17. MARKETING APPLICATIONS Predicting Lifetime Value (LTV) Wallet share estimation Churn Customer segmentation Product mix Cross selling Recommendatio n algorithms Up-selling Channel optimization Discount targeting Reactivation likelihood Adwords optimization and ad buying
  • 18. RISK APPLICATIONS Credit risk Treasury or currency risk Fraud detection Accounts Payable Recovery Anti-money laundering
  • 19. HUMAN RESOURCES APPLICATIONS Resume screening Employee churn Training recommendatio n Talent management
  • 20. MORE HORIZONTAL APPLICATIONS… Sales • Lead prioritization Customer support • Call routing • Call center message optimization • Call center volume forecasting Logistics • Demand forecasting
  • 22. HEALTHCARE APPLICATIONS Claims review prioritization Medicare/medicai d fraud Medical resources allocation Alerting and diagnostics from real-time patient data Prescription compliance Physician attrition Survival analysis Medication (dosage) effectiveness Readmission risk
  • 23. RETAIL APPLICATIONS Price optimization Location of new stores Product layout in stores Merchandizing Inventory Management (how many units) Shrinkage analytics Warranty Analytics Market Basket Analysis Cannibalization Analysis Next Best Offer Analysis In store traffic patterns
  • 25. Life Sciences Identifying biomarkers Drug/chemical discovery Analyzing study results Identifying negative responses Diagnostic test development Diagnostic targeting Predicting drug demand Prescription adherence Putative safety signals Social media marketing Image analysis Clinical trial design COGS optimization Insurance Claims prediction Claims handling Price sensitivity Investments Agent & branch performance DM, product mix Hospitality Dynamic pricing Promos/upgrades/ offers Table management & reservations Workforce management Manufacturing Failure analysis Quality management Inventory management Warranty/pricing Direct Marketing Response rates Segmentations for mailings Reactivation likelihood RFM Discount targeting Phone marketing Email Marketing Construction Contractor performance Design issue prediction Agriculture Yield management Automation Mall Operators Tenant capacity to pay Tenant selection Education Automated essay scoring Dynamic courses Utilities Optimize Distribution Network Predict Commodity Requirements
  • 26. How to do Deep Learning - Pick up a framework - Pick up an algorithm to use - Understand its hyper-parameters - Get data labelled - Make sure you understand data well (Visualization Tools etc) - Is your data enough ? - Very domain specific question. - Define what accuracy you need for your algorithm ? - Choose the platform / architecture etc ?
  • 27. Tools
  • 28. Building a Deep Learning Application - Make sure that you actually need Deep Learning - don’t just jump into it. - You need to have DATA (Labelled) - curated. - GIGO is the rule on using data. - Choose the right algorithm/network/framework for your Use case. - Find out if you can use Transfer Learning - if yes go ahead try it out. - If you don’t have data - Prepare a data capture strategy (buy it. Collect it etc) - and use API based service of known providers if you can use it. - Do not reinvent the wheel of trying to build a network from scratch. - Play with hyper-parameters of known models/framework etc. - Some cases an ensemble is good enough for the problem. - Look at Kaggle, Arxiv and other data science competition platforms
  • 29. TensorFlow, a DL framework 1) A deep learning library open sourced by google 2) Created for task with heavy numeric computation 3) Based on dataflow graph
  • 31. Tensorflow code: Getting the output sess.run(fetches, feeds) Fetches: List of graph nodes. To fetch the the output of the operation execute the graph in run(). Feeds: Feed data to the graph
  • 32. So What we covered so far? We first built a graph using variables and placeholders We then deployed the graph onto a session, which is the execution environment Next we will see how to train the model
  • 38. Dropout A regularization technique works great in Deep Neural network Randomly Drops certain % of nodes in training to reduce overfitting An efficient way of doing model averaging as we randomly drop nodes for each training example
  • 39. Convolutional Neural Network (CNN) Problem With Fully connected network on images: Losing shape information when we are flattening the image into a single array Visual cortex system of brain doesn't work like fully connected network. Huble and Wiesel experiment on cat reveals 1) Local connection 2) Hierarchical layer 3) Spatial Invariance : Any size, rotation, shift
  • 42. Conv Net on MNIST
  • 43. Pooling The idea of pooling in convolutional neural networks is to do two things: ● Reduce the number of parameters in your network (pooling is also called “down-sampling” for this reason) ● To make feature detection more robust by making it more impervious to scale and orientation changes Max Pooling Average Pooling
  • 45. DEMO
  • 46. Acknowledgement Materials have been taken from 1) Martin Gorner Tensorflow Tutorial 2) Stanford CS224d Tensorflow Tutorial 3) Stanford CS231n course material
  • 48. Transfer Learning - http://ruder.io/transfer-learning/ Transfer Learning
  • 49. Transfer Learning - http://ruder.io/transfer-learning/ Transfer Learning
  • 50. Drivers of ML Success Transfer Learning
  • 51. Transfer Learning Demo using TF http://ruder.io/transfer-learning/
  • 52. RNNs
  • 53. LSTMs
  • 54. Combining CNN and RNN - Image Captioning
  • 55. Challenges of Deep Learning - Explainability - How do you learn what you learn ? - Debugging - What has gone wrong ? - Hyper-parameter optimization - Why is this not converging ? - Is my learning rate good ? - Do I need to change the number of layers ? - Am I resilient to bad or erroneous data ? - Deep domain knowledge is sometimes required to make sense of what the data speaks about.
  • 56. Building a AI product in the age of APIs - Google - Amazon - Microsoft - Other small players. - Vision, Speech, Video Analytics etc - Build your own solutions taking inputs from those APIs - enhanced inferencing
  • 57. Q and A Thank you!! Subrat Panda - https://www.linkedin.com/in/subratpanda/ Biswa G Singh - https://www.linkedin.com/in/biswagsingh/ Consider Joining IDLI - https://www.facebook.com/groups/idliai/ - Indian Deep Learning Initiative Navin Manaswi - https://www.linkedin.com/in/navin-manaswi-1a708b8/ "Deep Learning by python using TensorFlow" By Navin - Book to be launched in Jan end
  • 58. Acknowledgements - Team at Capillary Technologies - SMC organizing team Navin Manaswi Deep Learning-AI Architect and Partner at Symphony AI Incubator IIT Kanpur 2004
  • 59. References - www.tensorflow.org - https://www.slideshare.net/LuMa921/deep-learning-a-visual-introduction - http://ruder.io/transfer-learning/ - Many more PPTs available on the internet from Universities and Good Samaritans