SlideShare a Scribd company logo
1 of 42
Download to read offline
Introduction to Machine
Learning
ThePhilosophy
Advantages
Applications
Algorithms
Venkat Reddy
The Learning
1
2
3
4
5
6
BigdataAnalytics
VenkatReddy
2
Machine Learning
• How did our brain process the images?
• How did the grouping happen?
• Human brain processed the given images - learning
• After learning the brain simply looked at the new image and
compared with the groups classified the image to the closest
group - Classification
• If a machine has to perform the same operation we use
Machine Learning
• We write programs for learning and then classification, this is
nothing but machine learning
BigdataAnalytics
VenkatReddy
3
Machine Learning
Learning
algorithm
TRAINING
DATA Answer
Trained
machine
Query
BigdataAnalytics
VenkatReddy
4
Example: Classification using ML
• Image processing:
• Machine learning can be used in classification Images & objects in
an image
• Ship
• Water
• Rock
• Iron object
• Fiber Object etc.,
• Does this really help?
BigdataAnalytics
VenkatReddy
5
Image processing Example
BigdataAnalytics
VenkatReddy
6
Machine learning application
BigdataAnalytics
VenkatReddy
7
The main advantage of ML
• Learning and writing an algorithm
• Its easy for human brain but it is tough for a machine. It takes
some time and good amount of training data for machine to
accurately classify objects
• Implementation and automation
• This is easy for a machine. Once learnt a machine can process one
million images without any fatigue where as human brain can’t
• That’s why ML with bigdata is a deadly combination
BigdataAnalytics
VenkatReddy
8
Applications of Machine Learning
• Banking / Telecom / Retail
• Identify:
• Prospective customers
• Dissatisfied customers
• Good customers
• Bad payers
• Obtain:
• More effective advertising
• Less credit risk
• Fewer fraud
• Decreased churn rate
BigdataAnalytics
VenkatReddy
9
Applications of Machine Learning
• Biomedical / Biometrics
• Medicine:
• Screening
• Diagnosis and prognosis
• Drug discovery
• Security:
• Face recognition
• Signature / fingerprint / iris verification
• DNA fingerprinting
BigdataAnalytics
VenkatReddy
10
Applications of Machine Learning
• Computer / Internet
• Computer interfaces:
• Troubleshooting wizards
• Handwriting and speech
• Brain waves
• Internet
• Hit ranking
• Spam filtering
• Text categorization
• Text translation
• Recommendation
BigdataAnalytics
VenkatReddy
11
Main Parts in Machine Learning
process
• Any Machine Learning algorithm has three parts
1. The Output
2. The Objective Function or Performance Matrix
3. The Input
Email Spam Classification
• The Output: Categorize email messages as spam or legitimate.
• Objective Function: Percentage of email messages correctly classified.
• The Input: Database of emails, some with human-given labels
Hand Writing Recognition
• The Output: Recognizing hand-written words
• Objective Function: Percentage of words correctly classified
• The Input: Database of human-labeled images of handwritten words
BigdataAnalytics
VenkatReddy
12
Some Machine Learning
Methods
BigdataAnalytics
VenkatReddy
13
Everything should be made as simple as possible, but no simpler
BigdataAnalytics
VenkatReddy
14
Bayes Network
BigdataAnalytics
VenkatReddy
15
The Bayes Method
A(2%) B(90%) C(8%)
1% 60% 2%
• A, B, C Generate bolts, given a machine what is the probability that
it will generate a defective bolt.
• Now given a defective bolt, can we tell which machine has
generated it?
BigdataAnalytics
VenkatReddy
16
Several Machine in Several Layers
• Several intermediate machines
• Each has its on error chance, This problem requires a Bayesian network
to solve
• Does C has anything to do if the bolt is finally coming out of F?
A(20%) B(50%) C(30%)
1% 5% 2%
D(60%) E(40%)
F G H
BigdataAnalytics
VenkatReddy
17
Artificial Neural Networks
BigdataAnalytics
VenkatReddy
18
The Philosophy of Neural Network
• If we have to predict whether we win in a chess game
• Can we use logistic regression?
• Since there can be almost infinite number of intermediate steps
its almost impossible to predict the probability of winning
• If each move is considered as an input variable then there are
almost (countable)infinite moves. A coefficient for each of them
is always negligible.
Neural network uses a simple technique, instead of predicting the
final output they predict the intermediate output that lead to a win
previously, this will be an input for predicting the next move
BigdataAnalytics
VenkatReddy
19
Biological Motivation for NN
• Aartificial neural networks are built out of a densely
interconnected set of simple units, where each unit takes a
number of real-valued inputs (possibly the outputs of other
units) and produces a single real-valued output (which may
become the input to many other units).
• The human brain is estimated to contain a densely
interconnected network of approximately 1011 neurons, each
connected, on average, to 104 others.
• Neuron activity is typically inhibited through
connections to other neurons.
BigdataAnalytics
VenkatReddy
20
ANN in Self Driving Car
BigdataAnalytics
VenkatReddy
21
PCA and FA
BigdataAnalytics
VenkatReddy
22
Look at below Cricket Team Players Data
Player Avg Runs Total
wickets
Height Not
outs
Highest
Score
Best
Bowling
1 45 3 5.5 15 120 1
2 50 34 5.2 34 209 2
3 38 0 6 36 183 0
4 46 9 6.1 78 160 3
5 37 45 5.8 56 98 1
6 32 0 5.10 89 183 0
7 18 123 6 2 35 4
8 19 239 6.1 3 56 5
9 18 96 6.6 5 87 7
10 16 83 5.9 7 32 7
11 17 138 5.10 9 12 6
BigdataAnalytics
VenkatReddy
23
Describe the players
• If we have to describe or segregate the players do we really
need Avg Runs, Total wickets, Height, Not outs, Highest Score,
Best bowling?
• Can we simply take
• Avg Runs+ Not outs+ Highest Score as one factor?
• Total wickets+ Height+ Best bowling as second factor?
Defining these imaginary variables or a linear combination of
variables to reduce the dimensions is called PCA or FA
BigdataAnalytics
VenkatReddy
24
Look at below Cricket Team Players Data
Player Avg Runs Total
wickets
Height Not
outs
Highest
Score
Best
Bowling
1 45 3 5.5 15 120 1
2 50 34 5.2 34 209 2
3 38 0 6 36 183 0
4 46 9 6.1 78 160 3
5 37 45 5.8 56 98 1
6 32 0 5.10 89 183 0
7 18 123 6 2 35 4
8 19 239 6.1 3 56 5
9 18 96 6.6 5 87 7
10 16 83 5.9 7 32 7
11 17 138 5.10 9 12 6
BigdataAnalytics
VenkatReddy
25
SVM Classification
BigdataAnalytics
VenkatReddy
26
Linear Classifier
Consider a two dimensional dataset
with two classes
How would we classify this dataset?
BigdataAnalytics
VenkatReddy
27
Linear Classifier
Both of the lines can be linear classifiers.
BigdataAnalytics
VenkatReddy
28
Linear Classifier
There are many lines that can be linear classifiers.
Which one is the optimal classifier.
BigdataAnalytics
VenkatReddy
29
Classifier Margin
Define the margin of a linear classifier as the width that theboundary could be
increased by before hitting a datapoint.
BigdataAnalytics
VenkatReddy
30
Maximum Margin
• The maximum margin linear classifier is the linear classifier
• with the maximum margin.
• This is the simplest kind of SVM (Called Linear SVM)
BigdataAnalytics
VenkatReddy
31
Support Vectors
• Examples closest to the hyper plane are support vectors.
• Margin ρ of the separator is the distance between
support vectors.
ρw . x + b > 0
w . x + b < 0
w . x + b = 0
Support Vectors
f(x) = sign(w . x + b)
red is +1
blue is -1
BigdataAnalytics
VenkatReddy
32
Difference Traditional Models &
Machine Learning
• ML is more heuristic
• Focused on improving performance of a learning agent
• Also looks at real-time self learning and robotics – areas
not part of data mining
• Some algorithms are too heuristic that there is no one
right or wrong answer
• Lets take K-Means example
BigdataAnalytics
VenkatReddy
33
K-Means clustering
Overall population
BigdataAnalytics
VenkatReddy
34
K-Means clustering
Fix the number of clusters
BigdataAnalytics
VenkatReddy
35
K-Means clustering
Calculate the distance of
each case from all clusters
BigdataAnalytics
VenkatReddy
36
K-Means clustering
Re calculate the cluster
centers
BigdataAnalytics
VenkatReddy
37
K-Means clustering
Continue till there is no
significant change between
two iterations
BigdataAnalytics
VenkatReddy
38
Calculating the distance
Weight
Cust1 68
Cust2 72
Cust3 100
Weight Age
Cust1 68 25
Cust2 72 70
Cust3 100 28
Weight Age Income
Cust1 68 25 60,000
Cust2 72 70 9,000
Cust3 100 28 62,000
Which of the two customers are similar?
Which of the two customers are similar
now?
Which two of the customers are
similar in this case?
BigdataAnalytics
VenkatReddy
39
Distance Measures
• Euclidean distance
• City-block (Manhattan) distance
• Chebychev similarity
• Minkowski distance
• Mahalanobis distance
• Maximum distance
• Cosine similarity
• Simple correlation between observations
• Minimum distance
• Weighted distance
• Venkat Reddy distance
• Not sure all these measures will result in same clusters in the
above example.
• So, same ML algorithms with different results. Generally this is not
the case with conventional methods
BigdataAnalytics
VenkatReddy
40
Thank you
BigdataAnalytics
VenkatReddy
41
More ..
BigdataAnalytics
VenkatReddy
42
http://www.slideshare.net/21_venkat/presentations
https://www.facebook.com/groups/SASanalysts/

More Related Content

What's hot

Introduction to machine learning
Introduction to machine learningIntroduction to machine learning
Introduction to machine learningKoundinya Desiraju
 
Explainability and bias in AI
Explainability and bias in AIExplainability and bias in AI
Explainability and bias in AIBill Liu
 
Introduction to ML (Machine Learning)
Introduction to ML (Machine Learning)Introduction to ML (Machine Learning)
Introduction to ML (Machine Learning)SwatiTripathi44
 
Introduction to-machine-learning
Introduction to-machine-learningIntroduction to-machine-learning
Introduction to-machine-learningBabu Priyavrat
 
Linear regression
Linear regressionLinear regression
Linear regressionMartinHogg9
 
Machine Learning Course | Edureka
Machine Learning Course | EdurekaMachine Learning Course | Edureka
Machine Learning Course | EdurekaEdureka!
 
Explainable AI in Industry (KDD 2019 Tutorial)
Explainable AI in Industry (KDD 2019 Tutorial)Explainable AI in Industry (KDD 2019 Tutorial)
Explainable AI in Industry (KDD 2019 Tutorial)Krishnaram Kenthapadi
 
Lecture1 introduction to machine learning
Lecture1 introduction to machine learningLecture1 introduction to machine learning
Lecture1 introduction to machine learningUmmeSalmaM1
 
Machine learning - AI
Machine learning - AIMachine learning - AI
Machine learning - AIWitekio
 
Naive Bayes Classifier
Naive Bayes ClassifierNaive Bayes Classifier
Naive Bayes ClassifierYiqun Hu
 
Artificial Neural Network | Deep Neural Network Explained | Artificial Neural...
Artificial Neural Network | Deep Neural Network Explained | Artificial Neural...Artificial Neural Network | Deep Neural Network Explained | Artificial Neural...
Artificial Neural Network | Deep Neural Network Explained | Artificial Neural...Simplilearn
 
Machine Learning
Machine LearningMachine Learning
Machine LearningRahul Kumar
 
Performance Metrics for Machine Learning Algorithms
Performance Metrics for Machine Learning AlgorithmsPerformance Metrics for Machine Learning Algorithms
Performance Metrics for Machine Learning AlgorithmsKush Kulshrestha
 
Applications in Machine Learning
Applications in Machine LearningApplications in Machine Learning
Applications in Machine LearningJoel Graff
 
Machine Learning for Dummies (without mathematics)
Machine Learning for Dummies (without mathematics)Machine Learning for Dummies (without mathematics)
Machine Learning for Dummies (without mathematics)ActiveEon
 
Introduction to Machine Learning
Introduction to Machine LearningIntroduction to Machine Learning
Introduction to Machine LearningLior Rokach
 
Machine learning overview
Machine learning overviewMachine learning overview
Machine learning overviewprih_yah
 
Machine Learning: Applications, Process and Techniques
Machine Learning: Applications, Process and TechniquesMachine Learning: Applications, Process and Techniques
Machine Learning: Applications, Process and TechniquesRui Pedro Paiva
 

What's hot (20)

Introduction to machine learning
Introduction to machine learningIntroduction to machine learning
Introduction to machine learning
 
machine learning
machine learningmachine learning
machine learning
 
Explainability and bias in AI
Explainability and bias in AIExplainability and bias in AI
Explainability and bias in AI
 
Introduction to ML (Machine Learning)
Introduction to ML (Machine Learning)Introduction to ML (Machine Learning)
Introduction to ML (Machine Learning)
 
Introduction to-machine-learning
Introduction to-machine-learningIntroduction to-machine-learning
Introduction to-machine-learning
 
Linear regression
Linear regressionLinear regression
Linear regression
 
Machine Learning Course | Edureka
Machine Learning Course | EdurekaMachine Learning Course | Edureka
Machine Learning Course | Edureka
 
Explainable AI in Industry (KDD 2019 Tutorial)
Explainable AI in Industry (KDD 2019 Tutorial)Explainable AI in Industry (KDD 2019 Tutorial)
Explainable AI in Industry (KDD 2019 Tutorial)
 
Lecture1 introduction to machine learning
Lecture1 introduction to machine learningLecture1 introduction to machine learning
Lecture1 introduction to machine learning
 
Machine learning - AI
Machine learning - AIMachine learning - AI
Machine learning - AI
 
Naive Bayes Classifier
Naive Bayes ClassifierNaive Bayes Classifier
Naive Bayes Classifier
 
Artificial Neural Network | Deep Neural Network Explained | Artificial Neural...
Artificial Neural Network | Deep Neural Network Explained | Artificial Neural...Artificial Neural Network | Deep Neural Network Explained | Artificial Neural...
Artificial Neural Network | Deep Neural Network Explained | Artificial Neural...
 
Machine Learning
Machine LearningMachine Learning
Machine Learning
 
Performance Metrics for Machine Learning Algorithms
Performance Metrics for Machine Learning AlgorithmsPerformance Metrics for Machine Learning Algorithms
Performance Metrics for Machine Learning Algorithms
 
Applications in Machine Learning
Applications in Machine LearningApplications in Machine Learning
Applications in Machine Learning
 
Machine Learning for Dummies (without mathematics)
Machine Learning for Dummies (without mathematics)Machine Learning for Dummies (without mathematics)
Machine Learning for Dummies (without mathematics)
 
Introduction to Machine Learning
Introduction to Machine LearningIntroduction to Machine Learning
Introduction to Machine Learning
 
Machine learning overview
Machine learning overviewMachine learning overview
Machine learning overview
 
Machine Learning: Applications, Process and Techniques
Machine Learning: Applications, Process and TechniquesMachine Learning: Applications, Process and Techniques
Machine Learning: Applications, Process and Techniques
 
Deep learning
Deep learningDeep learning
Deep learning
 

Viewers also liked

Vital.AI Creating Intelligent Apps
Vital.AI Creating Intelligent AppsVital.AI Creating Intelligent Apps
Vital.AI Creating Intelligent AppsVital.AI
 
Resource Scheduling
Resource SchedulingResource Scheduling
Resource SchedulingNicola2903
 
Data Science, Machine Learning and Neural Networks
Data Science, Machine Learning and Neural NetworksData Science, Machine Learning and Neural Networks
Data Science, Machine Learning and Neural NetworksBICA Labs
 
Lecture 1: What is Machine Learning?
Lecture 1: What is Machine Learning?Lecture 1: What is Machine Learning?
Lecture 1: What is Machine Learning?Marina Santini
 
5 Presentation design trends 2017
5 Presentation design trends 20175 Presentation design trends 2017
5 Presentation design trends 2017Annova Studio
 
Ppt on automation
Ppt on automation Ppt on automation
Ppt on automation harshaa
 
Introduction to Big Data/Machine Learning
Introduction to Big Data/Machine LearningIntroduction to Big Data/Machine Learning
Introduction to Big Data/Machine LearningLars Marius Garshol
 
Artificial Intelligence Presentation
Artificial Intelligence PresentationArtificial Intelligence Presentation
Artificial Intelligence Presentationlpaviglianiti
 

Viewers also liked (8)

Vital.AI Creating Intelligent Apps
Vital.AI Creating Intelligent AppsVital.AI Creating Intelligent Apps
Vital.AI Creating Intelligent Apps
 
Resource Scheduling
Resource SchedulingResource Scheduling
Resource Scheduling
 
Data Science, Machine Learning and Neural Networks
Data Science, Machine Learning and Neural NetworksData Science, Machine Learning and Neural Networks
Data Science, Machine Learning and Neural Networks
 
Lecture 1: What is Machine Learning?
Lecture 1: What is Machine Learning?Lecture 1: What is Machine Learning?
Lecture 1: What is Machine Learning?
 
5 Presentation design trends 2017
5 Presentation design trends 20175 Presentation design trends 2017
5 Presentation design trends 2017
 
Ppt on automation
Ppt on automation Ppt on automation
Ppt on automation
 
Introduction to Big Data/Machine Learning
Introduction to Big Data/Machine LearningIntroduction to Big Data/Machine Learning
Introduction to Big Data/Machine Learning
 
Artificial Intelligence Presentation
Artificial Intelligence PresentationArtificial Intelligence Presentation
Artificial Intelligence Presentation
 

Similar to Machine Learning for Dummies

Machine Learning, Deep Learning and Data Analysis Introduction
Machine Learning, Deep Learning and Data Analysis IntroductionMachine Learning, Deep Learning and Data Analysis Introduction
Machine Learning, Deep Learning and Data Analysis IntroductionTe-Yen Liu
 
Machine learning for sensor Data Analytics
Machine learning for sensor Data AnalyticsMachine learning for sensor Data Analytics
Machine learning for sensor Data AnalyticsMATLABISRAEL
 
Business intelligence and data warehousing
Business intelligence and data warehousingBusiness intelligence and data warehousing
Business intelligence and data warehousingVaishnavi
 
Computer Vision for Beginners
Computer Vision for BeginnersComputer Vision for Beginners
Computer Vision for BeginnersSanghamitra Deb
 
Entity embeddings for categorical data
Entity embeddings for categorical dataEntity embeddings for categorical data
Entity embeddings for categorical dataPaul Skeie
 
Analytics demystified
Analytics demystifiedAnalytics demystified
Analytics demystifiedMarc Moreau
 
How machines can take decisions
How machines can take decisionsHow machines can take decisions
How machines can take decisionsDeepu S Nath
 
How machines can take decisions
How machines can take decisionsHow machines can take decisions
How machines can take decisionsDeepu S Nath
 
Big Data & Machine Learning - TDC2013 São Paulo - 12/0713
Big Data & Machine Learning - TDC2013 São Paulo - 12/0713Big Data & Machine Learning - TDC2013 São Paulo - 12/0713
Big Data & Machine Learning - TDC2013 São Paulo - 12/0713Mathieu DESPRIEE
 
Master guide to become a Data Scientist -by zekeLabs
Master guide to become a Data Scientist  -by zekeLabsMaster guide to become a Data Scientist  -by zekeLabs
Master guide to become a Data Scientist -by zekeLabszekeLabs Technologies
 
AI & ML in Cyber Security - Why Algorithms Are Dangerous
AI & ML in Cyber Security - Why Algorithms Are DangerousAI & ML in Cyber Security - Why Algorithms Are Dangerous
AI & ML in Cyber Security - Why Algorithms Are DangerousRaffael Marty
 
Big Data & Machine Learning - TDC2013 Sao Paulo
Big Data & Machine Learning - TDC2013 Sao PauloBig Data & Machine Learning - TDC2013 Sao Paulo
Big Data & Machine Learning - TDC2013 Sao PauloOCTO Technology
 
AI and ML for Everyone
AI and ML for EveryoneAI and ML for Everyone
AI and ML for Everyonebigdata trunk
 
Fast Parallel Similarity Calculations with FPGA Hardware
Fast Parallel Similarity Calculations with FPGA HardwareFast Parallel Similarity Calculations with FPGA Hardware
Fast Parallel Similarity Calculations with FPGA HardwareTigerGraph
 
Monitoring Big Data Systems - "The Simple Way"
Monitoring Big Data Systems - "The Simple Way"Monitoring Big Data Systems - "The Simple Way"
Monitoring Big Data Systems - "The Simple Way"Demi Ben-Ari
 
H2O World - Intro to Data Science with Erin Ledell
H2O World - Intro to Data Science with Erin LedellH2O World - Intro to Data Science with Erin Ledell
H2O World - Intro to Data Science with Erin LedellSri Ambati
 

Similar to Machine Learning for Dummies (20)

Machine Learning, Deep Learning and Data Analysis Introduction
Machine Learning, Deep Learning and Data Analysis IntroductionMachine Learning, Deep Learning and Data Analysis Introduction
Machine Learning, Deep Learning and Data Analysis Introduction
 
Machine learning for sensor Data Analytics
Machine learning for sensor Data AnalyticsMachine learning for sensor Data Analytics
Machine learning for sensor Data Analytics
 
Business intelligence and data warehousing
Business intelligence and data warehousingBusiness intelligence and data warehousing
Business intelligence and data warehousing
 
Computer Vision for Beginners
Computer Vision for BeginnersComputer Vision for Beginners
Computer Vision for Beginners
 
Entity embeddings for categorical data
Entity embeddings for categorical dataEntity embeddings for categorical data
Entity embeddings for categorical data
 
Analytics demystified
Analytics demystifiedAnalytics demystified
Analytics demystified
 
How machines can take decisions
How machines can take decisionsHow machines can take decisions
How machines can take decisions
 
How machines can take decisions
How machines can take decisionsHow machines can take decisions
How machines can take decisions
 
Big Data & Machine Learning - TDC2013 São Paulo - 12/0713
Big Data & Machine Learning - TDC2013 São Paulo - 12/0713Big Data & Machine Learning - TDC2013 São Paulo - 12/0713
Big Data & Machine Learning - TDC2013 São Paulo - 12/0713
 
AI and Deep Learning
AI and Deep Learning AI and Deep Learning
AI and Deep Learning
 
Master guide to become a Data Scientist -by zekeLabs
Master guide to become a Data Scientist  -by zekeLabsMaster guide to become a Data Scientist  -by zekeLabs
Master guide to become a Data Scientist -by zekeLabs
 
AI & ML in Cyber Security - Why Algorithms Are Dangerous
AI & ML in Cyber Security - Why Algorithms Are DangerousAI & ML in Cyber Security - Why Algorithms Are Dangerous
AI & ML in Cyber Security - Why Algorithms Are Dangerous
 
Big Data & Machine Learning - TDC2013 Sao Paulo
Big Data & Machine Learning - TDC2013 Sao PauloBig Data & Machine Learning - TDC2013 Sao Paulo
Big Data & Machine Learning - TDC2013 Sao Paulo
 
Data science guide
Data science guideData science guide
Data science guide
 
AI and ML for Everyone
AI and ML for EveryoneAI and ML for Everyone
AI and ML for Everyone
 
Fast Parallel Similarity Calculations with FPGA Hardware
Fast Parallel Similarity Calculations with FPGA HardwareFast Parallel Similarity Calculations with FPGA Hardware
Fast Parallel Similarity Calculations with FPGA Hardware
 
Monitoring Big Data Systems - "The Simple Way"
Monitoring Big Data Systems - "The Simple Way"Monitoring Big Data Systems - "The Simple Way"
Monitoring Big Data Systems - "The Simple Way"
 
Ml - A shallow dive
Ml  - A shallow diveMl  - A shallow dive
Ml - A shallow dive
 
H2O World - Intro to Data Science with Erin Ledell
H2O World - Intro to Data Science with Erin LedellH2O World - Intro to Data Science with Erin Ledell
H2O World - Intro to Data Science with Erin Ledell
 
Future se oct15
Future se oct15Future se oct15
Future se oct15
 

More from Venkata Reddy Konasani

Machine Learning Deep Learning AI and Data Science
Machine Learning Deep Learning AI and Data Science Machine Learning Deep Learning AI and Data Science
Machine Learning Deep Learning AI and Data Science Venkata Reddy Konasani
 
Model selection and cross validation techniques
Model selection and cross validation techniquesModel selection and cross validation techniques
Model selection and cross validation techniquesVenkata Reddy Konasani
 
Table of Contents - Practical Business Analytics using SAS
Table of Contents - Practical Business Analytics using SAS Table of Contents - Practical Business Analytics using SAS
Table of Contents - Practical Business Analytics using SAS Venkata Reddy Konasani
 
Learning Tableau - Data, Graphs, Filters, Dashboards and Advanced features
Learning Tableau -  Data, Graphs, Filters, Dashboards and Advanced featuresLearning Tableau -  Data, Graphs, Filters, Dashboards and Advanced features
Learning Tableau - Data, Graphs, Filters, Dashboards and Advanced featuresVenkata Reddy Konasani
 
Data exploration validation and sanitization
Data exploration validation and sanitizationData exploration validation and sanitization
Data exploration validation and sanitizationVenkata Reddy Konasani
 

More from Venkata Reddy Konasani (20)

Transformers 101
Transformers 101 Transformers 101
Transformers 101
 
Machine Learning Deep Learning AI and Data Science
Machine Learning Deep Learning AI and Data Science Machine Learning Deep Learning AI and Data Science
Machine Learning Deep Learning AI and Data Science
 
Model selection and cross validation techniques
Model selection and cross validation techniquesModel selection and cross validation techniques
Model selection and cross validation techniques
 
Neural Network Part-2
Neural Network Part-2Neural Network Part-2
Neural Network Part-2
 
GBM theory code and parameters
GBM theory code and parametersGBM theory code and parameters
GBM theory code and parameters
 
Neural Networks made easy
Neural Networks made easyNeural Networks made easy
Neural Networks made easy
 
Decision tree
Decision treeDecision tree
Decision tree
 
Step By Step Guide to Learn R
Step By Step Guide to Learn RStep By Step Guide to Learn R
Step By Step Guide to Learn R
 
Credit Risk Model Building Steps
Credit Risk Model Building StepsCredit Risk Model Building Steps
Credit Risk Model Building Steps
 
Table of Contents - Practical Business Analytics using SAS
Table of Contents - Practical Business Analytics using SAS Table of Contents - Practical Business Analytics using SAS
Table of Contents - Practical Business Analytics using SAS
 
SAS basics Step by step learning
SAS basics Step by step learningSAS basics Step by step learning
SAS basics Step by step learning
 
Testing of hypothesis case study
Testing of hypothesis case study Testing of hypothesis case study
Testing of hypothesis case study
 
L101 predictive modeling case_study
L101 predictive modeling case_studyL101 predictive modeling case_study
L101 predictive modeling case_study
 
Learning Tableau - Data, Graphs, Filters, Dashboards and Advanced features
Learning Tableau -  Data, Graphs, Filters, Dashboards and Advanced featuresLearning Tableau -  Data, Graphs, Filters, Dashboards and Advanced features
Learning Tableau - Data, Graphs, Filters, Dashboards and Advanced features
 
Online data sources for analaysis
Online data sources for analaysis Online data sources for analaysis
Online data sources for analaysis
 
A data analyst view of Bigdata
A data analyst view of Bigdata A data analyst view of Bigdata
A data analyst view of Bigdata
 
R- Introduction
R- IntroductionR- Introduction
R- Introduction
 
Cluster Analysis for Dummies
Cluster Analysis for DummiesCluster Analysis for Dummies
Cluster Analysis for Dummies
 
Data exploration validation and sanitization
Data exploration validation and sanitizationData exploration validation and sanitization
Data exploration validation and sanitization
 
ARIMA
ARIMA ARIMA
ARIMA
 

Recently uploaded

Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsMiki Katsuragi
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Enterprise Knowledge
 
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfRankYa
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024Stephanie Beckett
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr LapshynFwdays
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):comworks
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024The Digital Insurer
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piececharlottematthew16
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University
 
Vector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector DatabasesVector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector DatabasesZilliz
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Wonjun Hwang
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clashcharlottematthew16
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 

Recently uploaded (20)

Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering Tips
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024
 
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdf
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piece
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
 
Vector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector DatabasesVector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector Databases
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clash
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 

Machine Learning for Dummies

  • 3. Machine Learning • How did our brain process the images? • How did the grouping happen? • Human brain processed the given images - learning • After learning the brain simply looked at the new image and compared with the groups classified the image to the closest group - Classification • If a machine has to perform the same operation we use Machine Learning • We write programs for learning and then classification, this is nothing but machine learning BigdataAnalytics VenkatReddy 3
  • 5. Example: Classification using ML • Image processing: • Machine learning can be used in classification Images & objects in an image • Ship • Water • Rock • Iron object • Fiber Object etc., • Does this really help? BigdataAnalytics VenkatReddy 5
  • 8. The main advantage of ML • Learning and writing an algorithm • Its easy for human brain but it is tough for a machine. It takes some time and good amount of training data for machine to accurately classify objects • Implementation and automation • This is easy for a machine. Once learnt a machine can process one million images without any fatigue where as human brain can’t • That’s why ML with bigdata is a deadly combination BigdataAnalytics VenkatReddy 8
  • 9. Applications of Machine Learning • Banking / Telecom / Retail • Identify: • Prospective customers • Dissatisfied customers • Good customers • Bad payers • Obtain: • More effective advertising • Less credit risk • Fewer fraud • Decreased churn rate BigdataAnalytics VenkatReddy 9
  • 10. Applications of Machine Learning • Biomedical / Biometrics • Medicine: • Screening • Diagnosis and prognosis • Drug discovery • Security: • Face recognition • Signature / fingerprint / iris verification • DNA fingerprinting BigdataAnalytics VenkatReddy 10
  • 11. Applications of Machine Learning • Computer / Internet • Computer interfaces: • Troubleshooting wizards • Handwriting and speech • Brain waves • Internet • Hit ranking • Spam filtering • Text categorization • Text translation • Recommendation BigdataAnalytics VenkatReddy 11
  • 12. Main Parts in Machine Learning process • Any Machine Learning algorithm has three parts 1. The Output 2. The Objective Function or Performance Matrix 3. The Input Email Spam Classification • The Output: Categorize email messages as spam or legitimate. • Objective Function: Percentage of email messages correctly classified. • The Input: Database of emails, some with human-given labels Hand Writing Recognition • The Output: Recognizing hand-written words • Objective Function: Percentage of words correctly classified • The Input: Database of human-labeled images of handwritten words BigdataAnalytics VenkatReddy 12
  • 14. Everything should be made as simple as possible, but no simpler BigdataAnalytics VenkatReddy 14
  • 16. The Bayes Method A(2%) B(90%) C(8%) 1% 60% 2% • A, B, C Generate bolts, given a machine what is the probability that it will generate a defective bolt. • Now given a defective bolt, can we tell which machine has generated it? BigdataAnalytics VenkatReddy 16
  • 17. Several Machine in Several Layers • Several intermediate machines • Each has its on error chance, This problem requires a Bayesian network to solve • Does C has anything to do if the bolt is finally coming out of F? A(20%) B(50%) C(30%) 1% 5% 2% D(60%) E(40%) F G H BigdataAnalytics VenkatReddy 17
  • 19. The Philosophy of Neural Network • If we have to predict whether we win in a chess game • Can we use logistic regression? • Since there can be almost infinite number of intermediate steps its almost impossible to predict the probability of winning • If each move is considered as an input variable then there are almost (countable)infinite moves. A coefficient for each of them is always negligible. Neural network uses a simple technique, instead of predicting the final output they predict the intermediate output that lead to a win previously, this will be an input for predicting the next move BigdataAnalytics VenkatReddy 19
  • 20. Biological Motivation for NN • Aartificial neural networks are built out of a densely interconnected set of simple units, where each unit takes a number of real-valued inputs (possibly the outputs of other units) and produces a single real-valued output (which may become the input to many other units). • The human brain is estimated to contain a densely interconnected network of approximately 1011 neurons, each connected, on average, to 104 others. • Neuron activity is typically inhibited through connections to other neurons. BigdataAnalytics VenkatReddy 20
  • 21. ANN in Self Driving Car BigdataAnalytics VenkatReddy 21
  • 23. Look at below Cricket Team Players Data Player Avg Runs Total wickets Height Not outs Highest Score Best Bowling 1 45 3 5.5 15 120 1 2 50 34 5.2 34 209 2 3 38 0 6 36 183 0 4 46 9 6.1 78 160 3 5 37 45 5.8 56 98 1 6 32 0 5.10 89 183 0 7 18 123 6 2 35 4 8 19 239 6.1 3 56 5 9 18 96 6.6 5 87 7 10 16 83 5.9 7 32 7 11 17 138 5.10 9 12 6 BigdataAnalytics VenkatReddy 23
  • 24. Describe the players • If we have to describe or segregate the players do we really need Avg Runs, Total wickets, Height, Not outs, Highest Score, Best bowling? • Can we simply take • Avg Runs+ Not outs+ Highest Score as one factor? • Total wickets+ Height+ Best bowling as second factor? Defining these imaginary variables or a linear combination of variables to reduce the dimensions is called PCA or FA BigdataAnalytics VenkatReddy 24
  • 25. Look at below Cricket Team Players Data Player Avg Runs Total wickets Height Not outs Highest Score Best Bowling 1 45 3 5.5 15 120 1 2 50 34 5.2 34 209 2 3 38 0 6 36 183 0 4 46 9 6.1 78 160 3 5 37 45 5.8 56 98 1 6 32 0 5.10 89 183 0 7 18 123 6 2 35 4 8 19 239 6.1 3 56 5 9 18 96 6.6 5 87 7 10 16 83 5.9 7 32 7 11 17 138 5.10 9 12 6 BigdataAnalytics VenkatReddy 25
  • 27. Linear Classifier Consider a two dimensional dataset with two classes How would we classify this dataset? BigdataAnalytics VenkatReddy 27
  • 28. Linear Classifier Both of the lines can be linear classifiers. BigdataAnalytics VenkatReddy 28
  • 29. Linear Classifier There are many lines that can be linear classifiers. Which one is the optimal classifier. BigdataAnalytics VenkatReddy 29
  • 30. Classifier Margin Define the margin of a linear classifier as the width that theboundary could be increased by before hitting a datapoint. BigdataAnalytics VenkatReddy 30
  • 31. Maximum Margin • The maximum margin linear classifier is the linear classifier • with the maximum margin. • This is the simplest kind of SVM (Called Linear SVM) BigdataAnalytics VenkatReddy 31
  • 32. Support Vectors • Examples closest to the hyper plane are support vectors. • Margin ρ of the separator is the distance between support vectors. ρw . x + b > 0 w . x + b < 0 w . x + b = 0 Support Vectors f(x) = sign(w . x + b) red is +1 blue is -1 BigdataAnalytics VenkatReddy 32
  • 33. Difference Traditional Models & Machine Learning • ML is more heuristic • Focused on improving performance of a learning agent • Also looks at real-time self learning and robotics – areas not part of data mining • Some algorithms are too heuristic that there is no one right or wrong answer • Lets take K-Means example BigdataAnalytics VenkatReddy 33
  • 35. K-Means clustering Fix the number of clusters BigdataAnalytics VenkatReddy 35
  • 36. K-Means clustering Calculate the distance of each case from all clusters BigdataAnalytics VenkatReddy 36
  • 37. K-Means clustering Re calculate the cluster centers BigdataAnalytics VenkatReddy 37
  • 38. K-Means clustering Continue till there is no significant change between two iterations BigdataAnalytics VenkatReddy 38
  • 39. Calculating the distance Weight Cust1 68 Cust2 72 Cust3 100 Weight Age Cust1 68 25 Cust2 72 70 Cust3 100 28 Weight Age Income Cust1 68 25 60,000 Cust2 72 70 9,000 Cust3 100 28 62,000 Which of the two customers are similar? Which of the two customers are similar now? Which two of the customers are similar in this case? BigdataAnalytics VenkatReddy 39
  • 40. Distance Measures • Euclidean distance • City-block (Manhattan) distance • Chebychev similarity • Minkowski distance • Mahalanobis distance • Maximum distance • Cosine similarity • Simple correlation between observations • Minimum distance • Weighted distance • Venkat Reddy distance • Not sure all these measures will result in same clusters in the above example. • So, same ML algorithms with different results. Generally this is not the case with conventional methods BigdataAnalytics VenkatReddy 40