SlideShare ist ein Scribd-Unternehmen logo
1 von 22
Downloaden Sie, um offline zu lesen
Machine Learning Training
in Mohali
E2Matrix Training and Research Institute Jalandhar
Contact : +91 9041262727
Email : support@e2matrix.com
What is machine learning?
Data Model
training
Output
Predictions
Classifications
Clusters
Ordinals
examples
Why: Face Recognition?
Categories of problems
Classification Ordinal Reg.
Regression Prediction
By output:
Clustering
By input:
Vector, X Time Series, x(t)
One size never fits all…
• Improving an algorithm:
– First option: better features
• Visualize classes
• Trends
• Histograms
– Next: make the algorithm smarter (more complicated)
• Interaction of features
• Better objective and training criteria
WEKA or GGOBI
-4 -2 0 2 4 6
-20
-10
0
10
20
30
40
y=1 + 0.5t + 4t2 - t3
-4 -2 0 2 4 6
-20
-10
0
10
20
30
40
input
output
Categories of ML algorithms
By training:
Supervised (labeled) Unsupervised (unlabeled)
By model:
Non-parametric
Raw data only
Parametric
Model parameters only
-4 -2 0 2 4 6
-20
-10
0
10
20
30
40
input
output
Kernel
methods
-4 -2 0 2 4 6
-20
-10
0
10
20
30
40
input
output
50 100 150 200 250
0
0
0.05
0.1
0.15
0.2
-2 0 2 4 6
-20
-10
0
10
20
30
40
input
tupt
u
o
-2 0 2 4 6 -4
-20
-10
0
10
20
30
40
input
tupt
u
o
-4 -2 0 2 4 6 -4
-20
-10
0
10
20
30
40
input
output
Training a ML algorithm
• Choose data
• Optimize model parameters according to:
– Objective function
-4 -2 0 2 4 6
-20
-10
0
10
20
30
40
Regression Classification
-2 0 2 4 6 8
-2
0
2
4
6
8
10
1
2
Mean Square Error
Max Margin
Pitfalls of ML algorithms
• Clean your features:
– Training volume: more is better
– Outliers: remove them!
– Dynamic range: normalize it!
• Generalization
– Over fitting
– Under fitting
• Speed: parametric vs. non
• What are you learning? …features, features, features…
outliers
-4 -2 0 2 4 6
-20
-10
0
10
20
30
40
input
output
-4 -2 0 2 4 6
-20
-10
0
10
20
30
40
input
output
-4 -2 0 2 4 6
-10
-20
0
10
20
50
40
30
input
output
Keep a “good” percentile range!
5-95, 1-99: depends on your data
Dynamic range
0 0.2 0.4 0.6 0.8 1
-0.2
0
0.2
0.4
0.6
0.8
1
1.2
f1
f2
1
2
0 200 400 600 800 1000
-1
0
1
2
3
4
5
6
f1
f2
1
2
0 200 400 600 800 1000
0
50
100
150
200
250
300
350
400
f1
f2
1
2
-2 0 2 4 6 8
-1
0
1
2
3
4
5
6
f1
f2
1
2
Over fitting and comparing
algorithms
• Early stop
• Regularization
• Validation Sets
Under
fitting
Curse of dimensionality
Under
fitting
Curse of dimensionality
K-Means clustering
•Planar decision boundaries,
depending on space you are in…
•Highly Efficient
•Not always great (but usually
pretty good)
•Needs good starting criteria
K-Nearest Neighbor
•Arbitrary decision boundaries
•Not so efficient…
•With enough data in each class…
optimal
•Easy to train, known as a lazy classifier
Mixture of Gaussians
•Arbitrary decision boundaries
with enough boundaries
•Efficient, depending on number
of models and Gaussians
•Can represent more than just
Gaussian distributions
•Generative, sometimes tough to
train up
•Spurious singularities
•Can get a distribution for a
specific class and feature(s)… and
get a Bayesian classifier
Components
Analysis (principal or
independent)•Reduces dimensionality
•All other classifiers work in a
rotated space
•Remember Eigen-values and
Vectors?
Trees Classifiers
•Arbitrary Decision boundaries
•Can be quite efficient (or not!)
•Needs good criteria for splitting
•Easy to visualize
Multi-Layer Perceptron
•Arbitrary (but linear) Decision
boundaries
•Can be quite efficient (or not!)
•What did it learn?
Support Vector Machines
•Arbitrary Decision boundaries
•Efficiency depends on support
vector size and feature size
Hidden Markov Models
•Arbitrary Decision boundaries
•Efficiency depends on state
space and number of models
•Generalizes to incorporate
features that change over time
More sophisticated approaches
• Graphical models (like an HMM)
– Bayesian network
– Markov random fields
• Boosting
– Adaboost
• Voting
• Cascading
• Stacking…

Weitere ähnliche Inhalte

Ähnlich wie Machine Learning Training in Mohali

Build Deep Learning model to identify santader bank's dissatisfied customers
Build Deep Learning model to identify santader bank's dissatisfied customersBuild Deep Learning model to identify santader bank's dissatisfied customers
Build Deep Learning model to identify santader bank's dissatisfied customers
sriram30691
 
CUDA_ICLR_2023.pdf
CUDA_ICLR_2023.pdfCUDA_ICLR_2023.pdf
CUDA_ICLR_2023.pdf
sumyeong ahn
 

Ähnlich wie Machine Learning Training in Mohali (20)

MLConf 2016 SigOpt Talk by Scott Clark
MLConf 2016 SigOpt Talk by Scott ClarkMLConf 2016 SigOpt Talk by Scott Clark
MLConf 2016 SigOpt Talk by Scott Clark
 
Heuristic design of experiments w meta gradient search
Heuristic design of experiments w meta gradient searchHeuristic design of experiments w meta gradient search
Heuristic design of experiments w meta gradient search
 
Py conie 2014
Py conie 2014Py conie 2014
Py conie 2014
 
From Black Box to Black Magic, Pycon Ireland 2014
From Black Box to Black Magic, Pycon Ireland 2014From Black Box to Black Magic, Pycon Ireland 2014
From Black Box to Black Magic, Pycon Ireland 2014
 
ES2022-Minh-Nguyen-ShapingTestsIntoModelsForAutomatedTCGeneration.pdf
ES2022-Minh-Nguyen-ShapingTestsIntoModelsForAutomatedTCGeneration.pdfES2022-Minh-Nguyen-ShapingTestsIntoModelsForAutomatedTCGeneration.pdf
ES2022-Minh-Nguyen-ShapingTestsIntoModelsForAutomatedTCGeneration.pdf
 
General Tips for participating Kaggle Competitions
General Tips for participating Kaggle CompetitionsGeneral Tips for participating Kaggle Competitions
General Tips for participating Kaggle Competitions
 
Machine learning for IoT - unpacking the blackbox
Machine learning for IoT - unpacking the blackboxMachine learning for IoT - unpacking the blackbox
Machine learning for IoT - unpacking the blackbox
 
Build Deep Learning model to identify santader bank's dissatisfied customers
Build Deep Learning model to identify santader bank's dissatisfied customersBuild Deep Learning model to identify santader bank's dissatisfied customers
Build Deep Learning model to identify santader bank's dissatisfied customers
 
Bridging the Gap: Machine Learning for Ubiquitous Computing -- Evaluation
Bridging the Gap: Machine Learning for Ubiquitous Computing -- EvaluationBridging the Gap: Machine Learning for Ubiquitous Computing -- Evaluation
Bridging the Gap: Machine Learning for Ubiquitous Computing -- Evaluation
 
CAPTCHA Cracking System
CAPTCHA Cracking SystemCAPTCHA Cracking System
CAPTCHA Cracking System
 
Building ML Pipelines
Building ML PipelinesBuilding ML Pipelines
Building ML Pipelines
 
Building Machine Learning Pipelines
Building Machine Learning PipelinesBuilding Machine Learning Pipelines
Building Machine Learning Pipelines
 
background.pptx
background.pptxbackground.pptx
background.pptx
 
Application of Machine Learning in Agriculture
Application of Machine  Learning in AgricultureApplication of Machine  Learning in Agriculture
Application of Machine Learning in Agriculture
 
The Power of Auto ML and How Does it Work
The Power of Auto ML and How Does it WorkThe Power of Auto ML and How Does it Work
The Power of Auto ML and How Does it Work
 
Intro to Machine Learning by Microsoft Ventures
Intro to Machine Learning by Microsoft VenturesIntro to Machine Learning by Microsoft Ventures
Intro to Machine Learning by Microsoft Ventures
 
Counterfactual evaluation of machine learning models
Counterfactual evaluation of machine learning modelsCounterfactual evaluation of machine learning models
Counterfactual evaluation of machine learning models
 
CUDA_ICLR_2023.pdf
CUDA_ICLR_2023.pdfCUDA_ICLR_2023.pdf
CUDA_ICLR_2023.pdf
 
Future stock performance presentation
Future stock performance presentationFuture stock performance presentation
Future stock performance presentation
 
The Machine Learning Workflow with Azure
The Machine Learning Workflow with AzureThe Machine Learning Workflow with Azure
The Machine Learning Workflow with Azure
 

Mehr von E2MATRIX

Mehr von E2MATRIX (20)

Electrical Training in Phagwara
Electrical Training in PhagwaraElectrical Training in Phagwara
Electrical Training in Phagwara
 
Electrical Training in Mohali
Electrical Training in MohaliElectrical Training in Mohali
Electrical Training in Mohali
 
Electrical Training in Ludhiana
Electrical Training in LudhianaElectrical Training in Ludhiana
Electrical Training in Ludhiana
 
Electrical Training in Jalandhar
Electrical Training in JalandharElectrical Training in Jalandhar
Electrical Training in Jalandhar
 
Electrical Training in Chandigarh
Electrical Training in ChandigarhElectrical Training in Chandigarh
Electrical Training in Chandigarh
 
Electrical Training in Amritsar
Electrical Training in AmritsarElectrical Training in Amritsar
Electrical Training in Amritsar
 
Big Data Training in Amritsar
Big Data Training in AmritsarBig Data Training in Amritsar
Big Data Training in Amritsar
 
Big Data Training in Mohali
Big Data Training in MohaliBig Data Training in Mohali
Big Data Training in Mohali
 
Big Data Training in Ludhiana
Big Data Training in LudhianaBig Data Training in Ludhiana
Big Data Training in Ludhiana
 
Raspberry Pi training in Ludhiana
Raspberry Pi training in LudhianaRaspberry Pi training in Ludhiana
Raspberry Pi training in Ludhiana
 
Raspberry Pi Training in Phagwara
Raspberry Pi Training in PhagwaraRaspberry Pi Training in Phagwara
Raspberry Pi Training in Phagwara
 
Raspberry Pi Training in Mohali
Raspberry Pi Training in MohaliRaspberry Pi Training in Mohali
Raspberry Pi Training in Mohali
 
Raspberry Pi Training in Chandigarh
Raspberry Pi Training in ChandigarhRaspberry Pi Training in Chandigarh
Raspberry Pi Training in Chandigarh
 
Raspberry Pi Training in Amritsar
Raspberry Pi Training in AmritsarRaspberry Pi Training in Amritsar
Raspberry Pi Training in Amritsar
 
Raspberry Pi Training in Jalandhar
Raspberry Pi Training in JalandharRaspberry Pi Training in Jalandhar
Raspberry Pi Training in Jalandhar
 
Selenium Training in Phagwara
Selenium Training in PhagwaraSelenium Training in Phagwara
Selenium Training in Phagwara
 
Selenium Training in Mohali
Selenium Training in MohaliSelenium Training in Mohali
Selenium Training in Mohali
 
Selenium Training in Ludhiana
Selenium Training in LudhianaSelenium Training in Ludhiana
Selenium Training in Ludhiana
 
Selenium Training in Jalandhar
Selenium Training in JalandharSelenium Training in Jalandhar
Selenium Training in Jalandhar
 
Selenium Training in Amritsar
Selenium Training in AmritsarSelenium Training in Amritsar
Selenium Training in Amritsar
 

Kürzlich hochgeladen

Gardella_Mateo_IntellectualProperty.pdf.
Gardella_Mateo_IntellectualProperty.pdf.Gardella_Mateo_IntellectualProperty.pdf.
Gardella_Mateo_IntellectualProperty.pdf.
MateoGardella
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global Impact
PECB
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptx
heathfieldcps1
 

Kürzlich hochgeladen (20)

Gardella_Mateo_IntellectualProperty.pdf.
Gardella_Mateo_IntellectualProperty.pdf.Gardella_Mateo_IntellectualProperty.pdf.
Gardella_Mateo_IntellectualProperty.pdf.
 
ICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptxICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptx
 
How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17
 
Unit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxUnit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptx
 
Ecological Succession. ( ECOSYSTEM, B. Pharmacy, 1st Year, Sem-II, Environmen...
Ecological Succession. ( ECOSYSTEM, B. Pharmacy, 1st Year, Sem-II, Environmen...Ecological Succession. ( ECOSYSTEM, B. Pharmacy, 1st Year, Sem-II, Environmen...
Ecological Succession. ( ECOSYSTEM, B. Pharmacy, 1st Year, Sem-II, Environmen...
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
 
Holdier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfHoldier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdf
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global Impact
 
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
 
Unit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptxUnit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptx
 
APM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAPM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across Sectors
 
Z Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphZ Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot Graph
 
Web & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfWeb & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdf
 
Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104
 
Mattingly "AI & Prompt Design: The Basics of Prompt Design"
Mattingly "AI & Prompt Design: The Basics of Prompt Design"Mattingly "AI & Prompt Design: The Basics of Prompt Design"
Mattingly "AI & Prompt Design: The Basics of Prompt Design"
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptx
 
psychiatric nursing HISTORY COLLECTION .docx
psychiatric  nursing HISTORY  COLLECTION  .docxpsychiatric  nursing HISTORY  COLLECTION  .docx
psychiatric nursing HISTORY COLLECTION .docx
 
Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024
 
Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and Mode
 
SECOND SEMESTER TOPIC COVERAGE SY 2023-2024 Trends, Networks, and Critical Th...
SECOND SEMESTER TOPIC COVERAGE SY 2023-2024 Trends, Networks, and Critical Th...SECOND SEMESTER TOPIC COVERAGE SY 2023-2024 Trends, Networks, and Critical Th...
SECOND SEMESTER TOPIC COVERAGE SY 2023-2024 Trends, Networks, and Critical Th...
 

Machine Learning Training in Mohali

  • 1. Machine Learning Training in Mohali E2Matrix Training and Research Institute Jalandhar Contact : +91 9041262727 Email : support@e2matrix.com
  • 2. What is machine learning? Data Model training Output Predictions Classifications Clusters Ordinals examples Why: Face Recognition?
  • 3. Categories of problems Classification Ordinal Reg. Regression Prediction By output: Clustering By input: Vector, X Time Series, x(t)
  • 4. One size never fits all… • Improving an algorithm: – First option: better features • Visualize classes • Trends • Histograms – Next: make the algorithm smarter (more complicated) • Interaction of features • Better objective and training criteria WEKA or GGOBI
  • 5. -4 -2 0 2 4 6 -20 -10 0 10 20 30 40 y=1 + 0.5t + 4t2 - t3 -4 -2 0 2 4 6 -20 -10 0 10 20 30 40 input output Categories of ML algorithms By training: Supervised (labeled) Unsupervised (unlabeled) By model: Non-parametric Raw data only Parametric Model parameters only -4 -2 0 2 4 6 -20 -10 0 10 20 30 40 input output Kernel methods
  • 6. -4 -2 0 2 4 6 -20 -10 0 10 20 30 40 input output 50 100 150 200 250 0 0 0.05 0.1 0.15 0.2 -2 0 2 4 6 -20 -10 0 10 20 30 40 input tupt u o -2 0 2 4 6 -4 -20 -10 0 10 20 30 40 input tupt u o -4 -2 0 2 4 6 -4 -20 -10 0 10 20 30 40 input output
  • 7. Training a ML algorithm • Choose data • Optimize model parameters according to: – Objective function -4 -2 0 2 4 6 -20 -10 0 10 20 30 40 Regression Classification -2 0 2 4 6 8 -2 0 2 4 6 8 10 1 2 Mean Square Error Max Margin
  • 8. Pitfalls of ML algorithms • Clean your features: – Training volume: more is better – Outliers: remove them! – Dynamic range: normalize it! • Generalization – Over fitting – Under fitting • Speed: parametric vs. non • What are you learning? …features, features, features…
  • 9. outliers -4 -2 0 2 4 6 -20 -10 0 10 20 30 40 input output -4 -2 0 2 4 6 -20 -10 0 10 20 30 40 input output -4 -2 0 2 4 6 -10 -20 0 10 20 50 40 30 input output Keep a “good” percentile range! 5-95, 1-99: depends on your data
  • 10. Dynamic range 0 0.2 0.4 0.6 0.8 1 -0.2 0 0.2 0.4 0.6 0.8 1 1.2 f1 f2 1 2 0 200 400 600 800 1000 -1 0 1 2 3 4 5 6 f1 f2 1 2 0 200 400 600 800 1000 0 50 100 150 200 250 300 350 400 f1 f2 1 2 -2 0 2 4 6 8 -1 0 1 2 3 4 5 6 f1 f2 1 2
  • 11. Over fitting and comparing algorithms • Early stop • Regularization • Validation Sets
  • 14. K-Means clustering •Planar decision boundaries, depending on space you are in… •Highly Efficient •Not always great (but usually pretty good) •Needs good starting criteria
  • 15. K-Nearest Neighbor •Arbitrary decision boundaries •Not so efficient… •With enough data in each class… optimal •Easy to train, known as a lazy classifier
  • 16. Mixture of Gaussians •Arbitrary decision boundaries with enough boundaries •Efficient, depending on number of models and Gaussians •Can represent more than just Gaussian distributions •Generative, sometimes tough to train up •Spurious singularities •Can get a distribution for a specific class and feature(s)… and get a Bayesian classifier
  • 17. Components Analysis (principal or independent)•Reduces dimensionality •All other classifiers work in a rotated space •Remember Eigen-values and Vectors?
  • 18. Trees Classifiers •Arbitrary Decision boundaries •Can be quite efficient (or not!) •Needs good criteria for splitting •Easy to visualize
  • 19. Multi-Layer Perceptron •Arbitrary (but linear) Decision boundaries •Can be quite efficient (or not!) •What did it learn?
  • 20. Support Vector Machines •Arbitrary Decision boundaries •Efficiency depends on support vector size and feature size
  • 21. Hidden Markov Models •Arbitrary Decision boundaries •Efficiency depends on state space and number of models •Generalizes to incorporate features that change over time
  • 22. More sophisticated approaches • Graphical models (like an HMM) – Bayesian network – Markov random fields • Boosting – Adaboost • Voting • Cascading • Stacking…