SlideShare ist ein Scribd-Unternehmen logo
1 von 95
Data Mining  Classification: Alternative Techniques Lecture Notes for Chapter 5 Introduction to Data Mining by Tan, Steinbach, Kumar © Tan,Steinbach, Kumar    Introduction to Data Mining    4/18/2004
Chapter 5:  Classification: Alternative Techniques   ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
5.1 Rule-Based Classifier
Rule-Based Classifier ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[Chapter 5.1, page 207]
Rule-based Classifier (Example) ,[object Object],[object Object],[object Object],[object Object],[object Object],[Chapter 5.1, page 209]
Application of Rule-Based Classifier ,[object Object],R1: (Give Birth = no)    (Can Fly = yes)    Birds R2: (Give Birth = no)    (Live in Water = yes)    Fishes R3: (Give Birth = yes)    (Blood Type = warm)    Mammals R4: (Give Birth = no)    (Can Fly = no)    Reptiles R5: (Live in Water = sometimes)    Amphibians  The rule R1 covers a hawk => Bird The rule R3 covers the grizzly bear => Mammal [Chapter 5.1, page 208]
Rule Coverage and Accuracy ,[object Object],[object Object],[object Object],[object Object],(Status=Single)    No Coverage = 40%,  Accuracy = 50% [Chapter 5.1, page 208]
How does Rule-based Classifier Work? R1: (Give Birth = no)    (Can Fly = yes)    Birds R2: (Give Birth = no)    (Live in Water = yes)    Fishes R3: (Give Birth = yes)    (Blood Type = warm)    Mammals R4: (Give Birth = no)    (Can Fly = no)    Reptiles R5: (Live in Water = sometimes)    Amphibians  A lemur triggers rule R3, so it is classified as a mammal A turtle triggers both R4 and R5 A dogfish shark triggers none of the rules [Chapter 5 . 1 . 1, page 209]
Characteristics of Rule-Based Classifier ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[Chapter 5 . 1 . 1, page 210]
From Decision Trees To Rules Rules are mutually exclusive and exhaustive Rule set contains as much information as the tree [Optional]
Rules Can Be Simplified Initial Rule:  (Refund=No)     (Status=Married)    No Simplified Rule:  (Status=Married)    No [Optional]
Effect of Rule Simplification ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[Optional]
Ordered Rule Set ,[object Object],[object Object],[object Object],[object Object],[object Object],R1: (Give Birth = no)    (Can Fly = yes)    Birds R2: (Give Birth = no)    (Live in Water = yes)    Fishes R3: (Give Birth = yes)    (Blood Type = warm)    Mammals R4: (Give Birth = no)    (Can Fly = no)    Reptiles R5: (Live in Water = sometimes)    Amphibians  [Chapter 5 . 1 . 1, page 211]
Rule Ordering Schemes ,[object Object],[object Object],[object Object],[object Object],[Chapter 5 . 1 . 2, page 212]
Building Classification Rules ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[Chapter 5 . 1 . 3 – 5 . 1. 5, page 213]
Direct Method: Sequential Covering ,[object Object],[object Object],[object Object],[object Object],[Chapter 5 . 1 . 4, page 213]
Example of Sequential Covering [Chapter 5 . 1 . 4, page 214]
Example of Sequential Covering… [Chapter 5 . 1 . 4, page 214]
Aspects of Sequential Covering ,[object Object],[object Object],[object Object],[object Object],[object Object],[Chapter 5 . 1 . 4, page 215 - 218]
Rule Growing ,[object Object],[Chapter 5 . 1 . 4, page 215]
Rule Growing (Examples) ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[Chapter 5 . 1 . 4 ]
Instance Elimination ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[Optional]
Rule Evaluation ,[object Object],[object Object],[object Object],[object Object],n :  Number of instances covered by rule n c  :  Number of instances covered by rule k  : Number of classes p  : Prior probability [Chapter 5 . 1 . 4, page 216]
Stopping Criterion and Rule Pruning ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[Chapter 5 . 1 . 4, page 216]
Summary of Direct Method ,[object Object],[object Object],[object Object],[object Object],[object Object],[Chapter 5 . 1 . 4 ]
Direct Method: RIPPER ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[Chapter 5 . 1 . 4, page 220]
Direct Method: RIPPER ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[Chapter 5 . 1 . 4, page 220]
Direct Method: RIPPER ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[Chapter 5 . 1 . 4, page 220]
Direct Method: RIPPER ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[Chapter 5 . 1 . 4, page 220]
Indirect Methods [Chapter 5 . 1 . 5, page 221]
Indirect Method: C4.5rules ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[Chapter 5 . 1 . 5, page 222]
Indirect Method: C4.5rules ,[object Object],[object Object],[object Object],[object Object],[object Object],[Chapter 5 . 1 . 5, page 222]
Example [Optional]
C4.5 versus C4.5rules versus RIPPER C4.5rules: (Give Birth=No, Can Fly=Yes)    Birds (Give Birth=No, Live in Water=Yes)    Fishes (Give Birth=Yes)    Mammals (Give Birth=No, Can Fly=No, Live in Water=No)    Reptiles ( )    Amphibians RIPPER: (Live in Water=Yes)    Fishes (Have Legs=No)    Reptiles (Give Birth=No, Can Fly=No, Live In Water=No)    Reptiles (Can Fly=Yes,Give Birth=No)    Birds ()    Mammals [Chapter 5 . 1 . 5, page 222]
C4.5 versus C4.5rules versus RIPPER C4.5 and C4.5rules: RIPPER: [Optional]
Advantages of Rule-Based Classifiers ,[object Object],[object Object],[object Object],[object Object],[object Object],[Chapter 5 . 1 . 6, page 223]
Instance-Based Classifiers ,[object Object],[object Object],[Optional]
5.2 Nearest-Neighbor Classifiers
Instance Based Classifiers ,[object Object],[object Object],[object Object],[object Object],[object Object],[Chapter 5 . 2, page 223]
Nearest Neighbor Classifiers ,[object Object],[object Object],[Chapter 5 . 2,  page 223] Training Records Test Record Compute Distance Choose k of the “nearest” records
Nearest-Neighbor Classifiers ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[Chapter 5 . 2]
Definition of Nearest Neighbor K-nearest neighbors of a record x are data points that have the k smallest distance to x [Chapter 5 . 2,  page 224]
1 nearest-neighbor Voronoi Diagram [Chapter 5 .  2]
Nearest Neighbor Classification ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[Chapter 5 . 2 . 1, page 226]
Nearest Neighbor Classification… ,[object Object],[object Object],[object Object],[Chapter 5 . 2 . page 225]
Nearest Neighbor Classification… ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[Optional]
Nearest Neighbor Classification… ,[object Object],[object Object],[object Object],[object Object],1 1 1 1 1 1 1 1 1 1 1 0 0 1 1 1 1 1 1 1 1 1 1 1 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 vs d = 1.4142 d = 1.4142 ,[object Object],[Optional]
Nearest neighbor Classification… ,[object Object],[object Object],[object Object],[object Object],[Optional]
Example: PEBLS ,[object Object],[object Object],[object Object],[object Object],[object Object],[Optional]
Example: PEBLS Distance between nominal attribute values : d(Single,Married)  =  | 2/4 – 0/4 | + | 2/4 – 4/4 | =  1 d(Single,Divorced)  =  | 2/4 – 1/2 | + | 2/4 – 1/2 | =  0 d(Married,Divorced)  =  | 0/4 – 1/2 | + | 4/4 – 1/2 | =  1 d(Refund=Yes,Refund=No) = | 0/3 – 3/7 | + | 3/3 – 4/7 | = 6/7 [Optional] 1 4 2 No 1 0 2 Yes Divorced Married Single Marital Status Class 4 3 No 3 0 Yes No Yes Refund Class
Example: PEBLS Distance between record X and record Y:  where:   w X     1 if X makes accurate prediction most of the time w X   > 1 if X is not reliable for making predictions [Optional]
5.3 Bayesian Classifiers
Bayes Classifier ,[object Object],[object Object],[object Object],[Chapter 5 . 3, page 227]
Example of Bayes Theorem ,[object Object],[object Object],[object Object],[object Object],[object Object],[Chapter 5 . 3 . 1, page 228]
Bayesian Classifiers ,[object Object],[object Object],[object Object],[object Object],[object Object],[Chapter 5 . 3 . 2, page 229]
Bayesian Classifiers ,[object Object],[object Object],[object Object],[object Object],[object Object],[Chapter 5 . 3 . 2]
Naïve Bayes Classifier ,[object Object],[object Object],[object Object],[object Object],[Chapter 5 . 3 . 2]
How to Estimate Probabilities from Data? ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],k [Chapter 5 . 3 . 2, page 230]
How to Estimate Probabilities from Data? ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],k [Chapter 5 . 3 . 3, page 232]
How to Estimate Probabilities from Data? ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[Chapter 5 . 3 . 3, page 235]
Example of Naïve Bayes Classifier ,[object Object],[object Object],[object Object],[object Object],Given a Test Record: [Chapter 5 . 3 . 3, page 235]
Naïve Bayes Classifier ,[object Object],[object Object],c: number of classes p: prior probability m: parameter [Optional]
Example of Naïve Bayes Classifier A: attributes M: mammals N: non-mammals P(A|M)P(M) > P(A|N)P(N) => Mammals [Optional]
Naïve Bayes (Summary) ,[object Object],[object Object],[object Object],[object Object],[object Object],[Chapter 5 . 3 . 5, page 240]
5.4 Artificial Neural Network (ANN)
Artificial Neural Networks (ANN) Output Y is 1 if at least two of the three inputs are equal to 1. [Chapter 5 . 4, page 247]
Artificial Neural Networks (ANN) [Chapter 5 . 4]
Artificial Neural Networks (ANN) ,[object Object],[object Object],[object Object],Perceptron Model or [Chapter 5 . 4 ]
General Structure of ANN Training ANN means learning the weights of the neurons [Chapter 5 . 4 . 2, page 251]
Algorithm for learning ANN ,[object Object],[object Object],[object Object],[object Object],[object Object],[Chapter 5 . 4 . 2, page 253]
5.5 Support Vector Machine (SVM)
Support Vector Machines ,[object Object],[Chapter 5 . 5, page 256]
Support Vector Machines ,[object Object],[Chapter 5 . 5 ]
Support Vector Machines ,[object Object],[Chapter 5 . 5 ]
Support Vector Machines ,[object Object],[Chapter 5 . 5 ]
Support Vector Machines ,[object Object],[object Object],[Chapter 5 . 5 ]
Support Vector Machines ,[object Object],[Chapter 5 . 5 ]
Support Vector Machines [Chapter 5 . 5 ]
Support Vector Machines ,[object Object],[object Object],[object Object],[object Object],[object Object],[Chapter 5 . 5 ]
Support Vector Machines ,[object Object],[Chapter 5 . 5 ]
Support Vector Machines ,[object Object],[object Object],[object Object],[object Object],[Chapter 5 . 5 ]
Nonlinear Support Vector Machines ,[object Object],[Chapter 5 . 5 . 4, page 270]
Nonlinear Support Vector Machines ,[object Object],[Chapter 5 . 5 . 4 ]
5.6 Ensemble Methods
Ensemble Methods ,[object Object],[object Object],[Chapter 5 . 6, page 276]
General Idea [Chapter 5 . 6 . 1,  page 278]
Why does it work? ,[object Object],[object Object],[object Object],[object Object],[Chapter 5 . 6 . 1,  page 277]
Examples of Ensemble Methods ,[object Object],[object Object],[object Object],[Chapter 5 . 6 . 2,  page 279]
Bagging ,[object Object],[object Object],[object Object],[Chapter 5 . 6 . 4,  page 283]
Boosting ,[object Object],[object Object],[object Object],[Chapter 5 . 6 . 5,  page 285]
Boosting ,[object Object],[object Object],[object Object],[object Object],[Chapter 5 . 6 . 5,  page 287]
Example: AdaBoost ,[object Object],[object Object],[object Object],[Chapter 5 . 6 . 5,  page 288-289]
Example: AdaBoost ,[object Object],[object Object],[object Object],[Chapter 5 . 6 . 5,  page 288]
Illustrating AdaBoost [Chapter 5 . 6 . 5 ] Data points for training Initial weights for each data point
Illustrating AdaBoost [Chapter 5 . 6 . 5 ]

Weitere ähnliche Inhalte

Andere mochten auch (6)

Neural networks
Neural networksNeural networks
Neural networks
 
Neural network
Neural networkNeural network
Neural network
 
Artificial intelligence NEURAL NETWORKS
Artificial intelligence NEURAL NETWORKSArtificial intelligence NEURAL NETWORKS
Artificial intelligence NEURAL NETWORKS
 
Artificial neural networks
Artificial neural networksArtificial neural networks
Artificial neural networks
 
Artificial neural network
Artificial neural networkArtificial neural network
Artificial neural network
 
Machine Learning and Data Mining: 12 Classification Rules
Machine Learning and Data Mining: 12 Classification RulesMachine Learning and Data Mining: 12 Classification Rules
Machine Learning and Data Mining: 12 Classification Rules
 

Ähnlich wie Ch5 alternative classification

lec06_Classification_NaiveBayes_RuleBased.pptx
lec06_Classification_NaiveBayes_RuleBased.pptxlec06_Classification_NaiveBayes_RuleBased.pptx
lec06_Classification_NaiveBayes_RuleBased.pptx
Mmine1
 
17 2 expert systems
17 2 expert systems17 2 expert systems
17 2 expert systems
Tianlu Wang
 
Test suite minimization
Test suite minimizationTest suite minimization
Test suite minimization
Gaurav Saxena
 
PSYC 3663 SAFMEDS TERMS – Assignment 1Provide a definition for e.docx
PSYC 3663 SAFMEDS TERMS – Assignment 1Provide a definition for e.docxPSYC 3663 SAFMEDS TERMS – Assignment 1Provide a definition for e.docx
PSYC 3663 SAFMEDS TERMS – Assignment 1Provide a definition for e.docx
amrit47
 

Ähnlich wie Ch5 alternative classification (20)

Rule-Based Classifiers
Rule-Based ClassifiersRule-Based Classifiers
Rule-Based Classifiers
 
lec06_Classification_NaiveBayes_RuleBased.pptx
lec06_Classification_NaiveBayes_RuleBased.pptxlec06_Classification_NaiveBayes_RuleBased.pptx
lec06_Classification_NaiveBayes_RuleBased.pptx
 
New_ML_Lecture_9.ppt
New_ML_Lecture_9.pptNew_ML_Lecture_9.ppt
New_ML_Lecture_9.ppt
 
Sparsenet
SparsenetSparsenet
Sparsenet
 
Pharo Quality Engine: The Last Strokes (esug2017)
Pharo Quality Engine: The Last Strokes (esug2017)Pharo Quality Engine: The Last Strokes (esug2017)
Pharo Quality Engine: The Last Strokes (esug2017)
 
Pharo Quality Engine: The Last Strokes
Pharo Quality Engine: The Last StrokesPharo Quality Engine: The Last Strokes
Pharo Quality Engine: The Last Strokes
 
Prediction Of Bioactivity From Chemical Structure
Prediction Of Bioactivity From Chemical StructurePrediction Of Bioactivity From Chemical Structure
Prediction Of Bioactivity From Chemical Structure
 
B0950814
B0950814B0950814
B0950814
 
Factor analysis
Factor analysisFactor analysis
Factor analysis
 
Leveraging Scala Macros for Better Validation
Leveraging Scala Macros for Better ValidationLeveraging Scala Macros for Better Validation
Leveraging Scala Macros for Better Validation
 
Classification Continued
Classification ContinuedClassification Continued
Classification Continued
 
Classification Continued
Classification ContinuedClassification Continued
Classification Continued
 
Ot regularization and_gradient_descent
Ot regularization and_gradient_descentOt regularization and_gradient_descent
Ot regularization and_gradient_descent
 
Ruleby
RulebyRuleby
Ruleby
 
17 2 expert systems
17 2 expert systems17 2 expert systems
17 2 expert systems
 
Test suite minimization
Test suite minimizationTest suite minimization
Test suite minimization
 
PSYC 3663 SAFMEDS TERMS – Assignment 1Provide a definition for e.docx
PSYC 3663 SAFMEDS TERMS – Assignment 1Provide a definition for e.docxPSYC 3663 SAFMEDS TERMS – Assignment 1Provide a definition for e.docx
PSYC 3663 SAFMEDS TERMS – Assignment 1Provide a definition for e.docx
 
A GENETIC-FROG LEAPING ALGORITHM FOR TEXT DOCUMENT CLUSTERING
A GENETIC-FROG LEAPING ALGORITHM FOR TEXT DOCUMENT CLUSTERINGA GENETIC-FROG LEAPING ALGORITHM FOR TEXT DOCUMENT CLUSTERING
A GENETIC-FROG LEAPING ALGORITHM FOR TEXT DOCUMENT CLUSTERING
 
3 recursion
3 recursion3 recursion
3 recursion
 
3-Recursion.ppt
3-Recursion.ppt3-Recursion.ppt
3-Recursion.ppt
 

Kürzlich hochgeladen

The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptx
heathfieldcps1
 

Kürzlich hochgeladen (20)

Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)
 
On National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan FellowsOn National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan Fellows
 
ICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptxICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptx
 
COMMUNICATING NEGATIVE NEWS - APPROACHES .pptx
COMMUNICATING NEGATIVE NEWS - APPROACHES .pptxCOMMUNICATING NEGATIVE NEWS - APPROACHES .pptx
COMMUNICATING NEGATIVE NEWS - APPROACHES .pptx
 
80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...
80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...
80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...
 
HMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptx
HMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptxHMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptx
HMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptx
 
SOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning PresentationSOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning Presentation
 
Jamworks pilot and AI at Jisc (20/03/2024)
Jamworks pilot and AI at Jisc (20/03/2024)Jamworks pilot and AI at Jisc (20/03/2024)
Jamworks pilot and AI at Jisc (20/03/2024)
 
Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024
 
The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptx
 
General Principles of Intellectual Property: Concepts of Intellectual Proper...
General Principles of Intellectual Property: Concepts of Intellectual  Proper...General Principles of Intellectual Property: Concepts of Intellectual  Proper...
General Principles of Intellectual Property: Concepts of Intellectual Proper...
 
Food safety_Challenges food safety laboratories_.pdf
Food safety_Challenges food safety laboratories_.pdfFood safety_Challenges food safety laboratories_.pdf
Food safety_Challenges food safety laboratories_.pdf
 
Beyond_Borders_Understanding_Anime_and_Manga_Fandom_A_Comprehensive_Audience_...
Beyond_Borders_Understanding_Anime_and_Manga_Fandom_A_Comprehensive_Audience_...Beyond_Borders_Understanding_Anime_and_Manga_Fandom_A_Comprehensive_Audience_...
Beyond_Borders_Understanding_Anime_and_Manga_Fandom_A_Comprehensive_Audience_...
 
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
 
NO1 Top Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam Ex...
NO1 Top Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam Ex...NO1 Top Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam Ex...
NO1 Top Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam Ex...
 
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptxHMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
 
Python Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docxPython Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docx
 
Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdf
 
On_Translating_a_Tamil_Poem_by_A_K_Ramanujan.pptx
On_Translating_a_Tamil_Poem_by_A_K_Ramanujan.pptxOn_Translating_a_Tamil_Poem_by_A_K_Ramanujan.pptx
On_Translating_a_Tamil_Poem_by_A_K_Ramanujan.pptx
 
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdfUGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
 

Ch5 alternative classification

  • 1. Data Mining Classification: Alternative Techniques Lecture Notes for Chapter 5 Introduction to Data Mining by Tan, Steinbach, Kumar © Tan,Steinbach, Kumar Introduction to Data Mining 4/18/2004
  • 2.
  • 4.
  • 5.
  • 6.
  • 7.
  • 8. How does Rule-based Classifier Work? R1: (Give Birth = no)  (Can Fly = yes)  Birds R2: (Give Birth = no)  (Live in Water = yes)  Fishes R3: (Give Birth = yes)  (Blood Type = warm)  Mammals R4: (Give Birth = no)  (Can Fly = no)  Reptiles R5: (Live in Water = sometimes)  Amphibians A lemur triggers rule R3, so it is classified as a mammal A turtle triggers both R4 and R5 A dogfish shark triggers none of the rules [Chapter 5 . 1 . 1, page 209]
  • 9.
  • 10. From Decision Trees To Rules Rules are mutually exclusive and exhaustive Rule set contains as much information as the tree [Optional]
  • 11. Rules Can Be Simplified Initial Rule: (Refund=No)  (Status=Married)  No Simplified Rule: (Status=Married)  No [Optional]
  • 12.
  • 13.
  • 14.
  • 15.
  • 16.
  • 17. Example of Sequential Covering [Chapter 5 . 1 . 4, page 214]
  • 18. Example of Sequential Covering… [Chapter 5 . 1 . 4, page 214]
  • 19.
  • 20.
  • 21.
  • 22.
  • 23.
  • 24.
  • 25.
  • 26.
  • 27.
  • 28.
  • 29.
  • 30. Indirect Methods [Chapter 5 . 1 . 5, page 221]
  • 31.
  • 32.
  • 34. C4.5 versus C4.5rules versus RIPPER C4.5rules: (Give Birth=No, Can Fly=Yes)  Birds (Give Birth=No, Live in Water=Yes)  Fishes (Give Birth=Yes)  Mammals (Give Birth=No, Can Fly=No, Live in Water=No)  Reptiles ( )  Amphibians RIPPER: (Live in Water=Yes)  Fishes (Have Legs=No)  Reptiles (Give Birth=No, Can Fly=No, Live In Water=No)  Reptiles (Can Fly=Yes,Give Birth=No)  Birds ()  Mammals [Chapter 5 . 1 . 5, page 222]
  • 35. C4.5 versus C4.5rules versus RIPPER C4.5 and C4.5rules: RIPPER: [Optional]
  • 36.
  • 37.
  • 39.
  • 40.
  • 41.
  • 42. Definition of Nearest Neighbor K-nearest neighbors of a record x are data points that have the k smallest distance to x [Chapter 5 . 2, page 224]
  • 43. 1 nearest-neighbor Voronoi Diagram [Chapter 5 . 2]
  • 44.
  • 45.
  • 46.
  • 47.
  • 48.
  • 49.
  • 50. Example: PEBLS Distance between nominal attribute values : d(Single,Married) = | 2/4 – 0/4 | + | 2/4 – 4/4 | = 1 d(Single,Divorced) = | 2/4 – 1/2 | + | 2/4 – 1/2 | = 0 d(Married,Divorced) = | 0/4 – 1/2 | + | 4/4 – 1/2 | = 1 d(Refund=Yes,Refund=No) = | 0/3 – 3/7 | + | 3/3 – 4/7 | = 6/7 [Optional] 1 4 2 No 1 0 2 Yes Divorced Married Single Marital Status Class 4 3 No 3 0 Yes No Yes Refund Class
  • 51. Example: PEBLS Distance between record X and record Y: where: w X  1 if X makes accurate prediction most of the time w X > 1 if X is not reliable for making predictions [Optional]
  • 53.
  • 54.
  • 55.
  • 56.
  • 57.
  • 58.
  • 59.
  • 60.
  • 61.
  • 62.
  • 63. Example of Naïve Bayes Classifier A: attributes M: mammals N: non-mammals P(A|M)P(M) > P(A|N)P(N) => Mammals [Optional]
  • 64.
  • 65. 5.4 Artificial Neural Network (ANN)
  • 66. Artificial Neural Networks (ANN) Output Y is 1 if at least two of the three inputs are equal to 1. [Chapter 5 . 4, page 247]
  • 67. Artificial Neural Networks (ANN) [Chapter 5 . 4]
  • 68.
  • 69. General Structure of ANN Training ANN means learning the weights of the neurons [Chapter 5 . 4 . 2, page 251]
  • 70.
  • 71. 5.5 Support Vector Machine (SVM)
  • 72.
  • 73.
  • 74.
  • 75.
  • 76.
  • 77.
  • 78. Support Vector Machines [Chapter 5 . 5 ]
  • 79.
  • 80.
  • 81.
  • 82.
  • 83.
  • 85.
  • 86. General Idea [Chapter 5 . 6 . 1, page 278]
  • 87.
  • 88.
  • 89.
  • 90.
  • 91.
  • 92.
  • 93.
  • 94. Illustrating AdaBoost [Chapter 5 . 6 . 5 ] Data points for training Initial weights for each data point