SlideShare ist ein Scribd-Unternehmen logo
1 von 89
Cluster Analysis ,[object Object],[object Object],[object Object],modified by Donghui Zhang
Chapter 7.  Cluster Analysis ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
What is Cluster Analysis? ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
General Applications of Clustering  ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Examples of Clustering Applications ,[object Object],[object Object],[object Object],[object Object],[object Object]
What Is Good Clustering? ,[object Object],[object Object],[object Object],[object Object],[object Object]
Requirements of Clustering in Data Mining  ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Major Clustering Approaches ,[object Object],[object Object],[object Object],[object Object],[object Object]
Chapter 7.  Cluster Analysis ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Partitioning Algorithms: Basic Concept ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
The  K-Means  Clustering Method   ,[object Object],[object Object],[object Object],[object Object]
The  K-Means  Clustering Method   ,[object Object],0 1 2 3 4 5 6 7 8 9 10 0 1 2 3 4 5 6 7 8 9 10 0 1 2 3 4 5 6 7 8 9 10 0 1 2 3 4 5 6 7 8 9 10 Assign each objects to most similar center 0 1 2 3 4 5 6 7 8 9 10 0 1 2 3 4 5 6 7 8 9 10 Update the cluster means 0 1 2 3 4 5 6 7 8 9 10 0 1 2 3 4 5 6 7 8 9 10 Update the cluster means K=2 Arbitrarily choose K object as initial cluster center reassign
Comments on the  K-Means  Method ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Variations of the  K-Means  Method ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Chapter 7.  Cluster Analysis ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Hierarchical Clustering ,[object Object],Step 0 Step 1 Step 2 Step 3 Step 4 b d c e a a b d e c d e a b c d e Step 4 Step 3 Step 2 Step 1 Step 0 agglomerative (AGNES) divisive (DIANA)
AGNES (Agglomerative Nesting) ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
A  Dendrogram  Shows How the Clusters are Merged Hierarchically ,[object Object],[object Object]
DIANA (Divisive Analysis) ,[object Object],[object Object],[object Object],[object Object]
More on Hierarchical Clustering Methods ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
BIRCH (1996) ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],Balanced Iterative Reducing and Clustering using Hierarchies
Clustering Feature Vector Clustering Feature:   CF = (N, LS, SS) N :  Number of data points LS:   N i=1  X i SS:   N i=1  (X i  ) 2 CF = (5, (16,30),244) (3,4) (2,6) (4,5) (4,7) (3,8)
Some Characteristics of CF ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Some Characteristics of CF
CF-Tree in BIRCH ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
CF Tree CF 1 child 1 CF 3 child 3 CF 2 child 2 CF 5 child 5 CF 1 CF 2 CF 6 prev next CF 1 CF 2 CF 4 prev next B = 7 L = 6 Root Non-leaf node Leaf node Leaf node CF 1 child 1 CF 3 child 3 CF 2 child 2 CF 6 child 6
CF-Tree Construction ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
CURE  (Clustering Using REpresentatives ) ,[object Object],[object Object],[object Object],[object Object],[object Object]
Motivation: K-means is not good
[object Object],AGNES + d min  is not good, either
Cure: The Basic Version ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Representative points ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Cure: for large data set ,[object Object],[object Object],[object Object],[object Object],[object Object],A representative is close to it.
Data Partitioning and Clustering ,[object Object],[object Object],[object Object],x x x y y y y x y x
Cure: Shrinking Representative Points ,[object Object],[object Object],x y x y
CHAMELEON ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Example of CHAMELEON Construct Sparse Graph Partition the Graph Merge Partition Final Clusters Data Set
Chapter 7.  Cluster Analysis ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Density-Based Clustering Methods ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Density Concepts ,[object Object],[object Object],[object Object],[object Object]
Density-Based Clustering: Background ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],p q MinPts = 5 Eps = 1 cm
Density-Based Clustering: Background (II) ,[object Object],[object Object],[object Object],[object Object],p q p 1 p q o
DBSCAN: Density Based Spatial Clustering of Applications with Noise ,[object Object],[object Object],Core Border Outlier Eps = 1cm MinPts = 5
DBSCAN: The Algorithm ,[object Object],[object Object],[object Object],[object Object],[object Object]
OPTICS:  A Cluster-Ordering Method (1999) ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
OPTICS: Some Extension from DBSCAN ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],D p2 MinPts = 5    = 3 cm Max (core-distance (o), d (o, p)) r(p1, o) = 2.8cm.  r(p2,o) = 4cm o o p1
Reachability-distance Cluster-order of the objects undefined ‘
Density-Based Cluster analysis: OPTICS & Its Applications
DENCLUE: Using density functions ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
[object Object],[object Object],[object Object],[object Object],[object Object],Denclue: Technical Essence
Gradient: The steepness of a slope ,[object Object]
Density Attractor
Center-Defined and Arbitrary
Chapter 7.  Cluster Analysis ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Grid-Based Clustering Method  ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
STING: A Statistical Information Grid Approach ,[object Object],[object Object],[object Object]
STING: A Statistical Information Grid Approach (2) ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
STING: A Statistical Information Grid Approach (3) ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
WaveCluster (1998) ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
What is Wavelet (1)?
WaveCluster (1998) ,[object Object],[object Object],[object Object],[object Object],[object Object]
Wavelet Transform ,[object Object],[object Object],[object Object]
What Is Wavelet (2)?
Quantization
Transformation
WaveCluster (1998) ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
CLIQUE (Clustering In QUEst)   ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
CLIQUE: The Major Steps ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Salary (10,000) 20 30 40 50 60 age 5 4 3 1 2 6 7 0    = 3 20 30 40 50 60 age 5 4 3 1 2 6 7 0 Vacation(week) age Vacation Salary 30 50
Strength and Weakness of  CLIQUE ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Chapter 7.  Cluster Analysis ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Model-Based Clustering Methods ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
COBWEB Clustering Method A classification tree
More on Statistical-Based Clustering ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Other Model-Based Clustering Methods ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Model-Based Clustering Methods
Self-organizing feature maps (SOMs) ,[object Object],[object Object],[object Object],[object Object],[object Object]
Chapter 7.  Cluster Analysis ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
What Is Outlier Discovery? ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Outlier Discovery: Statistical Approaches ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Outlier Discovery: Distance-Based Approach ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Outlier Discovery: Deviation-Based Approach ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Chapter 7.  Cluster Analysis ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Problems and Challenges ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Constraint-Based Clustering Analysis ,[object Object]
Clustering With Obstacle Objects Taking obstacles into account Not  Taking obstacles into account
Summary ,[object Object],[object Object],[object Object],[object Object],[object Object]
References (1) ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
References (2) ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]

Weitere ähnliche Inhalte

Was ist angesagt?

Instance based learning
Instance based learningInstance based learning
Instance based learning
Slideshare
 
CC282 Unsupervised Learning (Clustering) Lecture 7 slides for ...
CC282 Unsupervised Learning (Clustering) Lecture 7 slides for ...CC282 Unsupervised Learning (Clustering) Lecture 7 slides for ...
CC282 Unsupervised Learning (Clustering) Lecture 7 slides for ...
butest
 

Was ist angesagt? (20)

Classification Algorithm.
Classification Algorithm.Classification Algorithm.
Classification Algorithm.
 
Instance based learning
Instance based learningInstance based learning
Instance based learning
 
K means Clustering Algorithm
K means Clustering AlgorithmK means Clustering Algorithm
K means Clustering Algorithm
 
Knn 160904075605-converted
Knn 160904075605-convertedKnn 160904075605-converted
Knn 160904075605-converted
 
Machine learning clustering
Machine learning clusteringMachine learning clustering
Machine learning clustering
 
Presentation on K-Means Clustering
Presentation on K-Means ClusteringPresentation on K-Means Clustering
Presentation on K-Means Clustering
 
05 Clustering in Data Mining
05 Clustering in Data Mining05 Clustering in Data Mining
05 Clustering in Data Mining
 
K-Means Algorithm
K-Means AlgorithmK-Means Algorithm
K-Means Algorithm
 
K MEANS CLUSTERING
K MEANS CLUSTERINGK MEANS CLUSTERING
K MEANS CLUSTERING
 
Clustering
ClusteringClustering
Clustering
 
3.3 hierarchical methods
3.3 hierarchical methods3.3 hierarchical methods
3.3 hierarchical methods
 
K mean-clustering algorithm
K mean-clustering algorithmK mean-clustering algorithm
K mean-clustering algorithm
 
CC282 Unsupervised Learning (Clustering) Lecture 7 slides for ...
CC282 Unsupervised Learning (Clustering) Lecture 7 slides for ...CC282 Unsupervised Learning (Clustering) Lecture 7 slides for ...
CC282 Unsupervised Learning (Clustering) Lecture 7 slides for ...
 
Neural Networks: Radial Bases Functions (RBF)
Neural Networks: Radial Bases Functions (RBF)Neural Networks: Radial Bases Functions (RBF)
Neural Networks: Radial Bases Functions (RBF)
 
Cluster Analysis
Cluster AnalysisCluster Analysis
Cluster Analysis
 
Gnn overview
Gnn overviewGnn overview
Gnn overview
 
KNN
KNNKNN
KNN
 
Neural nw k means
Neural nw k meansNeural nw k means
Neural nw k means
 
K-Means Clustering Algorithm - Cluster Analysis | Machine Learning Algorithm ...
K-Means Clustering Algorithm - Cluster Analysis | Machine Learning Algorithm ...K-Means Clustering Algorithm - Cluster Analysis | Machine Learning Algorithm ...
K-Means Clustering Algorithm - Cluster Analysis | Machine Learning Algorithm ...
 
Hierachical clustering
Hierachical clusteringHierachical clustering
Hierachical clustering
 

Ähnlich wie My8clst

Ähnlich wie My8clst (20)

Dataa miining
Dataa miiningDataa miining
Dataa miining
 
Chapter 10. Cluster Analysis Basic Concepts and Methods.ppt
Chapter 10. Cluster Analysis Basic Concepts and Methods.pptChapter 10. Cluster Analysis Basic Concepts and Methods.ppt
Chapter 10. Cluster Analysis Basic Concepts and Methods.ppt
 
Data Mining Concepts and Techniques, Chapter 10. Cluster Analysis: Basic Conc...
Data Mining Concepts and Techniques, Chapter 10. Cluster Analysis: Basic Conc...Data Mining Concepts and Techniques, Chapter 10. Cluster Analysis: Basic Conc...
Data Mining Concepts and Techniques, Chapter 10. Cluster Analysis: Basic Conc...
 
10 clusbasic
10 clusbasic10 clusbasic
10 clusbasic
 
data mining cocepts and techniques chapter
data mining cocepts and techniques chapterdata mining cocepts and techniques chapter
data mining cocepts and techniques chapter
 
data mining cocepts and techniques chapter
data mining cocepts and techniques chapterdata mining cocepts and techniques chapter
data mining cocepts and techniques chapter
 
CLUSTERING
CLUSTERINGCLUSTERING
CLUSTERING
 
10 clusbasic
10 clusbasic10 clusbasic
10 clusbasic
 
multiarmed bandit.ppt
multiarmed bandit.pptmultiarmed bandit.ppt
multiarmed bandit.ppt
 
Capter10 cluster basic
Capter10 cluster basicCapter10 cluster basic
Capter10 cluster basic
 
Capter10 cluster basic : Han & Kamber
Capter10 cluster basic : Han & KamberCapter10 cluster basic : Han & Kamber
Capter10 cluster basic : Han & Kamber
 
Data mining concepts and techniques Chapter 10
Data mining concepts and techniques Chapter 10Data mining concepts and techniques Chapter 10
Data mining concepts and techniques Chapter 10
 
ClusetrigBasic.ppt
ClusetrigBasic.pptClusetrigBasic.ppt
ClusetrigBasic.ppt
 
ClustIII.ppt
ClustIII.pptClustIII.ppt
ClustIII.ppt
 
Data clustring
Data clustring Data clustring
Data clustring
 
3.5 model based clustering
3.5 model based clustering3.5 model based clustering
3.5 model based clustering
 
dm_clustering2.ppt
dm_clustering2.pptdm_clustering2.ppt
dm_clustering2.ppt
 
Chapter 10.1,2,3.pptx
Chapter 10.1,2,3.pptxChapter 10.1,2,3.pptx
Chapter 10.1,2,3.pptx
 
15857 cse422 unsupervised-learning
15857 cse422 unsupervised-learning15857 cse422 unsupervised-learning
15857 cse422 unsupervised-learning
 
CLUSTER ANALYSIS ALGORITHMS.pptx
CLUSTER ANALYSIS ALGORITHMS.pptxCLUSTER ANALYSIS ALGORITHMS.pptx
CLUSTER ANALYSIS ALGORITHMS.pptx
 

Kürzlich hochgeladen

+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 

Kürzlich hochgeladen (20)

ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot ModelNavi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
 
Ransomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdfRansomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdf
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu SubbuApidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024
 

My8clst