SlideShare ist ein Scribd-Unternehmen logo
1 von 24
1
Spatial Data Mining
Spatial Database
2
Stores a large amount of space-related data
Maps
Remote Sensing
Medical Imaging
VLSI chip layout
Have Topological and distance information
Require spatial indexing, data access, reasoning ,geometric
computation and knowledge representation techniques
Spatial Data Mining
3
Extraction of knowledge, spatial relationships from
spatial databases
Can be used for understanding spatial data and spatial
relationships
Applications:
GIS, Geomarketing, Remote Sensing, Image database
exploration, medical imaging, Navigation
Challenges
Complexity of spatial data types and access methods
Large amounts of data
Cont.
4
Non-spatial Information
Same as data in traditional data mining
Numerical, categorical, ordinal, boolean, etc
e.g., city name, city population
Spatial Information
Spatial attribute: geographically referenced
 Neighborhood and extent
Location, e.g., longitude, latitude, elevation
Spatial data representations
Raster: gridded space
Vector: point, line, polygon
Graph: node, edge, path
Spatial Data
5
Cont.
6
Statistical techniques
7
Popular approach to analyze spatial data
Assumes independence among spatial data
Can be performed only by experts
Do not work well with symbolic values
Spatial Data Warehousing
8
Spatial data warehouse: Integrated, subject-oriented, time-variant,
and nonvolatile spatial data repository.
It consists of both spatial and non spatial in support of spatial data mining
and spatial-data-related decision-making processes.
Spatial data cube: multidimensional spatial database
Both dimensions and measures may contain spatial components.
Challenging issues:
Spatial data integration: a big issue
Structure-specific formats (raster- vs. vector-based, OO vs. relational models,
different storage and indexing, etc.)
Vendor-specific formats (ESRI, MapInfo, Intergraph, IDRISI, etc.)
Realization of Fast and flexible OLAP in spatial data warehouses.
Dimensions and Measures in Spatial
Data Warehouse
9
Dimensions
non-spatial
e.g. “25-30 degrees” generalizes to“hot” (both are strings)
spatial-to-non spatial
e.g. Seattle generalizes to description “Pacific Northwest” (as a string)
spatial-to-spatial
e.g. Seattle generalizes to Pacific Northwest (as a spatial region)
Measures
numerical (e.g. monthly revenue of a region)
distributive (e.g. count, sum)
algebraic (e.g. average)
holistic (e.g. median, rank)
spatial
collection of spatial pointers (e.g. pointers to all regions with temperature of
25-30 degrees in July)
Example: British Columbia Weather
Pattern Analysis
10
Input
A map with about 3,000 weather probes scattered in B.C.
Recording daily data for temperature, precipitation, wind velocity, etc. for a designated
small area and transmitting signal to a provincial weather station.
Data warehouse using star schema
Output
A map that reveals patterns: merged (similar) regions
Goals
Interactive analysis (drill-down, slice, dice, pivot, roll-up)
Fast response time
Minimizing storage space used
Challenge
A merged region may contain hundreds of “primitive” regions (polygons)
Star Schema of the BC Weather
Warehouse
Spatial data warehouse
Dimensions
region_name
time
temperature
precipitation
Measurements
region_map
area
count
11Fact tableDimension table
12
Can we precompute all of the possible spatial merges
and store them in the corresponding cuboid cells of a
spatial data cube?
Probably not.
It requires multi-megabytes of storage.
On-line computation is slow and expensive.
Dynamic Merging of Spatial
Objects
13
Methods for Computing Spatial Data
Cubes
14
On-line aggregation: collect and store pointers to spatial
objects in a spatial data cube
expensive and slow, need efficient aggregation techniques
Precompute and store all the possible combinations
huge space overhead
Precompute and store rough approximations in a spatial data
cube
accuracy trade-off, MBR
Selective computation: only materialize those which will be
accessed frequently
a reasonable choice
Mining Spatial Association and
Co-location Patterns
15
Spatial association rule: A ⇒ B [s%, c%]
A and B are sets of spatial or non-spatial predicates
Topological relations: intersects, overlaps, disjoint, etc.
Spatial orientations: left_of, west_of, under, etc.
Distance information: close_to, within_distance, etc.
s% is the support and c% is the confidence of the rule
Examples
is_a(x, “School”) ^ Close_to(x, “Sports_Center”) → close_to(x, “Park”)
[7%, 85%]
Progressive Refinement
16
Progressive Refinement:
spatial association mining needs to evaluate multiple spatial relationships
among a large no. of spatial object – expensive.
Hierarchy of spatial relationship:
First search for rough relationship and then refine it
Superset coverage property – all the potential answers should be perserved
(i.e.false-positive test).
Two-step mining of spatial association:
Step 1: Rough spatial computation (as a filter)
 Using MBR for rough estimation
Step2: Detailed spatial algorithm (as refinement)
 Apply only to those objects which have passed the rough spatial association test
(no less than min_support)
Spatial co-locations
17
Just what one really wants to explore.
Based on the property of spatial autocorrelation, interesting
features likely coexist in closely located regions.
Efficient methods - Apriori , progressive refinement,etc.
18
Spatial Cluster Analysis
19
• Mining clusters—k-means, k-medoids, hierarchical, density-based,
etc.
• Analysis of distinct features of the clusters
Spatial Classification
20
Analyze spatial objects to derive classification schemes, such
as decision trees, in relevance to certain spatial properties
(district, highway, river, etc.)
Classifying medium-size families according to income, region, and infant mortality
rates
Mining for volcanoes on Venus
Employ methods such as:
Decision-tree classification, Naïve-Bayesian classifier + boosting, neural network,
genetic programming, etc.
Spatial Trend Analysis
21
Function
Detect changes and trends along a spatial dimension
Study the trend of non-spatial or spatial data changing with space
Application examples
Observe the trend of changes of the climate or vegetation with
increasing distance from an ocean
Crime rate or unemployment rate change with regard to city geo-
distribution.
Traffic flows in highways and in cities.
Mining Raster Databases
22
Vector data Mining
Maps
Graphs
Molecular chains
Raster data mining
Satellite Images
23
Other Applications
24
Spatial data mining is used in
 NASA Earth Observing System (EOS): Earth science data
National Inst. of Justice: crime mapping
 Census Bureau, Dept. of Commerce: census data
 Dept. of Transportation (DOT): traffic data
National Inst. of Health(NIH): cancer clusters
 Commerce, e.g. Retail Analysis

Weitere ähnliche Inhalte

Was ist angesagt?

Data cube computation
Data cube computationData cube computation
Data cube computation
Rashmi Sheikh
 
Data preprocessing
Data preprocessingData preprocessing
Data preprocessing
ankur bhalla
 
Major issues in data mining
Major issues in data miningMajor issues in data mining
Major issues in data mining
Slideshare
 
Data warehouse architecture
Data warehouse architectureData warehouse architecture
Data warehouse architecture
pcherukumalla
 

Was ist angesagt? (20)

3.2 partitioning methods
3.2 partitioning methods3.2 partitioning methods
3.2 partitioning methods
 
Data Mining
Data MiningData Mining
Data Mining
 
Data cube computation
Data cube computationData cube computation
Data cube computation
 
3. mining frequent patterns
3. mining frequent patterns3. mining frequent patterns
3. mining frequent patterns
 
1.2 steps and functionalities
1.2 steps and functionalities1.2 steps and functionalities
1.2 steps and functionalities
 
Spatial Database
Spatial DatabaseSpatial Database
Spatial Database
 
Multimedia Database
Multimedia Database Multimedia Database
Multimedia Database
 
Data preprocessing
Data preprocessingData preprocessing
Data preprocessing
 
Kdd process
Kdd processKdd process
Kdd process
 
Data Mining: Graph mining and social network analysis
Data Mining: Graph mining and social network analysisData Mining: Graph mining and social network analysis
Data Mining: Graph mining and social network analysis
 
5.2 mining time series data
5.2 mining time series data5.2 mining time series data
5.2 mining time series data
 
Data mining primitives
Data mining primitivesData mining primitives
Data mining primitives
 
Major issues in data mining
Major issues in data miningMajor issues in data mining
Major issues in data mining
 
OLAP operations
OLAP operationsOLAP operations
OLAP operations
 
Data mining: Classification and prediction
Data mining: Classification and predictionData mining: Classification and prediction
Data mining: Classification and prediction
 
Data Mining: Data cube computation and data generalization
Data Mining: Data cube computation and data generalizationData Mining: Data cube computation and data generalization
Data Mining: Data cube computation and data generalization
 
Multidimensional data models
Multidimensional data  modelsMultidimensional data  models
Multidimensional data models
 
Temporal databases
Temporal databasesTemporal databases
Temporal databases
 
multi dimensional data model
multi dimensional data modelmulti dimensional data model
multi dimensional data model
 
Data warehouse architecture
Data warehouse architectureData warehouse architecture
Data warehouse architecture
 

Ähnlich wie 4.2 spatial data mining

Geographical Information System By Zewde Alemayehu Tilahun.pptx
Geographical Information System By Zewde Alemayehu Tilahun.pptxGeographical Information System By Zewde Alemayehu Tilahun.pptx
Geographical Information System By Zewde Alemayehu Tilahun.pptx
zewde alemayehu
 

Ähnlich wie 4.2 spatial data mining (20)

Spatial databases
Spatial databasesSpatial databases
Spatial databases
 
Data warehousing and data mining Chapter 9
Data warehousing and data mining Chapter 9Data warehousing and data mining Chapter 9
Data warehousing and data mining Chapter 9
 
Spatial databases
Spatial databasesSpatial databases
Spatial databases
 
RS_GIS_Crop_monitoring-converted.pptx
RS_GIS_Crop_monitoring-converted.pptxRS_GIS_Crop_monitoring-converted.pptx
RS_GIS_Crop_monitoring-converted.pptx
 
Scattered gis handbook
Scattered gis handbookScattered gis handbook
Scattered gis handbook
 
PIAS 2013-GIS.pptxfskjczjsbchdbfscnnND dHSA
PIAS 2013-GIS.pptxfskjczjsbchdbfscnnND  dHSAPIAS 2013-GIS.pptxfskjczjsbchdbfscnnND  dHSA
PIAS 2013-GIS.pptxfskjczjsbchdbfscnnND dHSA
 
Fundamentals of GIS
Fundamentals of GISFundamentals of GIS
Fundamentals of GIS
 
Geographic Information System unit 1
Geographic Information System   unit 1Geographic Information System   unit 1
Geographic Information System unit 1
 
Basic of gis concept and theories
Basic of gis concept and theoriesBasic of gis concept and theories
Basic of gis concept and theories
 
Archaeological Applications of Geographic Information Systems (GIS)
Archaeological Applications of Geographic Information Systems (GIS)Archaeological Applications of Geographic Information Systems (GIS)
Archaeological Applications of Geographic Information Systems (GIS)
 
17 manjula aakunuri final_paper--185-190
17 manjula aakunuri final_paper--185-19017 manjula aakunuri final_paper--185-190
17 manjula aakunuri final_paper--185-190
 
Gis
GisGis
Gis
 
Data collection
Data collectionData collection
Data collection
 
INTRODUCTION_TO_GIS.ppt
INTRODUCTION_TO_GIS.pptINTRODUCTION_TO_GIS.ppt
INTRODUCTION_TO_GIS.ppt
 
Spatial Databases
Spatial DatabasesSpatial Databases
Spatial Databases
 
Geographical Information System By Zewde Alemayehu Tilahun.pptx
Geographical Information System By Zewde Alemayehu Tilahun.pptxGeographical Information System By Zewde Alemayehu Tilahun.pptx
Geographical Information System By Zewde Alemayehu Tilahun.pptx
 
Geographical information system by zewde alemayehu tilahun
Geographical information system by zewde alemayehu tilahunGeographical information system by zewde alemayehu tilahun
Geographical information system by zewde alemayehu tilahun
 
spatial databases ADBMS ppt
spatial databases ADBMS pptspatial databases ADBMS ppt
spatial databases ADBMS ppt
 
Geographic Information System for Bachelor in Agriculture Engineering
Geographic Information System for Bachelor in Agriculture EngineeringGeographic Information System for Bachelor in Agriculture Engineering
Geographic Information System for Bachelor in Agriculture Engineering
 
Gis basic-2
Gis basic-2Gis basic-2
Gis basic-2
 

Mehr von Krish_ver2

Mehr von Krish_ver2 (20)

5.5 back tracking
5.5 back tracking5.5 back tracking
5.5 back tracking
 
5.5 back track
5.5 back track5.5 back track
5.5 back track
 
5.5 back tracking 02
5.5 back tracking 025.5 back tracking 02
5.5 back tracking 02
 
5.4 randomized datastructures
5.4 randomized datastructures5.4 randomized datastructures
5.4 randomized datastructures
 
5.4 randomized datastructures
5.4 randomized datastructures5.4 randomized datastructures
5.4 randomized datastructures
 
5.4 randamized algorithm
5.4 randamized algorithm5.4 randamized algorithm
5.4 randamized algorithm
 
5.3 dynamic programming 03
5.3 dynamic programming 035.3 dynamic programming 03
5.3 dynamic programming 03
 
5.3 dynamic programming
5.3 dynamic programming5.3 dynamic programming
5.3 dynamic programming
 
5.3 dyn algo-i
5.3 dyn algo-i5.3 dyn algo-i
5.3 dyn algo-i
 
5.2 divede and conquer 03
5.2 divede and conquer 035.2 divede and conquer 03
5.2 divede and conquer 03
 
5.2 divide and conquer
5.2 divide and conquer5.2 divide and conquer
5.2 divide and conquer
 
5.2 divede and conquer 03
5.2 divede and conquer 035.2 divede and conquer 03
5.2 divede and conquer 03
 
5.1 greedyyy 02
5.1 greedyyy 025.1 greedyyy 02
5.1 greedyyy 02
 
5.1 greedy
5.1 greedy5.1 greedy
5.1 greedy
 
5.1 greedy 03
5.1 greedy 035.1 greedy 03
5.1 greedy 03
 
4.4 hashing02
4.4 hashing024.4 hashing02
4.4 hashing02
 
4.4 hashing
4.4 hashing4.4 hashing
4.4 hashing
 
4.4 hashing ext
4.4 hashing  ext4.4 hashing  ext
4.4 hashing ext
 
4.4 external hashing
4.4 external hashing4.4 external hashing
4.4 external hashing
 
4.2 bst
4.2 bst4.2 bst
4.2 bst
 

Kürzlich hochgeladen

The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptx
heathfieldcps1
 
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in DelhiRussian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
kauryashika82
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
QucHHunhnh
 
Seal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptxSeal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptx
negromaestrong
 
An Overview of Mutual Funds Bcom Project.pdf
An Overview of Mutual Funds Bcom Project.pdfAn Overview of Mutual Funds Bcom Project.pdf
An Overview of Mutual Funds Bcom Project.pdf
SanaAli374401
 
Gardella_Mateo_IntellectualProperty.pdf.
Gardella_Mateo_IntellectualProperty.pdf.Gardella_Mateo_IntellectualProperty.pdf.
Gardella_Mateo_IntellectualProperty.pdf.
MateoGardella
 

Kürzlich hochgeladen (20)

The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptx
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
 
How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17
 
This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.
 
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in DelhiRussian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
 
Holdier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfHoldier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdf
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introduction
 
Seal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptxSeal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptx
 
An Overview of Mutual Funds Bcom Project.pdf
An Overview of Mutual Funds Bcom Project.pdfAn Overview of Mutual Funds Bcom Project.pdf
An Overview of Mutual Funds Bcom Project.pdf
 
psychiatric nursing HISTORY COLLECTION .docx
psychiatric  nursing HISTORY  COLLECTION  .docxpsychiatric  nursing HISTORY  COLLECTION  .docx
psychiatric nursing HISTORY COLLECTION .docx
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy Consulting
 
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptxBasic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
 
Advance Mobile Application Development class 07
Advance Mobile Application Development class 07Advance Mobile Application Development class 07
Advance Mobile Application Development class 07
 
Unit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptxUnit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptx
 
APM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAPM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across Sectors
 
Gardella_Mateo_IntellectualProperty.pdf.
Gardella_Mateo_IntellectualProperty.pdf.Gardella_Mateo_IntellectualProperty.pdf.
Gardella_Mateo_IntellectualProperty.pdf.
 
SECOND SEMESTER TOPIC COVERAGE SY 2023-2024 Trends, Networks, and Critical Th...
SECOND SEMESTER TOPIC COVERAGE SY 2023-2024 Trends, Networks, and Critical Th...SECOND SEMESTER TOPIC COVERAGE SY 2023-2024 Trends, Networks, and Critical Th...
SECOND SEMESTER TOPIC COVERAGE SY 2023-2024 Trends, Networks, and Critical Th...
 
Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024
 
Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and Mode
 

4.2 spatial data mining

  • 2. Spatial Database 2 Stores a large amount of space-related data Maps Remote Sensing Medical Imaging VLSI chip layout Have Topological and distance information Require spatial indexing, data access, reasoning ,geometric computation and knowledge representation techniques
  • 3. Spatial Data Mining 3 Extraction of knowledge, spatial relationships from spatial databases Can be used for understanding spatial data and spatial relationships Applications: GIS, Geomarketing, Remote Sensing, Image database exploration, medical imaging, Navigation Challenges Complexity of spatial data types and access methods Large amounts of data
  • 4. Cont. 4 Non-spatial Information Same as data in traditional data mining Numerical, categorical, ordinal, boolean, etc e.g., city name, city population Spatial Information Spatial attribute: geographically referenced  Neighborhood and extent Location, e.g., longitude, latitude, elevation Spatial data representations Raster: gridded space Vector: point, line, polygon Graph: node, edge, path
  • 7. Statistical techniques 7 Popular approach to analyze spatial data Assumes independence among spatial data Can be performed only by experts Do not work well with symbolic values
  • 8. Spatial Data Warehousing 8 Spatial data warehouse: Integrated, subject-oriented, time-variant, and nonvolatile spatial data repository. It consists of both spatial and non spatial in support of spatial data mining and spatial-data-related decision-making processes. Spatial data cube: multidimensional spatial database Both dimensions and measures may contain spatial components. Challenging issues: Spatial data integration: a big issue Structure-specific formats (raster- vs. vector-based, OO vs. relational models, different storage and indexing, etc.) Vendor-specific formats (ESRI, MapInfo, Intergraph, IDRISI, etc.) Realization of Fast and flexible OLAP in spatial data warehouses.
  • 9. Dimensions and Measures in Spatial Data Warehouse 9 Dimensions non-spatial e.g. “25-30 degrees” generalizes to“hot” (both are strings) spatial-to-non spatial e.g. Seattle generalizes to description “Pacific Northwest” (as a string) spatial-to-spatial e.g. Seattle generalizes to Pacific Northwest (as a spatial region) Measures numerical (e.g. monthly revenue of a region) distributive (e.g. count, sum) algebraic (e.g. average) holistic (e.g. median, rank) spatial collection of spatial pointers (e.g. pointers to all regions with temperature of 25-30 degrees in July)
  • 10. Example: British Columbia Weather Pattern Analysis 10 Input A map with about 3,000 weather probes scattered in B.C. Recording daily data for temperature, precipitation, wind velocity, etc. for a designated small area and transmitting signal to a provincial weather station. Data warehouse using star schema Output A map that reveals patterns: merged (similar) regions Goals Interactive analysis (drill-down, slice, dice, pivot, roll-up) Fast response time Minimizing storage space used Challenge A merged region may contain hundreds of “primitive” regions (polygons)
  • 11. Star Schema of the BC Weather Warehouse Spatial data warehouse Dimensions region_name time temperature precipitation Measurements region_map area count 11Fact tableDimension table
  • 12. 12 Can we precompute all of the possible spatial merges and store them in the corresponding cuboid cells of a spatial data cube? Probably not. It requires multi-megabytes of storage. On-line computation is slow and expensive.
  • 13. Dynamic Merging of Spatial Objects 13
  • 14. Methods for Computing Spatial Data Cubes 14 On-line aggregation: collect and store pointers to spatial objects in a spatial data cube expensive and slow, need efficient aggregation techniques Precompute and store all the possible combinations huge space overhead Precompute and store rough approximations in a spatial data cube accuracy trade-off, MBR Selective computation: only materialize those which will be accessed frequently a reasonable choice
  • 15. Mining Spatial Association and Co-location Patterns 15 Spatial association rule: A ⇒ B [s%, c%] A and B are sets of spatial or non-spatial predicates Topological relations: intersects, overlaps, disjoint, etc. Spatial orientations: left_of, west_of, under, etc. Distance information: close_to, within_distance, etc. s% is the support and c% is the confidence of the rule Examples is_a(x, “School”) ^ Close_to(x, “Sports_Center”) → close_to(x, “Park”) [7%, 85%]
  • 16. Progressive Refinement 16 Progressive Refinement: spatial association mining needs to evaluate multiple spatial relationships among a large no. of spatial object – expensive. Hierarchy of spatial relationship: First search for rough relationship and then refine it Superset coverage property – all the potential answers should be perserved (i.e.false-positive test). Two-step mining of spatial association: Step 1: Rough spatial computation (as a filter)  Using MBR for rough estimation Step2: Detailed spatial algorithm (as refinement)  Apply only to those objects which have passed the rough spatial association test (no less than min_support)
  • 17. Spatial co-locations 17 Just what one really wants to explore. Based on the property of spatial autocorrelation, interesting features likely coexist in closely located regions. Efficient methods - Apriori , progressive refinement,etc.
  • 18. 18
  • 19. Spatial Cluster Analysis 19 • Mining clusters—k-means, k-medoids, hierarchical, density-based, etc. • Analysis of distinct features of the clusters
  • 20. Spatial Classification 20 Analyze spatial objects to derive classification schemes, such as decision trees, in relevance to certain spatial properties (district, highway, river, etc.) Classifying medium-size families according to income, region, and infant mortality rates Mining for volcanoes on Venus Employ methods such as: Decision-tree classification, Naïve-Bayesian classifier + boosting, neural network, genetic programming, etc.
  • 21. Spatial Trend Analysis 21 Function Detect changes and trends along a spatial dimension Study the trend of non-spatial or spatial data changing with space Application examples Observe the trend of changes of the climate or vegetation with increasing distance from an ocean Crime rate or unemployment rate change with regard to city geo- distribution. Traffic flows in highways and in cities.
  • 22. Mining Raster Databases 22 Vector data Mining Maps Graphs Molecular chains Raster data mining Satellite Images
  • 23. 23
  • 24. Other Applications 24 Spatial data mining is used in  NASA Earth Observing System (EOS): Earth science data National Inst. of Justice: crime mapping  Census Bureau, Dept. of Commerce: census data  Dept. of Transportation (DOT): traffic data National Inst. of Health(NIH): cancer clusters  Commerce, e.g. Retail Analysis