SlideShare ist ein Scribd-Unternehmen logo
1 von 46
10 Steps to Optimize Your Crime Analysis




             340 N 12th St, Suite 402
             Philadelphia, PA 19107
                  215.925.2600
               info@azavea.com
           www.azavea.com/hunchlab
About Us


    Robert Cheetham
    President & CEO
    cheetham@azavea.com
    215.701.7713




   Jeremy Heffner
   HunchLab Product Manager
   jheffner@azavea.com
   215.701.7712
About Azavea

• Founded in 2000
• 32 people
• Based in Philadelphia
   – Boston office
   – Minneapolis office
• Geospatial + web + mobile
   – Software development
   – Spatial analysis services
Clients & Industries

•   Public Safety
•   Municipal Services
•   Public Health
•   Human Services
•   Culture
•   Elections & Politics
•   Land Conservation
•   Economic Development
web-based crime analysis, early warning, and risk forecasting
10 Steps to Optimize Your Crime Analysis
10 Steps to Optimize Your Crime Analysis

    Crime Analysis


             Data              Analytic
                                                       Use Cases
            Quality          Techniques

     •   Geocoding       •   Kernel Density Map    •   Open data
                             Predictive Accuracy
     •   Dates & Times                             •   Find research
                         •   NNI and Gi*               partners
     •   Polygon
         Hierarchies     •   Near Repeat
                             Calculator
                         •   Randomized
                             controlled trial
                         •   Risk Terrain
                             Modeling
Data Quality
1. Examine Geocoding Accuracy

• Geocoding
  – Process of turning addresses into geographic coordinates
  – Examine accuracy
     • Correct locations
         – Geocoding method
             Âť Commercial geocoder
             Âť Street center line
             Âť Parcel database
         – POIs and landmarks
         – Incorrect clustering at:
             Âť Precinct locations, zip code and city centroids
     • High geocoding success rates
         – Ratcliffe suggests at least 85% (lowest acceptable)
             Âť http://bit.ly/ratcliffegeocoding
         – Examine unsuccessful geocodes for patterns
2. Examine Dates & Times

• Dates & Times
   – Event-related times
      • Actual occurrence time
          – From / to time interval
      • Report time
      • Officers responded time
2. Examine Dates & Times

• Dates & Times
   – Examining accuracy
      • Data entry defaults
      • Data validation on input
      • Clustering by time cycles
          –   Day of week
          –   Day of month
          –   Day of year
          –   Hour of day
          –   Minute of hour
2. Examine Dates & Times
3. Examine Polygon Hierarchy

• Polygon Hierarchy
   – A set of geographic areas that nest within each other used
     to organize resources (i.e. divisions, districts, PSAs, beats)
• What makes a good hierarchy?
   – Perfectly nested polygons
       • No sliver polygons
   – Areas should be periodically rebalanced based on
     changing crime levels
   – Consider that splitting areas based on streets means one
     side of street is in one district / other side is in a different
     district.
Analytic Techniques
4. Test Predictive Accuracy of KDE

• Kernel Density Estimation
   – A smoothing technique that generates hotspot maps
4. Test Predictive Accuracy of KDE

• When we look at a hotspot map what are we assuming?
   – That crimes will happen in the hotspots again.
   – But…
      •   How predictive is it?
      •   How much historic data should we use?
      •   What search radius should we use?
      •   What is the density cutoff for a hotspot?




          Source: Chainey, http://www.popcenter.org/conference/conferencepapers/2010/Chainey-Gi-hotSpots.pdf
4. Test Predictive Accuracy of KDE

• Predictive Accuracy Index
   – Spencer Chainey, Jill Dando Institute
      • http://www.palgrave-journals.com/sj/journal/v21/n1/full/8350066a.html
   – Incorporates:
      • Desire for a high hit rate
           – Lots of crime incidents in a prior ‘hotspot’
      • Desire for a small geographic area
           – Less to patrol, etc.
4. Test Predictive Accuracy of KDE

• Predictive Accuracy Index Steps
    1. Generate kernel density map for historic period
    2. Measure predictive validity against future time period
    3. Higher number better
•   Example from Chainey
4. Test Predictive Accuracy of KDE

• Predictive Accuracy Index
   – Caveats
      • Number is relative to crime type and geography
      • Best for comparing different techniques (or parameter
        variations of techniques) for the same predictive period
4. Test Predictive Accuracy of KDE

• Predictive Accuracy Index
   – Caveats
      • Number is relative to crime type and geography
      • Best for comparing different techniques (or parameter
        variations of techniques) for the same predictive period




         Is there something similar but better than kernel density?
5. Test NNI and use Gi*

• Gi*
  – Spencer Chainey, Jill Dando Institute
        • http://www.popcenter.org/conference/conferencepapers/2010/Chainey-Gi-hotSpots.pdf

  – LISA statistic
        • Local indicator of spatial association
  – Compares local averages to global averages
  – Generates map visually similar to KDE
5. Test NNI and use Gi*
5. Test NNI and use Gi*

• How do we know hotspots exist though?
   – Calculate nearest neighbor index (NNI)
      • Determines if clustering exists
          – NNI ~ 1: data is randomly distributed
          – NNI < 1: data is clustered
          – NNI > 1: data is uniformly distributed
      • Helps to answer if we have enough historic data for statistical
        significance
      • If data is not clustered neither Gi* nor KDE should be used
5. Test NNI and use Gi*

• Summary of Steps
   –   Test for clustering with nearest neighbor index
   –   Calculate crime counts within a grid
   –   Run Gi* statistic
   –   Set color ramp breakpoints based on fixed statistical
       significance levels
5. Test NNI and use Gi*

• Summary of Steps
   –   Test for clustering with nearest neighbor index
   –   Calculate crime counts within a grid
   –   Run Gi* statistic
   –   Set color ramp breakpoints based on fixed statistical
       significance levels



            Remember our friend the predictive accuracy index?
5. Test NNI and use Gi*

• Is it really better than KDE?
   – Example from Chainey
6. Run the Near Repeat Calculator

• Near Repeat Pattern Analysis
   – Measures ‘contagion’ effect of crime incident
   – How does one burglary change the risk that another
     burglary will occur nearby in the coming days?
• Common in Some Types of Crime
   –   Burglary
   –   Theft from Vehicle
   –   Gun Crime
   –   Robbery
   –   Bicycle Theft
6. Run the Near Repeat Calculator
6. Run the Near Repeat Calculator
6. Run the Near Repeat Calculator

• Near Repeat Calculator
   – http://www.temple.edu/cj/misc/nr/
• Papers
   – Near-Repeat Patterns in Philadelphia Shootings (2008)
      • One city block & two weeks after one shooting
           – 33% increase in likelihood of a second event




                                               Jerry Ratcliffe
                                             Temple University
7. Conduct a Randomized Trial

• Randomized Controlled Trial
   – An experiment where study subjects (e.g. locations) are
     randomly assigned to different treatment protocols
   – Academics do this regularly
• But why do this yourself?
   – Proves/disproves a technique’s efficacy for your
     department
   – Successfully mimicking a published trial gives you the skills
     to experiment based on local anecdotal evidence
7. Conduct a Randomized Trial

• Philadelphia Foot Patrol Experiment
   – Jerry Ratcliffe, Temple University
      • http://bit.ly/phillyfootpatrol
   – Concentrated patrol in 60 violent crime hotspots
   – Outlines full methodology
      • Experimental design
      • Evaluation
   – Result was a net reduction of 53 violent crimes
8. Generate a Risk Terrain Model

• Risk Terrain Modeling
   – Joel Caplan & Les Kennedy, Rutgers University
      • http://www.rutgerscps.org/rtm/
   – Forms a combined risk surface of several spatial risk factors
     that correlate with a particular type of crime
   – Describes the environmental context within crime occurs
8. Generate a Risk Terrain Model

• Steps to Building a Model
   1. List potential risk factors
      •   Literature review
      •   Departmental experience
   2. Assemble GIS data sets for each factor
   3. Operationalize each factor and test for correlation
   4. Combine correlated factors into combined risk terrain
8. Generate a Risk Terrain Model




                       Gun shootings example
Source: Rutgers, http://www.rutgerscps.org/rtm/irvrtmgoogearth.htm
8. Generate a Risk Terrain Model




                       Gun shootings example
Source: Rutgers, http://www.rutgerscps.org/rtm/irvrtmgoogearth.htm
8. Generate a Risk Terrain Model




                       Gun shootings example
Source: Rutgers, http://www.rutgerscps.org/rtm/irvrtmgoogearth.htm
8. Generate a Risk Terrain Model

• Risk Terrain Modeling
   – Risk Terrain Modeling Manual
      • http://www.rutgerscps.org/rtm/
   – Online training
      • http://www.rutgerscps.org/rtm/webinar.html
Use Cases
9. Open Up Data (Appropriately)

• Increase transparency and community engagement
  –   Open data movement
  –   Increases trust
  –   Allows novel uses for crime data
  –   Increases perceived value
      of good data
• Not a new idea
  – NIJ guide released in 2001
       • https://www.ncjrs.gov/pdffiles1/nij/188739.pdf
9. Open Up Data (Appropriately)

• Public crime mapping sites
   – Omega Group
      • http://www.crimemapping.com/
9. Open Up Data (Appropriately)

• Data portals
   – Chicago Data Portal
      • http://data.cityofchicago.org/
      • Raw incident data from 2001 to present
10. Find Research Partners

• Benefits of Conducting Research
   – Lowers the risk of trying something new
   – Supplements limited resources
      • Labor
      • Software
      • Knowledge/techniques/statistical rigor
   – Encourages cutting edge analysis
• Types of Partners
   – Academic crime researchers
   – Nonprofits
   – Commercial entities
10. Find Research Partners




         (A brief plug)
Q&A
About Us


              Robert Cheetham
              President & CEO
              cheetham@azavea.com
              215.701.7713


              Jeremy Heffner
              HunchLab Product Manager
              jheffner@azavea.com
              215.701.7712




www.azavea.com/hunchlab

Weitere ähnliche Inhalte

Was ist angesagt?

Crime analysis mapping, intrusion detection using data mining
Crime analysis mapping, intrusion detection using data miningCrime analysis mapping, intrusion detection using data mining
Crime analysis mapping, intrusion detection using data mining
Venkat Projects
 
The Real-time Police Force: Publishing Analytic Information to the Field with...
The Real-time Police Force: Publishing Analytic Information to the Field with...The Real-time Police Force: Publishing Analytic Information to the Field with...
The Real-time Police Force: Publishing Analytic Information to the Field with...
Azavea
 
Crime Identification Denver Colorado
Crime Identification Denver ColoradoCrime Identification Denver Colorado
Crime Identification Denver Colorado
Chad Yowler
 
Jamal Shakeenab Resume
Jamal Shakeenab ResumeJamal Shakeenab Resume
Jamal Shakeenab Resume
Jamal Shakeenab
 

Was ist angesagt? (20)

Analytics-Based Crime Prediction
Analytics-Based Crime PredictionAnalytics-Based Crime Prediction
Analytics-Based Crime Prediction
 
Crime Analysis using Data Analysis
Crime Analysis using Data AnalysisCrime Analysis using Data Analysis
Crime Analysis using Data Analysis
 
Crime analysis mapping, intrusion detection using data mining
Crime analysis mapping, intrusion detection using data miningCrime analysis mapping, intrusion detection using data mining
Crime analysis mapping, intrusion detection using data mining
 
Crime Analytics: Analysis of crimes through news paper articles
Crime Analytics: Analysis of crimes through news paper articlesCrime Analytics: Analysis of crimes through news paper articles
Crime Analytics: Analysis of crimes through news paper articles
 
The Real-time Police Force: Publishing Analytic Information to the Field with...
The Real-time Police Force: Publishing Analytic Information to the Field with...The Real-time Police Force: Publishing Analytic Information to the Field with...
The Real-time Police Force: Publishing Analytic Information to the Field with...
 
Using Data Mining Techniques to Analyze Crime Pattern
Using Data Mining Techniques to Analyze Crime PatternUsing Data Mining Techniques to Analyze Crime Pattern
Using Data Mining Techniques to Analyze Crime Pattern
 
Crime analysis
Crime analysisCrime analysis
Crime analysis
 
Crime Mapping & Analysis – Georgia Tech
Crime Mapping & Analysis – Georgia TechCrime Mapping & Analysis – Georgia Tech
Crime Mapping & Analysis – Georgia Tech
 
Crime rate analysis using k nn in python
Crime rate analysis using k nn in python Crime rate analysis using k nn in python
Crime rate analysis using k nn in python
 
Social Life Networks (Eventshop and Personal Event Shop)
Social Life Networks (Eventshop and Personal Event Shop)Social Life Networks (Eventshop and Personal Event Shop)
Social Life Networks (Eventshop and Personal Event Shop)
 
Observing real world phenomena through event web
Observing real world phenomena through event webObserving real world phenomena through event web
Observing real world phenomena through event web
 
EvIM: a real time complex event discovery platform for CPSS
EvIM: a real time complex event discovery platform for CPSSEvIM: a real time complex event discovery platform for CPSS
EvIM: a real time complex event discovery platform for CPSS
 
EventShop ISG talk 140213
EventShop ISG talk 140213EventShop ISG talk 140213
EventShop ISG talk 140213
 
EventShop Demo
EventShop DemoEventShop Demo
EventShop Demo
 
9th may net sci presentation (1)
9th may net sci presentation (1)9th may net sci presentation (1)
9th may net sci presentation (1)
 
IRJET- Crime Analysis using Data Mining and Data Analytics
IRJET- Crime Analysis using Data Mining and Data AnalyticsIRJET- Crime Analysis using Data Mining and Data Analytics
IRJET- Crime Analysis using Data Mining and Data Analytics
 
Crime Identification Denver Colorado
Crime Identification Denver ColoradoCrime Identification Denver Colorado
Crime Identification Denver Colorado
 
Situation Recognition from Multimodal Data Tutorial (ICME2016)
Situation Recognition from Multimodal Data Tutorial (ICME2016)Situation Recognition from Multimodal Data Tutorial (ICME2016)
Situation Recognition from Multimodal Data Tutorial (ICME2016)
 
NASA Data Science Day Plenary: Applied Machine Learning (ML)
NASA Data Science Day Plenary: Applied Machine Learning (ML)NASA Data Science Day Plenary: Applied Machine Learning (ML)
NASA Data Science Day Plenary: Applied Machine Learning (ML)
 
Jamal Shakeenab Resume
Jamal Shakeenab ResumeJamal Shakeenab Resume
Jamal Shakeenab Resume
 

Ähnlich wie 10 Steps to Optimize Your Crime Analysis

Get Started with Data Science by Analyzing Traffic Data from California Highways
Get Started with Data Science by Analyzing Traffic Data from California HighwaysGet Started with Data Science by Analyzing Traffic Data from California Highways
Get Started with Data Science by Analyzing Traffic Data from California Highways
Aerospike, Inc.
 
Aug2013 bioinformatics working group
Aug2013 bioinformatics working groupAug2013 bioinformatics working group
Aug2013 bioinformatics working group
GenomeInABottle
 
Power Decision-making at Scale with Address-based Spatial Data Science
Power Decision-making at Scale with Address-based Spatial Data SciencePower Decision-making at Scale with Address-based Spatial Data Science
Power Decision-making at Scale with Address-based Spatial Data Science
Precisely
 
Planning an Investigation
Planning an InvestigationPlanning an Investigation
Planning an Investigation
aatkinson7
 
From Threat Intelligence to Defense Cleverness: A Data Science Approach (#tid...
From Threat Intelligence to Defense Cleverness: A Data Science Approach (#tid...From Threat Intelligence to Defense Cleverness: A Data Science Approach (#tid...
From Threat Intelligence to Defense Cleverness: A Data Science Approach (#tid...
Alex Pinto
 
March 2013 Bioinformatics Working Group
March 2013 Bioinformatics Working GroupMarch 2013 Bioinformatics Working Group
March 2013 Bioinformatics Working Group
GenomeInABottle
 
Mike Goodchild's keynote - GISRUK 2010
Mike Goodchild's keynote - GISRUK 2010Mike Goodchild's keynote - GISRUK 2010
Mike Goodchild's keynote - GISRUK 2010
GISRUK conference
 

Ähnlich wie 10 Steps to Optimize Your Crime Analysis (20)

Forecasting Space-Time Events - Strata + Hadoop World 2015 San Jose
Forecasting Space-Time Events - Strata + Hadoop World 2015 San JoseForecasting Space-Time Events - Strata + Hadoop World 2015 San Jose
Forecasting Space-Time Events - Strata + Hadoop World 2015 San Jose
 
Data Science At Zillow
Data Science At ZillowData Science At Zillow
Data Science At Zillow
 
Get Started with Data Science by Analyzing Traffic Data from California Highways
Get Started with Data Science by Analyzing Traffic Data from California HighwaysGet Started with Data Science by Analyzing Traffic Data from California Highways
Get Started with Data Science by Analyzing Traffic Data from California Highways
 
Rent, Rain, and Regulations | Du Phan, Dataiku | DN18
Rent, Rain, and Regulations | Du Phan, Dataiku | DN18Rent, Rain, and Regulations | Du Phan, Dataiku | DN18
Rent, Rain, and Regulations | Du Phan, Dataiku | DN18
 
DBMS
DBMSDBMS
DBMS
 
PPT.pptx
PPT.pptxPPT.pptx
PPT.pptx
 
Aug2013 bioinformatics working group
Aug2013 bioinformatics working groupAug2013 bioinformatics working group
Aug2013 bioinformatics working group
 
Big Data
Big Data Big Data
Big Data
 
What’s State of the Data?
What’s State of the Data?What’s State of the Data?
What’s State of the Data?
 
Opportunities for alternative data sources
Opportunities for alternative data sourcesOpportunities for alternative data sources
Opportunities for alternative data sources
 
Kenett On Information NYU-Poly 2013
Kenett On Information NYU-Poly 2013Kenett On Information NYU-Poly 2013
Kenett On Information NYU-Poly 2013
 
Power Decision-making at Scale with Address-based Spatial Data Science
Power Decision-making at Scale with Address-based Spatial Data SciencePower Decision-making at Scale with Address-based Spatial Data Science
Power Decision-making at Scale with Address-based Spatial Data Science
 
Planning an Investigation
Planning an InvestigationPlanning an Investigation
Planning an Investigation
 
Presentation of PhD thesis on Location Data Fusion
Presentation of PhD thesis on Location Data Fusion Presentation of PhD thesis on Location Data Fusion
Presentation of PhD thesis on Location Data Fusion
 
From Threat Intelligence to Defense Cleverness: A Data Science Approach (#tid...
From Threat Intelligence to Defense Cleverness: A Data Science Approach (#tid...From Threat Intelligence to Defense Cleverness: A Data Science Approach (#tid...
From Threat Intelligence to Defense Cleverness: A Data Science Approach (#tid...
 
March 2013 Bioinformatics Working Group
March 2013 Bioinformatics Working GroupMarch 2013 Bioinformatics Working Group
March 2013 Bioinformatics Working Group
 
Reliable and Scalable Data Ingestion at Airbnb
Reliable and Scalable Data Ingestion at AirbnbReliable and Scalable Data Ingestion at Airbnb
Reliable and Scalable Data Ingestion at Airbnb
 
Mike Goodchild's keynote - GISRUK 2010
Mike Goodchild's keynote - GISRUK 2010Mike Goodchild's keynote - GISRUK 2010
Mike Goodchild's keynote - GISRUK 2010
 
Geospatial Rectification of Web Transactions and Data Security
Geospatial Rectification of Web Transactions and Data SecurityGeospatial Rectification of Web Transactions and Data Security
Geospatial Rectification of Web Transactions and Data Security
 
Anomalies and events keep us on our toes
Anomalies and events keep us on our toesAnomalies and events keep us on our toes
Anomalies and events keep us on our toes
 

Mehr von Azavea

HunchLab 2.0 Predictive Missions: Under the Hood
HunchLab 2.0 Predictive Missions: Under the HoodHunchLab 2.0 Predictive Missions: Under the Hood
HunchLab 2.0 Predictive Missions: Under the Hood
Azavea
 
OpenTreeMap NCGIS
OpenTreeMap NCGISOpenTreeMap NCGIS
OpenTreeMap NCGIS
Azavea
 

Mehr von Azavea (20)

Using New Tools to Analyze and Plan Your Urban Forest
Using New Tools to Analyze and Plan Your Urban Forest Using New Tools to Analyze and Plan Your Urban Forest
Using New Tools to Analyze and Plan Your Urban Forest
 
7 misconceptions about predictive policing webinar
7 misconceptions about predictive policing webinar7 misconceptions about predictive policing webinar
7 misconceptions about predictive policing webinar
 
Tracking Your Green Infrastructure
Tracking Your Green InfrastructureTracking Your Green Infrastructure
Tracking Your Green Infrastructure
 
Growing Your Urban Forest: Using the OpenTreeMap Bulk Uploader
Growing Your Urban Forest: Using the OpenTreeMap Bulk UploaderGrowing Your Urban Forest: Using the OpenTreeMap Bulk Uploader
Growing Your Urban Forest: Using the OpenTreeMap Bulk Uploader
 
November 12, 2014 Webinar: Hackers, Beer Geeks, and Arborly Love - Reaching o...
November 12, 2014 Webinar: Hackers, Beer Geeks, and Arborly Love - Reaching o...November 12, 2014 Webinar: Hackers, Beer Geeks, and Arborly Love - Reaching o...
November 12, 2014 Webinar: Hackers, Beer Geeks, and Arborly Love - Reaching o...
 
Mobile Citizen Science
Mobile Citizen Science Mobile Citizen Science
Mobile Citizen Science
 
Getting Started with OpenTreeMap Cloud
Getting Started with OpenTreeMap CloudGetting Started with OpenTreeMap Cloud
Getting Started with OpenTreeMap Cloud
 
HunchLab 2.0 Getting Started
HunchLab 2.0 Getting StartedHunchLab 2.0 Getting Started
HunchLab 2.0 Getting Started
 
Is it a Package or a Wrapper? Designing, Documenting, and Distributing a Pyth...
Is it a Package or a Wrapper? Designing, Documenting, and Distributing a Pyth...Is it a Package or a Wrapper? Designing, Documenting, and Distributing a Pyth...
Is it a Package or a Wrapper? Designing, Documenting, and Distributing a Pyth...
 
Your New Partners: Understanding Civic Hackathons, Why You Should be Involved...
Your New Partners: Understanding Civic Hackathons, Why You Should be Involved...Your New Partners: Understanding Civic Hackathons, Why You Should be Involved...
Your New Partners: Understanding Civic Hackathons, Why You Should be Involved...
 
HunchLab 2.0 Predictive Missions: Under the Hood
HunchLab 2.0 Predictive Missions: Under the HoodHunchLab 2.0 Predictive Missions: Under the Hood
HunchLab 2.0 Predictive Missions: Under the Hood
 
Using Open Data and Citizen Science to Promote Citizen Engagement with Green ...
Using Open Data and Citizen Science to Promote Citizen Engagement with Green ...Using Open Data and Citizen Science to Promote Citizen Engagement with Green ...
Using Open Data and Citizen Science to Promote Citizen Engagement with Green ...
 
HunchLab 2.0 Preview Webinar - Place
HunchLab 2.0 Preview Webinar - PlaceHunchLab 2.0 Preview Webinar - Place
HunchLab 2.0 Preview Webinar - Place
 
Five Technology Trends Every Nonprofit Needs to Know
Five Technology Trends Every Nonprofit Needs to KnowFive Technology Trends Every Nonprofit Needs to Know
Five Technology Trends Every Nonprofit Needs to Know
 
PhillyHistory.org - Tracking Metrics for a Digital Project
PhillyHistory.org - Tracking Metrics for a Digital ProjectPhillyHistory.org - Tracking Metrics for a Digital Project
PhillyHistory.org - Tracking Metrics for a Digital Project
 
Fed Geo Day - Applying GeoTrellis at the US Army Corps
Fed Geo Day - Applying GeoTrellis at the US Army CorpsFed Geo Day - Applying GeoTrellis at the US Army Corps
Fed Geo Day - Applying GeoTrellis at the US Army Corps
 
Fed Geo Day - GeoTrellis Intro
Fed Geo Day - GeoTrellis IntroFed Geo Day - GeoTrellis Intro
Fed Geo Day - GeoTrellis Intro
 
Fed Geo Day 2013 - Azavea Intro
Fed Geo Day 2013 - Azavea Intro Fed Geo Day 2013 - Azavea Intro
Fed Geo Day 2013 - Azavea Intro
 
Modeling Count-based Raster Data with ArcGIS and R
Modeling Count-based Raster Data with ArcGIS and RModeling Count-based Raster Data with ArcGIS and R
Modeling Count-based Raster Data with ArcGIS and R
 
OpenTreeMap NCGIS
OpenTreeMap NCGISOpenTreeMap NCGIS
OpenTreeMap NCGIS
 

KĂźrzlich hochgeladen

Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
panagenda
 

KĂźrzlich hochgeladen (20)

TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
Top 10 Most Downloaded Games on Play Store in 2024
Top 10 Most Downloaded Games on Play Store in 2024Top 10 Most Downloaded Games on Play Store in 2024
Top 10 Most Downloaded Games on Play Store in 2024
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 

10 Steps to Optimize Your Crime Analysis

  • 1. 10 Steps to Optimize Your Crime Analysis 340 N 12th St, Suite 402 Philadelphia, PA 19107 215.925.2600 info@azavea.com www.azavea.com/hunchlab
  • 2. About Us Robert Cheetham President & CEO cheetham@azavea.com 215.701.7713 Jeremy Heffner HunchLab Product Manager jheffner@azavea.com 215.701.7712
  • 3. About Azavea • Founded in 2000 • 32 people • Based in Philadelphia – Boston office – Minneapolis office • Geospatial + web + mobile – Software development – Spatial analysis services
  • 4. Clients & Industries • Public Safety • Municipal Services • Public Health • Human Services • Culture • Elections & Politics • Land Conservation • Economic Development
  • 5. web-based crime analysis, early warning, and risk forecasting
  • 6. 10 Steps to Optimize Your Crime Analysis
  • 7. 10 Steps to Optimize Your Crime Analysis Crime Analysis Data Analytic Use Cases Quality Techniques • Geocoding • Kernel Density Map • Open data Predictive Accuracy • Dates & Times • Find research • NNI and Gi* partners • Polygon Hierarchies • Near Repeat Calculator • Randomized controlled trial • Risk Terrain Modeling
  • 9. 1. Examine Geocoding Accuracy • Geocoding – Process of turning addresses into geographic coordinates – Examine accuracy • Correct locations – Geocoding method Âť Commercial geocoder Âť Street center line Âť Parcel database – POIs and landmarks – Incorrect clustering at: Âť Precinct locations, zip code and city centroids • High geocoding success rates – Ratcliffe suggests at least 85% (lowest acceptable) Âť http://bit.ly/ratcliffegeocoding – Examine unsuccessful geocodes for patterns
  • 10. 2. Examine Dates & Times • Dates & Times – Event-related times • Actual occurrence time – From / to time interval • Report time • Officers responded time
  • 11. 2. Examine Dates & Times • Dates & Times – Examining accuracy • Data entry defaults • Data validation on input • Clustering by time cycles – Day of week – Day of month – Day of year – Hour of day – Minute of hour
  • 12. 2. Examine Dates & Times
  • 13. 3. Examine Polygon Hierarchy • Polygon Hierarchy – A set of geographic areas that nest within each other used to organize resources (i.e. divisions, districts, PSAs, beats) • What makes a good hierarchy? – Perfectly nested polygons • No sliver polygons – Areas should be periodically rebalanced based on changing crime levels – Consider that splitting areas based on streets means one side of street is in one district / other side is in a different district.
  • 15. 4. Test Predictive Accuracy of KDE • Kernel Density Estimation – A smoothing technique that generates hotspot maps
  • 16. 4. Test Predictive Accuracy of KDE • When we look at a hotspot map what are we assuming? – That crimes will happen in the hotspots again. – But… • How predictive is it? • How much historic data should we use? • What search radius should we use? • What is the density cutoff for a hotspot? Source: Chainey, http://www.popcenter.org/conference/conferencepapers/2010/Chainey-Gi-hotSpots.pdf
  • 17. 4. Test Predictive Accuracy of KDE • Predictive Accuracy Index – Spencer Chainey, Jill Dando Institute • http://www.palgrave-journals.com/sj/journal/v21/n1/full/8350066a.html – Incorporates: • Desire for a high hit rate – Lots of crime incidents in a prior ‘hotspot’ • Desire for a small geographic area – Less to patrol, etc.
  • 18. 4. Test Predictive Accuracy of KDE • Predictive Accuracy Index Steps 1. Generate kernel density map for historic period 2. Measure predictive validity against future time period 3. Higher number better • Example from Chainey
  • 19. 4. Test Predictive Accuracy of KDE • Predictive Accuracy Index – Caveats • Number is relative to crime type and geography • Best for comparing different techniques (or parameter variations of techniques) for the same predictive period
  • 20. 4. Test Predictive Accuracy of KDE • Predictive Accuracy Index – Caveats • Number is relative to crime type and geography • Best for comparing different techniques (or parameter variations of techniques) for the same predictive period Is there something similar but better than kernel density?
  • 21. 5. Test NNI and use Gi* • Gi* – Spencer Chainey, Jill Dando Institute • http://www.popcenter.org/conference/conferencepapers/2010/Chainey-Gi-hotSpots.pdf – LISA statistic • Local indicator of spatial association – Compares local averages to global averages – Generates map visually similar to KDE
  • 22. 5. Test NNI and use Gi*
  • 23. 5. Test NNI and use Gi* • How do we know hotspots exist though? – Calculate nearest neighbor index (NNI) • Determines if clustering exists – NNI ~ 1: data is randomly distributed – NNI < 1: data is clustered – NNI > 1: data is uniformly distributed • Helps to answer if we have enough historic data for statistical significance • If data is not clustered neither Gi* nor KDE should be used
  • 24. 5. Test NNI and use Gi* • Summary of Steps – Test for clustering with nearest neighbor index – Calculate crime counts within a grid – Run Gi* statistic – Set color ramp breakpoints based on fixed statistical significance levels
  • 25. 5. Test NNI and use Gi* • Summary of Steps – Test for clustering with nearest neighbor index – Calculate crime counts within a grid – Run Gi* statistic – Set color ramp breakpoints based on fixed statistical significance levels Remember our friend the predictive accuracy index?
  • 26. 5. Test NNI and use Gi* • Is it really better than KDE? – Example from Chainey
  • 27. 6. Run the Near Repeat Calculator • Near Repeat Pattern Analysis – Measures ‘contagion’ effect of crime incident – How does one burglary change the risk that another burglary will occur nearby in the coming days? • Common in Some Types of Crime – Burglary – Theft from Vehicle – Gun Crime – Robbery – Bicycle Theft
  • 28. 6. Run the Near Repeat Calculator
  • 29. 6. Run the Near Repeat Calculator
  • 30. 6. Run the Near Repeat Calculator • Near Repeat Calculator – http://www.temple.edu/cj/misc/nr/ • Papers – Near-Repeat Patterns in Philadelphia Shootings (2008) • One city block & two weeks after one shooting – 33% increase in likelihood of a second event Jerry Ratcliffe Temple University
  • 31. 7. Conduct a Randomized Trial • Randomized Controlled Trial – An experiment where study subjects (e.g. locations) are randomly assigned to different treatment protocols – Academics do this regularly • But why do this yourself? – Proves/disproves a technique’s efficacy for your department – Successfully mimicking a published trial gives you the skills to experiment based on local anecdotal evidence
  • 32. 7. Conduct a Randomized Trial • Philadelphia Foot Patrol Experiment – Jerry Ratcliffe, Temple University • http://bit.ly/phillyfootpatrol – Concentrated patrol in 60 violent crime hotspots – Outlines full methodology • Experimental design • Evaluation – Result was a net reduction of 53 violent crimes
  • 33. 8. Generate a Risk Terrain Model • Risk Terrain Modeling – Joel Caplan & Les Kennedy, Rutgers University • http://www.rutgerscps.org/rtm/ – Forms a combined risk surface of several spatial risk factors that correlate with a particular type of crime – Describes the environmental context within crime occurs
  • 34. 8. Generate a Risk Terrain Model • Steps to Building a Model 1. List potential risk factors • Literature review • Departmental experience 2. Assemble GIS data sets for each factor 3. Operationalize each factor and test for correlation 4. Combine correlated factors into combined risk terrain
  • 35. 8. Generate a Risk Terrain Model Gun shootings example Source: Rutgers, http://www.rutgerscps.org/rtm/irvrtmgoogearth.htm
  • 36. 8. Generate a Risk Terrain Model Gun shootings example Source: Rutgers, http://www.rutgerscps.org/rtm/irvrtmgoogearth.htm
  • 37. 8. Generate a Risk Terrain Model Gun shootings example Source: Rutgers, http://www.rutgerscps.org/rtm/irvrtmgoogearth.htm
  • 38. 8. Generate a Risk Terrain Model • Risk Terrain Modeling – Risk Terrain Modeling Manual • http://www.rutgerscps.org/rtm/ – Online training • http://www.rutgerscps.org/rtm/webinar.html
  • 40. 9. Open Up Data (Appropriately) • Increase transparency and community engagement – Open data movement – Increases trust – Allows novel uses for crime data – Increases perceived value of good data • Not a new idea – NIJ guide released in 2001 • https://www.ncjrs.gov/pdffiles1/nij/188739.pdf
  • 41. 9. Open Up Data (Appropriately) • Public crime mapping sites – Omega Group • http://www.crimemapping.com/
  • 42. 9. Open Up Data (Appropriately) • Data portals – Chicago Data Portal • http://data.cityofchicago.org/ • Raw incident data from 2001 to present
  • 43. 10. Find Research Partners • Benefits of Conducting Research – Lowers the risk of trying something new – Supplements limited resources • Labor • Software • Knowledge/techniques/statistical rigor – Encourages cutting edge analysis • Types of Partners – Academic crime researchers – Nonprofits – Commercial entities
  • 44. 10. Find Research Partners (A brief plug)
  • 45. Q&A
  • 46. About Us Robert Cheetham President & CEO cheetham@azavea.com 215.701.7713 Jeremy Heffner HunchLab Product Manager jheffner@azavea.com 215.701.7712 www.azavea.com/hunchlab