SlideShare ist ein Scribd-Unternehmen logo
1 von 53
Downloaden Sie, um offline zu lesen
Matt Pazoles
GovEx - Chief Data Scientist
pazoles@jhu.edu
Predictive Analytics & Neighborhood Health
A collaboration by GovEx and Kansas City, MO
Today
● Analytics in city government
● The problem in Kansas City
● Exploratory analysis
● Random forest regression
● Final scheduling tool
● Questions
govex.jhu.edu / @gov_ex
Quick Primer on GovEx
● Help organizations make informed
decisions with data to improve
people’s lives
● Provide targeted training and
technical assistance for dedicated
civil servants
● Team of 25 with practical
government and NGO experience
govex.jhu.edu / @gov_ex
Where We Work
132
US cities
2,000+
civil servants trained
govex.jhu.edu / @gov_ex
● Direct technical
assistance
○ Data management
○ Performance
○ Advanced analytics
● Training and education
● Applied research
How We Work
Analytics in City Government
...a brief history
1970’S
The Abbreviated Evolution of Performance Management
Private sector
renews
emphasis on
quality and
process
improvement
(e.g. japanese auto
manufacturers)
Malcolm Baldridge
Quality Improvement
Program
Six Sigma
GPRA
COMPSTAT
STAT MANIA:
CITISTAT
CAPSTAT
SCHOOLSTAT
HUDSTAT
FEMASTAT
Moneyball
GPRA
Modernization Act
MODA
EAD
1980’S 1990’S 2000’S 2010’S
Results
Focused
Management
What did Performance Management give us?
A few wins:
“NYC cuts crime by 60%”
“Baltimore saves $40m in
overtime”
“DC houses 500 homeless
men”
Some
believers
Some
skeptics
Some
victims
A few losses:
Standardized Testing
Cheating Scandals
VA Wait-Time Gaming
Airline on-time departure
gaming
A lot of process:
Meetings (including STAT)
Metrics
Reports
Dashboards
Reviews
Appraisals
Results
Focused
Performance
Management
High
Quality
Analytics
Performance
Analytics
Without this,
analysis
does not get
used.
Without this,
opportunities
are missed.
Problem Definition Making Decisions Taking Action
What happened?
Why did it happen?
What will happen?
What should we do?
Insights Foresight
Insights
Insights to Action
Identify problem
Explore data
Create model
Adjust operations
Track results
Add dataUpdate model
Communicate
Results
Some examples
Jackson, MS
Problem:
Dwindling general fund reserves,
population decline, overstaffed?
Analysis:
Clustering algorithm to identify
similar cities, then normalize and
compare budgets
Conclusion:
Jackson was overspending on
Public Works Operations
San Jose, CA
Problem:
Low citywide compliance with pet
licensing
Analysis:
Identify what areas are least
compliant and most able to take
action
Conclusion:
Target veterinarians and animal
hospitals in identified geographies
New Orleans, LA
Problem:
Large backlog of blight abatement
cases at the review stage
Analysis:
Predict likely course of action
based on historical data
Conclusion:
Create decision-support tool to
cut down on work needed to
advance the case
Questions?
KCMO Neighborhood Preservation
Predictive Analytics Project
2008-2011
2010 Foreclosure Rates by Zip Code
Source: Kansas City Star, 7/9/2017
Source: Bureau of Labor
Statistics
Unemployment Rate in Kansas City, MO
2011
Source: Kansas City
• Increase in property violations
overwhelmed existing workflow
• 20% of reported violations
were not seen for 10+ weeks
• Prioritized initial inspections
• Goal: 5 to 10 days
• Result: 6 days
Existing Inspection Process
1. Violation is reported through 311
2. Case is opened as a priority 1 for initial
inspection (within 6 days)
3. If a violation is found, owner is sent a
notification letter
4. After 3 to 6 weeks, case is added as a priority 3
(after initial inspections and court requirements)
5. Eventually, property is reinspected; if violation
remains, another letter is issued
Legal/Policy
constraints
● City can fine owners who
live within 50 miles
○ Fines are minimal
● Owners who live more
than 50 miles from the
city are not pursued
○ Weak ability to take
possession of
blighted properties
2012-2016
Source: Kansas City Open Data
201
1
201
2
201
3
201
4
201
6201
5
2011
2012
2013
2014
2016
2015
2012-2016
• Inspector capacity hasn’t changed
• More open cases per inspector = higher workload
• Hurt by success of prioritizing initial inspections?
What is driving inspector workload?
Can we reduce the amount of
work to get a property into
compliance?
Identify problem
Explore data
Create model
Adjust operations
Track results
Add dataUpdate model
Communicate
Results
Questions
● Is the backlog still growing?
Is the backlog of inspections growing?
Year Net Violations
2012 5,204
2013 13,703
2014 1,398
2015 1,402
2016 -4,431
Questions
● Is the backlog still growing?
● Are more violations being
issued?
2012-2016
Source: Kansas City Open Data Portal
Are there more violations per month?
Questions
● Is the backlog still growing?
● Are more violations being
issued?
● Are a few properties
responsible?
Are more properties becoming noncompliant?
Are repeat offenders a growing issue?
Violations by
Property
Most Recent Violation Opened
2010 2011 2012 2013 2014 2015 2016
Grand
Total
Oldest
Violation
Opened
2010 3,783 1,010 1,020 1,130 1,241 1,603 3,541 13,406
2011 3,433 1,038 972 1,101 1,266 2,873 10,749
2012 3,183 837 778 852 1,911 7,600
2013 3,658 1,129 943 1,896 7,667
2014 3,530 1,192 1,771 6,548
2015 3,589 1,934 5,560
2016 4,656 4,690
Grand
Total 3,794 4,452 5,252 6,605 7,790 9,462 18,617 56,220
Years After
Initial
% Repeat
1 14%
2 25%
3 34%
4 43%
5 52%
6 59%
Questions
● Is the backlog still growing?
● Are more violations being
issued?
● Are a few properties
responsible?
● Do more inspections
decrease resolution time?
2012-2016
Source: Kansas City Open Data Portal
60% of
cases
70% of
reinspections
Do inspections reduce resolution times?
Questions ● Are inspections themselves
part of the issue?
Are inspections closing violations efficiently?
Average number of inspections per year:
32,291
+/-3,000
“Inefficient Reinspection”
Reinspections where no violations are closed
Nearly doubled between 2012 and 2016
Insights
● Reinspection of open cases
accounts for most work
○ Majority of reinspections
on a minority of cases
● Timing of inspections is
more important than
quantity
● Many reinspections create
more work for inspectors
Improve inspection efficiency by modeling
how long a violation will take to resolve.
Reduce the number of inspections
that close 0 violations
At the address/case level
Identify problem
Explore data
Create model
Adjust operations
Track results
Add dataUpdate model
Communicate
Results
PostgreSQL
database
+
PostGIS
spatial
extension
Data Infrastructure
Property violation, 311,
property ownership, and
neighborhood related data
from OpenData KC
Demographic data by census
block from the US Census
Bureau
Property values and building
features from county data
Data Infrastructure
Data Infrastructure
PostgreSQL
database
+
PostGIS
Schedule of Inspections
• Tier 1: Initial Inspections
• Tier 2: Court-ordered
• Tier 3: Model Prioritized
• Tier 4: Low Priority
• AWS hosted server for analysis
• Python + Jupyter, R + R Studio
• Scikit-Learn
• Random Forest Regression
• 70% of data as training set
■ Corrected for class imbalance
• Mutual regression feature selection
• Grid Search parameter optimization
• Target: Violation Days Open
• Grouped violations by property to produce
recommendations for inspectors
• How many days has it been open?
• What are the other violations present?
Correctly sorts
violations from
shortest to longest.
(Bars rise from left to right)
Underestimated
time to resolution
Overestimated time to
resolution
Red line represents a
perfect prediction
Blue dots represent
violations from the
“test” data
MSE = 45.5 days
Current Status
Cases predicted as ready to be closed:
3,870 properties (62% of open cases)
(38% of open cases can be deferred)
Model-informed
inspection cycle
INITIAL INSPECTION
New violations come in
ACTION
Enhance inspection
schedule with
model-informed
targets
DATA WRANGLING
Add other relevant data and
run model
AGGREGATION
Compare prediction to
current violation age;
group by address
Anticipated outcomes
•More time helping residents
•More timely inspections
•Fewer follow-up visits documenting existing issues
•Less frustration all around!
Future opportunities for analysis
• “Inspector Effect”
• The inspection area is a predictive feature, but no data linking inspectors to specific inspections
• Weather impacts
• More detailed analyses could find relations between ground saturation, wind speed, and specific
violation types
• Utility Analysis
• Can water bills be used to proactively identify severe violations?
• Court Data
• How effective are current penalties in preventing reviolation?
• Equity in Outcomes
• Testing for bias by examining outcomes of inspection processes
Next Steps
(this project)
● Technical infrastructure
● Fold into prioritization
● Field testing
● Future analyses
Questions?
Thank you!
Matt Pazoles
pazoles@jhu.edu

Weitere ähnliche Inhalte

Ähnlich wie Predictive analytics help KCMO improve neighborhood inspections

DCBADD2015 public sector agile
DCBADD2015 public sector agileDCBADD2015 public sector agile
DCBADD2015 public sector agileIIBADCBADD
 
London data and digital masterclass for councillors slides 14-Feb-20
London data and digital masterclass for councillors slides 14-Feb-20London data and digital masterclass for councillors slides 14-Feb-20
London data and digital masterclass for councillors slides 14-Feb-20LG Inform Plus
 
Big Data Analytics and Open Data
Big Data Analytics and Open Data Big Data Analytics and Open Data
Big Data Analytics and Open Data Sharjeel Imtiaz
 
Barga Galvanize Sept 2015
Barga Galvanize Sept 2015Barga Galvanize Sept 2015
Barga Galvanize Sept 2015Roger Barga
 
t Success: An Update to the Georgia Southern IT Community
t Success: An Update to the Georgia Southern IT Communityt Success: An Update to the Georgia Southern IT Community
t Success: An Update to the Georgia Southern IT CommunityCurtis A. Carver Jr.
 
Information Technology Services 201608 v1.5
Information Technology Services 201608 v1.5Information Technology Services 201608 v1.5
Information Technology Services 201608 v1.5John Halbert
 
Regina Food Summit - Data for Good
Regina Food Summit - Data for GoodRegina Food Summit - Data for Good
Regina Food Summit - Data for GoodData For Good Regina
 
MSCPA Update 2017
MSCPA Update 2017MSCPA Update 2017
MSCPA Update 2017Allen Lloyd
 
Lake Hill Analytics General Capabilities Overview
Lake Hill Analytics General Capabilities OverviewLake Hill Analytics General Capabilities Overview
Lake Hill Analytics General Capabilities OverviewDamian Herrick
 
The 2019 State of Grantseeking: Benchmarks and Trends
The 2019 State of Grantseeking: Benchmarks and TrendsThe 2019 State of Grantseeking: Benchmarks and Trends
The 2019 State of Grantseeking: Benchmarks and TrendsTechSoup
 
S&OP maturity comes prior to advance planning software
S&OP maturity comes prior to advance planning softwareS&OP maturity comes prior to advance planning software
S&OP maturity comes prior to advance planning softwareTristan Wiggill
 
Ellicium's Gadfly - Next Generation Big Data Text Analytics Platform
Ellicium's Gadfly - Next Generation Big Data Text Analytics Platform Ellicium's Gadfly - Next Generation Big Data Text Analytics Platform
Ellicium's Gadfly - Next Generation Big Data Text Analytics Platform Ellicium Solutions Inc.
 
Agile Trends in Government
Agile Trends in GovernmentAgile Trends in Government
Agile Trends in GovernmentLitheSpeed
 
Put Agile to the Test: A Case Study for Test Agility on a Large IT Project
Put Agile to the Test: A Case Study for Test Agility on a Large IT ProjectPut Agile to the Test: A Case Study for Test Agility on a Large IT Project
Put Agile to the Test: A Case Study for Test Agility on a Large IT ProjectTechWell
 
How to do UX in Government Without Losing Your Freaking Mind
How to do UX in Government Without Losing Your Freaking MindHow to do UX in Government Without Losing Your Freaking Mind
How to do UX in Government Without Losing Your Freaking MindUXPA DC
 

Ähnlich wie Predictive analytics help KCMO improve neighborhood inspections (20)

DCBADD2015 public sector agile
DCBADD2015 public sector agileDCBADD2015 public sector agile
DCBADD2015 public sector agile
 
London data and digital masterclass for councillors slides 14-Feb-20
London data and digital masterclass for councillors slides 14-Feb-20London data and digital masterclass for councillors slides 14-Feb-20
London data and digital masterclass for councillors slides 14-Feb-20
 
Big Data Analytics and Open Data
Big Data Analytics and Open Data Big Data Analytics and Open Data
Big Data Analytics and Open Data
 
Barga Galvanize Sept 2015
Barga Galvanize Sept 2015Barga Galvanize Sept 2015
Barga Galvanize Sept 2015
 
t Success: An Update to the Georgia Southern IT Community
t Success: An Update to the Georgia Southern IT Communityt Success: An Update to the Georgia Southern IT Community
t Success: An Update to the Georgia Southern IT Community
 
Information Technology Services 201608 v1.5
Information Technology Services 201608 v1.5Information Technology Services 201608 v1.5
Information Technology Services 201608 v1.5
 
Regina Food Summit - Data for Good
Regina Food Summit - Data for GoodRegina Food Summit - Data for Good
Regina Food Summit - Data for Good
 
MSCPA Update 2017
MSCPA Update 2017MSCPA Update 2017
MSCPA Update 2017
 
Lake Hill Analytics General Capabilities Overview
Lake Hill Analytics General Capabilities OverviewLake Hill Analytics General Capabilities Overview
Lake Hill Analytics General Capabilities Overview
 
The 2019 State of Grantseeking: Benchmarks and Trends
The 2019 State of Grantseeking: Benchmarks and TrendsThe 2019 State of Grantseeking: Benchmarks and Trends
The 2019 State of Grantseeking: Benchmarks and Trends
 
S&OP maturity comes prior to advance planning software
S&OP maturity comes prior to advance planning softwareS&OP maturity comes prior to advance planning software
S&OP maturity comes prior to advance planning software
 
Big_Data.pptx
Big_Data.pptxBig_Data.pptx
Big_Data.pptx
 
Data Collection Transformation Programme
Data Collection Transformation ProgrammeData Collection Transformation Programme
Data Collection Transformation Programme
 
Rock eagle thursday presentation
Rock eagle thursday presentationRock eagle thursday presentation
Rock eagle thursday presentation
 
PQF Overview
PQF OverviewPQF Overview
PQF Overview
 
Manchester Annual Planning Survey Presentation Feb 2015
Manchester Annual Planning Survey Presentation Feb 2015Manchester Annual Planning Survey Presentation Feb 2015
Manchester Annual Planning Survey Presentation Feb 2015
 
Ellicium's Gadfly - Next Generation Big Data Text Analytics Platform
Ellicium's Gadfly - Next Generation Big Data Text Analytics Platform Ellicium's Gadfly - Next Generation Big Data Text Analytics Platform
Ellicium's Gadfly - Next Generation Big Data Text Analytics Platform
 
Agile Trends in Government
Agile Trends in GovernmentAgile Trends in Government
Agile Trends in Government
 
Put Agile to the Test: A Case Study for Test Agility on a Large IT Project
Put Agile to the Test: A Case Study for Test Agility on a Large IT ProjectPut Agile to the Test: A Case Study for Test Agility on a Large IT Project
Put Agile to the Test: A Case Study for Test Agility on a Large IT Project
 
How to do UX in Government Without Losing Your Freaking Mind
How to do UX in Government Without Losing Your Freaking MindHow to do UX in Government Without Losing Your Freaking Mind
How to do UX in Government Without Losing Your Freaking Mind
 

Mehr von Data Works MD

Data Journalism at The Baltimore Banner
Data Journalism at The Baltimore BannerData Journalism at The Baltimore Banner
Data Journalism at The Baltimore BannerData Works MD
 
Jolt’s Picks - Machine Learning and Major League Baseball Hit Streaks
Jolt’s Picks - Machine Learning and Major League Baseball Hit StreaksJolt’s Picks - Machine Learning and Major League Baseball Hit Streaks
Jolt’s Picks - Machine Learning and Major League Baseball Hit StreaksData Works MD
 
Introducing DataWave
Introducing DataWaveIntroducing DataWave
Introducing DataWaveData Works MD
 
Malware Detection, Enabled by Machine Learning
Malware Detection, Enabled by Machine LearningMalware Detection, Enabled by Machine Learning
Malware Detection, Enabled by Machine LearningData Works MD
 
Using AWS, Terraform, and Ansible to Automate Splunk at Scale
Using AWS, Terraform, and Ansible to Automate Splunk at ScaleUsing AWS, Terraform, and Ansible to Automate Splunk at Scale
Using AWS, Terraform, and Ansible to Automate Splunk at ScaleData Works MD
 
A Day in the Life of a Data Journalist
A Day in the Life of a Data JournalistA Day in the Life of a Data Journalist
A Day in the Life of a Data JournalistData Works MD
 
Robotics and Machine Learning: Working with NVIDIA Jetson Kits
Robotics and Machine Learning: Working with NVIDIA Jetson KitsRobotics and Machine Learning: Working with NVIDIA Jetson Kits
Robotics and Machine Learning: Working with NVIDIA Jetson KitsData Works MD
 
Connect Data and Devices with Apache NiFi
Connect Data and Devices with Apache NiFiConnect Data and Devices with Apache NiFi
Connect Data and Devices with Apache NiFiData Works MD
 
Introduction to Machine Learning
Introduction to Machine LearningIntroduction to Machine Learning
Introduction to Machine LearningData Works MD
 
Data in the City: Analytics and Civic Data in Baltimore
Data in the City: Analytics and Civic Data in BaltimoreData in the City: Analytics and Civic Data in Baltimore
Data in the City: Analytics and Civic Data in BaltimoreData Works MD
 
Exploring Correlation Between Sentiment of Environmental Tweets and the Stock...
Exploring Correlation Between Sentiment of Environmental Tweets and the Stock...Exploring Correlation Between Sentiment of Environmental Tweets and the Stock...
Exploring Correlation Between Sentiment of Environmental Tweets and the Stock...Data Works MD
 
Automated Software Requirements Labeling
Automated Software Requirements LabelingAutomated Software Requirements Labeling
Automated Software Requirements LabelingData Works MD
 
Introduction to Elasticsearch for Business Intelligence and Application Insights
Introduction to Elasticsearch for Business Intelligence and Application InsightsIntroduction to Elasticsearch for Business Intelligence and Application Insights
Introduction to Elasticsearch for Business Intelligence and Application InsightsData Works MD
 
An Asynchronous Distributed Deep Learning Based Intrusion Detection System fo...
An Asynchronous Distributed Deep Learning Based Intrusion Detection System fo...An Asynchronous Distributed Deep Learning Based Intrusion Detection System fo...
An Asynchronous Distributed Deep Learning Based Intrusion Detection System fo...Data Works MD
 
RAPIDS – Open GPU-accelerated Data Science
RAPIDS – Open GPU-accelerated Data ScienceRAPIDS – Open GPU-accelerated Data Science
RAPIDS – Open GPU-accelerated Data ScienceData Works MD
 
Two Algorithms for Weakly Supervised Denoising of EEG Data
Two Algorithms for Weakly Supervised Denoising of EEG DataTwo Algorithms for Weakly Supervised Denoising of EEG Data
Two Algorithms for Weakly Supervised Denoising of EEG DataData Works MD
 
Detecting Lateral Movement with a Compute-Intense Graph Kernel
Detecting Lateral Movement with a Compute-Intense Graph KernelDetecting Lateral Movement with a Compute-Intense Graph Kernel
Detecting Lateral Movement with a Compute-Intense Graph KernelData Works MD
 
Social Network Analysis Workshop
Social Network Analysis WorkshopSocial Network Analysis Workshop
Social Network Analysis WorkshopData Works MD
 

Mehr von Data Works MD (18)

Data Journalism at The Baltimore Banner
Data Journalism at The Baltimore BannerData Journalism at The Baltimore Banner
Data Journalism at The Baltimore Banner
 
Jolt’s Picks - Machine Learning and Major League Baseball Hit Streaks
Jolt’s Picks - Machine Learning and Major League Baseball Hit StreaksJolt’s Picks - Machine Learning and Major League Baseball Hit Streaks
Jolt’s Picks - Machine Learning and Major League Baseball Hit Streaks
 
Introducing DataWave
Introducing DataWaveIntroducing DataWave
Introducing DataWave
 
Malware Detection, Enabled by Machine Learning
Malware Detection, Enabled by Machine LearningMalware Detection, Enabled by Machine Learning
Malware Detection, Enabled by Machine Learning
 
Using AWS, Terraform, and Ansible to Automate Splunk at Scale
Using AWS, Terraform, and Ansible to Automate Splunk at ScaleUsing AWS, Terraform, and Ansible to Automate Splunk at Scale
Using AWS, Terraform, and Ansible to Automate Splunk at Scale
 
A Day in the Life of a Data Journalist
A Day in the Life of a Data JournalistA Day in the Life of a Data Journalist
A Day in the Life of a Data Journalist
 
Robotics and Machine Learning: Working with NVIDIA Jetson Kits
Robotics and Machine Learning: Working with NVIDIA Jetson KitsRobotics and Machine Learning: Working with NVIDIA Jetson Kits
Robotics and Machine Learning: Working with NVIDIA Jetson Kits
 
Connect Data and Devices with Apache NiFi
Connect Data and Devices with Apache NiFiConnect Data and Devices with Apache NiFi
Connect Data and Devices with Apache NiFi
 
Introduction to Machine Learning
Introduction to Machine LearningIntroduction to Machine Learning
Introduction to Machine Learning
 
Data in the City: Analytics and Civic Data in Baltimore
Data in the City: Analytics and Civic Data in BaltimoreData in the City: Analytics and Civic Data in Baltimore
Data in the City: Analytics and Civic Data in Baltimore
 
Exploring Correlation Between Sentiment of Environmental Tweets and the Stock...
Exploring Correlation Between Sentiment of Environmental Tweets and the Stock...Exploring Correlation Between Sentiment of Environmental Tweets and the Stock...
Exploring Correlation Between Sentiment of Environmental Tweets and the Stock...
 
Automated Software Requirements Labeling
Automated Software Requirements LabelingAutomated Software Requirements Labeling
Automated Software Requirements Labeling
 
Introduction to Elasticsearch for Business Intelligence and Application Insights
Introduction to Elasticsearch for Business Intelligence and Application InsightsIntroduction to Elasticsearch for Business Intelligence and Application Insights
Introduction to Elasticsearch for Business Intelligence and Application Insights
 
An Asynchronous Distributed Deep Learning Based Intrusion Detection System fo...
An Asynchronous Distributed Deep Learning Based Intrusion Detection System fo...An Asynchronous Distributed Deep Learning Based Intrusion Detection System fo...
An Asynchronous Distributed Deep Learning Based Intrusion Detection System fo...
 
RAPIDS – Open GPU-accelerated Data Science
RAPIDS – Open GPU-accelerated Data ScienceRAPIDS – Open GPU-accelerated Data Science
RAPIDS – Open GPU-accelerated Data Science
 
Two Algorithms for Weakly Supervised Denoising of EEG Data
Two Algorithms for Weakly Supervised Denoising of EEG DataTwo Algorithms for Weakly Supervised Denoising of EEG Data
Two Algorithms for Weakly Supervised Denoising of EEG Data
 
Detecting Lateral Movement with a Compute-Intense Graph Kernel
Detecting Lateral Movement with a Compute-Intense Graph KernelDetecting Lateral Movement with a Compute-Intense Graph Kernel
Detecting Lateral Movement with a Compute-Intense Graph Kernel
 
Social Network Analysis Workshop
Social Network Analysis WorkshopSocial Network Analysis Workshop
Social Network Analysis Workshop
 

Kürzlich hochgeladen

定制(AUT毕业证书)新西兰奥克兰理工大学毕业证成绩单原版一比一
定制(AUT毕业证书)新西兰奥克兰理工大学毕业证成绩单原版一比一定制(AUT毕业证书)新西兰奥克兰理工大学毕业证成绩单原版一比一
定制(AUT毕业证书)新西兰奥克兰理工大学毕业证成绩单原版一比一Fs
 
Q4-1-Illustrating-Hypothesis-Testing.pptx
Q4-1-Illustrating-Hypothesis-Testing.pptxQ4-1-Illustrating-Hypothesis-Testing.pptx
Q4-1-Illustrating-Hypothesis-Testing.pptxeditsforyah
 
PHP-based rendering of TYPO3 Documentation
PHP-based rendering of TYPO3 DocumentationPHP-based rendering of TYPO3 Documentation
PHP-based rendering of TYPO3 DocumentationLinaWolf1
 
Packaging the Monolith - PHP Tek 2024 (Breaking it down one bite at a time)
Packaging the Monolith - PHP Tek 2024 (Breaking it down one bite at a time)Packaging the Monolith - PHP Tek 2024 (Breaking it down one bite at a time)
Packaging the Monolith - PHP Tek 2024 (Breaking it down one bite at a time)Dana Luther
 
Potsdam FH学位证,波茨坦应用技术大学毕业证书1:1制作
Potsdam FH学位证,波茨坦应用技术大学毕业证书1:1制作Potsdam FH学位证,波茨坦应用技术大学毕业证书1:1制作
Potsdam FH学位证,波茨坦应用技术大学毕业证书1:1制作ys8omjxb
 
A Good Girl's Guide to Murder (A Good Girl's Guide to Murder, #1)
A Good Girl's Guide to Murder (A Good Girl's Guide to Murder, #1)A Good Girl's Guide to Murder (A Good Girl's Guide to Murder, #1)
A Good Girl's Guide to Murder (A Good Girl's Guide to Murder, #1)Christopher H Felton
 
Call Girls South Delhi Delhi reach out to us at ☎ 9711199012
Call Girls South Delhi Delhi reach out to us at ☎ 9711199012Call Girls South Delhi Delhi reach out to us at ☎ 9711199012
Call Girls South Delhi Delhi reach out to us at ☎ 9711199012rehmti665
 
『澳洲文凭』买詹姆士库克大学毕业证书成绩单办理澳洲JCU文凭学位证书
『澳洲文凭』买詹姆士库克大学毕业证书成绩单办理澳洲JCU文凭学位证书『澳洲文凭』买詹姆士库克大学毕业证书成绩单办理澳洲JCU文凭学位证书
『澳洲文凭』买詹姆士库克大学毕业证书成绩单办理澳洲JCU文凭学位证书rnrncn29
 
Call Girls Near The Suryaa Hotel New Delhi 9873777170
Call Girls Near The Suryaa Hotel New Delhi 9873777170Call Girls Near The Suryaa Hotel New Delhi 9873777170
Call Girls Near The Suryaa Hotel New Delhi 9873777170Sonam Pathan
 
定制(UAL学位证)英国伦敦艺术大学毕业证成绩单原版一比一
定制(UAL学位证)英国伦敦艺术大学毕业证成绩单原版一比一定制(UAL学位证)英国伦敦艺术大学毕业证成绩单原版一比一
定制(UAL学位证)英国伦敦艺术大学毕业证成绩单原版一比一Fs
 
Top 10 Interactive Website Design Trends in 2024.pptx
Top 10 Interactive Website Design Trends in 2024.pptxTop 10 Interactive Website Design Trends in 2024.pptx
Top 10 Interactive Website Design Trends in 2024.pptxDyna Gilbert
 
定制(Lincoln毕业证书)新西兰林肯大学毕业证成绩单原版一比一
定制(Lincoln毕业证书)新西兰林肯大学毕业证成绩单原版一比一定制(Lincoln毕业证书)新西兰林肯大学毕业证成绩单原版一比一
定制(Lincoln毕业证书)新西兰林肯大学毕业证成绩单原版一比一Fs
 
SCM Symposium PPT Format Customer loyalty is predi
SCM Symposium PPT Format Customer loyalty is prediSCM Symposium PPT Format Customer loyalty is predi
SCM Symposium PPT Format Customer loyalty is predieusebiomeyer
 
办理(UofR毕业证书)罗切斯特大学毕业证成绩单原版一比一
办理(UofR毕业证书)罗切斯特大学毕业证成绩单原版一比一办理(UofR毕业证书)罗切斯特大学毕业证成绩单原版一比一
办理(UofR毕业证书)罗切斯特大学毕业证成绩单原版一比一z xss
 
Elevate Your Business with Our IT Expertise in New Orleans
Elevate Your Business with Our IT Expertise in New OrleansElevate Your Business with Our IT Expertise in New Orleans
Elevate Your Business with Our IT Expertise in New Orleanscorenetworkseo
 
Contact Rya Baby for Call Girls New Delhi
Contact Rya Baby for Call Girls New DelhiContact Rya Baby for Call Girls New Delhi
Contact Rya Baby for Call Girls New Delhimiss dipika
 
定制(Management毕业证书)新加坡管理大学毕业证成绩单原版一比一
定制(Management毕业证书)新加坡管理大学毕业证成绩单原版一比一定制(Management毕业证书)新加坡管理大学毕业证成绩单原版一比一
定制(Management毕业证书)新加坡管理大学毕业证成绩单原版一比一Fs
 

Kürzlich hochgeladen (20)

定制(AUT毕业证书)新西兰奥克兰理工大学毕业证成绩单原版一比一
定制(AUT毕业证书)新西兰奥克兰理工大学毕业证成绩单原版一比一定制(AUT毕业证书)新西兰奥克兰理工大学毕业证成绩单原版一比一
定制(AUT毕业证书)新西兰奥克兰理工大学毕业证成绩单原版一比一
 
Q4-1-Illustrating-Hypothesis-Testing.pptx
Q4-1-Illustrating-Hypothesis-Testing.pptxQ4-1-Illustrating-Hypothesis-Testing.pptx
Q4-1-Illustrating-Hypothesis-Testing.pptx
 
PHP-based rendering of TYPO3 Documentation
PHP-based rendering of TYPO3 DocumentationPHP-based rendering of TYPO3 Documentation
PHP-based rendering of TYPO3 Documentation
 
Packaging the Monolith - PHP Tek 2024 (Breaking it down one bite at a time)
Packaging the Monolith - PHP Tek 2024 (Breaking it down one bite at a time)Packaging the Monolith - PHP Tek 2024 (Breaking it down one bite at a time)
Packaging the Monolith - PHP Tek 2024 (Breaking it down one bite at a time)
 
Potsdam FH学位证,波茨坦应用技术大学毕业证书1:1制作
Potsdam FH学位证,波茨坦应用技术大学毕业证书1:1制作Potsdam FH学位证,波茨坦应用技术大学毕业证书1:1制作
Potsdam FH学位证,波茨坦应用技术大学毕业证书1:1制作
 
A Good Girl's Guide to Murder (A Good Girl's Guide to Murder, #1)
A Good Girl's Guide to Murder (A Good Girl's Guide to Murder, #1)A Good Girl's Guide to Murder (A Good Girl's Guide to Murder, #1)
A Good Girl's Guide to Murder (A Good Girl's Guide to Murder, #1)
 
Call Girls South Delhi Delhi reach out to us at ☎ 9711199012
Call Girls South Delhi Delhi reach out to us at ☎ 9711199012Call Girls South Delhi Delhi reach out to us at ☎ 9711199012
Call Girls South Delhi Delhi reach out to us at ☎ 9711199012
 
『澳洲文凭』买詹姆士库克大学毕业证书成绩单办理澳洲JCU文凭学位证书
『澳洲文凭』买詹姆士库克大学毕业证书成绩单办理澳洲JCU文凭学位证书『澳洲文凭』买詹姆士库克大学毕业证书成绩单办理澳洲JCU文凭学位证书
『澳洲文凭』买詹姆士库克大学毕业证书成绩单办理澳洲JCU文凭学位证书
 
Call Girls Near The Suryaa Hotel New Delhi 9873777170
Call Girls Near The Suryaa Hotel New Delhi 9873777170Call Girls Near The Suryaa Hotel New Delhi 9873777170
Call Girls Near The Suryaa Hotel New Delhi 9873777170
 
定制(UAL学位证)英国伦敦艺术大学毕业证成绩单原版一比一
定制(UAL学位证)英国伦敦艺术大学毕业证成绩单原版一比一定制(UAL学位证)英国伦敦艺术大学毕业证成绩单原版一比一
定制(UAL学位证)英国伦敦艺术大学毕业证成绩单原版一比一
 
Model Call Girl in Jamuna Vihar Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in  Jamuna Vihar Delhi reach out to us at 🔝9953056974🔝Model Call Girl in  Jamuna Vihar Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Jamuna Vihar Delhi reach out to us at 🔝9953056974🔝
 
young call girls in Uttam Nagar🔝 9953056974 🔝 Delhi escort Service
young call girls in Uttam Nagar🔝 9953056974 🔝 Delhi escort Serviceyoung call girls in Uttam Nagar🔝 9953056974 🔝 Delhi escort Service
young call girls in Uttam Nagar🔝 9953056974 🔝 Delhi escort Service
 
Top 10 Interactive Website Design Trends in 2024.pptx
Top 10 Interactive Website Design Trends in 2024.pptxTop 10 Interactive Website Design Trends in 2024.pptx
Top 10 Interactive Website Design Trends in 2024.pptx
 
定制(Lincoln毕业证书)新西兰林肯大学毕业证成绩单原版一比一
定制(Lincoln毕业证书)新西兰林肯大学毕业证成绩单原版一比一定制(Lincoln毕业证书)新西兰林肯大学毕业证成绩单原版一比一
定制(Lincoln毕业证书)新西兰林肯大学毕业证成绩单原版一比一
 
SCM Symposium PPT Format Customer loyalty is predi
SCM Symposium PPT Format Customer loyalty is prediSCM Symposium PPT Format Customer loyalty is predi
SCM Symposium PPT Format Customer loyalty is predi
 
办理(UofR毕业证书)罗切斯特大学毕业证成绩单原版一比一
办理(UofR毕业证书)罗切斯特大学毕业证成绩单原版一比一办理(UofR毕业证书)罗切斯特大学毕业证成绩单原版一比一
办理(UofR毕业证书)罗切斯特大学毕业证成绩单原版一比一
 
Elevate Your Business with Our IT Expertise in New Orleans
Elevate Your Business with Our IT Expertise in New OrleansElevate Your Business with Our IT Expertise in New Orleans
Elevate Your Business with Our IT Expertise in New Orleans
 
Contact Rya Baby for Call Girls New Delhi
Contact Rya Baby for Call Girls New DelhiContact Rya Baby for Call Girls New Delhi
Contact Rya Baby for Call Girls New Delhi
 
定制(Management毕业证书)新加坡管理大学毕业证成绩单原版一比一
定制(Management毕业证书)新加坡管理大学毕业证成绩单原版一比一定制(Management毕业证书)新加坡管理大学毕业证成绩单原版一比一
定制(Management毕业证书)新加坡管理大学毕业证成绩单原版一比一
 
Hot Sexy call girls in Rk Puram 🔝 9953056974 🔝 Delhi escort Service
Hot Sexy call girls in  Rk Puram 🔝 9953056974 🔝 Delhi escort ServiceHot Sexy call girls in  Rk Puram 🔝 9953056974 🔝 Delhi escort Service
Hot Sexy call girls in Rk Puram 🔝 9953056974 🔝 Delhi escort Service
 

Predictive analytics help KCMO improve neighborhood inspections

  • 1. Matt Pazoles GovEx - Chief Data Scientist pazoles@jhu.edu Predictive Analytics & Neighborhood Health A collaboration by GovEx and Kansas City, MO
  • 2. Today ● Analytics in city government ● The problem in Kansas City ● Exploratory analysis ● Random forest regression ● Final scheduling tool ● Questions
  • 3. govex.jhu.edu / @gov_ex Quick Primer on GovEx ● Help organizations make informed decisions with data to improve people’s lives ● Provide targeted training and technical assistance for dedicated civil servants ● Team of 25 with practical government and NGO experience
  • 4. govex.jhu.edu / @gov_ex Where We Work 132 US cities 2,000+ civil servants trained
  • 5. govex.jhu.edu / @gov_ex ● Direct technical assistance ○ Data management ○ Performance ○ Advanced analytics ● Training and education ● Applied research How We Work
  • 6. Analytics in City Government ...a brief history
  • 7. 1970’S The Abbreviated Evolution of Performance Management Private sector renews emphasis on quality and process improvement (e.g. japanese auto manufacturers) Malcolm Baldridge Quality Improvement Program Six Sigma GPRA COMPSTAT STAT MANIA: CITISTAT CAPSTAT SCHOOLSTAT HUDSTAT FEMASTAT Moneyball GPRA Modernization Act MODA EAD 1980’S 1990’S 2000’S 2010’S
  • 8. Results Focused Management What did Performance Management give us? A few wins: “NYC cuts crime by 60%” “Baltimore saves $40m in overtime” “DC houses 500 homeless men” Some believers Some skeptics Some victims A few losses: Standardized Testing Cheating Scandals VA Wait-Time Gaming Airline on-time departure gaming A lot of process: Meetings (including STAT) Metrics Reports Dashboards Reviews Appraisals
  • 10.
  • 11. Problem Definition Making Decisions Taking Action What happened? Why did it happen? What will happen? What should we do? Insights Foresight Insights Insights to Action
  • 12. Identify problem Explore data Create model Adjust operations Track results Add dataUpdate model Communicate Results
  • 14. Jackson, MS Problem: Dwindling general fund reserves, population decline, overstaffed? Analysis: Clustering algorithm to identify similar cities, then normalize and compare budgets Conclusion: Jackson was overspending on Public Works Operations
  • 15. San Jose, CA Problem: Low citywide compliance with pet licensing Analysis: Identify what areas are least compliant and most able to take action Conclusion: Target veterinarians and animal hospitals in identified geographies
  • 16. New Orleans, LA Problem: Large backlog of blight abatement cases at the review stage Analysis: Predict likely course of action based on historical data Conclusion: Create decision-support tool to cut down on work needed to advance the case
  • 19. 2008-2011 2010 Foreclosure Rates by Zip Code Source: Kansas City Star, 7/9/2017 Source: Bureau of Labor Statistics Unemployment Rate in Kansas City, MO
  • 20. 2011 Source: Kansas City • Increase in property violations overwhelmed existing workflow • 20% of reported violations were not seen for 10+ weeks • Prioritized initial inspections • Goal: 5 to 10 days • Result: 6 days
  • 21. Existing Inspection Process 1. Violation is reported through 311 2. Case is opened as a priority 1 for initial inspection (within 6 days) 3. If a violation is found, owner is sent a notification letter 4. After 3 to 6 weeks, case is added as a priority 3 (after initial inspections and court requirements) 5. Eventually, property is reinspected; if violation remains, another letter is issued Legal/Policy constraints ● City can fine owners who live within 50 miles ○ Fines are minimal ● Owners who live more than 50 miles from the city are not pursued ○ Weak ability to take possession of blighted properties
  • 22. 2012-2016 Source: Kansas City Open Data 201 1 201 2 201 3 201 4 201 6201 5 2011 2012 2013 2014 2016 2015
  • 23. 2012-2016 • Inspector capacity hasn’t changed • More open cases per inspector = higher workload • Hurt by success of prioritizing initial inspections? What is driving inspector workload?
  • 24. Can we reduce the amount of work to get a property into compliance?
  • 25. Identify problem Explore data Create model Adjust operations Track results Add dataUpdate model Communicate Results
  • 26. Questions ● Is the backlog still growing?
  • 27. Is the backlog of inspections growing? Year Net Violations 2012 5,204 2013 13,703 2014 1,398 2015 1,402 2016 -4,431
  • 28. Questions ● Is the backlog still growing? ● Are more violations being issued?
  • 29. 2012-2016 Source: Kansas City Open Data Portal
  • 30. Are there more violations per month?
  • 31. Questions ● Is the backlog still growing? ● Are more violations being issued? ● Are a few properties responsible?
  • 32. Are more properties becoming noncompliant?
  • 33. Are repeat offenders a growing issue? Violations by Property Most Recent Violation Opened 2010 2011 2012 2013 2014 2015 2016 Grand Total Oldest Violation Opened 2010 3,783 1,010 1,020 1,130 1,241 1,603 3,541 13,406 2011 3,433 1,038 972 1,101 1,266 2,873 10,749 2012 3,183 837 778 852 1,911 7,600 2013 3,658 1,129 943 1,896 7,667 2014 3,530 1,192 1,771 6,548 2015 3,589 1,934 5,560 2016 4,656 4,690 Grand Total 3,794 4,452 5,252 6,605 7,790 9,462 18,617 56,220 Years After Initial % Repeat 1 14% 2 25% 3 34% 4 43% 5 52% 6 59%
  • 34. Questions ● Is the backlog still growing? ● Are more violations being issued? ● Are a few properties responsible? ● Do more inspections decrease resolution time?
  • 35. 2012-2016 Source: Kansas City Open Data Portal 60% of cases 70% of reinspections
  • 36. Do inspections reduce resolution times?
  • 37. Questions ● Are inspections themselves part of the issue?
  • 38. Are inspections closing violations efficiently? Average number of inspections per year: 32,291 +/-3,000 “Inefficient Reinspection” Reinspections where no violations are closed Nearly doubled between 2012 and 2016
  • 39.
  • 40. Insights ● Reinspection of open cases accounts for most work ○ Majority of reinspections on a minority of cases ● Timing of inspections is more important than quantity ● Many reinspections create more work for inspectors
  • 41. Improve inspection efficiency by modeling how long a violation will take to resolve. Reduce the number of inspections that close 0 violations At the address/case level
  • 42. Identify problem Explore data Create model Adjust operations Track results Add dataUpdate model Communicate Results
  • 43. PostgreSQL database + PostGIS spatial extension Data Infrastructure Property violation, 311, property ownership, and neighborhood related data from OpenData KC Demographic data by census block from the US Census Bureau Property values and building features from county data Data Infrastructure
  • 44. Data Infrastructure PostgreSQL database + PostGIS Schedule of Inspections • Tier 1: Initial Inspections • Tier 2: Court-ordered • Tier 3: Model Prioritized • Tier 4: Low Priority • AWS hosted server for analysis • Python + Jupyter, R + R Studio • Scikit-Learn • Random Forest Regression • 70% of data as training set ■ Corrected for class imbalance • Mutual regression feature selection • Grid Search parameter optimization • Target: Violation Days Open • Grouped violations by property to produce recommendations for inspectors • How many days has it been open? • What are the other violations present?
  • 45. Correctly sorts violations from shortest to longest. (Bars rise from left to right)
  • 46. Underestimated time to resolution Overestimated time to resolution Red line represents a perfect prediction Blue dots represent violations from the “test” data MSE = 45.5 days
  • 47. Current Status Cases predicted as ready to be closed: 3,870 properties (62% of open cases) (38% of open cases can be deferred)
  • 48. Model-informed inspection cycle INITIAL INSPECTION New violations come in ACTION Enhance inspection schedule with model-informed targets DATA WRANGLING Add other relevant data and run model AGGREGATION Compare prediction to current violation age; group by address
  • 49. Anticipated outcomes •More time helping residents •More timely inspections •Fewer follow-up visits documenting existing issues •Less frustration all around!
  • 50. Future opportunities for analysis • “Inspector Effect” • The inspection area is a predictive feature, but no data linking inspectors to specific inspections • Weather impacts • More detailed analyses could find relations between ground saturation, wind speed, and specific violation types • Utility Analysis • Can water bills be used to proactively identify severe violations? • Court Data • How effective are current penalties in preventing reviolation? • Equity in Outcomes • Testing for bias by examining outcomes of inspection processes
  • 51. Next Steps (this project) ● Technical infrastructure ● Fold into prioritization ● Field testing ● Future analyses