SlideShare a Scribd company logo
1 of 1
Download to read offline
Project Methodology
Customer Analytics for Social Good:
Machine Learning to identify “Sister Cities” for Benchmarking
Consultant: Krishnesh Pujari | Professor: Dr. Kathy Weaver
Client: Montgomery County – Office of the County Executive – CountyStat Team
Introduction
• Montgomery County is a recognized
leader in open data and innovation
• County seeks to benchmark itself
against like jurisdictions to assess its
performance and identify
opportunities for improvement
Problem Statement
• Currently, the CountyStat team
identifies similar counties purely
based on subjective determination
• CountyStat team wishes to automate
the process using unsupervised
machine learning algorithm
Goals
• Need to implement clustering
algorithm that identifies similar
jurisdictions
• Create a tool that can be used by any
County Government to benchmark
themselves against their peers
Challenges
• Working with multi-dimensional data
to identify prime factors
• Feature engineer the data discard
masking variables for better results
• Build interactive visualization using
Rstudio’s Shiny App Framework
Accomplishments
• Automated benchmarking process by
identifying clusters of similar counties
• Developed Interactive tool that can be
used by any US County Government
• Hands on experience working on R
programming, Shiny App, and Excel
Future Scope
• Optimize the algorithm to factor data
and identify similar counties across
different industry sectors
• Integrate data preparation task with
the tool
United States Census
Bureau Data Source
Data Preparation and
Analysis
Data Transformation
Unsupervised K-
Means Clustering
Client Server

More Related Content

Similar to PujariKrishnesh_Capstone_Project Poster

Unearth the limitless possibilities with Amazon Sagemaker.pptx
Unearth the limitless possibilities with Amazon Sagemaker.pptxUnearth the limitless possibilities with Amazon Sagemaker.pptx
Unearth the limitless possibilities with Amazon Sagemaker.pptxMarketing CloudThat
 
Government and Software as-a-service
Government and Software as-a-serviceGovernment and Software as-a-service
Government and Software as-a-serviceProudCity
 
Application of Analytics in Construction Management
Application of Analytics in Construction ManagementApplication of Analytics in Construction Management
Application of Analytics in Construction ManagementLogiKal Projects
 
Transportation Planning Implications of Automated Vehicles on Texas Highways
Transportation Planning Implications of Automated Vehicles on Texas Highways Transportation Planning Implications of Automated Vehicles on Texas Highways
Transportation Planning Implications of Automated Vehicles on Texas Highways Texas A&M Transportation Institute
 
Why is programmatic taking off? What is this revolution all about?
Why is programmatic taking off?  What is this revolution all about?Why is programmatic taking off?  What is this revolution all about?
Why is programmatic taking off? What is this revolution all about?Datacratic
 
Lavanya_4.3yrsExp_AutomationTesting_QTP_UFT_VBScript
Lavanya_4.3yrsExp_AutomationTesting_QTP_UFT_VBScriptLavanya_4.3yrsExp_AutomationTesting_QTP_UFT_VBScript
Lavanya_4.3yrsExp_AutomationTesting_QTP_UFT_VBScriptLavanya Dubbani
 
Process and Project Metrics-1
Process and Project Metrics-1Process and Project Metrics-1
Process and Project Metrics-1Saqib Raza
 
Continuous Improvement
Continuous ImprovementContinuous Improvement
Continuous ImprovementReading Room
 
CWP Meetup - All of Government Design
CWP Meetup - All of Government DesignCWP Meetup - All of Government Design
CWP Meetup - All of Government DesignAndrew Underwood
 
Effective Semantic Web Service Composition Framework Based on QoS
Effective Semantic Web Service Composition Framework Based on QoSEffective Semantic Web Service Composition Framework Based on QoS
Effective Semantic Web Service Composition Framework Based on QoSsethuraman R
 
Anjali gupta resume
Anjali gupta resumeAnjali gupta resume
Anjali gupta resumeANJALI GUPTA
 
BMDSE v1 - Data Scientist Deck
BMDSE v1 - Data Scientist DeckBMDSE v1 - Data Scientist Deck
BMDSE v1 - Data Scientist DeckSasha Lazarevic
 
URISA’s Local Government GIS Capability Maturity Model
URISA’s Local Government GIS Capability Maturity ModelURISA’s Local Government GIS Capability Maturity Model
URISA’s Local Government GIS Capability Maturity ModelGreg Babinski
 
LOAN APPROVAL PRDICTION SYSTEM USING MACHINE LEARNING.
LOAN APPROVAL PRDICTION SYSTEM USING MACHINE LEARNING.LOAN APPROVAL PRDICTION SYSTEM USING MACHINE LEARNING.
LOAN APPROVAL PRDICTION SYSTEM USING MACHINE LEARNING.Souma Maiti
 
Ramanan Resume New
Ramanan Resume NewRamanan Resume New
Ramanan Resume NewRam Ramanan
 
AI and the Financial Service Segment
AI and the Financial Service SegmentAI and the Financial Service Segment
AI and the Financial Service SegmentGraeme Wood
 

Similar to PujariKrishnesh_Capstone_Project Poster (20)

J sai subrahmanyam_insofe
J sai subrahmanyam_insofeJ sai subrahmanyam_insofe
J sai subrahmanyam_insofe
 
Unearth the limitless possibilities with Amazon Sagemaker.pptx
Unearth the limitless possibilities with Amazon Sagemaker.pptxUnearth the limitless possibilities with Amazon Sagemaker.pptx
Unearth the limitless possibilities with Amazon Sagemaker.pptx
 
Government and Software as-a-service
Government and Software as-a-serviceGovernment and Software as-a-service
Government and Software as-a-service
 
Application of Analytics in Construction Management
Application of Analytics in Construction ManagementApplication of Analytics in Construction Management
Application of Analytics in Construction Management
 
Transportation Planning Implications of Automated Vehicles on Texas Highways
Transportation Planning Implications of Automated Vehicles on Texas Highways Transportation Planning Implications of Automated Vehicles on Texas Highways
Transportation Planning Implications of Automated Vehicles on Texas Highways
 
Why is programmatic taking off? What is this revolution all about?
Why is programmatic taking off?  What is this revolution all about?Why is programmatic taking off?  What is this revolution all about?
Why is programmatic taking off? What is this revolution all about?
 
Lavanya_4.3yrsExp_AutomationTesting_QTP_UFT_VBScript
Lavanya_4.3yrsExp_AutomationTesting_QTP_UFT_VBScriptLavanya_4.3yrsExp_AutomationTesting_QTP_UFT_VBScript
Lavanya_4.3yrsExp_AutomationTesting_QTP_UFT_VBScript
 
Process and Project Metrics-1
Process and Project Metrics-1Process and Project Metrics-1
Process and Project Metrics-1
 
Continuous Improvement
Continuous ImprovementContinuous Improvement
Continuous Improvement
 
CWP Meetup - All of Government Design
CWP Meetup - All of Government DesignCWP Meetup - All of Government Design
CWP Meetup - All of Government Design
 
Effective Semantic Web Service Composition Framework Based on QoS
Effective Semantic Web Service Composition Framework Based on QoSEffective Semantic Web Service Composition Framework Based on QoS
Effective Semantic Web Service Composition Framework Based on QoS
 
Anjali gupta resume
Anjali gupta resumeAnjali gupta resume
Anjali gupta resume
 
BMDSE v1 - Data Scientist Deck
BMDSE v1 - Data Scientist DeckBMDSE v1 - Data Scientist Deck
BMDSE v1 - Data Scientist Deck
 
URISA’s Local Government GIS Capability Maturity Model
URISA’s Local Government GIS Capability Maturity ModelURISA’s Local Government GIS Capability Maturity Model
URISA’s Local Government GIS Capability Maturity Model
 
Ch 3
Ch   3Ch   3
Ch 3
 
LOAN APPROVAL PRDICTION SYSTEM USING MACHINE LEARNING.
LOAN APPROVAL PRDICTION SYSTEM USING MACHINE LEARNING.LOAN APPROVAL PRDICTION SYSTEM USING MACHINE LEARNING.
LOAN APPROVAL PRDICTION SYSTEM USING MACHINE LEARNING.
 
ml.pptx
ml.pptxml.pptx
ml.pptx
 
Ramanan Resume New
Ramanan Resume NewRamanan Resume New
Ramanan Resume New
 
Resume
ResumeResume
Resume
 
AI and the Financial Service Segment
AI and the Financial Service SegmentAI and the Financial Service Segment
AI and the Financial Service Segment
 

PujariKrishnesh_Capstone_Project Poster

  • 1. Project Methodology Customer Analytics for Social Good: Machine Learning to identify “Sister Cities” for Benchmarking Consultant: Krishnesh Pujari | Professor: Dr. Kathy Weaver Client: Montgomery County – Office of the County Executive – CountyStat Team Introduction • Montgomery County is a recognized leader in open data and innovation • County seeks to benchmark itself against like jurisdictions to assess its performance and identify opportunities for improvement Problem Statement • Currently, the CountyStat team identifies similar counties purely based on subjective determination • CountyStat team wishes to automate the process using unsupervised machine learning algorithm Goals • Need to implement clustering algorithm that identifies similar jurisdictions • Create a tool that can be used by any County Government to benchmark themselves against their peers Challenges • Working with multi-dimensional data to identify prime factors • Feature engineer the data discard masking variables for better results • Build interactive visualization using Rstudio’s Shiny App Framework Accomplishments • Automated benchmarking process by identifying clusters of similar counties • Developed Interactive tool that can be used by any US County Government • Hands on experience working on R programming, Shiny App, and Excel Future Scope • Optimize the algorithm to factor data and identify similar counties across different industry sectors • Integrate data preparation task with the tool United States Census Bureau Data Source Data Preparation and Analysis Data Transformation Unsupervised K- Means Clustering Client Server