SlideShare ist ein Scribd-Unternehmen logo
1 von 40
Downloaden Sie, um offline zu lesen
Predicting Churn in Telecom
Outline
• Business Problem
• Variable Description
• Exploratory Data Analysis
• Feature Selection
• Data Pre-Processing
• Model Development
• Model Validation
Business Problem
• Consumers today go through a complex decision
making process before subscribing to any one of the
numerous Telecom service options.
• The services provided by the Telecom vendors are not
highly differentiated and number portability is
commonplace.
• customer loyalty becomes an issue. Hence, it is
becoming increasingly important for
telecommunications companies to proactively
identify factors that have a tendency to unsubscribe
and take preventive measures to retain customers.
Variable Description
• State : categorical, for the 50 states and the District of Columbia
• Account Length : integer-valued, how long account has been active
• Area Code : categorical
• Phone : Phone number of customer
• Int'l Plan : International plan activated ( yes , no)
• VMail Plan : Voice Mail plan activated ( yes , no )
• VMail Message : No. of voice mail messages
• Day Mins : Total day minutes used
• Day Calls : Total day calls made
• Day Charge : Total day charge
• Eve Mins : Total evening minutes
• Eve Calls : Total evening calls
• Eve Charge : Total evening charge
• Night Mins : Total night minutes
• Night Calls : Total night calls
• Night Charge : Total night charge
• Intl Mins : Total International minutes used
• Intl Calls : Total International calls made
• Intl Charge : Total International charge
• CustServ Calls : Number of customer service calls made
• Churn : Customer churn (Target Variable 1= churn , 0= not churned )
Exploratory Data Analysis
Summary statistics
Visualizing statistics
Plot 1
Plot 2:
Plot 3
Few observation from exploratory
analysis
• Customers with the International Plan tend to
churn more frequently
• Customers with the Voice Mail Plan tend to
churn less frequently.
• Customers with four or more customer service
calls churn more than four times as often as
do the other customers.
Feature Selection
• Important features were identified during
model building process for ex:
– Stepwise regression indicates important variable
to consider
– Variable importance graph has been generated
using random forest and so on
Data Pre-Processing
• Dataset considered for this project is already
cleaned
• We have partitioned our dataset into training and
testing set using simple random sampling
• We have dropped following four variables as they
are not adding any meaning for modelling
purpose
– State
– Area.code
– Account.length
– Phone number
Model 1: Decision Tree
• Easy to interpret
• Generates if-else business rules
• Recursive partitioning and classification
technique is used
• Tree build
– Fully grown (results in overfitting of data)
– Pruned tree (optimal tree)
• R packages used:
– Rpart
– Caret
Tree 1: Full Tree
Performance measure of full tree :
ROC Curve
Performance measure of full tree :
Confusion Matrix and other statistics
Tree 2: Pruned Tree
Performance Measure of Pruned Tree:
ROC Curve
Performance measure of Pruned tree :
Confusion Matrix and other statistics
Comparing Performance of both the
tree: ROC Curve
Compare : Confusion Matrix and other
statistics
Full Tree Pruned Tree
Model 2: Logistic Regression
• Widely used across industry
• R packages used
– Glm for model building
– Caret for model evaluation
Model Summary on all variable as
Input
Model summary on statistically
significant variables
Model Evaluation-Confusion Matrix
Model 3: Support Vector Machine
• Widely used black box technique for binary
classification
• R packages used
– e1071 (for model building)
– Caret (for model evaluation)
Model performance: Confusion Matrix
Model Evaluation: SVM Roc Curve
Model 4: Ensemble (Random Forest)
• Ensembling of decision trees will be done
• R packages used:
– randomForest (model development)
– caret (model evaluation)
Variable Importance Plot : Random
Forest
Model Evaluation : Confusion Matrix
Model Evaluation : ROC curve
(Random Forest)
Models Comparison: ROC curve
CUSTMER SEGMENTATION & CLTV
CALCULATION
• Different techniques are available for
customer segmentation.
• Customer can be segmented into different
kind of profiles like high value, low value,
warm, cold and so on.
• RFM analaysis, CLTV based segmentation,
clustering based segmentation are few
techniques to name
CLTV( customer life time value)
• CLTV (Customer LifeTime Value) refers to the
amount of revenues that you expect to
generate from a customer during the period
over which your service will be of value.
• On the basis of above values we segment
customer profiles and treat them accordingly
Assumptions
• Due to limitation in our dataset we performed CLTV
analysis on the basis of the following assumptions:
– Given data contains one year of transaction details
– Unit of amount is dollars
– following are the margins that company is getting from
their customer
• 5% of day charge
• 10% of evening hours
• 20% of night and international calls
– Monthly churn rate of telecom industry is 4%
Note: above numbers are for illustration purpose only and it depends on domain knowledge of analyst.
CLTV calculation
• On the basis of this assumptions net profit
from any customer can be calculated as:
-> Net profit = 0.05*daycharge + 0.10* eve.charge + 0.15 *night charge + 0.20 * Intnl charge
->Churnrate = 0.04
->Customer_cltv = (netprofit-0.5*cust_serv_call)/churnrate
• For illustration purpose in our case customers
whose cltv is less than mean(cltv) are
considered as LVC and other are HVC
Note: Above segmentation can be done in a better way with the help of
business domain expert
• THANK YOU

Weitere ähnliche Inhalte

Was ist angesagt?

Data analytics telecom churn final ppt
Data analytics telecom churn final ppt Data analytics telecom churn final ppt
Data analytics telecom churn final ppt Gunvansh Khanna
 
churn prediction in telecom
churn prediction in telecom churn prediction in telecom
churn prediction in telecom Hong Bui Van
 
Churn prediction
Churn predictionChurn prediction
Churn predictionGigi Lino
 
Customer attrition and churn modeling
Customer attrition and churn modelingCustomer attrition and churn modeling
Customer attrition and churn modelingMariya Korsakova
 
Customer Churn Analysis and Prediction
Customer Churn Analysis and PredictionCustomer Churn Analysis and Prediction
Customer Churn Analysis and PredictionSOUMIT KAR
 
Churn prediction data modeling
Churn prediction data modelingChurn prediction data modeling
Churn prediction data modelingPierre Gutierrez
 
Ways to Reduce the Customer Churn Rate
Ways to Reduce the Customer Churn RateWays to Reduce the Customer Churn Rate
Ways to Reduce the Customer Churn RateFORMCEPT
 
Churn customer analysis
Churn customer analysisChurn customer analysis
Churn customer analysisDr.Bechoo Lal
 
Predicting Bank Customer Churn Using Classification
Predicting Bank Customer Churn Using ClassificationPredicting Bank Customer Churn Using Classification
Predicting Bank Customer Churn Using ClassificationVishva Abeyrathne
 
Customer Churn Analytics using Microsoft R Open
Customer Churn Analytics using Microsoft R OpenCustomer Churn Analytics using Microsoft R Open
Customer Churn Analytics using Microsoft R OpenPoo Kuan Hoong
 
Telecommunication Analysis (3 use-cases) with IBM watson analytics
Telecommunication Analysis (3 use-cases) with IBM watson analyticsTelecommunication Analysis (3 use-cases) with IBM watson analytics
Telecommunication Analysis (3 use-cases) with IBM watson analyticssheetal sharma
 
Prediction of customer propensity to churn - Telecom Industry
Prediction of customer propensity to churn - Telecom IndustryPrediction of customer propensity to churn - Telecom Industry
Prediction of customer propensity to churn - Telecom IndustryPranov Mishra
 
Bank churn with Data Science
Bank churn with Data ScienceBank churn with Data Science
Bank churn with Data ScienceCarolyn Knight
 
IRJET - Customer Churn Analysis in Telecom Industry
IRJET - Customer Churn Analysis in Telecom IndustryIRJET - Customer Churn Analysis in Telecom Industry
IRJET - Customer Churn Analysis in Telecom IndustryIRJET Journal
 
Churn Prediction in Practice
Churn Prediction in PracticeChurn Prediction in Practice
Churn Prediction in PracticeBigData Republic
 

Was ist angesagt? (20)

Data analytics telecom churn final ppt
Data analytics telecom churn final ppt Data analytics telecom churn final ppt
Data analytics telecom churn final ppt
 
churn prediction in telecom
churn prediction in telecom churn prediction in telecom
churn prediction in telecom
 
Churn prediction
Churn predictionChurn prediction
Churn prediction
 
Customer attrition and churn modeling
Customer attrition and churn modelingCustomer attrition and churn modeling
Customer attrition and churn modeling
 
Predicting the e-commerce churn
Predicting the e-commerce churnPredicting the e-commerce churn
Predicting the e-commerce churn
 
Customer Churn Analysis and Prediction
Customer Churn Analysis and PredictionCustomer Churn Analysis and Prediction
Customer Churn Analysis and Prediction
 
Churn prediction data modeling
Churn prediction data modelingChurn prediction data modeling
Churn prediction data modeling
 
Ways to Reduce the Customer Churn Rate
Ways to Reduce the Customer Churn RateWays to Reduce the Customer Churn Rate
Ways to Reduce the Customer Churn Rate
 
Churn customer analysis
Churn customer analysisChurn customer analysis
Churn customer analysis
 
Predicting Bank Customer Churn Using Classification
Predicting Bank Customer Churn Using ClassificationPredicting Bank Customer Churn Using Classification
Predicting Bank Customer Churn Using Classification
 
Customer Churn Analytics using Microsoft R Open
Customer Churn Analytics using Microsoft R OpenCustomer Churn Analytics using Microsoft R Open
Customer Churn Analytics using Microsoft R Open
 
Telcom churn .pptx
Telcom churn .pptxTelcom churn .pptx
Telcom churn .pptx
 
DTH Case Study
DTH Case StudyDTH Case Study
DTH Case Study
 
Telecom customer churn prediction
Telecom customer churn predictionTelecom customer churn prediction
Telecom customer churn prediction
 
Telecommunication Analysis (3 use-cases) with IBM watson analytics
Telecommunication Analysis (3 use-cases) with IBM watson analyticsTelecommunication Analysis (3 use-cases) with IBM watson analytics
Telecommunication Analysis (3 use-cases) with IBM watson analytics
 
Customer churn prediction in banking
Customer churn prediction in bankingCustomer churn prediction in banking
Customer churn prediction in banking
 
Prediction of customer propensity to churn - Telecom Industry
Prediction of customer propensity to churn - Telecom IndustryPrediction of customer propensity to churn - Telecom Industry
Prediction of customer propensity to churn - Telecom Industry
 
Bank churn with Data Science
Bank churn with Data ScienceBank churn with Data Science
Bank churn with Data Science
 
IRJET - Customer Churn Analysis in Telecom Industry
IRJET - Customer Churn Analysis in Telecom IndustryIRJET - Customer Churn Analysis in Telecom Industry
IRJET - Customer Churn Analysis in Telecom Industry
 
Churn Prediction in Practice
Churn Prediction in PracticeChurn Prediction in Practice
Churn Prediction in Practice
 

Andere mochten auch

Telco Churn Roi V3
Telco Churn Roi V3Telco Churn Roi V3
Telco Churn Roi V3hkaul
 
Subscriber Churn Prediction Model using Social Network Analysis In Telecommun...
Subscriber Churn Prediction Model using Social Network Analysis In Telecommun...Subscriber Churn Prediction Model using Social Network Analysis In Telecommun...
Subscriber Churn Prediction Model using Social Network Analysis In Telecommun...BAINIDA
 
Deriving economic value for CSPs with Big Data [read-only]
Deriving economic value for CSPs with Big Data [read-only]Deriving economic value for CSPs with Big Data [read-only]
Deriving economic value for CSPs with Big Data [read-only]Flytxt
 
Idiro Analytics - What is Rotational Churn and how can we tackle it?
Idiro Analytics - What is Rotational Churn and how can we tackle it?Idiro Analytics - What is Rotational Churn and how can we tackle it?
Idiro Analytics - What is Rotational Churn and how can we tackle it?Idiro Analytics
 
Big Telco - Yousun Jeong
Big Telco - Yousun JeongBig Telco - Yousun Jeong
Big Telco - Yousun JeongSpark Summit
 
Idiro Analytics - Social Network Analysis for Online Gaming
Idiro Analytics - Social Network Analysis for Online GamingIdiro Analytics - Social Network Analysis for Online Gaming
Idiro Analytics - Social Network Analysis for Online GamingIdiro Analytics
 
Idiro Analytics - Identifying Families using Social Network Analysis and Big ...
Idiro Analytics - Identifying Families using Social Network Analysis and Big ...Idiro Analytics - Identifying Families using Social Network Analysis and Big ...
Idiro Analytics - Identifying Families using Social Network Analysis and Big ...Idiro Analytics
 
Idiro Analytics - Analytics & Big Data
Idiro Analytics - Analytics & Big DataIdiro Analytics - Analytics & Big Data
Idiro Analytics - Analytics & Big DataIdiro Analytics
 
Social Network Analysis for Telecoms
Social Network Analysis for TelecomsSocial Network Analysis for Telecoms
Social Network Analysis for TelecomsDataspora
 
Predicting churn in telco industry: machine learning approach - Marko Mitić
 Predicting churn in telco industry: machine learning approach - Marko Mitić Predicting churn in telco industry: machine learning approach - Marko Mitić
Predicting churn in telco industry: machine learning approach - Marko MitićInstitute of Contemporary Sciences
 
Leveraging Kafka for Big Data in Real Time Bidding, Analytics, ML & Campaign ...
Leveraging Kafka for Big Data in Real Time Bidding, Analytics, ML & Campaign ...Leveraging Kafka for Big Data in Real Time Bidding, Analytics, ML & Campaign ...
Leveraging Kafka for Big Data in Real Time Bidding, Analytics, ML & Campaign ...Helena Edelson
 
Decide on technology stack & data architecture
Decide on technology stack & data architectureDecide on technology stack & data architecture
Decide on technology stack & data architectureSV.CO
 
How to use your CRM for upselling and cross-selling
How to use your CRM for upselling and cross-sellingHow to use your CRM for upselling and cross-selling
How to use your CRM for upselling and cross-sellingRedspire Ltd
 
Big Data Analytics : A Social Network Approach
Big Data Analytics : A Social Network ApproachBig Data Analytics : A Social Network Approach
Big Data Analytics : A Social Network ApproachAndry Alamsyah
 
Big Data: Social Network Analysis
Big Data: Social Network AnalysisBig Data: Social Network Analysis
Big Data: Social Network AnalysisMichel Bruley
 

Andere mochten auch (15)

Telco Churn Roi V3
Telco Churn Roi V3Telco Churn Roi V3
Telco Churn Roi V3
 
Subscriber Churn Prediction Model using Social Network Analysis In Telecommun...
Subscriber Churn Prediction Model using Social Network Analysis In Telecommun...Subscriber Churn Prediction Model using Social Network Analysis In Telecommun...
Subscriber Churn Prediction Model using Social Network Analysis In Telecommun...
 
Deriving economic value for CSPs with Big Data [read-only]
Deriving economic value for CSPs with Big Data [read-only]Deriving economic value for CSPs with Big Data [read-only]
Deriving economic value for CSPs with Big Data [read-only]
 
Idiro Analytics - What is Rotational Churn and how can we tackle it?
Idiro Analytics - What is Rotational Churn and how can we tackle it?Idiro Analytics - What is Rotational Churn and how can we tackle it?
Idiro Analytics - What is Rotational Churn and how can we tackle it?
 
Big Telco - Yousun Jeong
Big Telco - Yousun JeongBig Telco - Yousun Jeong
Big Telco - Yousun Jeong
 
Idiro Analytics - Social Network Analysis for Online Gaming
Idiro Analytics - Social Network Analysis for Online GamingIdiro Analytics - Social Network Analysis for Online Gaming
Idiro Analytics - Social Network Analysis for Online Gaming
 
Idiro Analytics - Identifying Families using Social Network Analysis and Big ...
Idiro Analytics - Identifying Families using Social Network Analysis and Big ...Idiro Analytics - Identifying Families using Social Network Analysis and Big ...
Idiro Analytics - Identifying Families using Social Network Analysis and Big ...
 
Idiro Analytics - Analytics & Big Data
Idiro Analytics - Analytics & Big DataIdiro Analytics - Analytics & Big Data
Idiro Analytics - Analytics & Big Data
 
Social Network Analysis for Telecoms
Social Network Analysis for TelecomsSocial Network Analysis for Telecoms
Social Network Analysis for Telecoms
 
Predicting churn in telco industry: machine learning approach - Marko Mitić
 Predicting churn in telco industry: machine learning approach - Marko Mitić Predicting churn in telco industry: machine learning approach - Marko Mitić
Predicting churn in telco industry: machine learning approach - Marko Mitić
 
Leveraging Kafka for Big Data in Real Time Bidding, Analytics, ML & Campaign ...
Leveraging Kafka for Big Data in Real Time Bidding, Analytics, ML & Campaign ...Leveraging Kafka for Big Data in Real Time Bidding, Analytics, ML & Campaign ...
Leveraging Kafka for Big Data in Real Time Bidding, Analytics, ML & Campaign ...
 
Decide on technology stack & data architecture
Decide on technology stack & data architectureDecide on technology stack & data architecture
Decide on technology stack & data architecture
 
How to use your CRM for upselling and cross-selling
How to use your CRM for upselling and cross-sellingHow to use your CRM for upselling and cross-selling
How to use your CRM for upselling and cross-selling
 
Big Data Analytics : A Social Network Approach
Big Data Analytics : A Social Network ApproachBig Data Analytics : A Social Network Approach
Big Data Analytics : A Social Network Approach
 
Big Data: Social Network Analysis
Big Data: Social Network AnalysisBig Data: Social Network Analysis
Big Data: Social Network Analysis
 

Ähnlich wie Churn modelling

Customer_Churn_prediction.pptx
Customer_Churn_prediction.pptxCustomer_Churn_prediction.pptx
Customer_Churn_prediction.pptxAniket Patil
 
Customer_Churn_prediction.pptx
Customer_Churn_prediction.pptxCustomer_Churn_prediction.pptx
Customer_Churn_prediction.pptxpatilaniket2418
 
Revenue assurance 102
Revenue assurance 102Revenue assurance 102
Revenue assurance 102ntel
 
NJTC Workshop on Financial Services Technology and Application Development Pr...
NJTC Workshop on Financial Services Technology and Application Development Pr...NJTC Workshop on Financial Services Technology and Application Development Pr...
NJTC Workshop on Financial Services Technology and Application Development Pr...DATA Inc.
 
Implementing Advanced Analytics Platform
Implementing Advanced Analytics PlatformImplementing Advanced Analytics Platform
Implementing Advanced Analytics PlatformArvind Sathi
 
Insight-2015-Session-3193
Insight-2015-Session-3193Insight-2015-Session-3193
Insight-2015-Session-3193Michal Miklas
 
Business Intelligence and OLAP Practice
Business Intelligence and OLAP PracticeBusiness Intelligence and OLAP Practice
Business Intelligence and OLAP PracticeTatiana Ivanova
 
Acqueon's LCM - for Cisco Unified CCE Dialer - Presentation
Acqueon's LCM - for Cisco Unified CCE Dialer - PresentationAcqueon's LCM - for Cisco Unified CCE Dialer - Presentation
Acqueon's LCM - for Cisco Unified CCE Dialer - PresentationAcqueon Technologies Inc.
 
Mathematical Model For Customer Life Time Based Offer Management
Mathematical Model For Customer Life Time Based Offer ManagementMathematical Model For Customer Life Time Based Offer Management
Mathematical Model For Customer Life Time Based Offer ManagementZehra Kendir
 
Inspire2015 Bank of America Merrill Lynch
Inspire2015 Bank of America Merrill LynchInspire2015 Bank of America Merrill Lynch
Inspire2015 Bank of America Merrill LynchAltan Atabarut, MSc.
 
ASUG Utilities Presentation
ASUG Utilities PresentationASUG Utilities Presentation
ASUG Utilities PresentationMichael Robinson
 
Modelling Customer Lifetime Revenue for Subscription Business
Modelling Customer Lifetime Revenue for Subscription BusinessModelling Customer Lifetime Revenue for Subscription Business
Modelling Customer Lifetime Revenue for Subscription BusinessDatabricks
 
Barga Galvanize Sept 2015
Barga Galvanize Sept 2015Barga Galvanize Sept 2015
Barga Galvanize Sept 2015Roger Barga
 
Key findings when upgrading your sap crm system
Key findings when upgrading your sap crm systemKey findings when upgrading your sap crm system
Key findings when upgrading your sap crm systemrobgirvan
 
Calculating Client Profitability: Analysis to Action
Calculating Client Profitability: Analysis to ActionCalculating Client Profitability: Analysis to Action
Calculating Client Profitability: Analysis to ActionPerformLaw
 

Ähnlich wie Churn modelling (20)

Customer Life Time Value
Customer Life Time ValueCustomer Life Time Value
Customer Life Time Value
 
Customer_Churn_prediction.pptx
Customer_Churn_prediction.pptxCustomer_Churn_prediction.pptx
Customer_Churn_prediction.pptx
 
Customer_Churn_prediction.pptx
Customer_Churn_prediction.pptxCustomer_Churn_prediction.pptx
Customer_Churn_prediction.pptx
 
Revenue assurance 102
Revenue assurance 102Revenue assurance 102
Revenue assurance 102
 
NJTC Workshop on Financial Services Technology and Application Development Pr...
NJTC Workshop on Financial Services Technology and Application Development Pr...NJTC Workshop on Financial Services Technology and Application Development Pr...
NJTC Workshop on Financial Services Technology and Application Development Pr...
 
Implementing Advanced Analytics Platform
Implementing Advanced Analytics PlatformImplementing Advanced Analytics Platform
Implementing Advanced Analytics Platform
 
Final mis power
Final mis powerFinal mis power
Final mis power
 
Insight-2015-Session-3193
Insight-2015-Session-3193Insight-2015-Session-3193
Insight-2015-Session-3193
 
Business Intelligence and OLAP Practice
Business Intelligence and OLAP PracticeBusiness Intelligence and OLAP Practice
Business Intelligence and OLAP Practice
 
Krrushnan Resume - Mainframe (2)
Krrushnan Resume - Mainframe (2)Krrushnan Resume - Mainframe (2)
Krrushnan Resume - Mainframe (2)
 
Acqueon's LCM - for Cisco Unified CCE Dialer - Presentation
Acqueon's LCM - for Cisco Unified CCE Dialer - PresentationAcqueon's LCM - for Cisco Unified CCE Dialer - Presentation
Acqueon's LCM - for Cisco Unified CCE Dialer - Presentation
 
Mathematical Model For Customer Life Time Based Offer Management
Mathematical Model For Customer Life Time Based Offer ManagementMathematical Model For Customer Life Time Based Offer Management
Mathematical Model For Customer Life Time Based Offer Management
 
Inspire2015 Bank of America Merrill Lynch
Inspire2015 Bank of America Merrill LynchInspire2015 Bank of America Merrill Lynch
Inspire2015 Bank of America Merrill Lynch
 
ASUG Utilities Presentation
ASUG Utilities PresentationASUG Utilities Presentation
ASUG Utilities Presentation
 
Modelling Customer Lifetime Revenue for Subscription Business
Modelling Customer Lifetime Revenue for Subscription BusinessModelling Customer Lifetime Revenue for Subscription Business
Modelling Customer Lifetime Revenue for Subscription Business
 
VAN
VANVAN
VAN
 
Barga Galvanize Sept 2015
Barga Galvanize Sept 2015Barga Galvanize Sept 2015
Barga Galvanize Sept 2015
 
1710 track3 zhu
1710 track3 zhu1710 track3 zhu
1710 track3 zhu
 
Key findings when upgrading your sap crm system
Key findings when upgrading your sap crm systemKey findings when upgrading your sap crm system
Key findings when upgrading your sap crm system
 
Calculating Client Profitability: Analysis to Action
Calculating Client Profitability: Analysis to ActionCalculating Client Profitability: Analysis to Action
Calculating Client Profitability: Analysis to Action
 

Kürzlich hochgeladen

Rithik Kumar Singh codealpha pythohn.pdf
Rithik Kumar Singh codealpha pythohn.pdfRithik Kumar Singh codealpha pythohn.pdf
Rithik Kumar Singh codealpha pythohn.pdfrahulyadav957181
 
knowledge representation in artificial intelligence
knowledge representation in artificial intelligenceknowledge representation in artificial intelligence
knowledge representation in artificial intelligencePriyadharshiniG41
 
Non Text Magic Studio Magic Design for Presentations L&P.pdf
Non Text Magic Studio Magic Design for Presentations L&P.pdfNon Text Magic Studio Magic Design for Presentations L&P.pdf
Non Text Magic Studio Magic Design for Presentations L&P.pdfPratikPatil591646
 
Decoding Movie Sentiments: Analyzing Reviews with Data Analysis model
Decoding Movie Sentiments: Analyzing Reviews with Data Analysis modelDecoding Movie Sentiments: Analyzing Reviews with Data Analysis model
Decoding Movie Sentiments: Analyzing Reviews with Data Analysis modelBoston Institute of Analytics
 
Introduction to Mongo DB-open-­‐source, high-­‐performance, document-­‐orient...
Introduction to Mongo DB-open-­‐source, high-­‐performance, document-­‐orient...Introduction to Mongo DB-open-­‐source, high-­‐performance, document-­‐orient...
Introduction to Mongo DB-open-­‐source, high-­‐performance, document-­‐orient...boychatmate1
 
Statistics For Management by Richard I. Levin 8ed.pdf
Statistics For Management by Richard I. Levin 8ed.pdfStatistics For Management by Richard I. Levin 8ed.pdf
Statistics For Management by Richard I. Levin 8ed.pdfnikeshsingh56
 
What To Do For World Nature Conservation Day by Slidesgo.pptx
What To Do For World Nature Conservation Day by Slidesgo.pptxWhat To Do For World Nature Conservation Day by Slidesgo.pptx
What To Do For World Nature Conservation Day by Slidesgo.pptxSimranPal17
 
6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...
6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...
6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...Dr Arash Najmaei ( Phd., MBA, BSc)
 
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...Boston Institute of Analytics
 
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...Boston Institute of Analytics
 
Decoding Patterns: Customer Churn Prediction Data Analysis Project
Decoding Patterns: Customer Churn Prediction Data Analysis ProjectDecoding Patterns: Customer Churn Prediction Data Analysis Project
Decoding Patterns: Customer Churn Prediction Data Analysis ProjectBoston Institute of Analytics
 
Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)Cathrine Wilhelmsen
 
Digital Marketing Plan, how digital marketing works
Digital Marketing Plan, how digital marketing worksDigital Marketing Plan, how digital marketing works
Digital Marketing Plan, how digital marketing worksdeepakthakur548787
 
Bank Loan Approval Analysis: A Comprehensive Data Analysis Project
Bank Loan Approval Analysis: A Comprehensive Data Analysis ProjectBank Loan Approval Analysis: A Comprehensive Data Analysis Project
Bank Loan Approval Analysis: A Comprehensive Data Analysis ProjectBoston Institute of Analytics
 
Digital Indonesia Report 2024 by We Are Social .pdf
Digital Indonesia Report 2024 by We Are Social .pdfDigital Indonesia Report 2024 by We Are Social .pdf
Digital Indonesia Report 2024 by We Are Social .pdfNicoChristianSunaryo
 
English-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdf
English-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdfEnglish-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdf
English-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdfblazblazml
 
Principles and Practices of Data Visualization
Principles and Practices of Data VisualizationPrinciples and Practices of Data Visualization
Principles and Practices of Data VisualizationKianJazayeri1
 

Kürzlich hochgeladen (20)

Rithik Kumar Singh codealpha pythohn.pdf
Rithik Kumar Singh codealpha pythohn.pdfRithik Kumar Singh codealpha pythohn.pdf
Rithik Kumar Singh codealpha pythohn.pdf
 
knowledge representation in artificial intelligence
knowledge representation in artificial intelligenceknowledge representation in artificial intelligence
knowledge representation in artificial intelligence
 
Non Text Magic Studio Magic Design for Presentations L&P.pdf
Non Text Magic Studio Magic Design for Presentations L&P.pdfNon Text Magic Studio Magic Design for Presentations L&P.pdf
Non Text Magic Studio Magic Design for Presentations L&P.pdf
 
Decoding Movie Sentiments: Analyzing Reviews with Data Analysis model
Decoding Movie Sentiments: Analyzing Reviews with Data Analysis modelDecoding Movie Sentiments: Analyzing Reviews with Data Analysis model
Decoding Movie Sentiments: Analyzing Reviews with Data Analysis model
 
2023 Survey Shows Dip in High School E-Cigarette Use
2023 Survey Shows Dip in High School E-Cigarette Use2023 Survey Shows Dip in High School E-Cigarette Use
2023 Survey Shows Dip in High School E-Cigarette Use
 
Introduction to Mongo DB-open-­‐source, high-­‐performance, document-­‐orient...
Introduction to Mongo DB-open-­‐source, high-­‐performance, document-­‐orient...Introduction to Mongo DB-open-­‐source, high-­‐performance, document-­‐orient...
Introduction to Mongo DB-open-­‐source, high-­‐performance, document-­‐orient...
 
Statistics For Management by Richard I. Levin 8ed.pdf
Statistics For Management by Richard I. Levin 8ed.pdfStatistics For Management by Richard I. Levin 8ed.pdf
Statistics For Management by Richard I. Levin 8ed.pdf
 
Data Analysis Project: Stroke Prediction
Data Analysis Project: Stroke PredictionData Analysis Project: Stroke Prediction
Data Analysis Project: Stroke Prediction
 
What To Do For World Nature Conservation Day by Slidesgo.pptx
What To Do For World Nature Conservation Day by Slidesgo.pptxWhat To Do For World Nature Conservation Day by Slidesgo.pptx
What To Do For World Nature Conservation Day by Slidesgo.pptx
 
6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...
6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...
6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...
 
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...
 
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...
 
Decoding Patterns: Customer Churn Prediction Data Analysis Project
Decoding Patterns: Customer Churn Prediction Data Analysis ProjectDecoding Patterns: Customer Churn Prediction Data Analysis Project
Decoding Patterns: Customer Churn Prediction Data Analysis Project
 
Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)
 
Digital Marketing Plan, how digital marketing works
Digital Marketing Plan, how digital marketing worksDigital Marketing Plan, how digital marketing works
Digital Marketing Plan, how digital marketing works
 
Bank Loan Approval Analysis: A Comprehensive Data Analysis Project
Bank Loan Approval Analysis: A Comprehensive Data Analysis ProjectBank Loan Approval Analysis: A Comprehensive Data Analysis Project
Bank Loan Approval Analysis: A Comprehensive Data Analysis Project
 
Digital Indonesia Report 2024 by We Are Social .pdf
Digital Indonesia Report 2024 by We Are Social .pdfDigital Indonesia Report 2024 by We Are Social .pdf
Digital Indonesia Report 2024 by We Are Social .pdf
 
Insurance Churn Prediction Data Analysis Project
Insurance Churn Prediction Data Analysis ProjectInsurance Churn Prediction Data Analysis Project
Insurance Churn Prediction Data Analysis Project
 
English-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdf
English-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdfEnglish-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdf
English-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdf
 
Principles and Practices of Data Visualization
Principles and Practices of Data VisualizationPrinciples and Practices of Data Visualization
Principles and Practices of Data Visualization
 

Churn modelling

  • 2. Outline • Business Problem • Variable Description • Exploratory Data Analysis • Feature Selection • Data Pre-Processing • Model Development • Model Validation
  • 3. Business Problem • Consumers today go through a complex decision making process before subscribing to any one of the numerous Telecom service options. • The services provided by the Telecom vendors are not highly differentiated and number portability is commonplace. • customer loyalty becomes an issue. Hence, it is becoming increasingly important for telecommunications companies to proactively identify factors that have a tendency to unsubscribe and take preventive measures to retain customers.
  • 4. Variable Description • State : categorical, for the 50 states and the District of Columbia • Account Length : integer-valued, how long account has been active • Area Code : categorical • Phone : Phone number of customer • Int'l Plan : International plan activated ( yes , no) • VMail Plan : Voice Mail plan activated ( yes , no ) • VMail Message : No. of voice mail messages • Day Mins : Total day minutes used • Day Calls : Total day calls made • Day Charge : Total day charge • Eve Mins : Total evening minutes • Eve Calls : Total evening calls • Eve Charge : Total evening charge • Night Mins : Total night minutes • Night Calls : Total night calls • Night Charge : Total night charge • Intl Mins : Total International minutes used • Intl Calls : Total International calls made • Intl Charge : Total International charge • CustServ Calls : Number of customer service calls made • Churn : Customer churn (Target Variable 1= churn , 0= not churned )
  • 8.
  • 12. Few observation from exploratory analysis • Customers with the International Plan tend to churn more frequently • Customers with the Voice Mail Plan tend to churn less frequently. • Customers with four or more customer service calls churn more than four times as often as do the other customers.
  • 13. Feature Selection • Important features were identified during model building process for ex: – Stepwise regression indicates important variable to consider – Variable importance graph has been generated using random forest and so on
  • 14. Data Pre-Processing • Dataset considered for this project is already cleaned • We have partitioned our dataset into training and testing set using simple random sampling • We have dropped following four variables as they are not adding any meaning for modelling purpose – State – Area.code – Account.length – Phone number
  • 15. Model 1: Decision Tree • Easy to interpret • Generates if-else business rules • Recursive partitioning and classification technique is used • Tree build – Fully grown (results in overfitting of data) – Pruned tree (optimal tree) • R packages used: – Rpart – Caret
  • 16. Tree 1: Full Tree
  • 17. Performance measure of full tree : ROC Curve
  • 18. Performance measure of full tree : Confusion Matrix and other statistics
  • 20. Performance Measure of Pruned Tree: ROC Curve
  • 21. Performance measure of Pruned tree : Confusion Matrix and other statistics
  • 22. Comparing Performance of both the tree: ROC Curve
  • 23. Compare : Confusion Matrix and other statistics Full Tree Pruned Tree
  • 24. Model 2: Logistic Regression • Widely used across industry • R packages used – Glm for model building – Caret for model evaluation
  • 25. Model Summary on all variable as Input
  • 26. Model summary on statistically significant variables
  • 28. Model 3: Support Vector Machine • Widely used black box technique for binary classification • R packages used – e1071 (for model building) – Caret (for model evaluation)
  • 31. Model 4: Ensemble (Random Forest) • Ensembling of decision trees will be done • R packages used: – randomForest (model development) – caret (model evaluation)
  • 32. Variable Importance Plot : Random Forest
  • 33. Model Evaluation : Confusion Matrix
  • 34. Model Evaluation : ROC curve (Random Forest)
  • 36. CUSTMER SEGMENTATION & CLTV CALCULATION • Different techniques are available for customer segmentation. • Customer can be segmented into different kind of profiles like high value, low value, warm, cold and so on. • RFM analaysis, CLTV based segmentation, clustering based segmentation are few techniques to name
  • 37. CLTV( customer life time value) • CLTV (Customer LifeTime Value) refers to the amount of revenues that you expect to generate from a customer during the period over which your service will be of value. • On the basis of above values we segment customer profiles and treat them accordingly
  • 38. Assumptions • Due to limitation in our dataset we performed CLTV analysis on the basis of the following assumptions: – Given data contains one year of transaction details – Unit of amount is dollars – following are the margins that company is getting from their customer • 5% of day charge • 10% of evening hours • 20% of night and international calls – Monthly churn rate of telecom industry is 4% Note: above numbers are for illustration purpose only and it depends on domain knowledge of analyst.
  • 39. CLTV calculation • On the basis of this assumptions net profit from any customer can be calculated as: -> Net profit = 0.05*daycharge + 0.10* eve.charge + 0.15 *night charge + 0.20 * Intnl charge ->Churnrate = 0.04 ->Customer_cltv = (netprofit-0.5*cust_serv_call)/churnrate • For illustration purpose in our case customers whose cltv is less than mean(cltv) are considered as LVC and other are HVC Note: Above segmentation can be done in a better way with the help of business domain expert

Hinweis der Redaktion

  1. We can also add transaction data and demographic data of the customer for better insights . Since we are limited to these dummy data for our analysis we will try to explain different machine learning algorithm on this dataset.
  2. Hypotheses: No of calls to customer care results in more churning & graph b/w both variables is also indicating the same
  3. Hypotheses 2: International call subscriber has more chances of churning figure also reflecting the same
  4. Hypotheses 3: subscribing to voice mail service has no significant impact on churning
  5. More conclusions can be drawn by plotting differnet graphs between different variables as per hypotheses
  6. Since we are already dealing with small number of features we are not going into specific feature selection techniques
  7. There is little difference in sensitivity and specificity which clearly indicates that our full tree has overfitted data in model building therefore one should go for pruned tree where less number of rules are generated as compared to full tree
  8. Underlined variables are statistically significant
  9. _