SlideShare a Scribd company logo
1 of 5
Download to read offline
Essam Shaaban, Yehia Helmy, Ayman Khedr, Mona Nasr / International Journal of Engineering
               Research and Applications (IJERA) ISSN: 2248-9622 www.ijera.com
                            Vol. 2, Issue 4, June-July 2012, pp.693-697
                              A Proposed Churn Prediction Model
               Essam Shaaban*, Yehia Helmy, Ayman Khedr, Mona Nasr**
                    *(Department of Information Systems, MUST University, 6 October, Cairo, Egypt
                       ** (Department of Information Systems, Helwan University, Cairo, Egypt



ABSTRACT
         Churn prediction aims to detect customers            techniques may use Decision Tree (DT), Support Vector
intended to leave a service provider. Retaining one           Machine (SVM) in addition to Neural Networks (NN),
customer costs an organization from 5 to 10 times             Genetic Algorithms (GA) or Fuzzy Logic (FL) to predict
than gaining a new one. Predictive models can                 churners.
provide correct identification of possible churners in                 This paper is organized as follows. Section 2
the near future in order to provide a retention               describes the types of churners. Section 3 shows the
solution. This paper presents a new prediction model          existing prediction models rather than the techniques of
based on Data Mining (DM) techniques. The                     developing a predictive model. Section 4 describes the
proposed model is composed of six steps which are;            proposed churn prediction model besides the results of
identify problem domain, data selection, investigate          an implemented case study. Finally; conclusion and
data set, classification, clustering and knowledge            future work are presented.
usage. A data set with 23 attributes and 5000
instances is used. 4000 instances used for training the       2. TYPES OF CHURNERS
model and 1000 instances used as a testing set. The                     As figure 1 depicts; There are two main
predicted churners are clustered into 3 categories in         categories of churners which are voluntary and
case of using in a retention strategy. The data mining        involuntary [4]. Involuntary churners are the easiest to
techniques used in this paper are Decision Tree,              identify. These are the customers that Telco decides to
Support Vector Machine and Neural Network                     remove from subscribers list. Therefore’ this category
throughout an open source software name WEKA.                 includes people that are churned for fraud, non-payment
                                                              and customers who don’t use the phone. Voluntary
Keywords:-Churn          prediction,    classification,
                                                              churner is more difficult to determine; it occurs when a
clustering, data mining, prediction model
                                                              customer makes a decision to terminate his/her service
                                                              with the provider. When people think about Telco churn
1. INTRODUCTION                                               it is usually the voluntary kind that comes to mind.
          Churn prediction process is a highly debated
research area for more than ten years. Researchers from
different disciplines have tried to analyze this problem
from their own perspectives to figure out a clear
understanding and to recommend an effective solution
for churners in many business areas. Abbasimehr et al.
[1] state that churn prediction is a useful tool to predict
customer at churn risk. Conventional churn prediction         Figure 1: churn taxonomy
techniques have the advantage of being simple and
robust with respect to defects in the input data, they                  Voluntary churn can be sub-divided into two
possess serious limitations to the interpretation of          main categories, incidental churn and deliberate churn.
reasons for churn. Therefore, measuring the                   Incidental churn occurs, not because the customers
effectiveness of a prediction model depends also on how       planned on it but because something happened in their
well the results can be interpreted for inferring the         lives. For example: change in financial condition churn,
possible reasons of churn [2]. The purpose of prediction      change in location churn, etc. Deliberate churn happens
is to anticipate the value that a random variable will        for reasons of technology (customers wanting newer or
assume in the future or to estimate the likelihood of         better technology), economics (price sensitivity), service
future events [3]. Most DM techniques derive their            quality factors, social or psychological factors, and
predictions from the value of a set of variables              convenience reasons. Deliberate churn is the problem
associated with the entities in a database. DM models         that most churn management solutions try to solve [4]
may be employed to predict customer churn developed           [5].
in many disciplines such as demographic data and/or
behavioral data. There are many DM techniques that can
be used in classification and clustering customer data to
predict churners in the near future. These


                                                                                                        693 | P a g e
Essam Shaaban, Yehia Helmy, Ayman Khedr, Mona Nasr / International Journal of Engineering
              Research and Applications (IJERA) ISSN: 2248-9622 www.ijera.com
                           Vol. 2, Issue 4, June-July 2012, pp.693-697
3. PREDICTIVE MODELS                                3.2.2 Fuzzy Logic (FL)
          Predictive modeling is mainly concerned with       FL is a conceptually easy to understand. The
predicting how the customer will behave in the future by     mathematical concepts behind fuzzy reasoning are very
analyzing their past behavior. Predicting customers who      simple. Naturalness of the approach makes it preferable
are likely to churn is one example of the predictive         to the other techniques. FL is flexible, tolerant of
modeling [6]. Predictive modeling is used in analyzing       imprecise data, and it can model nonlinear functions of
Customer Relationship Management (CRM) data                  arbitrary complexity. It can be blended with
and DM to produce customer-level models that describe        conventional control techniques. In many cases fuzzy
the likelihood that a customer will take a particular        systems expends the concept of the conventional control
action. The actions are usually sales, marketing and         techniques and simplify their implementation. Regarding
customer retention related. There are many models that       the telecom industry; there is no work achieved related
can used to define distinguish between churners and non-     to churn prediction using the fuzzy techniques [10].
churners in an organization. These models can be             There are a lot of studies have implemented in the area
classified into traditional models or techniques (RA and     of telecom churn prediction. Summary of latest churn
DT) and soft computing techniques (FL and NN) [2].           prediction studies is shown in table 1.
                                                             Table 1: churn prediction studies
3.1 Traditional techniques                                   Year Author                          Technique
3.1.1 Decision trees                                         2001 Datta et al. [11]               DT
          DT is most popular type of predictive model. It    2002 Ping and Tang [12]              DT induction
has become an important knowledge structure, used for        2003 Au et al. [13]                  GA
the classification of future events [7]. DT usually          2006 Ahn et al. [14]                 partial defection
consists of two main steps, tree building and tree                                                Survival      Analysis
pruning. The tree-building step consists of recursively      2007 Junxiang [15]
                                                                                                  Modeling
partitioning the training sets according to the values of    2008 Piotr [16]                      rough-sets
the attributes. The partitioning process continues until
                                                                      Seo, and Ranganathan
all, or most of the records in each of the partitions        2008                                 Two-level model
                                                                      [17]
contain identical values. Some branches may be removed
                                                             2009 Jahromi et al [18]              NN and DT
because it could consist of noisy data. The pruning step
involves selecting and removing the branches containing      2010 Gotovac [4]                     DT
the largest estimated error rate. Tree pruning is known to                                        partial least squares
                                                             2011 Lee et al. [19]
enhance the predictive accuracy of the decision tree,                                             (PLS) model
while reducing the complexity [8].                           2011 Yeshwanth et al [20]            Hybrid Learning
3.1.2 Regression Analysis                                                                         Local Linear Model
          RA is another popular technique used to deal                Fasanghari           and
                                                             2011                                 Tree     (LOLIMOT)
with predicting customer satisfaction it is based on                  Keramati [21]
                                                                                                  algorithm.
supervised learning models. Regression models deal
with a dataset consisting of past observations, for which    4. THE PROPOSED MODEL
both the value of the explanatory attributes and the value             The proposed model is composed of six steps.
of the continuous numerical target variable are known        As shown in figure 1, these steps are: identify problem
[3].                                                         domain, data selection, investigate data set,
3.2 Soft computing techniques                                classification, clustering and knowledge usage. As figure
3.2.1 Neural Networks                                        2 depicts; the classification step produces two types of
          NN has been successfully used to estimate          customers (churners and non-churners) while the
intricate non-linear functions. A NN is an analogous data    clustering step produces 3 clusters which are used to be
processing structure that possesses the ability to learn.    evaluated according to the retention strategy in further
The concept is loosely based on a biological brain and       usage.
has successfully been applied to many types of
problems, such as classification, control, and prediction
[9]. NN is different from DT and other classification
techniques because they can provide a prediction with its
likelihood. Various neural network approaches have
emerged over time, each with varying advantages and
disadvantages (Liao et al., 2004), however greater detail
into these variances is beyond the scope of this paper.
Research suggests that neural networks outperform
decision trees and regression models for churn prediction
[8].


                                                                                                        694 | P a g e
Essam Shaaban, Yehia Helmy, Ayman Khedr, Mona Nasr / International Journal of Engineering
               Research and Applications (IJERA) ISSN: 2248-9622 www.ijera.com
                            Vol. 2, Issue 4, June-July 2012, pp.693-697
                                                                           The attributes for this data set are 23 attributes
                                                                 as shown in table 2. The class of evaluating the status of
                                                                 each instance in the a data set is named churn. Churn
                                                                 attribute is labeled churn=True if a customer left the
                                                                 service provider but it is labeled churn=False if he/she is
                                                                 still continuing with the service provider. churn=True if
                                                                 a customer left the service provider but it is labeled
                                                                 churn=False if he/she is still continuing with the service
                                                                 provider.
                                                                 Table 2: attributes of the data set.
                                                                 Attribute Data             Attribute Description
                                                                               Type
                                                                                            Customer’s age categorized
                                                                 Age           Number
                                                                                            into five groups
                                                                                            Customer’s gender (1=male,
                                                                 Gender        Number
                                                                                            0=female)
                                                                                            Customer’s marital status
                                                                 mar_st        Number
                                                                                            (1=yes, 0=no)
                                                                 M_in_sm                    mean of in minutes of use from
                                                                               Number
                                                                 _MOU                       the same service provider
Figure 2: The Proposed Churn Prediction model                    M_out_s                    mean of out minutes of use
         The proposed model can produce more than 3                            Number
                                                                 m_MOU                      from the same service provider
clusters based on the types of acquired knowledge.               M_sm_M                     mean of all minutes of use
Knowledge usage receives the produced clusters for                             Number
                                                                 OU                         from the same service provider
assign a retaining solution for each type of churners.           M_in_oth                   mean of in minutes of use from
Churners can be clustered according to many criteria                           Number
                                                                 _MOU                       other service provider
such as profitability or dissatisfactory of customers.           M_out_ot                   mean of out minutes of use
                                                                               Number
                                                                 h_MOU                      from other service provider
4.1 Churn Prediction with the Proposed Model                     M_oth_                     mean of all minutes of use
          During this case study an open source DM tool                        Number
                                                                 MOU                        from other service provider
named WEKA [22] is used in addition to a data set of                                        mean of in minutes of use
5000 instances. The data set is obtained from an                 M_in_M
                                                                               Number       either from the same or other
anonymous mobile service provider. As figure 2 depicts;          OU
                                                                                            service provider
the data set is divided into a training set and a testing set.                              mean of out minutes of use
The training set is 4000 (80%) instances and the testing         M_out_
                                                                               Number       either from the same or other
set is 1000 (20%) instances. The training data contains          MOU
                                                                                            service provider
3200 (80%) instances are labeled non-churners bot the                                       mean of all in and our minutes
others are 800 (20%) are labeled churners.                       MOU           Number       of use either from the same or
                                                                                            other service provider
                                                                                            Change in minutes of use from
                                                                                            one month to another during
                                                                 chng_M
                                                                               Nominal the time of the experiment
                                                                 OU
                                                                                            (decreased,      Normal       or
                                                                                            increased)
Figure 2: the distribution of the data set                                                  The mean number of messages
                                                                 M_sms         Number       during the time of the
                                                                                            experiment
                                                                 M_M_re
                                                                               Number       Mean monthly revenues
                                                                 v
                                                                 Ass_prod Number            Associated product (yes or no)
Figure 3: Training set                                           Ass_ser       Number       Associated services (yes or no)
                                                                 M_CC_c                     Mean number of customer care
                                                                               Number
                                                                 alls                       calls
                                                                 M_drop_
                                                                               Nominal       Mean number of dropped calls
                                                                 calls
                                                                 Compaint                   Mean number of complaints
                                                                               Nominal
                                                                 s
Figure 4: Testing set                                            no_chng_                   Number of changes in tariff
                                                                               Number
                                                                 tif_plan                   plan

                                                                                                             695 | P a g e
Essam Shaaban, Yehia Helmy, Ayman Khedr, Mona Nasr / International Journal of Engineering
               Research and Applications (IJERA) ISSN: 2248-9622 www.ijera.com
                            Vol. 2, Issue 4, June-July 2012, pp.693-697
max_call                 The maximum time distance                                   3 Clusters with 421 instances
            Number
_distance                between 2 calls                                             Cluster 0     Cluster 1
                                                                                                                  Cluster 2
CHURN       Nominal      Churn=True or False                                         (88) 20.9 (208)
                                                                                                                  (125) 29.7%
                                                                                     %             49.4%
                                                              Age                    12            43             25
4.2 Interpretation of Results                                 Gender                 Female        Male           Female
The result of the classification step is represented in       mar_st                 Single        Married        Single
table 3.                                                      M_in_sm_MOU            47            29             23
Table 3: Confusion Matrix of results                          M_out_sm_MOU           12            45             12
            Actual                                            M_sm_MOU               698           43             52
                            Actual Prediction
            Class
                                                              M_in_oth_MOU           41            45             65
                            Non-
                                           Churners           M_out_oth_MOU          14            25             28
                            churners
                                                              M_oth_MOU              77            52             158
Decision    Non-            376/500        124/500
                                                              M_in_MOU               50            54             69
Tree        churners        (75.2%)        (14.8%)
                                                              M_out_MOU              730           142            85
                            97/500         403/500
            Churners                                          MOU                    194           232            210
                            (19.4%)        (80.6%)
                                                              chng_MOU               Decreased     Decreased      Decreased
            Non-            420/500        80/500
            churners        (84%)          (16%)              M_sms                  5             25             1
Neural
Networks                    83/500         417/500            M_M_rev                21.4          112.14         29
            Churners                                          Ass_prod               NO            NO             NO
                            (16.6%)        (83.4%)
            Non-            416/500        84/500             Ass_ser                YES           NO             NO
Support                                                       M_CC_calls             2             6              5
            churners        (83.2%)        (16.8)
Vector                                                        M_drop_calls           0             0              0
                            79/500         421/500
Machine Churners                                              Compaints              2             1              0
                            (15.8%)        (84.2%)
                                                              no_chng_tif_plan       0             2              3
          The accuracy and error rate of the predicted        max_call_distance      0             0              0
results are shown is table 4. The accuracy rate and error
rate are computed as shown in the following equations         The output of the clustering step can be used for the
Accuracy = (no. of correct / Total no. of predictions)        knowledge usage step in order to assign a retention
Error rate = (no. of wrong / Total no. of predictions)        strategy for a specific cluster or customer. In case of
                                                              clustering for a specific purpose such as customer’s
                                                              profitability, dissatisfactory, or cross selling this requires
                                                              a specific attribute selection from table 5.
Table 4: Accuracy and error rates comparison for
DT, NN, SVM techniques.                                       5. CONCLUSION
Technique       Accuracy       Error rate                               Many churn prediction models and techniques
Decision Tree 77.9%            22.1%                          have been presented to date. However, a simple model is
(DT)            (779/1000)      (221/1000)                    required to distinguish churners from non- churners then
Neural          83.7%          16.3%                          clustering the resulted churners for providing retention
Networks (NN)   (837/1000)      (163/1000)                    solutions. In this paper, a simple model based on DM
Support Vector 83.7%           16.3%                          techniques was introduced to help a CRM department to
Machine (SVM) (837/1000)        (163/1000)                    keep track its customers and their behavior against
                                                              churn. A data set of 5000 instances with 23 attributes is
          The best classification results can be extracted    used to train and test the model. Using 3 different
from tables 3 and 4 which is found in SVM classification      techniques which are DT, SVM, and NN for
technique that is the most relevant with NN classification    classification and simple K Means techniques for
technique for the data set in hand. SVM classification        clustering results indicate that the best output for the data
technique predicts 421 churners from 500. The predicted       set in hand is SVM technique. The next stage of the
churners are used to be clustered in the next step.           authors’ research will involve performing a deeper
During the clustering step the 421 predicted churners are     analysis into the customer data to try to establish new
used to be clustered using simple K Means algorithm. 22       churn prediction retention model that will use the
attributes are used in clustering as shown in table 5. The    predicted and clustered data to assign a suitable retention
clustering process use 3 types of clusters which one can      strategies for each churner type.
name Cluster 0 (Low), Cluster 1 (Medium) and Cluster 2
(High) according to the suitable situation of clustering.
                                                              6. REFERENCES
The 3 resulting clusters can be assigned for profitability,
priority for retaining, or dissatisfactory.
                                                              [1]    H. Abbasimehr, M. Setak, M. Tarokh. A Neuro-
Table 5: Clustering Output
                                                                     Fuzzy Classifier for Customer Churn Prediction.

                                                                                                           696 | P a g e
Essam Shaaban, Yehia Helmy, Ayman Khedr, Mona Nasr / International Journal of Engineering
                 Research and Applications (IJERA) ISSN: 2248-9622 www.ijera.com
                              Vol. 2, Issue 4, June-July 2012, pp.693-697
        International Journal of Computer Applications,            Systems with Applications, Elsevier , vol. 23,
        vol. 19, no. 8, pp. 35-41, April, 2011                     no. 2, pp. 103-112, Aug. 2002.
[2]     V. Lazarov and M. Capota. Churn                     [13]   W. Au, , C. Chan, and X. Yao, A Novel
        Prediction. Business Analytics Course. TUM                 Evolutionary Data Mining Algorithm with
        Computer        Science,       December     2007.          Applications to Churn Prediction, IEEE
        http://home.in.tum.de/~lazarov/files/research/pap          transactions on evolutionary computation, 7, 6,
        ers/churn-prediction.pdf                                   532-545, 2003
[3]     Carlo Vercellis, Business Intelligence: Data        [14]   H. Ahn, , P. Hana, and S. Lee, Customer churn
        Mining and Optimization for Decision Making,               analysis: Churn determinants and mediation
        John Wiley & Sons, Ltd. 2009 ISBN: 978-0-470-              effects of partial defection in the Korean mobile
        51138-1                                                    telecommunications          service      industry.
[4]     S.     Gotovac.     “Modeling      Data   Mining           Telecommunications Policy. Vol. 30 No.: PP.
        Applications for Prediction of Prepaid Churn in            552-568, 2006)
        Telecommunication Services,” vol. 51, no. 3, pp.    [15]   L. Junxiang. Predicting Customer Churn in the
        275-283, 2010                                              Telecommunications Industry – An Application
[5]     H. Kim, and C. Yoon, “Determinants of                      of Survival Analysis Modeling Using SAS. Data
        subscriber churn and customer loyalty in the               mining techniques. SUGI 27. Paper 114, 2007
        Korean        mobile       telephony     market.”   [16]   S. Piotr. Global Perspectives Mobile Operator
        Telecommunications Policy. Vol. 28 No.: PP.                Customer Classification in Churn Analysis.
        751-765, 2004.                                             Technical University of Szczecin, Poland SAS
[6]     M. Hassouna, Agent Based Modelling and                     Global Forum, pp. 1-5, 2008.
        Simulation: An Examination of Customer              [17]   B. Seo, C. Ranganathan, and Y. Babad,. “Two-
        Retention in the UK Mobile Market. PhD thesis,             level model of customer retention in the US
        Brunel University, UK, 2012.                               mobile telecommunications service market.”
[7]     K. Muata, and O. Bryson, Evaluation of                     Telecommunications Policy. Vol. 32 No.: PP.
        Decision Trees: AMulti Criteria Approach,                  182-196, 2008
        Computers and Operational Research, 31, 1933-       [18]   A. tamaddoni, M. Moeini, I. Akbari, A.
        1945, 2004                                                 Akbarzadeh, "A dual-step multi-algorithm
[8]     W. Au, C. Chan, and X. Yao, A Novel                        approach for churn prediction in Pre-paid
        Evolutionary Data Mining Algorithm with                    telecommunications service providers", the 6th
        Applications to Churn Prediction, IEEE                     International Conference on Innovation &
        transactions on evolutionary computation, 7, 6,            Management, SÃO PAULO, Brazil, 2009.
        532-545, 2003.                                      [19]   H. Lee, Y. Lee, H. Cho, K. Im, and Y. S. Kim,
[9]     R. Behara, W. Fisher, and J. Lemmink,                      “Mining churning behaviors and developing
        Modelling and Evaluating Service Quality                   retention strategies based on a partial least
        Measurement        Using      Neural    Networks,          squares (PLS) model,” Decision Support
        International journal of operations and                    Systems, July 2011.
        production management, 22, 10, 1162-1185,           [20]   V. Yeshwanth, V. Vemal Raj, and M.
        2002.                                                      Sharavanan, “Evolutionary Churn Prediction in
[10]    Ö. SELVİ, Traffic Accident Predictions Based               Mobile Networks Using Hybrid Learning,”
        On Fuzzy Logic Approach For Safer Urban                    Proceedings of the Twenty-Fourth International
        Environments, Case Study: Izmir Metropolitan               Florida Artificial Intelligence Research Society
        Area, PhD thesis, 2009.                                    Conference, pp. 471-476, 2011
[11]    P Datta, B. Masand, D. Mani, and B. Li,             [21]   M. Fasanghari and A. Keramati. Customer
        Automated Cellular Modeling and Prediction on              Churn Prediction Using Local Linear Model
        a Large Scale, Issues on the application of data           Tree for Iranian Telecommunication Companies.
        mining, 14, 485- 502, 2001                                 pp. 25-37, July 2011.
[12]    C.      Wei      and     I.     Chiu,    “Turning   [22]   http://www.cs.waikato.ac.nz/ml/weka/
        telecommunications call details to churn
        prediction: a data mining approach,” Expert




                                                                                                      697 | P a g e

More Related Content

What's hot

Rank Computation Model for Distribution Product in Fuzzy Multiple Attribute D...
Rank Computation Model for Distribution Product in Fuzzy Multiple Attribute D...Rank Computation Model for Distribution Product in Fuzzy Multiple Attribute D...
Rank Computation Model for Distribution Product in Fuzzy Multiple Attribute D...TELKOMNIKA JOURNAL
 
Automated Feature Selection and Churn Prediction using Deep Learning Models
Automated Feature Selection and Churn Prediction using Deep Learning ModelsAutomated Feature Selection and Churn Prediction using Deep Learning Models
Automated Feature Selection and Churn Prediction using Deep Learning ModelsIRJET Journal
 
Possibility fuzzy c means clustering for expression invariant face recognition
Possibility fuzzy c means clustering for expression invariant face recognitionPossibility fuzzy c means clustering for expression invariant face recognition
Possibility fuzzy c means clustering for expression invariant face recognitionIJCI JOURNAL
 
Selecting Experts Using Data Quality Concepts
Selecting Experts Using Data Quality ConceptsSelecting Experts Using Data Quality Concepts
Selecting Experts Using Data Quality Conceptsijdms
 
Ijatcse71852019
Ijatcse71852019Ijatcse71852019
Ijatcse71852019loki536577
 
Filtering of Frequency Components for Privacy Preserving Facial Recognition
Filtering of Frequency Components for Privacy Preserving Facial RecognitionFiltering of Frequency Components for Privacy Preserving Facial Recognition
Filtering of Frequency Components for Privacy Preserving Facial RecognitionArtur Filipowicz
 
An Empirical Comparison and Feature Reduction Performance Analysis of Intrusi...
An Empirical Comparison and Feature Reduction Performance Analysis of Intrusi...An Empirical Comparison and Feature Reduction Performance Analysis of Intrusi...
An Empirical Comparison and Feature Reduction Performance Analysis of Intrusi...ijctcm
 
International Journal of Engineering Research and Development (IJERD)
International Journal of Engineering Research and Development (IJERD)International Journal of Engineering Research and Development (IJERD)
International Journal of Engineering Research and Development (IJERD)IJERD Editor
 
IRJET - Clustering Algorithm for Brain Image Segmentation
IRJET - Clustering Algorithm for Brain Image SegmentationIRJET - Clustering Algorithm for Brain Image Segmentation
IRJET - Clustering Algorithm for Brain Image SegmentationIRJET Journal
 
GRID COMPUTING: STRATEGIC DECISION MAKING IN RESOURCE SELECTION
GRID COMPUTING: STRATEGIC DECISION MAKING IN RESOURCE SELECTIONGRID COMPUTING: STRATEGIC DECISION MAKING IN RESOURCE SELECTION
GRID COMPUTING: STRATEGIC DECISION MAKING IN RESOURCE SELECTIONIJCSEA Journal
 
Feature selection in multimodal
Feature selection in multimodalFeature selection in multimodal
Feature selection in multimodalijcsa
 
Decision treeinductionmethodsandtheirapplicationtobigdatafinal 5
Decision treeinductionmethodsandtheirapplicationtobigdatafinal 5Decision treeinductionmethodsandtheirapplicationtobigdatafinal 5
Decision treeinductionmethodsandtheirapplicationtobigdatafinal 5ssuser33da69
 
The use of genetic algorithm, clustering and feature selection techniques in ...
The use of genetic algorithm, clustering and feature selection techniques in ...The use of genetic algorithm, clustering and feature selection techniques in ...
The use of genetic algorithm, clustering and feature selection techniques in ...IJMIT JOURNAL
 
T OWARDS A S YSTEM D YNAMICS M ODELING M E- THOD B ASED ON DEMATEL
T OWARDS A  S YSTEM  D YNAMICS  M ODELING  M E- THOD B ASED ON  DEMATELT OWARDS A  S YSTEM  D YNAMICS  M ODELING  M E- THOD B ASED ON  DEMATEL
T OWARDS A S YSTEM D YNAMICS M ODELING M E- THOD B ASED ON DEMATELijcsit
 
Optimizing Market Segmentation
Optimizing Market SegmentationOptimizing Market Segmentation
Optimizing Market SegmentationRobert Colner
 
Intrusion Detection System for Classification of Attacks with Cross Validation
Intrusion Detection System for Classification of Attacks with Cross ValidationIntrusion Detection System for Classification of Attacks with Cross Validation
Intrusion Detection System for Classification of Attacks with Cross Validationinventionjournals
 

What's hot (19)

Malhotra06
Malhotra06Malhotra06
Malhotra06
 
Rank Computation Model for Distribution Product in Fuzzy Multiple Attribute D...
Rank Computation Model for Distribution Product in Fuzzy Multiple Attribute D...Rank Computation Model for Distribution Product in Fuzzy Multiple Attribute D...
Rank Computation Model for Distribution Product in Fuzzy Multiple Attribute D...
 
Automated Feature Selection and Churn Prediction using Deep Learning Models
Automated Feature Selection and Churn Prediction using Deep Learning ModelsAutomated Feature Selection and Churn Prediction using Deep Learning Models
Automated Feature Selection and Churn Prediction using Deep Learning Models
 
Possibility fuzzy c means clustering for expression invariant face recognition
Possibility fuzzy c means clustering for expression invariant face recognitionPossibility fuzzy c means clustering for expression invariant face recognition
Possibility fuzzy c means clustering for expression invariant face recognition
 
Selecting Experts Using Data Quality Concepts
Selecting Experts Using Data Quality ConceptsSelecting Experts Using Data Quality Concepts
Selecting Experts Using Data Quality Concepts
 
Distributed Streams
Distributed StreamsDistributed Streams
Distributed Streams
 
Ijatcse71852019
Ijatcse71852019Ijatcse71852019
Ijatcse71852019
 
Filtering of Frequency Components for Privacy Preserving Facial Recognition
Filtering of Frequency Components for Privacy Preserving Facial RecognitionFiltering of Frequency Components for Privacy Preserving Facial Recognition
Filtering of Frequency Components for Privacy Preserving Facial Recognition
 
An Empirical Comparison and Feature Reduction Performance Analysis of Intrusi...
An Empirical Comparison and Feature Reduction Performance Analysis of Intrusi...An Empirical Comparison and Feature Reduction Performance Analysis of Intrusi...
An Empirical Comparison and Feature Reduction Performance Analysis of Intrusi...
 
International Journal of Engineering Research and Development (IJERD)
International Journal of Engineering Research and Development (IJERD)International Journal of Engineering Research and Development (IJERD)
International Journal of Engineering Research and Development (IJERD)
 
IRJET - Clustering Algorithm for Brain Image Segmentation
IRJET - Clustering Algorithm for Brain Image SegmentationIRJET - Clustering Algorithm for Brain Image Segmentation
IRJET - Clustering Algorithm for Brain Image Segmentation
 
GRID COMPUTING: STRATEGIC DECISION MAKING IN RESOURCE SELECTION
GRID COMPUTING: STRATEGIC DECISION MAKING IN RESOURCE SELECTIONGRID COMPUTING: STRATEGIC DECISION MAKING IN RESOURCE SELECTION
GRID COMPUTING: STRATEGIC DECISION MAKING IN RESOURCE SELECTION
 
Feature selection in multimodal
Feature selection in multimodalFeature selection in multimodal
Feature selection in multimodal
 
Decision treeinductionmethodsandtheirapplicationtobigdatafinal 5
Decision treeinductionmethodsandtheirapplicationtobigdatafinal 5Decision treeinductionmethodsandtheirapplicationtobigdatafinal 5
Decision treeinductionmethodsandtheirapplicationtobigdatafinal 5
 
The use of genetic algorithm, clustering and feature selection techniques in ...
The use of genetic algorithm, clustering and feature selection techniques in ...The use of genetic algorithm, clustering and feature selection techniques in ...
The use of genetic algorithm, clustering and feature selection techniques in ...
 
T OWARDS A S YSTEM D YNAMICS M ODELING M E- THOD B ASED ON DEMATEL
T OWARDS A  S YSTEM  D YNAMICS  M ODELING  M E- THOD B ASED ON  DEMATELT OWARDS A  S YSTEM  D YNAMICS  M ODELING  M E- THOD B ASED ON  DEMATEL
T OWARDS A S YSTEM D YNAMICS M ODELING M E- THOD B ASED ON DEMATEL
 
Optimizing Market Segmentation
Optimizing Market SegmentationOptimizing Market Segmentation
Optimizing Market Segmentation
 
Intrusion Detection System for Classification of Attacks with Cross Validation
Intrusion Detection System for Classification of Attacks with Cross ValidationIntrusion Detection System for Classification of Attacks with Cross Validation
Intrusion Detection System for Classification of Attacks with Cross Validation
 
Malhotra04
Malhotra04Malhotra04
Malhotra04
 

Viewers also liked (20)

As24295300
As24295300As24295300
As24295300
 
Ba24345348
Ba24345348Ba24345348
Ba24345348
 
Be24365370
Be24365370Be24365370
Be24365370
 
Ay24334338
Ay24334338Ay24334338
Ay24334338
 
Am24262265
Am24262265Am24262265
Am24262265
 
Bi24385389
Bi24385389Bi24385389
Bi24385389
 
A24001005
A24001005A24001005
A24001005
 
Ar24289294
Ar24289294Ar24289294
Ar24289294
 
Ag24227232
Ag24227232Ag24227232
Ag24227232
 
Cg31545555
Cg31545555Cg31545555
Cg31545555
 
Cw31641675
Cw31641675Cw31641675
Cw31641675
 
Bw31494497
Bw31494497Bw31494497
Bw31494497
 
Ct31628631
Ct31628631Ct31628631
Ct31628631
 
Cu31632635
Cu31632635Cu31632635
Cu31632635
 
Z31157162
Z31157162Z31157162
Z31157162
 
Cz31693698
Cz31693698Cz31693698
Cz31693698
 
Slide
SlideSlide
Slide
 
O sangue
O sangueO sangue
O sangue
 
Инструкция по грабберу
Инструкция по грабберуИнструкция по грабберу
Инструкция по грабберу
 
Pressupost nuria 1
Pressupost nuria 1Pressupost nuria 1
Pressupost nuria 1
 

Similar to Df24693697

A Proposed Churn Prediction Model
A Proposed Churn Prediction ModelA Proposed Churn Prediction Model
A Proposed Churn Prediction ModelMona Nasr
 
TERM DEPOSIT SUBSCRIPTION PREDICTION
TERM DEPOSIT SUBSCRIPTION PREDICTIONTERM DEPOSIT SUBSCRIPTION PREDICTION
TERM DEPOSIT SUBSCRIPTION PREDICTIONIRJET Journal
 
Different Classification Technique for Data mining in Insurance Industry usin...
Different Classification Technique for Data mining in Insurance Industry usin...Different Classification Technique for Data mining in Insurance Industry usin...
Different Classification Technique for Data mining in Insurance Industry usin...IOSRjournaljce
 
Projection pursuit Random Forest using discriminant feature analysis model fo...
Projection pursuit Random Forest using discriminant feature analysis model fo...Projection pursuit Random Forest using discriminant feature analysis model fo...
Projection pursuit Random Forest using discriminant feature analysis model fo...IJECEIAES
 
IRJET - An Overview of Machine Learning Algorithms for Data Science
IRJET - An Overview of Machine Learning Algorithms for Data ScienceIRJET - An Overview of Machine Learning Algorithms for Data Science
IRJET - An Overview of Machine Learning Algorithms for Data ScienceIRJET Journal
 
A NOVEL SCHEME FOR ACCURATE REMAINING USEFUL LIFE PREDICTION FOR INDUSTRIAL I...
A NOVEL SCHEME FOR ACCURATE REMAINING USEFUL LIFE PREDICTION FOR INDUSTRIAL I...A NOVEL SCHEME FOR ACCURATE REMAINING USEFUL LIFE PREDICTION FOR INDUSTRIAL I...
A NOVEL SCHEME FOR ACCURATE REMAINING USEFUL LIFE PREDICTION FOR INDUSTRIAL I...gerogepatton
 
A NOVEL SCHEME FOR ACCURATE REMAINING USEFUL LIFE PREDICTION FOR INDUSTRIAL I...
A NOVEL SCHEME FOR ACCURATE REMAINING USEFUL LIFE PREDICTION FOR INDUSTRIAL I...A NOVEL SCHEME FOR ACCURATE REMAINING USEFUL LIFE PREDICTION FOR INDUSTRIAL I...
A NOVEL SCHEME FOR ACCURATE REMAINING USEFUL LIFE PREDICTION FOR INDUSTRIAL I...ijaia
 
An Explanation Framework for Interpretable Credit Scoring
An Explanation Framework for Interpretable Credit Scoring An Explanation Framework for Interpretable Credit Scoring
An Explanation Framework for Interpretable Credit Scoring gerogepatton
 
AN EXPLANATION FRAMEWORK FOR INTERPRETABLE CREDIT SCORING
AN EXPLANATION FRAMEWORK FOR INTERPRETABLE CREDIT SCORINGAN EXPLANATION FRAMEWORK FOR INTERPRETABLE CREDIT SCORING
AN EXPLANATION FRAMEWORK FOR INTERPRETABLE CREDIT SCORINGijaia
 
CUSTOMER CHURN PREDICTION
CUSTOMER CHURN PREDICTIONCUSTOMER CHURN PREDICTION
CUSTOMER CHURN PREDICTIONIRJET Journal
 
CREDIT RISK MANAGEMENT USING ARTIFICIAL INTELLIGENCE TECHNIQUES
CREDIT RISK MANAGEMENT USING ARTIFICIAL INTELLIGENCE TECHNIQUESCREDIT RISK MANAGEMENT USING ARTIFICIAL INTELLIGENCE TECHNIQUES
CREDIT RISK MANAGEMENT USING ARTIFICIAL INTELLIGENCE TECHNIQUESijaia
 
MACHINE LEARNING AND DEEP LEARNING TECHNIQUES FOR DETECTING ABUSIVE CONTENT O...
MACHINE LEARNING AND DEEP LEARNING TECHNIQUES FOR DETECTING ABUSIVE CONTENT O...MACHINE LEARNING AND DEEP LEARNING TECHNIQUES FOR DETECTING ABUSIVE CONTENT O...
MACHINE LEARNING AND DEEP LEARNING TECHNIQUES FOR DETECTING ABUSIVE CONTENT O...IRJET Journal
 
PROVIDING A METHOD FOR DETERMINING THE INDEX OF CUSTOMER CHURN IN INDUSTRY
PROVIDING A METHOD FOR DETERMINING THE INDEX OF CUSTOMER CHURN IN INDUSTRYPROVIDING A METHOD FOR DETERMINING THE INDEX OF CUSTOMER CHURN IN INDUSTRY
PROVIDING A METHOD FOR DETERMINING THE INDEX OF CUSTOMER CHURN IN INDUSTRYIJITCA Journal
 
PREDICTION OF DISEASE WITH MINING ALGORITHMS IN MACHINE LEARNING
PREDICTION OF DISEASE WITH MINING ALGORITHMS IN MACHINE LEARNINGPREDICTION OF DISEASE WITH MINING ALGORITHMS IN MACHINE LEARNING
PREDICTION OF DISEASE WITH MINING ALGORITHMS IN MACHINE LEARNINGIRJET Journal
 
Fake Reviews Detection using Supervised Machine Learning
Fake Reviews Detection using Supervised Machine LearningFake Reviews Detection using Supervised Machine Learning
Fake Reviews Detection using Supervised Machine LearningIRJET Journal
 
Pattern recognition using context dependent memory model (cdmm) in multimodal...
Pattern recognition using context dependent memory model (cdmm) in multimodal...Pattern recognition using context dependent memory model (cdmm) in multimodal...
Pattern recognition using context dependent memory model (cdmm) in multimodal...ijfcstjournal
 
Selecting the correct Data Mining Method: Classification & InDaMiTe-R
Selecting the correct Data Mining Method: Classification & InDaMiTe-RSelecting the correct Data Mining Method: Classification & InDaMiTe-R
Selecting the correct Data Mining Method: Classification & InDaMiTe-RIOSR Journals
 
IRJET- Deep Learning Based Card-Less Atm Using Fingerprint And Face Recogniti...
IRJET- Deep Learning Based Card-Less Atm Using Fingerprint And Face Recogniti...IRJET- Deep Learning Based Card-Less Atm Using Fingerprint And Face Recogniti...
IRJET- Deep Learning Based Card-Less Atm Using Fingerprint And Face Recogniti...IRJET Journal
 
A Compendium of Various Applications of Machine Learning
A Compendium of Various Applications of Machine LearningA Compendium of Various Applications of Machine Learning
A Compendium of Various Applications of Machine LearningIRJET Journal
 
Survey of the Euro Currency Fluctuation by Using Data Mining
Survey of the Euro Currency Fluctuation by Using Data MiningSurvey of the Euro Currency Fluctuation by Using Data Mining
Survey of the Euro Currency Fluctuation by Using Data Miningijcsit
 

Similar to Df24693697 (20)

A Proposed Churn Prediction Model
A Proposed Churn Prediction ModelA Proposed Churn Prediction Model
A Proposed Churn Prediction Model
 
TERM DEPOSIT SUBSCRIPTION PREDICTION
TERM DEPOSIT SUBSCRIPTION PREDICTIONTERM DEPOSIT SUBSCRIPTION PREDICTION
TERM DEPOSIT SUBSCRIPTION PREDICTION
 
Different Classification Technique for Data mining in Insurance Industry usin...
Different Classification Technique for Data mining in Insurance Industry usin...Different Classification Technique for Data mining in Insurance Industry usin...
Different Classification Technique for Data mining in Insurance Industry usin...
 
Projection pursuit Random Forest using discriminant feature analysis model fo...
Projection pursuit Random Forest using discriminant feature analysis model fo...Projection pursuit Random Forest using discriminant feature analysis model fo...
Projection pursuit Random Forest using discriminant feature analysis model fo...
 
IRJET - An Overview of Machine Learning Algorithms for Data Science
IRJET - An Overview of Machine Learning Algorithms for Data ScienceIRJET - An Overview of Machine Learning Algorithms for Data Science
IRJET - An Overview of Machine Learning Algorithms for Data Science
 
A NOVEL SCHEME FOR ACCURATE REMAINING USEFUL LIFE PREDICTION FOR INDUSTRIAL I...
A NOVEL SCHEME FOR ACCURATE REMAINING USEFUL LIFE PREDICTION FOR INDUSTRIAL I...A NOVEL SCHEME FOR ACCURATE REMAINING USEFUL LIFE PREDICTION FOR INDUSTRIAL I...
A NOVEL SCHEME FOR ACCURATE REMAINING USEFUL LIFE PREDICTION FOR INDUSTRIAL I...
 
A NOVEL SCHEME FOR ACCURATE REMAINING USEFUL LIFE PREDICTION FOR INDUSTRIAL I...
A NOVEL SCHEME FOR ACCURATE REMAINING USEFUL LIFE PREDICTION FOR INDUSTRIAL I...A NOVEL SCHEME FOR ACCURATE REMAINING USEFUL LIFE PREDICTION FOR INDUSTRIAL I...
A NOVEL SCHEME FOR ACCURATE REMAINING USEFUL LIFE PREDICTION FOR INDUSTRIAL I...
 
An Explanation Framework for Interpretable Credit Scoring
An Explanation Framework for Interpretable Credit Scoring An Explanation Framework for Interpretable Credit Scoring
An Explanation Framework for Interpretable Credit Scoring
 
AN EXPLANATION FRAMEWORK FOR INTERPRETABLE CREDIT SCORING
AN EXPLANATION FRAMEWORK FOR INTERPRETABLE CREDIT SCORINGAN EXPLANATION FRAMEWORK FOR INTERPRETABLE CREDIT SCORING
AN EXPLANATION FRAMEWORK FOR INTERPRETABLE CREDIT SCORING
 
CUSTOMER CHURN PREDICTION
CUSTOMER CHURN PREDICTIONCUSTOMER CHURN PREDICTION
CUSTOMER CHURN PREDICTION
 
CREDIT RISK MANAGEMENT USING ARTIFICIAL INTELLIGENCE TECHNIQUES
CREDIT RISK MANAGEMENT USING ARTIFICIAL INTELLIGENCE TECHNIQUESCREDIT RISK MANAGEMENT USING ARTIFICIAL INTELLIGENCE TECHNIQUES
CREDIT RISK MANAGEMENT USING ARTIFICIAL INTELLIGENCE TECHNIQUES
 
MACHINE LEARNING AND DEEP LEARNING TECHNIQUES FOR DETECTING ABUSIVE CONTENT O...
MACHINE LEARNING AND DEEP LEARNING TECHNIQUES FOR DETECTING ABUSIVE CONTENT O...MACHINE LEARNING AND DEEP LEARNING TECHNIQUES FOR DETECTING ABUSIVE CONTENT O...
MACHINE LEARNING AND DEEP LEARNING TECHNIQUES FOR DETECTING ABUSIVE CONTENT O...
 
PROVIDING A METHOD FOR DETERMINING THE INDEX OF CUSTOMER CHURN IN INDUSTRY
PROVIDING A METHOD FOR DETERMINING THE INDEX OF CUSTOMER CHURN IN INDUSTRYPROVIDING A METHOD FOR DETERMINING THE INDEX OF CUSTOMER CHURN IN INDUSTRY
PROVIDING A METHOD FOR DETERMINING THE INDEX OF CUSTOMER CHURN IN INDUSTRY
 
PREDICTION OF DISEASE WITH MINING ALGORITHMS IN MACHINE LEARNING
PREDICTION OF DISEASE WITH MINING ALGORITHMS IN MACHINE LEARNINGPREDICTION OF DISEASE WITH MINING ALGORITHMS IN MACHINE LEARNING
PREDICTION OF DISEASE WITH MINING ALGORITHMS IN MACHINE LEARNING
 
Fake Reviews Detection using Supervised Machine Learning
Fake Reviews Detection using Supervised Machine LearningFake Reviews Detection using Supervised Machine Learning
Fake Reviews Detection using Supervised Machine Learning
 
Pattern recognition using context dependent memory model (cdmm) in multimodal...
Pattern recognition using context dependent memory model (cdmm) in multimodal...Pattern recognition using context dependent memory model (cdmm) in multimodal...
Pattern recognition using context dependent memory model (cdmm) in multimodal...
 
Selecting the correct Data Mining Method: Classification & InDaMiTe-R
Selecting the correct Data Mining Method: Classification & InDaMiTe-RSelecting the correct Data Mining Method: Classification & InDaMiTe-R
Selecting the correct Data Mining Method: Classification & InDaMiTe-R
 
IRJET- Deep Learning Based Card-Less Atm Using Fingerprint And Face Recogniti...
IRJET- Deep Learning Based Card-Less Atm Using Fingerprint And Face Recogniti...IRJET- Deep Learning Based Card-Less Atm Using Fingerprint And Face Recogniti...
IRJET- Deep Learning Based Card-Less Atm Using Fingerprint And Face Recogniti...
 
A Compendium of Various Applications of Machine Learning
A Compendium of Various Applications of Machine LearningA Compendium of Various Applications of Machine Learning
A Compendium of Various Applications of Machine Learning
 
Survey of the Euro Currency Fluctuation by Using Data Mining
Survey of the Euro Currency Fluctuation by Using Data MiningSurvey of the Euro Currency Fluctuation by Using Data Mining
Survey of the Euro Currency Fluctuation by Using Data Mining
 

Df24693697

  • 1. Essam Shaaban, Yehia Helmy, Ayman Khedr, Mona Nasr / International Journal of Engineering Research and Applications (IJERA) ISSN: 2248-9622 www.ijera.com Vol. 2, Issue 4, June-July 2012, pp.693-697 A Proposed Churn Prediction Model Essam Shaaban*, Yehia Helmy, Ayman Khedr, Mona Nasr** *(Department of Information Systems, MUST University, 6 October, Cairo, Egypt ** (Department of Information Systems, Helwan University, Cairo, Egypt ABSTRACT Churn prediction aims to detect customers techniques may use Decision Tree (DT), Support Vector intended to leave a service provider. Retaining one Machine (SVM) in addition to Neural Networks (NN), customer costs an organization from 5 to 10 times Genetic Algorithms (GA) or Fuzzy Logic (FL) to predict than gaining a new one. Predictive models can churners. provide correct identification of possible churners in This paper is organized as follows. Section 2 the near future in order to provide a retention describes the types of churners. Section 3 shows the solution. This paper presents a new prediction model existing prediction models rather than the techniques of based on Data Mining (DM) techniques. The developing a predictive model. Section 4 describes the proposed model is composed of six steps which are; proposed churn prediction model besides the results of identify problem domain, data selection, investigate an implemented case study. Finally; conclusion and data set, classification, clustering and knowledge future work are presented. usage. A data set with 23 attributes and 5000 instances is used. 4000 instances used for training the 2. TYPES OF CHURNERS model and 1000 instances used as a testing set. The As figure 1 depicts; There are two main predicted churners are clustered into 3 categories in categories of churners which are voluntary and case of using in a retention strategy. The data mining involuntary [4]. Involuntary churners are the easiest to techniques used in this paper are Decision Tree, identify. These are the customers that Telco decides to Support Vector Machine and Neural Network remove from subscribers list. Therefore’ this category throughout an open source software name WEKA. includes people that are churned for fraud, non-payment and customers who don’t use the phone. Voluntary Keywords:-Churn prediction, classification, churner is more difficult to determine; it occurs when a clustering, data mining, prediction model customer makes a decision to terminate his/her service with the provider. When people think about Telco churn 1. INTRODUCTION it is usually the voluntary kind that comes to mind. Churn prediction process is a highly debated research area for more than ten years. Researchers from different disciplines have tried to analyze this problem from their own perspectives to figure out a clear understanding and to recommend an effective solution for churners in many business areas. Abbasimehr et al. [1] state that churn prediction is a useful tool to predict customer at churn risk. Conventional churn prediction Figure 1: churn taxonomy techniques have the advantage of being simple and robust with respect to defects in the input data, they Voluntary churn can be sub-divided into two possess serious limitations to the interpretation of main categories, incidental churn and deliberate churn. reasons for churn. Therefore, measuring the Incidental churn occurs, not because the customers effectiveness of a prediction model depends also on how planned on it but because something happened in their well the results can be interpreted for inferring the lives. For example: change in financial condition churn, possible reasons of churn [2]. The purpose of prediction change in location churn, etc. Deliberate churn happens is to anticipate the value that a random variable will for reasons of technology (customers wanting newer or assume in the future or to estimate the likelihood of better technology), economics (price sensitivity), service future events [3]. Most DM techniques derive their quality factors, social or psychological factors, and predictions from the value of a set of variables convenience reasons. Deliberate churn is the problem associated with the entities in a database. DM models that most churn management solutions try to solve [4] may be employed to predict customer churn developed [5]. in many disciplines such as demographic data and/or behavioral data. There are many DM techniques that can be used in classification and clustering customer data to predict churners in the near future. These 693 | P a g e
  • 2. Essam Shaaban, Yehia Helmy, Ayman Khedr, Mona Nasr / International Journal of Engineering Research and Applications (IJERA) ISSN: 2248-9622 www.ijera.com Vol. 2, Issue 4, June-July 2012, pp.693-697 3. PREDICTIVE MODELS 3.2.2 Fuzzy Logic (FL) Predictive modeling is mainly concerned with FL is a conceptually easy to understand. The predicting how the customer will behave in the future by mathematical concepts behind fuzzy reasoning are very analyzing their past behavior. Predicting customers who simple. Naturalness of the approach makes it preferable are likely to churn is one example of the predictive to the other techniques. FL is flexible, tolerant of modeling [6]. Predictive modeling is used in analyzing imprecise data, and it can model nonlinear functions of Customer Relationship Management (CRM) data arbitrary complexity. It can be blended with and DM to produce customer-level models that describe conventional control techniques. In many cases fuzzy the likelihood that a customer will take a particular systems expends the concept of the conventional control action. The actions are usually sales, marketing and techniques and simplify their implementation. Regarding customer retention related. There are many models that the telecom industry; there is no work achieved related can used to define distinguish between churners and non- to churn prediction using the fuzzy techniques [10]. churners in an organization. These models can be There are a lot of studies have implemented in the area classified into traditional models or techniques (RA and of telecom churn prediction. Summary of latest churn DT) and soft computing techniques (FL and NN) [2]. prediction studies is shown in table 1. Table 1: churn prediction studies 3.1 Traditional techniques Year Author Technique 3.1.1 Decision trees 2001 Datta et al. [11] DT DT is most popular type of predictive model. It 2002 Ping and Tang [12] DT induction has become an important knowledge structure, used for 2003 Au et al. [13] GA the classification of future events [7]. DT usually 2006 Ahn et al. [14] partial defection consists of two main steps, tree building and tree Survival Analysis pruning. The tree-building step consists of recursively 2007 Junxiang [15] Modeling partitioning the training sets according to the values of 2008 Piotr [16] rough-sets the attributes. The partitioning process continues until Seo, and Ranganathan all, or most of the records in each of the partitions 2008 Two-level model [17] contain identical values. Some branches may be removed 2009 Jahromi et al [18] NN and DT because it could consist of noisy data. The pruning step involves selecting and removing the branches containing 2010 Gotovac [4] DT the largest estimated error rate. Tree pruning is known to partial least squares 2011 Lee et al. [19] enhance the predictive accuracy of the decision tree, (PLS) model while reducing the complexity [8]. 2011 Yeshwanth et al [20] Hybrid Learning 3.1.2 Regression Analysis Local Linear Model RA is another popular technique used to deal Fasanghari and 2011 Tree (LOLIMOT) with predicting customer satisfaction it is based on Keramati [21] algorithm. supervised learning models. Regression models deal with a dataset consisting of past observations, for which 4. THE PROPOSED MODEL both the value of the explanatory attributes and the value The proposed model is composed of six steps. of the continuous numerical target variable are known As shown in figure 1, these steps are: identify problem [3]. domain, data selection, investigate data set, 3.2 Soft computing techniques classification, clustering and knowledge usage. As figure 3.2.1 Neural Networks 2 depicts; the classification step produces two types of NN has been successfully used to estimate customers (churners and non-churners) while the intricate non-linear functions. A NN is an analogous data clustering step produces 3 clusters which are used to be processing structure that possesses the ability to learn. evaluated according to the retention strategy in further The concept is loosely based on a biological brain and usage. has successfully been applied to many types of problems, such as classification, control, and prediction [9]. NN is different from DT and other classification techniques because they can provide a prediction with its likelihood. Various neural network approaches have emerged over time, each with varying advantages and disadvantages (Liao et al., 2004), however greater detail into these variances is beyond the scope of this paper. Research suggests that neural networks outperform decision trees and regression models for churn prediction [8]. 694 | P a g e
  • 3. Essam Shaaban, Yehia Helmy, Ayman Khedr, Mona Nasr / International Journal of Engineering Research and Applications (IJERA) ISSN: 2248-9622 www.ijera.com Vol. 2, Issue 4, June-July 2012, pp.693-697 The attributes for this data set are 23 attributes as shown in table 2. The class of evaluating the status of each instance in the a data set is named churn. Churn attribute is labeled churn=True if a customer left the service provider but it is labeled churn=False if he/she is still continuing with the service provider. churn=True if a customer left the service provider but it is labeled churn=False if he/she is still continuing with the service provider. Table 2: attributes of the data set. Attribute Data Attribute Description Type Customer’s age categorized Age Number into five groups Customer’s gender (1=male, Gender Number 0=female) Customer’s marital status mar_st Number (1=yes, 0=no) M_in_sm mean of in minutes of use from Number _MOU the same service provider Figure 2: The Proposed Churn Prediction model M_out_s mean of out minutes of use The proposed model can produce more than 3 Number m_MOU from the same service provider clusters based on the types of acquired knowledge. M_sm_M mean of all minutes of use Knowledge usage receives the produced clusters for Number OU from the same service provider assign a retaining solution for each type of churners. M_in_oth mean of in minutes of use from Churners can be clustered according to many criteria Number _MOU other service provider such as profitability or dissatisfactory of customers. M_out_ot mean of out minutes of use Number h_MOU from other service provider 4.1 Churn Prediction with the Proposed Model M_oth_ mean of all minutes of use During this case study an open source DM tool Number MOU from other service provider named WEKA [22] is used in addition to a data set of mean of in minutes of use 5000 instances. The data set is obtained from an M_in_M Number either from the same or other anonymous mobile service provider. As figure 2 depicts; OU service provider the data set is divided into a training set and a testing set. mean of out minutes of use The training set is 4000 (80%) instances and the testing M_out_ Number either from the same or other set is 1000 (20%) instances. The training data contains MOU service provider 3200 (80%) instances are labeled non-churners bot the mean of all in and our minutes others are 800 (20%) are labeled churners. MOU Number of use either from the same or other service provider Change in minutes of use from one month to another during chng_M Nominal the time of the experiment OU (decreased, Normal or increased) Figure 2: the distribution of the data set The mean number of messages M_sms Number during the time of the experiment M_M_re Number Mean monthly revenues v Ass_prod Number Associated product (yes or no) Figure 3: Training set Ass_ser Number Associated services (yes or no) M_CC_c Mean number of customer care Number alls calls M_drop_ Nominal Mean number of dropped calls calls Compaint Mean number of complaints Nominal s Figure 4: Testing set no_chng_ Number of changes in tariff Number tif_plan plan 695 | P a g e
  • 4. Essam Shaaban, Yehia Helmy, Ayman Khedr, Mona Nasr / International Journal of Engineering Research and Applications (IJERA) ISSN: 2248-9622 www.ijera.com Vol. 2, Issue 4, June-July 2012, pp.693-697 max_call The maximum time distance 3 Clusters with 421 instances Number _distance between 2 calls Cluster 0 Cluster 1 Cluster 2 CHURN Nominal Churn=True or False (88) 20.9 (208) (125) 29.7% % 49.4% Age 12 43 25 4.2 Interpretation of Results Gender Female Male Female The result of the classification step is represented in mar_st Single Married Single table 3. M_in_sm_MOU 47 29 23 Table 3: Confusion Matrix of results M_out_sm_MOU 12 45 12 Actual M_sm_MOU 698 43 52 Actual Prediction Class M_in_oth_MOU 41 45 65 Non- Churners M_out_oth_MOU 14 25 28 churners M_oth_MOU 77 52 158 Decision Non- 376/500 124/500 M_in_MOU 50 54 69 Tree churners (75.2%) (14.8%) M_out_MOU 730 142 85 97/500 403/500 Churners MOU 194 232 210 (19.4%) (80.6%) chng_MOU Decreased Decreased Decreased Non- 420/500 80/500 churners (84%) (16%) M_sms 5 25 1 Neural Networks 83/500 417/500 M_M_rev 21.4 112.14 29 Churners Ass_prod NO NO NO (16.6%) (83.4%) Non- 416/500 84/500 Ass_ser YES NO NO Support M_CC_calls 2 6 5 churners (83.2%) (16.8) Vector M_drop_calls 0 0 0 79/500 421/500 Machine Churners Compaints 2 1 0 (15.8%) (84.2%) no_chng_tif_plan 0 2 3 The accuracy and error rate of the predicted max_call_distance 0 0 0 results are shown is table 4. The accuracy rate and error rate are computed as shown in the following equations The output of the clustering step can be used for the Accuracy = (no. of correct / Total no. of predictions) knowledge usage step in order to assign a retention Error rate = (no. of wrong / Total no. of predictions) strategy for a specific cluster or customer. In case of clustering for a specific purpose such as customer’s profitability, dissatisfactory, or cross selling this requires a specific attribute selection from table 5. Table 4: Accuracy and error rates comparison for DT, NN, SVM techniques. 5. CONCLUSION Technique Accuracy Error rate Many churn prediction models and techniques Decision Tree 77.9% 22.1% have been presented to date. However, a simple model is (DT) (779/1000) (221/1000) required to distinguish churners from non- churners then Neural 83.7% 16.3% clustering the resulted churners for providing retention Networks (NN) (837/1000) (163/1000) solutions. In this paper, a simple model based on DM Support Vector 83.7% 16.3% techniques was introduced to help a CRM department to Machine (SVM) (837/1000) (163/1000) keep track its customers and their behavior against churn. A data set of 5000 instances with 23 attributes is The best classification results can be extracted used to train and test the model. Using 3 different from tables 3 and 4 which is found in SVM classification techniques which are DT, SVM, and NN for technique that is the most relevant with NN classification classification and simple K Means techniques for technique for the data set in hand. SVM classification clustering results indicate that the best output for the data technique predicts 421 churners from 500. The predicted set in hand is SVM technique. The next stage of the churners are used to be clustered in the next step. authors’ research will involve performing a deeper During the clustering step the 421 predicted churners are analysis into the customer data to try to establish new used to be clustered using simple K Means algorithm. 22 churn prediction retention model that will use the attributes are used in clustering as shown in table 5. The predicted and clustered data to assign a suitable retention clustering process use 3 types of clusters which one can strategies for each churner type. name Cluster 0 (Low), Cluster 1 (Medium) and Cluster 2 (High) according to the suitable situation of clustering. 6. REFERENCES The 3 resulting clusters can be assigned for profitability, priority for retaining, or dissatisfactory. [1] H. Abbasimehr, M. Setak, M. Tarokh. A Neuro- Table 5: Clustering Output Fuzzy Classifier for Customer Churn Prediction. 696 | P a g e
  • 5. Essam Shaaban, Yehia Helmy, Ayman Khedr, Mona Nasr / International Journal of Engineering Research and Applications (IJERA) ISSN: 2248-9622 www.ijera.com Vol. 2, Issue 4, June-July 2012, pp.693-697 International Journal of Computer Applications, Systems with Applications, Elsevier , vol. 23, vol. 19, no. 8, pp. 35-41, April, 2011 no. 2, pp. 103-112, Aug. 2002. [2] V. Lazarov and M. Capota. Churn [13] W. Au, , C. Chan, and X. Yao, A Novel Prediction. Business Analytics Course. TUM Evolutionary Data Mining Algorithm with Computer Science, December 2007. Applications to Churn Prediction, IEEE http://home.in.tum.de/~lazarov/files/research/pap transactions on evolutionary computation, 7, 6, ers/churn-prediction.pdf 532-545, 2003 [3] Carlo Vercellis, Business Intelligence: Data [14] H. Ahn, , P. Hana, and S. Lee, Customer churn Mining and Optimization for Decision Making, analysis: Churn determinants and mediation John Wiley & Sons, Ltd. 2009 ISBN: 978-0-470- effects of partial defection in the Korean mobile 51138-1 telecommunications service industry. [4] S. Gotovac. “Modeling Data Mining Telecommunications Policy. Vol. 30 No.: PP. Applications for Prediction of Prepaid Churn in 552-568, 2006) Telecommunication Services,” vol. 51, no. 3, pp. [15] L. Junxiang. Predicting Customer Churn in the 275-283, 2010 Telecommunications Industry – An Application [5] H. Kim, and C. Yoon, “Determinants of of Survival Analysis Modeling Using SAS. Data subscriber churn and customer loyalty in the mining techniques. SUGI 27. Paper 114, 2007 Korean mobile telephony market.” [16] S. Piotr. Global Perspectives Mobile Operator Telecommunications Policy. Vol. 28 No.: PP. Customer Classification in Churn Analysis. 751-765, 2004. Technical University of Szczecin, Poland SAS [6] M. Hassouna, Agent Based Modelling and Global Forum, pp. 1-5, 2008. Simulation: An Examination of Customer [17] B. Seo, C. Ranganathan, and Y. Babad,. “Two- Retention in the UK Mobile Market. PhD thesis, level model of customer retention in the US Brunel University, UK, 2012. mobile telecommunications service market.” [7] K. Muata, and O. Bryson, Evaluation of Telecommunications Policy. Vol. 32 No.: PP. Decision Trees: AMulti Criteria Approach, 182-196, 2008 Computers and Operational Research, 31, 1933- [18] A. tamaddoni, M. Moeini, I. Akbari, A. 1945, 2004 Akbarzadeh, "A dual-step multi-algorithm [8] W. Au, C. Chan, and X. Yao, A Novel approach for churn prediction in Pre-paid Evolutionary Data Mining Algorithm with telecommunications service providers", the 6th Applications to Churn Prediction, IEEE International Conference on Innovation & transactions on evolutionary computation, 7, 6, Management, SÃO PAULO, Brazil, 2009. 532-545, 2003. [19] H. Lee, Y. Lee, H. Cho, K. Im, and Y. S. Kim, [9] R. Behara, W. Fisher, and J. Lemmink, “Mining churning behaviors and developing Modelling and Evaluating Service Quality retention strategies based on a partial least Measurement Using Neural Networks, squares (PLS) model,” Decision Support International journal of operations and Systems, July 2011. production management, 22, 10, 1162-1185, [20] V. Yeshwanth, V. Vemal Raj, and M. 2002. Sharavanan, “Evolutionary Churn Prediction in [10] Ö. SELVİ, Traffic Accident Predictions Based Mobile Networks Using Hybrid Learning,” On Fuzzy Logic Approach For Safer Urban Proceedings of the Twenty-Fourth International Environments, Case Study: Izmir Metropolitan Florida Artificial Intelligence Research Society Area, PhD thesis, 2009. Conference, pp. 471-476, 2011 [11] P Datta, B. Masand, D. Mani, and B. Li, [21] M. Fasanghari and A. Keramati. Customer Automated Cellular Modeling and Prediction on Churn Prediction Using Local Linear Model a Large Scale, Issues on the application of data Tree for Iranian Telecommunication Companies. mining, 14, 485- 502, 2001 pp. 25-37, July 2011. [12] C. Wei and I. Chiu, “Turning [22] http://www.cs.waikato.ac.nz/ml/weka/ telecommunications call details to churn prediction: a data mining approach,” Expert 697 | P a g e