SlideShare ist ein Scribd-Unternehmen logo
1 von 6
Downloaden Sie, um offline zu lesen
INTERNATIONAL CONFERENCE ON INFORMATION COMMUNICATION AND EMBEDDED SYSTEMS ICICES 2013 
978-1-4673-5788-3/13/$31.00©2013IEEE 
7 
An Empirical Study on applying Data Mining 
Techniques for the Analysis and Prediction 
of Heart Disease 
Sivagowry .S#1, Dr. Durairaj. M*2 and Persia.A#3 
#Research Scholar, School of Computer Science and Engg. 
Bharathidasan University, Trichy, Tamilnadu, India. 
*Assistant Professor, School of Computer Science and Engg. 
Bharathidasan University, Trichy, Tamilnadu, India 
Abstract— The health care environment is found to be rich in 
information, but poor in extracting knowledge from the 
information. This is because of the lack of effective analysis tool 
to discover hidden relationships and trends in them. By applying 
the data mining techniques, valuable knowledge can be extracted 
from the health care system. Heart disease is a group of condition 
affecting the structure and functions of heart and has many root 
causes. Heart disease is the leading cause of death in the world 
over past ten years. Researches have been made with many 
hybrid techniques for diagnosing heart disease. This paper deals 
with an overall review of application of data mining in heart 
disease prediction. 
Index Terms— Data Mining, Decision Tree, Naïve Bayes, 
Classification, Clustering. 
I. INTRODUCTION 
Data Mining is the exploration of large datasets to extract 
hidden and previously unknown patterns, relationships and 
knowledge that are difficult to detect with traditional statistics 
[17]. Data mining techniques are the result of a long process 
of research and product development. Data Mining is divided 
into two tasks such as Predictive Tasks and Descriptive Tasks. 
Predictive Tasks predict the value of a specific attribute based 
on other attribute. Classification, Regression and Deviation 
Deduction come under Predictive Tasks. Descriptive Tasks 
derive pattern that summarize the relationship between data. 
Clustering, Association Rule Mining and Sequential Pattern 
Discovery are coming under Descriptive Tasks.Data Mining 
involves few steps from raw data collection to some form of 
new knowledge. The iterative process consists of following 
steps like Data cleaning, Data Integration, Data Selection, 
Data transformation, Data Mining, Pattern Evaluation, and 
Knowledge the core for Knowledge Discovery Process 
Figure 1: Data Mining is the core of Knowledge Discovery Process 
Medical Data Mining is a domain of challenge which involves 
lot of imprecision and uncertainty. Provision of quality 
services at affordable cost is the major challenge faced in the 
health care organization. Poor clinical decision may lead to 
disastrous consequences. Health care data is massive. Clinical 
decisions are often made based on doctor’s experience rather 
than on the knowledge rich data hidden in the data base. This 
in some cases will result in errors, excessive medical cost 
which affects the quality of service to the patients [33]. 
Medical history data comprises of a number of tests essentials 
to diagnose a particular disease. It is possible to gain the 
advantage of Data mining in health care by employing it as an 
intelligent diagnostic tool. The researchers in the medical field 
identify and predict the disease with the aid of Data mining 
techniques [29]. 
II. HEART DISEASE 
The initial diagnosis of a heart attack is made by a 
combination of clinical symptoms and characteristic 
electrocardiogram (ECG) changes. An ECG is a recording of 
the electrical activity of the heart. Confirmation of a heart 
attack can only be made hours later through detection of 
elevated creatinine phosphokinase (CPK) in the blood. CPK is 
a muscle protein enzyme which is released into the blood 
circulation by dying heart muscles when their surrounding 
dissolves [3]. 
World Health Organization in the year 2003 reported that 
29.2% of total global deaths are due to Cardio Vascular 
Disease (CVD). By the end of this year, CVD is expected to 
be the leading cause for deaths in developing countries due to 
change in life style, work culture and food habits. Hence, 
more careful and efficient methods of cardiac diseases and 
periodic examination are of high importance [1]. 
III. DATA SETS 
The Data set is taken from Data mining repository of 
University of California, Irvine (UCI). Data set from 
Cleveland data set, Hungary Data set, Switzerland Data set, 
long beach and Statlog data set are collected. Cleveland, 
Hungary, Switzerland and va long beach data set contains 76 
attributes in all. But only 14 attributes are used. Among all 
those Cleveland data set and Statlog data set are the most
INTERNATIONAL CONFERENCE ON INFORMATION COMMUNICATION AND EMBEDDED SYSTEMS ICICES 2013 
978-1-4673-5788-3/13/$31.00©2013IEEE 
2 
commonly used data set. Because all the other has missing 
values. The Figure 2 shows some sample of data set collected 
from the UCI repository. 
Figure 2: Sample Data Set 
Attributes Used 
No Name Description 
1 Age Age in Years 
2 Sex 1=male, 0=female 
3 cp Chest pain type(1 = 
typical angina, 2 
=atypical angina, 3 = 
non-anginal pain, 4 = 
asymptomatic) 
4 trestbps Resting blood sugar(in 
mm Hg on admission 
to hospital) 
5 chol Serum cholesterol in 
mg/dl 
6 fbs Fasting blood 
sugar>120 mg/dl(1= 
true, 0=false) 
7 restecg Resting 
electrocardiographic 
results(0 = normal, 1 = 
having ST-T wave 
abnormality, 2 = left 
ventricular 
hypertrophy) 
8 thalach Maximum heart rate 
9 exang Exercise induced 
angina 
10 oldpeak ST depression induced 
by exercise relative to 
rest 
11 slope Slope of the peak 
exercise ST segment 
(1=upsloping, 2=flat, 
3= downsloping) 
12 Ca Number of major 
vessels colored by 
fluoroscopy 
13 thal 3= normal, 6=fixed 
defect, 7= reversible 
defect 
14 num Class(0=healthy, 
1=have heart disease) 
IV. DATA MINING TECHNIQUES USED IN HEART 
DISEASE PREDICTION 
Data mining techniques such as clustering, Classification, 
Regression and Association Rule mining are used in 
extracting knowledge from database. Medical data are mined 
by using the techniques above and the diagnosis of disease is 
carried out. Application of Data mining technique in medical 
data is as explained below: 
Data Mining and Association Rules 
An Association Rule is an implication expression of the form 
;ҏҏY, where X and Y are disjoint sets (i.e) XŀY = Ø. The 
strength of Association Rule can be measured in terms of 
support and confidence. Support determines how often a rule 
is applicable to a given data set. Confidence determines how 
frequently the item set is used. 
Support (XҏҏY) = P (XŀY) 
Confidence (XҏҏY) = P (XŀY)/P(X). 
Minimum support and minimum confidence are the two 
thresh hold in Association Rule mining. Association Rule 
Mining is divided into two tasks like frequent item set 
generation and Rule generation. 
Carlos Ordonez and et al.,[7] used Association rules to 
predict the heart disease. They used a simple mapping 
algorithm. This algorithm uniformly treats attributes as 
numerical or categorical. This is used to transform medical 
records to a transaction format. An improved algorithm is 
used to mine the constrained association rules. A mapping 
table is constructed and attribute values are mapped to items. 
The Decision tree is found incapable in mining data because 
they automatically split numerical values [7]. But in medical 
fields the standard cutoffs are in the numerical format only. 
The split point chosen by the Decision tree are of little use 
only. And interpreting the experimental results using Decision 
tree is very difficult. Clustering was useful to have a global 
understanding of data. A constrained version of clustering 
focusing on projection of data could be useful, but it deserves 
further research. So all these factors justify the use of 
Association Rule in mining medical data. 
Deepika [11] have used Pruning-Classification Association 
Rule (PCAR), for mining Association Rule. 
PCAR comes from analyzing and considering of Apriori 
algorithm. PCAR deletes minimum frequency item with 
minimum frequency item sets. It deletes infrequent item from 
item sets. Then classifies item sets based on frequency of item 
sets and discovers frequent item sets. The below Figure 3 
shows how the PCAR algorithm works [11].
INTERNATIONAL CONFERENCE ON INFORMATION COMMUNICATION AND EMBEDDED SYSTEMS ICICES 2013 
978-1-4673-5788-3/13/$31.00©2013IEEE 
3 
Figure 3: Procedure of PCAR Algorithm 
B. Data Mining and Classification 
Usha Rani [38] has implemented artificial neural network in 
heart disease database using feed forward and back 
propagation algorithm. The experiment is conducted by 
considering single and multilayered neural network models. 
Parallelism is implemented at each neuron in all hidden and 
output layer to speed up the learning process. 13 neurons are 
set for input, each neuron corresponds to each attributes. 
Neural network provides satisfactory results for classification 
task. 
Training 
Samples 
Test 
Samples 
Classification 
Efficiency 
Single 
Layer 
Multi 
Layer 
100 300 76% 82% 
150 200 79.4% 83% 
250 150 86.2% 89.3% 
350 100 90.6% 94% 
Table 1: Experimental Results of Heart Disease Data set [37] 
In Asha Rajkumar [3] , the data mining Classification is 
based on supervised machine learning Algorithm. Tangara 
tool is used to classify the data and evaluated using 10 fold 
cross validation. The performance analysis of Naïve Bayes, K-nn, 
Decision List Algorithm is based on accuracy and time 
taken to build the model. Naïve bayes algorithm is considered 
to be better since it takes only some to calculate the accuracy 
than other algorithm. And also it resulted in lower error rates. 
Naïve Bayes algorithm gives 52.23% of accurate result. The 
below Table 2 shows the performance study of the algorithm. 
Table 2: Performance study of the Algorithm [3] 
In [24] R. Setthukkarasi,, proposed a novel neuro fuzzy 
technique to diagnose the severity of the disease from the set 
of the patient records. A generalized database is constructed 
for decision making from the reduced attributes set obtained 
through genetic algorithm. A four layered fuzzy neural 
network for efficient modeling and reasoning with temporal 
dependencies under uncertainity is used. A Decision Support 
Subsystem is constructed by extracting rules from the 
temporal patterns retrieved from the relationship between 
trained data set. Radial Basic Function (RBF) neural network 
is constructed with five input nodes, training and 
normalization in hidden layer and output layer with one output 
node. RBF is used to train large amount of data with very few 
inputs. Data preprocessing by genetic algorithm, generalized 
database generation, Dataset training by RBF fuzzy neural 
network, Rule generation, Query processing and Severity 
based diagnosis are the steps followed for predicting the heart 
disease. The Figure 4 shows the logical view of Rule 
generation Process [24]. 
Figure 4: Logical view of Rule Generation Process 
Rafiah Awang and Sellapan Palaniappan [25, 26] developed a 
prototype called Intelligent Heart Disease Prediction System 
(IHDPS) using data mining technique namely Decision Trees, 
Naïve Bayes and Neural Network. Each technique has its own 
strength in realizing the objective of data mining. Data Mining 
Extension (DMX), a SQL based query language is for 
building and accessing the models’ contents. IHDPS answers 
complex “what if” queries where decision support system 
cannot. The most common modeling objective used in Data 
mining is Classification and Prediction. Decision Tree and 
Neural network come under Classification and Regression, 
Association Rules and Clustering [26] comes under 
prediction. Naïve bayes is the most effective in heart disease 
prediction as it has the highest percentage of correct 
predictions (86.53%). Five Data mining rules are defined and 
evaluated using the three techniques namely Decision Trees, 
Naïve Bayes and Neural Network. 
Anbarasi.M and et Al., [1] has used Genetic Algorithm (GA) 
to determine the attributes for the diagnosis of heart disease. 
Future extraction is done using GA. Attributes number is 
reduced as 6. The Table 3 shows the reduced number of 
attributes using GA [1]. 
Algorithm 
used 
Accuracy Time 
taken 
Naïve 
Bayes 
52.33% 609ms 
Decision 
List 
52% 719ms 
KNN 45.67% 1000ms
INTERNATIONAL CONFERENCE ON INFORMATION COMMUNICATION AND EMBEDDED SYSTEMS ICICES 2013 
978-1-4673-5788-3/13/$31.00©2013IEEE 
4 
Predictable attribute: 
Diagnosis 
Value Healthy: No heart disease 
Value Sick: Has Heart disease 
Reduced Input attributes: 
1.Type - Chest Pain Type 
2.Rbp - Resting blood pressure 
3.Eia - Exercise induced angina 
4.Oldpk - Old peak 
5.Vsl - No. of vessels colored 
6.Thal -Maximum heart rate 
achieved 
Table 3: Reduced Attributes list using GA [1] 
Three Classifiers are used for comparing the performance in 
predicting the heart disease. The classifiers taken for study are 
Naïve Bayes, Classification by Clustering and Decision Tree. 
Decision tree outperforms but it takes more time to build the 
model. Naïve bayes perform consistently before and after 
reduction of attributes. Classification via Clustering is very 
poor in its performance. Weka tool is used in evaluation the 
performance. 
Shantakumar B. Patil and et al.,[30, 31 ] clustered the data 
warehouse using K-means clustering algorithm after pre 
processing the data. Maximal Frequent Item Set Algorithm 
(MAFIA) is employed for the extraction of association rules 
from the clustered dataset besides performing efficiently when 
the database consists of very long item sets specifically. 
Multilayer perception Network (MLPNN) and Back 
propagation is used as training algorithm. 
Pseudo code for MAFIA [29]: 
MAFIA(C, MFI, Boolean IsHUT) { 
name HUT = C.head C.tail; 
if HUT is in MFI 
stop generation of children and return 
Count all children, use PEP to trim the tail, and 
recorder by increasing support, 
For each item i in C, trimmed_tail { 
IsHUT = whether i is the first item in the tail 
newNode = C I 
MAFIA (newNode, MFI, IsHUT)} 
if (IsHUT and all extensions are frequent) 
Stop search and go back up subtree 
If (C is a leaf and C.head is not in MFI) 
Add C.head toMFI 
} 
In Application of Data Mining Techniques in Health Care and 
Prediction of Heart Attack [34] One Dependency Augmented 
Naïve Bayes Classifier [17] (ODANB) and Naïve Credal 
Classifier 2 (NCC2) is used. The objective of this work is to 
examine the potential use of Classification based Data Mining 
techniques like Naïve Bayes, Rule based, Decision Tree and 
Artificial Neural Network. For the prediction of heart disease 
Naïve Bayes found to be the best when compared with 
ODANB. Text Mining can also be used in future to mine the 
vast amount of unstructured data available in health care 
databases. 
In [35], Subbulakshmi and et al., used Naïve Bayes for 
predicting Decision Support in heart disease prediction 
System (DSHDPS). It can serve as a training tool for nurses 
and medical students in diagnosing the heart disease. Naïve 
Bayes is found to be the best in heart disease prediction. It is 
used to create models with predictive capability. It provides 
new ways of exploring and understanding the data. Naïve 
Bayes is used to get efficient output. 
Enter 
Patient 
Record 
Naïve 
Bayes 
Data set 
Calculates 
probability of each 
Calculates yes or no 
probability 
Tells about 
risk 
Fig 5: Implementation of Naïve Bayes Algorithm on the patient data [35] 
In Naïve Bayesian Classification Approach in Health Care 
application, Bhuvaneshwari. R [6] proposed that Naïve Bayes 
Classification can be used as a best decision support system. 
In [15], Kavitha K.S used hybridization to train neural 
network using Genetic Algorithm. In this Feed- forward 
neural network and Back propagation is used as a learning 
algorithm. Chen A.H [10] used Artificial Neural Network 
(ANN) algorithm for classifying the heart disease based on 
input. Learning Vector Quantization (LVQ) is a prototype 
based supervised Classification Algorithm. Accuracy is 80%, 
Sensitivity is 85% and Specificity is 70%. To prove 
goodness, ROC Curve is also displayed. The below Figure 6 
shows the computational steps of LAV model. 
Figure 6: The Computational Steps of LAV Model 
In [9] Chaitrali S. Dangare and Sulabha have added 2 more 
attributes such as obesity and smoking to the existing 13
INTERNATIONAL CONFERENCE ON INFORMATION COMMUNICATION AND EMBEDDED SYSTEMS ICICES 2013 
978-1-4673-5788-3/13/$31.00©2013IEEE 
5 
attributes. Decision tree, Naïve Bayes and Artificial Neural 
Network has been tested with both 13 and 15 attributes. In 
both the case, the neural network showed better performance. 
The Figure 7 shows the Graphical Representation of Accuracy 
for each method. 
Figure 7: Graphical Representation of Accuracy for each method [9] 
Clustering, Time Series and Association Rule can also be 
used for Data Mining Classification. Text mining can be used 
to mine huge amount of unstructured data in future. 
In [20], Milan Kumari compares RIPPER, SVM, Decision 
Tree and Artificial Neural Network based on Sensitivity, 
Specificity, Accuracy, Error Rate, True Positive Rate and 
False Positive Rate. SVM predicts with least error rate and 
highest accuracy when compared with other techniques [19]. 
Future work is predicted to done using Meta models. In [23], 
Feed forward back propagation neural network is used as a 
classifier to distinguish between infected or non-infected 
person in both the cases. Neural network tool box from 
Matlab 7.9 is used to evaluate the performance of the 
proposed network. 
Levenburg-Marquardt back propagation algorithm is used to 
train the network. Neural network can be used for identifying 
the patients suffering Heart disease. 
A Novel Approach for Heart Disease diagnosis using Data 
Mining and Fuzzy Logic [21] reduces the number of attributes 
to reduce the number of tests for the patients. Decision Tree 
and Naïve Bayes combined with Fuzzy logic outplayed other 
Data Mining techniques in predicting Heart disease. In [32], 
Shouman and et. Al., used K-Nearest Neighbor (K-NN) in 
classification problem. Integrating voting with K-NN cannot 
enhance its accuracy in diagnosing the heart disease. When 
voting is applied to Decision Tree, different decision rules are 
extracted from each subset, which extract new knowledge 
leads to increasing accuracy in Decision Tree. But in K-NN 
no new knowledge is extracted, just distance between clusters 
in measured. Applying K-NN achieved an accuracy of 97.4% 
without voting. But voting in K-NN results in decrease in 
accuracy. 
C.Data Mining and Clustering 
In [4], Bala Sundar and et. al., used K-means Clustering 
Algorithm for the prediction of the heart disease. It is a 
method of cluster analysis which aims to partition ‘n’ 
observations to ‘k’ clusters. Euclidaen distance formula is 
used to minimize the sum of square of distance between data. 
Naïve Bayes is found to slow and Neural Network takes 
number of iterations in predicting the disease when compared 
using time and accuracy. 
Figure 8: Exponential accuracy of k-means with existing techniques 
The Figure 8 explains the exponential accuracy of k-means 
with existing techniques [4]. Santhi and et. Al., [28] proposes 
the performance of clustering and classification algorithm 
using heart disease data. Performance of classifiers of Bayes, 
functions, Lazy, Metammulti Boost AB, Rules, Trees, etc., are 
calculated. The performance of classification is calculated 
using cross validation test mode and clusers by mode of 
classes to evaluate clusters. Data preprocessing is done by 
attribute subset selection algorithm. Final result showed that 
Naïve Bayes tress has highest prediction accuracy than 
clustering Algorithm. 
V. CONCLUSION 
Heart disease prediction is a major challenge in the health 
care industry. Instead of going for a number of tests, 
predicting heart disease with less number of attributes is a 
challenging task in Data Mining. Existing literature shows that 
Classification task in Data Mining plays a vital role in heart 
disease prediction when compared with Clustering, 
Association Rule and Regression. In Classification Decision 
Tree outperforms in some cases, where Neural Network and 
Naïve Bayes outperforms in some other cases. Each technique 
has its own merits and demerits. When combined with each 
other or with Fuzzy logic, Heart disease prediction with Data 
Mining techniques will become most successful with less 
number of attributes. Text mining the medical data is another 
extension found in predicting the health care data. 
REFERENCES 
[1] Anbarasi.M, Anupriya and Iyengar “Enhanced Prediction of Heart 
Disease with Feature Subset Selection using Genetic Algorithm”,
INTERNATIONAL CONFERENCE ON INFORMATION COMMUNICATION AND EMBEDDED SYSTEMS ICICES 2013 
978-1-4673-5788-3/13/$31.00©2013IEEE 
6 
International Journal of Engineering and Technology, Vol 2(10), 2010, 
pp 5370-5376. 
[2] Annoj P.K.,” Clinical decision support system: Risk level prediction of 
heart disease using Data Mining Algorithms”, Journal of King Saud 
University- Computer and Information Sciences, 2012,pp 27-40. 
[3] Asha Rajkumar and Mrs. Sophia Reena, “ Diagnosis of Heart Disease 
using Data Mining Algorithms, Global Journal of Computer Science and 
Technology, vol. 10(10), 2010, pp 38-43. 
[4] Bala Sundar V, “Development of Data Clustering Algorithm for 
predicting Heart”, IJCA, Vol 48(7), June 2012, pp 8-13. 
[5] Bhagyashree Ambulkar and Vaishali Borkar “Data Mining in Cloud 
Computing”, MPGINMC, Recent Trends in Computing, ISSN 0975- 
8887,2012, pp 23-26. 
[6] Bhuvaneswari. R, “Naïve Bayesian Classification Approach in Health 
Care Application”, International Journal of Computer Science and 
Telecommunication, vol 3(1), Jan 2012, pp 106-112. 
[7] Carlos Ordonez, Edward Omincenski and Levien de Braal “Mining 
Constraint Association Rules to Predict Heart Disease”, Proceeding of 
2001, IEEE International Conference of Data Mining, IEEE Computer 
Society, ISBN-0-7695-1119-8, 2001, pp: 433-440. 
[8] Cengiz colak.M , Cemiz colak and Hasan Kocatruk “Predicting 
coronary artery disease using different artificial neural network 
models”, CAD and Artificial neural network, pp 249-254, 2008. 
[9] Chaltrali S. Dangare and Sulabha, “Improved Study of Heart Disease 
Prediction System using Data Mining Classification Techniques”, 
IJCA, Vol 47(10), pp 44-48, June 2012. 
[10] Chen A.H., “HDPS: Heart Disease Prediction System”, Computing in 
Cardiology, ISSN 0276-6574, pp 557-560, 2011. 
[11] Deepika. N, “Association Rule for Classification of Heart Attack 
patients”, IJAEST, Vol 11(2), pp 253-257, 2011. 
[12] Jabbar M.A., “Knowledge discovery from mining association rules for 
Heart disease Prediction”, JATIT, Vol 41(2), pp 166-174, 2012. 
[13] Jyothi Soni, Uzma ansari and Dipesh Ansari “Intelligent and Effective 
Heart Disease Prediction System using Weighted Associate Classifer”, 
IJCSE, Vol 3(6), pp 2385-2392, June 2011. 
[14] K.Rajeswari, “Prediction of Risk Score for Heart Disease in India using 
Machine Intelligence”,IPCSIT, Vol 4, 2011. 
[15] Kavitha K.S, “Modeling and designing of evolutionary neural network 
for heart disease prediction”, IJCSI, Vol 7(5), pp 272-283, September 
2010. 
[16] Latha Parthiban and R.Subramanian, “Intelligent Heart Disease 
Prediction System using CANFIS and Genetic Algorithm”, 
International Journal of Biological and Life Sciences, Vol 3(3), pp157- 
160,2007. 
[17] Liangxiao. J, Harry.Z, Zhihua.C and Jiang.S “One Dependency 
Augmented Naïve Bayes”, ADMA, pp 186-194, 2005. 
[18] Mia Shouman, “Using data mining techniques in heart disease diagnosis 
and treatment”, 978-1-4673-0483-2, Japan-Egypt Conference on 
Electronics, Communications and Computers, pp 189-193, 2012. 
[19] Milan Kumari and Sunila Godara, “Review of Data Mining 
Classification Model in Cardio Vascular Disease diagnosis”, IJCA, 
2011. 
[20] Milan Kumari and Sunila Godara, “Comparative Study of Data Mining 
Classification Methods in Cardio-Vascular Disease Prediction”, IJCST, 
Vol 2(2), June 2011. 
[21] Nidhi Bhatia and Kiran Jyothi, “A Novel Approach for heart disease 
diagnosis using Data Mining and Fuzzy logic”, IJCA, Vol 54(17), pp 
16-21, September 2012. 
[22] Nithya N.S, Sarumathi. S and Dr. Duraisamy. K “ Assessment of the 
risk factors of Heart Attack using frequent feature Selection Method”, 
International Journal of Communications and Enggineering, Vol 1(1), 
ISSN 0988-0382, pp 127-133, March 2012. 
[23] Qeethara Kadhim Al. Shayea, “Artificial neural network in Medical 
Diagnosis”, IJCSI, Vol 3(2), March 2011. 
[24] R. Setthukkarase and Kannan “An Intelligent System for mining 
Temporal rules in Clinical database using Fuzzy neural 
network”,European Journal of Scientific Research, ISSN 1450-216, Vol 
70(3), pp 386-395, 2012. 
[25] Rafiah Awang and Palaniappan. S “Intelligent Heart Disease Prediction 
System Using Data Mining techniques”, IJCSNS, Vol 8(8), pp 343-350, 
Aug 2008. 
[26] Rafiah Awang and Palaniappan. S “Web based Heart Disease Decision 
Support System using Data Mining Classification Modeling techniques” 
, Proceedings of iiWAS, pp 177-187, 2007. 
[27] Raghu. D.Dr, “Probability Based Heart Disease Prediction using Data 
Mining Techniques”, IJCST, Vol 2(4), pp 66-68, Dec 2011. 
[28] Santhi. P, “Improving the performance of Data Mining Algorithm in 
Health Care data”, IJCST, Vol 2(3), 2011. 
[29] Setiawan N.A, “ Rule Selection for Coronary Artery Disease Diagnosis 
Based on Rough Set” ,International Journal of Recent Trends in 
Engineering, Vol 2(5), pp 198-202, Dec 2009. 
[30] Shantakumar B.Patil, “Intelligent and Effective Heart Attack Prediction 
System using Data Mining and Artifical Neural Network”, European 
Journal of Scientific Research, Vol 31(4), pp 642-656, 2009. 
[31] Shanthakumar B. Patil, “Extraction of Significant patterns from Heart 
Disease Ware Houses for Heart Attack Prediction”, IJCSNS, Vol 9(2), 
pp 228-235, Feb 2009. 
[32] Shouman.M, Turner.T and Stocker.R, “Applying K-Nearest Neighbour 
in diagnosing Heart Disease Patients”, International Journal of 
Information and Education Technology, Vol 2(3), June 2012. 
[33] Siri Krishnan Wasan, Vasutha Bhatnagar and Harleen Kaur “The Impact 
of Data Mining techniques on medical diagnostics”, Data Science 
Journal, Vol 5(19), pp 119-126, October 2006. 
[34] Srinivas, Kavitha Rani and Dr. Govarthan, “Application of Data Mining 
Techniques in Health Care and Prediction of Heart Attack”, IJCSE, Vol 
2(2), pp 250-255, 2010. 
[35] Subbulakshmi, Ramesh and Chinna Rao “Decision Support in Heart 
Disease Prediction System using Naïve Bayes”, IJCSE, ISSN 0976- 
5166, Vol 2(2), May 2011. 
[36] Sudha.A, Gayathri.p and Jaishankar. N “Utilization of Data Mining 
Approaches for prediction of life Threatening Disease Survivability”, 
IJAC (0975-8887), Vol 14(17), March 2012. 
[37] Jyothi. S, Ujma.A, Dipesh. S and Sunita. S “Predictive Data Mining for 
Medical Diagnosis : An Overview of Heart Disease Prediction”, IJCA, 
Vol 17(8), pp 43-48, March 2011. 
[38] Usha. K Dr, “Analysis of Heart Disease Dataset using Neural network 
approach”, IJDKP, Vol 1(5), Sep 2011.

Weitere ähnliche Inhalte

Was ist angesagt?

Android Based Questionnaires Application for Heart Disease Prediction System
Android Based Questionnaires Application for Heart Disease Prediction SystemAndroid Based Questionnaires Application for Heart Disease Prediction System
Android Based Questionnaires Application for Heart Disease Prediction System
ijtsrd
 

Was ist angesagt? (20)

Heart Attack Prediction using Machine Learning
Heart Attack Prediction using Machine LearningHeart Attack Prediction using Machine Learning
Heart Attack Prediction using Machine Learning
 
Health care analytics
Health care analyticsHealth care analytics
Health care analytics
 
HEART DISEASE PREDICTION USING NAIVE BAYES ALGORITHM
HEART DISEASE PREDICTION USING NAIVE BAYES ALGORITHMHEART DISEASE PREDICTION USING NAIVE BAYES ALGORITHM
HEART DISEASE PREDICTION USING NAIVE BAYES ALGORITHM
 
A Heart Disease Prediction Model using Decision Tree
A Heart Disease Prediction Model using Decision TreeA Heart Disease Prediction Model using Decision Tree
A Heart Disease Prediction Model using Decision Tree
 
Heart Disease Prediction Using Data Mining Techniques
Heart Disease Prediction Using Data Mining TechniquesHeart Disease Prediction Using Data Mining Techniques
Heart Disease Prediction Using Data Mining Techniques
 
IRJET - Effective Heart Disease Prediction using Distinct Machine Learning Te...
IRJET - Effective Heart Disease Prediction using Distinct Machine Learning Te...IRJET - Effective Heart Disease Prediction using Distinct Machine Learning Te...
IRJET - Effective Heart Disease Prediction using Distinct Machine Learning Te...
 
DISEASE PREDICTION SYSTEM USING DATA MINING
DISEASE PREDICTION SYSTEM USING  DATA MININGDISEASE PREDICTION SYSTEM USING  DATA MINING
DISEASE PREDICTION SYSTEM USING DATA MINING
 
IRJET- Diabetes Diagnosis using Machine Learning Algorithms
IRJET- Diabetes Diagnosis using Machine Learning AlgorithmsIRJET- Diabetes Diagnosis using Machine Learning Algorithms
IRJET- Diabetes Diagnosis using Machine Learning Algorithms
 
Detection of heart diseases by data mining
Detection of heart diseases by data miningDetection of heart diseases by data mining
Detection of heart diseases by data mining
 
Diabetes prediction using different machine learning approaches
Diabetes prediction using different machine learning approachesDiabetes prediction using different machine learning approaches
Diabetes prediction using different machine learning approaches
 
PPT.pptx
PPT.pptxPPT.pptx
PPT.pptx
 
Heart Attack Prediction System Using Fuzzy C Means Classifier
Heart Attack Prediction System Using Fuzzy C Means ClassifierHeart Attack Prediction System Using Fuzzy C Means Classifier
Heart Attack Prediction System Using Fuzzy C Means Classifier
 
Android Based Questionnaires Application for Heart Disease Prediction System
Android Based Questionnaires Application for Heart Disease Prediction SystemAndroid Based Questionnaires Application for Heart Disease Prediction System
Android Based Questionnaires Application for Heart Disease Prediction System
 
PREDICTION OF DIABETES MELLITUS USING MACHINE LEARNING TECHNIQUES
PREDICTION OF DIABETES MELLITUS USING MACHINE LEARNING TECHNIQUESPREDICTION OF DIABETES MELLITUS USING MACHINE LEARNING TECHNIQUES
PREDICTION OF DIABETES MELLITUS USING MACHINE LEARNING TECHNIQUES
 
IRJET- Diabetes Prediction using Machine Learning
IRJET- Diabetes Prediction using Machine LearningIRJET- Diabetes Prediction using Machine Learning
IRJET- Diabetes Prediction using Machine Learning
 
Machine Learning for Disease Prediction
Machine Learning for Disease PredictionMachine Learning for Disease Prediction
Machine Learning for Disease Prediction
 
Disease Prediction by Machine Learning Over Big Data From Healthcare Communities
Disease Prediction by Machine Learning Over Big Data From Healthcare CommunitiesDisease Prediction by Machine Learning Over Big Data From Healthcare Communities
Disease Prediction by Machine Learning Over Big Data From Healthcare Communities
 
Chronic Kidney Disease Prediction
Chronic Kidney Disease PredictionChronic Kidney Disease Prediction
Chronic Kidney Disease Prediction
 
introduction to data mining tutorial
introduction to data mining tutorial introduction to data mining tutorial
introduction to data mining tutorial
 
HPPS: Heart Problem Prediction System using Machine Learning
HPPS: Heart Problem Prediction System using Machine LearningHPPS: Heart Problem Prediction System using Machine Learning
HPPS: Heart Problem Prediction System using Machine Learning
 

Andere mochten auch

1.PPT (1.PREDICTION OF DISEASES New)
1.PPT (1.PREDICTION OF DISEASES New)1.PPT (1.PREDICTION OF DISEASES New)
1.PPT (1.PREDICTION OF DISEASES New)
Jashvant Shah
 
Data mining seminar report
Data mining seminar reportData mining seminar report
Data mining seminar report
mayurik19
 
Hybrid Technique for Associative Classification of Heart Diseases
Hybrid Technique for Associative Classification of Heart DiseasesHybrid Technique for Associative Classification of Heart Diseases
Hybrid Technique for Associative Classification of Heart Diseases
Jagdeep Singh Malhi
 
Linear Regression Parameters
Linear Regression ParametersLinear Regression Parameters
Linear Regression Parameters
camposer
 
DATA MINING WITH WEKA
DATA MINING WITH WEKADATA MINING WITH WEKA
DATA MINING WITH WEKA
Shubham Gupta
 
Machine Learning with WEKA
Machine Learning with WEKAMachine Learning with WEKA
Machine Learning with WEKA
butest
 
Discovering knowledge using web structure mining
Discovering knowledge using web structure miningDiscovering knowledge using web structure mining
Discovering knowledge using web structure mining
Atul Khanna
 

Andere mochten auch (16)

Psdot 14 using data mining techniques in heart
Psdot 14 using data mining techniques in heartPsdot 14 using data mining techniques in heart
Psdot 14 using data mining techniques in heart
 
SMART HEALTH PREDICTION USING DATA MINING by Dr.Mahboob Khan Phd
SMART HEALTH PREDICTION USING DATA MINING by Dr.Mahboob Khan PhdSMART HEALTH PREDICTION USING DATA MINING by Dr.Mahboob Khan Phd
SMART HEALTH PREDICTION USING DATA MINING by Dr.Mahboob Khan Phd
 
Health Prediction System - an Artificial Intelligence Project 2015
Health Prediction System - an Artificial Intelligence Project 2015Health Prediction System - an Artificial Intelligence Project 2015
Health Prediction System - an Artificial Intelligence Project 2015
 
1.PPT (1.PREDICTION OF DISEASES New)
1.PPT (1.PREDICTION OF DISEASES New)1.PPT (1.PREDICTION OF DISEASES New)
1.PPT (1.PREDICTION OF DISEASES New)
 
Data mining seminar report
Data mining seminar reportData mining seminar report
Data mining seminar report
 
Secure shell protocol
Secure shell protocolSecure shell protocol
Secure shell protocol
 
Hybrid Technique for Associative Classification of Heart Diseases
Hybrid Technique for Associative Classification of Heart DiseasesHybrid Technique for Associative Classification of Heart Diseases
Hybrid Technique for Associative Classification of Heart Diseases
 
Secure Shell - a Presentation on Ethical Hacking
Secure Shell - a Presentation on Ethical HackingSecure Shell - a Presentation on Ethical Hacking
Secure Shell - a Presentation on Ethical Hacking
 
Linear Regression Parameters
Linear Regression ParametersLinear Regression Parameters
Linear Regression Parameters
 
DATA MINING WITH WEKA
DATA MINING WITH WEKADATA MINING WITH WEKA
DATA MINING WITH WEKA
 
Machine Learning with WEKA
Machine Learning with WEKAMachine Learning with WEKA
Machine Learning with WEKA
 
Secure SHell
Secure SHellSecure SHell
Secure SHell
 
Discovering knowledge using web structure mining
Discovering knowledge using web structure miningDiscovering knowledge using web structure mining
Discovering knowledge using web structure mining
 
Wireless heartattack detector
Wireless heartattack detectorWireless heartattack detector
Wireless heartattack detector
 
Secure Shell(ssh)
Secure Shell(ssh)Secure Shell(ssh)
Secure Shell(ssh)
 
SSH
SSHSSH
SSH
 

Ähnlich wie Survey on data mining techniques in heart disease prediction

A data mining approach for prediction of heart disease using neural networks
A data mining approach for prediction of heart disease using neural networksA data mining approach for prediction of heart disease using neural networks
A data mining approach for prediction of heart disease using neural networks
IAEME Publication
 
A data mining approach for prediction of heart disease using neural networks
A data mining approach for prediction of heart disease using neural networksA data mining approach for prediction of heart disease using neural networks
A data mining approach for prediction of heart disease using neural networks
IAEME Publication
 
Performance Evaluation of Data Mining Algorithm on Electronic Health Record o...
Performance Evaluation of Data Mining Algorithm on Electronic Health Record o...Performance Evaluation of Data Mining Algorithm on Electronic Health Record o...
Performance Evaluation of Data Mining Algorithm on Electronic Health Record o...
BRNSSPublicationHubI
 
EVALUATING THE ACCURACY OF CLASSIFICATION ALGORITHMS FOR DETECTING HEART DIS...
EVALUATING THE ACCURACY OF CLASSIFICATION  ALGORITHMS FOR DETECTING HEART DIS...EVALUATING THE ACCURACY OF CLASSIFICATION  ALGORITHMS FOR DETECTING HEART DIS...
EVALUATING THE ACCURACY OF CLASSIFICATION ALGORITHMS FOR DETECTING HEART DIS...
mlaij
 
EVALUATING THE ACCURACY OF CLASSIFICATION ALGORITHMS FOR DETECTING HEART DISE...
EVALUATING THE ACCURACY OF CLASSIFICATION ALGORITHMS FOR DETECTING HEART DISE...EVALUATING THE ACCURACY OF CLASSIFICATION ALGORITHMS FOR DETECTING HEART DISE...
EVALUATING THE ACCURACY OF CLASSIFICATION ALGORITHMS FOR DETECTING HEART DISE...
mlaij
 
Acute coronary-syndrome-prediction-using-data-mining-techniques--an-application
Acute coronary-syndrome-prediction-using-data-mining-techniques--an-applicationAcute coronary-syndrome-prediction-using-data-mining-techniques--an-application
Acute coronary-syndrome-prediction-using-data-mining-techniques--an-application
Cemal Ardil
 
Deep Spectral Time‑Variant Feature Analytic Model for Cardiac Disease Predict...
Deep Spectral Time‑Variant Feature Analytic Model for Cardiac Disease Predict...Deep Spectral Time‑Variant Feature Analytic Model for Cardiac Disease Predict...
Deep Spectral Time‑Variant Feature Analytic Model for Cardiac Disease Predict...
BASMAJUMAASALEHALMOH
 
javed_prethesis2608 on predcition of heart disease
javed_prethesis2608 on predcition of heart diseasejaved_prethesis2608 on predcition of heart disease
javed_prethesis2608 on predcition of heart disease
javed75
 

Ähnlich wie Survey on data mining techniques in heart disease prediction (20)

IRJET -Improving the Accuracy of the Heart Disease Prediction using Hybrid Ma...
IRJET -Improving the Accuracy of the Heart Disease Prediction using Hybrid Ma...IRJET -Improving the Accuracy of the Heart Disease Prediction using Hybrid Ma...
IRJET -Improving the Accuracy of the Heart Disease Prediction using Hybrid Ma...
 
Heart failure prediction based on random forest algorithm using genetic algo...
Heart failure prediction based on random forest algorithm  using genetic algo...Heart failure prediction based on random forest algorithm  using genetic algo...
Heart failure prediction based on random forest algorithm using genetic algo...
 
Prediction of Heart Disease Using Data Mining Techniques- A Review
Prediction of Heart Disease Using Data Mining Techniques- A ReviewPrediction of Heart Disease Using Data Mining Techniques- A Review
Prediction of Heart Disease Using Data Mining Techniques- A Review
 
Mining of medical data to identify risk factors of heart disease using freque...
Mining of medical data to identify risk factors of heart disease using freque...Mining of medical data to identify risk factors of heart disease using freque...
Mining of medical data to identify risk factors of heart disease using freque...
 
A Survey on Heart Disease Prediction Techniques
A Survey on Heart Disease Prediction TechniquesA Survey on Heart Disease Prediction Techniques
A Survey on Heart Disease Prediction Techniques
 
Comparing Data Mining Techniques used for Heart Disease Prediction
Comparing Data Mining Techniques used for Heart Disease PredictionComparing Data Mining Techniques used for Heart Disease Prediction
Comparing Data Mining Techniques used for Heart Disease Prediction
 
IRJET- The Prediction of Heart Disease using Naive Bayes Classifier
IRJET- The Prediction of Heart Disease using Naive Bayes ClassifierIRJET- The Prediction of Heart Disease using Naive Bayes Classifier
IRJET- The Prediction of Heart Disease using Naive Bayes Classifier
 
A data mining approach for prediction of heart disease using neural networks
A data mining approach for prediction of heart disease using neural networksA data mining approach for prediction of heart disease using neural networks
A data mining approach for prediction of heart disease using neural networks
 
A data mining approach for prediction of heart disease using neural networks
A data mining approach for prediction of heart disease using neural networksA data mining approach for prediction of heart disease using neural networks
A data mining approach for prediction of heart disease using neural networks
 
Performance Evaluation of Data Mining Algorithm on Electronic Health Record o...
Performance Evaluation of Data Mining Algorithm on Electronic Health Record o...Performance Evaluation of Data Mining Algorithm on Electronic Health Record o...
Performance Evaluation of Data Mining Algorithm on Electronic Health Record o...
 
An Ill-identified Classification to Predict Cardiac Disease Using Data Cluste...
An Ill-identified Classification to Predict Cardiac Disease Using Data Cluste...An Ill-identified Classification to Predict Cardiac Disease Using Data Cluste...
An Ill-identified Classification to Predict Cardiac Disease Using Data Cluste...
 
Prediction of heart disease using neural network
Prediction of heart disease using neural networkPrediction of heart disease using neural network
Prediction of heart disease using neural network
 
Ascendable Clarification for Coronary Illness Prediction using Classification...
Ascendable Clarification for Coronary Illness Prediction using Classification...Ascendable Clarification for Coronary Illness Prediction using Classification...
Ascendable Clarification for Coronary Illness Prediction using Classification...
 
EVALUATING THE ACCURACY OF CLASSIFICATION ALGORITHMS FOR DETECTING HEART DIS...
EVALUATING THE ACCURACY OF CLASSIFICATION  ALGORITHMS FOR DETECTING HEART DIS...EVALUATING THE ACCURACY OF CLASSIFICATION  ALGORITHMS FOR DETECTING HEART DIS...
EVALUATING THE ACCURACY OF CLASSIFICATION ALGORITHMS FOR DETECTING HEART DIS...
 
EVALUATING THE ACCURACY OF CLASSIFICATION ALGORITHMS FOR DETECTING HEART DISE...
EVALUATING THE ACCURACY OF CLASSIFICATION ALGORITHMS FOR DETECTING HEART DISE...EVALUATING THE ACCURACY OF CLASSIFICATION ALGORITHMS FOR DETECTING HEART DISE...
EVALUATING THE ACCURACY OF CLASSIFICATION ALGORITHMS FOR DETECTING HEART DISE...
 
Heart Failure Prediction using Different Machine Learning Techniques
Heart Failure Prediction using Different Machine Learning TechniquesHeart Failure Prediction using Different Machine Learning Techniques
Heart Failure Prediction using Different Machine Learning Techniques
 
Acute coronary-syndrome-prediction-using-data-mining-techniques--an-application
Acute coronary-syndrome-prediction-using-data-mining-techniques--an-applicationAcute coronary-syndrome-prediction-using-data-mining-techniques--an-application
Acute coronary-syndrome-prediction-using-data-mining-techniques--an-application
 
Deep Spectral Time‑Variant Feature Analytic Model for Cardiac Disease Predict...
Deep Spectral Time‑Variant Feature Analytic Model for Cardiac Disease Predict...Deep Spectral Time‑Variant Feature Analytic Model for Cardiac Disease Predict...
Deep Spectral Time‑Variant Feature Analytic Model for Cardiac Disease Predict...
 
IRJET- Role of Different Data Mining Techniques for Predicting Heart Disease
IRJET-  	  Role of Different Data Mining Techniques for Predicting Heart DiseaseIRJET-  	  Role of Different Data Mining Techniques for Predicting Heart Disease
IRJET- Role of Different Data Mining Techniques for Predicting Heart Disease
 
javed_prethesis2608 on predcition of heart disease
javed_prethesis2608 on predcition of heart diseasejaved_prethesis2608 on predcition of heart disease
javed_prethesis2608 on predcition of heart disease
 

Kürzlich hochgeladen

Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global Impact
PECB
 
Seal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptxSeal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptx
negromaestrong
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
QucHHunhnh
 

Kürzlich hochgeladen (20)

TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
 
Ecological Succession. ( ECOSYSTEM, B. Pharmacy, 1st Year, Sem-II, Environmen...
Ecological Succession. ( ECOSYSTEM, B. Pharmacy, 1st Year, Sem-II, Environmen...Ecological Succession. ( ECOSYSTEM, B. Pharmacy, 1st Year, Sem-II, Environmen...
Ecological Succession. ( ECOSYSTEM, B. Pharmacy, 1st Year, Sem-II, Environmen...
 
Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global Impact
 
Role Of Transgenic Animal In Target Validation-1.pptx
Role Of Transgenic Animal In Target Validation-1.pptxRole Of Transgenic Animal In Target Validation-1.pptx
Role Of Transgenic Animal In Target Validation-1.pptx
 
Food Chain and Food Web (Ecosystem) EVS, B. Pharmacy 1st Year, Sem-II
Food Chain and Food Web (Ecosystem) EVS, B. Pharmacy 1st Year, Sem-IIFood Chain and Food Web (Ecosystem) EVS, B. Pharmacy 1st Year, Sem-II
Food Chain and Food Web (Ecosystem) EVS, B. Pharmacy 1st Year, Sem-II
 
Micro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdfMicro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdf
 
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
 
Holdier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfHoldier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdf
 
ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.
 
Python Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docxPython Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docx
 
Sociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning ExhibitSociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning Exhibit
 
This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.
 
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
 
Unit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxUnit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptx
 
Z Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphZ Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot Graph
 
General Principles of Intellectual Property: Concepts of Intellectual Proper...
General Principles of Intellectual Property: Concepts of Intellectual  Proper...General Principles of Intellectual Property: Concepts of Intellectual  Proper...
General Principles of Intellectual Property: Concepts of Intellectual Proper...
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The Basics
 
Seal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptxSeal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptx
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
 

Survey on data mining techniques in heart disease prediction

  • 1. INTERNATIONAL CONFERENCE ON INFORMATION COMMUNICATION AND EMBEDDED SYSTEMS ICICES 2013 978-1-4673-5788-3/13/$31.00©2013IEEE 7 An Empirical Study on applying Data Mining Techniques for the Analysis and Prediction of Heart Disease Sivagowry .S#1, Dr. Durairaj. M*2 and Persia.A#3 #Research Scholar, School of Computer Science and Engg. Bharathidasan University, Trichy, Tamilnadu, India. *Assistant Professor, School of Computer Science and Engg. Bharathidasan University, Trichy, Tamilnadu, India Abstract— The health care environment is found to be rich in information, but poor in extracting knowledge from the information. This is because of the lack of effective analysis tool to discover hidden relationships and trends in them. By applying the data mining techniques, valuable knowledge can be extracted from the health care system. Heart disease is a group of condition affecting the structure and functions of heart and has many root causes. Heart disease is the leading cause of death in the world over past ten years. Researches have been made with many hybrid techniques for diagnosing heart disease. This paper deals with an overall review of application of data mining in heart disease prediction. Index Terms— Data Mining, Decision Tree, Naïve Bayes, Classification, Clustering. I. INTRODUCTION Data Mining is the exploration of large datasets to extract hidden and previously unknown patterns, relationships and knowledge that are difficult to detect with traditional statistics [17]. Data mining techniques are the result of a long process of research and product development. Data Mining is divided into two tasks such as Predictive Tasks and Descriptive Tasks. Predictive Tasks predict the value of a specific attribute based on other attribute. Classification, Regression and Deviation Deduction come under Predictive Tasks. Descriptive Tasks derive pattern that summarize the relationship between data. Clustering, Association Rule Mining and Sequential Pattern Discovery are coming under Descriptive Tasks.Data Mining involves few steps from raw data collection to some form of new knowledge. The iterative process consists of following steps like Data cleaning, Data Integration, Data Selection, Data transformation, Data Mining, Pattern Evaluation, and Knowledge the core for Knowledge Discovery Process Figure 1: Data Mining is the core of Knowledge Discovery Process Medical Data Mining is a domain of challenge which involves lot of imprecision and uncertainty. Provision of quality services at affordable cost is the major challenge faced in the health care organization. Poor clinical decision may lead to disastrous consequences. Health care data is massive. Clinical decisions are often made based on doctor’s experience rather than on the knowledge rich data hidden in the data base. This in some cases will result in errors, excessive medical cost which affects the quality of service to the patients [33]. Medical history data comprises of a number of tests essentials to diagnose a particular disease. It is possible to gain the advantage of Data mining in health care by employing it as an intelligent diagnostic tool. The researchers in the medical field identify and predict the disease with the aid of Data mining techniques [29]. II. HEART DISEASE The initial diagnosis of a heart attack is made by a combination of clinical symptoms and characteristic electrocardiogram (ECG) changes. An ECG is a recording of the electrical activity of the heart. Confirmation of a heart attack can only be made hours later through detection of elevated creatinine phosphokinase (CPK) in the blood. CPK is a muscle protein enzyme which is released into the blood circulation by dying heart muscles when their surrounding dissolves [3]. World Health Organization in the year 2003 reported that 29.2% of total global deaths are due to Cardio Vascular Disease (CVD). By the end of this year, CVD is expected to be the leading cause for deaths in developing countries due to change in life style, work culture and food habits. Hence, more careful and efficient methods of cardiac diseases and periodic examination are of high importance [1]. III. DATA SETS The Data set is taken from Data mining repository of University of California, Irvine (UCI). Data set from Cleveland data set, Hungary Data set, Switzerland Data set, long beach and Statlog data set are collected. Cleveland, Hungary, Switzerland and va long beach data set contains 76 attributes in all. But only 14 attributes are used. Among all those Cleveland data set and Statlog data set are the most
  • 2. INTERNATIONAL CONFERENCE ON INFORMATION COMMUNICATION AND EMBEDDED SYSTEMS ICICES 2013 978-1-4673-5788-3/13/$31.00©2013IEEE 2 commonly used data set. Because all the other has missing values. The Figure 2 shows some sample of data set collected from the UCI repository. Figure 2: Sample Data Set Attributes Used No Name Description 1 Age Age in Years 2 Sex 1=male, 0=female 3 cp Chest pain type(1 = typical angina, 2 =atypical angina, 3 = non-anginal pain, 4 = asymptomatic) 4 trestbps Resting blood sugar(in mm Hg on admission to hospital) 5 chol Serum cholesterol in mg/dl 6 fbs Fasting blood sugar>120 mg/dl(1= true, 0=false) 7 restecg Resting electrocardiographic results(0 = normal, 1 = having ST-T wave abnormality, 2 = left ventricular hypertrophy) 8 thalach Maximum heart rate 9 exang Exercise induced angina 10 oldpeak ST depression induced by exercise relative to rest 11 slope Slope of the peak exercise ST segment (1=upsloping, 2=flat, 3= downsloping) 12 Ca Number of major vessels colored by fluoroscopy 13 thal 3= normal, 6=fixed defect, 7= reversible defect 14 num Class(0=healthy, 1=have heart disease) IV. DATA MINING TECHNIQUES USED IN HEART DISEASE PREDICTION Data mining techniques such as clustering, Classification, Regression and Association Rule mining are used in extracting knowledge from database. Medical data are mined by using the techniques above and the diagnosis of disease is carried out. Application of Data mining technique in medical data is as explained below: Data Mining and Association Rules An Association Rule is an implication expression of the form ;ҏҏY, where X and Y are disjoint sets (i.e) XŀY = Ø. The strength of Association Rule can be measured in terms of support and confidence. Support determines how often a rule is applicable to a given data set. Confidence determines how frequently the item set is used. Support (XҏҏY) = P (XŀY) Confidence (XҏҏY) = P (XŀY)/P(X). Minimum support and minimum confidence are the two thresh hold in Association Rule mining. Association Rule Mining is divided into two tasks like frequent item set generation and Rule generation. Carlos Ordonez and et al.,[7] used Association rules to predict the heart disease. They used a simple mapping algorithm. This algorithm uniformly treats attributes as numerical or categorical. This is used to transform medical records to a transaction format. An improved algorithm is used to mine the constrained association rules. A mapping table is constructed and attribute values are mapped to items. The Decision tree is found incapable in mining data because they automatically split numerical values [7]. But in medical fields the standard cutoffs are in the numerical format only. The split point chosen by the Decision tree are of little use only. And interpreting the experimental results using Decision tree is very difficult. Clustering was useful to have a global understanding of data. A constrained version of clustering focusing on projection of data could be useful, but it deserves further research. So all these factors justify the use of Association Rule in mining medical data. Deepika [11] have used Pruning-Classification Association Rule (PCAR), for mining Association Rule. PCAR comes from analyzing and considering of Apriori algorithm. PCAR deletes minimum frequency item with minimum frequency item sets. It deletes infrequent item from item sets. Then classifies item sets based on frequency of item sets and discovers frequent item sets. The below Figure 3 shows how the PCAR algorithm works [11].
  • 3. INTERNATIONAL CONFERENCE ON INFORMATION COMMUNICATION AND EMBEDDED SYSTEMS ICICES 2013 978-1-4673-5788-3/13/$31.00©2013IEEE 3 Figure 3: Procedure of PCAR Algorithm B. Data Mining and Classification Usha Rani [38] has implemented artificial neural network in heart disease database using feed forward and back propagation algorithm. The experiment is conducted by considering single and multilayered neural network models. Parallelism is implemented at each neuron in all hidden and output layer to speed up the learning process. 13 neurons are set for input, each neuron corresponds to each attributes. Neural network provides satisfactory results for classification task. Training Samples Test Samples Classification Efficiency Single Layer Multi Layer 100 300 76% 82% 150 200 79.4% 83% 250 150 86.2% 89.3% 350 100 90.6% 94% Table 1: Experimental Results of Heart Disease Data set [37] In Asha Rajkumar [3] , the data mining Classification is based on supervised machine learning Algorithm. Tangara tool is used to classify the data and evaluated using 10 fold cross validation. The performance analysis of Naïve Bayes, K-nn, Decision List Algorithm is based on accuracy and time taken to build the model. Naïve bayes algorithm is considered to be better since it takes only some to calculate the accuracy than other algorithm. And also it resulted in lower error rates. Naïve Bayes algorithm gives 52.23% of accurate result. The below Table 2 shows the performance study of the algorithm. Table 2: Performance study of the Algorithm [3] In [24] R. Setthukkarasi,, proposed a novel neuro fuzzy technique to diagnose the severity of the disease from the set of the patient records. A generalized database is constructed for decision making from the reduced attributes set obtained through genetic algorithm. A four layered fuzzy neural network for efficient modeling and reasoning with temporal dependencies under uncertainity is used. A Decision Support Subsystem is constructed by extracting rules from the temporal patterns retrieved from the relationship between trained data set. Radial Basic Function (RBF) neural network is constructed with five input nodes, training and normalization in hidden layer and output layer with one output node. RBF is used to train large amount of data with very few inputs. Data preprocessing by genetic algorithm, generalized database generation, Dataset training by RBF fuzzy neural network, Rule generation, Query processing and Severity based diagnosis are the steps followed for predicting the heart disease. The Figure 4 shows the logical view of Rule generation Process [24]. Figure 4: Logical view of Rule Generation Process Rafiah Awang and Sellapan Palaniappan [25, 26] developed a prototype called Intelligent Heart Disease Prediction System (IHDPS) using data mining technique namely Decision Trees, Naïve Bayes and Neural Network. Each technique has its own strength in realizing the objective of data mining. Data Mining Extension (DMX), a SQL based query language is for building and accessing the models’ contents. IHDPS answers complex “what if” queries where decision support system cannot. The most common modeling objective used in Data mining is Classification and Prediction. Decision Tree and Neural network come under Classification and Regression, Association Rules and Clustering [26] comes under prediction. Naïve bayes is the most effective in heart disease prediction as it has the highest percentage of correct predictions (86.53%). Five Data mining rules are defined and evaluated using the three techniques namely Decision Trees, Naïve Bayes and Neural Network. Anbarasi.M and et Al., [1] has used Genetic Algorithm (GA) to determine the attributes for the diagnosis of heart disease. Future extraction is done using GA. Attributes number is reduced as 6. The Table 3 shows the reduced number of attributes using GA [1]. Algorithm used Accuracy Time taken Naïve Bayes 52.33% 609ms Decision List 52% 719ms KNN 45.67% 1000ms
  • 4. INTERNATIONAL CONFERENCE ON INFORMATION COMMUNICATION AND EMBEDDED SYSTEMS ICICES 2013 978-1-4673-5788-3/13/$31.00©2013IEEE 4 Predictable attribute: Diagnosis Value Healthy: No heart disease Value Sick: Has Heart disease Reduced Input attributes: 1.Type - Chest Pain Type 2.Rbp - Resting blood pressure 3.Eia - Exercise induced angina 4.Oldpk - Old peak 5.Vsl - No. of vessels colored 6.Thal -Maximum heart rate achieved Table 3: Reduced Attributes list using GA [1] Three Classifiers are used for comparing the performance in predicting the heart disease. The classifiers taken for study are Naïve Bayes, Classification by Clustering and Decision Tree. Decision tree outperforms but it takes more time to build the model. Naïve bayes perform consistently before and after reduction of attributes. Classification via Clustering is very poor in its performance. Weka tool is used in evaluation the performance. Shantakumar B. Patil and et al.,[30, 31 ] clustered the data warehouse using K-means clustering algorithm after pre processing the data. Maximal Frequent Item Set Algorithm (MAFIA) is employed for the extraction of association rules from the clustered dataset besides performing efficiently when the database consists of very long item sets specifically. Multilayer perception Network (MLPNN) and Back propagation is used as training algorithm. Pseudo code for MAFIA [29]: MAFIA(C, MFI, Boolean IsHUT) { name HUT = C.head C.tail; if HUT is in MFI stop generation of children and return Count all children, use PEP to trim the tail, and recorder by increasing support, For each item i in C, trimmed_tail { IsHUT = whether i is the first item in the tail newNode = C I MAFIA (newNode, MFI, IsHUT)} if (IsHUT and all extensions are frequent) Stop search and go back up subtree If (C is a leaf and C.head is not in MFI) Add C.head toMFI } In Application of Data Mining Techniques in Health Care and Prediction of Heart Attack [34] One Dependency Augmented Naïve Bayes Classifier [17] (ODANB) and Naïve Credal Classifier 2 (NCC2) is used. The objective of this work is to examine the potential use of Classification based Data Mining techniques like Naïve Bayes, Rule based, Decision Tree and Artificial Neural Network. For the prediction of heart disease Naïve Bayes found to be the best when compared with ODANB. Text Mining can also be used in future to mine the vast amount of unstructured data available in health care databases. In [35], Subbulakshmi and et al., used Naïve Bayes for predicting Decision Support in heart disease prediction System (DSHDPS). It can serve as a training tool for nurses and medical students in diagnosing the heart disease. Naïve Bayes is found to be the best in heart disease prediction. It is used to create models with predictive capability. It provides new ways of exploring and understanding the data. Naïve Bayes is used to get efficient output. Enter Patient Record Naïve Bayes Data set Calculates probability of each Calculates yes or no probability Tells about risk Fig 5: Implementation of Naïve Bayes Algorithm on the patient data [35] In Naïve Bayesian Classification Approach in Health Care application, Bhuvaneshwari. R [6] proposed that Naïve Bayes Classification can be used as a best decision support system. In [15], Kavitha K.S used hybridization to train neural network using Genetic Algorithm. In this Feed- forward neural network and Back propagation is used as a learning algorithm. Chen A.H [10] used Artificial Neural Network (ANN) algorithm for classifying the heart disease based on input. Learning Vector Quantization (LVQ) is a prototype based supervised Classification Algorithm. Accuracy is 80%, Sensitivity is 85% and Specificity is 70%. To prove goodness, ROC Curve is also displayed. The below Figure 6 shows the computational steps of LAV model. Figure 6: The Computational Steps of LAV Model In [9] Chaitrali S. Dangare and Sulabha have added 2 more attributes such as obesity and smoking to the existing 13
  • 5. INTERNATIONAL CONFERENCE ON INFORMATION COMMUNICATION AND EMBEDDED SYSTEMS ICICES 2013 978-1-4673-5788-3/13/$31.00©2013IEEE 5 attributes. Decision tree, Naïve Bayes and Artificial Neural Network has been tested with both 13 and 15 attributes. In both the case, the neural network showed better performance. The Figure 7 shows the Graphical Representation of Accuracy for each method. Figure 7: Graphical Representation of Accuracy for each method [9] Clustering, Time Series and Association Rule can also be used for Data Mining Classification. Text mining can be used to mine huge amount of unstructured data in future. In [20], Milan Kumari compares RIPPER, SVM, Decision Tree and Artificial Neural Network based on Sensitivity, Specificity, Accuracy, Error Rate, True Positive Rate and False Positive Rate. SVM predicts with least error rate and highest accuracy when compared with other techniques [19]. Future work is predicted to done using Meta models. In [23], Feed forward back propagation neural network is used as a classifier to distinguish between infected or non-infected person in both the cases. Neural network tool box from Matlab 7.9 is used to evaluate the performance of the proposed network. Levenburg-Marquardt back propagation algorithm is used to train the network. Neural network can be used for identifying the patients suffering Heart disease. A Novel Approach for Heart Disease diagnosis using Data Mining and Fuzzy Logic [21] reduces the number of attributes to reduce the number of tests for the patients. Decision Tree and Naïve Bayes combined with Fuzzy logic outplayed other Data Mining techniques in predicting Heart disease. In [32], Shouman and et. Al., used K-Nearest Neighbor (K-NN) in classification problem. Integrating voting with K-NN cannot enhance its accuracy in diagnosing the heart disease. When voting is applied to Decision Tree, different decision rules are extracted from each subset, which extract new knowledge leads to increasing accuracy in Decision Tree. But in K-NN no new knowledge is extracted, just distance between clusters in measured. Applying K-NN achieved an accuracy of 97.4% without voting. But voting in K-NN results in decrease in accuracy. C.Data Mining and Clustering In [4], Bala Sundar and et. al., used K-means Clustering Algorithm for the prediction of the heart disease. It is a method of cluster analysis which aims to partition ‘n’ observations to ‘k’ clusters. Euclidaen distance formula is used to minimize the sum of square of distance between data. Naïve Bayes is found to slow and Neural Network takes number of iterations in predicting the disease when compared using time and accuracy. Figure 8: Exponential accuracy of k-means with existing techniques The Figure 8 explains the exponential accuracy of k-means with existing techniques [4]. Santhi and et. Al., [28] proposes the performance of clustering and classification algorithm using heart disease data. Performance of classifiers of Bayes, functions, Lazy, Metammulti Boost AB, Rules, Trees, etc., are calculated. The performance of classification is calculated using cross validation test mode and clusers by mode of classes to evaluate clusters. Data preprocessing is done by attribute subset selection algorithm. Final result showed that Naïve Bayes tress has highest prediction accuracy than clustering Algorithm. V. CONCLUSION Heart disease prediction is a major challenge in the health care industry. Instead of going for a number of tests, predicting heart disease with less number of attributes is a challenging task in Data Mining. Existing literature shows that Classification task in Data Mining plays a vital role in heart disease prediction when compared with Clustering, Association Rule and Regression. In Classification Decision Tree outperforms in some cases, where Neural Network and Naïve Bayes outperforms in some other cases. Each technique has its own merits and demerits. When combined with each other or with Fuzzy logic, Heart disease prediction with Data Mining techniques will become most successful with less number of attributes. Text mining the medical data is another extension found in predicting the health care data. REFERENCES [1] Anbarasi.M, Anupriya and Iyengar “Enhanced Prediction of Heart Disease with Feature Subset Selection using Genetic Algorithm”,
  • 6. INTERNATIONAL CONFERENCE ON INFORMATION COMMUNICATION AND EMBEDDED SYSTEMS ICICES 2013 978-1-4673-5788-3/13/$31.00©2013IEEE 6 International Journal of Engineering and Technology, Vol 2(10), 2010, pp 5370-5376. [2] Annoj P.K.,” Clinical decision support system: Risk level prediction of heart disease using Data Mining Algorithms”, Journal of King Saud University- Computer and Information Sciences, 2012,pp 27-40. [3] Asha Rajkumar and Mrs. Sophia Reena, “ Diagnosis of Heart Disease using Data Mining Algorithms, Global Journal of Computer Science and Technology, vol. 10(10), 2010, pp 38-43. [4] Bala Sundar V, “Development of Data Clustering Algorithm for predicting Heart”, IJCA, Vol 48(7), June 2012, pp 8-13. [5] Bhagyashree Ambulkar and Vaishali Borkar “Data Mining in Cloud Computing”, MPGINMC, Recent Trends in Computing, ISSN 0975- 8887,2012, pp 23-26. [6] Bhuvaneswari. R, “Naïve Bayesian Classification Approach in Health Care Application”, International Journal of Computer Science and Telecommunication, vol 3(1), Jan 2012, pp 106-112. [7] Carlos Ordonez, Edward Omincenski and Levien de Braal “Mining Constraint Association Rules to Predict Heart Disease”, Proceeding of 2001, IEEE International Conference of Data Mining, IEEE Computer Society, ISBN-0-7695-1119-8, 2001, pp: 433-440. [8] Cengiz colak.M , Cemiz colak and Hasan Kocatruk “Predicting coronary artery disease using different artificial neural network models”, CAD and Artificial neural network, pp 249-254, 2008. [9] Chaltrali S. Dangare and Sulabha, “Improved Study of Heart Disease Prediction System using Data Mining Classification Techniques”, IJCA, Vol 47(10), pp 44-48, June 2012. [10] Chen A.H., “HDPS: Heart Disease Prediction System”, Computing in Cardiology, ISSN 0276-6574, pp 557-560, 2011. [11] Deepika. N, “Association Rule for Classification of Heart Attack patients”, IJAEST, Vol 11(2), pp 253-257, 2011. [12] Jabbar M.A., “Knowledge discovery from mining association rules for Heart disease Prediction”, JATIT, Vol 41(2), pp 166-174, 2012. [13] Jyothi Soni, Uzma ansari and Dipesh Ansari “Intelligent and Effective Heart Disease Prediction System using Weighted Associate Classifer”, IJCSE, Vol 3(6), pp 2385-2392, June 2011. [14] K.Rajeswari, “Prediction of Risk Score for Heart Disease in India using Machine Intelligence”,IPCSIT, Vol 4, 2011. [15] Kavitha K.S, “Modeling and designing of evolutionary neural network for heart disease prediction”, IJCSI, Vol 7(5), pp 272-283, September 2010. [16] Latha Parthiban and R.Subramanian, “Intelligent Heart Disease Prediction System using CANFIS and Genetic Algorithm”, International Journal of Biological and Life Sciences, Vol 3(3), pp157- 160,2007. [17] Liangxiao. J, Harry.Z, Zhihua.C and Jiang.S “One Dependency Augmented Naïve Bayes”, ADMA, pp 186-194, 2005. [18] Mia Shouman, “Using data mining techniques in heart disease diagnosis and treatment”, 978-1-4673-0483-2, Japan-Egypt Conference on Electronics, Communications and Computers, pp 189-193, 2012. [19] Milan Kumari and Sunila Godara, “Review of Data Mining Classification Model in Cardio Vascular Disease diagnosis”, IJCA, 2011. [20] Milan Kumari and Sunila Godara, “Comparative Study of Data Mining Classification Methods in Cardio-Vascular Disease Prediction”, IJCST, Vol 2(2), June 2011. [21] Nidhi Bhatia and Kiran Jyothi, “A Novel Approach for heart disease diagnosis using Data Mining and Fuzzy logic”, IJCA, Vol 54(17), pp 16-21, September 2012. [22] Nithya N.S, Sarumathi. S and Dr. Duraisamy. K “ Assessment of the risk factors of Heart Attack using frequent feature Selection Method”, International Journal of Communications and Enggineering, Vol 1(1), ISSN 0988-0382, pp 127-133, March 2012. [23] Qeethara Kadhim Al. Shayea, “Artificial neural network in Medical Diagnosis”, IJCSI, Vol 3(2), March 2011. [24] R. Setthukkarase and Kannan “An Intelligent System for mining Temporal rules in Clinical database using Fuzzy neural network”,European Journal of Scientific Research, ISSN 1450-216, Vol 70(3), pp 386-395, 2012. [25] Rafiah Awang and Palaniappan. S “Intelligent Heart Disease Prediction System Using Data Mining techniques”, IJCSNS, Vol 8(8), pp 343-350, Aug 2008. [26] Rafiah Awang and Palaniappan. S “Web based Heart Disease Decision Support System using Data Mining Classification Modeling techniques” , Proceedings of iiWAS, pp 177-187, 2007. [27] Raghu. D.Dr, “Probability Based Heart Disease Prediction using Data Mining Techniques”, IJCST, Vol 2(4), pp 66-68, Dec 2011. [28] Santhi. P, “Improving the performance of Data Mining Algorithm in Health Care data”, IJCST, Vol 2(3), 2011. [29] Setiawan N.A, “ Rule Selection for Coronary Artery Disease Diagnosis Based on Rough Set” ,International Journal of Recent Trends in Engineering, Vol 2(5), pp 198-202, Dec 2009. [30] Shantakumar B.Patil, “Intelligent and Effective Heart Attack Prediction System using Data Mining and Artifical Neural Network”, European Journal of Scientific Research, Vol 31(4), pp 642-656, 2009. [31] Shanthakumar B. Patil, “Extraction of Significant patterns from Heart Disease Ware Houses for Heart Attack Prediction”, IJCSNS, Vol 9(2), pp 228-235, Feb 2009. [32] Shouman.M, Turner.T and Stocker.R, “Applying K-Nearest Neighbour in diagnosing Heart Disease Patients”, International Journal of Information and Education Technology, Vol 2(3), June 2012. [33] Siri Krishnan Wasan, Vasutha Bhatnagar and Harleen Kaur “The Impact of Data Mining techniques on medical diagnostics”, Data Science Journal, Vol 5(19), pp 119-126, October 2006. [34] Srinivas, Kavitha Rani and Dr. Govarthan, “Application of Data Mining Techniques in Health Care and Prediction of Heart Attack”, IJCSE, Vol 2(2), pp 250-255, 2010. [35] Subbulakshmi, Ramesh and Chinna Rao “Decision Support in Heart Disease Prediction System using Naïve Bayes”, IJCSE, ISSN 0976- 5166, Vol 2(2), May 2011. [36] Sudha.A, Gayathri.p and Jaishankar. N “Utilization of Data Mining Approaches for prediction of life Threatening Disease Survivability”, IJAC (0975-8887), Vol 14(17), March 2012. [37] Jyothi. S, Ujma.A, Dipesh. S and Sunita. S “Predictive Data Mining for Medical Diagnosis : An Overview of Heart Disease Prediction”, IJCA, Vol 17(8), pp 43-48, March 2011. [38] Usha. K Dr, “Analysis of Heart Disease Dataset using Neural network approach”, IJDKP, Vol 1(5), Sep 2011.