SlideShare a Scribd company logo
1 of 17
On Predicting and Analyzing Breast Cancer
using Data Mining Approach
Under the Supervision of
Suman Saha
Assistant Professor
Department of CSE
Bangladesh University of Businessand
Technology (BUBT)
Presented by
Md. Masud Rana Basunia
Ismot Ara Pervin
Md. Al Mahmud
Outline
1. Introduction
2. Review of Literature
3. Motivation
4. Proposed Diagram
5. Proposed Methodology
6. Result Analysis
7. Conclusion
Introduction
 A cancer that develops in breast tissue.
 One of the leading cancers for women compared to all other cancer.
 Early detection is the most effective way to reduce breast cancer
deaths.
 Data mining is the process of running powerful classification
techniques that extract useful information from data.
 The techniques enable to create a model that can learn from past
data and detect patterns.
 With a robustly validated classification model, chances of the right
prediction improve.
 It specially helps in interpretation of results for borderline cases.
Review of Literature
 Multi-boost SMO classification technique used for classifying breast
cancer.
 Applied KNN, logistic regression, and multivariate linear regression and
classified tumor type on Wisconsin dataset.
 Used the SPSS Clementine data mining tool and analyzed with various
kernel functions and parameters of the SVM.
 Experimented on breast cancer data using C5 algorithm with bagging
to predict breast cancer survivability.
 Applied the best tree, IBK and SMO to classify tumor type.
Motivation
 The correct designation in determining whether or not the tumor is
benign or malignant is important for saving lives.
 For a doctor, it needs quite a bit of time to classify breast cancer but
using the data mining approach can classify instantly.
 Normally it’s difficult to distinguish certain benign masses from
malignant lesions with mammography, where using the data mining
approach can detect effectively.
 It's also Cost-effective.
 The model could predict the cases within higher accuracy.
Proposed Diagram
Proposed Methodology
 Collecting the dataset.
 Preprocessing the dataset.
 Selecting best features.
 Applying classification technique.
 Evaluating model performance.
Proposed Methodology contd.
Dataset Information
Dataset was obtained from the UCI
Machine Learning Repository.
Dataset having 569 instances with 32
features.
Features are computed from a digitized
image of a fine needle aspirate(FNA) of
a breast mass.
Two classes as Malignant (Cancerous)
and Benign (Non-Cancerous).
Class distribution of Malignant: 212
(37.3%) and Benign: 357 (62.7%)
instances.
Proposed Methodology contd.
Dataset Preprocessing
Converted categorical values as numeric values using Label
Encoding.
Removed outliers using Interquartile Range (IQR) method.
Normalized the dataset using Standard Scaling method.
Proposed Methodology contd.
Feature Selection
Selected 20 top features using the
Univariate Feature Selection method.
The method calculated ch2 score for
each feature using the ch2 formula.
It’s improving the prediction
performance of the predictors.
Provided a faster and more cost-
effective predictors.
Proposed Methodology contd.
Proposed Classification Technique
 Firstly applied five classification techniques and chose four
classifiers in the base of accuracy.
 Then applied Stacking Classifier which is an ensemble method and
has two levels.
 In level 0, applied three classification techniques on the dataset
with 10-fold cross-validation and compute individual output.
 In level 1, applied a meta classifier that combines the previous
output and provides a final output.
Proposed Methodology contd.
Performance Model Evaluation
The evaluation of data mining classification technique’s performance
involves testing the proposed model.
Confusion matrix is used for evaluating the performance.
Different performance evaluation parameters as accuracy, ROC area,
precision, recall, F1 score are applied to evaluate the performance of
the classification technique.
Result Analysis
Decision Boundary for Classification Technique Comparison of Accuracy
Result Analysis contd.
Confusion Matrix of Stacking Classifier Error Rate for Training and Testing set
Result Analysis contd.
Classification Report for Classification Techniques
Conclusion
 The automatic prediction of breast cancer is significant to abate the
propensity against enlarging this disease.
 Data mining classification techniques play a vital role to predict breast
cancer.
 We have presented a comparative study of different classification
techniques for the detection of breast cancer.
 It has been observed that Stacking Classifier had an accuracy of
97.20% to determine benign or malignant tumor.
 The performance of the Stacking Classifier shows a high level compare
with other classifiers.
Thank You

More Related Content

What's hot

IRJET- Detection of Breast Cancer using Machine Learning Techniques
IRJET-  	  Detection of Breast Cancer using Machine Learning TechniquesIRJET-  	  Detection of Breast Cancer using Machine Learning Techniques
IRJET- Detection of Breast Cancer using Machine Learning TechniquesIRJET Journal
 
Breast Cancer Diagnostics with Bayesian Networks
Breast Cancer Diagnostics with Bayesian NetworksBreast Cancer Diagnostics with Bayesian Networks
Breast Cancer Diagnostics with Bayesian NetworksBayesia USA
 
Breast cancerdetection IE594 Project Report
Breast cancerdetection IE594 Project ReportBreast cancerdetection IE594 Project Report
Breast cancerdetection IE594 Project ReportASHISH MENKUDALE
 
Possibilistic Fuzzy C Means Algorithm For Mass classificaion In Digital Mammo...
Possibilistic Fuzzy C Means Algorithm For Mass classificaion In Digital Mammo...Possibilistic Fuzzy C Means Algorithm For Mass classificaion In Digital Mammo...
Possibilistic Fuzzy C Means Algorithm For Mass classificaion In Digital Mammo...IJERA Editor
 
Brain tumor segmentation based on local independent projection based classifi...
Brain tumor segmentation based on local independent projection based classifi...Brain tumor segmentation based on local independent projection based classifi...
Brain tumor segmentation based on local independent projection based classifi...ieeepondy
 
Predict Breast Cancer using Deep Learning
Predict Breast Cancer using Deep LearningPredict Breast Cancer using Deep Learning
Predict Breast Cancer using Deep LearningAyesha Shafique
 
Cancer detection using data mining
Cancer detection using data miningCancer detection using data mining
Cancer detection using data miningRishabhKumar283
 
A New Approach to the Detection of Mammogram Boundary
A New Approach to the Detection of Mammogram Boundary A New Approach to the Detection of Mammogram Boundary
A New Approach to the Detection of Mammogram Boundary IJECEIAES
 
Diagnosis of Cancer using Fuzzy Rough Set Theory
Diagnosis of Cancer using Fuzzy Rough Set TheoryDiagnosis of Cancer using Fuzzy Rough Set Theory
Diagnosis of Cancer using Fuzzy Rough Set TheoryIRJET Journal
 
Performance and Evaluation of Data Mining Techniques in Cancer Diagnosis
Performance and Evaluation of Data Mining Techniques in Cancer DiagnosisPerformance and Evaluation of Data Mining Techniques in Cancer Diagnosis
Performance and Evaluation of Data Mining Techniques in Cancer DiagnosisIOSR Journals
 
Breast cancer detection using Artificial Neural Network
Breast cancer detection using Artificial Neural NetworkBreast cancer detection using Artificial Neural Network
Breast cancer detection using Artificial Neural NetworkSubroto Biswas
 
A Novel Approach for Breast Cancer Detection using Data Mining Techniques
A Novel Approach for Breast Cancer Detection using Data Mining TechniquesA Novel Approach for Breast Cancer Detection using Data Mining Techniques
A Novel Approach for Breast Cancer Detection using Data Mining Techniquesahmad abdelhafeez
 
Breast cancer diagnosis via data mining performance analysis of seven differe...
Breast cancer diagnosis via data mining performance analysis of seven differe...Breast cancer diagnosis via data mining performance analysis of seven differe...
Breast cancer diagnosis via data mining performance analysis of seven differe...cseij
 
Breast cancer Detection using MATLAB
Breast cancer Detection using MATLABBreast cancer Detection using MATLAB
Breast cancer Detection using MATLABNupurRathi7
 
A Comparative Study on the Methods Used for the Detection of Breast Cancer
A Comparative Study on the Methods Used for the Detection of Breast CancerA Comparative Study on the Methods Used for the Detection of Breast Cancer
A Comparative Study on the Methods Used for the Detection of Breast Cancerrahulmonikasharma
 
A MODIFIED BINARY PSO BASED FEATURE SELECTION FOR AUTOMATIC LESION DETECTION ...
A MODIFIED BINARY PSO BASED FEATURE SELECTION FOR AUTOMATIC LESION DETECTION ...A MODIFIED BINARY PSO BASED FEATURE SELECTION FOR AUTOMATIC LESION DETECTION ...
A MODIFIED BINARY PSO BASED FEATURE SELECTION FOR AUTOMATIC LESION DETECTION ...ijcsit
 
Ensemble strategies for a medical diagnostic decision support system: A breas...
Ensemble strategies for a medical diagnostic decision support system: A breas...Ensemble strategies for a medical diagnostic decision support system: A breas...
Ensemble strategies for a medical diagnostic decision support system: A breas...dewisetiyana52
 

What's hot (19)

IRJET- Detection of Breast Cancer using Machine Learning Techniques
IRJET-  	  Detection of Breast Cancer using Machine Learning TechniquesIRJET-  	  Detection of Breast Cancer using Machine Learning Techniques
IRJET- Detection of Breast Cancer using Machine Learning Techniques
 
Breast Cancer Diagnostics with Bayesian Networks
Breast Cancer Diagnostics with Bayesian NetworksBreast Cancer Diagnostics with Bayesian Networks
Breast Cancer Diagnostics with Bayesian Networks
 
Breast cancerdetection IE594 Project Report
Breast cancerdetection IE594 Project ReportBreast cancerdetection IE594 Project Report
Breast cancerdetection IE594 Project Report
 
Possibilistic Fuzzy C Means Algorithm For Mass classificaion In Digital Mammo...
Possibilistic Fuzzy C Means Algorithm For Mass classificaion In Digital Mammo...Possibilistic Fuzzy C Means Algorithm For Mass classificaion In Digital Mammo...
Possibilistic Fuzzy C Means Algorithm For Mass classificaion In Digital Mammo...
 
Brain tumor segmentation based on local independent projection based classifi...
Brain tumor segmentation based on local independent projection based classifi...Brain tumor segmentation based on local independent projection based classifi...
Brain tumor segmentation based on local independent projection based classifi...
 
Predict Breast Cancer using Deep Learning
Predict Breast Cancer using Deep LearningPredict Breast Cancer using Deep Learning
Predict Breast Cancer using Deep Learning
 
Az4102375381
Az4102375381Az4102375381
Az4102375381
 
Cancer detection using data mining
Cancer detection using data miningCancer detection using data mining
Cancer detection using data mining
 
A New Approach to the Detection of Mammogram Boundary
A New Approach to the Detection of Mammogram Boundary A New Approach to the Detection of Mammogram Boundary
A New Approach to the Detection of Mammogram Boundary
 
01531
0153101531
01531
 
Diagnosis of Cancer using Fuzzy Rough Set Theory
Diagnosis of Cancer using Fuzzy Rough Set TheoryDiagnosis of Cancer using Fuzzy Rough Set Theory
Diagnosis of Cancer using Fuzzy Rough Set Theory
 
Performance and Evaluation of Data Mining Techniques in Cancer Diagnosis
Performance and Evaluation of Data Mining Techniques in Cancer DiagnosisPerformance and Evaluation of Data Mining Techniques in Cancer Diagnosis
Performance and Evaluation of Data Mining Techniques in Cancer Diagnosis
 
Breast cancer detection using Artificial Neural Network
Breast cancer detection using Artificial Neural NetworkBreast cancer detection using Artificial Neural Network
Breast cancer detection using Artificial Neural Network
 
A Novel Approach for Breast Cancer Detection using Data Mining Techniques
A Novel Approach for Breast Cancer Detection using Data Mining TechniquesA Novel Approach for Breast Cancer Detection using Data Mining Techniques
A Novel Approach for Breast Cancer Detection using Data Mining Techniques
 
Breast cancer diagnosis via data mining performance analysis of seven differe...
Breast cancer diagnosis via data mining performance analysis of seven differe...Breast cancer diagnosis via data mining performance analysis of seven differe...
Breast cancer diagnosis via data mining performance analysis of seven differe...
 
Breast cancer Detection using MATLAB
Breast cancer Detection using MATLABBreast cancer Detection using MATLAB
Breast cancer Detection using MATLAB
 
A Comparative Study on the Methods Used for the Detection of Breast Cancer
A Comparative Study on the Methods Used for the Detection of Breast CancerA Comparative Study on the Methods Used for the Detection of Breast Cancer
A Comparative Study on the Methods Used for the Detection of Breast Cancer
 
A MODIFIED BINARY PSO BASED FEATURE SELECTION FOR AUTOMATIC LESION DETECTION ...
A MODIFIED BINARY PSO BASED FEATURE SELECTION FOR AUTOMATIC LESION DETECTION ...A MODIFIED BINARY PSO BASED FEATURE SELECTION FOR AUTOMATIC LESION DETECTION ...
A MODIFIED BINARY PSO BASED FEATURE SELECTION FOR AUTOMATIC LESION DETECTION ...
 
Ensemble strategies for a medical diagnostic decision support system: A breas...
Ensemble strategies for a medical diagnostic decision support system: A breas...Ensemble strategies for a medical diagnostic decision support system: A breas...
Ensemble strategies for a medical diagnostic decision support system: A breas...
 

Similar to On Predicting and Analyzing Breast Cancer using Data Mining Approach

Classification AlgorithmBased Analysis of Breast Cancer Data
Classification AlgorithmBased Analysis of Breast Cancer DataClassification AlgorithmBased Analysis of Breast Cancer Data
Classification AlgorithmBased Analysis of Breast Cancer DataIIRindia
 
SVM &GA-CLUSTERING BASED FEATURE SELECTION APPROACH FOR BREAST CANCER DETECTION
SVM &GA-CLUSTERING BASED FEATURE SELECTION APPROACH FOR BREAST CANCER DETECTIONSVM &GA-CLUSTERING BASED FEATURE SELECTION APPROACH FOR BREAST CANCER DETECTION
SVM &GA-CLUSTERING BASED FEATURE SELECTION APPROACH FOR BREAST CANCER DETECTIONijscai
 
SVM &GA-CLUSTERING BASED FEATURE SELECTION APPROACH FOR BREAST CANCER DETECTION
SVM &GA-CLUSTERING BASED FEATURE SELECTION APPROACH FOR BREAST CANCER DETECTIONSVM &GA-CLUSTERING BASED FEATURE SELECTION APPROACH FOR BREAST CANCER DETECTION
SVM &GA-CLUSTERING BASED FEATURE SELECTION APPROACH FOR BREAST CANCER DETECTIONijscai
 
SVM &GA-CLUSTERING BASED FEATURE SELECTION APPROACH FOR BREAST CANCER DETECTION
SVM &GA-CLUSTERING BASED FEATURE SELECTION APPROACH FOR BREAST CANCER DETECTIONSVM &GA-CLUSTERING BASED FEATURE SELECTION APPROACH FOR BREAST CANCER DETECTION
SVM &GA-CLUSTERING BASED FEATURE SELECTION APPROACH FOR BREAST CANCER DETECTIONijscai
 
A CLASSIFICATION MODEL ON TUMOR CANCER DISEASE BASED MUTUAL INFORMATION AND F...
A CLASSIFICATION MODEL ON TUMOR CANCER DISEASE BASED MUTUAL INFORMATION AND F...A CLASSIFICATION MODEL ON TUMOR CANCER DISEASE BASED MUTUAL INFORMATION AND F...
A CLASSIFICATION MODEL ON TUMOR CANCER DISEASE BASED MUTUAL INFORMATION AND F...Kiogyf
 
SVM &GA-CLUSTERING BASED FEATURE SELECTION APPROACH FOR BREAST CANCER DETECTION
SVM &GA-CLUSTERING BASED FEATURE SELECTION APPROACH FOR BREAST CANCER DETECTIONSVM &GA-CLUSTERING BASED FEATURE SELECTION APPROACH FOR BREAST CANCER DETECTION
SVM &GA-CLUSTERING BASED FEATURE SELECTION APPROACH FOR BREAST CANCER DETECTIONijscai
 
IRJET - A Conceptual Method for Breast Tumor Classification using SHAP Values ...
IRJET - A Conceptual Method for Breast Tumor Classification using SHAP Values ...IRJET - A Conceptual Method for Breast Tumor Classification using SHAP Values ...
IRJET - A Conceptual Method for Breast Tumor Classification using SHAP Values ...IRJET Journal
 
Cervical Cancer Detection: An Enhanced Approach through Transfer Learning and...
Cervical Cancer Detection: An Enhanced Approach through Transfer Learning and...Cervical Cancer Detection: An Enhanced Approach through Transfer Learning and...
Cervical Cancer Detection: An Enhanced Approach through Transfer Learning and...IRJET Journal
 
IRJET - Breast Cancer Prediction using Supervised Machine Learning Algorithms...
IRJET - Breast Cancer Prediction using Supervised Machine Learning Algorithms...IRJET - Breast Cancer Prediction using Supervised Machine Learning Algorithms...
IRJET - Breast Cancer Prediction using Supervised Machine Learning Algorithms...IRJET Journal
 
A MODIFIED BINARY PSO BASED FEATURE SELECTION FOR AUTOMATIC LESION DETECTION ...
A MODIFIED BINARY PSO BASED FEATURE SELECTION FOR AUTOMATIC LESION DETECTION ...A MODIFIED BINARY PSO BASED FEATURE SELECTION FOR AUTOMATIC LESION DETECTION ...
A MODIFIED BINARY PSO BASED FEATURE SELECTION FOR AUTOMATIC LESION DETECTION ...AIRCC Publishing Corporation
 
A Progressive Review on Early Stage Breast Cancer Detection
A Progressive Review on Early Stage Breast Cancer DetectionA Progressive Review on Early Stage Breast Cancer Detection
A Progressive Review on Early Stage Breast Cancer DetectionIRJET Journal
 
A Progressive Review: Early Stage Breast Cancer Detection using Ultrasound Im...
A Progressive Review: Early Stage Breast Cancer Detection using Ultrasound Im...A Progressive Review: Early Stage Breast Cancer Detection using Ultrasound Im...
A Progressive Review: Early Stage Breast Cancer Detection using Ultrasound Im...IRJET Journal
 
Breast Cancer Prediction using Machine Learning
Breast Cancer Prediction using Machine LearningBreast Cancer Prediction using Machine Learning
Breast Cancer Prediction using Machine LearningIRJET Journal
 
Modified fuzzy rough set technique with stacked autoencoder model for magneti...
Modified fuzzy rough set technique with stacked autoencoder model for magneti...Modified fuzzy rough set technique with stacked autoencoder model for magneti...
Modified fuzzy rough set technique with stacked autoencoder model for magneti...IJECEIAES
 
A Comparative Analysis of Hybridized Genetic Algorithm in Predictive Models o...
A Comparative Analysis of Hybridized Genetic Algorithm in Predictive Models o...A Comparative Analysis of Hybridized Genetic Algorithm in Predictive Models o...
A Comparative Analysis of Hybridized Genetic Algorithm in Predictive Models o...Shakas Technologies
 
A Comprehensive Evaluation of Machine Learning Approaches for Breast Cancer C...
A Comprehensive Evaluation of Machine Learning Approaches for Breast Cancer C...A Comprehensive Evaluation of Machine Learning Approaches for Breast Cancer C...
A Comprehensive Evaluation of Machine Learning Approaches for Breast Cancer C...IRJET Journal
 
A Classification of Cancer Diagnostics based on Microarray Gene Expression Pr...
A Classification of Cancer Diagnostics based on Microarray Gene Expression Pr...A Classification of Cancer Diagnostics based on Microarray Gene Expression Pr...
A Classification of Cancer Diagnostics based on Microarray Gene Expression Pr...IJTET Journal
 
PREDICTION OF BREAST CANCER USING DATA MINING TECHNIQUES
PREDICTION OF BREAST CANCER USING DATA MINING TECHNIQUESPREDICTION OF BREAST CANCER USING DATA MINING TECHNIQUES
PREDICTION OF BREAST CANCER USING DATA MINING TECHNIQUESIAEME Publication
 

Similar to On Predicting and Analyzing Breast Cancer using Data Mining Approach (20)

Classification AlgorithmBased Analysis of Breast Cancer Data
Classification AlgorithmBased Analysis of Breast Cancer DataClassification AlgorithmBased Analysis of Breast Cancer Data
Classification AlgorithmBased Analysis of Breast Cancer Data
 
Comparison of breast cancer classification models on Wisconsin dataset
Comparison of breast cancer classification models on Wisconsin  datasetComparison of breast cancer classification models on Wisconsin  dataset
Comparison of breast cancer classification models on Wisconsin dataset
 
SVM &GA-CLUSTERING BASED FEATURE SELECTION APPROACH FOR BREAST CANCER DETECTION
SVM &GA-CLUSTERING BASED FEATURE SELECTION APPROACH FOR BREAST CANCER DETECTIONSVM &GA-CLUSTERING BASED FEATURE SELECTION APPROACH FOR BREAST CANCER DETECTION
SVM &GA-CLUSTERING BASED FEATURE SELECTION APPROACH FOR BREAST CANCER DETECTION
 
SVM &GA-CLUSTERING BASED FEATURE SELECTION APPROACH FOR BREAST CANCER DETECTION
SVM &GA-CLUSTERING BASED FEATURE SELECTION APPROACH FOR BREAST CANCER DETECTIONSVM &GA-CLUSTERING BASED FEATURE SELECTION APPROACH FOR BREAST CANCER DETECTION
SVM &GA-CLUSTERING BASED FEATURE SELECTION APPROACH FOR BREAST CANCER DETECTION
 
SVM &GA-CLUSTERING BASED FEATURE SELECTION APPROACH FOR BREAST CANCER DETECTION
SVM &GA-CLUSTERING BASED FEATURE SELECTION APPROACH FOR BREAST CANCER DETECTIONSVM &GA-CLUSTERING BASED FEATURE SELECTION APPROACH FOR BREAST CANCER DETECTION
SVM &GA-CLUSTERING BASED FEATURE SELECTION APPROACH FOR BREAST CANCER DETECTION
 
A CLASSIFICATION MODEL ON TUMOR CANCER DISEASE BASED MUTUAL INFORMATION AND F...
A CLASSIFICATION MODEL ON TUMOR CANCER DISEASE BASED MUTUAL INFORMATION AND F...A CLASSIFICATION MODEL ON TUMOR CANCER DISEASE BASED MUTUAL INFORMATION AND F...
A CLASSIFICATION MODEL ON TUMOR CANCER DISEASE BASED MUTUAL INFORMATION AND F...
 
SVM &GA-CLUSTERING BASED FEATURE SELECTION APPROACH FOR BREAST CANCER DETECTION
SVM &GA-CLUSTERING BASED FEATURE SELECTION APPROACH FOR BREAST CANCER DETECTIONSVM &GA-CLUSTERING BASED FEATURE SELECTION APPROACH FOR BREAST CANCER DETECTION
SVM &GA-CLUSTERING BASED FEATURE SELECTION APPROACH FOR BREAST CANCER DETECTION
 
IRJET - A Conceptual Method for Breast Tumor Classification using SHAP Values ...
IRJET - A Conceptual Method for Breast Tumor Classification using SHAP Values ...IRJET - A Conceptual Method for Breast Tumor Classification using SHAP Values ...
IRJET - A Conceptual Method for Breast Tumor Classification using SHAP Values ...
 
Cervical Cancer Detection: An Enhanced Approach through Transfer Learning and...
Cervical Cancer Detection: An Enhanced Approach through Transfer Learning and...Cervical Cancer Detection: An Enhanced Approach through Transfer Learning and...
Cervical Cancer Detection: An Enhanced Approach through Transfer Learning and...
 
IRJET - Breast Cancer Prediction using Supervised Machine Learning Algorithms...
IRJET - Breast Cancer Prediction using Supervised Machine Learning Algorithms...IRJET - Breast Cancer Prediction using Supervised Machine Learning Algorithms...
IRJET - Breast Cancer Prediction using Supervised Machine Learning Algorithms...
 
A MODIFIED BINARY PSO BASED FEATURE SELECTION FOR AUTOMATIC LESION DETECTION ...
A MODIFIED BINARY PSO BASED FEATURE SELECTION FOR AUTOMATIC LESION DETECTION ...A MODIFIED BINARY PSO BASED FEATURE SELECTION FOR AUTOMATIC LESION DETECTION ...
A MODIFIED BINARY PSO BASED FEATURE SELECTION FOR AUTOMATIC LESION DETECTION ...
 
A Progressive Review on Early Stage Breast Cancer Detection
A Progressive Review on Early Stage Breast Cancer DetectionA Progressive Review on Early Stage Breast Cancer Detection
A Progressive Review on Early Stage Breast Cancer Detection
 
journals public
journals publicjournals public
journals public
 
A Progressive Review: Early Stage Breast Cancer Detection using Ultrasound Im...
A Progressive Review: Early Stage Breast Cancer Detection using Ultrasound Im...A Progressive Review: Early Stage Breast Cancer Detection using Ultrasound Im...
A Progressive Review: Early Stage Breast Cancer Detection using Ultrasound Im...
 
Breast Cancer Prediction using Machine Learning
Breast Cancer Prediction using Machine LearningBreast Cancer Prediction using Machine Learning
Breast Cancer Prediction using Machine Learning
 
Modified fuzzy rough set technique with stacked autoencoder model for magneti...
Modified fuzzy rough set technique with stacked autoencoder model for magneti...Modified fuzzy rough set technique with stacked autoencoder model for magneti...
Modified fuzzy rough set technique with stacked autoencoder model for magneti...
 
A Comparative Analysis of Hybridized Genetic Algorithm in Predictive Models o...
A Comparative Analysis of Hybridized Genetic Algorithm in Predictive Models o...A Comparative Analysis of Hybridized Genetic Algorithm in Predictive Models o...
A Comparative Analysis of Hybridized Genetic Algorithm in Predictive Models o...
 
A Comprehensive Evaluation of Machine Learning Approaches for Breast Cancer C...
A Comprehensive Evaluation of Machine Learning Approaches for Breast Cancer C...A Comprehensive Evaluation of Machine Learning Approaches for Breast Cancer C...
A Comprehensive Evaluation of Machine Learning Approaches for Breast Cancer C...
 
A Classification of Cancer Diagnostics based on Microarray Gene Expression Pr...
A Classification of Cancer Diagnostics based on Microarray Gene Expression Pr...A Classification of Cancer Diagnostics based on Microarray Gene Expression Pr...
A Classification of Cancer Diagnostics based on Microarray Gene Expression Pr...
 
PREDICTION OF BREAST CANCER USING DATA MINING TECHNIQUES
PREDICTION OF BREAST CANCER USING DATA MINING TECHNIQUESPREDICTION OF BREAST CANCER USING DATA MINING TECHNIQUES
PREDICTION OF BREAST CANCER USING DATA MINING TECHNIQUES
 

Recently uploaded

MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINEMANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINESIVASHANKAR N
 
University management System project report..pdf
University management System project report..pdfUniversity management System project report..pdf
University management System project report..pdfKamal Acharya
 
Introduction to Multiple Access Protocol.pptx
Introduction to Multiple Access Protocol.pptxIntroduction to Multiple Access Protocol.pptx
Introduction to Multiple Access Protocol.pptxupamatechverse
 
BSides Seattle 2024 - Stopping Ethan Hunt From Taking Your Data.pptx
BSides Seattle 2024 - Stopping Ethan Hunt From Taking Your Data.pptxBSides Seattle 2024 - Stopping Ethan Hunt From Taking Your Data.pptx
BSides Seattle 2024 - Stopping Ethan Hunt From Taking Your Data.pptxfenichawla
 
MANUFACTURING PROCESS-II UNIT-1 THEORY OF METAL CUTTING
MANUFACTURING PROCESS-II UNIT-1 THEORY OF METAL CUTTINGMANUFACTURING PROCESS-II UNIT-1 THEORY OF METAL CUTTING
MANUFACTURING PROCESS-II UNIT-1 THEORY OF METAL CUTTINGSIVASHANKAR N
 
UNIT-V FMM.HYDRAULIC TURBINE - Construction and working
UNIT-V FMM.HYDRAULIC TURBINE - Construction and workingUNIT-V FMM.HYDRAULIC TURBINE - Construction and working
UNIT-V FMM.HYDRAULIC TURBINE - Construction and workingrknatarajan
 
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 BookingVIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Bookingdharasingh5698
 
KubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghlyKubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghlysanyuktamishra911
 
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...Christo Ananth
 
Coefficient of Thermal Expansion and their Importance.pptx
Coefficient of Thermal Expansion and their Importance.pptxCoefficient of Thermal Expansion and their Importance.pptx
Coefficient of Thermal Expansion and their Importance.pptxAsutosh Ranjan
 
Call Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service Nashik
Call Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service NashikCall Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service Nashik
Call Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service NashikCall Girls in Nagpur High Profile
 
Processing & Properties of Floor and Wall Tiles.pptx
Processing & Properties of Floor and Wall Tiles.pptxProcessing & Properties of Floor and Wall Tiles.pptx
Processing & Properties of Floor and Wall Tiles.pptxpranjaldaimarysona
 
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escortsranjana rawat
 
UNIT-II FMM-Flow Through Circular Conduits
UNIT-II FMM-Flow Through Circular ConduitsUNIT-II FMM-Flow Through Circular Conduits
UNIT-II FMM-Flow Through Circular Conduitsrknatarajan
 
Glass Ceramics: Processing and Properties
Glass Ceramics: Processing and PropertiesGlass Ceramics: Processing and Properties
Glass Ceramics: Processing and PropertiesPrabhanshu Chaturvedi
 
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...ranjana rawat
 

Recently uploaded (20)

MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINEMANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
 
University management System project report..pdf
University management System project report..pdfUniversity management System project report..pdf
University management System project report..pdf
 
Introduction to Multiple Access Protocol.pptx
Introduction to Multiple Access Protocol.pptxIntroduction to Multiple Access Protocol.pptx
Introduction to Multiple Access Protocol.pptx
 
(INDIRA) Call Girl Aurangabad Call Now 8617697112 Aurangabad Escorts 24x7
(INDIRA) Call Girl Aurangabad Call Now 8617697112 Aurangabad Escorts 24x7(INDIRA) Call Girl Aurangabad Call Now 8617697112 Aurangabad Escorts 24x7
(INDIRA) Call Girl Aurangabad Call Now 8617697112 Aurangabad Escorts 24x7
 
BSides Seattle 2024 - Stopping Ethan Hunt From Taking Your Data.pptx
BSides Seattle 2024 - Stopping Ethan Hunt From Taking Your Data.pptxBSides Seattle 2024 - Stopping Ethan Hunt From Taking Your Data.pptx
BSides Seattle 2024 - Stopping Ethan Hunt From Taking Your Data.pptx
 
Roadmap to Membership of RICS - Pathways and Routes
Roadmap to Membership of RICS - Pathways and RoutesRoadmap to Membership of RICS - Pathways and Routes
Roadmap to Membership of RICS - Pathways and Routes
 
MANUFACTURING PROCESS-II UNIT-1 THEORY OF METAL CUTTING
MANUFACTURING PROCESS-II UNIT-1 THEORY OF METAL CUTTINGMANUFACTURING PROCESS-II UNIT-1 THEORY OF METAL CUTTING
MANUFACTURING PROCESS-II UNIT-1 THEORY OF METAL CUTTING
 
UNIT-V FMM.HYDRAULIC TURBINE - Construction and working
UNIT-V FMM.HYDRAULIC TURBINE - Construction and workingUNIT-V FMM.HYDRAULIC TURBINE - Construction and working
UNIT-V FMM.HYDRAULIC TURBINE - Construction and working
 
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 BookingVIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking
 
KubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghlyKubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghly
 
Water Industry Process Automation & Control Monthly - April 2024
Water Industry Process Automation & Control Monthly - April 2024Water Industry Process Automation & Control Monthly - April 2024
Water Industry Process Automation & Control Monthly - April 2024
 
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
 
DJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINE
DJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINEDJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINE
DJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINE
 
Coefficient of Thermal Expansion and their Importance.pptx
Coefficient of Thermal Expansion and their Importance.pptxCoefficient of Thermal Expansion and their Importance.pptx
Coefficient of Thermal Expansion and their Importance.pptx
 
Call Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service Nashik
Call Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service NashikCall Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service Nashik
Call Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service Nashik
 
Processing & Properties of Floor and Wall Tiles.pptx
Processing & Properties of Floor and Wall Tiles.pptxProcessing & Properties of Floor and Wall Tiles.pptx
Processing & Properties of Floor and Wall Tiles.pptx
 
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
 
UNIT-II FMM-Flow Through Circular Conduits
UNIT-II FMM-Flow Through Circular ConduitsUNIT-II FMM-Flow Through Circular Conduits
UNIT-II FMM-Flow Through Circular Conduits
 
Glass Ceramics: Processing and Properties
Glass Ceramics: Processing and PropertiesGlass Ceramics: Processing and Properties
Glass Ceramics: Processing and Properties
 
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
 

On Predicting and Analyzing Breast Cancer using Data Mining Approach

  • 1. On Predicting and Analyzing Breast Cancer using Data Mining Approach Under the Supervision of Suman Saha Assistant Professor Department of CSE Bangladesh University of Businessand Technology (BUBT) Presented by Md. Masud Rana Basunia Ismot Ara Pervin Md. Al Mahmud
  • 2. Outline 1. Introduction 2. Review of Literature 3. Motivation 4. Proposed Diagram 5. Proposed Methodology 6. Result Analysis 7. Conclusion
  • 3. Introduction  A cancer that develops in breast tissue.  One of the leading cancers for women compared to all other cancer.  Early detection is the most effective way to reduce breast cancer deaths.  Data mining is the process of running powerful classification techniques that extract useful information from data.  The techniques enable to create a model that can learn from past data and detect patterns.  With a robustly validated classification model, chances of the right prediction improve.  It specially helps in interpretation of results for borderline cases.
  • 4. Review of Literature  Multi-boost SMO classification technique used for classifying breast cancer.  Applied KNN, logistic regression, and multivariate linear regression and classified tumor type on Wisconsin dataset.  Used the SPSS Clementine data mining tool and analyzed with various kernel functions and parameters of the SVM.  Experimented on breast cancer data using C5 algorithm with bagging to predict breast cancer survivability.  Applied the best tree, IBK and SMO to classify tumor type.
  • 5. Motivation  The correct designation in determining whether or not the tumor is benign or malignant is important for saving lives.  For a doctor, it needs quite a bit of time to classify breast cancer but using the data mining approach can classify instantly.  Normally it’s difficult to distinguish certain benign masses from malignant lesions with mammography, where using the data mining approach can detect effectively.  It's also Cost-effective.  The model could predict the cases within higher accuracy.
  • 7. Proposed Methodology  Collecting the dataset.  Preprocessing the dataset.  Selecting best features.  Applying classification technique.  Evaluating model performance.
  • 8. Proposed Methodology contd. Dataset Information Dataset was obtained from the UCI Machine Learning Repository. Dataset having 569 instances with 32 features. Features are computed from a digitized image of a fine needle aspirate(FNA) of a breast mass. Two classes as Malignant (Cancerous) and Benign (Non-Cancerous). Class distribution of Malignant: 212 (37.3%) and Benign: 357 (62.7%) instances.
  • 9. Proposed Methodology contd. Dataset Preprocessing Converted categorical values as numeric values using Label Encoding. Removed outliers using Interquartile Range (IQR) method. Normalized the dataset using Standard Scaling method.
  • 10. Proposed Methodology contd. Feature Selection Selected 20 top features using the Univariate Feature Selection method. The method calculated ch2 score for each feature using the ch2 formula. It’s improving the prediction performance of the predictors. Provided a faster and more cost- effective predictors.
  • 11. Proposed Methodology contd. Proposed Classification Technique  Firstly applied five classification techniques and chose four classifiers in the base of accuracy.  Then applied Stacking Classifier which is an ensemble method and has two levels.  In level 0, applied three classification techniques on the dataset with 10-fold cross-validation and compute individual output.  In level 1, applied a meta classifier that combines the previous output and provides a final output.
  • 12. Proposed Methodology contd. Performance Model Evaluation The evaluation of data mining classification technique’s performance involves testing the proposed model. Confusion matrix is used for evaluating the performance. Different performance evaluation parameters as accuracy, ROC area, precision, recall, F1 score are applied to evaluate the performance of the classification technique.
  • 13. Result Analysis Decision Boundary for Classification Technique Comparison of Accuracy
  • 14. Result Analysis contd. Confusion Matrix of Stacking Classifier Error Rate for Training and Testing set
  • 15. Result Analysis contd. Classification Report for Classification Techniques
  • 16. Conclusion  The automatic prediction of breast cancer is significant to abate the propensity against enlarging this disease.  Data mining classification techniques play a vital role to predict breast cancer.  We have presented a comparative study of different classification techniques for the detection of breast cancer.  It has been observed that Stacking Classifier had an accuracy of 97.20% to determine benign or malignant tumor.  The performance of the Stacking Classifier shows a high level compare with other classifiers.