SlideShare ist ein Scribd-Unternehmen logo
1 von 15
Data Mining
     ADDBASE
What is data mining?
 The    process of extracting valid
  previously unknown, comprehensive,
  and actionable information from large
  databases and using it to make
  crucial business decision
 It    starts   by     developing     a
  representation of simple data. then
  extended to larger sets of data
  working on the premise that the larger
  data has a structure similar to the
Data mining Applications
 It is almost applicable in all areas
  whether it is for business or for
  science.
 Provides    different purpose and
  benefits   depending    where    this
  technique is applied.
Data mining Applications
Retail/Marketing
 Identify buying patterns of customers.
 Finding association among customer
  demographic characteristic.
 Predicting    response to mailing
  campaigns.
 Market basket analysis.
Data mining Applications
Banking
 Detecting patterns of fraudulent credit
  card use.
 Identifying loyal customers.
 Predicting customers likely to change
  their credit card affiliation.
 Determining credit card spending by
  customer groups.
Data mining Applications
Insurance
 Claims analysis.
 Predicting which customers will buy
  new policies.
Medicine
 Characterizing patient behavior to
  predict surgery visit.
 Identifying     successful       medical
  therapies for different illnesses.
Data mining Operations
4 main operations of data mining:
 Predictive modeling
 Database segmentation
 Link analysis
 Deviation detection
Data mining Operations
 Predictive   modeling
    Based observations to form a model of
     the important characteristics of some
     phenomenon.
 Database     segmentation
    Is about partitioning of database into an
     unknown number of segments or
     clusters of similar records.
Data mining Operations
 Link   analysis
    Based on links called associations
     between the individual records and set
     of records in a database.
 Deviation   detection
  Newest data mining operation
  Often a source of true discovery
   because it identifies outliers which
   express deviation.
Data mining Process
 Cross-IndustryStandard Process for
 Data Mining (CRISP-DM)
  Specifies a data of data mining process
   model that is not specific to any industry
   tool.
  Involved from unknown knowledge
   discovery processes used widely in
   industry and in direct response to user
   requirements.
Data mining Process (cont…)
 Major objectives of this specification are
  to make large data mining projects run
  more efficiently as well as to make them
  cheaper, more reliable and more
  manageable.
 A hierarchy process model
Data mining Process (cont…)
 The  process is divided into 6 different
  generic phases ranging from business
  understanding to deployment of
  project result.
 The phases of CRISP-DM model are:
  Business understanding
  Data understanding
  Data preparation
  Modeling
Data mining Process (cont…)
  Evaluation
  Deployment
 Business    understanding
    This phase is focuses on understanding
     the project objectives and requirements
     from the business point of view.
 Data   understanding
    This phase includes task for initial
     collection of the data and is concerned
     with establishing the main characteristics
Data mining Process (cont…)
   Data preparation
       This phase involves all the activities for
        constructing the final data set on which
        modeling tools can be applied directly.
   Modeling
       This phase is the actual data mining
        operation and involves selecting modeling
        techniques, selecting modeling parameters
        and assessing the model created.
Data mining Process (cont…)
   Evaluation
       This phase validates the model from the data
        analysis point of view.
       The model and the steps in modeling are
        verified within the context of achieving the
        business goals.
   Deployment
       This phase is all about generating report or as
        complex as implementing repeatable data
        mining processing across the enterprise.

Weitere ähnliche Inhalte

Was ist angesagt?

Data Mining – analyse Bank Marketing Data Set by WEKA.
Data Mining – analyse Bank Marketing Data Set by WEKA.Data Mining – analyse Bank Marketing Data Set by WEKA.
Data Mining – analyse Bank Marketing Data Set by WEKA.
Mateusz Brzoska
 
Data warehouse and data mining
Data warehouse and data miningData warehouse and data mining
Data warehouse and data mining
Rohit Kumar
 
Data Mining – analyse Bank Marketing Data Set
Data Mining – analyse Bank Marketing Data SetData Mining – analyse Bank Marketing Data Set
Data Mining – analyse Bank Marketing Data Set
Mateusz Brzoska
 
Data mining by_ashok
Data mining by_ashokData mining by_ashok
Data mining by_ashok
Ashok Kumar
 

Was ist angesagt? (20)

Data Mining & Data Warehousing
Data Mining & Data WarehousingData Mining & Data Warehousing
Data Mining & Data Warehousing
 
Application of data mining
Application of data miningApplication of data mining
Application of data mining
 
Data mining
Data miningData mining
Data mining
 
Datamining
DataminingDatamining
Datamining
 
Unit 4 Advanced Data Analytics
Unit 4 Advanced Data AnalyticsUnit 4 Advanced Data Analytics
Unit 4 Advanced Data Analytics
 
Data warehousing and data mining
Data warehousing and data miningData warehousing and data mining
Data warehousing and data mining
 
Data Mining – analyse Bank Marketing Data Set by WEKA.
Data Mining – analyse Bank Marketing Data Set by WEKA.Data Mining – analyse Bank Marketing Data Set by WEKA.
Data Mining – analyse Bank Marketing Data Set by WEKA.
 
Data warehouse and data mining
Data warehouse and data miningData warehouse and data mining
Data warehouse and data mining
 
Data Mining – analyse Bank Marketing Data Set
Data Mining – analyse Bank Marketing Data SetData Mining – analyse Bank Marketing Data Set
Data Mining – analyse Bank Marketing Data Set
 
Data Mining: What is Data Mining?
Data Mining: What is Data Mining?Data Mining: What is Data Mining?
Data Mining: What is Data Mining?
 
Top Data Mining Techniques and Their Applications
Top Data Mining Techniques and Their ApplicationsTop Data Mining Techniques and Their Applications
Top Data Mining Techniques and Their Applications
 
Data mining by_ashok
Data mining by_ashokData mining by_ashok
Data mining by_ashok
 
Data mining
Data miningData mining
Data mining
 
Data mining
Data miningData mining
Data mining
 
Data Mining
Data MiningData Mining
Data Mining
 
Artificial Intelligence: Data Mining
Artificial Intelligence: Data MiningArtificial Intelligence: Data Mining
Artificial Intelligence: Data Mining
 
Data Mining & Applications
Data Mining & ApplicationsData Mining & Applications
Data Mining & Applications
 
Importance of Data Mining
Importance of Data MiningImportance of Data Mining
Importance of Data Mining
 
Business intelligence
Business intelligenceBusiness intelligence
Business intelligence
 
Data Mining
Data MiningData Mining
Data Mining
 

Ähnlich wie Data mining (prefinals)

Using Data Mining Techniques in Customer Segmentation
Using Data Mining Techniques in Customer SegmentationUsing Data Mining Techniques in Customer Segmentation
Using Data Mining Techniques in Customer Segmentation
IJERA Editor
 

Ähnlich wie Data mining (prefinals) (20)

Data mining
Data miningData mining
Data mining
 
data analysis-mining
data analysis-miningdata analysis-mining
data analysis-mining
 
Seminar Report Vaibhav
Seminar Report VaibhavSeminar Report Vaibhav
Seminar Report Vaibhav
 
Using Data Mining Techniques in Customer Segmentation
Using Data Mining Techniques in Customer SegmentationUsing Data Mining Techniques in Customer Segmentation
Using Data Mining Techniques in Customer Segmentation
 
Data Mining
Data MiningData Mining
Data Mining
 
What Is Data Mining How It Works, Benefits, Techniques.pdf
What Is Data Mining How It Works, Benefits, Techniques.pdfWhat Is Data Mining How It Works, Benefits, Techniques.pdf
What Is Data Mining How It Works, Benefits, Techniques.pdf
 
Data Mining.pptx
Data Mining.pptxData Mining.pptx
Data Mining.pptx
 
Data mining & data warehousing
Data mining & data warehousingData mining & data warehousing
Data mining & data warehousing
 
KDD assignmnt data.docx
KDD assignmnt data.docxKDD assignmnt data.docx
KDD assignmnt data.docx
 
Data Mining in Telecommunication Industry
Data Mining in Telecommunication IndustryData Mining in Telecommunication Industry
Data Mining in Telecommunication Industry
 
Data Mining
Data MiningData Mining
Data Mining
 
Data Mining Applications And Feature Scope Survey
Data Mining Applications And Feature Scope SurveyData Mining Applications And Feature Scope Survey
Data Mining Applications And Feature Scope Survey
 
Prescriptive Analytics-1.pptx
Prescriptive Analytics-1.pptxPrescriptive Analytics-1.pptx
Prescriptive Analytics-1.pptx
 
Seminar Presentation
Seminar PresentationSeminar Presentation
Seminar Presentation
 
Data mining
Data miningData mining
Data mining
 
THE EFFECTIVENESS OF DATA MINING TECHNIQUES IN BANKING
THE EFFECTIVENESS OF DATA MINING TECHNIQUES IN BANKINGTHE EFFECTIVENESS OF DATA MINING TECHNIQUES IN BANKING
THE EFFECTIVENESS OF DATA MINING TECHNIQUES IN BANKING
 
Datamining
DataminingDatamining
Datamining
 
Datamining
DataminingDatamining
Datamining
 
knowledge discovery and data mining approach in databases (2)
knowledge discovery and data mining approach in databases (2)knowledge discovery and data mining approach in databases (2)
knowledge discovery and data mining approach in databases (2)
 
Data Mining
Data MiningData Mining
Data Mining
 

Kürzlich hochgeladen

+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 

Kürzlich hochgeladen (20)

Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsTop 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Manulife - Insurer Innovation Award 2024
Manulife - Insurer Innovation Award 2024Manulife - Insurer Innovation Award 2024
Manulife - Insurer Innovation Award 2024
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 

Data mining (prefinals)

  • 1. Data Mining ADDBASE
  • 2. What is data mining?  The process of extracting valid previously unknown, comprehensive, and actionable information from large databases and using it to make crucial business decision  It starts by developing a representation of simple data. then extended to larger sets of data working on the premise that the larger data has a structure similar to the
  • 3. Data mining Applications  It is almost applicable in all areas whether it is for business or for science.  Provides different purpose and benefits depending where this technique is applied.
  • 4. Data mining Applications Retail/Marketing  Identify buying patterns of customers.  Finding association among customer demographic characteristic.  Predicting response to mailing campaigns.  Market basket analysis.
  • 5. Data mining Applications Banking  Detecting patterns of fraudulent credit card use.  Identifying loyal customers.  Predicting customers likely to change their credit card affiliation.  Determining credit card spending by customer groups.
  • 6. Data mining Applications Insurance  Claims analysis.  Predicting which customers will buy new policies. Medicine  Characterizing patient behavior to predict surgery visit.  Identifying successful medical therapies for different illnesses.
  • 7. Data mining Operations 4 main operations of data mining:  Predictive modeling  Database segmentation  Link analysis  Deviation detection
  • 8. Data mining Operations  Predictive modeling  Based observations to form a model of the important characteristics of some phenomenon.  Database segmentation  Is about partitioning of database into an unknown number of segments or clusters of similar records.
  • 9. Data mining Operations  Link analysis  Based on links called associations between the individual records and set of records in a database.  Deviation detection  Newest data mining operation  Often a source of true discovery because it identifies outliers which express deviation.
  • 10. Data mining Process  Cross-IndustryStandard Process for Data Mining (CRISP-DM)  Specifies a data of data mining process model that is not specific to any industry tool.  Involved from unknown knowledge discovery processes used widely in industry and in direct response to user requirements.
  • 11. Data mining Process (cont…)  Major objectives of this specification are to make large data mining projects run more efficiently as well as to make them cheaper, more reliable and more manageable.  A hierarchy process model
  • 12. Data mining Process (cont…)  The process is divided into 6 different generic phases ranging from business understanding to deployment of project result.  The phases of CRISP-DM model are:  Business understanding  Data understanding  Data preparation  Modeling
  • 13. Data mining Process (cont…)  Evaluation  Deployment  Business understanding  This phase is focuses on understanding the project objectives and requirements from the business point of view.  Data understanding  This phase includes task for initial collection of the data and is concerned with establishing the main characteristics
  • 14. Data mining Process (cont…)  Data preparation  This phase involves all the activities for constructing the final data set on which modeling tools can be applied directly.  Modeling  This phase is the actual data mining operation and involves selecting modeling techniques, selecting modeling parameters and assessing the model created.
  • 15. Data mining Process (cont…)  Evaluation  This phase validates the model from the data analysis point of view.  The model and the steps in modeling are verified within the context of achieving the business goals.  Deployment  This phase is all about generating report or as complex as implementing repeatable data mining processing across the enterprise.