SlideShare ist ein Scribd-Unternehmen logo
1 von 5
Downloaden Sie, um offline zu lesen
VihangShah
Data mining
Introduction
Data mining is a process of retrieving data from huge database. Data mining is
automatically searching large data to discover patterns and trends that is different from
simple analysis. Data mining is also known as Knowledge Discovery in Data (KDD).
Data mining Process
Problem Definition
Problem definition in this stage the need of project, objective of project and
requirements are defined and from that the basic plan should be implement on primary
level.
Problem
Defination
Data Gathering
& Preparation
Model building
& Evaluation
Knowledge
Deployment
VihangShah
Data gathering & Preparation
As you know in earlier phase you collect all requirements in this phase the additional data or
some data be omitted for further phases. This is also a time to identify data quality problem.
In short data preparation can significantly improve the information that can be discovered
through data mining. The outcome of the data preparation is final data set.
Once the data sources are identified, they need to be selected, cleaned, constructed and
formatted into the desired form.
Model Building and evaluation
In this phase selection and apply various modeling techniques for retrieving optimal values.
The test will be generated to validate the quality and validity of the model. One or more
model are created and run on the prepared dataset.
Knowledge deployment
The knowledge or information which we gain from data mining process need to present in
such a way that it will be use when we need knowledge or information. In this phase the
plans for deployment, maintenance and monitoring have to be created for implementation
and also future supports.
What can data mining do and Not Do?
Do:-
 Data mining can help to find pattern and relationships within your data.
 Data mining help you to discover hidden information in your data.
 Data mining actually give optimize result from huge databases.
 Data mining can help you to analyze the data for future use.
VihangShah
Not Do:-
 Data mining cannot work automatically.
 Data mining cannot give you information about value of the information to your
organization.
 Data mining does not eliminate the need to know your business, to understand your
data.
Data Mining Technique
Data mining have basically six different techniques and that are Association, classification,
clustering, prediction, sequential pattern and decision tree.
Association
Association basically works on relation between items that why it also called relation
technique. It is used in marketing analysis to identify a set of customer’s frequently
purchase together.
Retailers are using association technique to research customer’s buying habits. Based on
historical sale data, retailers might found out that customers buy bread they also buy butter.
Classification
Classification is used to classify each item into predefined set of data or group. For example:
- We can apply classification in application that gives all records of employees who left the
company, predict who will probably leave the company in a future period.
Clustering
In clustering the classes are defined and the objects are put in each class, while in
classification technique object are assigned into predefined classes.
For example:- Consider book management in library there is wide range of book that having
a different topic. So now reader must have easy searching facility of books that having same
topics so for that we make a cluster that can keep books that have some kind of similarities
in one cluster or one shelf and label it with a meaningful name.
VihangShah
Prediction
Prediction is technique that predicts relationship between independent variable and
relationship between dependent and independent variables.
For instance the prediction technique can be used in sales to predict profit for the future if
we consider sale is an independent variable, profit could be a dependent variable.
Sequential Patterns
This technique seeks to discover or identity similar patterns, regular events or trends in
transaction data over a business period.
Decision Tree
It is most used technique of data mining because it is easy to understand. In this the root of
decision tree is a simple question or condition that has a multiple answers.
Each answer leads to a set of questions or conditions that help us determine the data.
Note: - we often combine two or more data mining techniques together to form an
appropriate process that meets the business needs.
Data mining Applications
 Data mining help in marketing such as it will used for analysis to provide information
on what product together, when they were bought and in what sequence and it will
also help to find customer’s behavior.
 Data mining help in banking/finance sector such as it will used to identify customer
loyalty by analyzing the data of customer’s purchasing activities and it will also help
retain credit card customers.
 Data mining help in health care and insurance sector such as it will analysis the
claims which medical procedures are claimed together and it will also forecasts
which customer will potentially purchase new policies.
NOTE: - Data mining is also used to analyze the data in many sectors.
VihangShah

Weitere ähnliche Inhalte

Was ist angesagt?

DSO528GroupProject-PortugueseBank
DSO528GroupProject-PortugueseBankDSO528GroupProject-PortugueseBank
DSO528GroupProject-PortugueseBank
Eric Esajian
 
Data mining financial services
Data mining financial servicesData mining financial services
Data mining financial services
Hprentice
 

Was ist angesagt? (20)

Data Mining & Applications
Data Mining & ApplicationsData Mining & Applications
Data Mining & Applications
 
Introduction to Data Mining
Introduction to Data MiningIntroduction to Data Mining
Introduction to Data Mining
 
Data mining concepts
Data mining conceptsData mining concepts
Data mining concepts
 
Importance of Data Mining
Importance of Data MiningImportance of Data Mining
Importance of Data Mining
 
Bank market classification
Bank market classificationBank market classification
Bank market classification
 
KETL Quick guide to data analytics
KETL Quick guide to data analytics KETL Quick guide to data analytics
KETL Quick guide to data analytics
 
Application areas of data mining
Application areas of data miningApplication areas of data mining
Application areas of data mining
 
Data mining
Data miningData mining
Data mining
 
DSO528GroupProject-PortugueseBank
DSO528GroupProject-PortugueseBankDSO528GroupProject-PortugueseBank
DSO528GroupProject-PortugueseBank
 
Data analytics
Data analyticsData analytics
Data analytics
 
Key Principles Of Data Mining
Key Principles Of Data MiningKey Principles Of Data Mining
Key Principles Of Data Mining
 
Unit ii data analytics
Unit ii data analytics Unit ii data analytics
Unit ii data analytics
 
Teaching Descriptive Analytics, Customer Profiling and Clustering
Teaching Descriptive Analytics, Customer Profiling and ClusteringTeaching Descriptive Analytics, Customer Profiling and Clustering
Teaching Descriptive Analytics, Customer Profiling and Clustering
 
Data Mining: Application and trends in data mining
Data Mining: Application and trends in data miningData Mining: Application and trends in data mining
Data Mining: Application and trends in data mining
 
Knowledge Discovery and Data Mining
Knowledge Discovery and Data MiningKnowledge Discovery and Data Mining
Knowledge Discovery and Data Mining
 
Business analytics
Business analyticsBusiness analytics
Business analytics
 
Data mining financial services
Data mining financial servicesData mining financial services
Data mining financial services
 
Data Mining Techniques
Data Mining TechniquesData Mining Techniques
Data Mining Techniques
 
Data Mining and Data Warehouse
Data Mining and Data WarehouseData Mining and Data Warehouse
Data Mining and Data Warehouse
 
Data Mining
Data Mining Data Mining
Data Mining
 

Andere mochten auch (7)

Man's heart
Man's heartMan's heart
Man's heart
 
Elementary Concepts of data minig
Elementary Concepts of data minigElementary Concepts of data minig
Elementary Concepts of data minig
 
Data mining
Data miningData mining
Data mining
 
Data minig
Data minig Data minig
Data minig
 
Data mining with big data
Data mining with big dataData mining with big data
Data mining with big data
 
Data minig with Big data analysis
Data minig with Big data analysisData minig with Big data analysis
Data minig with Big data analysis
 
Data mining slides
Data mining slidesData mining slides
Data mining slides
 

Ähnlich wie Data Mining

Data Mining Presentation for College Harsh.pptx
Data Mining Presentation for College Harsh.pptxData Mining Presentation for College Harsh.pptx
Data Mining Presentation for College Harsh.pptx
hp41112004
 
Business analytics and data mining
Business analytics and data miningBusiness analytics and data mining
Business analytics and data mining
Tony Nguyen
 
Business analytics and data mining
Business analytics and data miningBusiness analytics and data mining
Business analytics and data mining
Luis Goldster
 
Business analytics and data mining
Business analytics and data miningBusiness analytics and data mining
Business analytics and data mining
James Wong
 
Business analytics and data mining
Business analytics and data miningBusiness analytics and data mining
Business analytics and data mining
Harry Potter
 

Ähnlich wie Data Mining (20)

Presentation in Strategic Plannin and Management.pptx
Presentation in Strategic Plannin and Management.pptxPresentation in Strategic Plannin and Management.pptx
Presentation in Strategic Plannin and Management.pptx
 
Data Analysis - Approach & Techniques
Data Analysis - Approach & TechniquesData Analysis - Approach & Techniques
Data Analysis - Approach & Techniques
 
what is ..how to process types and methods involved in data analysis
what is ..how to process types and methods involved in data analysiswhat is ..how to process types and methods involved in data analysis
what is ..how to process types and methods involved in data analysis
 
Data Mining Presentation for College Harsh.pptx
Data Mining Presentation for College Harsh.pptxData Mining Presentation for College Harsh.pptx
Data Mining Presentation for College Harsh.pptx
 
Data Mining
Data MiningData Mining
Data Mining
 
DataMining Techniq
DataMining TechniqDataMining Techniq
DataMining Techniq
 
Data Mining: What is Data Mining?
Data Mining: What is Data Mining?Data Mining: What is Data Mining?
Data Mining: What is Data Mining?
 
A Practical Approach To Data Mining Presentation
A Practical Approach To Data Mining PresentationA Practical Approach To Data Mining Presentation
A Practical Approach To Data Mining Presentation
 
Data and Information Visualization part 2.pptx
Data and Information Visualization part 2.pptxData and Information Visualization part 2.pptx
Data and Information Visualization part 2.pptx
 
Datamining
DataminingDatamining
Datamining
 
Datamining
DataminingDatamining
Datamining
 
Classification and prediction in data mining
Classification and prediction in data miningClassification and prediction in data mining
Classification and prediction in data mining
 
Data mining & data warehousing
Data mining & data warehousingData mining & data warehousing
Data mining & data warehousing
 
What Is Data Mining How It Works, Benefits, Techniques.pdf
What Is Data Mining How It Works, Benefits, Techniques.pdfWhat Is Data Mining How It Works, Benefits, Techniques.pdf
What Is Data Mining How It Works, Benefits, Techniques.pdf
 
leewayhertz.com-Data analysis workflow using Scikit-learn.pdf
leewayhertz.com-Data analysis workflow using Scikit-learn.pdfleewayhertz.com-Data analysis workflow using Scikit-learn.pdf
leewayhertz.com-Data analysis workflow using Scikit-learn.pdf
 
Business analytics and data mining
Business analytics and data miningBusiness analytics and data mining
Business analytics and data mining
 
Business analytics and data mining
Business analytics and data miningBusiness analytics and data mining
Business analytics and data mining
 
Business analytics and data mining
Business analytics and data miningBusiness analytics and data mining
Business analytics and data mining
 
Business analytics and data mining
Business analytics and data miningBusiness analytics and data mining
Business analytics and data mining
 
Business analytics and data mining
Business analytics and data miningBusiness analytics and data mining
Business analytics and data mining
 

Kürzlich hochgeladen

IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
Enterprise Knowledge
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
vu2urc
 

Kürzlich hochgeladen (20)

presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
Tech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfTech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdf
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 

Data Mining

  • 1. VihangShah Data mining Introduction Data mining is a process of retrieving data from huge database. Data mining is automatically searching large data to discover patterns and trends that is different from simple analysis. Data mining is also known as Knowledge Discovery in Data (KDD). Data mining Process Problem Definition Problem definition in this stage the need of project, objective of project and requirements are defined and from that the basic plan should be implement on primary level. Problem Defination Data Gathering & Preparation Model building & Evaluation Knowledge Deployment
  • 2. VihangShah Data gathering & Preparation As you know in earlier phase you collect all requirements in this phase the additional data or some data be omitted for further phases. This is also a time to identify data quality problem. In short data preparation can significantly improve the information that can be discovered through data mining. The outcome of the data preparation is final data set. Once the data sources are identified, they need to be selected, cleaned, constructed and formatted into the desired form. Model Building and evaluation In this phase selection and apply various modeling techniques for retrieving optimal values. The test will be generated to validate the quality and validity of the model. One or more model are created and run on the prepared dataset. Knowledge deployment The knowledge or information which we gain from data mining process need to present in such a way that it will be use when we need knowledge or information. In this phase the plans for deployment, maintenance and monitoring have to be created for implementation and also future supports. What can data mining do and Not Do? Do:-  Data mining can help to find pattern and relationships within your data.  Data mining help you to discover hidden information in your data.  Data mining actually give optimize result from huge databases.  Data mining can help you to analyze the data for future use.
  • 3. VihangShah Not Do:-  Data mining cannot work automatically.  Data mining cannot give you information about value of the information to your organization.  Data mining does not eliminate the need to know your business, to understand your data. Data Mining Technique Data mining have basically six different techniques and that are Association, classification, clustering, prediction, sequential pattern and decision tree. Association Association basically works on relation between items that why it also called relation technique. It is used in marketing analysis to identify a set of customer’s frequently purchase together. Retailers are using association technique to research customer’s buying habits. Based on historical sale data, retailers might found out that customers buy bread they also buy butter. Classification Classification is used to classify each item into predefined set of data or group. For example: - We can apply classification in application that gives all records of employees who left the company, predict who will probably leave the company in a future period. Clustering In clustering the classes are defined and the objects are put in each class, while in classification technique object are assigned into predefined classes. For example:- Consider book management in library there is wide range of book that having a different topic. So now reader must have easy searching facility of books that having same topics so for that we make a cluster that can keep books that have some kind of similarities in one cluster or one shelf and label it with a meaningful name.
  • 4. VihangShah Prediction Prediction is technique that predicts relationship between independent variable and relationship between dependent and independent variables. For instance the prediction technique can be used in sales to predict profit for the future if we consider sale is an independent variable, profit could be a dependent variable. Sequential Patterns This technique seeks to discover or identity similar patterns, regular events or trends in transaction data over a business period. Decision Tree It is most used technique of data mining because it is easy to understand. In this the root of decision tree is a simple question or condition that has a multiple answers. Each answer leads to a set of questions or conditions that help us determine the data. Note: - we often combine two or more data mining techniques together to form an appropriate process that meets the business needs. Data mining Applications  Data mining help in marketing such as it will used for analysis to provide information on what product together, when they were bought and in what sequence and it will also help to find customer’s behavior.  Data mining help in banking/finance sector such as it will used to identify customer loyalty by analyzing the data of customer’s purchasing activities and it will also help retain credit card customers.  Data mining help in health care and insurance sector such as it will analysis the claims which medical procedures are claimed together and it will also forecasts which customer will potentially purchase new policies. NOTE: - Data mining is also used to analyze the data in many sectors.