SlideShare ist ein Scribd-Unternehmen logo
1 von 35
LOGO   www.themegallery.com




                               DATA MINING
                              Dayanand Academy of Management Studies
LOGO
www.themegallery.com




                                                  Contents


                       1   Data Mining Introduction

                       2   Data Mining Procedures

                       3   Data Mining Techniques

                       4   Data Mining Application
LOGO   www.themegallery.com




                              Data Mining
LOGO




       Introduction
www.themegallery.com




                                            Intoduction


What is Data Mining?
        Data mining is the process of extracting
         meaningful piece of information from Data
         warehouses , which can be useful for
         maximizing profit , fraud detection , marketing
         perspective and scientific research.
www.themegallery.com




Data Warehouses:
     According to Stanford University,
   "A Data Warehouse is a repository of integrated
    information, available for queries and analysis. Data
    and information are extracted from heterogeneous
    sources as they are generated .This makes it much
    easier and more efficient to run queries over data
    that originally came from different sources."
www.themegallery.com




                                    Data Minining Steps



              Fourth Step                  Knowledge Deployment


              Third Step                 Model Building


            Second Step            Data Gathering


            First Step      Problem Definition
www.themegallery.com




                               Data Mining Procedures

    Problem Definition:-
         Data mining project focuses on understanding the
         objectives and requirements of a particular project of
         business. The Project must be specified from a business
         point of view. After that it can be formulated as a data
         mining problem and develop a preliminary.

    Data Gathering & Preparation:-
         This task involves data collection and exploration. It can
         be done by Removing unnecessary information
         , Detecting Data Duplicity and supplying some new
         information.
www.themegallery.com




                                Data Mining Procedures

    Model Building and Evaluation:-
         In this phase, various Modeling Techniques can be applied
         to build the data model which is likely to be sufficient with
         the requirement and then An Evaluation can be done to
         compare the current model with the originally stated project
         goal.
    Knowledge Deployment:-
         Knowledge deployment is the use of data mining within a
         target environment. In the deployment phase, insight and
         actionable information can be derived from data.
www.themegallery.com




                       History of Data Mining Techniques

                1950             1960’s            1980’s

     • Neural                 • Decision       • Support
       Networks                 Trees            Vector
     • Clustering                                Machine

                       1999                    2004

      • Cross Industry Standard       • Java Data Mining
        Platform Data Mining            Package (JDM 1.0)
        Package (Crisp DM 1.0)
LOGO   www.themegallery.com




                              Data Mining
LOGO




Neural Networks
www.themegallery.com




                                     Neural Networks:-


     Neural networks are non-linear statistical data modeling
      tools. They can be used to model complex relationships
      between inputs and outputs or to find patterns in data.
      Using neural networks as a tool, data warehousing firms
      are extracting information from datasets in the process
      known as data mining.
     Neural network is a techniques derived from artificial
      intelligence research that uses generalized regression
      and provide methods to carry it out.
     It is self adapted and it uses learning method.
www.themegallery.com




                       Processing of Neural Networks

     Input data is presented to the
      network and propagated
      through the network until it
      reaches the output layer. The
      predicted output is subtracted
      from the actual output and an
      error value for the networks is
      calculated through supervised
      learning.
     Once back propagation has
      finished, the forward process
      starts again, and this cycle is
      continued until the error
      between predicted and actual
      outputs is minimized.
LOGO   www.themegallery.com




                              Data Mining
LOGO




                  Clustering
www.themegallery.com




                                      Clustering

    Clustering is used to segment the data.
     Clustering models segment records into groups
     that are similar to each other which is totally
     distinct from other groups.
    Typical Applications of Clustering are Online
     Document Classification and to cluster web log
     data to discover groups of similar access
     patterns. Pattern Recognition, Spatial Data
     Analysis and Image processing are other
     applications in Scientific areas.
www.themegallery.com




                       Clustering
LOGO   www.themegallery.com




                              Data Mining
LOGO




 Decision Trees
www.themegallery.com




                                     Decision Trees

    The Decision Tree algorithm is based on
     conditional probabilities. Decision trees
     generate rules. A rule is a conditional statement
     that can easily be understood by humans and
     easily used within a database to identify a set of
     records.
    The Decision Tree algorithm produces accurate
     and interpretable models with relatively little
     user intervention. The algorithm can be used for
     both binary and multi-class classification
     problems.
www.themegallery.com




                                                     Decision Trees




  Node 1 sows about married persons and 0 describes single persons.
  Node 1 has 712 records (cases). Of these, 382 have a target of 0 (not
  likely to increase spending), and 330 have a target of 1 (likely to increase
  spending).
LOGO   www.themegallery.com




                              Data Mining
LOGO




 Support Vector
   Machines
www.themegallery.com




                          Support Vector Machine

    An optimal Defined Surface.
    Linear and non linear Input Space.
    Linear or High Dimension Feature Space which
     is specially defined Kernel function.
    SVM involves the fitting of a hyper plane such
     that the largest margin is formed between 2
     classes of vectors while minimizing the effects
     of classification errors so that we can classified
     in to groups.
www.themegallery.com




                       Support Vector Machine
www.themegallery.com




                       Support Vector Used For

    Classification

    Regression

    Unsupervised Learning and
     supervised learning.
LOGO   www.themegallery.com




                              Data Mining
LOGO




           JAVA
        Data Mining
www.themegallery.com




                       JAVA Data Mining
www.themegallery.com




                       Facilities by JDM 1.0 Package
www.themegallery.com




                            Parallel Processing

                       Resources
              R1       R2     R3        R4

                       Processors
               P1      P2     P3        P4

                        Output
              O1       O2     O3        O4
www.themegallery.com




                            Distributed Computing

                       Processors
              P1       P2     P3          P4

                       Http Request

                       Resources
              R1       R2        R3       R4
LOGO   www.themegallery.com




                              Data Mining
LOGO




         Cross Industry
       Standard Platform
          Data Mining
www.themegallery.com




                                                           Crisp DM 1.0

                                      Business
                                    Understanding

                                                        Data
                       Deployment
                                                    Understanding



                                      Data
                                                        Data
                       Evaluation
                                                     Preparation


                                    Data Modeling
LOGO   www.themegallery.com




                              Data Mining
LOGO




        Data Mining
        Applications
www.themegallery.com




                                  Data Mining Applications


    Online Searching                          Business




       Spatial                       Data
                                                   Science
     Data Mining                     Mining


                       Security               Marketing
www.themegallery.com




                              Data Mining Applications

    BUSSINESS PRECPECTIVE:-
          Data mining helps business to extract information from
         resources such as print media, television, internet,
         investment. Data mining tools predicts future trend and
         behavior allowing business to make proactive
         knowledge driven decision for increasing revenue, profit
         of the company.
    SCIENCTIFIC PRECPECTIVE:-
         Practical perspective describe how techniques from
         data mining can be used to address and resolve the
         modern problem in science and engineering
         domains.
www.themegallery.com




                              Data Mining Applications

    SECURITY PRECPECTIVE:-
          To prevent or detect for fraud such as showing wrong
         geographical domain and to identify stolen credit card by
         transaction history. Data Mining can help to make online
         transactions more secure and reliable by analyzing
         previous transaction records.
    SPATIAL DATA MINING:-
          Geo-marketing companies doing customer segmentation
         based on spatial location through data mining by mining
         the purchase and subscription history .
www.themegallery.com




    WEBSITE PROMOTION:-
         Web owner can attract most number of visitors by
         mining their data and then modifying their layout on the
         basis of extracted information.
LOGO   www.themegallery.com




LOGO                          Add your company slogan

Weitere ähnliche Inhalte

Was ist angesagt?

Data mining and knowledge discovery
Data mining and knowledge discoveryData mining and knowledge discovery
Data mining and knowledge discoveryHoang Nguyen
 
Data mining process powerpoint ppt slides.
Data mining process powerpoint ppt slides.Data mining process powerpoint ppt slides.
Data mining process powerpoint ppt slides.SlideTeam.net
 
Data mining process powerpoint presentation templates
Data mining process powerpoint presentation templatesData mining process powerpoint presentation templates
Data mining process powerpoint presentation templatesSlideTeam.net
 
Introduction to Data Mining
Introduction to Data MiningIntroduction to Data Mining
Introduction to Data MiningSi Krishan
 
Datamining - On What Kind of Data
Datamining - On What Kind of DataDatamining - On What Kind of Data
Datamining - On What Kind of Datawina wulansari
 
A Practical Approach To Data Mining Presentation
A Practical Approach To Data Mining PresentationA Practical Approach To Data Mining Presentation
A Practical Approach To Data Mining Presentationmillerca2
 
Application of Data Warehousing & Data Mining to Exploitation for Supporting ...
Application of Data Warehousing & Data Mining to Exploitation for Supporting ...Application of Data Warehousing & Data Mining to Exploitation for Supporting ...
Application of Data Warehousing & Data Mining to Exploitation for Supporting ...Gihan Wikramanayake
 
Lecture 02 Machine Learning For Data Mining
Lecture 02 Machine Learning For Data MiningLecture 02 Machine Learning For Data Mining
Lecture 02 Machine Learning For Data MiningPier Luca Lanzi
 
Linked Open data: CNR
Linked Open data: CNRLinked Open data: CNR
Linked Open data: CNRDatiGovIT
 
Upstate CSCI 525 Data Mining Chapter 1
Upstate CSCI 525 Data Mining Chapter 1Upstate CSCI 525 Data Mining Chapter 1
Upstate CSCI 525 Data Mining Chapter 1DanWooster1
 

Was ist angesagt? (20)

Data mining and knowledge discovery
Data mining and knowledge discoveryData mining and knowledge discovery
Data mining and knowledge discovery
 
Data mining process powerpoint ppt slides.
Data mining process powerpoint ppt slides.Data mining process powerpoint ppt slides.
Data mining process powerpoint ppt slides.
 
Data mining process powerpoint presentation templates
Data mining process powerpoint presentation templatesData mining process powerpoint presentation templates
Data mining process powerpoint presentation templates
 
Introduction to Data Mining
Introduction to Data MiningIntroduction to Data Mining
Introduction to Data Mining
 
Datamining - On What Kind of Data
Datamining - On What Kind of DataDatamining - On What Kind of Data
Datamining - On What Kind of Data
 
Lecture - Data Mining
Lecture - Data MiningLecture - Data Mining
Lecture - Data Mining
 
Analytics and Data Mining Industry Overview
Analytics and Data Mining Industry OverviewAnalytics and Data Mining Industry Overview
Analytics and Data Mining Industry Overview
 
Introduction data mining
Introduction data miningIntroduction data mining
Introduction data mining
 
A Practical Approach To Data Mining Presentation
A Practical Approach To Data Mining PresentationA Practical Approach To Data Mining Presentation
A Practical Approach To Data Mining Presentation
 
Introduction
IntroductionIntroduction
Introduction
 
STI Summit 2011 - Digital Worlds
STI Summit 2011 - Digital WorldsSTI Summit 2011 - Digital Worlds
STI Summit 2011 - Digital Worlds
 
Data mining
Data miningData mining
Data mining
 
Application of Data Warehousing & Data Mining to Exploitation for Supporting ...
Application of Data Warehousing & Data Mining to Exploitation for Supporting ...Application of Data Warehousing & Data Mining to Exploitation for Supporting ...
Application of Data Warehousing & Data Mining to Exploitation for Supporting ...
 
Lecture 02 Machine Learning For Data Mining
Lecture 02 Machine Learning For Data MiningLecture 02 Machine Learning For Data Mining
Lecture 02 Machine Learning For Data Mining
 
Linked Open data: CNR
Linked Open data: CNRLinked Open data: CNR
Linked Open data: CNR
 
Ch03
Ch03Ch03
Ch03
 
Huguk lily
Huguk lilyHuguk lily
Huguk lily
 
Upstate CSCI 525 Data Mining Chapter 1
Upstate CSCI 525 Data Mining Chapter 1Upstate CSCI 525 Data Mining Chapter 1
Upstate CSCI 525 Data Mining Chapter 1
 
Dwdm
DwdmDwdm
Dwdm
 
Graph
GraphGraph
Graph
 

Andere mochten auch

Understanding Data Mining in the Social Media Marketing Age
Understanding Data Mining in the Social Media Marketing AgeUnderstanding Data Mining in the Social Media Marketing Age
Understanding Data Mining in the Social Media Marketing AgeSherman Mohr Jr.
 
Social Media Mining: An Introduction
Social Media Mining: An IntroductionSocial Media Mining: An Introduction
Social Media Mining: An IntroductionAli Abbasi
 
Mining Social media for Product Development & Marketing Insights
Mining Social media for Product Development & Marketing InsightsMining Social media for Product Development & Marketing Insights
Mining Social media for Product Development & Marketing Insightsjuliannacole
 
Social Media Data Mining
Social Media Data MiningSocial Media Data Mining
Social Media Data MiningRyan Reede
 
Data mining on Social Media
Data mining on Social MediaData mining on Social Media
Data mining on Social Mediahome
 
Big Data and Social Media Mining in Crisis and Emergency Management
Big Data and Social Media Mining in Crisis and Emergency ManagementBig Data and Social Media Mining in Crisis and Emergency Management
Big Data and Social Media Mining in Crisis and Emergency ManagementBYTE Project
 
Social Media Mining - Chapter 5 (Data Mining Essentials)
Social Media Mining - Chapter 5 (Data Mining Essentials)Social Media Mining - Chapter 5 (Data Mining Essentials)
Social Media Mining - Chapter 5 (Data Mining Essentials)SocialMediaMining
 
Social media mining hicss 46 part 1
Social media mining   hicss 46 part 1Social media mining   hicss 46 part 1
Social media mining hicss 46 part 1Dave King
 
Data mining for social media
Data mining for social mediaData mining for social media
Data mining for social mediarangesharp
 
DIY basic Facebook data mining
DIY basic Facebook data miningDIY basic Facebook data mining
DIY basic Facebook data miningSTEM/MARK
 
Data Mining: Graph mining and social network analysis
Data Mining: Graph mining and social network analysisData Mining: Graph mining and social network analysis
Data Mining: Graph mining and social network analysisDatamining Tools
 
Data mining in social network
Data mining in social networkData mining in social network
Data mining in social networkakash_mishra
 
Social media mining PPT
Social media mining PPTSocial media mining PPT
Social media mining PPTChhavi Mathur
 

Andere mochten auch (15)

Social Media Mining and Analytics
Social Media Mining and AnalyticsSocial Media Mining and Analytics
Social Media Mining and Analytics
 
Understanding Data Mining in the Social Media Marketing Age
Understanding Data Mining in the Social Media Marketing AgeUnderstanding Data Mining in the Social Media Marketing Age
Understanding Data Mining in the Social Media Marketing Age
 
Social Media Mining: An Introduction
Social Media Mining: An IntroductionSocial Media Mining: An Introduction
Social Media Mining: An Introduction
 
Mining Social media for Product Development & Marketing Insights
Mining Social media for Product Development & Marketing InsightsMining Social media for Product Development & Marketing Insights
Mining Social media for Product Development & Marketing Insights
 
Social Media Data Mining
Social Media Data MiningSocial Media Data Mining
Social Media Data Mining
 
Data mining on Social Media
Data mining on Social MediaData mining on Social Media
Data mining on Social Media
 
Big Data and Social Media Mining in Crisis and Emergency Management
Big Data and Social Media Mining in Crisis and Emergency ManagementBig Data and Social Media Mining in Crisis and Emergency Management
Big Data and Social Media Mining in Crisis and Emergency Management
 
Social Media Mining - Chapter 5 (Data Mining Essentials)
Social Media Mining - Chapter 5 (Data Mining Essentials)Social Media Mining - Chapter 5 (Data Mining Essentials)
Social Media Mining - Chapter 5 (Data Mining Essentials)
 
Social media mining hicss 46 part 1
Social media mining   hicss 46 part 1Social media mining   hicss 46 part 1
Social media mining hicss 46 part 1
 
Data mining for social media
Data mining for social mediaData mining for social media
Data mining for social media
 
DIY basic Facebook data mining
DIY basic Facebook data miningDIY basic Facebook data mining
DIY basic Facebook data mining
 
Data Mining: Graph mining and social network analysis
Data Mining: Graph mining and social network analysisData Mining: Graph mining and social network analysis
Data Mining: Graph mining and social network analysis
 
Data mining in social network
Data mining in social networkData mining in social network
Data mining in social network
 
Social Data Mining
Social Data MiningSocial Data Mining
Social Data Mining
 
Social media mining PPT
Social media mining PPTSocial media mining PPT
Social media mining PPT
 

Ähnlich wie Data Mining

Big Data For Investment Research Management
Big Data For Investment Research ManagementBig Data For Investment Research Management
Big Data For Investment Research ManagementIDT Partners
 
Enterprise Data Architecture Deliverables
Enterprise Data Architecture DeliverablesEnterprise Data Architecture Deliverables
Enterprise Data Architecture DeliverablesLars E Martinsson
 
Product overview 6.0 v.1.0
Product overview 6.0 v.1.0Product overview 6.0 v.1.0
Product overview 6.0 v.1.0Gianluigi Riccio
 
Off-Label Data Mesh: A Prescription for Healthier Data
Off-Label Data Mesh: A Prescription for Healthier DataOff-Label Data Mesh: A Prescription for Healthier Data
Off-Label Data Mesh: A Prescription for Healthier DataHostedbyConfluent
 
From open data to API-driven business
From open data to API-driven businessFrom open data to API-driven business
From open data to API-driven businessOpenDataSoft
 
DATA MINING TOOL- ORANGE
DATA MINING TOOL- ORANGEDATA MINING TOOL- ORANGE
DATA MINING TOOL- ORANGENeeraj Goswami
 
Big data and computing grid
Big data and computing gridBig data and computing grid
Big data and computing gridThang Nguyen
 
Getting Cloud Architecture Right the First Time Ver 2
Getting Cloud Architecture Right the First Time Ver 2Getting Cloud Architecture Right the First Time Ver 2
Getting Cloud Architecture Right the First Time Ver 2David Linthicum
 
Quicker Insights and Sustainable Business Agility Powered By Data Virtualizat...
Quicker Insights and Sustainable Business Agility Powered By Data Virtualizat...Quicker Insights and Sustainable Business Agility Powered By Data Virtualizat...
Quicker Insights and Sustainable Business Agility Powered By Data Virtualizat...Denodo
 
Big data and its impact on SOA
Big data and its impact on SOABig data and its impact on SOA
Big data and its impact on SOADemed L'Her
 
Scaling up with Cisco Big Data: Data + Science = Data Science
Scaling up with Cisco Big Data: Data + Science = Data ScienceScaling up with Cisco Big Data: Data + Science = Data Science
Scaling up with Cisco Big Data: Data + Science = Data ScienceeRic Choo
 
Best practices for building and deploying predictive models over big data pre...
Best practices for building and deploying predictive models over big data pre...Best practices for building and deploying predictive models over big data pre...
Best practices for building and deploying predictive models over big data pre...Kun Le
 
Virtualisation de données : Enjeux, Usages & Bénéfices
Virtualisation de données : Enjeux, Usages & BénéficesVirtualisation de données : Enjeux, Usages & Bénéfices
Virtualisation de données : Enjeux, Usages & BénéficesDenodo
 
Database & Technology 2 _ Damien Bootsma _ best Practices for capturing meta ...
Database & Technology 2 _ Damien Bootsma _ best Practices for capturing meta ...Database & Technology 2 _ Damien Bootsma _ best Practices for capturing meta ...
Database & Technology 2 _ Damien Bootsma _ best Practices for capturing meta ...InSync2011
 
MS Word file resumes16869r.doc.doc
MS Word file resumes16869r.doc.docMS Word file resumes16869r.doc.doc
MS Word file resumes16869r.doc.docbutest
 
A Data Fabric for All Things Intelligent
A Data Fabric for All Things IntelligentA Data Fabric for All Things Intelligent
A Data Fabric for All Things IntelligentDenodo
 
New Opportunities for Connected Data - Emil Eifrem @ GraphConnect Boston + Ch...
New Opportunities for Connected Data - Emil Eifrem @ GraphConnect Boston + Ch...New Opportunities for Connected Data - Emil Eifrem @ GraphConnect Boston + Ch...
New Opportunities for Connected Data - Emil Eifrem @ GraphConnect Boston + Ch...Neo4j
 

Ähnlich wie Data Mining (20)

Big Data For Investment Research Management
Big Data For Investment Research ManagementBig Data For Investment Research Management
Big Data For Investment Research Management
 
Data mining & column stores
Data mining & column storesData mining & column stores
Data mining & column stores
 
Enterprise Data Architecture Deliverables
Enterprise Data Architecture DeliverablesEnterprise Data Architecture Deliverables
Enterprise Data Architecture Deliverables
 
Product overview 6.0 v.1.0
Product overview 6.0 v.1.0Product overview 6.0 v.1.0
Product overview 6.0 v.1.0
 
Off-Label Data Mesh: A Prescription for Healthier Data
Off-Label Data Mesh: A Prescription for Healthier DataOff-Label Data Mesh: A Prescription for Healthier Data
Off-Label Data Mesh: A Prescription for Healthier Data
 
From open data to API-driven business
From open data to API-driven businessFrom open data to API-driven business
From open data to API-driven business
 
DATA MINING TOOL- ORANGE
DATA MINING TOOL- ORANGEDATA MINING TOOL- ORANGE
DATA MINING TOOL- ORANGE
 
Big data and computing grid
Big data and computing gridBig data and computing grid
Big data and computing grid
 
Getting Cloud Architecture Right the First Time Ver 2
Getting Cloud Architecture Right the First Time Ver 2Getting Cloud Architecture Right the First Time Ver 2
Getting Cloud Architecture Right the First Time Ver 2
 
Quicker Insights and Sustainable Business Agility Powered By Data Virtualizat...
Quicker Insights and Sustainable Business Agility Powered By Data Virtualizat...Quicker Insights and Sustainable Business Agility Powered By Data Virtualizat...
Quicker Insights and Sustainable Business Agility Powered By Data Virtualizat...
 
Big data and its impact on SOA
Big data and its impact on SOABig data and its impact on SOA
Big data and its impact on SOA
 
Data mining
Data miningData mining
Data mining
 
Datumize Deck 2019
Datumize Deck 2019 Datumize Deck 2019
Datumize Deck 2019
 
Scaling up with Cisco Big Data: Data + Science = Data Science
Scaling up with Cisco Big Data: Data + Science = Data ScienceScaling up with Cisco Big Data: Data + Science = Data Science
Scaling up with Cisco Big Data: Data + Science = Data Science
 
Best practices for building and deploying predictive models over big data pre...
Best practices for building and deploying predictive models over big data pre...Best practices for building and deploying predictive models over big data pre...
Best practices for building and deploying predictive models over big data pre...
 
Virtualisation de données : Enjeux, Usages & Bénéfices
Virtualisation de données : Enjeux, Usages & BénéficesVirtualisation de données : Enjeux, Usages & Bénéfices
Virtualisation de données : Enjeux, Usages & Bénéfices
 
Database & Technology 2 _ Damien Bootsma _ best Practices for capturing meta ...
Database & Technology 2 _ Damien Bootsma _ best Practices for capturing meta ...Database & Technology 2 _ Damien Bootsma _ best Practices for capturing meta ...
Database & Technology 2 _ Damien Bootsma _ best Practices for capturing meta ...
 
MS Word file resumes16869r.doc.doc
MS Word file resumes16869r.doc.docMS Word file resumes16869r.doc.doc
MS Word file resumes16869r.doc.doc
 
A Data Fabric for All Things Intelligent
A Data Fabric for All Things IntelligentA Data Fabric for All Things Intelligent
A Data Fabric for All Things Intelligent
 
New Opportunities for Connected Data - Emil Eifrem @ GraphConnect Boston + Ch...
New Opportunities for Connected Data - Emil Eifrem @ GraphConnect Boston + Ch...New Opportunities for Connected Data - Emil Eifrem @ GraphConnect Boston + Ch...
New Opportunities for Connected Data - Emil Eifrem @ GraphConnect Boston + Ch...
 

Data Mining

  • 1. LOGO www.themegallery.com DATA MINING Dayanand Academy of Management Studies LOGO
  • 2. www.themegallery.com Contents 1 Data Mining Introduction 2 Data Mining Procedures 3 Data Mining Techniques 4 Data Mining Application
  • 3. LOGO www.themegallery.com Data Mining LOGO Introduction
  • 4. www.themegallery.com Intoduction What is Data Mining?  Data mining is the process of extracting meaningful piece of information from Data warehouses , which can be useful for maximizing profit , fraud detection , marketing perspective and scientific research.
  • 5. www.themegallery.com Data Warehouses: According to Stanford University, "A Data Warehouse is a repository of integrated information, available for queries and analysis. Data and information are extracted from heterogeneous sources as they are generated .This makes it much easier and more efficient to run queries over data that originally came from different sources."
  • 6. www.themegallery.com Data Minining Steps Fourth Step Knowledge Deployment Third Step Model Building Second Step Data Gathering First Step Problem Definition
  • 7. www.themegallery.com Data Mining Procedures Problem Definition:- Data mining project focuses on understanding the objectives and requirements of a particular project of business. The Project must be specified from a business point of view. After that it can be formulated as a data mining problem and develop a preliminary. Data Gathering & Preparation:- This task involves data collection and exploration. It can be done by Removing unnecessary information , Detecting Data Duplicity and supplying some new information.
  • 8. www.themegallery.com Data Mining Procedures Model Building and Evaluation:- In this phase, various Modeling Techniques can be applied to build the data model which is likely to be sufficient with the requirement and then An Evaluation can be done to compare the current model with the originally stated project goal. Knowledge Deployment:- Knowledge deployment is the use of data mining within a target environment. In the deployment phase, insight and actionable information can be derived from data.
  • 9. www.themegallery.com History of Data Mining Techniques 1950 1960’s 1980’s • Neural • Decision • Support Networks Trees Vector • Clustering Machine 1999 2004 • Cross Industry Standard • Java Data Mining Platform Data Mining Package (JDM 1.0) Package (Crisp DM 1.0)
  • 10. LOGO www.themegallery.com Data Mining LOGO Neural Networks
  • 11. www.themegallery.com Neural Networks:-  Neural networks are non-linear statistical data modeling tools. They can be used to model complex relationships between inputs and outputs or to find patterns in data. Using neural networks as a tool, data warehousing firms are extracting information from datasets in the process known as data mining.  Neural network is a techniques derived from artificial intelligence research that uses generalized regression and provide methods to carry it out.  It is self adapted and it uses learning method.
  • 12. www.themegallery.com Processing of Neural Networks  Input data is presented to the network and propagated through the network until it reaches the output layer. The predicted output is subtracted from the actual output and an error value for the networks is calculated through supervised learning.  Once back propagation has finished, the forward process starts again, and this cycle is continued until the error between predicted and actual outputs is minimized.
  • 13. LOGO www.themegallery.com Data Mining LOGO Clustering
  • 14. www.themegallery.com Clustering Clustering is used to segment the data. Clustering models segment records into groups that are similar to each other which is totally distinct from other groups. Typical Applications of Clustering are Online Document Classification and to cluster web log data to discover groups of similar access patterns. Pattern Recognition, Spatial Data Analysis and Image processing are other applications in Scientific areas.
  • 15. www.themegallery.com Clustering
  • 16. LOGO www.themegallery.com Data Mining LOGO Decision Trees
  • 17. www.themegallery.com Decision Trees The Decision Tree algorithm is based on conditional probabilities. Decision trees generate rules. A rule is a conditional statement that can easily be understood by humans and easily used within a database to identify a set of records. The Decision Tree algorithm produces accurate and interpretable models with relatively little user intervention. The algorithm can be used for both binary and multi-class classification problems.
  • 18. www.themegallery.com Decision Trees Node 1 sows about married persons and 0 describes single persons. Node 1 has 712 records (cases). Of these, 382 have a target of 0 (not likely to increase spending), and 330 have a target of 1 (likely to increase spending).
  • 19. LOGO www.themegallery.com Data Mining LOGO Support Vector Machines
  • 20. www.themegallery.com Support Vector Machine An optimal Defined Surface. Linear and non linear Input Space. Linear or High Dimension Feature Space which is specially defined Kernel function. SVM involves the fitting of a hyper plane such that the largest margin is formed between 2 classes of vectors while minimizing the effects of classification errors so that we can classified in to groups.
  • 21. www.themegallery.com Support Vector Machine
  • 22. www.themegallery.com Support Vector Used For Classification Regression Unsupervised Learning and supervised learning.
  • 23. LOGO www.themegallery.com Data Mining LOGO JAVA Data Mining
  • 24. www.themegallery.com JAVA Data Mining
  • 25. www.themegallery.com Facilities by JDM 1.0 Package
  • 26. www.themegallery.com Parallel Processing Resources R1 R2 R3 R4 Processors P1 P2 P3 P4 Output O1 O2 O3 O4
  • 27. www.themegallery.com Distributed Computing Processors P1 P2 P3 P4 Http Request Resources R1 R2 R3 R4
  • 28. LOGO www.themegallery.com Data Mining LOGO Cross Industry Standard Platform Data Mining
  • 29. www.themegallery.com Crisp DM 1.0 Business Understanding Data Deployment Understanding Data Data Evaluation Preparation Data Modeling
  • 30. LOGO www.themegallery.com Data Mining LOGO Data Mining Applications
  • 31. www.themegallery.com Data Mining Applications Online Searching Business Spatial Data Science Data Mining Mining Security Marketing
  • 32. www.themegallery.com Data Mining Applications BUSSINESS PRECPECTIVE:- Data mining helps business to extract information from resources such as print media, television, internet, investment. Data mining tools predicts future trend and behavior allowing business to make proactive knowledge driven decision for increasing revenue, profit of the company. SCIENCTIFIC PRECPECTIVE:- Practical perspective describe how techniques from data mining can be used to address and resolve the modern problem in science and engineering domains.
  • 33. www.themegallery.com Data Mining Applications SECURITY PRECPECTIVE:- To prevent or detect for fraud such as showing wrong geographical domain and to identify stolen credit card by transaction history. Data Mining can help to make online transactions more secure and reliable by analyzing previous transaction records. SPATIAL DATA MINING:- Geo-marketing companies doing customer segmentation based on spatial location through data mining by mining the purchase and subscription history .
  • 34. www.themegallery.com WEBSITE PROMOTION:- Web owner can attract most number of visitors by mining their data and then modifying their layout on the basis of extracted information.
  • 35. LOGO www.themegallery.com LOGO Add your company slogan