SlideShare a Scribd company logo
1 of 21
A REVIEW ON DATA MINING

1   Er.Nancy, perusing M-tech in CSE (2012-14), GNDEC,
                      Ludhiana, India.
ABSTRACT

   Data Mining is an approach to discover or extract
    knowledge from Databases.
   Generally to store databases, enterprises make data
    warehouses and data marts.
   Data warehouses and data marts contain large
    mountains of data.
   The mountains represent valuable resource to the
    enterprise.
   Due to extracting knowledge from large data
    warehouses or depositories, data mining plays great
    role in various fields of machine learning.
   Data mining is used in education, medical, scientific,
    business fields. Various algorithms and programs are
                                                             2
    used for data mining approach.[1]
INTRODUCTION
   Now the world of information technology, all things are
    become automated.
   Information technology is used in every field of human
    life such as business, engineering, medical,
    mathematical, scientific.
   All these fields of human life have lead to the large
    volume of data storage in various formats such as
    records, files, documents, images, sounds, recordings,
    videos and many new data.
   Collection of related data is also known as database. To
    extract correct data from large databases, the proper
    mechanism is used, that mechanism is also known as
    DATA MINING. It is a knowledge discovery technique
    (KDD).[2]                                                  3
PROCESS
 (1) Selection
 (2) Pre-processing

 (3) Transformation

 (4) Data Mining

 (5) Interpretation/Evaluation.[2,3]




                                        4
5
PROCEDURE OF DATA MINING
   1. Data cleaning: It is also known as data cleansing; in
    this phase noise data and irrelevant data are removed
    from the collection.

   2. Data integration: In this stage, multiple data sources,
    often heterogeneous, are combined in a common
    source.

   3. Data selection: The data relevant to the analysis is
    decided on and retrieved from the data collection.

   4. Data transformation: It is also known as data
    consolidation; in this phase the selected data is
    transformed into forms appropriate for the mining
    procedure.                                                   6
CONTD…….
   5. Data mining: It is the crucial step in which clever
    techniques are applied to extract potentially useful patterns.


   6. Pattern evaluation: In this step, interesting patterns
    representing knowledge are identified based on given
    measures.

   7. Knowledge representation: It is the final phase in
    which the discovered knowledge is visually presented to
    the user. This essential step uses visualization
    techniques to help users understand and interpret the
    data mining results.[4,2]
                                                                     7
DATA MINING LIFE CYCLE

   The Cross Industry Standard Process for Data
    Mining (CRISP-DM) which defines six phases:

 (1) Business Understanding
 (2) Data Understanding

 (3) Data Preparation

 (4) Modeling

 (5) Evaluation

 (6) Deployment.[6]


                                                   8
CONTD……
   1. Business Understanding: This phase focuses on
    understanding the project objectives and requirements
    from a business perspective, then converting this
    knowledge into a data mining problem definition and a
    preliminary plan designed to achieve the objectives.

   2. Data Understanding: It starts with an initial data
    collection, to get familiar with the data, to identify data
    quality problems, to discover first insights into the data
    or to detect interesting subsets to form hypotheses for
    hidden information.

   3. Data Preparation: It covers all activities to construct
    the final dataset from the initial raw data.
                                                                  9
CONTD……
   4. Modeling: In this phase, various modeling techniques
    are selected and applied and their parameters are
    calibrated to optimal values.

   5. Evaluation: In this stage the model is thoroughly
    evaluated and reviewed. The steps executed to
    construct the model to be certain it properly achieves
    the business objectives.

   6. Deployment: The purpose of the model is to increase
    knowledge of the data, the knowledge gained will need
    to be organized and presented in a way that the
                                                              10
    customer can use it.[6,8]
TYPES OF DATA MINING SYSTEM
   Classification of data mining systems according to the
    type of data source mined: This classification is according to
    the type of data handled such as spatial data, multimedia
    Data, time-series data, text data, World Wide Web, etc.

    Classification of data mining systems according to the
    data model: This classification based on the data model
    involved such as relational database, object-oriented
    database, Data warehouse, transactional database, etc.

   Classification of data mining systems according to the
    kind of knowledge discovered: This classification based on
    the kind of knowledge discovered or data mining
    functionalities, such as characterization, discrimination,
    association, classification, clustering, etc
                                                                     11
CONTD……
   Classification of data mining systems according to
    mining techniques used: This classification is
    according to the data analysis approach used such as
    machine learning, Neural networks, genetic algorithms,
    statistics, visualization, database oriented or data
    warehouse-oriented, etc.[5,2]




                                                             12
DATA MINING METHODS

   On-Line Analytical             Factor analysis
    Processing (OLAP)              Neural Networks
   Classification                 Regression analysis
   Association Rule Mining        Structured data analysis
   Temporal Data Mining           Sequence mining
   Time Series Analysis           Text mining
   Spatial Mining,                Drug Discovery
   Anomaly/outlier/change         Exploratory data analysis
    detection
                                   Predictive analytics
   Association rule learning
                                   Web Mining
   Cluster analysis
                                   Data analysis etc.[1,8]     13
   Decision trees
DATA MINING APPLICATIONS

   Games
   Business
   Science and Engineering
   Human rights
   Spatial data mining.[5]




                                        14
CHALLENGES

 Sensor data mining
 Pattern mining

 Visual data mining

 Subject based data mining

 Music data mining

 The Digital Library retrieves.[1,2,5]




                                          15
CONCLUSION

   In this paper I briefly reviewed data mining, background
    of data mining, methods, life cycle model, and types of
    data mining, application of it.
   This review will be helpful to you to easily understand
    what data mining is, previously used data mining
    techniques.
   Now days use data mining techniques and also process
    of data mining.
   In this I completely, explain applications and types of
    data mining. [1,2,4,5]


                                                               16
FUTURE SCOPE

   Data mining is a very wide concept. It contains many
    concepts such as various algorithms to extract
    knowledge from large databases.
   Soft Computing techniques like Fuzzy logic, Neural
    Networks and Genetic Programming which contains
    Complex data objects Includes high dimensional, high
    speed data streams, sequence, noise in the time series,
    graph, Multi instance objects, Multi represented objects
    and temporal data etc.
   These are used in Business, Web, Medical diagnosis,
    Scientific and Research analysis fields (bio, remote
    sensing etc…), Social networking etc.[1,7]
                                                               17
REFERENCES
   1. WIKIPEDIA.Com

   2. Mr. S. P. Deshpande1 and Dr. V. M. Thakar
    International Journal of Distributed and Parallel systems
    (IJDPS) Vol.1, No.1, September 2010 DOI.

    3. A Review on Data mining from Past to the Future.
    Venkatadri.M Research Scholar, Dept. of Computer
    Science, Dravidian University, India. Lokanatha C.
    Reddy Professor, Dept. of Computer Science,
    International Journal of Computer Applications (0975 –
    8887) Volume 15– No.7, February 2011.
                                                                18
CONTD…..
   4. Data Mining for High Performance Data Cloud using
    Association Rule Mining.1 T.V.Mahendra 2N.Deepika
    3N.Keasava Rao Professor & HOD, IT, Narayana Engg.
    College, Nellore, AP, India.2 Sr.Assistant Professor,
    Dept. of ISE, New Horizon College of Engineering,
    Bangalore, India 3 Associate Professor, IT, Narayana
    Engg. College, Gudur, AP, India.


   5. Data Mining and KDD: A Shifting Mosaic By Joseph
    M. Firestone, Ph.D. White Paper No. Two March 12,
    1997 the Idea.

                                                            19
CONTD….
   6. Data mining and static’s: what’s the connection?
    Jerome h. Friedman.

   7.www.Google.Com.

   8.www. IEEEXPLORE.Com.




                                                          20
THANKS
21

More Related Content

What's hot

Big Data Analytics
Big Data AnalyticsBig Data Analytics
Big Data AnalyticsEMC
 
CRISP-DM: a data science project methodology
CRISP-DM: a data science project methodologyCRISP-DM: a data science project methodology
CRISP-DM: a data science project methodologySergey Shelpuk
 
Introduction to data analytics
Introduction to data analyticsIntroduction to data analytics
Introduction to data analyticsUmasree Raghunath
 
Data science applications and usecases
Data science applications and usecasesData science applications and usecases
Data science applications and usecasesSreenatha Reddy K R
 
Web scraping
Web scrapingWeb scraping
Web scrapingSelecto
 
Intuit's Data Mesh - Data Mesh Leaning Community meetup 5.13.2021
Intuit's Data Mesh - Data Mesh Leaning Community meetup 5.13.2021Intuit's Data Mesh - Data Mesh Leaning Community meetup 5.13.2021
Intuit's Data Mesh - Data Mesh Leaning Community meetup 5.13.2021Tristan Baker
 
Data mining techniques unit 1
Data mining techniques  unit 1Data mining techniques  unit 1
Data mining techniques unit 1malathieswaran29
 
introduction to data science
introduction to data scienceintroduction to data science
introduction to data sciencebhavesh lande
 
Summary data visualization
Summary data visualizationSummary data visualization
Summary data visualizationNovita Sari
 
Big Data, Big Deal: For Future Big Data Scientists
Big Data, Big Deal: For Future Big Data ScientistsBig Data, Big Deal: For Future Big Data Scientists
Big Data, Big Deal: For Future Big Data ScientistsWay-Yen Lin
 
Introduction to Data Warehousing
Introduction to Data WarehousingIntroduction to Data Warehousing
Introduction to Data WarehousingEyad Manna
 
What Is Apache Spark? | Introduction To Apache Spark | Apache Spark Tutorial ...
What Is Apache Spark? | Introduction To Apache Spark | Apache Spark Tutorial ...What Is Apache Spark? | Introduction To Apache Spark | Apache Spark Tutorial ...
What Is Apache Spark? | Introduction To Apache Spark | Apache Spark Tutorial ...Simplilearn
 
Smart Data Slides: Machine Learning - Case Studies
Smart Data Slides: Machine Learning - Case StudiesSmart Data Slides: Machine Learning - Case Studies
Smart Data Slides: Machine Learning - Case StudiesDATAVERSITY
 

What's hot (20)

Big Data Analytics
Big Data AnalyticsBig Data Analytics
Big Data Analytics
 
Big Data Analytics
Big Data AnalyticsBig Data Analytics
Big Data Analytics
 
01 Data Mining: Concepts and Techniques, 2nd ed.
01 Data Mining: Concepts and Techniques, 2nd ed.01 Data Mining: Concepts and Techniques, 2nd ed.
01 Data Mining: Concepts and Techniques, 2nd ed.
 
Graph databases
Graph databasesGraph databases
Graph databases
 
CRISP-DM: a data science project methodology
CRISP-DM: a data science project methodologyCRISP-DM: a data science project methodology
CRISP-DM: a data science project methodology
 
Machine Learning
Machine LearningMachine Learning
Machine Learning
 
Introduction to data analytics
Introduction to data analyticsIntroduction to data analytics
Introduction to data analytics
 
Data science applications and usecases
Data science applications and usecasesData science applications and usecases
Data science applications and usecases
 
Web scraping
Web scrapingWeb scraping
Web scraping
 
Predictive Analytics - An Introduction
Predictive Analytics - An IntroductionPredictive Analytics - An Introduction
Predictive Analytics - An Introduction
 
Intuit's Data Mesh - Data Mesh Leaning Community meetup 5.13.2021
Intuit's Data Mesh - Data Mesh Leaning Community meetup 5.13.2021Intuit's Data Mesh - Data Mesh Leaning Community meetup 5.13.2021
Intuit's Data Mesh - Data Mesh Leaning Community meetup 5.13.2021
 
Data Engineering Basics
Data Engineering BasicsData Engineering Basics
Data Engineering Basics
 
Data mining
Data mining Data mining
Data mining
 
Data mining techniques unit 1
Data mining techniques  unit 1Data mining techniques  unit 1
Data mining techniques unit 1
 
introduction to data science
introduction to data scienceintroduction to data science
introduction to data science
 
Summary data visualization
Summary data visualizationSummary data visualization
Summary data visualization
 
Big Data, Big Deal: For Future Big Data Scientists
Big Data, Big Deal: For Future Big Data ScientistsBig Data, Big Deal: For Future Big Data Scientists
Big Data, Big Deal: For Future Big Data Scientists
 
Introduction to Data Warehousing
Introduction to Data WarehousingIntroduction to Data Warehousing
Introduction to Data Warehousing
 
What Is Apache Spark? | Introduction To Apache Spark | Apache Spark Tutorial ...
What Is Apache Spark? | Introduction To Apache Spark | Apache Spark Tutorial ...What Is Apache Spark? | Introduction To Apache Spark | Apache Spark Tutorial ...
What Is Apache Spark? | Introduction To Apache Spark | Apache Spark Tutorial ...
 
Smart Data Slides: Machine Learning - Case Studies
Smart Data Slides: Machine Learning - Case StudiesSmart Data Slides: Machine Learning - Case Studies
Smart Data Slides: Machine Learning - Case Studies
 

Viewers also liked

Fast Data Mining: Real Time Knowledge Discovery for Predictive Decision Making
Fast Data Mining: Real Time Knowledge Discovery for Predictive Decision MakingFast Data Mining: Real Time Knowledge Discovery for Predictive Decision Making
Fast Data Mining: Real Time Knowledge Discovery for Predictive Decision MakingCodemotion
 
Data Mining : Concepts
Data Mining : ConceptsData Mining : Concepts
Data Mining : ConceptsPragya Pandey
 
Data mining slides
Data mining slidesData mining slides
Data mining slidessmj
 
Data Warehousing and Data Mining
Data Warehousing and Data MiningData Warehousing and Data Mining
Data Warehousing and Data Miningidnats
 

Viewers also liked (7)

Fast Data Mining: Real Time Knowledge Discovery for Predictive Decision Making
Fast Data Mining: Real Time Knowledge Discovery for Predictive Decision MakingFast Data Mining: Real Time Knowledge Discovery for Predictive Decision Making
Fast Data Mining: Real Time Knowledge Discovery for Predictive Decision Making
 
Data Mining : Concepts
Data Mining : ConceptsData Mining : Concepts
Data Mining : Concepts
 
Data Mining
Data MiningData Mining
Data Mining
 
Big Data v Data Mining
Big Data v Data MiningBig Data v Data Mining
Big Data v Data Mining
 
Data mining slides
Data mining slidesData mining slides
Data mining slides
 
Data mining
Data miningData mining
Data mining
 
Data Warehousing and Data Mining
Data Warehousing and Data MiningData Warehousing and Data Mining
Data Warehousing and Data Mining
 

Similar to A review on data mining

DATA MINING IN EDUCATION : A REVIEW ON THE KNOWLEDGE DISCOVERY PERSPECTIVE
DATA MINING IN EDUCATION : A REVIEW ON THE KNOWLEDGE DISCOVERY PERSPECTIVEDATA MINING IN EDUCATION : A REVIEW ON THE KNOWLEDGE DISCOVERY PERSPECTIVE
DATA MINING IN EDUCATION : A REVIEW ON THE KNOWLEDGE DISCOVERY PERSPECTIVEIJDKP
 
knowledge discovery and data mining approach in databases (2)
knowledge discovery and data mining approach in databases (2)knowledge discovery and data mining approach in databases (2)
knowledge discovery and data mining approach in databases (2)Kartik Kalpande Patil
 
The Survey of Data Mining Applications And Feature Scope
The Survey of Data Mining Applications  And Feature Scope The Survey of Data Mining Applications  And Feature Scope
The Survey of Data Mining Applications And Feature Scope IJCSEIT Journal
 
Ci2004-10.doc
Ci2004-10.docCi2004-10.doc
Ci2004-10.docbutest
 
A Comprehensive Study on Outlier Detection in Data Mining
A Comprehensive Study on Outlier Detection in Data MiningA Comprehensive Study on Outlier Detection in Data Mining
A Comprehensive Study on Outlier Detection in Data MiningBRNSSPublicationHubI
 
Unit 1 (Chapter-1) on data mining concepts.ppt
Unit 1 (Chapter-1) on data mining concepts.pptUnit 1 (Chapter-1) on data mining concepts.ppt
Unit 1 (Chapter-1) on data mining concepts.pptPadmajaLaksh
 
A LITERATURE REVIEW ON DATAMINING
A LITERATURE REVIEW ON DATAMININGA LITERATURE REVIEW ON DATAMINING
A LITERATURE REVIEW ON DATAMININGCarrie Romero
 
A SURVEY ON DATA MINING IN STEEL INDUSTRIES
A SURVEY ON DATA MINING IN STEEL INDUSTRIESA SURVEY ON DATA MINING IN STEEL INDUSTRIES
A SURVEY ON DATA MINING IN STEEL INDUSTRIESIJCSES Journal
 
Data Mining System and Applications: A Review
Data Mining System and Applications: A ReviewData Mining System and Applications: A Review
Data Mining System and Applications: A Reviewijdpsjournal
 
Data Mining – A Perspective Approach
Data Mining – A Perspective ApproachData Mining – A Perspective Approach
Data Mining – A Perspective ApproachIRJET Journal
 
Introduction to dm and dw
Introduction to dm and dwIntroduction to dm and dw
Introduction to dm and dwANUSUYA T K
 
A Review On Data Mining From Past To The Future
A Review On Data Mining From Past To The FutureA Review On Data Mining From Past To The Future
A Review On Data Mining From Past To The FutureKaela Johnson
 

Similar to A review on data mining (20)

DATA MINING IN EDUCATION : A REVIEW ON THE KNOWLEDGE DISCOVERY PERSPECTIVE
DATA MINING IN EDUCATION : A REVIEW ON THE KNOWLEDGE DISCOVERY PERSPECTIVEDATA MINING IN EDUCATION : A REVIEW ON THE KNOWLEDGE DISCOVERY PERSPECTIVE
DATA MINING IN EDUCATION : A REVIEW ON THE KNOWLEDGE DISCOVERY PERSPECTIVE
 
Hi2413031309
Hi2413031309Hi2413031309
Hi2413031309
 
knowledge discovery and data mining approach in databases (2)
knowledge discovery and data mining approach in databases (2)knowledge discovery and data mining approach in databases (2)
knowledge discovery and data mining approach in databases (2)
 
The Survey of Data Mining Applications And Feature Scope
The Survey of Data Mining Applications  And Feature Scope The Survey of Data Mining Applications  And Feature Scope
The Survey of Data Mining Applications And Feature Scope
 
dwdm unit 1.ppt
dwdm unit 1.pptdwdm unit 1.ppt
dwdm unit 1.ppt
 
Seminar Report Vaibhav
Seminar Report VaibhavSeminar Report Vaibhav
Seminar Report Vaibhav
 
Ci2004-10.doc
Ci2004-10.docCi2004-10.doc
Ci2004-10.doc
 
Data Mining Applications And Feature Scope Survey
Data Mining Applications And Feature Scope SurveyData Mining Applications And Feature Scope Survey
Data Mining Applications And Feature Scope Survey
 
A Comprehensive Study on Outlier Detection in Data Mining
A Comprehensive Study on Outlier Detection in Data MiningA Comprehensive Study on Outlier Detection in Data Mining
A Comprehensive Study on Outlier Detection in Data Mining
 
Unit 1 (Chapter-1) on data mining concepts.ppt
Unit 1 (Chapter-1) on data mining concepts.pptUnit 1 (Chapter-1) on data mining concepts.ppt
Unit 1 (Chapter-1) on data mining concepts.ppt
 
A LITERATURE REVIEW ON DATAMINING
A LITERATURE REVIEW ON DATAMININGA LITERATURE REVIEW ON DATAMINING
A LITERATURE REVIEW ON DATAMINING
 
A SURVEY ON DATA MINING IN STEEL INDUSTRIES
A SURVEY ON DATA MINING IN STEEL INDUSTRIESA SURVEY ON DATA MINING IN STEEL INDUSTRIES
A SURVEY ON DATA MINING IN STEEL INDUSTRIES
 
Data Mining System and Applications: A Review
Data Mining System and Applications: A ReviewData Mining System and Applications: A Review
Data Mining System and Applications: A Review
 
Data Mining – A Perspective Approach
Data Mining – A Perspective ApproachData Mining – A Perspective Approach
Data Mining – A Perspective Approach
 
isd314-01
isd314-01isd314-01
isd314-01
 
Introduction to dm and dw
Introduction to dm and dwIntroduction to dm and dw
Introduction to dm and dw
 
2 Data-mining process
2   Data-mining process2   Data-mining process
2 Data-mining process
 
A Review On Data Mining From Past To The Future
A Review On Data Mining From Past To The FutureA Review On Data Mining From Past To The Future
A Review On Data Mining From Past To The Future
 
Datamining
DataminingDatamining
Datamining
 
Chapter 1. Introduction.ppt
Chapter 1. Introduction.pptChapter 1. Introduction.ppt
Chapter 1. Introduction.ppt
 

Recently uploaded

Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)wesley chun
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?Antenna Manufacturer Coco
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024Results
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUK Journal
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slidevu2urc
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxKatpro Technologies
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CVKhem
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
 

Recently uploaded (20)

Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 

A review on data mining

  • 1. A REVIEW ON DATA MINING 1 Er.Nancy, perusing M-tech in CSE (2012-14), GNDEC, Ludhiana, India.
  • 2. ABSTRACT  Data Mining is an approach to discover or extract knowledge from Databases.  Generally to store databases, enterprises make data warehouses and data marts.  Data warehouses and data marts contain large mountains of data.  The mountains represent valuable resource to the enterprise.  Due to extracting knowledge from large data warehouses or depositories, data mining plays great role in various fields of machine learning.  Data mining is used in education, medical, scientific, business fields. Various algorithms and programs are 2 used for data mining approach.[1]
  • 3. INTRODUCTION  Now the world of information technology, all things are become automated.  Information technology is used in every field of human life such as business, engineering, medical, mathematical, scientific.  All these fields of human life have lead to the large volume of data storage in various formats such as records, files, documents, images, sounds, recordings, videos and many new data.  Collection of related data is also known as database. To extract correct data from large databases, the proper mechanism is used, that mechanism is also known as DATA MINING. It is a knowledge discovery technique (KDD).[2] 3
  • 4. PROCESS  (1) Selection  (2) Pre-processing  (3) Transformation  (4) Data Mining  (5) Interpretation/Evaluation.[2,3] 4
  • 5. 5
  • 6. PROCEDURE OF DATA MINING  1. Data cleaning: It is also known as data cleansing; in this phase noise data and irrelevant data are removed from the collection.  2. Data integration: In this stage, multiple data sources, often heterogeneous, are combined in a common source.  3. Data selection: The data relevant to the analysis is decided on and retrieved from the data collection.  4. Data transformation: It is also known as data consolidation; in this phase the selected data is transformed into forms appropriate for the mining procedure. 6
  • 7. CONTD…….  5. Data mining: It is the crucial step in which clever techniques are applied to extract potentially useful patterns.  6. Pattern evaluation: In this step, interesting patterns representing knowledge are identified based on given measures.  7. Knowledge representation: It is the final phase in which the discovered knowledge is visually presented to the user. This essential step uses visualization techniques to help users understand and interpret the data mining results.[4,2] 7
  • 8. DATA MINING LIFE CYCLE  The Cross Industry Standard Process for Data Mining (CRISP-DM) which defines six phases:  (1) Business Understanding  (2) Data Understanding  (3) Data Preparation  (4) Modeling  (5) Evaluation  (6) Deployment.[6] 8
  • 9. CONTD……  1. Business Understanding: This phase focuses on understanding the project objectives and requirements from a business perspective, then converting this knowledge into a data mining problem definition and a preliminary plan designed to achieve the objectives.  2. Data Understanding: It starts with an initial data collection, to get familiar with the data, to identify data quality problems, to discover first insights into the data or to detect interesting subsets to form hypotheses for hidden information.  3. Data Preparation: It covers all activities to construct the final dataset from the initial raw data. 9
  • 10. CONTD……  4. Modeling: In this phase, various modeling techniques are selected and applied and their parameters are calibrated to optimal values.  5. Evaluation: In this stage the model is thoroughly evaluated and reviewed. The steps executed to construct the model to be certain it properly achieves the business objectives.  6. Deployment: The purpose of the model is to increase knowledge of the data, the knowledge gained will need to be organized and presented in a way that the 10 customer can use it.[6,8]
  • 11. TYPES OF DATA MINING SYSTEM  Classification of data mining systems according to the type of data source mined: This classification is according to the type of data handled such as spatial data, multimedia Data, time-series data, text data, World Wide Web, etc.  Classification of data mining systems according to the data model: This classification based on the data model involved such as relational database, object-oriented database, Data warehouse, transactional database, etc.  Classification of data mining systems according to the kind of knowledge discovered: This classification based on the kind of knowledge discovered or data mining functionalities, such as characterization, discrimination, association, classification, clustering, etc 11
  • 12. CONTD……  Classification of data mining systems according to mining techniques used: This classification is according to the data analysis approach used such as machine learning, Neural networks, genetic algorithms, statistics, visualization, database oriented or data warehouse-oriented, etc.[5,2] 12
  • 13. DATA MINING METHODS  On-Line Analytical  Factor analysis Processing (OLAP)  Neural Networks  Classification  Regression analysis  Association Rule Mining  Structured data analysis  Temporal Data Mining  Sequence mining  Time Series Analysis  Text mining  Spatial Mining,  Drug Discovery  Anomaly/outlier/change  Exploratory data analysis detection  Predictive analytics  Association rule learning  Web Mining  Cluster analysis  Data analysis etc.[1,8] 13  Decision trees
  • 14. DATA MINING APPLICATIONS  Games  Business  Science and Engineering  Human rights  Spatial data mining.[5] 14
  • 15. CHALLENGES  Sensor data mining  Pattern mining  Visual data mining  Subject based data mining  Music data mining  The Digital Library retrieves.[1,2,5] 15
  • 16. CONCLUSION  In this paper I briefly reviewed data mining, background of data mining, methods, life cycle model, and types of data mining, application of it.  This review will be helpful to you to easily understand what data mining is, previously used data mining techniques.  Now days use data mining techniques and also process of data mining.  In this I completely, explain applications and types of data mining. [1,2,4,5] 16
  • 17. FUTURE SCOPE  Data mining is a very wide concept. It contains many concepts such as various algorithms to extract knowledge from large databases.  Soft Computing techniques like Fuzzy logic, Neural Networks and Genetic Programming which contains Complex data objects Includes high dimensional, high speed data streams, sequence, noise in the time series, graph, Multi instance objects, Multi represented objects and temporal data etc.  These are used in Business, Web, Medical diagnosis, Scientific and Research analysis fields (bio, remote sensing etc…), Social networking etc.[1,7] 17
  • 18. REFERENCES  1. WIKIPEDIA.Com  2. Mr. S. P. Deshpande1 and Dr. V. M. Thakar International Journal of Distributed and Parallel systems (IJDPS) Vol.1, No.1, September 2010 DOI.  3. A Review on Data mining from Past to the Future. Venkatadri.M Research Scholar, Dept. of Computer Science, Dravidian University, India. Lokanatha C. Reddy Professor, Dept. of Computer Science, International Journal of Computer Applications (0975 – 8887) Volume 15– No.7, February 2011. 18
  • 19. CONTD…..  4. Data Mining for High Performance Data Cloud using Association Rule Mining.1 T.V.Mahendra 2N.Deepika 3N.Keasava Rao Professor & HOD, IT, Narayana Engg. College, Nellore, AP, India.2 Sr.Assistant Professor, Dept. of ISE, New Horizon College of Engineering, Bangalore, India 3 Associate Professor, IT, Narayana Engg. College, Gudur, AP, India.  5. Data Mining and KDD: A Shifting Mosaic By Joseph M. Firestone, Ph.D. White Paper No. Two March 12, 1997 the Idea. 19
  • 20. CONTD….  6. Data mining and static’s: what’s the connection? Jerome h. Friedman.  7.www.Google.Com.  8.www. IEEEXPLORE.Com. 20