SlideShare ist ein Scribd-Unternehmen logo
1 von 15
INTRODUCTION TO DATA MINING
TECHNIQUE
By – Pawneshwar Datt Rai
WHAT IS DATA MINING?
 Data mining is also called knowledge discovery and data
mining (KDD)
 Data mining is
 extraction of useful patterns from data sources, e.g.,
databases, texts, web, image.
 Patterns must be:
 valid, novel, potentially useful, understandable
This PPT presented By - Pawneshwar Datt Rai
EXAMPLE OF DISCOVERED PATTERNS
 Association rules:
“80% of customers who buy cheese and milk also buy
bread, and 5% of customers buy all of them together”
Cheese, Milk Bread [sup =5%, confid=80%]
This PPT presented By - Pawneshwar Datt Rai
MAIN DATA MINING TASKS
 Classification:
mining patterns that can classify future data into known
classes.
 Association rule mining
mining any rule of the form X  Y, where X and Y are
sets of data items.
 Clustering
identifying a set of similarity groups in the data
This PPT presented By - Pawneshwar Datt Rai
MAIN DATA MINING TASKS
 Sequential pattern mining:
A sequential rule: A B, says that event A will be
immediately followed by event B with a certain confidence
 Deviation detection:
discovering the most significant changes in data
 Data visualization: using graphical methods to show
patterns in data.
This PPT presented By - Pawneshwar Datt Rai
WHY IS DATA MINING IMPORTANT?
 Rapid computerization of businesses produce huge
amount of data
 How to make best use of data?
 A growing realization: knowledge discovered from
data can be used for competitive advantage.
This PPT presented By - Pawneshwar Datt Rai
WHY IS DATA MINING NECESSARY?
 Make use of your data assets
 There is a big gap from stored data to knowledge; and
the transition won’t occur automatically.
 Many interesting things you want to find cannot be found
using database queries
“find me people likely to buy my products”
“Who are likely to respond to my promotion”
This PPT presented By - Pawneshwar Datt Rai
WHY DATA MINING NOW?
 The data is abundant.
 The data is being warehoused.
 The computing power is affordable.
 The competitive pressure is strong.
 Data mining tools have become available
This PPT presented By - Pawneshwar Datt Rai
RELATED FIELDS
 Data mining is an emerging multi-disciplinary field:
Statistics
Machine learning
Databases
Information retrieval
Visualization
etc.
This PPT presented By - Pawneshwar Datt Rai
DATA MINING (KDD) PROCESS
 Understand the application domain
 Identify data sources and select target data
 Pre-process: cleaning, attribute selection
 Data mining to extract patterns or models
 Post-process: identifying interesting or useful patterns
 Incorporate patterns in real world tasks
This PPT presented By - Pawneshwar Datt Rai
DATA MINING APPLICATIONS
 Marketing, customer profiling and retention,
identifying potential customers, market
segmentation.
 Fraud detection
identifying credit card fraud, intrusion detection
 Scientific data analysis
 Text and web mining
 Any application that involves a large amount of data.
This PPT presented By - Pawneshwar Datt Rai
WEB DATA EXTRACTION
Data
region1
Data
region2
A data
record
A data
record
This PPT presented By - Pawneshwar Datt Rai
OPINION ANALYSIS
 Word-of-mouth on the Web
 The Web has dramatically changed the way that
consumers express their opinions.
 One can post reviews of products at merchant
sites, Web forums, discussion groups, blogs
 Techniques are being developed to exploit these
sources.
 Benefits of Review Analysis
 Potential Customer: No need to read many reviews
 Product manufacturer: market intelligence, product
benchmarking
This PPT presented By - Pawneshwar Datt Rai
FEATURE BASED ANALYSIS &
SUMMARIZATION
 Extracting product features (called Opinion Features) that
have been commented on by customers.
 Identifying opinion sentences in each review and
deciding whether each opinion sentence is positive or
negative.
 Summarizing and comparing results.
This PPT presented By - Pawneshwar Datt Rai
A Happy and Prosperous day to all friends.
This PPT presented By – Pawneshwar Datt Rai
ThisPPTpresentedBy-PawneshwarDattRai

Weitere ähnliche Inhalte

Was ist angesagt?

Data warehouse architecture
Data warehouse architectureData warehouse architecture
Data warehouse architecture
pcherukumalla
 
What Is Data Science? | Introduction to Data Science | Data Science For Begin...
What Is Data Science? | Introduction to Data Science | Data Science For Begin...What Is Data Science? | Introduction to Data Science | Data Science For Begin...
What Is Data Science? | Introduction to Data Science | Data Science For Begin...
Simplilearn
 

Was ist angesagt? (20)

Data mining concepts and work
Data mining concepts and workData mining concepts and work
Data mining concepts and work
 
Introduction To Analytics
Introduction To AnalyticsIntroduction To Analytics
Introduction To Analytics
 
Knowledge Discovery and Data Mining
Knowledge Discovery and Data MiningKnowledge Discovery and Data Mining
Knowledge Discovery and Data Mining
 
Data Mining : Concepts
Data Mining : ConceptsData Mining : Concepts
Data Mining : Concepts
 
Data Analytics
Data AnalyticsData Analytics
Data Analytics
 
Data Mining
Data MiningData Mining
Data Mining
 
Data mining presentation.ppt
Data mining presentation.pptData mining presentation.ppt
Data mining presentation.ppt
 
Data mining
Data miningData mining
Data mining
 
01 Data Mining: Concepts and Techniques, 2nd ed.
01 Data Mining: Concepts and Techniques, 2nd ed.01 Data Mining: Concepts and Techniques, 2nd ed.
01 Data Mining: Concepts and Techniques, 2nd ed.
 
Data warehouse architecture
Data warehouse architectureData warehouse architecture
Data warehouse architecture
 
Introduction of Data Science
Introduction of Data ScienceIntroduction of Data Science
Introduction of Data Science
 
What Is Data Science? | Introduction to Data Science | Data Science For Begin...
What Is Data Science? | Introduction to Data Science | Data Science For Begin...What Is Data Science? | Introduction to Data Science | Data Science For Begin...
What Is Data Science? | Introduction to Data Science | Data Science For Begin...
 
Data mining & data warehousing (ppt)
Data mining & data warehousing (ppt)Data mining & data warehousing (ppt)
Data mining & data warehousing (ppt)
 
Data mining , Knowledge Discovery Process, Classification
Data mining , Knowledge Discovery Process, ClassificationData mining , Knowledge Discovery Process, Classification
Data mining , Knowledge Discovery Process, Classification
 
introduction to data mining tutorial
introduction to data mining tutorial introduction to data mining tutorial
introduction to data mining tutorial
 
Data Mining & Applications
Data Mining & ApplicationsData Mining & Applications
Data Mining & Applications
 
Data warehousing
Data warehousingData warehousing
Data warehousing
 
Application of data mining
Application of data miningApplication of data mining
Application of data mining
 
data mining
data miningdata mining
data mining
 
Introduction to Data Analytics
Introduction to Data AnalyticsIntroduction to Data Analytics
Introduction to Data Analytics
 

Andere mochten auch

Data mining (lecture 1 & 2) conecpts and techniques
Data mining (lecture 1 & 2) conecpts and techniquesData mining (lecture 1 & 2) conecpts and techniques
Data mining (lecture 1 & 2) conecpts and techniques
Saif Ullah
 
Data mining process powerpoint ppt slides.
Data mining process powerpoint ppt slides.Data mining process powerpoint ppt slides.
Data mining process powerpoint ppt slides.
SlideTeam.net
 
DATA MINING TOOL- ORANGE
DATA MINING TOOL- ORANGEDATA MINING TOOL- ORANGE
DATA MINING TOOL- ORANGE
Neeraj Goswami
 

Andere mochten auch (15)

Transactional Data Mining
Transactional Data MiningTransactional Data Mining
Transactional Data Mining
 
Data mining (lecture 1 & 2) conecpts and techniques
Data mining (lecture 1 & 2) conecpts and techniquesData mining (lecture 1 & 2) conecpts and techniques
Data mining (lecture 1 & 2) conecpts and techniques
 
Chapter 08 Data Mining Techniques
Chapter 08 Data Mining Techniques Chapter 08 Data Mining Techniques
Chapter 08 Data Mining Techniques
 
Chapter - 8.3 Data Mining Concepts and Techniques 2nd Ed slides Han & Kamber
Chapter - 8.3 Data Mining Concepts and Techniques 2nd Ed slides Han & KamberChapter - 8.3 Data Mining Concepts and Techniques 2nd Ed slides Han & Kamber
Chapter - 8.3 Data Mining Concepts and Techniques 2nd Ed slides Han & Kamber
 
Data mining and its applications!
Data mining and its applications!Data mining and its applications!
Data mining and its applications!
 
Data mining process powerpoint ppt slides.
Data mining process powerpoint ppt slides.Data mining process powerpoint ppt slides.
Data mining process powerpoint ppt slides.
 
Significance of Data Mining
Significance of Data MiningSignificance of Data Mining
Significance of Data Mining
 
Lecture 01 Data Mining
Lecture 01 Data MiningLecture 01 Data Mining
Lecture 01 Data Mining
 
Data Mining Overview
Data Mining OverviewData Mining Overview
Data Mining Overview
 
Decision trees
Decision treesDecision trees
Decision trees
 
DATA MINING TOOL- ORANGE
DATA MINING TOOL- ORANGEDATA MINING TOOL- ORANGE
DATA MINING TOOL- ORANGE
 
Decision tree example problem
Decision tree example problemDecision tree example problem
Decision tree example problem
 
Decision Trees
Decision TreesDecision Trees
Decision Trees
 
Ch 1 Intro to Data Mining
Ch 1 Intro to Data MiningCh 1 Intro to Data Mining
Ch 1 Intro to Data Mining
 
Data Mining Concepts
Data Mining ConceptsData Mining Concepts
Data Mining Concepts
 

Ähnlich wie Introduction to data mining technique

Big Data & Business Analytics: Understanding the Marketspace
Big Data & Business Analytics: Understanding the MarketspaceBig Data & Business Analytics: Understanding the Marketspace
Big Data & Business Analytics: Understanding the Marketspace
Bala Iyer
 
Explainability for Natural Language Processing
Explainability for Natural Language ProcessingExplainability for Natural Language Processing
Explainability for Natural Language Processing
Yunyao Li
 

Ähnlich wie Introduction to data mining technique (20)

A Practical Approach To Data Mining Presentation
A Practical Approach To Data Mining PresentationA Practical Approach To Data Mining Presentation
A Practical Approach To Data Mining Presentation
 
Delivering Value Through Business Analytics
Delivering Value Through Business AnalyticsDelivering Value Through Business Analytics
Delivering Value Through Business Analytics
 
Bootstrap Big Data Webinar
Bootstrap Big Data WebinarBootstrap Big Data Webinar
Bootstrap Big Data Webinar
 
IT Ready - DW: 1st Day
IT Ready - DW: 1st Day IT Ready - DW: 1st Day
IT Ready - DW: 1st Day
 
H2O World - What you need before doing predictive analysis - Keen.io
H2O World - What you need before doing predictive analysis - Keen.ioH2O World - What you need before doing predictive analysis - Keen.io
H2O World - What you need before doing predictive analysis - Keen.io
 
Data Science for Marketing
Data Science for MarketingData Science for Marketing
Data Science for Marketing
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
 
Big Data & Business Analytics: Understanding the Marketspace
Big Data & Business Analytics: Understanding the MarketspaceBig Data & Business Analytics: Understanding the Marketspace
Big Data & Business Analytics: Understanding the Marketspace
 
Operationalizing Customer Analytics with Azure and Power BI
Operationalizing Customer Analytics with Azure and Power BIOperationalizing Customer Analytics with Azure and Power BI
Operationalizing Customer Analytics with Azure and Power BI
 
Social Media Analytics
Social Media AnalyticsSocial Media Analytics
Social Media Analytics
 
Smart Data Module 2 d drive_own data
Smart Data Module 2 d drive_own dataSmart Data Module 2 d drive_own data
Smart Data Module 2 d drive_own data
 
Get your data analytics strategy right!
Get your data analytics strategy right!Get your data analytics strategy right!
Get your data analytics strategy right!
 
Data Mining
Data MiningData Mining
Data Mining
 
Riding the wave of analytics revolution
Riding the wave of analytics revolutionRiding the wave of analytics revolution
Riding the wave of analytics revolution
 
data analytics lecture2.pptx
data analytics lecture2.pptxdata analytics lecture2.pptx
data analytics lecture2.pptx
 
Smarter Analytics: Supporting the Enterprise with Automation
Smarter Analytics: Supporting the Enterprise with AutomationSmarter Analytics: Supporting the Enterprise with Automation
Smarter Analytics: Supporting the Enterprise with Automation
 
Explainability for Natural Language Processing
Explainability for Natural Language ProcessingExplainability for Natural Language Processing
Explainability for Natural Language Processing
 
Embracing data science
Embracing data scienceEmbracing data science
Embracing data science
 
Big data - The next best thing
Big data - The next best thingBig data - The next best thing
Big data - The next best thing
 
Predictive Analytics in Practice - Breakfast Club 11th May 2017
Predictive Analytics in Practice - Breakfast Club 11th May 2017Predictive Analytics in Practice - Breakfast Club 11th May 2017
Predictive Analytics in Practice - Breakfast Club 11th May 2017
 

Mehr von Pawneshwar Datt Rai

Mehr von Pawneshwar Datt Rai (15)

Venture capital in india
Venture capital in indiaVenture capital in india
Venture capital in india
 
Service support process ppt
Service support process pptService support process ppt
Service support process ppt
 
Online payment
Online paymentOnline payment
Online payment
 
Quick Learning or Effective Learning
Quick Learning or Effective LearningQuick Learning or Effective Learning
Quick Learning or Effective Learning
 
Latter Writting
Latter WrittingLatter Writting
Latter Writting
 
Geometric series
Geometric seriesGeometric series
Geometric series
 
Facebook marketing
Facebook marketingFacebook marketing
Facebook marketing
 
Big data
Big dataBig data
Big data
 
Welcome All Of You
Welcome  All  Of  YouWelcome  All  Of  You
Welcome All Of You
 
Service Support Process PPT
Service Support Process PPTService Support Process PPT
Service Support Process PPT
 
Geometric Series
Geometric SeriesGeometric Series
Geometric Series
 
Big Data
Big DataBig Data
Big Data
 
Presentation on a Book
Presentation  on a BookPresentation  on a Book
Presentation on a Book
 
Learning
LearningLearning
Learning
 
Online Payment
Online PaymentOnline Payment
Online Payment
 

Kürzlich hochgeladen

Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
ZurliaSoop
 
如何办理英国诺森比亚大学毕业证(NU毕业证书)成绩单原件一模一样
如何办理英国诺森比亚大学毕业证(NU毕业证书)成绩单原件一模一样如何办理英国诺森比亚大学毕业证(NU毕业证书)成绩单原件一模一样
如何办理英国诺森比亚大学毕业证(NU毕业证书)成绩单原件一模一样
wsppdmt
 
Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...
nirzagarg
 
Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...
Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...
Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...
gajnagarg
 
PLE-statistics document for primary schs
PLE-statistics document for primary schsPLE-statistics document for primary schs
PLE-statistics document for primary schs
cnajjemba
 
In Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi Arabia
In Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi ArabiaIn Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi Arabia
In Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi Arabia
ahmedjiabur940
 
Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...
nirzagarg
 
Jual Cytotec Asli Obat Aborsi No. 1 Paling Manjur
Jual Cytotec Asli Obat Aborsi No. 1 Paling ManjurJual Cytotec Asli Obat Aborsi No. 1 Paling Manjur
Jual Cytotec Asli Obat Aborsi No. 1 Paling Manjur
ptikerjasaptiker
 
一比一原版(曼大毕业证书)曼尼托巴大学毕业证成绩单留信学历认证一手价格
一比一原版(曼大毕业证书)曼尼托巴大学毕业证成绩单留信学历认证一手价格一比一原版(曼大毕业证书)曼尼托巴大学毕业证成绩单留信学历认证一手价格
一比一原版(曼大毕业证书)曼尼托巴大学毕业证成绩单留信学历认证一手价格
q6pzkpark
 
怎样办理旧金山城市学院毕业证(CCSF毕业证书)成绩单学校原版复制
怎样办理旧金山城市学院毕业证(CCSF毕业证书)成绩单学校原版复制怎样办理旧金山城市学院毕业证(CCSF毕业证书)成绩单学校原版复制
怎样办理旧金山城市学院毕业证(CCSF毕业证书)成绩单学校原版复制
vexqp
 
Reconciling Conflicting Data Curation Actions: Transparency Through Argument...
Reconciling Conflicting Data Curation Actions:  Transparency Through Argument...Reconciling Conflicting Data Curation Actions:  Transparency Through Argument...
Reconciling Conflicting Data Curation Actions: Transparency Through Argument...
Bertram Ludäscher
 
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
nirzagarg
 
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
nirzagarg
 

Kürzlich hochgeladen (20)

Discover Why Less is More in B2B Research
Discover Why Less is More in B2B ResearchDiscover Why Less is More in B2B Research
Discover Why Less is More in B2B Research
 
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
 
如何办理英国诺森比亚大学毕业证(NU毕业证书)成绩单原件一模一样
如何办理英国诺森比亚大学毕业证(NU毕业证书)成绩单原件一模一样如何办理英国诺森比亚大学毕业证(NU毕业证书)成绩单原件一模一样
如何办理英国诺森比亚大学毕业证(NU毕业证书)成绩单原件一模一样
 
Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...
 
7. Epi of Chronic respiratory diseases.ppt
7. Epi of Chronic respiratory diseases.ppt7. Epi of Chronic respiratory diseases.ppt
7. Epi of Chronic respiratory diseases.ppt
 
Digital Transformation Playbook by Graham Ware
Digital Transformation Playbook by Graham WareDigital Transformation Playbook by Graham Ware
Digital Transformation Playbook by Graham Ware
 
Harnessing the Power of GenAI for BI and Reporting.pptx
Harnessing the Power of GenAI for BI and Reporting.pptxHarnessing the Power of GenAI for BI and Reporting.pptx
Harnessing the Power of GenAI for BI and Reporting.pptx
 
DATA SUMMIT 24 Building Real-Time Pipelines With FLaNK
DATA SUMMIT 24  Building Real-Time Pipelines With FLaNKDATA SUMMIT 24  Building Real-Time Pipelines With FLaNK
DATA SUMMIT 24 Building Real-Time Pipelines With FLaNK
 
Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...
Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...
Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...
 
PLE-statistics document for primary schs
PLE-statistics document for primary schsPLE-statistics document for primary schs
PLE-statistics document for primary schs
 
In Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi Arabia
In Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi ArabiaIn Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi Arabia
In Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi Arabia
 
Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...
 
Jual Cytotec Asli Obat Aborsi No. 1 Paling Manjur
Jual Cytotec Asli Obat Aborsi No. 1 Paling ManjurJual Cytotec Asli Obat Aborsi No. 1 Paling Manjur
Jual Cytotec Asli Obat Aborsi No. 1 Paling Manjur
 
SR-101-01012024-EN.docx Federal Constitution of the Swiss Confederation
SR-101-01012024-EN.docx  Federal Constitution  of the Swiss ConfederationSR-101-01012024-EN.docx  Federal Constitution  of the Swiss Confederation
SR-101-01012024-EN.docx Federal Constitution of the Swiss Confederation
 
一比一原版(曼大毕业证书)曼尼托巴大学毕业证成绩单留信学历认证一手价格
一比一原版(曼大毕业证书)曼尼托巴大学毕业证成绩单留信学历认证一手价格一比一原版(曼大毕业证书)曼尼托巴大学毕业证成绩单留信学历认证一手价格
一比一原版(曼大毕业证书)曼尼托巴大学毕业证成绩单留信学历认证一手价格
 
怎样办理旧金山城市学院毕业证(CCSF毕业证书)成绩单学校原版复制
怎样办理旧金山城市学院毕业证(CCSF毕业证书)成绩单学校原版复制怎样办理旧金山城市学院毕业证(CCSF毕业证书)成绩单学校原版复制
怎样办理旧金山城市学院毕业证(CCSF毕业证书)成绩单学校原版复制
 
Reconciling Conflicting Data Curation Actions: Transparency Through Argument...
Reconciling Conflicting Data Curation Actions:  Transparency Through Argument...Reconciling Conflicting Data Curation Actions:  Transparency Through Argument...
Reconciling Conflicting Data Curation Actions: Transparency Through Argument...
 
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
 
Vadodara 💋 Call Girl 7737669865 Call Girls in Vadodara Escort service book now
Vadodara 💋 Call Girl 7737669865 Call Girls in Vadodara Escort service book nowVadodara 💋 Call Girl 7737669865 Call Girls in Vadodara Escort service book now
Vadodara 💋 Call Girl 7737669865 Call Girls in Vadodara Escort service book now
 
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
 

Introduction to data mining technique

  • 1. INTRODUCTION TO DATA MINING TECHNIQUE By – Pawneshwar Datt Rai
  • 2. WHAT IS DATA MINING?  Data mining is also called knowledge discovery and data mining (KDD)  Data mining is  extraction of useful patterns from data sources, e.g., databases, texts, web, image.  Patterns must be:  valid, novel, potentially useful, understandable This PPT presented By - Pawneshwar Datt Rai
  • 3. EXAMPLE OF DISCOVERED PATTERNS  Association rules: “80% of customers who buy cheese and milk also buy bread, and 5% of customers buy all of them together” Cheese, Milk Bread [sup =5%, confid=80%] This PPT presented By - Pawneshwar Datt Rai
  • 4. MAIN DATA MINING TASKS  Classification: mining patterns that can classify future data into known classes.  Association rule mining mining any rule of the form X  Y, where X and Y are sets of data items.  Clustering identifying a set of similarity groups in the data This PPT presented By - Pawneshwar Datt Rai
  • 5. MAIN DATA MINING TASKS  Sequential pattern mining: A sequential rule: A B, says that event A will be immediately followed by event B with a certain confidence  Deviation detection: discovering the most significant changes in data  Data visualization: using graphical methods to show patterns in data. This PPT presented By - Pawneshwar Datt Rai
  • 6. WHY IS DATA MINING IMPORTANT?  Rapid computerization of businesses produce huge amount of data  How to make best use of data?  A growing realization: knowledge discovered from data can be used for competitive advantage. This PPT presented By - Pawneshwar Datt Rai
  • 7. WHY IS DATA MINING NECESSARY?  Make use of your data assets  There is a big gap from stored data to knowledge; and the transition won’t occur automatically.  Many interesting things you want to find cannot be found using database queries “find me people likely to buy my products” “Who are likely to respond to my promotion” This PPT presented By - Pawneshwar Datt Rai
  • 8. WHY DATA MINING NOW?  The data is abundant.  The data is being warehoused.  The computing power is affordable.  The competitive pressure is strong.  Data mining tools have become available This PPT presented By - Pawneshwar Datt Rai
  • 9. RELATED FIELDS  Data mining is an emerging multi-disciplinary field: Statistics Machine learning Databases Information retrieval Visualization etc. This PPT presented By - Pawneshwar Datt Rai
  • 10. DATA MINING (KDD) PROCESS  Understand the application domain  Identify data sources and select target data  Pre-process: cleaning, attribute selection  Data mining to extract patterns or models  Post-process: identifying interesting or useful patterns  Incorporate patterns in real world tasks This PPT presented By - Pawneshwar Datt Rai
  • 11. DATA MINING APPLICATIONS  Marketing, customer profiling and retention, identifying potential customers, market segmentation.  Fraud detection identifying credit card fraud, intrusion detection  Scientific data analysis  Text and web mining  Any application that involves a large amount of data. This PPT presented By - Pawneshwar Datt Rai
  • 12. WEB DATA EXTRACTION Data region1 Data region2 A data record A data record This PPT presented By - Pawneshwar Datt Rai
  • 13. OPINION ANALYSIS  Word-of-mouth on the Web  The Web has dramatically changed the way that consumers express their opinions.  One can post reviews of products at merchant sites, Web forums, discussion groups, blogs  Techniques are being developed to exploit these sources.  Benefits of Review Analysis  Potential Customer: No need to read many reviews  Product manufacturer: market intelligence, product benchmarking This PPT presented By - Pawneshwar Datt Rai
  • 14. FEATURE BASED ANALYSIS & SUMMARIZATION  Extracting product features (called Opinion Features) that have been commented on by customers.  Identifying opinion sentences in each review and deciding whether each opinion sentence is positive or negative.  Summarizing and comparing results. This PPT presented By - Pawneshwar Datt Rai
  • 15. A Happy and Prosperous day to all friends. This PPT presented By – Pawneshwar Datt Rai ThisPPTpresentedBy-PawneshwarDattRai