SlideShare a Scribd company logo
1 of 47
Data Mining
In SSAS
Ram Kedem
Copyright 2014 © Ram Kedem. All rights reserved. Not to be reproduced without written consent.
ramkedem.com
What is Data Mining ?
• the computational process of discovering patterns in large
data sets involving methods at the intersection of artificial
intelligence, machine learning, statistics, and database
systems.
• The overall goal of the data mining process is to extract
information from a data set and transform it into an
understandable structure for further use.
Copyright 2014 © Ram Kedem. All rights reserved. Not to be reproduced without written consent.
ramkedem.com
Data Mining Process
• Cross-Industry Standard Process for Data Mining (CRISP-DM)
consists of six cyclic phases :
• Business Understanding
• Data Understanding
• Data Preparation
• Modeling
• Evaluation
• Deployment
Copyright 2014 © Ram Kedem. All rights reserved. Not to be reproduced without written consent.
ramkedem.com
Data Mining Algorithms
Covered In This Lesson
• Decision Trees
• Clustering
• Time Series
• Association rules
Copyright 2014 © Ram Kedem. All rights reserved. Not to be reproduced without written consent.
ramkedem.com
Decision Trees
• Decision Trees
• a tree-like graph or model of decisions and their possible
consequences, including chance event outcomes, resource costs
• Very common, used for classification.
Copyright 2014 © Ram Kedem. All rights reserved. Not to be reproduced without written consent.
ramkedem.com
Clustering
• Clustering
• grouping a set of objects in such a way that objects in the same
group (called a cluster) are more similar (in some sense or
another) to each other than to those in other groups (clusters)
• Widely used in areas of fraud detection
Copyright 2014 © Ram Kedem. All rights reserved. Not to be reproduced without written consent.
ramkedem.com
Data Validation Sample
• http://www.sqlserverdatamining.com/ssdm/
Copyright 2014 © Ram Kedem. All rights reserved. Not to be reproduced without written consent.
ramkedem.com
Data Validation Sample
• http://www.sqlserverdatamining.com/DataValidation/default.
aspx
Copyright 2014 © Ram Kedem. All rights reserved. Not to be reproduced without written consent.
ramkedem.com
Time Series
• Time Series
• A time series is a collection of observations made chronologically,
Used to predict a number of some sort.
Copyright 2014 © Ram Kedem. All rights reserved. Not to be reproduced without written consent.
ramkedem.com
Association rules
• Association rules
• Association rules are if/then statements that help uncover
relationships between seemingly unrelated data in a relational
database or other information repository.
Create New Mining
Structure
Copyright 2014 © Ram Kedem. All rights reserved. Not to be reproduced without written consent.
ramkedem.com
Define your Data Source
Copyright 2014 © Ram Kedem. All rights reserved. Not to be reproduced without written consent.
ramkedem.com
Define your DSV
Copyright 2014 © Ram Kedem. All rights reserved. Not to be reproduced without written consent.
ramkedem.com
Create new mining structure
• Right click on Mining Structures and choose New Mining
Structure
Copyright 2014 © Ram Kedem. All rights reserved. Not to be reproduced without written consent.
ramkedem.com
Select the Definition Method
Copyright 2014 © Ram Kedem. All rights reserved. Not to be reproduced without written consent.
ramkedem.com
Create the Mining Structure
Copyright 2014 © Ram Kedem. All rights reserved. Not to be reproduced without written consent.
ramkedem.com
Select Data Source View
Copyright 2014 © Ram Kedem. All rights reserved. Not to be reproduced without written consent.
ramkedem.com
Specify Table Types
Copyright 2014 © Ram Kedem. All rights reserved. Not to be reproduced without written consent.
ramkedem.com
Specify Training Data
Copyright 2014 © Ram Kedem. All rights reserved. Not to be reproduced without written consent.
ramkedem.com
Suggest Related Columns
Copyright 2014 © Ram Kedem. All rights reserved. Not to be reproduced without written consent.
ramkedem.com
Columns Content & Data Types
Copyright 2014 © Ram Kedem. All rights reserved. Not to be reproduced without written consent.
ramkedem.com
Create Testing Set
Copyright 2014 © Ram Kedem. All rights reserved. Not to be reproduced without written consent.
ramkedem.com
Completing the Wizard
Copyright 2014 © Ram Kedem. All rights reserved. Not to be reproduced without written consent.
ramkedem.com
View Mining Model
• Deploy and Process the Mining Model.
• View the Mining Model in Mining Models Tab
Copyright 2014 © Ram Kedem. All rights reserved. Not to be reproduced without written consent.
ramkedem.com
Adding Mining Models
• It is possible to add more mining models by right-clicking at
the Mining models area and choosing New Mining Model.
• Deploy and Process once finished.
Copyright 2014 © Ram Kedem. All rights reserved. Not to be reproduced without written consent.
ramkedem.com
Mining Model Viewer
Copyright 2014 © Ram Kedem. All rights reserved. Not to be reproduced without written consent.
ramkedem.com
Dependency Network
Copyright 2014 © Ram Kedem. All rights reserved. Not to be reproduced without written consent.
ramkedem.com
Mining Accuracy Chart - Input
Copyright 2014 © Ram Kedem. All rights reserved. Not to be reproduced without written consent.
ramkedem.com
Lift Chart
Copyright 2014 © Ram Kedem. All rights reserved. Not to be reproduced without written consent.
ramkedem.com
Profit Chart
Copyright 2014 © Ram Kedem. All rights reserved. Not to be reproduced without written consent.
ramkedem.com
Profit Chart
Copyright 2014 © Ram Kedem. All rights reserved. Not to be reproduced without written consent.
ramkedem.com
Mining Model Prediction
Copyright 2014 © Ram Kedem. All rights reserved. Not to be reproduced without written consent.
ramkedem.com
Mining Model Prediction
Copyright 2014 © Ram Kedem. All rights reserved. Not to be reproduced without written consent.
ramkedem.com
Query Results View
Copyright 2014 © Ram Kedem. All rights reserved. Not to be reproduced without written consent.
ramkedem.com
Order the Data Differently
Copyright 2014 © Ram Kedem. All rights reserved. Not to be reproduced without written consent.
ramkedem.com
Singleton Query
Copyright 2014 © Ram Kedem. All rights reserved. Not to be reproduced without written consent.
ramkedem.com
Singleton Query
Time Series Mining
Structure
Copyright 2014 © Ram Kedem. All rights reserved. Not to be reproduced without written consent.
ramkedem.com
Time Series Mining Structure
• Right click on Mining Structures and choose New Mining
Structure
Copyright 2014 © Ram Kedem. All rights reserved. Not to be reproduced without written consent.
ramkedem.com
Select Definition Method
Copyright 2014 © Ram Kedem. All rights reserved. Not to be reproduced without written consent.
ramkedem.com
Create the Data Mining
Structure
Copyright 2014 © Ram Kedem. All rights reserved. Not to be reproduced without written consent.
ramkedem.com
Select DSV
Copyright 2014 © Ram Kedem. All rights reserved. Not to be reproduced without written consent.
ramkedem.com
Specify Table Types
Copyright 2014 © Ram Kedem. All rights reserved. Not to be reproduced without written consent.
ramkedem.com
Specify Training Data
Copyright 2014 © Ram Kedem. All rights reserved. Not to be reproduced without written consent.
ramkedem.com
Specify Columns Content and
Data Types
Copyright 2014 © Ram Kedem. All rights reserved. Not to be reproduced without written consent.
ramkedem.com
Complete the Wizard
Copyright 2014 © Ram Kedem. All rights reserved. Not to be reproduced without written consent.
ramkedem.com
Mining Model Viewer

More Related Content

What's hot

Data Warehouse Design Considerations
Data Warehouse Design ConsiderationsData Warehouse Design Considerations
Data Warehouse Design ConsiderationsRam Kedem
 
Introduction to Databases
Introduction to DatabasesIntroduction to Databases
Introduction to DatabasesRam Kedem
 
SSRS Basic Parameters
SSRS Basic ParametersSSRS Basic Parameters
SSRS Basic ParametersRam Kedem
 
Data mining In SSAS
Data mining In SSASData mining In SSAS
Data mining In SSASRam Kedem
 
SSRS Conditional Formatting
SSRS Conditional FormattingSSRS Conditional Formatting
SSRS Conditional FormattingRam Kedem
 
SSRS Calculated Fields
SSRS Calculated FieldsSSRS Calculated Fields
SSRS Calculated FieldsRam Kedem
 
Bliblidotcom - SASS Introduction
Bliblidotcom - SASS IntroductionBliblidotcom - SASS Introduction
Bliblidotcom - SASS IntroductionIrfan Maulana
 
Html tag presentation
Html tag presentationHtml tag presentation
Html tag presentationkfalgout
 

What's hot (10)

Data Warehouse Design Considerations
Data Warehouse Design ConsiderationsData Warehouse Design Considerations
Data Warehouse Design Considerations
 
Introduction to Databases
Introduction to DatabasesIntroduction to Databases
Introduction to Databases
 
SSRS Basic Parameters
SSRS Basic ParametersSSRS Basic Parameters
SSRS Basic Parameters
 
SSRS Matrix
SSRS MatrixSSRS Matrix
SSRS Matrix
 
Data mining In SSAS
Data mining In SSASData mining In SSAS
Data mining In SSAS
 
SSRS Gauges
SSRS GaugesSSRS Gauges
SSRS Gauges
 
SSRS Conditional Formatting
SSRS Conditional FormattingSSRS Conditional Formatting
SSRS Conditional Formatting
 
SSRS Calculated Fields
SSRS Calculated FieldsSSRS Calculated Fields
SSRS Calculated Fields
 
Bliblidotcom - SASS Introduction
Bliblidotcom - SASS IntroductionBliblidotcom - SASS Introduction
Bliblidotcom - SASS Introduction
 
Html tag presentation
Html tag presentationHtml tag presentation
Html tag presentation
 

Similar to Data Mining in SSAS with Decision Trees, Clustering & Time Series

SSIS Data Flow Tasks
SSIS Data Flow Tasks SSIS Data Flow Tasks
SSIS Data Flow Tasks Ram Kedem
 
Data Warehouse Basics
Data Warehouse BasicsData Warehouse Basics
Data Warehouse BasicsRam Kedem
 
Deploy SSRS Project - SQL Server 2014
Deploy SSRS Project - SQL Server 2014Deploy SSRS Project - SQL Server 2014
Deploy SSRS Project - SQL Server 2014Ram Kedem
 
CiviCRM for Government NAI
CiviCRM for Government NAICiviCRM for Government NAI
CiviCRM for Government NAInuams
 
Open source Cloud Automation Platform
Open source Cloud Automation PlatformOpen source Cloud Automation Platform
Open source Cloud Automation PlatformKishore Neelamegam
 
Microservices for Architects - Atlanta 2018-03-28
Microservices for Architects - Atlanta 2018-03-28Microservices for Architects - Atlanta 2018-03-28
Microservices for Architects - Atlanta 2018-03-28Derek Ashmore
 
How to Effectively Plan for Disaster Recovery on AWS (CMP204-S) - AWS re:Inve...
How to Effectively Plan for Disaster Recovery on AWS (CMP204-S) - AWS re:Inve...How to Effectively Plan for Disaster Recovery on AWS (CMP204-S) - AWS re:Inve...
How to Effectively Plan for Disaster Recovery on AWS (CMP204-S) - AWS re:Inve...Amazon Web Services
 
SolidWorks Design Automation Using the SolidWorks API, Microsoft Excel and VBA
SolidWorks Design Automation Using the SolidWorks API, Microsoft Excel and VBASolidWorks Design Automation Using the SolidWorks API, Microsoft Excel and VBA
SolidWorks Design Automation Using the SolidWorks API, Microsoft Excel and VBARazorleaf Corporation
 
Barbara_Aichinger_Server_Forum_2014
Barbara_Aichinger_Server_Forum_2014Barbara_Aichinger_Server_Forum_2014
Barbara_Aichinger_Server_Forum_2014Barbara Aichinger
 
Microservices for architects los angeles-2016-07-16
Microservices for architects los angeles-2016-07-16Microservices for architects los angeles-2016-07-16
Microservices for architects los angeles-2016-07-16Derek Ashmore
 
From Monolithic to Modern Apps: Best Practices
From Monolithic to Modern Apps: Best PracticesFrom Monolithic to Modern Apps: Best Practices
From Monolithic to Modern Apps: Best PracticesTom Laszewski
 
Should you be hosting your DAM software on-premise or in the cloud?
Should you be hosting your DAM software on-premise or in the cloud?Should you be hosting your DAM software on-premise or in the cloud?
Should you be hosting your DAM software on-premise or in the cloud?ResourceSpace
 
Comparing JVM Web Frameworks - February 2014
Comparing JVM Web Frameworks - February 2014Comparing JVM Web Frameworks - February 2014
Comparing JVM Web Frameworks - February 2014Matt Raible
 
Scribe insight 03 elevating performance
Scribe insight 03   elevating performanceScribe insight 03   elevating performance
Scribe insight 03 elevating performanceScribe Software Corp.
 
Developing-Effective-Mass-Migration-Strategy-out-of-a-Tool-based-Portfolio-As...
Developing-Effective-Mass-Migration-Strategy-out-of-a-Tool-based-Portfolio-As...Developing-Effective-Mass-Migration-Strategy-out-of-a-Tool-based-Portfolio-As...
Developing-Effective-Mass-Migration-Strategy-out-of-a-Tool-based-Portfolio-As...Amazon Web Services
 
The Big 3: Top Mistakes to Avoid on Your Lead Gen Landing Pages
The Big 3: Top Mistakes to Avoid on Your Lead Gen Landing PagesThe Big 3: Top Mistakes to Avoid on Your Lead Gen Landing Pages
The Big 3: Top Mistakes to Avoid on Your Lead Gen Landing PagesMarketo
 

Similar to Data Mining in SSAS with Decision Trees, Clustering & Time Series (20)

SSIS Data Flow Tasks
SSIS Data Flow Tasks SSIS Data Flow Tasks
SSIS Data Flow Tasks
 
Data Warehouse Basics
Data Warehouse BasicsData Warehouse Basics
Data Warehouse Basics
 
SSRS Groups
SSRS GroupsSSRS Groups
SSRS Groups
 
Deploy SSRS Project - SQL Server 2014
Deploy SSRS Project - SQL Server 2014Deploy SSRS Project - SQL Server 2014
Deploy SSRS Project - SQL Server 2014
 
CiviCRM for Government NAI
CiviCRM for Government NAICiviCRM for Government NAI
CiviCRM for Government NAI
 
Open source Cloud Automation Platform
Open source Cloud Automation PlatformOpen source Cloud Automation Platform
Open source Cloud Automation Platform
 
Microservices for Architects - Atlanta 2018-03-28
Microservices for Architects - Atlanta 2018-03-28Microservices for Architects - Atlanta 2018-03-28
Microservices for Architects - Atlanta 2018-03-28
 
How to Effectively Plan for Disaster Recovery on AWS (CMP204-S) - AWS re:Inve...
How to Effectively Plan for Disaster Recovery on AWS (CMP204-S) - AWS re:Inve...How to Effectively Plan for Disaster Recovery on AWS (CMP204-S) - AWS re:Inve...
How to Effectively Plan for Disaster Recovery on AWS (CMP204-S) - AWS re:Inve...
 
SolidWorks Design Automation Using the SolidWorks API, Microsoft Excel and VBA
SolidWorks Design Automation Using the SolidWorks API, Microsoft Excel and VBASolidWorks Design Automation Using the SolidWorks API, Microsoft Excel and VBA
SolidWorks Design Automation Using the SolidWorks API, Microsoft Excel and VBA
 
Barbara_Aichinger_Server_Forum_2014
Barbara_Aichinger_Server_Forum_2014Barbara_Aichinger_Server_Forum_2014
Barbara_Aichinger_Server_Forum_2014
 
Microservices for architects los angeles-2016-07-16
Microservices for architects los angeles-2016-07-16Microservices for architects los angeles-2016-07-16
Microservices for architects los angeles-2016-07-16
 
From Monolithic to Modern Apps: Best Practices
From Monolithic to Modern Apps: Best PracticesFrom Monolithic to Modern Apps: Best Practices
From Monolithic to Modern Apps: Best Practices
 
Well-Architected Bootcamp
Well-Architected BootcampWell-Architected Bootcamp
Well-Architected Bootcamp
 
Should you be hosting your DAM software on-premise or in the cloud?
Should you be hosting your DAM software on-premise or in the cloud?Should you be hosting your DAM software on-premise or in the cloud?
Should you be hosting your DAM software on-premise or in the cloud?
 
Comparing JVM Web Frameworks - February 2014
Comparing JVM Web Frameworks - February 2014Comparing JVM Web Frameworks - February 2014
Comparing JVM Web Frameworks - February 2014
 
Scribe insight 03 elevating performance
Scribe insight 03   elevating performanceScribe insight 03   elevating performance
Scribe insight 03 elevating performance
 
Developing-Effective-Mass-Migration-Strategy-out-of-a-Tool-based-Portfolio-As...
Developing-Effective-Mass-Migration-Strategy-out-of-a-Tool-based-Portfolio-As...Developing-Effective-Mass-Migration-Strategy-out-of-a-Tool-based-Portfolio-As...
Developing-Effective-Mass-Migration-Strategy-out-of-a-Tool-based-Portfolio-As...
 
SAP License Services by Crayon Software Experts
SAP License Services by Crayon Software ExpertsSAP License Services by Crayon Software Experts
SAP License Services by Crayon Software Experts
 
The Big 3: Top Mistakes to Avoid on Your Lead Gen Landing Pages
The Big 3: Top Mistakes to Avoid on Your Lead Gen Landing PagesThe Big 3: Top Mistakes to Avoid on Your Lead Gen Landing Pages
The Big 3: Top Mistakes to Avoid on Your Lead Gen Landing Pages
 
AWS Well-Architected Workshop
AWS Well-Architected WorkshopAWS Well-Architected Workshop
AWS Well-Architected Workshop
 

More from Ram Kedem

Impala use case @ edge
Impala use case @ edgeImpala use case @ edge
Impala use case @ edgeRam Kedem
 
Advanced SQL Webinar
Advanced SQL WebinarAdvanced SQL Webinar
Advanced SQL WebinarRam Kedem
 
Managing oracle Database Instance
Managing oracle Database InstanceManaging oracle Database Instance
Managing oracle Database InstanceRam Kedem
 
DDL Practice (Hebrew)
DDL Practice (Hebrew)DDL Practice (Hebrew)
DDL Practice (Hebrew)Ram Kedem
 
DML Practice (Hebrew)
DML Practice (Hebrew)DML Practice (Hebrew)
DML Practice (Hebrew)Ram Kedem
 
Exploring Oracle Database Architecture (Hebrew)
Exploring Oracle Database Architecture (Hebrew)Exploring Oracle Database Architecture (Hebrew)
Exploring Oracle Database Architecture (Hebrew)Ram Kedem
 
Introduction to SQL
Introduction to SQLIntroduction to SQL
Introduction to SQLRam Kedem
 
Pig - Processing XML data
Pig - Processing XML dataPig - Processing XML data
Pig - Processing XML dataRam Kedem
 
MSSQL Server - Automation
MSSQL Server - AutomationMSSQL Server - Automation
MSSQL Server - AutomationRam Kedem
 
Lesson 5 security
Lesson 5   securityLesson 5   security
Lesson 5 securityRam Kedem
 

More from Ram Kedem (10)

Impala use case @ edge
Impala use case @ edgeImpala use case @ edge
Impala use case @ edge
 
Advanced SQL Webinar
Advanced SQL WebinarAdvanced SQL Webinar
Advanced SQL Webinar
 
Managing oracle Database Instance
Managing oracle Database InstanceManaging oracle Database Instance
Managing oracle Database Instance
 
DDL Practice (Hebrew)
DDL Practice (Hebrew)DDL Practice (Hebrew)
DDL Practice (Hebrew)
 
DML Practice (Hebrew)
DML Practice (Hebrew)DML Practice (Hebrew)
DML Practice (Hebrew)
 
Exploring Oracle Database Architecture (Hebrew)
Exploring Oracle Database Architecture (Hebrew)Exploring Oracle Database Architecture (Hebrew)
Exploring Oracle Database Architecture (Hebrew)
 
Introduction to SQL
Introduction to SQLIntroduction to SQL
Introduction to SQL
 
Pig - Processing XML data
Pig - Processing XML dataPig - Processing XML data
Pig - Processing XML data
 
MSSQL Server - Automation
MSSQL Server - AutomationMSSQL Server - Automation
MSSQL Server - Automation
 
Lesson 5 security
Lesson 5   securityLesson 5   security
Lesson 5 security
 

Recently uploaded

Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfLoriGlavin3
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxNavinnSomaal
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.Curtis Poe
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024Lonnie McRorey
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxLoriGlavin3
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxLoriGlavin3
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersRaghuram Pandurangan
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brandgvaughan
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxLoriGlavin3
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfMounikaPolabathina
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenHervé Boutemy
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxLoriGlavin3
 

Recently uploaded (20)

DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdf
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptx
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information Developers
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brand
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdf
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache Maven
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
 

Data Mining in SSAS with Decision Trees, Clustering & Time Series

  • 2. Copyright 2014 © Ram Kedem. All rights reserved. Not to be reproduced without written consent. ramkedem.com What is Data Mining ? • the computational process of discovering patterns in large data sets involving methods at the intersection of artificial intelligence, machine learning, statistics, and database systems. • The overall goal of the data mining process is to extract information from a data set and transform it into an understandable structure for further use.
  • 3. Copyright 2014 © Ram Kedem. All rights reserved. Not to be reproduced without written consent. ramkedem.com Data Mining Process • Cross-Industry Standard Process for Data Mining (CRISP-DM) consists of six cyclic phases : • Business Understanding • Data Understanding • Data Preparation • Modeling • Evaluation • Deployment
  • 4. Copyright 2014 © Ram Kedem. All rights reserved. Not to be reproduced without written consent. ramkedem.com Data Mining Algorithms Covered In This Lesson • Decision Trees • Clustering • Time Series • Association rules
  • 5. Copyright 2014 © Ram Kedem. All rights reserved. Not to be reproduced without written consent. ramkedem.com Decision Trees • Decision Trees • a tree-like graph or model of decisions and their possible consequences, including chance event outcomes, resource costs • Very common, used for classification.
  • 6. Copyright 2014 © Ram Kedem. All rights reserved. Not to be reproduced without written consent. ramkedem.com Clustering • Clustering • grouping a set of objects in such a way that objects in the same group (called a cluster) are more similar (in some sense or another) to each other than to those in other groups (clusters) • Widely used in areas of fraud detection
  • 7. Copyright 2014 © Ram Kedem. All rights reserved. Not to be reproduced without written consent. ramkedem.com Data Validation Sample • http://www.sqlserverdatamining.com/ssdm/
  • 8. Copyright 2014 © Ram Kedem. All rights reserved. Not to be reproduced without written consent. ramkedem.com Data Validation Sample • http://www.sqlserverdatamining.com/DataValidation/default. aspx
  • 9. Copyright 2014 © Ram Kedem. All rights reserved. Not to be reproduced without written consent. ramkedem.com Time Series • Time Series • A time series is a collection of observations made chronologically, Used to predict a number of some sort.
  • 10. Copyright 2014 © Ram Kedem. All rights reserved. Not to be reproduced without written consent. ramkedem.com Association rules • Association rules • Association rules are if/then statements that help uncover relationships between seemingly unrelated data in a relational database or other information repository.
  • 12. Copyright 2014 © Ram Kedem. All rights reserved. Not to be reproduced without written consent. ramkedem.com Define your Data Source
  • 13. Copyright 2014 © Ram Kedem. All rights reserved. Not to be reproduced without written consent. ramkedem.com Define your DSV
  • 14. Copyright 2014 © Ram Kedem. All rights reserved. Not to be reproduced without written consent. ramkedem.com Create new mining structure • Right click on Mining Structures and choose New Mining Structure
  • 15. Copyright 2014 © Ram Kedem. All rights reserved. Not to be reproduced without written consent. ramkedem.com Select the Definition Method
  • 16. Copyright 2014 © Ram Kedem. All rights reserved. Not to be reproduced without written consent. ramkedem.com Create the Mining Structure
  • 17. Copyright 2014 © Ram Kedem. All rights reserved. Not to be reproduced without written consent. ramkedem.com Select Data Source View
  • 18. Copyright 2014 © Ram Kedem. All rights reserved. Not to be reproduced without written consent. ramkedem.com Specify Table Types
  • 19. Copyright 2014 © Ram Kedem. All rights reserved. Not to be reproduced without written consent. ramkedem.com Specify Training Data
  • 20. Copyright 2014 © Ram Kedem. All rights reserved. Not to be reproduced without written consent. ramkedem.com Suggest Related Columns
  • 21. Copyright 2014 © Ram Kedem. All rights reserved. Not to be reproduced without written consent. ramkedem.com Columns Content & Data Types
  • 22. Copyright 2014 © Ram Kedem. All rights reserved. Not to be reproduced without written consent. ramkedem.com Create Testing Set
  • 23. Copyright 2014 © Ram Kedem. All rights reserved. Not to be reproduced without written consent. ramkedem.com Completing the Wizard
  • 24. Copyright 2014 © Ram Kedem. All rights reserved. Not to be reproduced without written consent. ramkedem.com View Mining Model • Deploy and Process the Mining Model. • View the Mining Model in Mining Models Tab
  • 25. Copyright 2014 © Ram Kedem. All rights reserved. Not to be reproduced without written consent. ramkedem.com Adding Mining Models • It is possible to add more mining models by right-clicking at the Mining models area and choosing New Mining Model. • Deploy and Process once finished.
  • 26. Copyright 2014 © Ram Kedem. All rights reserved. Not to be reproduced without written consent. ramkedem.com Mining Model Viewer
  • 27. Copyright 2014 © Ram Kedem. All rights reserved. Not to be reproduced without written consent. ramkedem.com Dependency Network
  • 28. Copyright 2014 © Ram Kedem. All rights reserved. Not to be reproduced without written consent. ramkedem.com Mining Accuracy Chart - Input
  • 29. Copyright 2014 © Ram Kedem. All rights reserved. Not to be reproduced without written consent. ramkedem.com Lift Chart
  • 30. Copyright 2014 © Ram Kedem. All rights reserved. Not to be reproduced without written consent. ramkedem.com Profit Chart
  • 31. Copyright 2014 © Ram Kedem. All rights reserved. Not to be reproduced without written consent. ramkedem.com Profit Chart
  • 32. Copyright 2014 © Ram Kedem. All rights reserved. Not to be reproduced without written consent. ramkedem.com Mining Model Prediction
  • 33. Copyright 2014 © Ram Kedem. All rights reserved. Not to be reproduced without written consent. ramkedem.com Mining Model Prediction
  • 34. Copyright 2014 © Ram Kedem. All rights reserved. Not to be reproduced without written consent. ramkedem.com Query Results View
  • 35. Copyright 2014 © Ram Kedem. All rights reserved. Not to be reproduced without written consent. ramkedem.com Order the Data Differently
  • 36. Copyright 2014 © Ram Kedem. All rights reserved. Not to be reproduced without written consent. ramkedem.com Singleton Query
  • 37. Copyright 2014 © Ram Kedem. All rights reserved. Not to be reproduced without written consent. ramkedem.com Singleton Query
  • 39. Copyright 2014 © Ram Kedem. All rights reserved. Not to be reproduced without written consent. ramkedem.com Time Series Mining Structure • Right click on Mining Structures and choose New Mining Structure
  • 40. Copyright 2014 © Ram Kedem. All rights reserved. Not to be reproduced without written consent. ramkedem.com Select Definition Method
  • 41. Copyright 2014 © Ram Kedem. All rights reserved. Not to be reproduced without written consent. ramkedem.com Create the Data Mining Structure
  • 42. Copyright 2014 © Ram Kedem. All rights reserved. Not to be reproduced without written consent. ramkedem.com Select DSV
  • 43. Copyright 2014 © Ram Kedem. All rights reserved. Not to be reproduced without written consent. ramkedem.com Specify Table Types
  • 44. Copyright 2014 © Ram Kedem. All rights reserved. Not to be reproduced without written consent. ramkedem.com Specify Training Data
  • 45. Copyright 2014 © Ram Kedem. All rights reserved. Not to be reproduced without written consent. ramkedem.com Specify Columns Content and Data Types
  • 46. Copyright 2014 © Ram Kedem. All rights reserved. Not to be reproduced without written consent. ramkedem.com Complete the Wizard
  • 47. Copyright 2014 © Ram Kedem. All rights reserved. Not to be reproduced without written consent. ramkedem.com Mining Model Viewer