SlideShare ist ein Scribd-Unternehmen logo
1 von 32
AGILE DATA MINING 
WITH DATA VAULT 2.0 
Timo Cirkel, Michael Olschimke 
Dörffler & Partner GmbH
Introduction 
Background 
Example 
Conclusion 
AGENDA 
Agile 12.02.2014 Data Mining with Data Vault 2.0 2
INTRODUCTION 
Agile Data Mining with DataVault 2.0 
Agile 12.02.2014 Data Mining with Data Vault 2.0 3
TIMO CIRKEL 
BI-Consultant 
Certified Data Vault 2.0 Practitioner 
Analysis Of Policyholders 
Specialized inCRM, Software Development, 
DWHAutomation 
Industries: Insurance, Energy 
B. Sc. Business Informatics 
12.02.2014 Agile Data Mining with Data Vault 2.0 4
MICHAEL OLSCHIMKE 
Senior BI-Consultant 
Certified Data Vault 2.0 Practitioner 
Official Data Vault 2.0 Trainer in Europe 
AssociateTeacher University of Hannover 
Specializing in Data Vault 2.0, Data Mining, 
CRM, project management 
Industries: Insurance, Automotive, Retail, 
Public Sector, Non-Profits 
12.02.2014 Agile Data Mining with Data Vault 2.0 5
• Medium-sized consulting firm 
• Official Partner of Dan Linstedt In 
Europe 
• Consulting, Training, 
Implementation 
• Industries: 
• Insurance 
• Automotive 
• Banks 
• Trade 
• Pharmaceuticals 
• Telecommunications 
DÖRFFLER & PARTNER GMBH 
12.02.2014 Agile Data Mining With Data Vault 2.0 6
BACKGROUND 
Agile Data Mining with DataVault 2.0 
Agile 12.02.2014 Data Mining with Data Vault 2.0 7
DATA MINING PROJECT IN THE VGH 
Motor insurance 
Customer segmentation 
A first datamining pilot, therefore: 
No specific requirements 
Vision is developed during project 
Agile Project Methodology 
Close co-operation with business 
12.02.2014 Agile Data Mining with Data Vault 2.0 8
• Extracting 
information from 
existing data and 
Patterns 
• Four (large) 
categories: 
• Segmentation 
• Classification 
• Prediction 
• Association 
• Wide range of 
available algorithms 
and methods 
DATA MINING PROJECTS 
"The term Data Mining ... describes 
the extraction implicitly existing, 
non-trivial and useful knowledge 
from large, dynamic, relatively 
complex structured data." 
Datenbank 
Anwendung 
Anwender 
Data-Mining- 
Techniken 
Aussagen, Regeln & 
Informationen 
Data Dictionary 
Fachwissen 
12.02.2014 Agile Data Mining with Data Vault 2.0 9
DATA VAULT 2.0 MODELING 
Surrogate 
Key 
Business 
Keys 
Foreign Keys 
Descriptors 
In accordance with its own representation Linstedt, 2014 
12.02.2014 Agile Data Mining with Data Vault 2.0 10
DATA VAULT 2.0 METHODOLOGY 
Data Vault 
2.0 
Methodology 
Six 
Sigma 
TQM 
Scrum CMMI 
PMP 
SDLC 
12.02.2014 Agile Data Mining with Data Vault 2.0 11
DATA VAULT 2.0 METHODOLOGY FOR DATA MINING 
Advantages 
• Agile project management for DWH projects 
• Automation and generation 
• Rapid adoption to changes in the model 
• Incremental build-out = incremental cost control 
• Targeted delivery = two week sprints 
• Predictable and measurable results 
Disadvantages 
• Focus on loading of raw data and the production 
of information 
• Not many data mining references 
• Many concepts in the methodology are not 
applicable for data mining projects 
• Difficult scaling of team sizes in data mining 
projects 
12.02.2014 Agile Data Mining with Data Vault 2.0 12
CRISP-DM 
Own Representation in accordance with Chapman, et al. , 2000 
12.02.2014 Agile Data Mining with Data Vault 2.0 13
PROCESS MODEL 
Prozessmodell – VGH Kundensegmentierung 
ivv KTC D & P 
Daten in Data Vault 
Modell speichern 
Daten abziehen 
Algorithmus 
auswählen 
Segmentierung 
ausführen 
Ergebnis erzielt? 
Ja 
Ergebnis 
präsentieren 
Ergebnis ok? 
Ende 
Ja 
Start 
Gütefunktion 
erarbeiten 
SQL-Query erstellen 
Relevante VN-Attribute 
ermitteln 
Nein Formel ok? 
Ja 
Nein 
Algorithmen 
erforschen 
Nein 
Geeigneter 
Algorithmus 
gefunden? 
Ja 
Nein 
12.02.2014 Agile Data Mining with Data Vault 2.0 14
RAPIDMINER 
 Java-based 
data 
mining 
software 
 One of 
the most 
widely used 
data mining 
tools 
 Offers 
 Environment fo 
r control flow 
 Large number 
of algorithms 
 Large choice 
of data sources 
Overall CorporaTE Consultants Academics NGO / GOV'T 
© 2012 Rexer AnalYTICS 
12.02.2014 Agile Data Mining with Data Vault 2.0 15
EXAMPLE 
Agile Data Mining with DataVault 2.0 
Agile 12.02.2014 Data Mining with Data Vault 2.0 16
EXAMPLE 
 AdventureWorks-Database 
 Scenario: 
 Advertising campaign for a new bike 
 Identification of the target group 
 Solution: 
 Decision Tree 
 Identify relevant attributes in several iterations 
Lachev, 2005, p. 238ff 
Simple 
Example 
12.02.2014 Agile Data Mining with Data Vault 2.0 17
Agile Data Mining with Data Vault 2.0 18 
10066 Records 
Attribute 
Marital 
Status 
Gender 
Yearly 
Income 
Total 
Children 
Education 
Number Cars 
Owned 
Commute 
Distance 
Occupation 
House Owner 
Flag 
Age
ITERATION 1: DATA VAULT 2.0 MODEL 
English 
Education 
Numbers Cars 
Owned 
Gender 
Marital Status 
Sat 
Customer 
Hub 
Customer 
Customer Key 
Commute 
Distance 
Age 
House Owner 
Flag 
English 
Occupation 
Sat Category 
Product 
Category 
12.02.2014 Agile Data Mining with Data Vault 2.0 19
ITERATION 1: RAPIDMINER PROCESS 
Data Gathering 
Data preparation 
Modeling 
12.02.2014 Agile Data Mining with Data Vault 2.0 20
ITERATION 1: DECISIONTREE MODEL 
12.02.2014 Agile Data Mining with Data Vault 2.0 21
ITERATION 1: RESULTS 
12.02.2014 Agile Data Mining with Data Vault 2.0 22
ITERATION 2: DATA VAULT 2.0 MODEL 
English 
Education 
Numbers Cars 
Owned 
Gender 
Marital Status 
Sat 
Customer 
Hub 
Customer 
Sat Customer 
Income 
Customer Key 
Commute 
Distance 
Age 
House Owner 
Flag 
English 
Occupation 
Sat Customer 
Children 
Sat Category 
Total 
Children 
Yearly 
Income 
Product 
Category 
12.02.2014 Agile Data Mining with Data Vault 2.0 23
ITERATION 2: RAPIDMINER PROCESS 
Data Gathering 
Preparation Modeling 
12.02.2014 Agile Data Mining with Data Vault 2.0 24
ITERATION 2: RESULTS 
+4.01% 
12.02.2014 Agile Data Mining with Data Vault 2.0 25
ITERATION 3: DATA VAULT 2.0 MODEL 
English 
Education 
Numbers Cars 
Owned 
Gender 
Marital Status 
Sat 
Customer 
Hub 
Customer 
Sat Customer 
Income 
Customer Key 
Commute 
Distance 
Age 
House Owner 
Flag 
English 
Occupation 
Sat Customer 
Children 
Sat Category 
Total 
Children 
Yearly 
Income 
Product 
Category 
Commute 
Distance Miles 
CSat Customer 
Distance 
12.02.2014 Agile Data Mining with Data Vault 2.0 26
ITERATION 3: RAPIDMINER PROCESS 
Data Gathering 
Preparation Modeling 
12.02.2014 Agile Data Mining with Data Vault 2.0 27
ITERATION 3: RESULTS 
+0.12% 
12.02.2014 Agile Data Mining with Data Vault 2.0 28
CONCLUSIONS 
Agile Data Mining with DataVault 2.0 
Agile 12.02.2014 Data Mining with Data Vault 2.0 29
CONCLUSIONS 
 Data Vault is a flexible data 
model, with good support for agile project 
methodology 
 DataVault is not an additional hurdle in data mining 
projects 
 Additional attributes can be added at any time during 
the project, in an incremental fashion 
Business Vault: transparent data processing 
12.02.2014 Agile Data Mining with Data Vault 2.0 30
FURTHER INFORMATION 
Appears 
2015 
Available 
Www.doerffler.com WWW.datavault.de Www.learndatavault.com 
Appears 
2015 
12.02.2014 Agile Data Mining with Data Vault 2.0 31
Give us feedback 
Agile Data Mining with Data Vault 2.0 32 
Http://goo.gl/LGO4ze 
Source:Vasilijonline.com 
12.02.2014

Weitere ähnliche Inhalte

Was ist angesagt?

Enabling a Data Mesh Architecture with Data Virtualization
Enabling a Data Mesh Architecture with Data VirtualizationEnabling a Data Mesh Architecture with Data Virtualization
Enabling a Data Mesh Architecture with Data VirtualizationDenodo
 
Data Architecture Brief Overview
Data Architecture Brief OverviewData Architecture Brief Overview
Data Architecture Brief OverviewHal Kalechofsky
 
Data Lakehouse, Data Mesh, and Data Fabric (r1)
Data Lakehouse, Data Mesh, and Data Fabric (r1)Data Lakehouse, Data Mesh, and Data Fabric (r1)
Data Lakehouse, Data Mesh, and Data Fabric (r1)James Serra
 
Essential Reference and Master Data Management
Essential Reference and Master Data ManagementEssential Reference and Master Data Management
Essential Reference and Master Data ManagementDATAVERSITY
 
Master Data Management methodology
Master Data Management methodologyMaster Data Management methodology
Master Data Management methodologyDatabase Architechs
 
Agile BI via Data Vault and Modelstorming
Agile BI via Data Vault and ModelstormingAgile BI via Data Vault and Modelstorming
Agile BI via Data Vault and ModelstormingDaniel Upton
 
(OTW13) Agile Data Warehousing: Introduction to Data Vault Modeling
(OTW13) Agile Data Warehousing: Introduction to Data Vault Modeling(OTW13) Agile Data Warehousing: Introduction to Data Vault Modeling
(OTW13) Agile Data Warehousing: Introduction to Data Vault ModelingKent Graziano
 
Intro to Data Vault 2.0 on Snowflake
Intro to Data Vault 2.0 on SnowflakeIntro to Data Vault 2.0 on Snowflake
Intro to Data Vault 2.0 on SnowflakeKent Graziano
 
How to Build & Sustain a Data Governance Operating Model
How to Build & Sustain a Data Governance Operating Model How to Build & Sustain a Data Governance Operating Model
How to Build & Sustain a Data Governance Operating Model DATUM LLC
 
Architect’s Open-Source Guide for a Data Mesh Architecture
Architect’s Open-Source Guide for a Data Mesh ArchitectureArchitect’s Open-Source Guide for a Data Mesh Architecture
Architect’s Open-Source Guide for a Data Mesh ArchitectureDatabricks
 
Sample - Data Warehouse Requirements
Sample -  Data Warehouse RequirementsSample -  Data Warehouse Requirements
Sample - Data Warehouse RequirementsDavid Walker
 
Data Lakehouse Symposium | Day 4
Data Lakehouse Symposium | Day 4Data Lakehouse Symposium | Day 4
Data Lakehouse Symposium | Day 4Databricks
 
Data Governance Best Practices
Data Governance Best PracticesData Governance Best Practices
Data Governance Best PracticesDATAVERSITY
 
Data Mesh for Dinner
Data Mesh for DinnerData Mesh for Dinner
Data Mesh for DinnerKent Graziano
 
Slides: Taking an Active Approach to Data Governance
Slides: Taking an Active Approach to Data GovernanceSlides: Taking an Active Approach to Data Governance
Slides: Taking an Active Approach to Data GovernanceDATAVERSITY
 
Accelerate and modernize your data pipelines
Accelerate and modernize your data pipelinesAccelerate and modernize your data pipelines
Accelerate and modernize your data pipelinesPaul Van Siclen
 
Data Mesh Part 4 Monolith to Mesh
Data Mesh Part 4 Monolith to MeshData Mesh Part 4 Monolith to Mesh
Data Mesh Part 4 Monolith to MeshJeffrey T. Pollock
 
Data Mess to Data Mesh | Jay Kreps, CEO, Confluent | Kafka Summit Americas 20...
Data Mess to Data Mesh | Jay Kreps, CEO, Confluent | Kafka Summit Americas 20...Data Mess to Data Mesh | Jay Kreps, CEO, Confluent | Kafka Summit Americas 20...
Data Mess to Data Mesh | Jay Kreps, CEO, Confluent | Kafka Summit Americas 20...HostedbyConfluent
 

Was ist angesagt? (20)

Enabling a Data Mesh Architecture with Data Virtualization
Enabling a Data Mesh Architecture with Data VirtualizationEnabling a Data Mesh Architecture with Data Virtualization
Enabling a Data Mesh Architecture with Data Virtualization
 
Data Architecture Brief Overview
Data Architecture Brief OverviewData Architecture Brief Overview
Data Architecture Brief Overview
 
Data Lakehouse, Data Mesh, and Data Fabric (r1)
Data Lakehouse, Data Mesh, and Data Fabric (r1)Data Lakehouse, Data Mesh, and Data Fabric (r1)
Data Lakehouse, Data Mesh, and Data Fabric (r1)
 
Essential Reference and Master Data Management
Essential Reference and Master Data ManagementEssential Reference and Master Data Management
Essential Reference and Master Data Management
 
Master Data Management methodology
Master Data Management methodologyMaster Data Management methodology
Master Data Management methodology
 
Agile BI via Data Vault and Modelstorming
Agile BI via Data Vault and ModelstormingAgile BI via Data Vault and Modelstorming
Agile BI via Data Vault and Modelstorming
 
(OTW13) Agile Data Warehousing: Introduction to Data Vault Modeling
(OTW13) Agile Data Warehousing: Introduction to Data Vault Modeling(OTW13) Agile Data Warehousing: Introduction to Data Vault Modeling
(OTW13) Agile Data Warehousing: Introduction to Data Vault Modeling
 
Intro to Data Vault 2.0 on Snowflake
Intro to Data Vault 2.0 on SnowflakeIntro to Data Vault 2.0 on Snowflake
Intro to Data Vault 2.0 on Snowflake
 
How to Build & Sustain a Data Governance Operating Model
How to Build & Sustain a Data Governance Operating Model How to Build & Sustain a Data Governance Operating Model
How to Build & Sustain a Data Governance Operating Model
 
Webinar Data Mesh - Part 3
Webinar Data Mesh - Part 3Webinar Data Mesh - Part 3
Webinar Data Mesh - Part 3
 
Architect’s Open-Source Guide for a Data Mesh Architecture
Architect’s Open-Source Guide for a Data Mesh ArchitectureArchitect’s Open-Source Guide for a Data Mesh Architecture
Architect’s Open-Source Guide for a Data Mesh Architecture
 
Sample - Data Warehouse Requirements
Sample -  Data Warehouse RequirementsSample -  Data Warehouse Requirements
Sample - Data Warehouse Requirements
 
Data Mesh
Data MeshData Mesh
Data Mesh
 
Data Lakehouse Symposium | Day 4
Data Lakehouse Symposium | Day 4Data Lakehouse Symposium | Day 4
Data Lakehouse Symposium | Day 4
 
Data Governance Best Practices
Data Governance Best PracticesData Governance Best Practices
Data Governance Best Practices
 
Data Mesh for Dinner
Data Mesh for DinnerData Mesh for Dinner
Data Mesh for Dinner
 
Slides: Taking an Active Approach to Data Governance
Slides: Taking an Active Approach to Data GovernanceSlides: Taking an Active Approach to Data Governance
Slides: Taking an Active Approach to Data Governance
 
Accelerate and modernize your data pipelines
Accelerate and modernize your data pipelinesAccelerate and modernize your data pipelines
Accelerate and modernize your data pipelines
 
Data Mesh Part 4 Monolith to Mesh
Data Mesh Part 4 Monolith to MeshData Mesh Part 4 Monolith to Mesh
Data Mesh Part 4 Monolith to Mesh
 
Data Mess to Data Mesh | Jay Kreps, CEO, Confluent | Kafka Summit Americas 20...
Data Mess to Data Mesh | Jay Kreps, CEO, Confluent | Kafka Summit Americas 20...Data Mess to Data Mesh | Jay Kreps, CEO, Confluent | Kafka Summit Americas 20...
Data Mess to Data Mesh | Jay Kreps, CEO, Confluent | Kafka Summit Americas 20...
 

Ähnlich wie Agile Data Mining with Data Vault 2.0 Techniques

Building Resiliency and Agility with Data Virtualization for the New Normal
Building Resiliency and Agility with Data Virtualization for the New NormalBuilding Resiliency and Agility with Data Virtualization for the New Normal
Building Resiliency and Agility with Data Virtualization for the New NormalDenodo
 
Making Big Data Analytics with Hadoop fast & easy (webinar slides)
Making Big Data Analytics with Hadoop fast & easy (webinar slides)Making Big Data Analytics with Hadoop fast & easy (webinar slides)
Making Big Data Analytics with Hadoop fast & easy (webinar slides)Yellowfin
 
Innovative Data Strategies for Advanced Analytics Solutions and the Role of D...
Innovative Data Strategies for Advanced Analytics Solutions and the Role of D...Innovative Data Strategies for Advanced Analytics Solutions and the Role of D...
Innovative Data Strategies for Advanced Analytics Solutions and the Role of D...Denodo
 
By Thoughtworks | Building data as a product: The key to unlocking Data Mesh'...
By Thoughtworks | Building data as a product: The key to unlocking Data Mesh'...By Thoughtworks | Building data as a product: The key to unlocking Data Mesh'...
By Thoughtworks | Building data as a product: The key to unlocking Data Mesh'...IngridBuenaventura
 
Denodo DataFest 2016: Data Science: Operationalizing Analytical Models in Rea...
Denodo DataFest 2016: Data Science: Operationalizing Analytical Models in Rea...Denodo DataFest 2016: Data Science: Operationalizing Analytical Models in Rea...
Denodo DataFest 2016: Data Science: Operationalizing Analytical Models in Rea...Denodo
 
¿En qué se parece el Gobierno del Dato a un parque de atracciones?
¿En qué se parece el Gobierno del Dato a un parque de atracciones?¿En qué se parece el Gobierno del Dato a un parque de atracciones?
¿En qué se parece el Gobierno del Dato a un parque de atracciones?Denodo
 
A Key to Real-time Insights in a Post-COVID World (ASEAN)
A Key to Real-time Insights in a Post-COVID World (ASEAN)A Key to Real-time Insights in a Post-COVID World (ASEAN)
A Key to Real-time Insights in a Post-COVID World (ASEAN)Denodo
 
Big Data with Data Virtualization (session 3 from Packed Lunch Webinar Series)
Big Data with Data Virtualization (session 3 from Packed Lunch Webinar Series)Big Data with Data Virtualization (session 3 from Packed Lunch Webinar Series)
Big Data with Data Virtualization (session 3 from Packed Lunch Webinar Series)Denodo
 
Slides: Success Stories for Data-to-Cloud
Slides: Success Stories for Data-to-CloudSlides: Success Stories for Data-to-Cloud
Slides: Success Stories for Data-to-CloudDATAVERSITY
 
MongoDB IoT City Tour STUTTGART: Hadoop and future data management. By, Cloudera
MongoDB IoT City Tour STUTTGART: Hadoop and future data management. By, ClouderaMongoDB IoT City Tour STUTTGART: Hadoop and future data management. By, Cloudera
MongoDB IoT City Tour STUTTGART: Hadoop and future data management. By, ClouderaMongoDB
 
Multi-Cloud Data Integration with Data Virtualization (APAC)
Multi-Cloud Data Integration with Data Virtualization (APAC)Multi-Cloud Data Integration with Data Virtualization (APAC)
Multi-Cloud Data Integration with Data Virtualization (APAC)Denodo
 
Rethink Your 2021 Data Management Strategy with Data Virtualization (ASEAN)
Rethink Your 2021 Data Management Strategy with Data Virtualization (ASEAN)Rethink Your 2021 Data Management Strategy with Data Virtualization (ASEAN)
Rethink Your 2021 Data Management Strategy with Data Virtualization (ASEAN)Denodo
 
Trends for Modernizing Analytics and Data Warehousing in 2019
Trends for Modernizing Analytics and Data Warehousing in 2019Trends for Modernizing Analytics and Data Warehousing in 2019
Trends for Modernizing Analytics and Data Warehousing in 2019Arcadia Data
 
Bridging the Last Mile: Getting Data to the People Who Need It
Bridging the Last Mile: Getting Data to the People Who Need ItBridging the Last Mile: Getting Data to the People Who Need It
Bridging the Last Mile: Getting Data to the People Who Need ItDenodo
 
451 Research + NuoDB: What It Means to be a Container-Native SQL Database
451 Research + NuoDB: What It Means to be a Container-Native SQL Database451 Research + NuoDB: What It Means to be a Container-Native SQL Database
451 Research + NuoDB: What It Means to be a Container-Native SQL DatabaseNuoDB
 
Self-Service Analytics with Guard Rails
Self-Service Analytics with Guard RailsSelf-Service Analytics with Guard Rails
Self-Service Analytics with Guard RailsDenodo
 
Your Data is Waiting. What are the Top 5 Trends for Data in 2022? (ASEAN)
Your Data is Waiting. What are the Top 5 Trends for Data in 2022? (ASEAN)Your Data is Waiting. What are the Top 5 Trends for Data in 2022? (ASEAN)
Your Data is Waiting. What are the Top 5 Trends for Data in 2022? (ASEAN)Denodo
 
When and How Data Lakes Fit into a Modern Data Architecture
When and How Data Lakes Fit into a Modern Data ArchitectureWhen and How Data Lakes Fit into a Modern Data Architecture
When and How Data Lakes Fit into a Modern Data ArchitectureDATAVERSITY
 
TechEvent DWH Modernization
TechEvent DWH ModernizationTechEvent DWH Modernization
TechEvent DWH ModernizationTrivadis
 
Data Virtualization: An Introduction
Data Virtualization: An IntroductionData Virtualization: An Introduction
Data Virtualization: An IntroductionDenodo
 

Ähnlich wie Agile Data Mining with Data Vault 2.0 Techniques (20)

Building Resiliency and Agility with Data Virtualization for the New Normal
Building Resiliency and Agility with Data Virtualization for the New NormalBuilding Resiliency and Agility with Data Virtualization for the New Normal
Building Resiliency and Agility with Data Virtualization for the New Normal
 
Making Big Data Analytics with Hadoop fast & easy (webinar slides)
Making Big Data Analytics with Hadoop fast & easy (webinar slides)Making Big Data Analytics with Hadoop fast & easy (webinar slides)
Making Big Data Analytics with Hadoop fast & easy (webinar slides)
 
Innovative Data Strategies for Advanced Analytics Solutions and the Role of D...
Innovative Data Strategies for Advanced Analytics Solutions and the Role of D...Innovative Data Strategies for Advanced Analytics Solutions and the Role of D...
Innovative Data Strategies for Advanced Analytics Solutions and the Role of D...
 
By Thoughtworks | Building data as a product: The key to unlocking Data Mesh'...
By Thoughtworks | Building data as a product: The key to unlocking Data Mesh'...By Thoughtworks | Building data as a product: The key to unlocking Data Mesh'...
By Thoughtworks | Building data as a product: The key to unlocking Data Mesh'...
 
Denodo DataFest 2016: Data Science: Operationalizing Analytical Models in Rea...
Denodo DataFest 2016: Data Science: Operationalizing Analytical Models in Rea...Denodo DataFest 2016: Data Science: Operationalizing Analytical Models in Rea...
Denodo DataFest 2016: Data Science: Operationalizing Analytical Models in Rea...
 
¿En qué se parece el Gobierno del Dato a un parque de atracciones?
¿En qué se parece el Gobierno del Dato a un parque de atracciones?¿En qué se parece el Gobierno del Dato a un parque de atracciones?
¿En qué se parece el Gobierno del Dato a un parque de atracciones?
 
A Key to Real-time Insights in a Post-COVID World (ASEAN)
A Key to Real-time Insights in a Post-COVID World (ASEAN)A Key to Real-time Insights in a Post-COVID World (ASEAN)
A Key to Real-time Insights in a Post-COVID World (ASEAN)
 
Big Data with Data Virtualization (session 3 from Packed Lunch Webinar Series)
Big Data with Data Virtualization (session 3 from Packed Lunch Webinar Series)Big Data with Data Virtualization (session 3 from Packed Lunch Webinar Series)
Big Data with Data Virtualization (session 3 from Packed Lunch Webinar Series)
 
Slides: Success Stories for Data-to-Cloud
Slides: Success Stories for Data-to-CloudSlides: Success Stories for Data-to-Cloud
Slides: Success Stories for Data-to-Cloud
 
MongoDB IoT City Tour STUTTGART: Hadoop and future data management. By, Cloudera
MongoDB IoT City Tour STUTTGART: Hadoop and future data management. By, ClouderaMongoDB IoT City Tour STUTTGART: Hadoop and future data management. By, Cloudera
MongoDB IoT City Tour STUTTGART: Hadoop and future data management. By, Cloudera
 
Multi-Cloud Data Integration with Data Virtualization (APAC)
Multi-Cloud Data Integration with Data Virtualization (APAC)Multi-Cloud Data Integration with Data Virtualization (APAC)
Multi-Cloud Data Integration with Data Virtualization (APAC)
 
Rethink Your 2021 Data Management Strategy with Data Virtualization (ASEAN)
Rethink Your 2021 Data Management Strategy with Data Virtualization (ASEAN)Rethink Your 2021 Data Management Strategy with Data Virtualization (ASEAN)
Rethink Your 2021 Data Management Strategy with Data Virtualization (ASEAN)
 
Trends for Modernizing Analytics and Data Warehousing in 2019
Trends for Modernizing Analytics and Data Warehousing in 2019Trends for Modernizing Analytics and Data Warehousing in 2019
Trends for Modernizing Analytics and Data Warehousing in 2019
 
Bridging the Last Mile: Getting Data to the People Who Need It
Bridging the Last Mile: Getting Data to the People Who Need ItBridging the Last Mile: Getting Data to the People Who Need It
Bridging the Last Mile: Getting Data to the People Who Need It
 
451 Research + NuoDB: What It Means to be a Container-Native SQL Database
451 Research + NuoDB: What It Means to be a Container-Native SQL Database451 Research + NuoDB: What It Means to be a Container-Native SQL Database
451 Research + NuoDB: What It Means to be a Container-Native SQL Database
 
Self-Service Analytics with Guard Rails
Self-Service Analytics with Guard RailsSelf-Service Analytics with Guard Rails
Self-Service Analytics with Guard Rails
 
Your Data is Waiting. What are the Top 5 Trends for Data in 2022? (ASEAN)
Your Data is Waiting. What are the Top 5 Trends for Data in 2022? (ASEAN)Your Data is Waiting. What are the Top 5 Trends for Data in 2022? (ASEAN)
Your Data is Waiting. What are the Top 5 Trends for Data in 2022? (ASEAN)
 
When and How Data Lakes Fit into a Modern Data Architecture
When and How Data Lakes Fit into a Modern Data ArchitectureWhen and How Data Lakes Fit into a Modern Data Architecture
When and How Data Lakes Fit into a Modern Data Architecture
 
TechEvent DWH Modernization
TechEvent DWH ModernizationTechEvent DWH Modernization
TechEvent DWH Modernization
 
Data Virtualization: An Introduction
Data Virtualization: An IntroductionData Virtualization: An Introduction
Data Virtualization: An Introduction
 

Mehr von Michael Olschimke

Agiles Data Mining mit Data Vault 2.0
Agiles Data Mining mit Data Vault 2.0Agiles Data Mining mit Data Vault 2.0
Agiles Data Mining mit Data Vault 2.0Michael Olschimke
 
Introduction to Salesforce CRM Reporting
Introduction to Salesforce CRM ReportingIntroduction to Salesforce CRM Reporting
Introduction to Salesforce CRM ReportingMichael Olschimke
 
Introduction to Google Analytics
Introduction to Google AnalyticsIntroduction to Google Analytics
Introduction to Google AnalyticsMichael Olschimke
 
Business Concepts for Mobile Applications
Business Concepts for Mobile ApplicationsBusiness Concepts for Mobile Applications
Business Concepts for Mobile ApplicationsMichael Olschimke
 
Technology Concepts for Mobile Applications
Technology Concepts for Mobile ApplicationsTechnology Concepts for Mobile Applications
Technology Concepts for Mobile ApplicationsMichael Olschimke
 
Ethische Entscheidungskompetenz
Ethische EntscheidungskompetenzEthische Entscheidungskompetenz
Ethische EntscheidungskompetenzMichael Olschimke
 

Mehr von Michael Olschimke (9)

Agiles Data Mining mit Data Vault 2.0
Agiles Data Mining mit Data Vault 2.0Agiles Data Mining mit Data Vault 2.0
Agiles Data Mining mit Data Vault 2.0
 
Introduction to Salesforce CRM Reporting
Introduction to Salesforce CRM ReportingIntroduction to Salesforce CRM Reporting
Introduction to Salesforce CRM Reporting
 
Introduction to Google Analytics
Introduction to Google AnalyticsIntroduction to Google Analytics
Introduction to Google Analytics
 
Visual Data Vault
Visual Data VaultVisual Data Vault
Visual Data Vault
 
Introduction to Piwik
Introduction to PiwikIntroduction to Piwik
Introduction to Piwik
 
Business Concepts for Mobile Applications
Business Concepts for Mobile ApplicationsBusiness Concepts for Mobile Applications
Business Concepts for Mobile Applications
 
Technology Concepts for Mobile Applications
Technology Concepts for Mobile ApplicationsTechnology Concepts for Mobile Applications
Technology Concepts for Mobile Applications
 
Ethische Entscheidungskompetenz
Ethische EntscheidungskompetenzEthische Entscheidungskompetenz
Ethische Entscheidungskompetenz
 
Data Modeling Zone 2013
Data Modeling Zone 2013Data Modeling Zone 2013
Data Modeling Zone 2013
 

Kürzlich hochgeladen

Student Profile Sample report on improving academic performance by uniting gr...
Student Profile Sample report on improving academic performance by uniting gr...Student Profile Sample report on improving academic performance by uniting gr...
Student Profile Sample report on improving academic performance by uniting gr...Seán Kennedy
 
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...Thomas Poetter
 
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024Susanna-Assunta Sansone
 
Bank Loan Approval Analysis: A Comprehensive Data Analysis Project
Bank Loan Approval Analysis: A Comprehensive Data Analysis ProjectBank Loan Approval Analysis: A Comprehensive Data Analysis Project
Bank Loan Approval Analysis: A Comprehensive Data Analysis ProjectBoston Institute of Analytics
 
6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...
6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...
6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...Dr Arash Najmaei ( Phd., MBA, BSc)
 
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...Amil Baba Dawood bangali
 
Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)Cathrine Wilhelmsen
 
Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...Seán Kennedy
 
Digital Marketing Plan, how digital marketing works
Digital Marketing Plan, how digital marketing worksDigital Marketing Plan, how digital marketing works
Digital Marketing Plan, how digital marketing worksdeepakthakur548787
 
Networking Case Study prepared by teacher.pptx
Networking Case Study prepared by teacher.pptxNetworking Case Study prepared by teacher.pptx
Networking Case Study prepared by teacher.pptxHimangsuNath
 
Unveiling the Role of Social Media Suspect Investigators in Preventing Online...
Unveiling the Role of Social Media Suspect Investigators in Preventing Online...Unveiling the Role of Social Media Suspect Investigators in Preventing Online...
Unveiling the Role of Social Media Suspect Investigators in Preventing Online...Milind Agarwal
 
Principles and Practices of Data Visualization
Principles and Practices of Data VisualizationPrinciples and Practices of Data Visualization
Principles and Practices of Data VisualizationKianJazayeri1
 
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...Boston Institute of Analytics
 
Decoding Patterns: Customer Churn Prediction Data Analysis Project
Decoding Patterns: Customer Churn Prediction Data Analysis ProjectDecoding Patterns: Customer Churn Prediction Data Analysis Project
Decoding Patterns: Customer Churn Prediction Data Analysis ProjectBoston Institute of Analytics
 
What To Do For World Nature Conservation Day by Slidesgo.pptx
What To Do For World Nature Conservation Day by Slidesgo.pptxWhat To Do For World Nature Conservation Day by Slidesgo.pptx
What To Do For World Nature Conservation Day by Slidesgo.pptxSimranPal17
 
Advanced Machine Learning for Business Professionals
Advanced Machine Learning for Business ProfessionalsAdvanced Machine Learning for Business Professionals
Advanced Machine Learning for Business ProfessionalsVICTOR MAESTRE RAMIREZ
 
Defining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data StoryDefining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data StoryJeremy Anderson
 
Real-Time AI Streaming - AI Max Princeton
Real-Time AI  Streaming - AI Max PrincetonReal-Time AI  Streaming - AI Max Princeton
Real-Time AI Streaming - AI Max PrincetonTimothy Spann
 
INTRODUCTION TO Natural language processing
INTRODUCTION TO Natural language processingINTRODUCTION TO Natural language processing
INTRODUCTION TO Natural language processingsocarem879
 

Kürzlich hochgeladen (20)

Student Profile Sample report on improving academic performance by uniting gr...
Student Profile Sample report on improving academic performance by uniting gr...Student Profile Sample report on improving academic performance by uniting gr...
Student Profile Sample report on improving academic performance by uniting gr...
 
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...
 
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024
 
Bank Loan Approval Analysis: A Comprehensive Data Analysis Project
Bank Loan Approval Analysis: A Comprehensive Data Analysis ProjectBank Loan Approval Analysis: A Comprehensive Data Analysis Project
Bank Loan Approval Analysis: A Comprehensive Data Analysis Project
 
Data Analysis Project: Stroke Prediction
Data Analysis Project: Stroke PredictionData Analysis Project: Stroke Prediction
Data Analysis Project: Stroke Prediction
 
6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...
6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...
6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...
 
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...
 
Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)
 
Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...
 
Digital Marketing Plan, how digital marketing works
Digital Marketing Plan, how digital marketing worksDigital Marketing Plan, how digital marketing works
Digital Marketing Plan, how digital marketing works
 
Networking Case Study prepared by teacher.pptx
Networking Case Study prepared by teacher.pptxNetworking Case Study prepared by teacher.pptx
Networking Case Study prepared by teacher.pptx
 
Unveiling the Role of Social Media Suspect Investigators in Preventing Online...
Unveiling the Role of Social Media Suspect Investigators in Preventing Online...Unveiling the Role of Social Media Suspect Investigators in Preventing Online...
Unveiling the Role of Social Media Suspect Investigators in Preventing Online...
 
Principles and Practices of Data Visualization
Principles and Practices of Data VisualizationPrinciples and Practices of Data Visualization
Principles and Practices of Data Visualization
 
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...
 
Decoding Patterns: Customer Churn Prediction Data Analysis Project
Decoding Patterns: Customer Churn Prediction Data Analysis ProjectDecoding Patterns: Customer Churn Prediction Data Analysis Project
Decoding Patterns: Customer Churn Prediction Data Analysis Project
 
What To Do For World Nature Conservation Day by Slidesgo.pptx
What To Do For World Nature Conservation Day by Slidesgo.pptxWhat To Do For World Nature Conservation Day by Slidesgo.pptx
What To Do For World Nature Conservation Day by Slidesgo.pptx
 
Advanced Machine Learning for Business Professionals
Advanced Machine Learning for Business ProfessionalsAdvanced Machine Learning for Business Professionals
Advanced Machine Learning for Business Professionals
 
Defining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data StoryDefining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data Story
 
Real-Time AI Streaming - AI Max Princeton
Real-Time AI  Streaming - AI Max PrincetonReal-Time AI  Streaming - AI Max Princeton
Real-Time AI Streaming - AI Max Princeton
 
INTRODUCTION TO Natural language processing
INTRODUCTION TO Natural language processingINTRODUCTION TO Natural language processing
INTRODUCTION TO Natural language processing
 

Agile Data Mining with Data Vault 2.0 Techniques

  • 1. AGILE DATA MINING WITH DATA VAULT 2.0 Timo Cirkel, Michael Olschimke Dörffler & Partner GmbH
  • 2. Introduction Background Example Conclusion AGENDA Agile 12.02.2014 Data Mining with Data Vault 2.0 2
  • 3. INTRODUCTION Agile Data Mining with DataVault 2.0 Agile 12.02.2014 Data Mining with Data Vault 2.0 3
  • 4. TIMO CIRKEL BI-Consultant Certified Data Vault 2.0 Practitioner Analysis Of Policyholders Specialized inCRM, Software Development, DWHAutomation Industries: Insurance, Energy B. Sc. Business Informatics 12.02.2014 Agile Data Mining with Data Vault 2.0 4
  • 5. MICHAEL OLSCHIMKE Senior BI-Consultant Certified Data Vault 2.0 Practitioner Official Data Vault 2.0 Trainer in Europe AssociateTeacher University of Hannover Specializing in Data Vault 2.0, Data Mining, CRM, project management Industries: Insurance, Automotive, Retail, Public Sector, Non-Profits 12.02.2014 Agile Data Mining with Data Vault 2.0 5
  • 6. • Medium-sized consulting firm • Official Partner of Dan Linstedt In Europe • Consulting, Training, Implementation • Industries: • Insurance • Automotive • Banks • Trade • Pharmaceuticals • Telecommunications DÖRFFLER & PARTNER GMBH 12.02.2014 Agile Data Mining With Data Vault 2.0 6
  • 7. BACKGROUND Agile Data Mining with DataVault 2.0 Agile 12.02.2014 Data Mining with Data Vault 2.0 7
  • 8. DATA MINING PROJECT IN THE VGH Motor insurance Customer segmentation A first datamining pilot, therefore: No specific requirements Vision is developed during project Agile Project Methodology Close co-operation with business 12.02.2014 Agile Data Mining with Data Vault 2.0 8
  • 9. • Extracting information from existing data and Patterns • Four (large) categories: • Segmentation • Classification • Prediction • Association • Wide range of available algorithms and methods DATA MINING PROJECTS "The term Data Mining ... describes the extraction implicitly existing, non-trivial and useful knowledge from large, dynamic, relatively complex structured data." Datenbank Anwendung Anwender Data-Mining- Techniken Aussagen, Regeln & Informationen Data Dictionary Fachwissen 12.02.2014 Agile Data Mining with Data Vault 2.0 9
  • 10. DATA VAULT 2.0 MODELING Surrogate Key Business Keys Foreign Keys Descriptors In accordance with its own representation Linstedt, 2014 12.02.2014 Agile Data Mining with Data Vault 2.0 10
  • 11. DATA VAULT 2.0 METHODOLOGY Data Vault 2.0 Methodology Six Sigma TQM Scrum CMMI PMP SDLC 12.02.2014 Agile Data Mining with Data Vault 2.0 11
  • 12. DATA VAULT 2.0 METHODOLOGY FOR DATA MINING Advantages • Agile project management for DWH projects • Automation and generation • Rapid adoption to changes in the model • Incremental build-out = incremental cost control • Targeted delivery = two week sprints • Predictable and measurable results Disadvantages • Focus on loading of raw data and the production of information • Not many data mining references • Many concepts in the methodology are not applicable for data mining projects • Difficult scaling of team sizes in data mining projects 12.02.2014 Agile Data Mining with Data Vault 2.0 12
  • 13. CRISP-DM Own Representation in accordance with Chapman, et al. , 2000 12.02.2014 Agile Data Mining with Data Vault 2.0 13
  • 14. PROCESS MODEL Prozessmodell – VGH Kundensegmentierung ivv KTC D & P Daten in Data Vault Modell speichern Daten abziehen Algorithmus auswählen Segmentierung ausführen Ergebnis erzielt? Ja Ergebnis präsentieren Ergebnis ok? Ende Ja Start Gütefunktion erarbeiten SQL-Query erstellen Relevante VN-Attribute ermitteln Nein Formel ok? Ja Nein Algorithmen erforschen Nein Geeigneter Algorithmus gefunden? Ja Nein 12.02.2014 Agile Data Mining with Data Vault 2.0 14
  • 15. RAPIDMINER  Java-based data mining software  One of the most widely used data mining tools  Offers  Environment fo r control flow  Large number of algorithms  Large choice of data sources Overall CorporaTE Consultants Academics NGO / GOV'T © 2012 Rexer AnalYTICS 12.02.2014 Agile Data Mining with Data Vault 2.0 15
  • 16. EXAMPLE Agile Data Mining with DataVault 2.0 Agile 12.02.2014 Data Mining with Data Vault 2.0 16
  • 17. EXAMPLE  AdventureWorks-Database  Scenario:  Advertising campaign for a new bike  Identification of the target group  Solution:  Decision Tree  Identify relevant attributes in several iterations Lachev, 2005, p. 238ff Simple Example 12.02.2014 Agile Data Mining with Data Vault 2.0 17
  • 18. Agile Data Mining with Data Vault 2.0 18 10066 Records Attribute Marital Status Gender Yearly Income Total Children Education Number Cars Owned Commute Distance Occupation House Owner Flag Age
  • 19. ITERATION 1: DATA VAULT 2.0 MODEL English Education Numbers Cars Owned Gender Marital Status Sat Customer Hub Customer Customer Key Commute Distance Age House Owner Flag English Occupation Sat Category Product Category 12.02.2014 Agile Data Mining with Data Vault 2.0 19
  • 20. ITERATION 1: RAPIDMINER PROCESS Data Gathering Data preparation Modeling 12.02.2014 Agile Data Mining with Data Vault 2.0 20
  • 21. ITERATION 1: DECISIONTREE MODEL 12.02.2014 Agile Data Mining with Data Vault 2.0 21
  • 22. ITERATION 1: RESULTS 12.02.2014 Agile Data Mining with Data Vault 2.0 22
  • 23. ITERATION 2: DATA VAULT 2.0 MODEL English Education Numbers Cars Owned Gender Marital Status Sat Customer Hub Customer Sat Customer Income Customer Key Commute Distance Age House Owner Flag English Occupation Sat Customer Children Sat Category Total Children Yearly Income Product Category 12.02.2014 Agile Data Mining with Data Vault 2.0 23
  • 24. ITERATION 2: RAPIDMINER PROCESS Data Gathering Preparation Modeling 12.02.2014 Agile Data Mining with Data Vault 2.0 24
  • 25. ITERATION 2: RESULTS +4.01% 12.02.2014 Agile Data Mining with Data Vault 2.0 25
  • 26. ITERATION 3: DATA VAULT 2.0 MODEL English Education Numbers Cars Owned Gender Marital Status Sat Customer Hub Customer Sat Customer Income Customer Key Commute Distance Age House Owner Flag English Occupation Sat Customer Children Sat Category Total Children Yearly Income Product Category Commute Distance Miles CSat Customer Distance 12.02.2014 Agile Data Mining with Data Vault 2.0 26
  • 27. ITERATION 3: RAPIDMINER PROCESS Data Gathering Preparation Modeling 12.02.2014 Agile Data Mining with Data Vault 2.0 27
  • 28. ITERATION 3: RESULTS +0.12% 12.02.2014 Agile Data Mining with Data Vault 2.0 28
  • 29. CONCLUSIONS Agile Data Mining with DataVault 2.0 Agile 12.02.2014 Data Mining with Data Vault 2.0 29
  • 30. CONCLUSIONS  Data Vault is a flexible data model, with good support for agile project methodology  DataVault is not an additional hurdle in data mining projects  Additional attributes can be added at any time during the project, in an incremental fashion Business Vault: transparent data processing 12.02.2014 Agile Data Mining with Data Vault 2.0 30
  • 31. FURTHER INFORMATION Appears 2015 Available Www.doerffler.com WWW.datavault.de Www.learndatavault.com Appears 2015 12.02.2014 Agile Data Mining with Data Vault 2.0 31
  • 32. Give us feedback Agile Data Mining with Data Vault 2.0 32 Http://goo.gl/LGO4ze Source:Vasilijonline.com 12.02.2014

Hinweis der Redaktion

  1. In This Slides Only The logos Replace. To Try it out New Design /Discuss Have We No Time
  2. Short On the DM Project In The VGH Comment. On the BI Spectrum Article Point out Objectives The Project Used Tools. Crisp-DM Used. Etc. GGF. For more Slides Open Name The insurance? No specific requirements Attributes evolve over time "Customer" does not exactly define first Only private clients or companies? Policyholders or vehicle owners? What kinds of contracts? How are "good" customers?
  3. Hubs, Left, Satellite Short Explains With VDV. Take a look at In the Folder Sources, There Can You You Use.
  4. We can no data and Findings of the VGH present Therefore to avoid AdventureWorks Setup took over from book
  5. Short On Adenture Works DW Comment Background Information Model of the Relevant Tables 25 Attributes, 500k Records
  6. On the First DV model Comment.
  7. Demo in Rapidminer Also On Measures Comment (Accuracy, Or Precision/recall).  On Best Graphically In Rm Represent.
  8. Scatter Matrix Confusion matrix (performance matrix).
  9. On the Changes The DV Model Comment. Show As The Then Looks like.  Changes Comprehensible Make (On Animations)
  10. Demo in Rapidminer Also On Measures Comment (Accuracy, Or Precision/recall).  On Best Graphically In Rm Represent.
  11. On the Changes The DV Model Comment. Show As The Then Looks like.  Changes Comprehensible Make (On Animations)
  12. Demo in Rapidminer Also On Measures Comment (Accuracy, Or Precision/recall).  On Best Graphically In Rm Represent.
  13. What Are The Benefits From Approach? Reference The VGH Project Take, But Also On the demo
  14. TBC: Link Revise (Make I)