SlideShare ist ein Scribd-Unternehmen logo
1 von 10
Using SLE for creation of Data Warehouses
22.11.2015
1
Yvette Teiken
OFFIS Institute for Information Technology,
Escherweg 2, 26121 Oldenburg, Germany
yvette.teiken@offis.de
Problem Description and Motivation I
► Goal of a Data Warehouses:
► Perform complex analysis of all organizational data
► Used for decision support
► Time-variant
► Non-volatile
► Integrated data from different sources and different
formats in one integrated dataset
► Utilization of OLAP paradigm to allow easy analysis
and accessibility
► Addressed Problems in my thesis:
► Efficient creation of domain specific DWH
► Example of use:
► Health Reporting: preparation and presentation of
health relevant issues relating to population
2
22.11.2015
Problem Description and Motivation II
► Problems during DWH creation:
► No standardized process exists
► Documentation by many large documents
► Missing, distributed, inconsistent information
► A lot of schematic work performed during realization
► Many different user roles involved
► Initial build-up is a complex task
► Expected benefits:
► Faster realization of DWH
► Better documentation of whole creation process
► Not so well trained person can realize a DWH
3
22.11.2015
Analysis
organizational data
Define information
demand
Data source
transformation
Multidimensional
model
Data quality
Related Work
► Languages for covering aspects of DWH creation:
► Application Design for Analytical Processing Technologies (ADAPT)
► R2O mapping for relational databases
► InDaQu for Data quality
► MDA and DWA
► Rizzi et. al.: Modelling different aspects of DWHs
► Only deal with a certain aspect, not whole process
► My approach
► Use languages that cover the whole process of DWH creation
► Integrated through a common metamodel
► Deal with multidimensional structures
► Transformations generating large parts of the DWH
► Process model that orders different aspects and connect and refined
4
22.11.2015
Proposed Solution I
► Idea: Describe DWH with SLE techniques, generate
semi-automatic DWH
► Decompose DWH in different aspects, describe each
aspect with a language:
► Aspects:
► Data Sources Schemas: Subject, the
representation, and technical accessibility of sources
► Data Source Transformation: Use existing
languages like R2O
► Analysis Schema: Multidimensional data models,
based on ADAPT
► Measures: Mathematical functions on
multidimensional data
► Hierarchy: Central aspect, complex tree structures
► Data Quality: Integrate consistency constraints
(InDaQu)
5
22.11.2015
Example
► Hospital markt analysis:
► Find out percentages of birth
► Measure:
►
► Data Source Schema:
► Own Cases: Hospital information system: „§21 Data“
► All Cases: Buy from external source
6
22.11.2015
AllCases
OwnCases
eOfBirthMarketShar 
Name Typ Arity
Id of
Insurance
Numeric 10
Year of Birth Numeric 4
Month of
Birth
Numeric 2
Gender String 1
PLZ Numeric 5
Start date Numeric 12
Reason of
admisson
String 1
End date String 12
Age in years String 3
DRG String 4
Example
► Analysis Schema:
► Generated relational schema
7
22.11.2015
Example
Own Cases
start date
Reason of admisson
year of Birth
DRG
Gender
Id of Insurence
month of Birth
End date
age in years
PLZ
8
22.11.2015
Target
schema
day
ICD
Year
DRG
Gender
=new Datetime(Q[10,11],Q[4,5],Q[0-3])
(G==m  M || G==w  F)
► Data Source Transformation:
► Consistency Rules:
► ICD=O10-O16 & G=M  invalid
► DRG=O01F & G=M  invalid
Current Status
► Already done
► Analysis Schema DSL
► Hierarchy DSL
► Data Quality DSL
► Transformations for Data Integration and Cubes
► Integrated Metamodel for these aspects
► Left to be done
► Data Source Schema
► Measures
► Data Source Transformation
► Integrate these aspects
9
22.11.2015
Research Method and Conclusion
► Research Method
► Validation via implementation
► Described languages, metamodels, and transformations on basis of the
MUSTANG platform
► Ability to generate a configuration for a DWH
► Conclusion
► Experts can design and analyze all aspects of the DWH independently in DSLs
► Enables semi-automatic DWH creation
► Makes development faster
10
22.11.2015

Weitere ähnliche Inhalte

Was ist angesagt?

Data vault
Data vaultData vault
Data vaultJisc
 
Machine Learning in the Data Science Context
Machine Learning in the Data Science ContextMachine Learning in the Data Science Context
Machine Learning in the Data Science Contextsisira samarasinghe
 
Towards the Discovery of Person-Level Data (SemStats, ISWC 2013) [2013.10]
Towards the Discovery of Person-Level Data (SemStats, ISWC 2013) [2013.10]Towards the Discovery of Person-Level Data (SemStats, ISWC 2013) [2013.10]
Towards the Discovery of Person-Level Data (SemStats, ISWC 2013) [2013.10]Dr.-Ing. Thomas Hartmann
 
Design | expose ap is with cqr
Design | expose ap is with cqrDesign | expose ap is with cqr
Design | expose ap is with cqrJabar Asadi
 
ROI in Linking Content to CRM by Applying the Linked Data Stack
ROI in Linking Content to CRM by Applying the Linked Data StackROI in Linking Content to CRM by Applying the Linked Data Stack
ROI in Linking Content to CRM by Applying the Linked Data StackMartin Voigt
 
Design | expose ap is with cqrs
Design | expose ap is with cqrsDesign | expose ap is with cqrs
Design | expose ap is with cqrselazhiA
 
BigInsight seminar on Practical Privacy-Preserving Distributed Statistical Co...
BigInsight seminar on Practical Privacy-Preserving Distributed Statistical Co...BigInsight seminar on Practical Privacy-Preserving Distributed Statistical Co...
BigInsight seminar on Practical Privacy-Preserving Distributed Statistical Co...Statistisk sentralbyrå
 
Introduction to einstein analytics
Introduction to einstein analyticsIntroduction to einstein analytics
Introduction to einstein analyticsSteven Hugo
 
Cortex - NOAH19 Berlin
Cortex - NOAH19 BerlinCortex - NOAH19 Berlin
Cortex - NOAH19 BerlinNOAH Advisors
 
Collecting and Making Sense of Diverse Data at WayUp
Collecting and Making Sense of Diverse Data at WayUpCollecting and Making Sense of Diverse Data at WayUp
Collecting and Making Sense of Diverse Data at WayUpHarlan Harris
 
Enrico Bisogno - United Nations Office on Drugs and Crime (UNODC)
Enrico Bisogno - United Nations Office on Drugs and Crime (UNODC)Enrico Bisogno - United Nations Office on Drugs and Crime (UNODC)
Enrico Bisogno - United Nations Office on Drugs and Crime (UNODC)Geneva Declaration
 
datavirtuality - Beyond the data lake
datavirtuality - Beyond the data lake  datavirtuality - Beyond the data lake
datavirtuality - Beyond the data lake Dataconomy Media
 
How does Linked Open Data change the publishing landscape?
How does Linked Open Data change the publishing landscape?How does Linked Open Data change the publishing landscape?
How does Linked Open Data change the publishing landscape?Quentin Reul
 
L21 sharing data using data synchronization
L21 sharing data using data synchronizationL21 sharing data using data synchronization
L21 sharing data using data synchronizationNaresh Kumar SAHU
 
Adopting linked data principles for accelerating business transformation proc...
Adopting linked data principles for accelerating business transformation proc...Adopting linked data principles for accelerating business transformation proc...
Adopting linked data principles for accelerating business transformation proc...Quentin Reul
 
"Beyond the Data Lake", Matthias Korn, Technical Consultant at datavirtuality
"Beyond the Data Lake", Matthias Korn, Technical Consultant at datavirtuality"Beyond the Data Lake", Matthias Korn, Technical Consultant at datavirtuality
"Beyond the Data Lake", Matthias Korn, Technical Consultant at datavirtualityDataconomy Media
 
Big data in Food sector
Big data in Food sectorBig data in Food sector
Big data in Food sectorShamim Hossain
 
Workflows for machine-actionable Research Data Management Planning
Workflows for machine-actionable Research Data Management PlanningWorkflows for machine-actionable Research Data Management Planning
Workflows for machine-actionable Research Data Management PlanningSimonOblasser
 
Attract The Best and Save Costs
Attract The Best and Save CostsAttract The Best and Save Costs
Attract The Best and Save CostsJisc
 
The challenges of Analytical Data Management in R&D
The challenges of Analytical Data Management in R&DThe challenges of Analytical Data Management in R&D
The challenges of Analytical Data Management in R&DLaura Berry
 

Was ist angesagt? (20)

Data vault
Data vaultData vault
Data vault
 
Machine Learning in the Data Science Context
Machine Learning in the Data Science ContextMachine Learning in the Data Science Context
Machine Learning in the Data Science Context
 
Towards the Discovery of Person-Level Data (SemStats, ISWC 2013) [2013.10]
Towards the Discovery of Person-Level Data (SemStats, ISWC 2013) [2013.10]Towards the Discovery of Person-Level Data (SemStats, ISWC 2013) [2013.10]
Towards the Discovery of Person-Level Data (SemStats, ISWC 2013) [2013.10]
 
Design | expose ap is with cqr
Design | expose ap is with cqrDesign | expose ap is with cqr
Design | expose ap is with cqr
 
ROI in Linking Content to CRM by Applying the Linked Data Stack
ROI in Linking Content to CRM by Applying the Linked Data StackROI in Linking Content to CRM by Applying the Linked Data Stack
ROI in Linking Content to CRM by Applying the Linked Data Stack
 
Design | expose ap is with cqrs
Design | expose ap is with cqrsDesign | expose ap is with cqrs
Design | expose ap is with cqrs
 
BigInsight seminar on Practical Privacy-Preserving Distributed Statistical Co...
BigInsight seminar on Practical Privacy-Preserving Distributed Statistical Co...BigInsight seminar on Practical Privacy-Preserving Distributed Statistical Co...
BigInsight seminar on Practical Privacy-Preserving Distributed Statistical Co...
 
Introduction to einstein analytics
Introduction to einstein analyticsIntroduction to einstein analytics
Introduction to einstein analytics
 
Cortex - NOAH19 Berlin
Cortex - NOAH19 BerlinCortex - NOAH19 Berlin
Cortex - NOAH19 Berlin
 
Collecting and Making Sense of Diverse Data at WayUp
Collecting and Making Sense of Diverse Data at WayUpCollecting and Making Sense of Diverse Data at WayUp
Collecting and Making Sense of Diverse Data at WayUp
 
Enrico Bisogno - United Nations Office on Drugs and Crime (UNODC)
Enrico Bisogno - United Nations Office on Drugs and Crime (UNODC)Enrico Bisogno - United Nations Office on Drugs and Crime (UNODC)
Enrico Bisogno - United Nations Office on Drugs and Crime (UNODC)
 
datavirtuality - Beyond the data lake
datavirtuality - Beyond the data lake  datavirtuality - Beyond the data lake
datavirtuality - Beyond the data lake
 
How does Linked Open Data change the publishing landscape?
How does Linked Open Data change the publishing landscape?How does Linked Open Data change the publishing landscape?
How does Linked Open Data change the publishing landscape?
 
L21 sharing data using data synchronization
L21 sharing data using data synchronizationL21 sharing data using data synchronization
L21 sharing data using data synchronization
 
Adopting linked data principles for accelerating business transformation proc...
Adopting linked data principles for accelerating business transformation proc...Adopting linked data principles for accelerating business transformation proc...
Adopting linked data principles for accelerating business transformation proc...
 
"Beyond the Data Lake", Matthias Korn, Technical Consultant at datavirtuality
"Beyond the Data Lake", Matthias Korn, Technical Consultant at datavirtuality"Beyond the Data Lake", Matthias Korn, Technical Consultant at datavirtuality
"Beyond the Data Lake", Matthias Korn, Technical Consultant at datavirtuality
 
Big data in Food sector
Big data in Food sectorBig data in Food sector
Big data in Food sector
 
Workflows for machine-actionable Research Data Management Planning
Workflows for machine-actionable Research Data Management PlanningWorkflows for machine-actionable Research Data Management Planning
Workflows for machine-actionable Research Data Management Planning
 
Attract The Best and Save Costs
Attract The Best and Save CostsAttract The Best and Save Costs
Attract The Best and Save Costs
 
The challenges of Analytical Data Management in R&D
The challenges of Analytical Data Management in R&DThe challenges of Analytical Data Management in R&D
The challenges of Analytical Data Management in R&D
 

Andere mochten auch

Introduction to the NLP Meta Model - NLP Business Coaching Series
Introduction to the NLP Meta Model - NLP Business Coaching SeriesIntroduction to the NLP Meta Model - NLP Business Coaching Series
Introduction to the NLP Meta Model - NLP Business Coaching SeriesFiona Campbell
 
The Physical Interface
The Physical InterfaceThe Physical Interface
The Physical InterfaceJosh Clark
 
Mobile Is Eating the World (2016)
Mobile Is Eating the World (2016)Mobile Is Eating the World (2016)
Mobile Is Eating the World (2016)a16z
 
Visual Design with Data
Visual Design with DataVisual Design with Data
Visual Design with DataSeth Familian
 

Andere mochten auch (6)

Diseño de un Datamart
Diseño de un DatamartDiseño de un Datamart
Diseño de un Datamart
 
Introduction to the NLP Meta Model - NLP Business Coaching Series
Introduction to the NLP Meta Model - NLP Business Coaching SeriesIntroduction to the NLP Meta Model - NLP Business Coaching Series
Introduction to the NLP Meta Model - NLP Business Coaching Series
 
Displaying Data
Displaying DataDisplaying Data
Displaying Data
 
The Physical Interface
The Physical InterfaceThe Physical Interface
The Physical Interface
 
Mobile Is Eating the World (2016)
Mobile Is Eating the World (2016)Mobile Is Eating the World (2016)
Mobile Is Eating the World (2016)
 
Visual Design with Data
Visual Design with DataVisual Design with Data
Visual Design with Data
 

Ähnlich wie Using SLE for creation of data warehouses

F2 kepa rodriguez_ehri_integration_retrieva_minerva_2016
F2 kepa rodriguez_ehri_integration_retrieva_minerva_2016F2 kepa rodriguez_ehri_integration_retrieva_minerva_2016
F2 kepa rodriguez_ehri_integration_retrieva_minerva_2016evaminerva
 
Data discovery and sharing at UCLH
Data discovery and sharing at UCLHData discovery and sharing at UCLH
Data discovery and sharing at UCLHJisc
 
ESSnet Big Data WP8 Methodology (+ Quality, +IT)
ESSnet Big Data WP8 Methodology (+ Quality, +IT)ESSnet Big Data WP8 Methodology (+ Quality, +IT)
ESSnet Big Data WP8 Methodology (+ Quality, +IT)Piet J.H. Daas
 
Tutorial Data Management and workflows
Tutorial Data Management and workflowsTutorial Data Management and workflows
Tutorial Data Management and workflowsSSSW
 
Advanced Analytics and Machine Learning with Data Virtualization (India)
Advanced Analytics and Machine Learning with Data Virtualization (India)Advanced Analytics and Machine Learning with Data Virtualization (India)
Advanced Analytics and Machine Learning with Data Virtualization (India)Denodo
 
Making data typing efforts or automatically detecting data types for automat...
Making data typing efforts or automatically detecting data types  for automat...Making data typing efforts or automatically detecting data types  for automat...
Making data typing efforts or automatically detecting data types for automat...National Institute of Informatics
 
Advanced Analytics and Machine Learning with Data Virtualization
Advanced Analytics and Machine Learning with Data VirtualizationAdvanced Analytics and Machine Learning with Data Virtualization
Advanced Analytics and Machine Learning with Data VirtualizationDenodo
 
HETT Conference Olympic Central 2014 Integrating Healthcare Delivery
HETT Conference Olympic Central 2014 Integrating Healthcare DeliveryHETT Conference Olympic Central 2014 Integrating Healthcare Delivery
HETT Conference Olympic Central 2014 Integrating Healthcare DeliveryElmar Flamme
 
Best Practice Reference Architecture for Data Curation
Best Practice Reference Architecture for Data CurationBest Practice Reference Architecture for Data Curation
Best Practice Reference Architecture for Data CurationOSTHUS
 
Bio Data World - The promise of FAIR data lakes - The Hyve - 20191204
Bio Data World - The promise of FAIR data lakes - The Hyve - 20191204Bio Data World - The promise of FAIR data lakes - The Hyve - 20191204
Bio Data World - The promise of FAIR data lakes - The Hyve - 20191204Kees van Bochove
 
Data Communities - reusable data in and outside your organization.
Data Communities - reusable data in and outside your organization.Data Communities - reusable data in and outside your organization.
Data Communities - reusable data in and outside your organization.Paul Groth
 
Open Data in Trinidad and Tobago: presentation to developers
Open Data in Trinidad and Tobago: presentation to developers Open Data in Trinidad and Tobago: presentation to developers
Open Data in Trinidad and Tobago: presentation to developers Andrew Stott
 
Seminaire bigdata23102014
Seminaire bigdata23102014Seminaire bigdata23102014
Seminaire bigdata23102014Raja Chiky
 
Alive and kicking! Keeping data re-usable in the European Values Study
Alive and kicking! Keeping data re-usable in the European Values StudyAlive and kicking! Keeping data re-usable in the European Values Study
Alive and kicking! Keeping data re-usable in the European Values StudyCESSDA Training
 
What is SEND?
What is SEND?What is SEND?
What is SEND?Covance
 
Aleksandar Zivaljevic - Annotation of clinical datasets using openEHR Archety...
Aleksandar Zivaljevic - Annotation of clinical datasets using openEHR Archety...Aleksandar Zivaljevic - Annotation of clinical datasets using openEHR Archety...
Aleksandar Zivaljevic - Annotation of clinical datasets using openEHR Archety...Health Informatics New Zealand
 
LISA VII: The Scientific and Technical Foundation for Altmetrics in the Unite...
LISA VII: The Scientific and Technical Foundation for Altmetrics in the Unite...LISA VII: The Scientific and Technical Foundation for Altmetrics in the Unite...
LISA VII: The Scientific and Technical Foundation for Altmetrics in the Unite...William Gunn
 

Ähnlich wie Using SLE for creation of data warehouses (20)

F2 kepa rodriguez_ehri_integration_retrieva_minerva_2016
F2 kepa rodriguez_ehri_integration_retrieva_minerva_2016F2 kepa rodriguez_ehri_integration_retrieva_minerva_2016
F2 kepa rodriguez_ehri_integration_retrieva_minerva_2016
 
Data discovery and sharing at UCLH
Data discovery and sharing at UCLHData discovery and sharing at UCLH
Data discovery and sharing at UCLH
 
Dive deep into your Data Pools
Dive deep into your Data PoolsDive deep into your Data Pools
Dive deep into your Data Pools
 
ESSnet Big Data WP8 Methodology (+ Quality, +IT)
ESSnet Big Data WP8 Methodology (+ Quality, +IT)ESSnet Big Data WP8 Methodology (+ Quality, +IT)
ESSnet Big Data WP8 Methodology (+ Quality, +IT)
 
Tutorial Data Management and workflows
Tutorial Data Management and workflowsTutorial Data Management and workflows
Tutorial Data Management and workflows
 
14a Conferenza Nazionale di Statistica
14a Conferenza Nazionale di Statistica14a Conferenza Nazionale di Statistica
14a Conferenza Nazionale di Statistica
 
Advanced Analytics and Machine Learning with Data Virtualization (India)
Advanced Analytics and Machine Learning with Data Virtualization (India)Advanced Analytics and Machine Learning with Data Virtualization (India)
Advanced Analytics and Machine Learning with Data Virtualization (India)
 
Making data typing efforts or automatically detecting data types for automat...
Making data typing efforts or automatically detecting data types  for automat...Making data typing efforts or automatically detecting data types  for automat...
Making data typing efforts or automatically detecting data types for automat...
 
Advanced Analytics and Machine Learning with Data Virtualization
Advanced Analytics and Machine Learning with Data VirtualizationAdvanced Analytics and Machine Learning with Data Virtualization
Advanced Analytics and Machine Learning with Data Virtualization
 
HETT Conference Olympic Central 2014 Integrating Healthcare Delivery
HETT Conference Olympic Central 2014 Integrating Healthcare DeliveryHETT Conference Olympic Central 2014 Integrating Healthcare Delivery
HETT Conference Olympic Central 2014 Integrating Healthcare Delivery
 
Best Practice Reference Architecture for Data Curation
Best Practice Reference Architecture for Data CurationBest Practice Reference Architecture for Data Curation
Best Practice Reference Architecture for Data Curation
 
Bio Data World - The promise of FAIR data lakes - The Hyve - 20191204
Bio Data World - The promise of FAIR data lakes - The Hyve - 20191204Bio Data World - The promise of FAIR data lakes - The Hyve - 20191204
Bio Data World - The promise of FAIR data lakes - The Hyve - 20191204
 
Data Communities - reusable data in and outside your organization.
Data Communities - reusable data in and outside your organization.Data Communities - reusable data in and outside your organization.
Data Communities - reusable data in and outside your organization.
 
Open Data in Trinidad and Tobago: presentation to developers
Open Data in Trinidad and Tobago: presentation to developers Open Data in Trinidad and Tobago: presentation to developers
Open Data in Trinidad and Tobago: presentation to developers
 
Seminaire bigdata23102014
Seminaire bigdata23102014Seminaire bigdata23102014
Seminaire bigdata23102014
 
Brislinger, Recker: Keeping data re-usable in the evs
Brislinger, Recker: Keeping data re-usable in the evsBrislinger, Recker: Keeping data re-usable in the evs
Brislinger, Recker: Keeping data re-usable in the evs
 
Alive and kicking! Keeping data re-usable in the European Values Study
Alive and kicking! Keeping data re-usable in the European Values StudyAlive and kicking! Keeping data re-usable in the European Values Study
Alive and kicking! Keeping data re-usable in the European Values Study
 
What is SEND?
What is SEND?What is SEND?
What is SEND?
 
Aleksandar Zivaljevic - Annotation of clinical datasets using openEHR Archety...
Aleksandar Zivaljevic - Annotation of clinical datasets using openEHR Archety...Aleksandar Zivaljevic - Annotation of clinical datasets using openEHR Archety...
Aleksandar Zivaljevic - Annotation of clinical datasets using openEHR Archety...
 
LISA VII: The Scientific and Technical Foundation for Altmetrics in the Unite...
LISA VII: The Scientific and Technical Foundation for Altmetrics in the Unite...LISA VII: The Scientific and Technical Foundation for Altmetrics in the Unite...
LISA VII: The Scientific and Technical Foundation for Altmetrics in the Unite...
 

Mehr von Yvette Teiken

Angular von 0 auf 100
Angular von 0 auf 100Angular von 0 auf 100
Angular von 0 auf 100Yvette Teiken
 
BPW Vortragsabend: Praktisches Selbst- und Zeitmanagement in der Wissensgesel...
BPW Vortragsabend: Praktisches Selbst- und Zeitmanagement in der Wissensgesel...BPW Vortragsabend: Praktisches Selbst- und Zeitmanagement in der Wissensgesel...
BPW Vortragsabend: Praktisches Selbst- und Zeitmanagement in der Wissensgesel...Yvette Teiken
 
Datenanalyse in der Praxis
Datenanalyse in der PraxisDatenanalyse in der Praxis
Datenanalyse in der PraxisYvette Teiken
 
Praktisches Selbst- und Zeitmanagement in der Wissensgesellschaft
Praktisches Selbst- und Zeitmanagement in der WissensgesellschaftPraktisches Selbst- und Zeitmanagement in der Wissensgesellschaft
Praktisches Selbst- und Zeitmanagement in der WissensgesellschaftYvette Teiken
 
MIcrosoft Self Service BI
MIcrosoft Self Service BIMIcrosoft Self Service BI
MIcrosoft Self Service BIYvette Teiken
 
Microsoft on Big Data
Microsoft on Big DataMicrosoft on Big Data
Microsoft on Big DataYvette Teiken
 
Mobile Anwendungen mit Apache Cordova
Mobile Anwendungen mit Apache CordovaMobile Anwendungen mit Apache Cordova
Mobile Anwendungen mit Apache CordovaYvette Teiken
 
Microsoft Azure in der Praxis
Microsoft Azure in der PraxisMicrosoft Azure in der Praxis
Microsoft Azure in der PraxisYvette Teiken
 
Net ug oldenburg_2015_03_intro
Net ug oldenburg_2015_03_introNet ug oldenburg_2015_03_intro
Net ug oldenburg_2015_03_introYvette Teiken
 

Mehr von Yvette Teiken (9)

Angular von 0 auf 100
Angular von 0 auf 100Angular von 0 auf 100
Angular von 0 auf 100
 
BPW Vortragsabend: Praktisches Selbst- und Zeitmanagement in der Wissensgesel...
BPW Vortragsabend: Praktisches Selbst- und Zeitmanagement in der Wissensgesel...BPW Vortragsabend: Praktisches Selbst- und Zeitmanagement in der Wissensgesel...
BPW Vortragsabend: Praktisches Selbst- und Zeitmanagement in der Wissensgesel...
 
Datenanalyse in der Praxis
Datenanalyse in der PraxisDatenanalyse in der Praxis
Datenanalyse in der Praxis
 
Praktisches Selbst- und Zeitmanagement in der Wissensgesellschaft
Praktisches Selbst- und Zeitmanagement in der WissensgesellschaftPraktisches Selbst- und Zeitmanagement in der Wissensgesellschaft
Praktisches Selbst- und Zeitmanagement in der Wissensgesellschaft
 
MIcrosoft Self Service BI
MIcrosoft Self Service BIMIcrosoft Self Service BI
MIcrosoft Self Service BI
 
Microsoft on Big Data
Microsoft on Big DataMicrosoft on Big Data
Microsoft on Big Data
 
Mobile Anwendungen mit Apache Cordova
Mobile Anwendungen mit Apache CordovaMobile Anwendungen mit Apache Cordova
Mobile Anwendungen mit Apache Cordova
 
Microsoft Azure in der Praxis
Microsoft Azure in der PraxisMicrosoft Azure in der Praxis
Microsoft Azure in der Praxis
 
Net ug oldenburg_2015_03_intro
Net ug oldenburg_2015_03_introNet ug oldenburg_2015_03_intro
Net ug oldenburg_2015_03_intro
 

Kürzlich hochgeladen

Optimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTVOptimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTVshikhaohhpro
 
Software Quality Assurance Interview Questions
Software Quality Assurance Interview QuestionsSoftware Quality Assurance Interview Questions
Software Quality Assurance Interview QuestionsArshad QA
 
How To Use Server-Side Rendering with Nuxt.js
How To Use Server-Side Rendering with Nuxt.jsHow To Use Server-Side Rendering with Nuxt.js
How To Use Server-Side Rendering with Nuxt.jsAndolasoft Inc
 
Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...
Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...
Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...OnePlan Solutions
 
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...ICS
 
How to Choose the Right Laravel Development Partner in New York City_compress...
How to Choose the Right Laravel Development Partner in New York City_compress...How to Choose the Right Laravel Development Partner in New York City_compress...
How to Choose the Right Laravel Development Partner in New York City_compress...software pro Development
 
TECUNIQUE: Success Stories: IT Service provider
TECUNIQUE: Success Stories: IT Service providerTECUNIQUE: Success Stories: IT Service provider
TECUNIQUE: Success Stories: IT Service providermohitmore19
 
AI & Machine Learning Presentation Template
AI & Machine Learning Presentation TemplateAI & Machine Learning Presentation Template
AI & Machine Learning Presentation TemplatePresentation.STUDIO
 
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...panagenda
 
The Ultimate Test Automation Guide_ Best Practices and Tips.pdf
The Ultimate Test Automation Guide_ Best Practices and Tips.pdfThe Ultimate Test Automation Guide_ Best Practices and Tips.pdf
The Ultimate Test Automation Guide_ Best Practices and Tips.pdfkalichargn70th171
 
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️Delhi Call girls
 
Azure_Native_Qumulo_High_Performance_Compute_Benchmarks.pdf
Azure_Native_Qumulo_High_Performance_Compute_Benchmarks.pdfAzure_Native_Qumulo_High_Performance_Compute_Benchmarks.pdf
Azure_Native_Qumulo_High_Performance_Compute_Benchmarks.pdfryanfarris8
 
introduction-to-automotive Andoid os-csimmonds-ndctechtown-2021.pdf
introduction-to-automotive Andoid os-csimmonds-ndctechtown-2021.pdfintroduction-to-automotive Andoid os-csimmonds-ndctechtown-2021.pdf
introduction-to-automotive Andoid os-csimmonds-ndctechtown-2021.pdfVishalKumarJha10
 
Direct Style Effect Systems - The Print[A] Example - A Comprehension Aid
Direct Style Effect Systems -The Print[A] Example- A Comprehension AidDirect Style Effect Systems -The Print[A] Example- A Comprehension Aid
Direct Style Effect Systems - The Print[A] Example - A Comprehension AidPhilip Schwarz
 
10 Trends Likely to Shape Enterprise Technology in 2024
10 Trends Likely to Shape Enterprise Technology in 202410 Trends Likely to Shape Enterprise Technology in 2024
10 Trends Likely to Shape Enterprise Technology in 2024Mind IT Systems
 
HR Software Buyers Guide in 2024 - HRSoftware.com
HR Software Buyers Guide in 2024 - HRSoftware.comHR Software Buyers Guide in 2024 - HRSoftware.com
HR Software Buyers Guide in 2024 - HRSoftware.comFatema Valibhai
 
Unveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time ApplicationsUnveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time ApplicationsAlberto González Trastoy
 
Right Money Management App For Your Financial Goals
Right Money Management App For Your Financial GoalsRight Money Management App For Your Financial Goals
Right Money Management App For Your Financial GoalsJhone kinadey
 

Kürzlich hochgeladen (20)

Optimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTVOptimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTV
 
Software Quality Assurance Interview Questions
Software Quality Assurance Interview QuestionsSoftware Quality Assurance Interview Questions
Software Quality Assurance Interview Questions
 
How To Use Server-Side Rendering with Nuxt.js
How To Use Server-Side Rendering with Nuxt.jsHow To Use Server-Side Rendering with Nuxt.js
How To Use Server-Side Rendering with Nuxt.js
 
Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...
Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...
Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...
 
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
 
Microsoft AI Transformation Partner Playbook.pdf
Microsoft AI Transformation Partner Playbook.pdfMicrosoft AI Transformation Partner Playbook.pdf
Microsoft AI Transformation Partner Playbook.pdf
 
How to Choose the Right Laravel Development Partner in New York City_compress...
How to Choose the Right Laravel Development Partner in New York City_compress...How to Choose the Right Laravel Development Partner in New York City_compress...
How to Choose the Right Laravel Development Partner in New York City_compress...
 
TECUNIQUE: Success Stories: IT Service provider
TECUNIQUE: Success Stories: IT Service providerTECUNIQUE: Success Stories: IT Service provider
TECUNIQUE: Success Stories: IT Service provider
 
AI & Machine Learning Presentation Template
AI & Machine Learning Presentation TemplateAI & Machine Learning Presentation Template
AI & Machine Learning Presentation Template
 
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
 
The Ultimate Test Automation Guide_ Best Practices and Tips.pdf
The Ultimate Test Automation Guide_ Best Practices and Tips.pdfThe Ultimate Test Automation Guide_ Best Practices and Tips.pdf
The Ultimate Test Automation Guide_ Best Practices and Tips.pdf
 
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
 
Azure_Native_Qumulo_High_Performance_Compute_Benchmarks.pdf
Azure_Native_Qumulo_High_Performance_Compute_Benchmarks.pdfAzure_Native_Qumulo_High_Performance_Compute_Benchmarks.pdf
Azure_Native_Qumulo_High_Performance_Compute_Benchmarks.pdf
 
introduction-to-automotive Andoid os-csimmonds-ndctechtown-2021.pdf
introduction-to-automotive Andoid os-csimmonds-ndctechtown-2021.pdfintroduction-to-automotive Andoid os-csimmonds-ndctechtown-2021.pdf
introduction-to-automotive Andoid os-csimmonds-ndctechtown-2021.pdf
 
Direct Style Effect Systems - The Print[A] Example - A Comprehension Aid
Direct Style Effect Systems -The Print[A] Example- A Comprehension AidDirect Style Effect Systems -The Print[A] Example- A Comprehension Aid
Direct Style Effect Systems - The Print[A] Example - A Comprehension Aid
 
10 Trends Likely to Shape Enterprise Technology in 2024
10 Trends Likely to Shape Enterprise Technology in 202410 Trends Likely to Shape Enterprise Technology in 2024
10 Trends Likely to Shape Enterprise Technology in 2024
 
HR Software Buyers Guide in 2024 - HRSoftware.com
HR Software Buyers Guide in 2024 - HRSoftware.comHR Software Buyers Guide in 2024 - HRSoftware.com
HR Software Buyers Guide in 2024 - HRSoftware.com
 
Unveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time ApplicationsUnveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
 
Right Money Management App For Your Financial Goals
Right Money Management App For Your Financial GoalsRight Money Management App For Your Financial Goals
Right Money Management App For Your Financial Goals
 
CHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICECHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
 

Using SLE for creation of data warehouses

  • 1. Using SLE for creation of Data Warehouses 22.11.2015 1 Yvette Teiken OFFIS Institute for Information Technology, Escherweg 2, 26121 Oldenburg, Germany yvette.teiken@offis.de
  • 2. Problem Description and Motivation I ► Goal of a Data Warehouses: ► Perform complex analysis of all organizational data ► Used for decision support ► Time-variant ► Non-volatile ► Integrated data from different sources and different formats in one integrated dataset ► Utilization of OLAP paradigm to allow easy analysis and accessibility ► Addressed Problems in my thesis: ► Efficient creation of domain specific DWH ► Example of use: ► Health Reporting: preparation and presentation of health relevant issues relating to population 2 22.11.2015
  • 3. Problem Description and Motivation II ► Problems during DWH creation: ► No standardized process exists ► Documentation by many large documents ► Missing, distributed, inconsistent information ► A lot of schematic work performed during realization ► Many different user roles involved ► Initial build-up is a complex task ► Expected benefits: ► Faster realization of DWH ► Better documentation of whole creation process ► Not so well trained person can realize a DWH 3 22.11.2015 Analysis organizational data Define information demand Data source transformation Multidimensional model Data quality
  • 4. Related Work ► Languages for covering aspects of DWH creation: ► Application Design for Analytical Processing Technologies (ADAPT) ► R2O mapping for relational databases ► InDaQu for Data quality ► MDA and DWA ► Rizzi et. al.: Modelling different aspects of DWHs ► Only deal with a certain aspect, not whole process ► My approach ► Use languages that cover the whole process of DWH creation ► Integrated through a common metamodel ► Deal with multidimensional structures ► Transformations generating large parts of the DWH ► Process model that orders different aspects and connect and refined 4 22.11.2015
  • 5. Proposed Solution I ► Idea: Describe DWH with SLE techniques, generate semi-automatic DWH ► Decompose DWH in different aspects, describe each aspect with a language: ► Aspects: ► Data Sources Schemas: Subject, the representation, and technical accessibility of sources ► Data Source Transformation: Use existing languages like R2O ► Analysis Schema: Multidimensional data models, based on ADAPT ► Measures: Mathematical functions on multidimensional data ► Hierarchy: Central aspect, complex tree structures ► Data Quality: Integrate consistency constraints (InDaQu) 5 22.11.2015
  • 6. Example ► Hospital markt analysis: ► Find out percentages of birth ► Measure: ► ► Data Source Schema: ► Own Cases: Hospital information system: „§21 Data“ ► All Cases: Buy from external source 6 22.11.2015 AllCases OwnCases eOfBirthMarketShar  Name Typ Arity Id of Insurance Numeric 10 Year of Birth Numeric 4 Month of Birth Numeric 2 Gender String 1 PLZ Numeric 5 Start date Numeric 12 Reason of admisson String 1 End date String 12 Age in years String 3 DRG String 4
  • 7. Example ► Analysis Schema: ► Generated relational schema 7 22.11.2015
  • 8. Example Own Cases start date Reason of admisson year of Birth DRG Gender Id of Insurence month of Birth End date age in years PLZ 8 22.11.2015 Target schema day ICD Year DRG Gender =new Datetime(Q[10,11],Q[4,5],Q[0-3]) (G==m  M || G==w  F) ► Data Source Transformation: ► Consistency Rules: ► ICD=O10-O16 & G=M  invalid ► DRG=O01F & G=M  invalid
  • 9. Current Status ► Already done ► Analysis Schema DSL ► Hierarchy DSL ► Data Quality DSL ► Transformations for Data Integration and Cubes ► Integrated Metamodel for these aspects ► Left to be done ► Data Source Schema ► Measures ► Data Source Transformation ► Integrate these aspects 9 22.11.2015
  • 10. Research Method and Conclusion ► Research Method ► Validation via implementation ► Described languages, metamodels, and transformations on basis of the MUSTANG platform ► Ability to generate a configuration for a DWH ► Conclusion ► Experts can design and analyze all aspects of the DWH independently in DSLs ► Enables semi-automatic DWH creation ► Makes development faster 10 22.11.2015

Hinweis der Redaktion

  1. OFFIS