SlideShare a Scribd company logo
1 of 28
Download to read offline
25568 Genesee Trail Rd
                                                                                      Golden, Colorado 80401
                                                                                          (303) 526-0340


 Data Vault Modeling and Approach    DW2.0 and Unstructured Data    Master Data Management and Metadata




   Data Warehousing Agility

                     BI-Event             May 17
                                          Hans Hultgren
© 2011 Genesee Academy, LLC
    25568 Genesee Trail Rd
    Golden, Colorado 80401


    © 2011 Genesee Academy, LLC
Welcome


 •   Definition of agility
 •   Types of agility
 •   Discuss current approaches
 •   Hyper-agility
 •   Observations from the field

     – Also topics of operational data warehousing, operational bi, agile project
       management techniques, agility oriented tools, and operational integration
Data Warehouse Agility

    • Agility

        – The overall measure of adaptability in terms of speed
          & scope.

        – Overall performance in adapting to change.

    NOTE: Not warehouse machine throughput, near real time (NRT)
      processing, and operational DW performance…

              Ability of the data warehouse to adapt to change
                                    Versus
             Performance of an existing (steady state) warehouse
Data Warehouse Agility
    • Agility
       – Agile in IT
          • Agile Project Management
          • Agile Software Development
                – Agile Manifesto
          We are uncovering better ways of developing software by doing it and helping others do it.
          Through this work we have come to value:
           Individuals and interactions            over         processes and tools
           Working software                        over         comprehensive documentation
           Customer collaboration                  over          contract negotiation
           Responding to change                    over         following a plan
          That is, while there is value in the items on the right, we value the items on the left more.

          • Agile Modeling Driven Design (AMDD)
          • Test-Driven Design (TDD)
Data Warehouse Agility
    • Agility in the Data Warehouse
       – Agility in terms of Data Warehousing is related to the ability to
         build incrementally.
       – The approach today is more concerned with the development
         of a business intelligence, data warehousing program – the
         capability to increment (adapt and grow).
       – Since the business is always changing (new reporting needs,
         new business processes, new business units, new data
         sources, etc.) the EDW program is an ongoing initiative that
         needs to focus on adapting to these changes.
       – Note: distinguish between operational integration and data
         warehousing.
Types of Data Warehouse Agility
                                      Change DW
        New Source
                                                  New Mart

                     Data Warehouse



                New Attribute


                         New Subject Area
Types of Data Warehouse Agility
  – Presentation Layer Agility – ability to adapt to new business requirements
    based on existing data elements in the EDW.
        • Bottom Line: Ability to quickly and flexibly spin off new data marts
  – New Data Source Agility – ability to assimilate new data sources into the
    EDW architecture from stage to CDW+ and existing data marts.
        • Bottom Line: Ability to quickly adapt to new data sources   * using existing structures

  – New Attribute Agility – ability to absorb new attributes into the EDW
    architecture such that they can be loaded from the sources and integrate
    new attributes in terms of business context.
        • Bottom Line: Ability to quickly incorporate new attributes in the EDW and
          apply business context to these attributes
  – EDW Machine Agility – ability of the EDW machine (business and
    technical) to accommodate a new subject area from stage to mart.
        • Bottom Line: EDW response time; a function of people, process & tools
  – Changes in the DW – ability to absorb other changes such as
    integration logic, mappings, and business rules.                                                Current




  © 2011 Genesee Academy, LLC
Presentation Layer Agility

  – Presentation Layer Agility - ability to adapt to new business requirements
    based on existing data elements in the EDW.
        • Bottom Line: Ability to quickly and flexibly spin off new data marts


  – In this layer, agility is measured as a function of the time it takes to design,
    construct and deliver a new data mart.
  – Variables in this layer include:
        •   Strength of the BI team to capture requirements and define data mart.
        •   Ability of ETL integration team to understand EDW model and mart.
        •   Strength and repeatability of ETL processes for sourcing the EDW.
        •   Strength and repeatability of ETL development, testing and delivery.
  – Constraints:
        • Dependent upon the existence of the data in the EDW.
        • Dependent upon the level of business alignment of the data in the EDW.


  © 2011 Genesee Academy, LLC
New Data Source Agility

  – New Data Source Agility - ability to assimilate new data sources into the
    EDW architecture from stage to CDW+ and existing data marts.
        • Bottom Line: Ability to quickly adapt to new data sources   * using existing structures



  – In this layer, agility is measured as a function of the time it takes to design,
    model, build and load data into the EDW from a new source.
  – Variables in this layer include:
        •   Strength of the DW team to design the required model changes.
        •   Strength and repeatability of EDW development, testing and delivery.
        •   Ability of ETL integration team to understand new EDW model.
        •   Strength and repeatability of ETL processes for mapping and loading new
            source into the EDW.
  – Constraints:
        • Level of alignment of the new source data with the existing model.
        • Dependent upon the level of business alignment with the data in the EDW


  © 2011 Genesee Academy, LLC
New Attribute Technical Agility

  – New Attribute (Technical) Agility - ability to absorb new attributes into
    the EDW architecture such that they can be loaded from the sources.
        • Bottom Line: Ability to quickly incorporate new attributes in the EDW


  – In this layer, agility is measured as a function of the time it takes to design,
    map, add and load a new attribute from a source.
  – Variables in this layer include:
        •   Strength of the DW team to design the required model changes.
        •   Strength and repeatability of EDW development, testing and delivery.
        •   Ability of ETL integration team to understand new EDW attribute(s).
        •   Strength and repeatability of ETL processes for mapping and loading new
            source attributes into the EDW.
  – Constraints:
        • Level of alignment of the new attribute with the existing model.
        • Dependent upon business context being defined.

  © 2011 Genesee Academy, LLC
New Attribute Business Context

  – New Attribute (Business) Context Agility - ability to integrate new
    attributes in terms of business context.
        • Bottom Line: Ability to quickly apply business context to new attributes


  – In this layer, agility is measured as a function of the time it takes to align
    business context with a new attribute from a source.
  – Variables in this layer include:
        • Ability of the BI / DW team to accurately assess the business context of the
          new source attribute.
  – Constraints:
        • Level of alignment of the new attribute with the existing model.
        • Dependent upon the level of business alignment with the data in the EDW




  © 2011 Genesee Academy, LLC
EDW Machine Agility

  – EDW Machine Agility – ability of the EDW machine (business and
    technical) to accommodate a new subject area from stage to mart.
        • Bottom Line: EDW response time; a function of people, process & tools


  – In this layer, agility is measured as an overall function of the EDW machine
    to integrate a new subject area from stage to mart.
  – Variables in this layer include:
        •   Strength of the BI / DW development team.
        •   Strength and repeatability of EDW development, testing and delivery.
        •   Strength and ability of ETL integration team.
        •   Strength and repeatability of all BI / DW processes.
  – Constraints:
        • Executive sponsorship of the EDW program.
        • Well defined organizational structure for BIW, BICC, Architecture and
          Governance.

  © 2011 Genesee Academy, LLC
CURRENT APPROACHES
DW Agility Current Approaches

     – Incremental Data Warehouse Development
            • Data Vault modeling, 2G, Anchor, etc.

     – Agile BI Programs (People, Process, Models & Data)
            • Methodologies (Centennium, Platon, etc.)
            • Templates, Tools & Automation (Wherescape, etc.)


     – Alternate & New Paradigms for the Agile DW




  © 2011 Genesee Academy, LLC
DW Agility Components

     – Absorb Changes
            • Capture the Change
            • Understand the Change


     – A major constraint on agility is the required data
       warehouse modeling changes...
            • So we can capture the data (create the buckets)
            • So we can understand the data (context, meaning)
                  – Align to business keys, classify, describe (metadata)




  © 2011 Genesee Academy, LLC
Data Warehouse Agility
    • Why create a Data Model for the DW?

    • Model Data versus Meaning?

       –   Separate the capture of data from the meaning?
       –   The structure of a table versus the semantics
       –   Business meaning versus data loading
       –   As XML is to EDI
HYPER AGILITY AND THE
 NAME VALUE PAIR (NVP)
Concept of Name/Value Pair


   Cust_ID     Lname      Fname        Add        City    State    Zip     Bdate
  121202     Lundquist   Carl      22 Bird St   NYC       NY      98291   10/9/1977
  123335     Dahlgren    Eva       7 Academy    Madison   NJ      07940   2/12/1982
  139090     Lundberg    Scott     444 7th St   Tuborg    MN      70098   4/22/1988
  119944     Hultquist   Darla     17 South     Randolf   PA      91121   9/22/1967
  120334     Forsberg    Sven      117 East A   NYC       NY      98292   8/19/1976



 Each Value or ”data item” (record value for each attribute), is provided in a
 List format paired with the corresponding Name or ”field name” (column
 header) from the normalized table structure.

                                 Moving to Name / Value Pair…
Concept of Name/Value Pair


 Name                                                                                      Value
        Cust_ID     Lname      Fname       Add        City     State     Zip      Bdate
      121202      Lundquist   Carl     22 Bird St   NYC       NY       98291   10/9/1977
        Cust_ID     Lname      Fname       Add        City     State     Zip      Bdate
      123335      Dahlgren    Eva      7 Academy    Madison   NJ       07940   2/12/1982
        Cust_ID     Lname      Fname       Add        City     State     Zip      Bdate
      139090      Lundberg    Scott    444 7th St   Tuborg    MN       70098   4/22/1988
        Cust_ID     Lname      Fname       Add        City     State     Zip      Bdate
      119944      Hultquist   Darla    17 South     Randolf   PA       91121   9/22/1967
        Cust_ID     Lname      Fname       Add        City     State     Zip      Bdate

      120334      Forsberg    Sven     117 East A   NYC       NY       98292   8/19/1976
Moving to Name/Value Pair

      Cust_ID     Lname      Fname         Add        City   State    Zip     Bdate

     121202     Lundquist   Carl      22 Bird St   NYC       NY      98291   10/9/1977

     123335     Dahlgren    Eva       7 Academy    Madison   NJ      07940   2/12/1982

     139090     Lundberg    Scott     444 7th St   Tuborg    MN      70098   4/22/1988

     119944     Hultquist   Darla     17 South     Randolf   PA      91121   9/22/1967

     120334     Forsberg    Sven      117 East A   NYC       NY      98292   8/19/1976




                                                                                             V
                                                                                         N
                                                                                             A
                                                                                         A
                                                                                             L
                                                                                         M
                                                                                             U
                                                                                         E
                                                                                             E
                                    Transpose
                                    …with column headings…
Name          Value
Cust_ID
Lname
              121202
              Lundquist
                             Name/Value Pair
Fname         Carl
Add           22 Bird St
City          NYC
State         NY
Zip           98291
Bdate         10/9/1977
Cust_ID       123335
Lname         Dahlgren
Fname         Eva
Add           7 Academy
City          Madison
State         NJ
Zip           7940
Bdate         2/12/1982
Cust_ID       139090
Lname         Lundberg
Fname         Scott
Name         Value
Cust_ID       121202
Lname         Lundquist
Fname         Carl
Add           22 Bird St
                               The concept of the ”record” is effectively
City          NYC
                               lost in this transformation.
State         NY
Zip           98291         Now a RECORD is a set of Name/Value Pair
Bdate         10/9/1977     instances…
Cust_ID       123335
Lname         Dahlgren
                                CON      Lose resolution on the record.
Fname         Eva
Add           7 Academy
City          Madison
State         NJ
Zip           7940
Bdate         2/12/1982
Cust_ID       139090
Lname         Lundberg
Fname         Scott
Name         Value
Cust_ID       121202
Lname         Lundquist
Fname         Carl
Add           22 Bird St
City          NYC
State         NY
Zip           98291
Bdate         10/9/1977
Cust_ID       123335
Lname         Dahlgren
Fname         Eva           Also, the attributes are not defined in
Add           7 Academy     advance – we don’t know what to expect and
City          Madison       we can’t check for attribute meaning,
State         NJ            definitions, domain values or data types.
Zip           7940
Bdate         2/12/1982
                                CON     Attributes are not pre-defined.
Cust_ID       139090
Lname         Lundberg
Fname         Scott
Name          Value
Cust_ID       121202
Lname         Lundquist
Fname         Carl
Add           22 Bird St
                             New attributes that are introduced into the
City          NYC
                             source feed are added instantly to the DW.
State         NY
                             There is no modeling delay, no code
Zip           98291
                             change, and no ETL impact…
Bdate         10/9/1977
CustClass     Big
Cust_ID       123335           PRO      Absorb new attributes instantly.
Lname         Dahlgren
Fname         Eva
Add           7 Academy
City          Madison
State         NJ
Zip           7940
Bdate         2/12/1982
CustClass     Small
Cust_ID       139090
Hyper Agility
• The solution to deal with these issues requires a further level of
  abstraction which in effect moves the persisted (historized,
  permanent, integrated) data store even further away from the
  business context that it is intended to represent.
• The DW model – the data model itself – is then not readable (not
  understandable). In fact ETL professionals will also find themselves
  further removed from this model. To the extent that a model is
  intuitive, self-descriptive, and aligned with business meaning, this
  approach takes a step in the other direction.
• Moving towards addressing these business driven agility
  requirements casues the model itself to move much further away
  (an order of magnitude away) from the business. So far as to
  become effectively a technical solution utilizing only abstract
  representations.
Hyper Agility

• The context – the meaning of the data – will in these cases need to
  be managed in a different way.
• This can include a form of persisted and historized metadata
  concerning the mappings and business rules. In effect a form of
  EAI within the DW.
• Or it might include a more traditional secondary DW layer.
DW AGILITY SUMMARY

 • Consider specific Agility Requirements

 • Classify Agility Types and consider Alternatives

 • Distinguish between operational integration and DW

 • Look to modeling techniques optimized for Data Warehouse

 • Look at entire picture – people, process, models and data

 • Consider specific methodologies, templates and tools

 • Determine if hyper agility is a requirement
Questions?



                                www.GeneseeAcademy.com



                              CDVDM Certification Seminar



                                     June 23-24
                                    October 27-28

© 2011 Genesee Academy, LLC                                 Hans@GeneseeAcademy.com
    25568 Genesee Trail Rd                                     USA +1 303.526.0340
    Golden, Colorado 80401                                     Sverige 070 250 2102

                                                                                 28

More Related Content

What's hot

Data platform architecture
Data platform architectureData platform architecture
Data platform architectureSudheer Kondla
 
Introduction SQL Analytics on Lakehouse Architecture
Introduction SQL Analytics on Lakehouse ArchitectureIntroduction SQL Analytics on Lakehouse Architecture
Introduction SQL Analytics on Lakehouse ArchitectureDatabricks
 
Modernizing to a Cloud Data Architecture
Modernizing to a Cloud Data ArchitectureModernizing to a Cloud Data Architecture
Modernizing to a Cloud Data ArchitectureDatabricks
 
Future of Data Engineering
Future of Data EngineeringFuture of Data Engineering
Future of Data EngineeringC4Media
 
Introduction to Data Vault Modeling
Introduction to Data Vault ModelingIntroduction to Data Vault Modeling
Introduction to Data Vault ModelingKent Graziano
 
Data Warehousing Trends, Best Practices, and Future Outlook
Data Warehousing Trends, Best Practices, and Future OutlookData Warehousing Trends, Best Practices, and Future Outlook
Data Warehousing Trends, Best Practices, and Future OutlookJames Serra
 
Building a modern data warehouse
Building a modern data warehouseBuilding a modern data warehouse
Building a modern data warehouseJames Serra
 
Data Lakehouse Symposium | Day 4
Data Lakehouse Symposium | Day 4Data Lakehouse Symposium | Day 4
Data Lakehouse Symposium | Day 4Databricks
 
Speeding Time to Insight with a Modern ELT Approach
Speeding Time to Insight with a Modern ELT ApproachSpeeding Time to Insight with a Modern ELT Approach
Speeding Time to Insight with a Modern ELT ApproachDatabricks
 
Achieving Lakehouse Models with Spark 3.0
Achieving Lakehouse Models with Spark 3.0Achieving Lakehouse Models with Spark 3.0
Achieving Lakehouse Models with Spark 3.0Databricks
 
Data Modeling Fundamentals
Data Modeling FundamentalsData Modeling Fundamentals
Data Modeling FundamentalsDATAVERSITY
 
Data Vault 2.0 DeMystified with Dan Linstedt and WhereScape
Data Vault 2.0 DeMystified with Dan Linstedt and WhereScapeData Vault 2.0 DeMystified with Dan Linstedt and WhereScape
Data Vault 2.0 DeMystified with Dan Linstedt and WhereScapeWhereScape
 
Building the Data Lake with Azure Data Factory and Data Lake Analytics
Building the Data Lake with Azure Data Factory and Data Lake AnalyticsBuilding the Data Lake with Azure Data Factory and Data Lake Analytics
Building the Data Lake with Azure Data Factory and Data Lake AnalyticsKhalid Salama
 
Architect’s Open-Source Guide for a Data Mesh Architecture
Architect’s Open-Source Guide for a Data Mesh ArchitectureArchitect’s Open-Source Guide for a Data Mesh Architecture
Architect’s Open-Source Guide for a Data Mesh ArchitectureDatabricks
 
Data Mesh Part 4 Monolith to Mesh
Data Mesh Part 4 Monolith to MeshData Mesh Part 4 Monolith to Mesh
Data Mesh Part 4 Monolith to MeshJeffrey T. Pollock
 
Data Quality & Data Governance
Data Quality & Data GovernanceData Quality & Data Governance
Data Quality & Data GovernanceTuba Yaman Him
 
Data Lakehouse, Data Mesh, and Data Fabric (r1)
Data Lakehouse, Data Mesh, and Data Fabric (r1)Data Lakehouse, Data Mesh, and Data Fabric (r1)
Data Lakehouse, Data Mesh, and Data Fabric (r1)James Serra
 

What's hot (20)

Data platform architecture
Data platform architectureData platform architecture
Data platform architecture
 
Introduction SQL Analytics on Lakehouse Architecture
Introduction SQL Analytics on Lakehouse ArchitectureIntroduction SQL Analytics on Lakehouse Architecture
Introduction SQL Analytics on Lakehouse Architecture
 
Modernizing to a Cloud Data Architecture
Modernizing to a Cloud Data ArchitectureModernizing to a Cloud Data Architecture
Modernizing to a Cloud Data Architecture
 
Future of Data Engineering
Future of Data EngineeringFuture of Data Engineering
Future of Data Engineering
 
Introduction to Data Vault Modeling
Introduction to Data Vault ModelingIntroduction to Data Vault Modeling
Introduction to Data Vault Modeling
 
Data Warehousing Trends, Best Practices, and Future Outlook
Data Warehousing Trends, Best Practices, and Future OutlookData Warehousing Trends, Best Practices, and Future Outlook
Data Warehousing Trends, Best Practices, and Future Outlook
 
Building a modern data warehouse
Building a modern data warehouseBuilding a modern data warehouse
Building a modern data warehouse
 
Data Lakehouse Symposium | Day 4
Data Lakehouse Symposium | Day 4Data Lakehouse Symposium | Day 4
Data Lakehouse Symposium | Day 4
 
Data Vault and DW2.0
Data Vault and DW2.0Data Vault and DW2.0
Data Vault and DW2.0
 
Speeding Time to Insight with a Modern ELT Approach
Speeding Time to Insight with a Modern ELT ApproachSpeeding Time to Insight with a Modern ELT Approach
Speeding Time to Insight with a Modern ELT Approach
 
Data mesh
Data meshData mesh
Data mesh
 
Achieving Lakehouse Models with Spark 3.0
Achieving Lakehouse Models with Spark 3.0Achieving Lakehouse Models with Spark 3.0
Achieving Lakehouse Models with Spark 3.0
 
Data Modeling Fundamentals
Data Modeling FundamentalsData Modeling Fundamentals
Data Modeling Fundamentals
 
Data Vault 2.0 DeMystified with Dan Linstedt and WhereScape
Data Vault 2.0 DeMystified with Dan Linstedt and WhereScapeData Vault 2.0 DeMystified with Dan Linstedt and WhereScape
Data Vault 2.0 DeMystified with Dan Linstedt and WhereScape
 
Building the Data Lake with Azure Data Factory and Data Lake Analytics
Building the Data Lake with Azure Data Factory and Data Lake AnalyticsBuilding the Data Lake with Azure Data Factory and Data Lake Analytics
Building the Data Lake with Azure Data Factory and Data Lake Analytics
 
Architect’s Open-Source Guide for a Data Mesh Architecture
Architect’s Open-Source Guide for a Data Mesh ArchitectureArchitect’s Open-Source Guide for a Data Mesh Architecture
Architect’s Open-Source Guide for a Data Mesh Architecture
 
Data Mesh Part 4 Monolith to Mesh
Data Mesh Part 4 Monolith to MeshData Mesh Part 4 Monolith to Mesh
Data Mesh Part 4 Monolith to Mesh
 
Data Quality & Data Governance
Data Quality & Data GovernanceData Quality & Data Governance
Data Quality & Data Governance
 
Operational Data Vault
Operational Data VaultOperational Data Vault
Operational Data Vault
 
Data Lakehouse, Data Mesh, and Data Fabric (r1)
Data Lakehouse, Data Mesh, and Data Fabric (r1)Data Lakehouse, Data Mesh, and Data Fabric (r1)
Data Lakehouse, Data Mesh, and Data Fabric (r1)
 

Viewers also liked

Data Vault ReConnect Speed Presenting AM Part One
Data Vault ReConnect Speed Presenting AM Part OneData Vault ReConnect Speed Presenting AM Part One
Data Vault ReConnect Speed Presenting AM Part OneHans Hultgren
 
Data Vault ReConnect Speed Presenting PM Part Four
Data Vault ReConnect Speed Presenting PM Part FourData Vault ReConnect Speed Presenting PM Part Four
Data Vault ReConnect Speed Presenting PM Part FourHans Hultgren
 
Guru4Pro Data Vault Best Practices
Guru4Pro Data Vault Best PracticesGuru4Pro Data Vault Best Practices
Guru4Pro Data Vault Best PracticesCGI
 
Data Vault ReConnect Speed Presenting PM Part Three
Data Vault ReConnect Speed Presenting PM Part ThreeData Vault ReConnect Speed Presenting PM Part Three
Data Vault ReConnect Speed Presenting PM Part ThreeHans Hultgren
 
Data vault seminar May 5-6 Dommel - The factory and the workshop
Data vault seminar May 5-6 Dommel - The factory and the workshopData vault seminar May 5-6 Dommel - The factory and the workshop
Data vault seminar May 5-6 Dommel - The factory and the workshopjohannesvdb
 
Lean Data Warehouse via Data Vault
Lean Data Warehouse via Data VaultLean Data Warehouse via Data Vault
Lean Data Warehouse via Data VaultDaniel Upton
 
Metadaten und Data Vault (Meta Vault)
Metadaten und Data Vault (Meta Vault)Metadaten und Data Vault (Meta Vault)
Metadaten und Data Vault (Meta Vault)Andreas Buckenhofer
 
Part 4 - Data Warehousing Lecture at BW Cooperative State University (DHBW)
Part 4 - Data Warehousing Lecture at BW Cooperative State University (DHBW)Part 4 - Data Warehousing Lecture at BW Cooperative State University (DHBW)
Part 4 - Data Warehousing Lecture at BW Cooperative State University (DHBW)Andreas Buckenhofer
 
Agile Methods and Data Warehousing (2016 update)
Agile Methods and Data Warehousing (2016 update)Agile Methods and Data Warehousing (2016 update)
Agile Methods and Data Warehousing (2016 update)Kent Graziano
 
Agile BI via Data Vault and Modelstorming
Agile BI via Data Vault and ModelstormingAgile BI via Data Vault and Modelstorming
Agile BI via Data Vault and ModelstormingDaniel Upton
 
Introduction To Data Vault - DAMA Oregon 2012
Introduction To Data Vault - DAMA Oregon 2012Introduction To Data Vault - DAMA Oregon 2012
Introduction To Data Vault - DAMA Oregon 2012Empowered Holdings, LLC
 
Part 2 - Data Warehousing Lecture at BW Cooperative State University (DHBW)
Part 2 - Data Warehousing Lecture at BW Cooperative State University (DHBW)Part 2 - Data Warehousing Lecture at BW Cooperative State University (DHBW)
Part 2 - Data Warehousing Lecture at BW Cooperative State University (DHBW)Andreas Buckenhofer
 
CDC und Data Vault für den Aufbau eines DWH in der Automobilindustrie
CDC und Data Vault für den Aufbau eines DWH in der AutomobilindustrieCDC und Data Vault für den Aufbau eines DWH in der Automobilindustrie
CDC und Data Vault für den Aufbau eines DWH in der AutomobilindustrieAndreas Buckenhofer
 
Data Vault: Data Warehouse Design Goes Agile
Data Vault: Data Warehouse Design Goes AgileData Vault: Data Warehouse Design Goes Agile
Data Vault: Data Warehouse Design Goes AgileDaniel Upton
 
Agile Data Warehouse Design for Big Data Presentation
Agile Data Warehouse Design for Big Data PresentationAgile Data Warehouse Design for Big Data Presentation
Agile Data Warehouse Design for Big Data PresentationVishal Kumar
 
Data Modeling for Big Data
Data Modeling for Big DataData Modeling for Big Data
Data Modeling for Big DataDATAVERSITY
 

Viewers also liked (18)

Data Vault ReConnect Speed Presenting AM Part One
Data Vault ReConnect Speed Presenting AM Part OneData Vault ReConnect Speed Presenting AM Part One
Data Vault ReConnect Speed Presenting AM Part One
 
Big Data Modeling
Big Data ModelingBig Data Modeling
Big Data Modeling
 
Data Vault ReConnect Speed Presenting PM Part Four
Data Vault ReConnect Speed Presenting PM Part FourData Vault ReConnect Speed Presenting PM Part Four
Data Vault ReConnect Speed Presenting PM Part Four
 
Guru4Pro Data Vault Best Practices
Guru4Pro Data Vault Best PracticesGuru4Pro Data Vault Best Practices
Guru4Pro Data Vault Best Practices
 
Data Vault ReConnect Speed Presenting PM Part Three
Data Vault ReConnect Speed Presenting PM Part ThreeData Vault ReConnect Speed Presenting PM Part Three
Data Vault ReConnect Speed Presenting PM Part Three
 
Data vault seminar May 5-6 Dommel - The factory and the workshop
Data vault seminar May 5-6 Dommel - The factory and the workshopData vault seminar May 5-6 Dommel - The factory and the workshop
Data vault seminar May 5-6 Dommel - The factory and the workshop
 
Lean Data Warehouse via Data Vault
Lean Data Warehouse via Data VaultLean Data Warehouse via Data Vault
Lean Data Warehouse via Data Vault
 
Metadaten und Data Vault (Meta Vault)
Metadaten und Data Vault (Meta Vault)Metadaten und Data Vault (Meta Vault)
Metadaten und Data Vault (Meta Vault)
 
Part 4 - Data Warehousing Lecture at BW Cooperative State University (DHBW)
Part 4 - Data Warehousing Lecture at BW Cooperative State University (DHBW)Part 4 - Data Warehousing Lecture at BW Cooperative State University (DHBW)
Part 4 - Data Warehousing Lecture at BW Cooperative State University (DHBW)
 
Agile Methods and Data Warehousing (2016 update)
Agile Methods and Data Warehousing (2016 update)Agile Methods and Data Warehousing (2016 update)
Agile Methods and Data Warehousing (2016 update)
 
Agile BI via Data Vault and Modelstorming
Agile BI via Data Vault and ModelstormingAgile BI via Data Vault and Modelstorming
Agile BI via Data Vault and Modelstorming
 
Introduction To Data Vault - DAMA Oregon 2012
Introduction To Data Vault - DAMA Oregon 2012Introduction To Data Vault - DAMA Oregon 2012
Introduction To Data Vault - DAMA Oregon 2012
 
Part 2 - Data Warehousing Lecture at BW Cooperative State University (DHBW)
Part 2 - Data Warehousing Lecture at BW Cooperative State University (DHBW)Part 2 - Data Warehousing Lecture at BW Cooperative State University (DHBW)
Part 2 - Data Warehousing Lecture at BW Cooperative State University (DHBW)
 
CDC und Data Vault für den Aufbau eines DWH in der Automobilindustrie
CDC und Data Vault für den Aufbau eines DWH in der AutomobilindustrieCDC und Data Vault für den Aufbau eines DWH in der Automobilindustrie
CDC und Data Vault für den Aufbau eines DWH in der Automobilindustrie
 
Data Vault: Data Warehouse Design Goes Agile
Data Vault: Data Warehouse Design Goes AgileData Vault: Data Warehouse Design Goes Agile
Data Vault: Data Warehouse Design Goes Agile
 
Agile Data Warehouse Design for Big Data Presentation
Agile Data Warehouse Design for Big Data PresentationAgile Data Warehouse Design for Big Data Presentation
Agile Data Warehouse Design for Big Data Presentation
 
Data Modeling for Big Data
Data Modeling for Big DataData Modeling for Big Data
Data Modeling for Big Data
 
Agile KPIs
Agile KPIsAgile KPIs
Agile KPIs
 

Similar to Data Warehouse Agility Array Conference2011

Data Warehousing in the Cloud: Practical Migration Strategies
Data Warehousing in the Cloud: Practical Migration Strategies Data Warehousing in the Cloud: Practical Migration Strategies
Data Warehousing in the Cloud: Practical Migration Strategies SnapLogic
 
Optimizing IT Costs & Services With Big Data (Little Effort!) - Case Studies ...
Optimizing IT Costs & Services With Big Data (Little Effort!) - Case Studies ...Optimizing IT Costs & Services With Big Data (Little Effort!) - Case Studies ...
Optimizing IT Costs & Services With Big Data (Little Effort!) - Case Studies ...TeamQuest Corporation
 
Vijay_Kr_Singh_Oracle_SQL_PLSQL_Developer
Vijay_Kr_Singh_Oracle_SQL_PLSQL_DeveloperVijay_Kr_Singh_Oracle_SQL_PLSQL_Developer
Vijay_Kr_Singh_Oracle_SQL_PLSQL_DeveloperVijay Kumar Singh
 
Resume - Deepak v.s
Resume -  Deepak v.sResume -  Deepak v.s
Resume - Deepak v.sDeepak V S
 
Bringing Agility and Flexibility to Data Design and Integration
Bringing Agility and Flexibility to Data Design and IntegrationBringing Agility and Flexibility to Data Design and Integration
Bringing Agility and Flexibility to Data Design and IntegrationDATAVERSITY
 
Data Warehouse Optimization
Data Warehouse OptimizationData Warehouse Optimization
Data Warehouse OptimizationCloudera, Inc.
 
The final frontier
The final frontierThe final frontier
The final frontierTerry Bunio
 
SP1740_Vivek Kumar_Speridian
SP1740_Vivek Kumar_SperidianSP1740_Vivek Kumar_Speridian
SP1740_Vivek Kumar_Speridianvivek kumar
 
A beginners guide to Cloudera Hadoop
A beginners guide to Cloudera HadoopA beginners guide to Cloudera Hadoop
A beginners guide to Cloudera HadoopDavid Yahalom
 
rizwan cse exp resume
rizwan cse exp resumerizwan cse exp resume
rizwan cse exp resumeshaik rizwan
 
Ashish_Maheshwari_Data_Analyst
Ashish_Maheshwari_Data_AnalystAshish_Maheshwari_Data_Analyst
Ashish_Maheshwari_Data_AnalystAshish Maheshwari
 
Copy of Alok_Singh_CV
Copy of Alok_Singh_CVCopy of Alok_Singh_CV
Copy of Alok_Singh_CVAlok Singh
 

Similar to Data Warehouse Agility Array Conference2011 (20)

Data Warehousing in the Cloud: Practical Migration Strategies
Data Warehousing in the Cloud: Practical Migration Strategies Data Warehousing in the Cloud: Practical Migration Strategies
Data Warehousing in the Cloud: Practical Migration Strategies
 
Optimizing IT Costs & Services With Big Data (Little Effort!) - Case Studies ...
Optimizing IT Costs & Services With Big Data (Little Effort!) - Case Studies ...Optimizing IT Costs & Services With Big Data (Little Effort!) - Case Studies ...
Optimizing IT Costs & Services With Big Data (Little Effort!) - Case Studies ...
 
Vijay_Kr_Singh_Oracle_SQL_PLSQL_Developer
Vijay_Kr_Singh_Oracle_SQL_PLSQL_DeveloperVijay_Kr_Singh_Oracle_SQL_PLSQL_Developer
Vijay_Kr_Singh_Oracle_SQL_PLSQL_Developer
 
Resume - Deepak v.s
Resume -  Deepak v.sResume -  Deepak v.s
Resume - Deepak v.s
 
parthiban Loganathan
parthiban Loganathanparthiban Loganathan
parthiban Loganathan
 
Bringing Agility and Flexibility to Data Design and Integration
Bringing Agility and Flexibility to Data Design and IntegrationBringing Agility and Flexibility to Data Design and Integration
Bringing Agility and Flexibility to Data Design and Integration
 
ShashankJainMSBI
ShashankJainMSBIShashankJainMSBI
ShashankJainMSBI
 
Data Warehouse Optimization
Data Warehouse OptimizationData Warehouse Optimization
Data Warehouse Optimization
 
Accelerating Data Warehouse Modernization
Accelerating Data Warehouse ModernizationAccelerating Data Warehouse Modernization
Accelerating Data Warehouse Modernization
 
The final frontier
The final frontierThe final frontier
The final frontier
 
SP1740_Vivek Kumar_Speridian
SP1740_Vivek Kumar_SperidianSP1740_Vivek Kumar_Speridian
SP1740_Vivek Kumar_Speridian
 
Resume_of_Vasudevan - Hadoop
Resume_of_Vasudevan - HadoopResume_of_Vasudevan - Hadoop
Resume_of_Vasudevan - Hadoop
 
Ketan Resume
Ketan ResumeKetan Resume
Ketan Resume
 
A beginners guide to Cloudera Hadoop
A beginners guide to Cloudera HadoopA beginners guide to Cloudera Hadoop
A beginners guide to Cloudera Hadoop
 
rizwan cse exp resume
rizwan cse exp resumerizwan cse exp resume
rizwan cse exp resume
 
Ashish_Maheshwari_Data_Analyst
Ashish_Maheshwari_Data_AnalystAshish_Maheshwari_Data_Analyst
Ashish_Maheshwari_Data_Analyst
 
ChakravarthyUppara
ChakravarthyUpparaChakravarthyUppara
ChakravarthyUppara
 
Vikas sogani
Vikas soganiVikas sogani
Vikas sogani
 
Resume_Gulley_Oct7_2016
Resume_Gulley_Oct7_2016Resume_Gulley_Oct7_2016
Resume_Gulley_Oct7_2016
 
Copy of Alok_Singh_CV
Copy of Alok_Singh_CVCopy of Alok_Singh_CV
Copy of Alok_Singh_CV
 

Recently uploaded

Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK
 
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhisoniya singh
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure servicePooja Nehwal
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...shyamraj55
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptxLBM Solutions
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Paola De la Torre
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 3652toLead Limited
 
How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?XfilesPro
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...HostedbyConfluent
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxKatpro Technologies
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 

Recently uploaded (20)

Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen Frames
 
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptx
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
 
How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping Elbows
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 

Data Warehouse Agility Array Conference2011

  • 1. 25568 Genesee Trail Rd Golden, Colorado 80401 (303) 526-0340  Data Vault Modeling and Approach  DW2.0 and Unstructured Data  Master Data Management and Metadata Data Warehousing Agility BI-Event May 17 Hans Hultgren © 2011 Genesee Academy, LLC 25568 Genesee Trail Rd Golden, Colorado 80401 © 2011 Genesee Academy, LLC
  • 2. Welcome • Definition of agility • Types of agility • Discuss current approaches • Hyper-agility • Observations from the field – Also topics of operational data warehousing, operational bi, agile project management techniques, agility oriented tools, and operational integration
  • 3. Data Warehouse Agility • Agility – The overall measure of adaptability in terms of speed & scope. – Overall performance in adapting to change. NOTE: Not warehouse machine throughput, near real time (NRT) processing, and operational DW performance… Ability of the data warehouse to adapt to change Versus Performance of an existing (steady state) warehouse
  • 4. Data Warehouse Agility • Agility – Agile in IT • Agile Project Management • Agile Software Development – Agile Manifesto We are uncovering better ways of developing software by doing it and helping others do it. Through this work we have come to value: Individuals and interactions over processes and tools Working software over comprehensive documentation Customer collaboration over contract negotiation Responding to change over following a plan That is, while there is value in the items on the right, we value the items on the left more. • Agile Modeling Driven Design (AMDD) • Test-Driven Design (TDD)
  • 5. Data Warehouse Agility • Agility in the Data Warehouse – Agility in terms of Data Warehousing is related to the ability to build incrementally. – The approach today is more concerned with the development of a business intelligence, data warehousing program – the capability to increment (adapt and grow). – Since the business is always changing (new reporting needs, new business processes, new business units, new data sources, etc.) the EDW program is an ongoing initiative that needs to focus on adapting to these changes. – Note: distinguish between operational integration and data warehousing.
  • 6. Types of Data Warehouse Agility Change DW New Source New Mart Data Warehouse New Attribute New Subject Area
  • 7. Types of Data Warehouse Agility – Presentation Layer Agility – ability to adapt to new business requirements based on existing data elements in the EDW. • Bottom Line: Ability to quickly and flexibly spin off new data marts – New Data Source Agility – ability to assimilate new data sources into the EDW architecture from stage to CDW+ and existing data marts. • Bottom Line: Ability to quickly adapt to new data sources * using existing structures – New Attribute Agility – ability to absorb new attributes into the EDW architecture such that they can be loaded from the sources and integrate new attributes in terms of business context. • Bottom Line: Ability to quickly incorporate new attributes in the EDW and apply business context to these attributes – EDW Machine Agility – ability of the EDW machine (business and technical) to accommodate a new subject area from stage to mart. • Bottom Line: EDW response time; a function of people, process & tools – Changes in the DW – ability to absorb other changes such as integration logic, mappings, and business rules. Current © 2011 Genesee Academy, LLC
  • 8. Presentation Layer Agility – Presentation Layer Agility - ability to adapt to new business requirements based on existing data elements in the EDW. • Bottom Line: Ability to quickly and flexibly spin off new data marts – In this layer, agility is measured as a function of the time it takes to design, construct and deliver a new data mart. – Variables in this layer include: • Strength of the BI team to capture requirements and define data mart. • Ability of ETL integration team to understand EDW model and mart. • Strength and repeatability of ETL processes for sourcing the EDW. • Strength and repeatability of ETL development, testing and delivery. – Constraints: • Dependent upon the existence of the data in the EDW. • Dependent upon the level of business alignment of the data in the EDW. © 2011 Genesee Academy, LLC
  • 9. New Data Source Agility – New Data Source Agility - ability to assimilate new data sources into the EDW architecture from stage to CDW+ and existing data marts. • Bottom Line: Ability to quickly adapt to new data sources * using existing structures – In this layer, agility is measured as a function of the time it takes to design, model, build and load data into the EDW from a new source. – Variables in this layer include: • Strength of the DW team to design the required model changes. • Strength and repeatability of EDW development, testing and delivery. • Ability of ETL integration team to understand new EDW model. • Strength and repeatability of ETL processes for mapping and loading new source into the EDW. – Constraints: • Level of alignment of the new source data with the existing model. • Dependent upon the level of business alignment with the data in the EDW © 2011 Genesee Academy, LLC
  • 10. New Attribute Technical Agility – New Attribute (Technical) Agility - ability to absorb new attributes into the EDW architecture such that they can be loaded from the sources. • Bottom Line: Ability to quickly incorporate new attributes in the EDW – In this layer, agility is measured as a function of the time it takes to design, map, add and load a new attribute from a source. – Variables in this layer include: • Strength of the DW team to design the required model changes. • Strength and repeatability of EDW development, testing and delivery. • Ability of ETL integration team to understand new EDW attribute(s). • Strength and repeatability of ETL processes for mapping and loading new source attributes into the EDW. – Constraints: • Level of alignment of the new attribute with the existing model. • Dependent upon business context being defined. © 2011 Genesee Academy, LLC
  • 11. New Attribute Business Context – New Attribute (Business) Context Agility - ability to integrate new attributes in terms of business context. • Bottom Line: Ability to quickly apply business context to new attributes – In this layer, agility is measured as a function of the time it takes to align business context with a new attribute from a source. – Variables in this layer include: • Ability of the BI / DW team to accurately assess the business context of the new source attribute. – Constraints: • Level of alignment of the new attribute with the existing model. • Dependent upon the level of business alignment with the data in the EDW © 2011 Genesee Academy, LLC
  • 12. EDW Machine Agility – EDW Machine Agility – ability of the EDW machine (business and technical) to accommodate a new subject area from stage to mart. • Bottom Line: EDW response time; a function of people, process & tools – In this layer, agility is measured as an overall function of the EDW machine to integrate a new subject area from stage to mart. – Variables in this layer include: • Strength of the BI / DW development team. • Strength and repeatability of EDW development, testing and delivery. • Strength and ability of ETL integration team. • Strength and repeatability of all BI / DW processes. – Constraints: • Executive sponsorship of the EDW program. • Well defined organizational structure for BIW, BICC, Architecture and Governance. © 2011 Genesee Academy, LLC
  • 14. DW Agility Current Approaches – Incremental Data Warehouse Development • Data Vault modeling, 2G, Anchor, etc. – Agile BI Programs (People, Process, Models & Data) • Methodologies (Centennium, Platon, etc.) • Templates, Tools & Automation (Wherescape, etc.) – Alternate & New Paradigms for the Agile DW © 2011 Genesee Academy, LLC
  • 15. DW Agility Components – Absorb Changes • Capture the Change • Understand the Change – A major constraint on agility is the required data warehouse modeling changes... • So we can capture the data (create the buckets) • So we can understand the data (context, meaning) – Align to business keys, classify, describe (metadata) © 2011 Genesee Academy, LLC
  • 16. Data Warehouse Agility • Why create a Data Model for the DW? • Model Data versus Meaning? – Separate the capture of data from the meaning? – The structure of a table versus the semantics – Business meaning versus data loading – As XML is to EDI
  • 17. HYPER AGILITY AND THE NAME VALUE PAIR (NVP)
  • 18. Concept of Name/Value Pair Cust_ID Lname Fname Add City State Zip Bdate 121202 Lundquist Carl 22 Bird St NYC NY 98291 10/9/1977 123335 Dahlgren Eva 7 Academy Madison NJ 07940 2/12/1982 139090 Lundberg Scott 444 7th St Tuborg MN 70098 4/22/1988 119944 Hultquist Darla 17 South Randolf PA 91121 9/22/1967 120334 Forsberg Sven 117 East A NYC NY 98292 8/19/1976 Each Value or ”data item” (record value for each attribute), is provided in a List format paired with the corresponding Name or ”field name” (column header) from the normalized table structure. Moving to Name / Value Pair…
  • 19. Concept of Name/Value Pair Name Value Cust_ID Lname Fname Add City State Zip Bdate 121202 Lundquist Carl 22 Bird St NYC NY 98291 10/9/1977 Cust_ID Lname Fname Add City State Zip Bdate 123335 Dahlgren Eva 7 Academy Madison NJ 07940 2/12/1982 Cust_ID Lname Fname Add City State Zip Bdate 139090 Lundberg Scott 444 7th St Tuborg MN 70098 4/22/1988 Cust_ID Lname Fname Add City State Zip Bdate 119944 Hultquist Darla 17 South Randolf PA 91121 9/22/1967 Cust_ID Lname Fname Add City State Zip Bdate 120334 Forsberg Sven 117 East A NYC NY 98292 8/19/1976
  • 20. Moving to Name/Value Pair Cust_ID Lname Fname Add City State Zip Bdate 121202 Lundquist Carl 22 Bird St NYC NY 98291 10/9/1977 123335 Dahlgren Eva 7 Academy Madison NJ 07940 2/12/1982 139090 Lundberg Scott 444 7th St Tuborg MN 70098 4/22/1988 119944 Hultquist Darla 17 South Randolf PA 91121 9/22/1967 120334 Forsberg Sven 117 East A NYC NY 98292 8/19/1976 V N A A L M U E E Transpose …with column headings…
  • 21. Name Value Cust_ID Lname 121202 Lundquist Name/Value Pair Fname Carl Add 22 Bird St City NYC State NY Zip 98291 Bdate 10/9/1977 Cust_ID 123335 Lname Dahlgren Fname Eva Add 7 Academy City Madison State NJ Zip 7940 Bdate 2/12/1982 Cust_ID 139090 Lname Lundberg Fname Scott
  • 22. Name Value Cust_ID 121202 Lname Lundquist Fname Carl Add 22 Bird St The concept of the ”record” is effectively City NYC lost in this transformation. State NY Zip 98291 Now a RECORD is a set of Name/Value Pair Bdate 10/9/1977 instances… Cust_ID 123335 Lname Dahlgren CON Lose resolution on the record. Fname Eva Add 7 Academy City Madison State NJ Zip 7940 Bdate 2/12/1982 Cust_ID 139090 Lname Lundberg Fname Scott
  • 23. Name Value Cust_ID 121202 Lname Lundquist Fname Carl Add 22 Bird St City NYC State NY Zip 98291 Bdate 10/9/1977 Cust_ID 123335 Lname Dahlgren Fname Eva Also, the attributes are not defined in Add 7 Academy advance – we don’t know what to expect and City Madison we can’t check for attribute meaning, State NJ definitions, domain values or data types. Zip 7940 Bdate 2/12/1982 CON Attributes are not pre-defined. Cust_ID 139090 Lname Lundberg Fname Scott
  • 24. Name Value Cust_ID 121202 Lname Lundquist Fname Carl Add 22 Bird St New attributes that are introduced into the City NYC source feed are added instantly to the DW. State NY There is no modeling delay, no code Zip 98291 change, and no ETL impact… Bdate 10/9/1977 CustClass Big Cust_ID 123335 PRO Absorb new attributes instantly. Lname Dahlgren Fname Eva Add 7 Academy City Madison State NJ Zip 7940 Bdate 2/12/1982 CustClass Small Cust_ID 139090
  • 25. Hyper Agility • The solution to deal with these issues requires a further level of abstraction which in effect moves the persisted (historized, permanent, integrated) data store even further away from the business context that it is intended to represent. • The DW model – the data model itself – is then not readable (not understandable). In fact ETL professionals will also find themselves further removed from this model. To the extent that a model is intuitive, self-descriptive, and aligned with business meaning, this approach takes a step in the other direction. • Moving towards addressing these business driven agility requirements casues the model itself to move much further away (an order of magnitude away) from the business. So far as to become effectively a technical solution utilizing only abstract representations.
  • 26. Hyper Agility • The context – the meaning of the data – will in these cases need to be managed in a different way. • This can include a form of persisted and historized metadata concerning the mappings and business rules. In effect a form of EAI within the DW. • Or it might include a more traditional secondary DW layer.
  • 27. DW AGILITY SUMMARY • Consider specific Agility Requirements • Classify Agility Types and consider Alternatives • Distinguish between operational integration and DW • Look to modeling techniques optimized for Data Warehouse • Look at entire picture – people, process, models and data • Consider specific methodologies, templates and tools • Determine if hyper agility is a requirement
  • 28. Questions? www.GeneseeAcademy.com CDVDM Certification Seminar June 23-24 October 27-28 © 2011 Genesee Academy, LLC Hans@GeneseeAcademy.com 25568 Genesee Trail Rd USA +1 303.526.0340 Golden, Colorado 80401 Sverige 070 250 2102 28