SlideShare ist ein Scribd-Unternehmen logo
1 von 20
PRESENTATION
     ON
    DATA
WAREHOUSING



        Presented By:
        Jagnesh Chawla
        Manpreet Singh
        Mintu
CONTENTS:
 Meaning Of data warehousing
 Benefit of data warehousing

 Problems

 Architecture of data warehouse

 Main components

 Data flows

 Tools and technologies

 Data Mart
MEANING:
   Data warehouse is data management and data
    analysis




   Goal: is to integrate enterprise wide corporate
    data into a single reository from which users can
    easily run queries
BENEFITS:
   The major benefit of data warehousing are high
    returns on investment.




   Increased productivity of corporate decision-
    makers
PROBLEMS:
 Underestimation of resources for data loading
 Hidden problems with source systems
 Required data not captured
 Increased end-user demands
 Data homogenization
 High demand for resources
 Data ownership
 High maintenance
 Long-duration projects
 Complexity of integration
ARCHITECTURE:

   Operational                                                                       Reporting, query,
   data source1                                                                      application
                                                                                     development,
                                                                 High                and EIS(executive
                                Meta-data                    summarized data         information system)
   Operational                                                                 Query Manage
                                                                                     tools
  data source 2                                   Lightly
                    Load Manager                summarized
                                                   data


  Operational
  data source n                 Detailed data                    DBMS
                                                                                   OLAP(online analytical
                                                                                   processing) tools

  Operational
                                    Warehouse Manager
 data store (ods)



ational data store (ODS)
                                                                                         Data mining

                                      Archive/backup
                                           data
                                                                                         End-user
                       Typical architecture of a data warehouse                          access tools
MAIN COMPONENTS:
 Operational data sourcesfor the DW is
  supplied from mainframe operational data held in
  first generation hierarchical and network databases,
  departmental data held in proprietary file systems,
  private data held on workstaions and private serves
  and external systems such as the Internet,
  commercially available DB, or DB assoicated with
  and organization’s suppliers or customers
 Operational datastore(ODS)is a
  repository of current and integrated operational data
  used for analysis. It is often structured and supplied
  with data in the same way as the data warehouse, but
  may in fact simply act as a staging area for data to be
  moved into the warehouse
MAIN COMPONENTS:
 query   manageralso called backend
 component, it performs all the operations
 associated with the management of user queries.
 The operations performed by this component
 include directing queries to the appropriate
 tables and scheduling the execution of queries
 end-user   access toolscan be categorized into
 five main groups: data reporting and query tools,
 application development tools, executive
 information system (EIS) tools, online analytical
 processing (OLAP) tools, and data mining tools
DATA FLOW:
 Inflow- The processes associated with the
  extraction, cleansing, and loading of the data
  from the source systems into the data warehouse.
 upflow- The process associated with adding value
  to the data in the warehouse through
  summarizing, packaging , packaging, and
  distribution of the data
 downflow- The processes associated with
  archiving and backing-up of data in the
  warehouse
DATA FLOW:
   outflow- The process associated with making the
    data availabe to the end-users.




   Meta-flow- The processes associated with the
    management of the meta-data
Warehouse Manager
   Operational
   data source1


                                                 Meta-flow
                                Meta-data                                High
                                                                     summarized data

Inflow                                                                                 Outflow
                                                       Lightly
                   Load                              summarized
                                                        data
                   Manager
                                                                  Upflow           Query Manage
 Operational
                                                                           DBMS
 data source n                  Detailed data

                                                Warehouse Manager


 Operational
data store (ods)
                                                                                                  Data mining
                                                                                                  tools
                                                                                                   End-user
                                                                   Downflow                        access tools

                                            Archive/backup
                                                 data


                        Information flows of a data warehouse
TOOLS AND TECHNOLOGIES:
   The critical steps in the construction of a data
    warehouse:


a. Extraction

b. Cleansing

c. Transformation
TOOLS AND TECHNOLOGIES:
   after the critical steps, loading the results into
    target system can be carried out either by
    separate products, or by a single, categories:

   code generators

   database data replication tools

   dynamic transformation engines
MANAGEMENT TOOLS:
   For the various types of meta-data and the day-
    to-day operations of the data warehouse, the
    administration and management tools must be
    capable of supporting those tasks:

   Monitoring data loading from multiple sources

   Data quality and integrity checks

   Managing and updating meta-data

   Monitoring database performance to ensure efficient query
    response times and resource utilization
 Auditing data warehouse usage to provide user
  chargeback information
 Replicating, subsetting, and distributing data

 Maintaining effient data storage management

 Purging data;

 Archiving and backing-up data

 Implementing recovery following failure

 Security management
DATA MART:
   Data mart a subset of a data warehouse that
    supports the requirements of particular
    department or business function

   The characteristics that differentiate data marts
    and data warehouses include:


   A data mart focuses on only the requirements of
    users associated with one department or business
    function
Warehouse Manager
        Operational
        data source1



                                                                          High
                                     Meta-data
                                                                      summarized data


       Operational
      data source 2                                        Lightly                                      Query
                         Load                            summarized
                                                            data                                        Manage
                         Manager

      Operational
                                                                                 DBMS
                                    Detailed data
      data source n

                                                    Warehouse Manager


      Operational
     data store (ods)


                                                    (First Tier)
                                                                                                                      (Third Tier)
Operational data store
(ODS)
                                                    Archive/backup                                                     End-user
                                                         data                                                          access tools

                                                                              Data Mart

                                                                                  summarized
                                                                            data(Relational database)




                                                                           Summarized data
                                                                       (Multi-dimension database)           (Second Tier)

                               Typical data warehouse adn data mart architecture
DATA MART ISSUES:
   Data mart functionalitythe capabilities of data marts
    have increased with the growth in their popularity


   Data mart sizethe performance deteriorates as data
    marts grow in size, so need to reduce the size of data marts
    to gain improvements in performance


   Data mart load performancetwo critical components:
    end-user response time and data loading performanceto
    increment DB updating so that only cells affected by the
    change are updated and not the entire MDDB structure
REFERENCES:
 Book of DBMS
 Google.com

 Wikipedia, the free encyclopedia

 InformIT.com

 Allfree-stuff.com
data warehousing

Weitere ähnliche Inhalte

Was ist angesagt?

Introduction to Data Warehouse
Introduction to Data WarehouseIntroduction to Data Warehouse
Introduction to Data WarehouseShanthi Mukkavilli
 
Fundamentals of Database system
Fundamentals of Database systemFundamentals of Database system
Fundamentals of Database systemphilipsinter
 
data warehouse , data mart, etl
data warehouse , data mart, etldata warehouse , data mart, etl
data warehouse , data mart, etlAashish Rathod
 
Seminar datawarehousing
Seminar datawarehousingSeminar datawarehousing
Seminar datawarehousingKavisha Uniyal
 
Lesson 2 network database system
Lesson 2 network database systemLesson 2 network database system
Lesson 2 network database systemGiO Friginal
 
Dimensional Modeling
Dimensional ModelingDimensional Modeling
Dimensional ModelingSunita Sahu
 
Lecture 01 introduction to database
Lecture 01 introduction to databaseLecture 01 introduction to database
Lecture 01 introduction to databaseemailharmeet
 
Machine Learning and AI at Oracle
Machine Learning and AI at OracleMachine Learning and AI at Oracle
Machine Learning and AI at OracleSandesh Rao
 
Introduction to Data Warehousing
Introduction to Data WarehousingIntroduction to Data Warehousing
Introduction to Data WarehousingEyad Manna
 
Building an Effective Data Warehouse Architecture
Building an Effective Data Warehouse ArchitectureBuilding an Effective Data Warehouse Architecture
Building an Effective Data Warehouse ArchitectureJames Serra
 
Data Warehousing and Data Mining
Data Warehousing and Data MiningData Warehousing and Data Mining
Data Warehousing and Data Miningidnats
 

Was ist angesagt? (20)

Introduction to Data Warehouse
Introduction to Data WarehouseIntroduction to Data Warehouse
Introduction to Data Warehouse
 
Fundamentals of Database system
Fundamentals of Database systemFundamentals of Database system
Fundamentals of Database system
 
data warehouse , data mart, etl
data warehouse , data mart, etldata warehouse , data mart, etl
data warehouse , data mart, etl
 
Active database
Active databaseActive database
Active database
 
Ppt
PptPpt
Ppt
 
Seminar datawarehousing
Seminar datawarehousingSeminar datawarehousing
Seminar datawarehousing
 
Data warehousing
Data warehousingData warehousing
Data warehousing
 
Data warehousing
Data warehousingData warehousing
Data warehousing
 
Data warehousing ppt
Data warehousing pptData warehousing ppt
Data warehousing ppt
 
Lesson 2 network database system
Lesson 2 network database systemLesson 2 network database system
Lesson 2 network database system
 
Data warehouse
Data warehouseData warehouse
Data warehouse
 
Dimensional Modeling
Dimensional ModelingDimensional Modeling
Dimensional Modeling
 
Data warehouse
Data warehouse Data warehouse
Data warehouse
 
Lecture 01 introduction to database
Lecture 01 introduction to databaseLecture 01 introduction to database
Lecture 01 introduction to database
 
Data models
Data modelsData models
Data models
 
Machine Learning and AI at Oracle
Machine Learning and AI at OracleMachine Learning and AI at Oracle
Machine Learning and AI at Oracle
 
Introduction to Data Warehousing
Introduction to Data WarehousingIntroduction to Data Warehousing
Introduction to Data Warehousing
 
Introduction to Data Warehousing
Introduction to Data WarehousingIntroduction to Data Warehousing
Introduction to Data Warehousing
 
Building an Effective Data Warehouse Architecture
Building an Effective Data Warehouse ArchitectureBuilding an Effective Data Warehouse Architecture
Building an Effective Data Warehouse Architecture
 
Data Warehousing and Data Mining
Data Warehousing and Data MiningData Warehousing and Data Mining
Data Warehousing and Data Mining
 

Andere mochten auch

Data warehouse concepts
Data warehouse conceptsData warehouse concepts
Data warehouse conceptsobieefans
 
Datawarehousing
DatawarehousingDatawarehousing
Datawarehousingsumit621
 
Lecture 13
Lecture 13Lecture 13
Lecture 13Shani729
 
Data Mining and Data Warehousing
Data Mining and Data WarehousingData Mining and Data Warehousing
Data Mining and Data WarehousingAswathy S Nair
 
Data mining & data warehousing (ppt)
Data mining & data warehousing (ppt)Data mining & data warehousing (ppt)
Data mining & data warehousing (ppt)Harish Chand
 
Data Warehousing Datamining Concepts
Data Warehousing Datamining ConceptsData Warehousing Datamining Concepts
Data Warehousing Datamining Conceptsraulmisir
 
Data Warehousing, Data Mining & Data Visualisation
Data Warehousing, Data Mining & Data VisualisationData Warehousing, Data Mining & Data Visualisation
Data Warehousing, Data Mining & Data VisualisationSunderland City Council
 
Types of databases
Types of databasesTypes of databases
Types of databasesPAQUIAAIZEL
 
DATA WAREHOUSING
DATA WAREHOUSINGDATA WAREHOUSING
DATA WAREHOUSINGKing Julian
 
Introduction to Data Warehousing
Introduction to Data WarehousingIntroduction to Data Warehousing
Introduction to Data WarehousingJason S
 

Andere mochten auch (16)

Data warehouse concepts
Data warehouse conceptsData warehouse concepts
Data warehouse concepts
 
Datawarehousing
DatawarehousingDatawarehousing
Datawarehousing
 
Lecture 13
Lecture 13Lecture 13
Lecture 13
 
Lecture 1
Lecture 1Lecture 1
Lecture 1
 
Database and types of databases
Database and types of databasesDatabase and types of databases
Database and types of databases
 
Data Mining and Data Warehousing
Data Mining and Data WarehousingData Mining and Data Warehousing
Data Mining and Data Warehousing
 
Data mining & data warehousing (ppt)
Data mining & data warehousing (ppt)Data mining & data warehousing (ppt)
Data mining & data warehousing (ppt)
 
Types of database
Types of databaseTypes of database
Types of database
 
Data Warehousing Datamining Concepts
Data Warehousing Datamining ConceptsData Warehousing Datamining Concepts
Data Warehousing Datamining Concepts
 
Data Warehousing, Data Mining & Data Visualisation
Data Warehousing, Data Mining & Data VisualisationData Warehousing, Data Mining & Data Visualisation
Data Warehousing, Data Mining & Data Visualisation
 
wi-fi ppt
wi-fi pptwi-fi ppt
wi-fi ppt
 
Types dbms
Types dbmsTypes dbms
Types dbms
 
Types of databases
Types of databasesTypes of databases
Types of databases
 
DATA WAREHOUSING
DATA WAREHOUSINGDATA WAREHOUSING
DATA WAREHOUSING
 
DATA WAREHOUSING
DATA WAREHOUSINGDATA WAREHOUSING
DATA WAREHOUSING
 
Introduction to Data Warehousing
Introduction to Data WarehousingIntroduction to Data Warehousing
Introduction to Data Warehousing
 

Ähnlich wie data warehousing

data resource management
 data resource management data resource management
data resource managementsoodsurbhi123
 
Oracle: Fundamental Of Dw
Oracle: Fundamental Of DwOracle: Fundamental Of Dw
Oracle: Fundamental Of Dworacle content
 
Big Data For Investment Research Management
Big Data For Investment Research ManagementBig Data For Investment Research Management
Big Data For Investment Research ManagementIDT Partners
 
OLAP & Data Warehouse
OLAP & Data WarehouseOLAP & Data Warehouse
OLAP & Data WarehouseZalpa Rathod
 
OLAP & DATA WAREHOUSE
OLAP & DATA WAREHOUSEOLAP & DATA WAREHOUSE
OLAP & DATA WAREHOUSEZalpa Rathod
 
Talk IT_ Oracle_김태완_110831
Talk IT_ Oracle_김태완_110831Talk IT_ Oracle_김태완_110831
Talk IT_ Oracle_김태완_110831Cana Ko
 
Informatica Interview Questions & Answers
Informatica Interview Questions & AnswersInformatica Interview Questions & Answers
Informatica Interview Questions & AnswersZaranTech LLC
 
Magic quadrant for data warehouse database management systems
Magic quadrant for data warehouse database management systems Magic quadrant for data warehouse database management systems
Magic quadrant for data warehouse database management systems divjeev
 
Establishing A Robust Data Migration Methodology - White Paper
Establishing A Robust Data Migration Methodology - White PaperEstablishing A Robust Data Migration Methodology - White Paper
Establishing A Robust Data Migration Methodology - White PaperJames Chi
 
IRJET - The 3-Level Database Architectural Design for OLAP and OLTP Ops
IRJET - The 3-Level Database Architectural Design for OLAP and OLTP OpsIRJET - The 3-Level Database Architectural Design for OLAP and OLTP Ops
IRJET - The 3-Level Database Architectural Design for OLAP and OLTP OpsIRJET Journal
 
Use Big Data Technologies to Modernize Your Enterprise Data Warehouse
Use Big Data Technologies to Modernize Your Enterprise Data Warehouse Use Big Data Technologies to Modernize Your Enterprise Data Warehouse
Use Big Data Technologies to Modernize Your Enterprise Data Warehouse EMC
 
Anexinet Big Data Solutions
Anexinet Big Data SolutionsAnexinet Big Data Solutions
Anexinet Big Data SolutionsMark Kromer
 
Informatica World 2006 - MDM Data Quality
Informatica World 2006 - MDM Data QualityInformatica World 2006 - MDM Data Quality
Informatica World 2006 - MDM Data QualityDatabase Architechs
 
Introduction to data warehousing
Introduction to data warehousing   Introduction to data warehousing
Introduction to data warehousing Girish Dhareshwar
 

Ähnlich wie data warehousing (20)

data resource management
 data resource management data resource management
data resource management
 
Oracle: Fundamental Of DW
Oracle: Fundamental Of DWOracle: Fundamental Of DW
Oracle: Fundamental Of DW
 
Oracle: Fundamental Of Dw
Oracle: Fundamental Of DwOracle: Fundamental Of Dw
Oracle: Fundamental Of Dw
 
Big Data For Investment Research Management
Big Data For Investment Research ManagementBig Data For Investment Research Management
Big Data For Investment Research Management
 
DW 101
DW 101DW 101
DW 101
 
OLAP & Data Warehouse
OLAP & Data WarehouseOLAP & Data Warehouse
OLAP & Data Warehouse
 
OLAP & DATA WAREHOUSE
OLAP & DATA WAREHOUSEOLAP & DATA WAREHOUSE
OLAP & DATA WAREHOUSE
 
Talk IT_ Oracle_김태완_110831
Talk IT_ Oracle_김태완_110831Talk IT_ Oracle_김태완_110831
Talk IT_ Oracle_김태완_110831
 
Informatica Interview Questions & Answers
Informatica Interview Questions & AnswersInformatica Interview Questions & Answers
Informatica Interview Questions & Answers
 
Introduction to Hadoop
Introduction to HadoopIntroduction to Hadoop
Introduction to Hadoop
 
Magic quadrant for data warehouse database management systems
Magic quadrant for data warehouse database management systems Magic quadrant for data warehouse database management systems
Magic quadrant for data warehouse database management systems
 
Establishing A Robust Data Migration Methodology - White Paper
Establishing A Robust Data Migration Methodology - White PaperEstablishing A Robust Data Migration Methodology - White Paper
Establishing A Robust Data Migration Methodology - White Paper
 
Ch03
Ch03Ch03
Ch03
 
IRJET - The 3-Level Database Architectural Design for OLAP and OLTP Ops
IRJET - The 3-Level Database Architectural Design for OLAP and OLTP OpsIRJET - The 3-Level Database Architectural Design for OLAP and OLTP Ops
IRJET - The 3-Level Database Architectural Design for OLAP and OLTP Ops
 
Data Management
Data ManagementData Management
Data Management
 
Data Warehouse
Data WarehouseData Warehouse
Data Warehouse
 
Use Big Data Technologies to Modernize Your Enterprise Data Warehouse
Use Big Data Technologies to Modernize Your Enterprise Data Warehouse Use Big Data Technologies to Modernize Your Enterprise Data Warehouse
Use Big Data Technologies to Modernize Your Enterprise Data Warehouse
 
Anexinet Big Data Solutions
Anexinet Big Data SolutionsAnexinet Big Data Solutions
Anexinet Big Data Solutions
 
Informatica World 2006 - MDM Data Quality
Informatica World 2006 - MDM Data QualityInformatica World 2006 - MDM Data Quality
Informatica World 2006 - MDM Data Quality
 
Introduction to data warehousing
Introduction to data warehousing   Introduction to data warehousing
Introduction to data warehousing
 

Kürzlich hochgeladen

The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxLoriGlavin3
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .Alan Dix
 
Sample pptx for embedding into website for demo
Sample pptx for embedding into website for demoSample pptx for embedding into website for demo
Sample pptx for embedding into website for demoHarshalMandlekar2
 
What is Artificial Intelligence?????????
What is Artificial Intelligence?????????What is Artificial Intelligence?????????
What is Artificial Intelligence?????????blackmambaettijean
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfMounikaPolabathina
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.Curtis Poe
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Manik S Magar
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024Stephanie Beckett
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsPixlogix Infotech
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxhariprasad279825
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxLoriGlavin3
 
unit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxunit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxBkGupta21
 
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESSALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESmohitsingh558521
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 3652toLead Limited
 

Kürzlich hochgeladen (20)

The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .
 
Sample pptx for embedding into website for demo
Sample pptx for embedding into website for demoSample pptx for embedding into website for demo
Sample pptx for embedding into website for demo
 
What is Artificial Intelligence?????????
What is Artificial Intelligence?????????What is Artificial Intelligence?????????
What is Artificial Intelligence?????????
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdf
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and Cons
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptx
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
 
unit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxunit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptx
 
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESSALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365
 

data warehousing

  • 1. PRESENTATION ON DATA WAREHOUSING Presented By: Jagnesh Chawla Manpreet Singh Mintu
  • 2. CONTENTS:  Meaning Of data warehousing  Benefit of data warehousing  Problems  Architecture of data warehouse  Main components  Data flows  Tools and technologies  Data Mart
  • 3. MEANING:  Data warehouse is data management and data analysis  Goal: is to integrate enterprise wide corporate data into a single reository from which users can easily run queries
  • 4. BENEFITS:  The major benefit of data warehousing are high returns on investment.  Increased productivity of corporate decision- makers
  • 5. PROBLEMS:  Underestimation of resources for data loading  Hidden problems with source systems  Required data not captured  Increased end-user demands  Data homogenization  High demand for resources  Data ownership  High maintenance  Long-duration projects  Complexity of integration
  • 6. ARCHITECTURE: Operational Reporting, query, data source1 application development, High and EIS(executive Meta-data summarized data information system) Operational Query Manage tools data source 2 Lightly Load Manager summarized data Operational data source n Detailed data DBMS OLAP(online analytical processing) tools Operational Warehouse Manager data store (ods) ational data store (ODS) Data mining Archive/backup data End-user Typical architecture of a data warehouse access tools
  • 7. MAIN COMPONENTS:  Operational data sourcesfor the DW is supplied from mainframe operational data held in first generation hierarchical and network databases, departmental data held in proprietary file systems, private data held on workstaions and private serves and external systems such as the Internet, commercially available DB, or DB assoicated with and organization’s suppliers or customers  Operational datastore(ODS)is a repository of current and integrated operational data used for analysis. It is often structured and supplied with data in the same way as the data warehouse, but may in fact simply act as a staging area for data to be moved into the warehouse
  • 8. MAIN COMPONENTS:  query manageralso called backend component, it performs all the operations associated with the management of user queries. The operations performed by this component include directing queries to the appropriate tables and scheduling the execution of queries  end-user access toolscan be categorized into five main groups: data reporting and query tools, application development tools, executive information system (EIS) tools, online analytical processing (OLAP) tools, and data mining tools
  • 9. DATA FLOW:  Inflow- The processes associated with the extraction, cleansing, and loading of the data from the source systems into the data warehouse.  upflow- The process associated with adding value to the data in the warehouse through summarizing, packaging , packaging, and distribution of the data  downflow- The processes associated with archiving and backing-up of data in the warehouse
  • 10. DATA FLOW:  outflow- The process associated with making the data availabe to the end-users.  Meta-flow- The processes associated with the management of the meta-data
  • 11. Warehouse Manager Operational data source1 Meta-flow Meta-data High summarized data Inflow Outflow Lightly Load summarized data Manager Upflow Query Manage Operational DBMS data source n Detailed data Warehouse Manager Operational data store (ods) Data mining tools End-user Downflow access tools Archive/backup data Information flows of a data warehouse
  • 12. TOOLS AND TECHNOLOGIES:  The critical steps in the construction of a data warehouse: a. Extraction b. Cleansing c. Transformation
  • 13. TOOLS AND TECHNOLOGIES:  after the critical steps, loading the results into target system can be carried out either by separate products, or by a single, categories:  code generators  database data replication tools  dynamic transformation engines
  • 14. MANAGEMENT TOOLS:  For the various types of meta-data and the day- to-day operations of the data warehouse, the administration and management tools must be capable of supporting those tasks:  Monitoring data loading from multiple sources  Data quality and integrity checks  Managing and updating meta-data  Monitoring database performance to ensure efficient query response times and resource utilization
  • 15.  Auditing data warehouse usage to provide user chargeback information  Replicating, subsetting, and distributing data  Maintaining effient data storage management  Purging data;  Archiving and backing-up data  Implementing recovery following failure  Security management
  • 16. DATA MART:  Data mart a subset of a data warehouse that supports the requirements of particular department or business function  The characteristics that differentiate data marts and data warehouses include:  A data mart focuses on only the requirements of users associated with one department or business function
  • 17. Warehouse Manager Operational data source1 High Meta-data summarized data Operational data source 2 Lightly Query Load summarized data Manage Manager Operational DBMS Detailed data data source n Warehouse Manager Operational data store (ods) (First Tier) (Third Tier) Operational data store (ODS) Archive/backup End-user data access tools Data Mart summarized data(Relational database) Summarized data (Multi-dimension database) (Second Tier) Typical data warehouse adn data mart architecture
  • 18. DATA MART ISSUES:  Data mart functionalitythe capabilities of data marts have increased with the growth in their popularity  Data mart sizethe performance deteriorates as data marts grow in size, so need to reduce the size of data marts to gain improvements in performance  Data mart load performancetwo critical components: end-user response time and data loading performanceto increment DB updating so that only cells affected by the change are updated and not the entire MDDB structure
  • 19. REFERENCES:  Book of DBMS  Google.com  Wikipedia, the free encyclopedia  InformIT.com  Allfree-stuff.com