SlideShare ist ein Scribd-Unternehmen logo
1 von 11
Data Warehousing
Lecture-8
Online Analytical Processing (OLAP)
1
Mamuna Fatima
DWH & OLAP
•Relationship between DWH &
OLAP
•Data Warehouse & OLAP go
together.
•Analysis supported by OLAP
2
Supporting the human thought process
How many such query sequences can be programmed in advance? 3
THOUGHT PROCESSTHOUGHT PROCESS QUERY SEQUENCEQUERY SEQUENCE
An enterprise wide fall in profit
Profit down by a large
percentage consistently during
last quarter only. Rest is OK
What is special about last
quarter ?
Products alone doing OK, but
North region is most
problematic.
What was the quarterly sales
during last year ??
What was the quarterly sales at
regional level during last year ??
What was the monthly sale for
last quarter group by products
What was the monthly sale of
products in north at store level
group by products purchased
OK. So the problem is the high
cost of products purchased
in north.
What was the quarterly sales at
product level during last year?
What was the monthly sale for
last quarter group by region
Analysis of last example
• Analysis is Ad-hoc
• Analysis is interactive (user driven)
• Analysis is iterative
– Answer to one question leads to a dozen more
• Analysis is directional
– Drill Down
– Roll Up
– Pivot
4
More in
subsequent
slides
Challenges…
• Not feasible to write predefined queries.
– Fails to remain user_driven (becomes programmer
driven).
– Fails to remain ad_hoc and hence is not interactive.
• Enable ad-hoc query support
– Business user can not build his/her own queries
(does not know SQL, should not know it).
– On_the_go SQL generation and execution too slow.
5
Challenges
• Contradiction
– Want to compute answers in advance, but don't
know the questions
• Solution
– Compute answers to “all” possible “queries”. But
how?
– NOTE: Queries are multidimensional aggregates at
some level
6
“All” possible queries (level aggregates)
7
Province Frontier Punjab...
Division MultanLahorePeshawarMardan ......
Lahore ... Gugranwala
City
Zone GulbergDefense ...
District LahorePeshawar
ALL ALLALL ALL
Where Does OLAP Fit In?
• It is a classification of applications, NOT a
database design technique.
• Analytical processing uses multi-level
aggregates, instead of record level access.
• Objective is to support very
I. fast
II. iterative and
III. ad-hoc decision-making.
8
Where does OLAP fit in?
9
Transaction
Data

Presentation
Tools
Reports
OLAP
Data Cube
(MOLAP)
Data
Loading

?
Decision
Maker
OLTP vs. OLAP
Feature OLTP OLAP
Level of data Detailed Aggregated
Amount of data per
transaction
Small Large
Views Pre-defined User-defined
Typical write
operation
Update, insert, delete Bulk insert
“age” of data Current (60-90 days) Historical 5-10 years and
also current
Number of users High Low-Med
Tables Flat tables Multi-Dimensional tables
Database size Med (109
B – 1012
B) High (1012
B – 1015
B)
Query Optimizing Requires experience Already “optimized”
Data availability High Low-Med
10
OLAP FASMI Test
Fast: Delivers information to the user at a fairly
constant rate. Most queries answered in under five
seconds.
Analysis: Performs basic numerical and statistical
analysis of the data, pre-defined by an application
developer or defined ad-hocly by the user.
Shared: Implements the security requirements
necessary for sharing potentially confidential data
across a large user population.
Multi-dimensional: The essential characteristic of
OLAP.
Information: Accesses all the data and information
necessary and relevant for the application, wherever
it may reside and not limited by volume.
...from the OLAP Report by Pendse and Creeth.
11

Weitere ähnliche Inhalte

Ähnlich wie Dwh lecture 10-olap

An Agile Approach to Machine Learning
An Agile Approach to Machine LearningAn Agile Approach to Machine Learning
An Agile Approach to Machine Learning
Randy Shoup
 
Market Research Meets Big Data Analytics for Business Transformation
Market Research Meets Big Data Analytics  for Business Transformation Market Research Meets Big Data Analytics  for Business Transformation
Market Research Meets Big Data Analytics for Business Transformation
Sally Sadosky
 
Data Refinement: The missing link between data collection and decisions
Data Refinement: The missing link between data collection and decisionsData Refinement: The missing link between data collection and decisions
Data Refinement: The missing link between data collection and decisions
Vivastream
 

Ähnlich wie Dwh lecture 10-olap (20)

Lecture 10.ppt
Lecture 10.pptLecture 10.ppt
Lecture 10.ppt
 
Cs437 lecture 09
Cs437 lecture 09Cs437 lecture 09
Cs437 lecture 09
 
Dwh lecture slides-week 12&13
Dwh lecture slides-week 12&13Dwh lecture slides-week 12&13
Dwh lecture slides-week 12&13
 
Kushal Data Warehousing PPT
Kushal Data Warehousing PPTKushal Data Warehousing PPT
Kushal Data Warehousing PPT
 
Business Intelligence Trends for University of Western Australia
Business Intelligence Trends for University of Western AustraliaBusiness Intelligence Trends for University of Western Australia
Business Intelligence Trends for University of Western Australia
 
Lecture 01.ppt
Lecture 01.pptLecture 01.ppt
Lecture 01.ppt
 
Lecture1
Lecture1Lecture1
Lecture1
 
05_Decision Support and OLAP.pdf
05_Decision Support and OLAP.pdf05_Decision Support and OLAP.pdf
05_Decision Support and OLAP.pdf
 
Online analytical processing
Online analytical processingOnline analytical processing
Online analytical processing
 
IS OLAP DEAD IN THE AGE OF BIG DATA?
IS OLAP DEAD IN THE AGE OF BIG DATA?IS OLAP DEAD IN THE AGE OF BIG DATA?
IS OLAP DEAD IN THE AGE OF BIG DATA?
 
Big Data at a Gaming Company: Spil Games
Big Data at a Gaming Company: Spil GamesBig Data at a Gaming Company: Spil Games
Big Data at a Gaming Company: Spil Games
 
Agile data science
Agile data scienceAgile data science
Agile data science
 
An Agile Approach to Machine Learning
An Agile Approach to Machine LearningAn Agile Approach to Machine Learning
An Agile Approach to Machine Learning
 
BI Introduction
BI IntroductionBI Introduction
BI Introduction
 
Selling Text Analytics to your boss
Selling Text Analytics to your bossSelling Text Analytics to your boss
Selling Text Analytics to your boss
 
Market Research Meets Big Data Analytics for Business Transformation
Market Research Meets Big Data Analytics  for Business Transformation Market Research Meets Big Data Analytics  for Business Transformation
Market Research Meets Big Data Analytics for Business Transformation
 
OLTPandOLAP.pdf
OLTPandOLAP.pdfOLTPandOLAP.pdf
OLTPandOLAP.pdf
 
Data Refinement: The missing link between data collection and decisions
Data Refinement: The missing link between data collection and decisionsData Refinement: The missing link between data collection and decisions
Data Refinement: The missing link between data collection and decisions
 
Lecture 10
Lecture 10Lecture 10
Lecture 10
 
OLAP IN DATA MINING
OLAP IN DATA MININGOLAP IN DATA MINING
OLAP IN DATA MINING
 

Mehr von Sulman Ahmed

Data mining Basics and complete description onword
Data mining Basics and complete description onwordData mining Basics and complete description onword
Data mining Basics and complete description onword
Sulman Ahmed
 

Mehr von Sulman Ahmed (20)

Entrepreneurial Strategy Generating and Exploiting new entries
Entrepreneurial Strategy Generating and Exploiting new entriesEntrepreneurial Strategy Generating and Exploiting new entries
Entrepreneurial Strategy Generating and Exploiting new entries
 
Entrepreneurial Intentions and corporate entrepreneurship
Entrepreneurial Intentions and corporate entrepreneurshipEntrepreneurial Intentions and corporate entrepreneurship
Entrepreneurial Intentions and corporate entrepreneurship
 
Entrepreneurship main concepts and description
Entrepreneurship main concepts and descriptionEntrepreneurship main concepts and description
Entrepreneurship main concepts and description
 
Run time Verification using formal methods
Run time Verification using formal methodsRun time Verification using formal methods
Run time Verification using formal methods
 
Use of Formal Methods at Amazon Web Services
Use of Formal Methods at Amazon Web ServicesUse of Formal Methods at Amazon Web Services
Use of Formal Methods at Amazon Web Services
 
student learning App
student learning Appstudent learning App
student learning App
 
Software Engineering Economics Life Cycle.
Software Engineering Economics  Life Cycle.Software Engineering Economics  Life Cycle.
Software Engineering Economics Life Cycle.
 
Data mining Techniques
Data mining TechniquesData mining Techniques
Data mining Techniques
 
Rules of data mining
Rules of data miningRules of data mining
Rules of data mining
 
Rules of data mining
Rules of data miningRules of data mining
Rules of data mining
 
Classification in data mining
Classification in data mining Classification in data mining
Classification in data mining
 
Data mining Basics and complete description
Data mining Basics and complete description Data mining Basics and complete description
Data mining Basics and complete description
 
Data mining Basics and complete description onword
Data mining Basics and complete description onwordData mining Basics and complete description onword
Data mining Basics and complete description onword
 
Dwh lecture-07-denormalization
Dwh lecture-07-denormalizationDwh lecture-07-denormalization
Dwh lecture-07-denormalization
 
Dwh lecture-06-normalization
Dwh lecture-06-normalizationDwh lecture-06-normalization
Dwh lecture-06-normalization
 
Dwh lecture 12-dm
Dwh lecture 12-dmDwh lecture 12-dm
Dwh lecture 12-dm
 
Dwh lecture 13-process dm
Dwh  lecture 13-process dmDwh  lecture 13-process dm
Dwh lecture 13-process dm
 
Dwh lecture 08-denormalization tech
Dwh   lecture 08-denormalization techDwh   lecture 08-denormalization tech
Dwh lecture 08-denormalization tech
 
Dwh lecture 07-denormalization
Dwh   lecture 07-denormalizationDwh   lecture 07-denormalization
Dwh lecture 07-denormalization
 
Wbs
WbsWbs
Wbs
 

Kürzlich hochgeladen

"Lesotho Leaps Forward: A Chronicle of Transformative Developments"
"Lesotho Leaps Forward: A Chronicle of Transformative Developments""Lesotho Leaps Forward: A Chronicle of Transformative Developments"
"Lesotho Leaps Forward: A Chronicle of Transformative Developments"
mphochane1998
 
Call Girls in South Ex (delhi) call me [🔝9953056974🔝] escort service 24X7
Call Girls in South Ex (delhi) call me [🔝9953056974🔝] escort service 24X7Call Girls in South Ex (delhi) call me [🔝9953056974🔝] escort service 24X7
Call Girls in South Ex (delhi) call me [🔝9953056974🔝] escort service 24X7
9953056974 Low Rate Call Girls In Saket, Delhi NCR
 

Kürzlich hochgeladen (20)

AIRCANVAS[1].pdf mini project for btech students
AIRCANVAS[1].pdf mini project for btech studentsAIRCANVAS[1].pdf mini project for btech students
AIRCANVAS[1].pdf mini project for btech students
 
Minimum and Maximum Modes of microprocessor 8086
Minimum and Maximum Modes of microprocessor 8086Minimum and Maximum Modes of microprocessor 8086
Minimum and Maximum Modes of microprocessor 8086
 
Hazard Identification (HAZID) vs. Hazard and Operability (HAZOP): A Comparati...
Hazard Identification (HAZID) vs. Hazard and Operability (HAZOP): A Comparati...Hazard Identification (HAZID) vs. Hazard and Operability (HAZOP): A Comparati...
Hazard Identification (HAZID) vs. Hazard and Operability (HAZOP): A Comparati...
 
Online electricity billing project report..pdf
Online electricity billing project report..pdfOnline electricity billing project report..pdf
Online electricity billing project report..pdf
 
2016EF22_0 solar project report rooftop projects
2016EF22_0 solar project report rooftop projects2016EF22_0 solar project report rooftop projects
2016EF22_0 solar project report rooftop projects
 
Bridge Jacking Design Sample Calculation.pptx
Bridge Jacking Design Sample Calculation.pptxBridge Jacking Design Sample Calculation.pptx
Bridge Jacking Design Sample Calculation.pptx
 
FEA Based Level 3 Assessment of Deformed Tanks with Fluid Induced Loads
FEA Based Level 3 Assessment of Deformed Tanks with Fluid Induced LoadsFEA Based Level 3 Assessment of Deformed Tanks with Fluid Induced Loads
FEA Based Level 3 Assessment of Deformed Tanks with Fluid Induced Loads
 
"Lesotho Leaps Forward: A Chronicle of Transformative Developments"
"Lesotho Leaps Forward: A Chronicle of Transformative Developments""Lesotho Leaps Forward: A Chronicle of Transformative Developments"
"Lesotho Leaps Forward: A Chronicle of Transformative Developments"
 
Navigating Complexity: The Role of Trusted Partners and VIAS3D in Dassault Sy...
Navigating Complexity: The Role of Trusted Partners and VIAS3D in Dassault Sy...Navigating Complexity: The Role of Trusted Partners and VIAS3D in Dassault Sy...
Navigating Complexity: The Role of Trusted Partners and VIAS3D in Dassault Sy...
 
Unleashing the Power of the SORA AI lastest leap
Unleashing the Power of the SORA AI lastest leapUnleashing the Power of the SORA AI lastest leap
Unleashing the Power of the SORA AI lastest leap
 
Thermal Engineering Unit - I & II . ppt
Thermal Engineering  Unit - I & II . pptThermal Engineering  Unit - I & II . ppt
Thermal Engineering Unit - I & II . ppt
 
Design For Accessibility: Getting it right from the start
Design For Accessibility: Getting it right from the startDesign For Accessibility: Getting it right from the start
Design For Accessibility: Getting it right from the start
 
Air Compressor reciprocating single stage
Air Compressor reciprocating single stageAir Compressor reciprocating single stage
Air Compressor reciprocating single stage
 
Work-Permit-Receiver-in-Saudi-Aramco.pptx
Work-Permit-Receiver-in-Saudi-Aramco.pptxWork-Permit-Receiver-in-Saudi-Aramco.pptx
Work-Permit-Receiver-in-Saudi-Aramco.pptx
 
data_management_and _data_science_cheat_sheet.pdf
data_management_and _data_science_cheat_sheet.pdfdata_management_and _data_science_cheat_sheet.pdf
data_management_and _data_science_cheat_sheet.pdf
 
Tamil Call Girls Bhayandar WhatsApp +91-9930687706, Best Service
Tamil Call Girls Bhayandar WhatsApp +91-9930687706, Best ServiceTamil Call Girls Bhayandar WhatsApp +91-9930687706, Best Service
Tamil Call Girls Bhayandar WhatsApp +91-9930687706, Best Service
 
HAND TOOLS USED AT ELECTRONICS WORK PRESENTED BY KOUSTAV SARKAR
HAND TOOLS USED AT ELECTRONICS WORK PRESENTED BY KOUSTAV SARKARHAND TOOLS USED AT ELECTRONICS WORK PRESENTED BY KOUSTAV SARKAR
HAND TOOLS USED AT ELECTRONICS WORK PRESENTED BY KOUSTAV SARKAR
 
A CASE STUDY ON CERAMIC INDUSTRY OF BANGLADESH.pptx
A CASE STUDY ON CERAMIC INDUSTRY OF BANGLADESH.pptxA CASE STUDY ON CERAMIC INDUSTRY OF BANGLADESH.pptx
A CASE STUDY ON CERAMIC INDUSTRY OF BANGLADESH.pptx
 
kiln thermal load.pptx kiln tgermal load
kiln thermal load.pptx kiln tgermal loadkiln thermal load.pptx kiln tgermal load
kiln thermal load.pptx kiln tgermal load
 
Call Girls in South Ex (delhi) call me [🔝9953056974🔝] escort service 24X7
Call Girls in South Ex (delhi) call me [🔝9953056974🔝] escort service 24X7Call Girls in South Ex (delhi) call me [🔝9953056974🔝] escort service 24X7
Call Girls in South Ex (delhi) call me [🔝9953056974🔝] escort service 24X7
 

Dwh lecture 10-olap

  • 1. Data Warehousing Lecture-8 Online Analytical Processing (OLAP) 1 Mamuna Fatima
  • 2. DWH & OLAP •Relationship between DWH & OLAP •Data Warehouse & OLAP go together. •Analysis supported by OLAP 2
  • 3. Supporting the human thought process How many such query sequences can be programmed in advance? 3 THOUGHT PROCESSTHOUGHT PROCESS QUERY SEQUENCEQUERY SEQUENCE An enterprise wide fall in profit Profit down by a large percentage consistently during last quarter only. Rest is OK What is special about last quarter ? Products alone doing OK, but North region is most problematic. What was the quarterly sales during last year ?? What was the quarterly sales at regional level during last year ?? What was the monthly sale for last quarter group by products What was the monthly sale of products in north at store level group by products purchased OK. So the problem is the high cost of products purchased in north. What was the quarterly sales at product level during last year? What was the monthly sale for last quarter group by region
  • 4. Analysis of last example • Analysis is Ad-hoc • Analysis is interactive (user driven) • Analysis is iterative – Answer to one question leads to a dozen more • Analysis is directional – Drill Down – Roll Up – Pivot 4 More in subsequent slides
  • 5. Challenges… • Not feasible to write predefined queries. – Fails to remain user_driven (becomes programmer driven). – Fails to remain ad_hoc and hence is not interactive. • Enable ad-hoc query support – Business user can not build his/her own queries (does not know SQL, should not know it). – On_the_go SQL generation and execution too slow. 5
  • 6. Challenges • Contradiction – Want to compute answers in advance, but don't know the questions • Solution – Compute answers to “all” possible “queries”. But how? – NOTE: Queries are multidimensional aggregates at some level 6
  • 7. “All” possible queries (level aggregates) 7 Province Frontier Punjab... Division MultanLahorePeshawarMardan ...... Lahore ... Gugranwala City Zone GulbergDefense ... District LahorePeshawar ALL ALLALL ALL
  • 8. Where Does OLAP Fit In? • It is a classification of applications, NOT a database design technique. • Analytical processing uses multi-level aggregates, instead of record level access. • Objective is to support very I. fast II. iterative and III. ad-hoc decision-making. 8
  • 9. Where does OLAP fit in? 9 Transaction Data  Presentation Tools Reports OLAP Data Cube (MOLAP) Data Loading  ? Decision Maker
  • 10. OLTP vs. OLAP Feature OLTP OLAP Level of data Detailed Aggregated Amount of data per transaction Small Large Views Pre-defined User-defined Typical write operation Update, insert, delete Bulk insert “age” of data Current (60-90 days) Historical 5-10 years and also current Number of users High Low-Med Tables Flat tables Multi-Dimensional tables Database size Med (109 B – 1012 B) High (1012 B – 1015 B) Query Optimizing Requires experience Already “optimized” Data availability High Low-Med 10
  • 11. OLAP FASMI Test Fast: Delivers information to the user at a fairly constant rate. Most queries answered in under five seconds. Analysis: Performs basic numerical and statistical analysis of the data, pre-defined by an application developer or defined ad-hocly by the user. Shared: Implements the security requirements necessary for sharing potentially confidential data across a large user population. Multi-dimensional: The essential characteristic of OLAP. Information: Accesses all the data and information necessary and relevant for the application, wherever it may reside and not limited by volume. ...from the OLAP Report by Pendse and Creeth. 11

Hinweis der Redaktion

  1. In olap nothing is online, it is online that it has requirement that ans should come under 5 sec, so an impression of online goes to user. Oltp sys are fully normalized and olap are highly denormalized. We will discuss the ways to make the olap. The concept of dwh without olap is impossible. When a dwh is developed, olap is used for analysis. Olap is an enviornment or tool which must need a dwh. Olap is not a physical implementation it is a framework, will discuss in detail.
  2. First see how the decision makers think and how could we fill it with the help of queries. The decision maker need the ans of questions so that he could take a decision and we have to mimic it with sql queries. A profit fall or drop occurs and decision maker wants to know why. So the corresponding query will be “what was the quarterly sales during last year” means he wants to know what was the sale in 4 quarters of last year. Then he wants to anlyse w.r.t region and further he see it from another angle that What was the quarterly sales at product level during last year? Now he see the data from a different angles, because it shouldn’t be the case that some informative area of data should not remain untouch. The ans of these queries will be displayed before him in the form of charts. So when he saw chart he found that rest is ok but the fall of profit occurred only in the last quarter. But then he wants to explore the data of last quarter and wants to explore it, so he say what is special about last quarter. So he say that what was the quarterly sales at product level during last year? And then he saw monthly sales w.r.t products. And then he saw products w.r.t region. So he identified the region which is problematic. Then he saw w.r.t another angle he say What was the monthly sale of products in north at store level group by products purchased So it revealed that the products purchased at north region was high but to meet the challenge of compititors it may not be feasible to sale the products on those prices, so the fall of profit occurs. Because when you take less profit automatically your gross profit reduces. You will remain in profit but its percentage will fall as compare to the previous years profit. So this is the thought process having support from the anallysis tool known as the olap. But here a question come that How many such query sequences can be programmed in advance? So can you write the sequence of queries, ans is no. because the questions coming in the mind of decision makers cant be told in advance, so he reveals by exploring from different angles. So don’t you need the speed, and also if you are a decision maker and wants to know the ans from IT people, and they say we will answer them after 2 months. So you will be frustrated. So now we have to design the syst by how your business runs.
  3. So last slide reveals. That query seq is adhoc, no one knows in adv which and how many queries will come first. So a major diff b/w olap and oltp sys. The tool is used by decision maker interactively i.e using mouse on screen. So an interactive process. It should be interactive otherwise thought process breaks. Another thing is the process is iterative, so you know the ans of one question then other then next. It has direction i.e. you aggregate or see the elements i.e. from week to month, or months to days. Piovting is the way to change the things in which respect you were seeing the data.
  4. You cannot program the queries in adv because you don’t know which queries will come, and you also don’t know how many queries are there. So these are challenges, so we can say, when a decision maker moves the mouse we track the mouse and generate the sql on the go. But suppose it is true, still you have to address the vldbs with billions of rows so how can you bring the answer within 5 secs.
  5. When we discuss analysis, we do not retrieve data at record level, decision maker wants to see the aggregates, then he needs to explore. So the all possible questions are actually all possible aggregates, and you give an enviorment to deciosn maker where he can explore them.
  6. Aggregates have hierarchy which could be drilled down.
  7. Olap is a framework, having different flavours. It has a few chractristics i.e the support of adhoc queires, which should be fast, and iterative, and a point missing that is interactive. So you have to see these chractristics in an application to consider it as the olap.
  8. A diagram from past. Here data from oltp systems loads in dwh and data is loaded in an implementation of olap here the implementation is multidimensional olap, it is a multidimensional data structure and we displayed 3 dimesions because cannot display four dimensions in an instant. Then generate graphical reports. The data cube contains the information based on the transaction data.
  9. Which tool or sys could considered as olap. They should have 5 chractristics. Shared: user must be able to see their specific data only. Multidimensional: databases or data structures should be multidimensional Info: it should not say I can show this info but not this. It should be able to show all info.