SlideShare a Scribd company logo
1 of 38
Business Intelligence 
(BI) 
Lecturer: PhD Taras V. Panchenko 
Associate Professor 
@ Theory and Technology for Programming Chair 
@ Cybernetics Faculty 
@ National Taras Shevchenko University of Kyiv
Introduction 
Computers are useless. 
They can only give you answers. 
Pablo Picasso 
… Meaning that asking questions and true 
creativity are things that computers aren't 
capable of yet.
BI – Def(s) 
Business intelligence (BI) is the ability to apprehend the 
interrelationships of presented facts in such a way as to 
guide action towards a desired goal. 
Hans Peter Luhn, IBM, 1958 
BI is the transformation of raw data into meaningful and 
useful information for business analysis purposes. 
Wikipedia
BI – Def(s) 
Business intelligence (BI) BI is the transformation of raw 
data into meaningful and useful information for business 
analysis purposes. 
• BI can handle enormous amounts of unstructured data 
to help identify, develop and otherwise create new 
strategic business opportunities 
• BI allows for the easy interpretation of volumes of data 
• Identifying new opportunities and implementing an 
effective strategy can provide a competitive market 
advantage and long-term stability 
Wikipedia
BI – Def(s) 
Business intelligence (BI) is an umbrella term that 
includes the applications, infrastructure and tools, and 
best practices that enable access to and analysis of 
information to improve and optimize decisions and 
performance. 
Gartner 
BI is a set of methodologies, processes, architectures, 
and technologies that leverage the output of information 
management processes for analysis, reporting, 
performance management, and information delivery. 
Research coverage includes executive dashboards as well 
as query and reporting tools. 
Forrester
The Problem 
• Try to model and analyze activity of the bank 
– Develop the model: clients, accounts, currency, 
transactions, … 
– Possible questions to system/model from analyst 
• Performance (analytical query speed) 
• Dynamic reports & ad-hoc analysis 
• or: analyze sales of products by regions in time 
• Is RDBMS the best solution? 
… for Multidimensional model …
Multidimensional Analysis 
• Multidimensional (hyper-)cube:
Problem of Relational Database Model 
• Most notably lacking has been the ability to 
consolidate, view, and analyze data according 
to multiple dimensions, in ways that make 
sense to one or more specific enterprise 
analysts at any given point in time. This 
requirement is called “multidimensional data 
analysis.” 
E.F. Codd
Limitations: lack of … analytics 
• Until recently, the end-user products that had 
been developed as front-ends to the relational 
DBMS provided very straightforward simplistic 
functionality. The query/report writers and 
spreadsheets have been extremely limited in the 
ways in which data (having already been 
retrieved from the DBMS) can be aggregated, 
summarized, consolidated, summed, viewed, and 
analyzed. 
E.F. Codd
BI (or – partially – OLAP) 
• Is the solution 
• The only one “point of truth” 
– Contains all information about business 
(… or any subject area) in one place 
• Gives analytical & reporting means 
– Speed (performance) 
– Flexibility (many instruments)
BI is about 
• Decision Support Systems 
• Business Analytics 
• Complex & Comprehensive, Intelligent 
Reporting 
• Multidimensional Analysis (real-time) 
• “OLTP -> OLAP” – is the part of strategy 
– OLAP is the core of BI
BI core: Multidimensional engine 
(model, storage) 
• Multidimensional (hyper-)cube:
ETL = OLTP  OLAP 
• OnLine Transaction Processing System 
– accounting of transactions 
(E)xtract 
(T)ransform 
(L)oad 
• OnLine Analytical Processing System 
– gives analytical, intelligence (to transactional data)
OLAP (~Def.) 
• Is an approach to answering multi-dimensional 
analytical queries swiftly 
• Technology for information processing for 
quick answering on multidimensional 
analytical queries 
• Allows consolidation and analysis of data in a 
multidimensional space 
• Is not stand-alone! 
– but based on OLTP data
OLAP Applications 
• Business reporting 
– Sales 
– Marketing 
– Management etc. 
• Financial Reporting 
• Budgeting 
• Forecasting 
• Planning 
• Business Process Management 
… in any business (any subject area)
The Difference 
• OLTP: Operations 
– RDBMS 
• Large number of 
short transactions 
• 3NF, ER-model 
– ACID 
Atomicity, Consistency, 
Isolation, Durability 
– Business Process 
– Online, real-time info 
• OLAP: Information 
– Multidimensional 
• Complex queries 
involve aggregations 
• Sparse n-dim. spaces 
– Aggregates 
precalculated 
– Analytical Data 
Warehouse 
– Large historical data 
storage
Transaction vs. Analytical Approach 
Transaction Systems Analytical Systems 
Technology OLTP OLAP 
Data visualization Grid (Table) Pivot Table 
End-user visual querying QBE Cube browsing 
(drill-down, slice & dice) 
Query language SQL MDX + XMLA
OLTP vs. OLAP 
OLTP System – Online Transaction Processing 
(Operational System) 
OLAP System – Online Analytical Processing 
(Data Warehouse) 
Source of 
data 
Operational data; OLTPs are the original source of 
the data. 
Consolidation data; OLAP data comes from the various 
OLTP Databases 
Purpose of 
data 
To control and run fundamental business tasks 
To help with planning, problem solving, and decision 
support 
What the 
data 
Reveals a snapshot of ongoing business 
processes 
Multi-dimensional views of various kinds of business 
activities 
Inserts and 
Updates 
Short and fast inserts and updates initiated by 
end users 
Periodic long-running batch jobs refresh the data 
Queries 
Relatively standardized and simple queries 
Returning relatively few records 
Often complex queries involving aggregations 
Processing 
Speed 
Typically very fast 
Depends on the amount of data involved; batch data 
refreshes and complex queries may take many hours; 
query speed can be improved by creating indexes 
Space 
Require-ments 
Can be relatively small if historical data is 
archived 
Larger due to the existence of aggregation structures and 
history data; requires more indexes than OLTP 
Database 
Design 
Highly normalized with many tables 
Typically de-normalized with fewer tables; use of star 
and/or snowflake schemas 
Backup and 
Recovery 
Backup religiously; operational data is critical to 
run the business, data loss is likely to entail 
significant monetary loss and legal liability 
Instead of regular backups, some environments may 
consider simply reloading the OLTP data as a recovery 
method
BI includes 
• ETL procedure (= Extract – Transform – Load) 
– often: Data Warehouse, via Data Marts 
• OLAP Multidimensional Storage & Engine 
– Ad-hoc questions & multi-purpose querying 
• Reporting 
– flexible, interactive, dynamic, effective, … 
• Data Mining 
– clustering, associations, trends (time analysis), 
predictions, …
BI core is OLAP 
(engine & storage) 
• Multidimensional (hyper-)cube:
OLAP Concepts 
• Flexible Information Synthesis 
• Multiple Data Dimensions / 
/ Consolidation Paths 
(i.e. Multidimensional Conceptual View)
Data Consolidation 
• Dimension hierarchy
OLAP Characteristics 
• Dynamic Data Analysis 
• Four Enterprise Data Models 
– Categorical, Exegetical, Contemplative, and 
Formulaic Models (укр.: накопичення фактів – 
інтерпретація – аналіз – висновки, 
моделювання, прогнози, …) 
• Common Enterprise Data 
• Synergistic Implementation
OLAP Server Mediating Role
OLAP Product Evaluation Rules 
1. Multidimensional Conceptual View 
2. Transparency 
3. Accessibility 
4. Consistent Reporting Performance 
5. Client-Server Architecture 
6. Generic Dimensionality 
7. Dynamic Sparse Matrix Handling 
8. Multi-User Support 
9. Unrestricted Cross-dimensional Operations 
10. Intuitive Data Manipulation 
11. Flexible Reporting 
12. Unlimited Dimensions and Aggregation Levels 
Codd E.F., Codd S.B., and Salley C.T., 
“Providing OLAP to User-Analysts: An IT Mandate”,
Main OLAP features 
• Drill-down 
– Drill-up 
– Drill-through 
• Slice and dice 
– Pivoting
OLAP Multi-dimensional Modes 
• MOLAP = Multi-dimensional 
– Pure OLAP 
• ROLAP = Relational 
– OLAP requests -> relational backend 
• HOLAP = Hybrid 
– Aggregates – MOLAP 
– Basic facts – ROLAP 
– Inconsistences possible!
OLAP cube example
OLAP cube slicing (1)
OLAP cube slicing (2)
OLAP dicing
OLAP drill-up  drill-down
OLAP pivoting
BI Solutions. Client Sales
BI Solutions. Sales by territory
BI Solutions. Plan/Sales Analysis
BI core = OLAP engine & storage 
• Multidimensional (hyper-)cube:
BI Introduction 
• Business Intelligence enhances business (or 
any other area) vision and understanding 
• OLAP is a core of BI 
• BI includes 
– ETL (OLTP  Data Warehouse with Data Marts  
OLAP) 
– OLAP Multidimensional Storage & Engine 
– Reporting (multi-purpose, comprehensive) 
– Data Mining (clustering, associations, trends (time 
analysis), predictions, …)

More Related Content

What's hot

What's hot (20)

初めてのデータ分析基盤構築をまかされた、その時何を考えておくと良いのか
初めてのデータ分析基盤構築をまかされた、その時何を考えておくと良いのか初めてのデータ分析基盤構築をまかされた、その時何を考えておくと良いのか
初めてのデータ分析基盤構築をまかされた、その時何を考えておくと良いのか
 
OLTP vs OLAP
OLTP vs OLAPOLTP vs OLAP
OLTP vs OLAP
 
データ分析基盤について
データ分析基盤についてデータ分析基盤について
データ分析基盤について
 
マルチクラウドDWH(Snowflake)のすすめ
マルチクラウドDWH(Snowflake)のすすめマルチクラウドDWH(Snowflake)のすすめ
マルチクラウドDWH(Snowflake)のすすめ
 
Data Warehousing Trends, Best Practices, and Future Outlook
Data Warehousing Trends, Best Practices, and Future OutlookData Warehousing Trends, Best Practices, and Future Outlook
Data Warehousing Trends, Best Practices, and Future Outlook
 
データウェアハウスモデリング入門(ダイジェスト版)(事前公開版)
データウェアハウスモデリング入門(ダイジェスト版)(事前公開版) データウェアハウスモデリング入門(ダイジェスト版)(事前公開版)
データウェアハウスモデリング入門(ダイジェスト版)(事前公開版)
 
Intro to Data Vault 2.0 on Snowflake
Intro to Data Vault 2.0 on SnowflakeIntro to Data Vault 2.0 on Snowflake
Intro to Data Vault 2.0 on Snowflake
 
Implementation Approach of Artifical Intelligence
Implementation Approach of Artifical IntelligenceImplementation Approach of Artifical Intelligence
Implementation Approach of Artifical Intelligence
 
210519smartcity101
210519smartcity101210519smartcity101
210519smartcity101
 
Présentation data vault et bi v20120508
Présentation data vault et bi v20120508Présentation data vault et bi v20120508
Présentation data vault et bi v20120508
 
全社のデータ活用を一段階上げる取り組み
全社のデータ活用を一段階上げる取り組み全社のデータ活用を一段階上げる取り組み
全社のデータ活用を一段階上げる取り組み
 
アサヒのデータ活用基盤を支えるデータ仮想化技術
アサヒのデータ活用基盤を支えるデータ仮想化技術アサヒのデータ活用基盤を支えるデータ仮想化技術
アサヒのデータ活用基盤を支えるデータ仮想化技術
 
Snowflake Architecture and Performance(db tech showcase Tokyo 2018)
Snowflake Architecture and Performance(db tech showcase Tokyo 2018)Snowflake Architecture and Performance(db tech showcase Tokyo 2018)
Snowflake Architecture and Performance(db tech showcase Tokyo 2018)
 
データ分析を支える技術 データ分析基盤再入門
データ分析を支える技術 データ分析基盤再入門データ分析を支える技術 データ分析基盤再入門
データ分析を支える技術 データ分析基盤再入門
 
Building a modern data warehouse
Building a modern data warehouseBuilding a modern data warehouse
Building a modern data warehouse
 
データ分析基盤、どう作る?システム設計のポイント、教えます - Developers.IO 2019 (20191101)
データ分析基盤、どう作る?システム設計のポイント、教えます - Developers.IO 2019 (20191101)データ分析基盤、どう作る?システム設計のポイント、教えます - Developers.IO 2019 (20191101)
データ分析基盤、どう作る?システム設計のポイント、教えます - Developers.IO 2019 (20191101)
 
Landing Self Service Analytics using Microsoft Azure & Power BI
Landing Self Service Analytics using Microsoft Azure & Power BILanding Self Service Analytics using Microsoft Azure & Power BI
Landing Self Service Analytics using Microsoft Azure & Power BI
 
Business Intelligence - A Management Perspective
Business Intelligence - A Management PerspectiveBusiness Intelligence - A Management Perspective
Business Intelligence - A Management Perspective
 
データ分析基盤を支えるエンジニアリング
データ分析基盤を支えるエンジニアリングデータ分析基盤を支えるエンジニアリング
データ分析基盤を支えるエンジニアリング
 
Data Warehouse Concepts | Data Warehouse Tutorial | Data Warehousing | Edureka
Data Warehouse Concepts | Data Warehouse Tutorial | Data Warehousing | EdurekaData Warehouse Concepts | Data Warehouse Tutorial | Data Warehousing | Edureka
Data Warehouse Concepts | Data Warehouse Tutorial | Data Warehousing | Edureka
 

Viewers also liked

Pentaho Business Analytics for ISVs and SaaS providers in healthcare
Pentaho Business Analytics for ISVs and SaaS providers in healthcarePentaho Business Analytics for ISVs and SaaS providers in healthcare
Pentaho Business Analytics for ISVs and SaaS providers in healthcare
Pentaho
 
Datawarehousing and Business Intelligence
Datawarehousing and Business IntelligenceDatawarehousing and Business Intelligence
Datawarehousing and Business Intelligence
Prithwis Mukerjee
 
Distributed Database System
Distributed Database SystemDistributed Database System
Distributed Database System
Sulemang
 

Viewers also liked (11)

Olap Cube Design
Olap Cube DesignOlap Cube Design
Olap Cube Design
 
Pentaho Business Analytics for ISVs and SaaS providers in healthcare
Pentaho Business Analytics for ISVs and SaaS providers in healthcarePentaho Business Analytics for ISVs and SaaS providers in healthcare
Pentaho Business Analytics for ISVs and SaaS providers in healthcare
 
Litebi Summer School - Introduction to Business Intelligence
Litebi  Summer School - Introduction to Business IntelligenceLitebi  Summer School - Introduction to Business Intelligence
Litebi Summer School - Introduction to Business Intelligence
 
An Introduction To BI
An Introduction To BIAn Introduction To BI
An Introduction To BI
 
Datawarehouse & bi introduction
Datawarehouse & bi introductionDatawarehouse & bi introduction
Datawarehouse & bi introduction
 
Datawarehousing and Business Intelligence
Datawarehousing and Business IntelligenceDatawarehousing and Business Intelligence
Datawarehousing and Business Intelligence
 
Bi Applications - Oracle
Bi Applications - OracleBi Applications - Oracle
Bi Applications - Oracle
 
ERP & BI
ERP & BIERP & BI
ERP & BI
 
Distributed Database System
Distributed Database SystemDistributed Database System
Distributed Database System
 
Ramco ERP on Cloud - The Best Cloud Computing Solution Worldwide
Ramco ERP on Cloud - The Best Cloud Computing Solution Worldwide Ramco ERP on Cloud - The Best Cloud Computing Solution Worldwide
Ramco ERP on Cloud - The Best Cloud Computing Solution Worldwide
 
Introduction to Business Intelligence
Introduction to Business IntelligenceIntroduction to Business Intelligence
Introduction to Business Intelligence
 

Similar to BI Introduction

Business Intelligence Presentation 1 (15th March'16)
Business Intelligence Presentation 1 (15th March'16)Business Intelligence Presentation 1 (15th March'16)
Business Intelligence Presentation 1 (15th March'16)
Muhammad Fahad
 
Big Data Integration Webinar: Reducing Implementation Efforts of Hadoop, NoSQ...
Big Data Integration Webinar: Reducing Implementation Efforts of Hadoop, NoSQ...Big Data Integration Webinar: Reducing Implementation Efforts of Hadoop, NoSQ...
Big Data Integration Webinar: Reducing Implementation Efforts of Hadoop, NoSQ...
Pentaho
 
The final frontier v3
The final frontier v3The final frontier v3
The final frontier v3
Terry Bunio
 

Similar to BI Introduction (20)

Business Intelligence Architecture
Business Intelligence ArchitectureBusiness Intelligence Architecture
Business Intelligence Architecture
 
Introduction to data mining and data warehousing
Introduction to data mining and data warehousingIntroduction to data mining and data warehousing
Introduction to data mining and data warehousing
 
DATA WAREHOUSING
DATA WAREHOUSINGDATA WAREHOUSING
DATA WAREHOUSING
 
Lecture1
Lecture1Lecture1
Lecture1
 
IT webinar 2016
IT webinar 2016IT webinar 2016
IT webinar 2016
 
Kushal Data Warehousing PPT
Kushal Data Warehousing PPTKushal Data Warehousing PPT
Kushal Data Warehousing PPT
 
Business Intelligence Presentation 1 (15th March'16)
Business Intelligence Presentation 1 (15th March'16)Business Intelligence Presentation 1 (15th March'16)
Business Intelligence Presentation 1 (15th March'16)
 
Data warehousing
Data warehousingData warehousing
Data warehousing
 
Data warehousing
Data warehousingData warehousing
Data warehousing
 
Data warehousing
Data warehousingData warehousing
Data warehousing
 
Skilwise Big data
Skilwise Big dataSkilwise Big data
Skilwise Big data
 
Skillwise Big Data part 2
Skillwise Big Data part 2Skillwise Big Data part 2
Skillwise Big Data part 2
 
Data Warehousing, Data Mining & Data Visualisation
Data Warehousing, Data Mining & Data VisualisationData Warehousing, Data Mining & Data Visualisation
Data Warehousing, Data Mining & Data Visualisation
 
Big Data Integration Webinar: Reducing Implementation Efforts of Hadoop, NoSQ...
Big Data Integration Webinar: Reducing Implementation Efforts of Hadoop, NoSQ...Big Data Integration Webinar: Reducing Implementation Efforts of Hadoop, NoSQ...
Big Data Integration Webinar: Reducing Implementation Efforts of Hadoop, NoSQ...
 
What is OLAP -Data Warehouse Concepts - IT Online Training @ Newyorksys
What is OLAP -Data Warehouse Concepts - IT Online Training @ NewyorksysWhat is OLAP -Data Warehouse Concepts - IT Online Training @ Newyorksys
What is OLAP -Data Warehouse Concepts - IT Online Training @ Newyorksys
 
Data Warehouse approaches with Dynamics AX
Data Warehouse  approaches with Dynamics AXData Warehouse  approaches with Dynamics AX
Data Warehouse approaches with Dynamics AX
 
From Business Intelligence to Big Data - hack/reduce Dec 2014
From Business Intelligence to Big Data - hack/reduce Dec 2014From Business Intelligence to Big Data - hack/reduce Dec 2014
From Business Intelligence to Big Data - hack/reduce Dec 2014
 
The final frontier v3
The final frontier v3The final frontier v3
The final frontier v3
 
3dw
3dw3dw
3dw
 
3dw
3dw3dw
3dw
 

Recently uploaded

一比一原版西悉尼大学毕业证成绩单如何办理
一比一原版西悉尼大学毕业证成绩单如何办理一比一原版西悉尼大学毕业证成绩单如何办理
一比一原版西悉尼大学毕业证成绩单如何办理
pyhepag
 
一比一原版纽卡斯尔大学毕业证成绩单如何办理
一比一原版纽卡斯尔大学毕业证成绩单如何办理一比一原版纽卡斯尔大学毕业证成绩单如何办理
一比一原版纽卡斯尔大学毕业证成绩单如何办理
cyebo
 
一比一原版加利福尼亚大学尔湾分校毕业证成绩单如何办理
一比一原版加利福尼亚大学尔湾分校毕业证成绩单如何办理一比一原版加利福尼亚大学尔湾分校毕业证成绩单如何办理
一比一原版加利福尼亚大学尔湾分校毕业证成绩单如何办理
pyhepag
 
一比一原版阿德莱德大学毕业证成绩单如何办理
一比一原版阿德莱德大学毕业证成绩单如何办理一比一原版阿德莱德大学毕业证成绩单如何办理
一比一原版阿德莱德大学毕业证成绩单如何办理
pyhepag
 
Abortion pills in Dammam Saudi Arabia// +966572737505 // buy cytotec
Abortion pills in Dammam Saudi Arabia// +966572737505 // buy cytotecAbortion pills in Dammam Saudi Arabia// +966572737505 // buy cytotec
Abortion pills in Dammam Saudi Arabia// +966572737505 // buy cytotec
Abortion pills in Riyadh +966572737505 get cytotec
 
Fuzzy Sets decision making under information of uncertainty
Fuzzy Sets decision making under information of uncertaintyFuzzy Sets decision making under information of uncertainty
Fuzzy Sets decision making under information of uncertainty
RafigAliyev2
 
一比一原版(Monash毕业证书)莫纳什大学毕业证成绩单如何办理
一比一原版(Monash毕业证书)莫纳什大学毕业证成绩单如何办理一比一原版(Monash毕业证书)莫纳什大学毕业证成绩单如何办理
一比一原版(Monash毕业证书)莫纳什大学毕业证成绩单如何办理
pyhepag
 

Recently uploaded (20)

一比一原版西悉尼大学毕业证成绩单如何办理
一比一原版西悉尼大学毕业证成绩单如何办理一比一原版西悉尼大学毕业证成绩单如何办理
一比一原版西悉尼大学毕业证成绩单如何办理
 
2024 Q2 Orange County (CA) Tableau User Group Meeting
2024 Q2 Orange County (CA) Tableau User Group Meeting2024 Q2 Orange County (CA) Tableau User Group Meeting
2024 Q2 Orange County (CA) Tableau User Group Meeting
 
一比一原版纽卡斯尔大学毕业证成绩单如何办理
一比一原版纽卡斯尔大学毕业证成绩单如何办理一比一原版纽卡斯尔大学毕业证成绩单如何办理
一比一原版纽卡斯尔大学毕业证成绩单如何办理
 
一比一原版加利福尼亚大学尔湾分校毕业证成绩单如何办理
一比一原版加利福尼亚大学尔湾分校毕业证成绩单如何办理一比一原版加利福尼亚大学尔湾分校毕业证成绩单如何办理
一比一原版加利福尼亚大学尔湾分校毕业证成绩单如何办理
 
一比一原版阿德莱德大学毕业证成绩单如何办理
一比一原版阿德莱德大学毕业证成绩单如何办理一比一原版阿德莱德大学毕业证成绩单如何办理
一比一原版阿德莱德大学毕业证成绩单如何办理
 
Pre-ProductionImproveddsfjgndflghtgg.pptx
Pre-ProductionImproveddsfjgndflghtgg.pptxPre-ProductionImproveddsfjgndflghtgg.pptx
Pre-ProductionImproveddsfjgndflghtgg.pptx
 
Abortion pills in Dammam Saudi Arabia// +966572737505 // buy cytotec
Abortion pills in Dammam Saudi Arabia// +966572737505 // buy cytotecAbortion pills in Dammam Saudi Arabia// +966572737505 // buy cytotec
Abortion pills in Dammam Saudi Arabia// +966572737505 // buy cytotec
 
Supply chain analytics to combat the effects of Ukraine-Russia-conflict
Supply chain analytics to combat the effects of Ukraine-Russia-conflictSupply chain analytics to combat the effects of Ukraine-Russia-conflict
Supply chain analytics to combat the effects of Ukraine-Russia-conflict
 
AI Imagen for data-storytelling Infographics.pdf
AI Imagen for data-storytelling Infographics.pdfAI Imagen for data-storytelling Infographics.pdf
AI Imagen for data-storytelling Infographics.pdf
 
MALL CUSTOMER SEGMENTATION USING K-MEANS CLUSTERING.pptx
MALL CUSTOMER SEGMENTATION USING K-MEANS CLUSTERING.pptxMALL CUSTOMER SEGMENTATION USING K-MEANS CLUSTERING.pptx
MALL CUSTOMER SEGMENTATION USING K-MEANS CLUSTERING.pptx
 
Fuzzy Sets decision making under information of uncertainty
Fuzzy Sets decision making under information of uncertaintyFuzzy Sets decision making under information of uncertainty
Fuzzy Sets decision making under information of uncertainty
 
Webinar One View, Multiple Systems No-Code Integration of Salesforce and ERPs
Webinar One View, Multiple Systems No-Code Integration of Salesforce and ERPsWebinar One View, Multiple Systems No-Code Integration of Salesforce and ERPs
Webinar One View, Multiple Systems No-Code Integration of Salesforce and ERPs
 
2024 Q1 Tableau User Group Leader Quarterly Call
2024 Q1 Tableau User Group Leader Quarterly Call2024 Q1 Tableau User Group Leader Quarterly Call
2024 Q1 Tableau User Group Leader Quarterly Call
 
Machine Learning for Accident Severity Prediction
Machine Learning for Accident Severity PredictionMachine Learning for Accident Severity Prediction
Machine Learning for Accident Severity Prediction
 
Artificial_General_Intelligence__storm_gen_article.pdf
Artificial_General_Intelligence__storm_gen_article.pdfArtificial_General_Intelligence__storm_gen_article.pdf
Artificial_General_Intelligence__storm_gen_article.pdf
 
How I opened a fake bank account and didn't go to prison
How I opened a fake bank account and didn't go to prisonHow I opened a fake bank account and didn't go to prison
How I opened a fake bank account and didn't go to prison
 
How can I successfully sell my pi coins in Philippines?
How can I successfully sell my pi coins in Philippines?How can I successfully sell my pi coins in Philippines?
How can I successfully sell my pi coins in Philippines?
 
basics of data science with application areas.pdf
basics of data science with application areas.pdfbasics of data science with application areas.pdf
basics of data science with application areas.pdf
 
一比一原版(Monash毕业证书)莫纳什大学毕业证成绩单如何办理
一比一原版(Monash毕业证书)莫纳什大学毕业证成绩单如何办理一比一原版(Monash毕业证书)莫纳什大学毕业证成绩单如何办理
一比一原版(Monash毕业证书)莫纳什大学毕业证成绩单如何办理
 
Slip-and-fall Injuries: Top Workers' Comp Claims
Slip-and-fall Injuries: Top Workers' Comp ClaimsSlip-and-fall Injuries: Top Workers' Comp Claims
Slip-and-fall Injuries: Top Workers' Comp Claims
 

BI Introduction

  • 1. Business Intelligence (BI) Lecturer: PhD Taras V. Panchenko Associate Professor @ Theory and Technology for Programming Chair @ Cybernetics Faculty @ National Taras Shevchenko University of Kyiv
  • 2. Introduction Computers are useless. They can only give you answers. Pablo Picasso … Meaning that asking questions and true creativity are things that computers aren't capable of yet.
  • 3. BI – Def(s) Business intelligence (BI) is the ability to apprehend the interrelationships of presented facts in such a way as to guide action towards a desired goal. Hans Peter Luhn, IBM, 1958 BI is the transformation of raw data into meaningful and useful information for business analysis purposes. Wikipedia
  • 4. BI – Def(s) Business intelligence (BI) BI is the transformation of raw data into meaningful and useful information for business analysis purposes. • BI can handle enormous amounts of unstructured data to help identify, develop and otherwise create new strategic business opportunities • BI allows for the easy interpretation of volumes of data • Identifying new opportunities and implementing an effective strategy can provide a competitive market advantage and long-term stability Wikipedia
  • 5. BI – Def(s) Business intelligence (BI) is an umbrella term that includes the applications, infrastructure and tools, and best practices that enable access to and analysis of information to improve and optimize decisions and performance. Gartner BI is a set of methodologies, processes, architectures, and technologies that leverage the output of information management processes for analysis, reporting, performance management, and information delivery. Research coverage includes executive dashboards as well as query and reporting tools. Forrester
  • 6. The Problem • Try to model and analyze activity of the bank – Develop the model: clients, accounts, currency, transactions, … – Possible questions to system/model from analyst • Performance (analytical query speed) • Dynamic reports & ad-hoc analysis • or: analyze sales of products by regions in time • Is RDBMS the best solution? … for Multidimensional model …
  • 7. Multidimensional Analysis • Multidimensional (hyper-)cube:
  • 8. Problem of Relational Database Model • Most notably lacking has been the ability to consolidate, view, and analyze data according to multiple dimensions, in ways that make sense to one or more specific enterprise analysts at any given point in time. This requirement is called “multidimensional data analysis.” E.F. Codd
  • 9. Limitations: lack of … analytics • Until recently, the end-user products that had been developed as front-ends to the relational DBMS provided very straightforward simplistic functionality. The query/report writers and spreadsheets have been extremely limited in the ways in which data (having already been retrieved from the DBMS) can be aggregated, summarized, consolidated, summed, viewed, and analyzed. E.F. Codd
  • 10. BI (or – partially – OLAP) • Is the solution • The only one “point of truth” – Contains all information about business (… or any subject area) in one place • Gives analytical & reporting means – Speed (performance) – Flexibility (many instruments)
  • 11. BI is about • Decision Support Systems • Business Analytics • Complex & Comprehensive, Intelligent Reporting • Multidimensional Analysis (real-time) • “OLTP -> OLAP” – is the part of strategy – OLAP is the core of BI
  • 12. BI core: Multidimensional engine (model, storage) • Multidimensional (hyper-)cube:
  • 13. ETL = OLTP  OLAP • OnLine Transaction Processing System – accounting of transactions (E)xtract (T)ransform (L)oad • OnLine Analytical Processing System – gives analytical, intelligence (to transactional data)
  • 14. OLAP (~Def.) • Is an approach to answering multi-dimensional analytical queries swiftly • Technology for information processing for quick answering on multidimensional analytical queries • Allows consolidation and analysis of data in a multidimensional space • Is not stand-alone! – but based on OLTP data
  • 15. OLAP Applications • Business reporting – Sales – Marketing – Management etc. • Financial Reporting • Budgeting • Forecasting • Planning • Business Process Management … in any business (any subject area)
  • 16. The Difference • OLTP: Operations – RDBMS • Large number of short transactions • 3NF, ER-model – ACID Atomicity, Consistency, Isolation, Durability – Business Process – Online, real-time info • OLAP: Information – Multidimensional • Complex queries involve aggregations • Sparse n-dim. spaces – Aggregates precalculated – Analytical Data Warehouse – Large historical data storage
  • 17. Transaction vs. Analytical Approach Transaction Systems Analytical Systems Technology OLTP OLAP Data visualization Grid (Table) Pivot Table End-user visual querying QBE Cube browsing (drill-down, slice & dice) Query language SQL MDX + XMLA
  • 18. OLTP vs. OLAP OLTP System – Online Transaction Processing (Operational System) OLAP System – Online Analytical Processing (Data Warehouse) Source of data Operational data; OLTPs are the original source of the data. Consolidation data; OLAP data comes from the various OLTP Databases Purpose of data To control and run fundamental business tasks To help with planning, problem solving, and decision support What the data Reveals a snapshot of ongoing business processes Multi-dimensional views of various kinds of business activities Inserts and Updates Short and fast inserts and updates initiated by end users Periodic long-running batch jobs refresh the data Queries Relatively standardized and simple queries Returning relatively few records Often complex queries involving aggregations Processing Speed Typically very fast Depends on the amount of data involved; batch data refreshes and complex queries may take many hours; query speed can be improved by creating indexes Space Require-ments Can be relatively small if historical data is archived Larger due to the existence of aggregation structures and history data; requires more indexes than OLTP Database Design Highly normalized with many tables Typically de-normalized with fewer tables; use of star and/or snowflake schemas Backup and Recovery Backup religiously; operational data is critical to run the business, data loss is likely to entail significant monetary loss and legal liability Instead of regular backups, some environments may consider simply reloading the OLTP data as a recovery method
  • 19. BI includes • ETL procedure (= Extract – Transform – Load) – often: Data Warehouse, via Data Marts • OLAP Multidimensional Storage & Engine – Ad-hoc questions & multi-purpose querying • Reporting – flexible, interactive, dynamic, effective, … • Data Mining – clustering, associations, trends (time analysis), predictions, …
  • 20. BI core is OLAP (engine & storage) • Multidimensional (hyper-)cube:
  • 21. OLAP Concepts • Flexible Information Synthesis • Multiple Data Dimensions / / Consolidation Paths (i.e. Multidimensional Conceptual View)
  • 22. Data Consolidation • Dimension hierarchy
  • 23. OLAP Characteristics • Dynamic Data Analysis • Four Enterprise Data Models – Categorical, Exegetical, Contemplative, and Formulaic Models (укр.: накопичення фактів – інтерпретація – аналіз – висновки, моделювання, прогнози, …) • Common Enterprise Data • Synergistic Implementation
  • 25. OLAP Product Evaluation Rules 1. Multidimensional Conceptual View 2. Transparency 3. Accessibility 4. Consistent Reporting Performance 5. Client-Server Architecture 6. Generic Dimensionality 7. Dynamic Sparse Matrix Handling 8. Multi-User Support 9. Unrestricted Cross-dimensional Operations 10. Intuitive Data Manipulation 11. Flexible Reporting 12. Unlimited Dimensions and Aggregation Levels Codd E.F., Codd S.B., and Salley C.T., “Providing OLAP to User-Analysts: An IT Mandate”,
  • 26. Main OLAP features • Drill-down – Drill-up – Drill-through • Slice and dice – Pivoting
  • 27. OLAP Multi-dimensional Modes • MOLAP = Multi-dimensional – Pure OLAP • ROLAP = Relational – OLAP requests -> relational backend • HOLAP = Hybrid – Aggregates – MOLAP – Basic facts – ROLAP – Inconsistences possible!
  • 32. OLAP drill-up  drill-down
  • 35. BI Solutions. Sales by territory
  • 37. BI core = OLAP engine & storage • Multidimensional (hyper-)cube:
  • 38. BI Introduction • Business Intelligence enhances business (or any other area) vision and understanding • OLAP is a core of BI • BI includes – ETL (OLTP  Data Warehouse with Data Marts  OLAP) – OLAP Multidimensional Storage & Engine – Reporting (multi-purpose, comprehensive) – Data Mining (clustering, associations, trends (time analysis), predictions, …)