SlideShare ist ein Scribd-Unternehmen logo
1 von 30
Building Enterprise OLAP on Hadoop
for Financial Services Industry
Luke Han
luke@kyligence.io | @lukehq
Co-founder & CEO of Kyligence
Creator & VP of Apache Kylin
Microsoft Regional Director & MVP
About Kyligence
• Formed by creators of Apache Kylin in 2016
• Offers Enterprise and Cloud version of Apache Kylin
• Funding from Redpoint, Cisco, CBC and Shunwei
• Member of Microsoft Accelerator Shanghai 2017
• Dual HQ in Silicon Valley & Shanghai, China
Kyligence booth: #855
Transition to Big Data…
How about your traditional data warehouse?
How about your existing OLAP/BI application?
Data Warehouse/OLAP
in Financial Services Industry
o The biggest industry rely on DW/OLAP
application
o Thousands applications build on top of EDW
o Experienced analysts with decade expertise
…in data…but not in technologies
Presentation
Visualization
OLAP
Data Mart
Enterprise
Data
Warehouse
Data
Source
o Optimized for mission-critical
analytics
o Well modeling
o Best practices of industry
o Thriving ecosystem
o Trained experts everywhere
Enterprise Data Warehouse Architecture
But
you are asked to…
o Migrate or build existing OLAP/BI app to Big Data
o Better performance…just because you have Big Data now
o Train yourself to learn MR/Spark/ML…and AI
Presentation
Visualization
Data
Lake
Data
Source
o Too many options
o Low performance
o Long learning curve
o Compatibility issue
o Technology vs Data
OLAP: The Missing Part of Big Data
Hive Impala Spark
SQL
Drill
MapReduce …Spark
Presentation
Visualization
Data
Lake
Data
Source
o MOLAP on Hadoop
o Simplified Data Modeling
o Optimized for aggregation
query
o ANSI SQL
o Native on Hadoop
o On-Prem & In the Cloud
Apache Kylin: Bring OLAP back to Big Data
OLAP
Data Mart
Hive Impala Spark SQL Drill
MapReduce …Spark
Kylin vs Hive: Star-Schema Benchmark
0.17 0.17 0.18
142.42
161.66
189.17
0
20
40
60
80
100
120
140
160
180
200
2 10 20
ResponseTime(seconds)
Data Volume (Scale Factor)
Apache Kylin vs. Apache Hive
(lower is better)
KAP
Apache Hive
* Based on 4 Nodes, 16 Core CPU, 96 GB Memory per node
Apache Kylin
Global Users
FSI
• ABC
• CCB
• CMB
• CPIC
• Citic Bank
• China
Unionpay
• HUATAI
Securities
• GUOTAI
Securities
• Lufax
Telecom
• China Mobile
• China Telecom
• Chine Unicom
• AT & T
Internet
• eBay
• Yahoo! Japan
• Baidu
• Meituan
• NetEase
• Expedia
• JD.com
• VIP.com
• 360
• Toutiao
Others
• MachineZone
• Glispa
• Inovex
• Adobe
• iFLYTEC
500+ use cases in production global
Manufacturing
• SAIC
• HUAWEI
• Lenovo
• OPPO
• XIAOMI
• VIVO
Data collected from public information and kylin community
Enterprise OLAP on Hadoop
Kyligence: Enterprise OLAP on Hadoop
Kyligence Robot
Online Optimize &
Tuning Services
Kyligence Analytics Platform (KAP)
Kyligence Solutions
Apache Kylin
Open Source
OLAP On Hadoop
KyStorage
Columnar Storage
KyStudio
Model Designer
KyManager
Administrator Tool
KyAnalyzer
Agile BI
Security
Cell Level ACL
On-Demand
Deployment
On-Premises Hybrid In the Cloud
Kyligence: Enterprise OLAP on Hadoop
Hive
Spark
SQL
Impala
Kyligence
Analytics Platform (KAP)
Mission Critical AnalyticsData Exploration/Discovery
Intelligent Cubing
by KAP
Query Pushdown: minutes latency Cube Access: sub-second latency
Support
Data Exploration and Discovery
TPC-DS
0
50
100
150
200
250
1 4 7 101316192225283134374043464952555861646770737679828588919497
KAP: TPC-DS
• Hive: 33 queries can’t support
or run out of time
• KAP: all 99 queries supported
• Routine query between SQL
on Hadoop and Apache Kylin
Speed Up
Mission Critical Analytics
TPC-H Benchmark
0
10
20
30
40
50
60
Q1 Q2 Q3 Q4 Q5 Q6 Q7 Q8 Q9 Q10 Q11 Q12 Q13 Q14 Q15 Q16 Q17 Q18 Q19 Q20 Q21 Q22
KAP vs SparkSQL 2.1 (lower is better)
SparkSQL 2.1 KAP 2.4
Kyligence Studio: Data Modeling Designer
o Drag & Drop
o Smart Data Modeling
o Intelligent Optimization
Integrate
with Business Intelligence tools
Seamless Integration with BI tools
o KyAnalyzer
o Tableau
o Power BI/Excel
o IBM Cognos
o MicroStrategy
o Superset
o Zeppenlin
o Saiku
o …
Enhanced
Security and Management
Cell Level ACL/SSO/LDAP/Kerberos…
Use Case: CPIC
CPIC: China Pacific Insurance (Group) Co., LTD
• Global Fortune 500 insurance company
• Top 2 insurance company in China
• $40+ billion revenue
• 8+ million customers
• 97,000+ employees
Challenges
• Legacy IBM Cognos + DB2 solution can’t support Big Data scenarios
• Long waiting time (minutes ~ hours for reporting)
• Low concurrency (100,000+ employees!)
• High cost
2016.12
~
2017.01
KAP POC: Performance Testing
• Query Latency
• Concurrency
KAP POC: Compatibility
• Cognos Connection
• Cognos Syntax
2017.01
~
2017.03
Development
• Fixed Reports
• Flexible Reports
2017.03
~
2017.05
Go alive
• All dataset aggregation and testing
• Fixed Reports released
2017.05
~
2017.06
Journey of Kyligence Analytics Platform
• No changes on
Hadoop side
• No additional
engineers required
• Most of work done by
analysts
KAP + Cognos: Deployment
Dynamic Report
JDBC
Fixed Report
ODBC
KAP Query Server
Reporting & Dashboard OLAP & Data Mart Big Data Platform
Benefits after Adopting Kyligence
• One-stop BI platform generates complicated reports
• Over 90% queries return within 3 seconds (including high-dimensional
queries)
• Seamless integration with IBM Cognos, no change at front-end
• 2 KAP cubes replaced 2000+ IBM Cognos cubes
• Cost reduced significantly by adopting open source technology
Customer Quote
“Kyligence enables us to find valuable insights faster
from every insurance policy within seconds. Kyligence’s
platform allows us to achieve more with less. Our lean
management system has improved significantly”
-- Minchen Wu, Depute GM of IT, CPIC
Fusion Big Data Platform
• Open: Connect to Teradata/Greenplum and IBM Cognos/Saiku…
• Flexible: Self-Services for end users
• Efficiency: Speed up PC and Mobile analytics experience
China Construction Bank (CCB):
2nd Largest Bank in the World
“Apache Kylin is last piece of puzzle to
serving data asserts management
between legacy DW and new Big Data.”
-- Zhi Zhu, Vice Senior Manager of Tech Dept, CCB
Enterprise OLAP on Hadoop
Speed Up Mission Critical Analytics
Booth #855
luke@kyligence.io
http://kyligence.io

Weitere ähnliche Inhalte

Was ist angesagt?

SQL + Hadoop: The High Performance Advantage�
SQL + Hadoop:  The High Performance Advantage�SQL + Hadoop:  The High Performance Advantage�
SQL + Hadoop: The High Performance Advantage�Actian Corporation
 
Apache Hadoop India Summit 2011 talk "Data Integration on Hadoop" by Sanjay K...
Apache Hadoop India Summit 2011 talk "Data Integration on Hadoop" by Sanjay K...Apache Hadoop India Summit 2011 talk "Data Integration on Hadoop" by Sanjay K...
Apache Hadoop India Summit 2011 talk "Data Integration on Hadoop" by Sanjay K...Yahoo Developer Network
 
Analytics at the Speed of Thought: Actian Express Overview
Analytics at the Speed of Thought: Actian Express Overview Analytics at the Speed of Thought: Actian Express Overview
Analytics at the Speed of Thought: Actian Express Overview Actian Corporation
 
Real-Time Anomaly Detection using LSTM Auto-Encoders with Deep Learning4J on ...
Real-Time Anomaly Detection using LSTM Auto-Encoders with Deep Learning4J on ...Real-Time Anomaly Detection using LSTM Auto-Encoders with Deep Learning4J on ...
Real-Time Anomaly Detection using LSTM Auto-Encoders with Deep Learning4J on ...DataWorks Summit/Hadoop Summit
 
Digitalising the Core – How Analytics is Shaping the Energy Industry Daniel J...
Digitalising the Core – How Analytics is Shaping the Energy Industry Daniel J...Digitalising the Core – How Analytics is Shaping the Energy Industry Daniel J...
Digitalising the Core – How Analytics is Shaping the Energy Industry Daniel J...Spark Summit
 
Best Practices for Supercharging Cloud Analytics on Amazon Redshift
Best Practices for Supercharging Cloud Analytics on Amazon RedshiftBest Practices for Supercharging Cloud Analytics on Amazon Redshift
Best Practices for Supercharging Cloud Analytics on Amazon RedshiftSnapLogic
 
InfoTrack: Creating a single source of truth with the Elastic Stack
InfoTrack: Creating a single source of truth with the Elastic StackInfoTrack: Creating a single source of truth with the Elastic Stack
InfoTrack: Creating a single source of truth with the Elastic StackElasticsearch
 
Pivotal Big Data Suite: A Technical Overview
Pivotal Big Data Suite: A Technical OverviewPivotal Big Data Suite: A Technical Overview
Pivotal Big Data Suite: A Technical OverviewVMware Tanzu
 
Turning Your Data Lake into Measurable Business Value
Turning Your Data Lake into Measurable Business ValueTurning Your Data Lake into Measurable Business Value
Turning Your Data Lake into Measurable Business ValueActian Corporation
 
Northwestern Mutual Journey – Transform BI Space to Cloud
Northwestern Mutual Journey – Transform BI Space to CloudNorthwestern Mutual Journey – Transform BI Space to Cloud
Northwestern Mutual Journey – Transform BI Space to CloudDatabricks
 
Weathering the Data Storm – How SnapLogic and AWS Deliver Analytics in the Cl...
Weathering the Data Storm – How SnapLogic and AWS Deliver Analytics in the Cl...Weathering the Data Storm – How SnapLogic and AWS Deliver Analytics in the Cl...
Weathering the Data Storm – How SnapLogic and AWS Deliver Analytics in the Cl...SnapLogic
 
Cloud Experience: Data-driven Applications Made Simple and Fast
Cloud Experience: Data-driven Applications Made Simple and FastCloud Experience: Data-driven Applications Made Simple and Fast
Cloud Experience: Data-driven Applications Made Simple and FastDatabricks
 
Life is but a Stream
Life is but a StreamLife is but a Stream
Life is but a StreamDatabricks
 
Generative Hyperloop Design: Managing Massively Scaled Simulations Focused on...
Generative Hyperloop Design: Managing Massively Scaled Simulations Focused on...Generative Hyperloop Design: Managing Massively Scaled Simulations Focused on...
Generative Hyperloop Design: Managing Massively Scaled Simulations Focused on...Databricks
 
Apache Spark in Scientific Applciations
Apache Spark in Scientific ApplciationsApache Spark in Scientific Applciations
Apache Spark in Scientific ApplciationsDr. Mirko Kämpf
 
How to Operationalise Real-Time Hadoop in the Cloud
How to Operationalise Real-Time Hadoop in the CloudHow to Operationalise Real-Time Hadoop in the Cloud
How to Operationalise Real-Time Hadoop in the CloudAttunity
 
Getting Into the Business Intelligence Game: Migrating OBIA to the Cloud
Getting Into the Business Intelligence Game: Migrating OBIA to the CloudGetting Into the Business Intelligence Game: Migrating OBIA to the Cloud
Getting Into the Business Intelligence Game: Migrating OBIA to the CloudDatavail
 
PayPal datalake journey | teradata - edge of next | san diego | 2017 october ...
PayPal datalake journey | teradata - edge of next | san diego | 2017 october ...PayPal datalake journey | teradata - edge of next | san diego | 2017 october ...
PayPal datalake journey | teradata - edge of next | san diego | 2017 october ...Deepak Chandramouli
 
Fast Cars, Big Data - How Streaming Can Help Formula 1 - Tugdual Grall - Code...
Fast Cars, Big Data - How Streaming Can Help Formula 1 - Tugdual Grall - Code...Fast Cars, Big Data - How Streaming Can Help Formula 1 - Tugdual Grall - Code...
Fast Cars, Big Data - How Streaming Can Help Formula 1 - Tugdual Grall - Code...Codemotion
 
Offload, Transform, and Present - The New World of Data Integration
Offload, Transform, and Present - The New World of Data IntegrationOffload, Transform, and Present - The New World of Data Integration
Offload, Transform, and Present - The New World of Data Integrationgluent.
 

Was ist angesagt? (20)

SQL + Hadoop: The High Performance Advantage�
SQL + Hadoop:  The High Performance Advantage�SQL + Hadoop:  The High Performance Advantage�
SQL + Hadoop: The High Performance Advantage�
 
Apache Hadoop India Summit 2011 talk "Data Integration on Hadoop" by Sanjay K...
Apache Hadoop India Summit 2011 talk "Data Integration on Hadoop" by Sanjay K...Apache Hadoop India Summit 2011 talk "Data Integration on Hadoop" by Sanjay K...
Apache Hadoop India Summit 2011 talk "Data Integration on Hadoop" by Sanjay K...
 
Analytics at the Speed of Thought: Actian Express Overview
Analytics at the Speed of Thought: Actian Express Overview Analytics at the Speed of Thought: Actian Express Overview
Analytics at the Speed of Thought: Actian Express Overview
 
Real-Time Anomaly Detection using LSTM Auto-Encoders with Deep Learning4J on ...
Real-Time Anomaly Detection using LSTM Auto-Encoders with Deep Learning4J on ...Real-Time Anomaly Detection using LSTM Auto-Encoders with Deep Learning4J on ...
Real-Time Anomaly Detection using LSTM Auto-Encoders with Deep Learning4J on ...
 
Digitalising the Core – How Analytics is Shaping the Energy Industry Daniel J...
Digitalising the Core – How Analytics is Shaping the Energy Industry Daniel J...Digitalising the Core – How Analytics is Shaping the Energy Industry Daniel J...
Digitalising the Core – How Analytics is Shaping the Energy Industry Daniel J...
 
Best Practices for Supercharging Cloud Analytics on Amazon Redshift
Best Practices for Supercharging Cloud Analytics on Amazon RedshiftBest Practices for Supercharging Cloud Analytics on Amazon Redshift
Best Practices for Supercharging Cloud Analytics on Amazon Redshift
 
InfoTrack: Creating a single source of truth with the Elastic Stack
InfoTrack: Creating a single source of truth with the Elastic StackInfoTrack: Creating a single source of truth with the Elastic Stack
InfoTrack: Creating a single source of truth with the Elastic Stack
 
Pivotal Big Data Suite: A Technical Overview
Pivotal Big Data Suite: A Technical OverviewPivotal Big Data Suite: A Technical Overview
Pivotal Big Data Suite: A Technical Overview
 
Turning Your Data Lake into Measurable Business Value
Turning Your Data Lake into Measurable Business ValueTurning Your Data Lake into Measurable Business Value
Turning Your Data Lake into Measurable Business Value
 
Northwestern Mutual Journey – Transform BI Space to Cloud
Northwestern Mutual Journey – Transform BI Space to CloudNorthwestern Mutual Journey – Transform BI Space to Cloud
Northwestern Mutual Journey – Transform BI Space to Cloud
 
Weathering the Data Storm – How SnapLogic and AWS Deliver Analytics in the Cl...
Weathering the Data Storm – How SnapLogic and AWS Deliver Analytics in the Cl...Weathering the Data Storm – How SnapLogic and AWS Deliver Analytics in the Cl...
Weathering the Data Storm – How SnapLogic and AWS Deliver Analytics in the Cl...
 
Cloud Experience: Data-driven Applications Made Simple and Fast
Cloud Experience: Data-driven Applications Made Simple and FastCloud Experience: Data-driven Applications Made Simple and Fast
Cloud Experience: Data-driven Applications Made Simple and Fast
 
Life is but a Stream
Life is but a StreamLife is but a Stream
Life is but a Stream
 
Generative Hyperloop Design: Managing Massively Scaled Simulations Focused on...
Generative Hyperloop Design: Managing Massively Scaled Simulations Focused on...Generative Hyperloop Design: Managing Massively Scaled Simulations Focused on...
Generative Hyperloop Design: Managing Massively Scaled Simulations Focused on...
 
Apache Spark in Scientific Applciations
Apache Spark in Scientific ApplciationsApache Spark in Scientific Applciations
Apache Spark in Scientific Applciations
 
How to Operationalise Real-Time Hadoop in the Cloud
How to Operationalise Real-Time Hadoop in the CloudHow to Operationalise Real-Time Hadoop in the Cloud
How to Operationalise Real-Time Hadoop in the Cloud
 
Getting Into the Business Intelligence Game: Migrating OBIA to the Cloud
Getting Into the Business Intelligence Game: Migrating OBIA to the CloudGetting Into the Business Intelligence Game: Migrating OBIA to the Cloud
Getting Into the Business Intelligence Game: Migrating OBIA to the Cloud
 
PayPal datalake journey | teradata - edge of next | san diego | 2017 october ...
PayPal datalake journey | teradata - edge of next | san diego | 2017 october ...PayPal datalake journey | teradata - edge of next | san diego | 2017 october ...
PayPal datalake journey | teradata - edge of next | san diego | 2017 october ...
 
Fast Cars, Big Data - How Streaming Can Help Formula 1 - Tugdual Grall - Code...
Fast Cars, Big Data - How Streaming Can Help Formula 1 - Tugdual Grall - Code...Fast Cars, Big Data - How Streaming Can Help Formula 1 - Tugdual Grall - Code...
Fast Cars, Big Data - How Streaming Can Help Formula 1 - Tugdual Grall - Code...
 
Offload, Transform, and Present - The New World of Data Integration
Offload, Transform, and Present - The New World of Data IntegrationOffload, Transform, and Present - The New World of Data Integration
Offload, Transform, and Present - The New World of Data Integration
 

Ähnlich wie Building Enterprise OLAP on Hadoop for FSI

Cloud-native Semantic Layer on Data Lake
Cloud-native Semantic Layer on Data LakeCloud-native Semantic Layer on Data Lake
Cloud-native Semantic Layer on Data LakeDatabricks
 
Lightning-Fast, Interactive Business Intelligence Performance with MicroStrat...
Lightning-Fast, Interactive Business Intelligence Performance with MicroStrat...Lightning-Fast, Interactive Business Intelligence Performance with MicroStrat...
Lightning-Fast, Interactive Business Intelligence Performance with MicroStrat...Tyler Wishnoff
 
Take the Bias out of Big Data Insights With Augmented Analytics
Take the Bias out of Big Data Insights With Augmented AnalyticsTake the Bias out of Big Data Insights With Augmented Analytics
Take the Bias out of Big Data Insights With Augmented AnalyticsTyler Wishnoff
 
Simplify Data Analytics Over the Cloud
Simplify Data Analytics Over the CloudSimplify Data Analytics Over the Cloud
Simplify Data Analytics Over the CloudTyler Wishnoff
 
Accelerating Big Data Analytics with Apache Kylin
Accelerating Big Data Analytics with Apache KylinAccelerating Big Data Analytics with Apache Kylin
Accelerating Big Data Analytics with Apache KylinTyler Wishnoff
 
Architecting Snowflake for High Concurrency and High Performance
Architecting Snowflake for High Concurrency and High PerformanceArchitecting Snowflake for High Concurrency and High Performance
Architecting Snowflake for High Concurrency and High PerformanceSamanthaBerlant
 
PCM18 (Big Data Analytics)
PCM18 (Big Data Analytics)PCM18 (Big Data Analytics)
PCM18 (Big Data Analytics)Stratebi
 
Building Modern Data Platform with Microsoft Azure
Building Modern Data Platform with Microsoft AzureBuilding Modern Data Platform with Microsoft Azure
Building Modern Data Platform with Microsoft AzureDmitry Anoshin
 
AWS Partner Webcast - Reporting and Analytics in the Cloud
AWS Partner Webcast - Reporting and Analytics in the CloudAWS Partner Webcast - Reporting and Analytics in the Cloud
AWS Partner Webcast - Reporting and Analytics in the CloudAmazon Web Services
 
ICP for Data- Enterprise platform for AI, ML and Data Science
ICP for Data- Enterprise platform for AI, ML and Data ScienceICP for Data- Enterprise platform for AI, ML and Data Science
ICP for Data- Enterprise platform for AI, ML and Data ScienceKaran Sachdeva
 
Amazon Redshift with Full 360 Inc.
Amazon Redshift with Full 360 Inc.Amazon Redshift with Full 360 Inc.
Amazon Redshift with Full 360 Inc.Amazon Web Services
 
The Impact of SMACT on the Data Management Stack
The Impact of SMACT on the Data Management StackThe Impact of SMACT on the Data Management Stack
The Impact of SMACT on the Data Management StackSnapLogic
 
CDS Overview (May 2015)
CDS Overview (May 2015)CDS Overview (May 2015)
CDS Overview (May 2015)Karim Lalji
 
Apache kylin 101 - Get Sub-Second Analytics on Massive Datasets
Apache kylin 101 - Get Sub-Second Analytics on Massive DatasetsApache kylin 101 - Get Sub-Second Analytics on Massive Datasets
Apache kylin 101 - Get Sub-Second Analytics on Massive DatasetsTyler Wishnoff
 
Power b iwithaiandhybridbievent_techcloudtree
Power b iwithaiandhybridbievent_techcloudtreePower b iwithaiandhybridbievent_techcloudtree
Power b iwithaiandhybridbievent_techcloudtreeMohamed Ashiq Faleel
 
Can you Re-Platform your Teradata, Oracle, Netezza and SQL Server Analytic Wo...
Can you Re-Platform your Teradata, Oracle, Netezza and SQL Server Analytic Wo...Can you Re-Platform your Teradata, Oracle, Netezza and SQL Server Analytic Wo...
Can you Re-Platform your Teradata, Oracle, Netezza and SQL Server Analytic Wo...DataWorks Summit
 
U of A Web Strategy and Sitecore
U of A Web Strategy and SitecoreU of A Web Strategy and Sitecore
U of A Web Strategy and SitecoreTim Schneider
 

Ähnlich wie Building Enterprise OLAP on Hadoop for FSI (20)

Cloud-native Semantic Layer on Data Lake
Cloud-native Semantic Layer on Data LakeCloud-native Semantic Layer on Data Lake
Cloud-native Semantic Layer on Data Lake
 
Lightning-Fast, Interactive Business Intelligence Performance with MicroStrat...
Lightning-Fast, Interactive Business Intelligence Performance with MicroStrat...Lightning-Fast, Interactive Business Intelligence Performance with MicroStrat...
Lightning-Fast, Interactive Business Intelligence Performance with MicroStrat...
 
Take the Bias out of Big Data Insights With Augmented Analytics
Take the Bias out of Big Data Insights With Augmented AnalyticsTake the Bias out of Big Data Insights With Augmented Analytics
Take the Bias out of Big Data Insights With Augmented Analytics
 
Simplify Data Analytics Over the Cloud
Simplify Data Analytics Over the CloudSimplify Data Analytics Over the Cloud
Simplify Data Analytics Over the Cloud
 
Accelerating Big Data Analytics with Apache Kylin
Accelerating Big Data Analytics with Apache KylinAccelerating Big Data Analytics with Apache Kylin
Accelerating Big Data Analytics with Apache Kylin
 
Architecting Snowflake for High Concurrency and High Performance
Architecting Snowflake for High Concurrency and High PerformanceArchitecting Snowflake for High Concurrency and High Performance
Architecting Snowflake for High Concurrency and High Performance
 
PCM18 (Big Data Analytics)
PCM18 (Big Data Analytics)PCM18 (Big Data Analytics)
PCM18 (Big Data Analytics)
 
Building Modern Data Platform with Microsoft Azure
Building Modern Data Platform with Microsoft AzureBuilding Modern Data Platform with Microsoft Azure
Building Modern Data Platform with Microsoft Azure
 
AWS Partner Webcast - Reporting and Analytics in the Cloud
AWS Partner Webcast - Reporting and Analytics in the CloudAWS Partner Webcast - Reporting and Analytics in the Cloud
AWS Partner Webcast - Reporting and Analytics in the Cloud
 
Accelerating Data Warehouse Modernization
Accelerating Data Warehouse ModernizationAccelerating Data Warehouse Modernization
Accelerating Data Warehouse Modernization
 
ICP for Data- Enterprise platform for AI, ML and Data Science
ICP for Data- Enterprise platform for AI, ML and Data ScienceICP for Data- Enterprise platform for AI, ML and Data Science
ICP for Data- Enterprise platform for AI, ML and Data Science
 
Amazon Redshift with Full 360 Inc.
Amazon Redshift with Full 360 Inc.Amazon Redshift with Full 360 Inc.
Amazon Redshift with Full 360 Inc.
 
The Impact of SMACT on the Data Management Stack
The Impact of SMACT on the Data Management StackThe Impact of SMACT on the Data Management Stack
The Impact of SMACT on the Data Management Stack
 
ESGYN Overview
ESGYN OverviewESGYN Overview
ESGYN Overview
 
CDS Overview (May 2015)
CDS Overview (May 2015)CDS Overview (May 2015)
CDS Overview (May 2015)
 
Apache kylin 101 - Get Sub-Second Analytics on Massive Datasets
Apache kylin 101 - Get Sub-Second Analytics on Massive DatasetsApache kylin 101 - Get Sub-Second Analytics on Massive Datasets
Apache kylin 101 - Get Sub-Second Analytics on Massive Datasets
 
Apache Kylin 101
Apache Kylin 101Apache Kylin 101
Apache Kylin 101
 
Power b iwithaiandhybridbievent_techcloudtree
Power b iwithaiandhybridbievent_techcloudtreePower b iwithaiandhybridbievent_techcloudtree
Power b iwithaiandhybridbievent_techcloudtree
 
Can you Re-Platform your Teradata, Oracle, Netezza and SQL Server Analytic Wo...
Can you Re-Platform your Teradata, Oracle, Netezza and SQL Server Analytic Wo...Can you Re-Platform your Teradata, Oracle, Netezza and SQL Server Analytic Wo...
Can you Re-Platform your Teradata, Oracle, Netezza and SQL Server Analytic Wo...
 
U of A Web Strategy and Sitecore
U of A Web Strategy and SitecoreU of A Web Strategy and Sitecore
U of A Web Strategy and Sitecore
 

Mehr von Luke Han

Augmented OLAP for Big Data
Augmented OLAP for Big DataAugmented OLAP for Big Data
Augmented OLAP for Big DataLuke Han
 
The Evolution of Apache Kylin by Luke Han
The Evolution of Apache Kylin by Luke HanThe Evolution of Apache Kylin by Luke Han
The Evolution of Apache Kylin by Luke HanLuke Han
 
3. Apache Tez Introducation - Apache Kylin Meetup @Shanghai
3. Apache Tez Introducation - Apache Kylin Meetup @Shanghai3. Apache Tez Introducation - Apache Kylin Meetup @Shanghai
3. Apache Tez Introducation - Apache Kylin Meetup @ShanghaiLuke Han
 
5. Apache Kylin的金融大数据应用场景 - Apache Kylin Meetup @Shanghai
5. Apache Kylin的金融大数据应用场景 - Apache Kylin Meetup @Shanghai5. Apache Kylin的金融大数据应用场景 - Apache Kylin Meetup @Shanghai
5. Apache Kylin的金融大数据应用场景 - Apache Kylin Meetup @ShanghaiLuke Han
 
6. Apache Kylin Roadmap and Community - Apache Kylin Meetup @Shanghai
6. Apache Kylin Roadmap and Community - Apache Kylin Meetup @Shanghai6. Apache Kylin Roadmap and Community - Apache Kylin Meetup @Shanghai
6. Apache Kylin Roadmap and Community - Apache Kylin Meetup @ShanghaiLuke Han
 
4.Building a Data Product using apache Zeppelin - Apache Kylin Meetup @Shanghai
4.Building a Data Product using apache Zeppelin - Apache Kylin Meetup @Shanghai4.Building a Data Product using apache Zeppelin - Apache Kylin Meetup @Shanghai
4.Building a Data Product using apache Zeppelin - Apache Kylin Meetup @ShanghaiLuke Han
 
1. Apache Kylin Deep Dive - Streaming and Plugin Architecture - Apache Kylin ...
1. Apache Kylin Deep Dive - Streaming and Plugin Architecture - Apache Kylin ...1. Apache Kylin Deep Dive - Streaming and Plugin Architecture - Apache Kylin ...
1. Apache Kylin Deep Dive - Streaming and Plugin Architecture - Apache Kylin ...Luke Han
 
Apache Kylin Open Source Journey for QCon2015 Beijing
Apache Kylin Open Source Journey for QCon2015 BeijingApache Kylin Open Source Journey for QCon2015 Beijing
Apache Kylin Open Source Journey for QCon2015 BeijingLuke Han
 
ApacheKylin_HBaseCon2015
ApacheKylin_HBaseCon2015ApacheKylin_HBaseCon2015
ApacheKylin_HBaseCon2015Luke Han
 
Apache Kylin Extreme OLAP Engine for Big Data
Apache Kylin Extreme OLAP Engine for Big DataApache Kylin Extreme OLAP Engine for Big Data
Apache Kylin Extreme OLAP Engine for Big DataLuke Han
 
Apache Kylin Introduction
Apache Kylin IntroductionApache Kylin Introduction
Apache Kylin IntroductionLuke Han
 
Adding Spark support to Kylin at Bay Area Spark Meetup
Adding Spark support to Kylin at Bay Area Spark MeetupAdding Spark support to Kylin at Bay Area Spark Meetup
Adding Spark support to Kylin at Bay Area Spark MeetupLuke Han
 
Apache kylin - Big Data Technology Conference 2014 Beijing
Apache kylin - Big Data Technology Conference 2014 BeijingApache kylin - Big Data Technology Conference 2014 Beijing
Apache kylin - Big Data Technology Conference 2014 BeijingLuke Han
 
Kylin OLAP Engine Tour
Kylin OLAP Engine TourKylin OLAP Engine Tour
Kylin OLAP Engine TourLuke Han
 
Actuate presentation 2011
Actuate presentation   2011Actuate presentation   2011
Actuate presentation 2011Luke Han
 

Mehr von Luke Han (15)

Augmented OLAP for Big Data
Augmented OLAP for Big DataAugmented OLAP for Big Data
Augmented OLAP for Big Data
 
The Evolution of Apache Kylin by Luke Han
The Evolution of Apache Kylin by Luke HanThe Evolution of Apache Kylin by Luke Han
The Evolution of Apache Kylin by Luke Han
 
3. Apache Tez Introducation - Apache Kylin Meetup @Shanghai
3. Apache Tez Introducation - Apache Kylin Meetup @Shanghai3. Apache Tez Introducation - Apache Kylin Meetup @Shanghai
3. Apache Tez Introducation - Apache Kylin Meetup @Shanghai
 
5. Apache Kylin的金融大数据应用场景 - Apache Kylin Meetup @Shanghai
5. Apache Kylin的金融大数据应用场景 - Apache Kylin Meetup @Shanghai5. Apache Kylin的金融大数据应用场景 - Apache Kylin Meetup @Shanghai
5. Apache Kylin的金融大数据应用场景 - Apache Kylin Meetup @Shanghai
 
6. Apache Kylin Roadmap and Community - Apache Kylin Meetup @Shanghai
6. Apache Kylin Roadmap and Community - Apache Kylin Meetup @Shanghai6. Apache Kylin Roadmap and Community - Apache Kylin Meetup @Shanghai
6. Apache Kylin Roadmap and Community - Apache Kylin Meetup @Shanghai
 
4.Building a Data Product using apache Zeppelin - Apache Kylin Meetup @Shanghai
4.Building a Data Product using apache Zeppelin - Apache Kylin Meetup @Shanghai4.Building a Data Product using apache Zeppelin - Apache Kylin Meetup @Shanghai
4.Building a Data Product using apache Zeppelin - Apache Kylin Meetup @Shanghai
 
1. Apache Kylin Deep Dive - Streaming and Plugin Architecture - Apache Kylin ...
1. Apache Kylin Deep Dive - Streaming and Plugin Architecture - Apache Kylin ...1. Apache Kylin Deep Dive - Streaming and Plugin Architecture - Apache Kylin ...
1. Apache Kylin Deep Dive - Streaming and Plugin Architecture - Apache Kylin ...
 
Apache Kylin Open Source Journey for QCon2015 Beijing
Apache Kylin Open Source Journey for QCon2015 BeijingApache Kylin Open Source Journey for QCon2015 Beijing
Apache Kylin Open Source Journey for QCon2015 Beijing
 
ApacheKylin_HBaseCon2015
ApacheKylin_HBaseCon2015ApacheKylin_HBaseCon2015
ApacheKylin_HBaseCon2015
 
Apache Kylin Extreme OLAP Engine for Big Data
Apache Kylin Extreme OLAP Engine for Big DataApache Kylin Extreme OLAP Engine for Big Data
Apache Kylin Extreme OLAP Engine for Big Data
 
Apache Kylin Introduction
Apache Kylin IntroductionApache Kylin Introduction
Apache Kylin Introduction
 
Adding Spark support to Kylin at Bay Area Spark Meetup
Adding Spark support to Kylin at Bay Area Spark MeetupAdding Spark support to Kylin at Bay Area Spark Meetup
Adding Spark support to Kylin at Bay Area Spark Meetup
 
Apache kylin - Big Data Technology Conference 2014 Beijing
Apache kylin - Big Data Technology Conference 2014 BeijingApache kylin - Big Data Technology Conference 2014 Beijing
Apache kylin - Big Data Technology Conference 2014 Beijing
 
Kylin OLAP Engine Tour
Kylin OLAP Engine TourKylin OLAP Engine Tour
Kylin OLAP Engine Tour
 
Actuate presentation 2011
Actuate presentation   2011Actuate presentation   2011
Actuate presentation 2011
 

Kürzlich hochgeladen

cpct NetworkING BASICS AND NETWORK TOOL.ppt
cpct NetworkING BASICS AND NETWORK TOOL.pptcpct NetworkING BASICS AND NETWORK TOOL.ppt
cpct NetworkING BASICS AND NETWORK TOOL.pptrcbcrtm
 
VK Business Profile - provides IT solutions and Web Development
VK Business Profile - provides IT solutions and Web DevelopmentVK Business Profile - provides IT solutions and Web Development
VK Business Profile - provides IT solutions and Web Developmentvyaparkranti
 
Global Identity Enrolment and Verification Pro Solution - Cizo Technology Ser...
Global Identity Enrolment and Verification Pro Solution - Cizo Technology Ser...Global Identity Enrolment and Verification Pro Solution - Cizo Technology Ser...
Global Identity Enrolment and Verification Pro Solution - Cizo Technology Ser...Cizo Technology Services
 
Exploring Selenium_Appium Frameworks for Seamless Integration with HeadSpin.pdf
Exploring Selenium_Appium Frameworks for Seamless Integration with HeadSpin.pdfExploring Selenium_Appium Frameworks for Seamless Integration with HeadSpin.pdf
Exploring Selenium_Appium Frameworks for Seamless Integration with HeadSpin.pdfkalichargn70th171
 
How to submit a standout Adobe Champion Application
How to submit a standout Adobe Champion ApplicationHow to submit a standout Adobe Champion Application
How to submit a standout Adobe Champion ApplicationBradBedford3
 
Automate your Kamailio Test Calls - Kamailio World 2024
Automate your Kamailio Test Calls - Kamailio World 2024Automate your Kamailio Test Calls - Kamailio World 2024
Automate your Kamailio Test Calls - Kamailio World 2024Andreas Granig
 
Tech Tuesday - Mastering Time Management Unlock the Power of OnePlan's Timesh...
Tech Tuesday - Mastering Time Management Unlock the Power of OnePlan's Timesh...Tech Tuesday - Mastering Time Management Unlock the Power of OnePlan's Timesh...
Tech Tuesday - Mastering Time Management Unlock the Power of OnePlan's Timesh...OnePlan Solutions
 
Cloud Data Center Network Construction - IEEE
Cloud Data Center Network Construction - IEEECloud Data Center Network Construction - IEEE
Cloud Data Center Network Construction - IEEEVICTOR MAESTRE RAMIREZ
 
Unveiling Design Patterns: A Visual Guide with UML Diagrams
Unveiling Design Patterns: A Visual Guide with UML DiagramsUnveiling Design Patterns: A Visual Guide with UML Diagrams
Unveiling Design Patterns: A Visual Guide with UML DiagramsAhmed Mohamed
 
Balasore Best It Company|| Top 10 IT Company || Balasore Software company Odisha
Balasore Best It Company|| Top 10 IT Company || Balasore Software company OdishaBalasore Best It Company|| Top 10 IT Company || Balasore Software company Odisha
Balasore Best It Company|| Top 10 IT Company || Balasore Software company Odishasmiwainfosol
 
PREDICTING RIVER WATER QUALITY ppt presentation
PREDICTING  RIVER  WATER QUALITY  ppt presentationPREDICTING  RIVER  WATER QUALITY  ppt presentation
PREDICTING RIVER WATER QUALITY ppt presentationvaddepallysandeep122
 
What is Fashion PLM and Why Do You Need It
What is Fashion PLM and Why Do You Need ItWhat is Fashion PLM and Why Do You Need It
What is Fashion PLM and Why Do You Need ItWave PLM
 
A healthy diet for your Java application Devoxx France.pdf
A healthy diet for your Java application Devoxx France.pdfA healthy diet for your Java application Devoxx France.pdf
A healthy diet for your Java application Devoxx France.pdfMarharyta Nedzelska
 
Recruitment Management Software Benefits (Infographic)
Recruitment Management Software Benefits (Infographic)Recruitment Management Software Benefits (Infographic)
Recruitment Management Software Benefits (Infographic)Hr365.us smith
 
Software Project Health Check: Best Practices and Techniques for Your Product...
Software Project Health Check: Best Practices and Techniques for Your Product...Software Project Health Check: Best Practices and Techniques for Your Product...
Software Project Health Check: Best Practices and Techniques for Your Product...Velvetech LLC
 
Sending Calendar Invites on SES and Calendarsnack.pdf
Sending Calendar Invites on SES and Calendarsnack.pdfSending Calendar Invites on SES and Calendarsnack.pdf
Sending Calendar Invites on SES and Calendarsnack.pdf31events.com
 
办理学位证(UQ文凭证书)昆士兰大学毕业证成绩单原版一模一样
办理学位证(UQ文凭证书)昆士兰大学毕业证成绩单原版一模一样办理学位证(UQ文凭证书)昆士兰大学毕业证成绩单原版一模一样
办理学位证(UQ文凭证书)昆士兰大学毕业证成绩单原版一模一样umasea
 
Folding Cheat Sheet #4 - fourth in a series
Folding Cheat Sheet #4 - fourth in a seriesFolding Cheat Sheet #4 - fourth in a series
Folding Cheat Sheet #4 - fourth in a seriesPhilip Schwarz
 
Implementing Zero Trust strategy with Azure
Implementing Zero Trust strategy with AzureImplementing Zero Trust strategy with Azure
Implementing Zero Trust strategy with AzureDinusha Kumarasiri
 

Kürzlich hochgeladen (20)

cpct NetworkING BASICS AND NETWORK TOOL.ppt
cpct NetworkING BASICS AND NETWORK TOOL.pptcpct NetworkING BASICS AND NETWORK TOOL.ppt
cpct NetworkING BASICS AND NETWORK TOOL.ppt
 
VK Business Profile - provides IT solutions and Web Development
VK Business Profile - provides IT solutions and Web DevelopmentVK Business Profile - provides IT solutions and Web Development
VK Business Profile - provides IT solutions and Web Development
 
Global Identity Enrolment and Verification Pro Solution - Cizo Technology Ser...
Global Identity Enrolment and Verification Pro Solution - Cizo Technology Ser...Global Identity Enrolment and Verification Pro Solution - Cizo Technology Ser...
Global Identity Enrolment and Verification Pro Solution - Cizo Technology Ser...
 
Exploring Selenium_Appium Frameworks for Seamless Integration with HeadSpin.pdf
Exploring Selenium_Appium Frameworks for Seamless Integration with HeadSpin.pdfExploring Selenium_Appium Frameworks for Seamless Integration with HeadSpin.pdf
Exploring Selenium_Appium Frameworks for Seamless Integration with HeadSpin.pdf
 
How to submit a standout Adobe Champion Application
How to submit a standout Adobe Champion ApplicationHow to submit a standout Adobe Champion Application
How to submit a standout Adobe Champion Application
 
Automate your Kamailio Test Calls - Kamailio World 2024
Automate your Kamailio Test Calls - Kamailio World 2024Automate your Kamailio Test Calls - Kamailio World 2024
Automate your Kamailio Test Calls - Kamailio World 2024
 
Tech Tuesday - Mastering Time Management Unlock the Power of OnePlan's Timesh...
Tech Tuesday - Mastering Time Management Unlock the Power of OnePlan's Timesh...Tech Tuesday - Mastering Time Management Unlock the Power of OnePlan's Timesh...
Tech Tuesday - Mastering Time Management Unlock the Power of OnePlan's Timesh...
 
Cloud Data Center Network Construction - IEEE
Cloud Data Center Network Construction - IEEECloud Data Center Network Construction - IEEE
Cloud Data Center Network Construction - IEEE
 
Unveiling Design Patterns: A Visual Guide with UML Diagrams
Unveiling Design Patterns: A Visual Guide with UML DiagramsUnveiling Design Patterns: A Visual Guide with UML Diagrams
Unveiling Design Patterns: A Visual Guide with UML Diagrams
 
Balasore Best It Company|| Top 10 IT Company || Balasore Software company Odisha
Balasore Best It Company|| Top 10 IT Company || Balasore Software company OdishaBalasore Best It Company|| Top 10 IT Company || Balasore Software company Odisha
Balasore Best It Company|| Top 10 IT Company || Balasore Software company Odisha
 
PREDICTING RIVER WATER QUALITY ppt presentation
PREDICTING  RIVER  WATER QUALITY  ppt presentationPREDICTING  RIVER  WATER QUALITY  ppt presentation
PREDICTING RIVER WATER QUALITY ppt presentation
 
What is Fashion PLM and Why Do You Need It
What is Fashion PLM and Why Do You Need ItWhat is Fashion PLM and Why Do You Need It
What is Fashion PLM and Why Do You Need It
 
A healthy diet for your Java application Devoxx France.pdf
A healthy diet for your Java application Devoxx France.pdfA healthy diet for your Java application Devoxx France.pdf
A healthy diet for your Java application Devoxx France.pdf
 
Recruitment Management Software Benefits (Infographic)
Recruitment Management Software Benefits (Infographic)Recruitment Management Software Benefits (Infographic)
Recruitment Management Software Benefits (Infographic)
 
Software Project Health Check: Best Practices and Techniques for Your Product...
Software Project Health Check: Best Practices and Techniques for Your Product...Software Project Health Check: Best Practices and Techniques for Your Product...
Software Project Health Check: Best Practices and Techniques for Your Product...
 
Odoo Development Company in India | Devintelle Consulting Service
Odoo Development Company in India | Devintelle Consulting ServiceOdoo Development Company in India | Devintelle Consulting Service
Odoo Development Company in India | Devintelle Consulting Service
 
Sending Calendar Invites on SES and Calendarsnack.pdf
Sending Calendar Invites on SES and Calendarsnack.pdfSending Calendar Invites on SES and Calendarsnack.pdf
Sending Calendar Invites on SES and Calendarsnack.pdf
 
办理学位证(UQ文凭证书)昆士兰大学毕业证成绩单原版一模一样
办理学位证(UQ文凭证书)昆士兰大学毕业证成绩单原版一模一样办理学位证(UQ文凭证书)昆士兰大学毕业证成绩单原版一模一样
办理学位证(UQ文凭证书)昆士兰大学毕业证成绩单原版一模一样
 
Folding Cheat Sheet #4 - fourth in a series
Folding Cheat Sheet #4 - fourth in a seriesFolding Cheat Sheet #4 - fourth in a series
Folding Cheat Sheet #4 - fourth in a series
 
Implementing Zero Trust strategy with Azure
Implementing Zero Trust strategy with AzureImplementing Zero Trust strategy with Azure
Implementing Zero Trust strategy with Azure
 

Building Enterprise OLAP on Hadoop for FSI

  • 1. Building Enterprise OLAP on Hadoop for Financial Services Industry Luke Han luke@kyligence.io | @lukehq Co-founder & CEO of Kyligence Creator & VP of Apache Kylin Microsoft Regional Director & MVP
  • 2. About Kyligence • Formed by creators of Apache Kylin in 2016 • Offers Enterprise and Cloud version of Apache Kylin • Funding from Redpoint, Cisco, CBC and Shunwei • Member of Microsoft Accelerator Shanghai 2017 • Dual HQ in Silicon Valley & Shanghai, China Kyligence booth: #855
  • 3. Transition to Big Data… How about your traditional data warehouse? How about your existing OLAP/BI application?
  • 4. Data Warehouse/OLAP in Financial Services Industry o The biggest industry rely on DW/OLAP application o Thousands applications build on top of EDW o Experienced analysts with decade expertise …in data…but not in technologies
  • 5. Presentation Visualization OLAP Data Mart Enterprise Data Warehouse Data Source o Optimized for mission-critical analytics o Well modeling o Best practices of industry o Thriving ecosystem o Trained experts everywhere Enterprise Data Warehouse Architecture
  • 6. But you are asked to… o Migrate or build existing OLAP/BI app to Big Data o Better performance…just because you have Big Data now o Train yourself to learn MR/Spark/ML…and AI
  • 7. Presentation Visualization Data Lake Data Source o Too many options o Low performance o Long learning curve o Compatibility issue o Technology vs Data OLAP: The Missing Part of Big Data Hive Impala Spark SQL Drill MapReduce …Spark
  • 8. Presentation Visualization Data Lake Data Source o MOLAP on Hadoop o Simplified Data Modeling o Optimized for aggregation query o ANSI SQL o Native on Hadoop o On-Prem & In the Cloud Apache Kylin: Bring OLAP back to Big Data OLAP Data Mart Hive Impala Spark SQL Drill MapReduce …Spark
  • 9. Kylin vs Hive: Star-Schema Benchmark 0.17 0.17 0.18 142.42 161.66 189.17 0 20 40 60 80 100 120 140 160 180 200 2 10 20 ResponseTime(seconds) Data Volume (Scale Factor) Apache Kylin vs. Apache Hive (lower is better) KAP Apache Hive * Based on 4 Nodes, 16 Core CPU, 96 GB Memory per node Apache Kylin
  • 10. Global Users FSI • ABC • CCB • CMB • CPIC • Citic Bank • China Unionpay • HUATAI Securities • GUOTAI Securities • Lufax Telecom • China Mobile • China Telecom • Chine Unicom • AT & T Internet • eBay • Yahoo! Japan • Baidu • Meituan • NetEase • Expedia • JD.com • VIP.com • 360 • Toutiao Others • MachineZone • Glispa • Inovex • Adobe • iFLYTEC 500+ use cases in production global Manufacturing • SAIC • HUAWEI • Lenovo • OPPO • XIAOMI • VIVO Data collected from public information and kylin community
  • 12. Kyligence: Enterprise OLAP on Hadoop Kyligence Robot Online Optimize & Tuning Services Kyligence Analytics Platform (KAP) Kyligence Solutions Apache Kylin Open Source OLAP On Hadoop KyStorage Columnar Storage KyStudio Model Designer KyManager Administrator Tool KyAnalyzer Agile BI Security Cell Level ACL On-Demand Deployment On-Premises Hybrid In the Cloud
  • 13. Kyligence: Enterprise OLAP on Hadoop Hive Spark SQL Impala Kyligence Analytics Platform (KAP) Mission Critical AnalyticsData Exploration/Discovery Intelligent Cubing by KAP Query Pushdown: minutes latency Cube Access: sub-second latency
  • 15. TPC-DS 0 50 100 150 200 250 1 4 7 101316192225283134374043464952555861646770737679828588919497 KAP: TPC-DS • Hive: 33 queries can’t support or run out of time • KAP: all 99 queries supported • Routine query between SQL on Hadoop and Apache Kylin
  • 17. TPC-H Benchmark 0 10 20 30 40 50 60 Q1 Q2 Q3 Q4 Q5 Q6 Q7 Q8 Q9 Q10 Q11 Q12 Q13 Q14 Q15 Q16 Q17 Q18 Q19 Q20 Q21 Q22 KAP vs SparkSQL 2.1 (lower is better) SparkSQL 2.1 KAP 2.4
  • 18. Kyligence Studio: Data Modeling Designer o Drag & Drop o Smart Data Modeling o Intelligent Optimization
  • 20. Seamless Integration with BI tools o KyAnalyzer o Tableau o Power BI/Excel o IBM Cognos o MicroStrategy o Superset o Zeppenlin o Saiku o …
  • 21. Enhanced Security and Management Cell Level ACL/SSO/LDAP/Kerberos…
  • 23. CPIC: China Pacific Insurance (Group) Co., LTD • Global Fortune 500 insurance company • Top 2 insurance company in China • $40+ billion revenue • 8+ million customers • 97,000+ employees
  • 24. Challenges • Legacy IBM Cognos + DB2 solution can’t support Big Data scenarios • Long waiting time (minutes ~ hours for reporting) • Low concurrency (100,000+ employees!) • High cost
  • 25. 2016.12 ~ 2017.01 KAP POC: Performance Testing • Query Latency • Concurrency KAP POC: Compatibility • Cognos Connection • Cognos Syntax 2017.01 ~ 2017.03 Development • Fixed Reports • Flexible Reports 2017.03 ~ 2017.05 Go alive • All dataset aggregation and testing • Fixed Reports released 2017.05 ~ 2017.06 Journey of Kyligence Analytics Platform • No changes on Hadoop side • No additional engineers required • Most of work done by analysts
  • 26. KAP + Cognos: Deployment Dynamic Report JDBC Fixed Report ODBC KAP Query Server Reporting & Dashboard OLAP & Data Mart Big Data Platform
  • 27. Benefits after Adopting Kyligence • One-stop BI platform generates complicated reports • Over 90% queries return within 3 seconds (including high-dimensional queries) • Seamless integration with IBM Cognos, no change at front-end • 2 KAP cubes replaced 2000+ IBM Cognos cubes • Cost reduced significantly by adopting open source technology
  • 28. Customer Quote “Kyligence enables us to find valuable insights faster from every insurance policy within seconds. Kyligence’s platform allows us to achieve more with less. Our lean management system has improved significantly” -- Minchen Wu, Depute GM of IT, CPIC
  • 29. Fusion Big Data Platform • Open: Connect to Teradata/Greenplum and IBM Cognos/Saiku… • Flexible: Self-Services for end users • Efficiency: Speed up PC and Mobile analytics experience China Construction Bank (CCB): 2nd Largest Bank in the World “Apache Kylin is last piece of puzzle to serving data asserts management between legacy DW and new Big Data.” -- Zhi Zhu, Vice Senior Manager of Tech Dept, CCB
  • 30. Enterprise OLAP on Hadoop Speed Up Mission Critical Analytics Booth #855 luke@kyligence.io http://kyligence.io

Hinweis der Redaktion

  1. 好处是获取维度数据不依赖Cognos,高效方便;设计人员可以自由定义查询的SQL语句