SlideShare ist ein Scribd-Unternehmen logo
1 von 9
1
Informatica’s Data Virtualization Solution
Sanjay Krishnamurthi
May 28, 2013
2
Problem Statement : Takes too long to
get business the data it needs
On average, how long does it take to add a new
source of data to your data warehouse?
On average, how long does it take to create a complex
report or dashboard with about 20 dimensions, 12
measures, and 6 user access roles?
2011
TDWI BI BENCHMARK REPORT
Organizational and Performance Metrics
for Business Intelligence Teams
1 week
2 weeks
3 weeks
1 month
2 months
3 months
4-6 months
6 months or more
11%
7%
7%
22%
20%
14%
12%
7%
On average, how long does it take to change a
hierarchy (e.g. a new way of classifying products
or organizing sales regions)?
1 week
2 weeks
3 weeks
1 month
2 months
3 months
4-6 months
6 months or more
25%
13%
8%
25%
10%
6%
8%
4%
1 week
2 weeks
3 weeks
1 month
2 months
3 months
4-6 months
6 months or more
15%
9%
13%
16%
16%
16%
9%
5%
3
Why Does it Take So Long?
It takes too long to explain
requirements
It takes months to change a
DW/add new critical data
It takes many iterations to get
the right data/reports
Changes break integrations &
impact applications
Directly accessing operational
systems is not possible / ideal
Typical Data Integration Process
IT Has
a Huge Backlog
a
b
c
d
1
2
3
4
5
6
Design
Change
Integrate
Unit Test
Validate
Deploy
Business is
Involved Too Late
As-Is Value Stream Map (LOT OF WAIT & WASTE)
e
4
What is Needed?
PROFILE AND CLEASE DATA SO
IT CAN BE READILY TRUSTED
DELIVER REUSABLE DATA
SERVICES TO CONSUMERS
CREATE A COMMON ACCESS
LAYER ACROSS DATA SOURCES
Enterprise
Data Sources
Data
Virtualization
(Built-On Lean
Principles)…PRODUCTCUSTOMER ORDER
Logical View of All Underlying Data
QUICKLY & DIRECTLY ACCESS
DATA WITHOUT MOVEMENT
00110101
00100101
01011010
10010110
PortalBI Composite Apps
Data
Consumers
5
Informatica Proprietary/Confidential. Informational Purposes Only. No Representation, Warranty or
Commitment regarding Future Functionality. Not to be Relied Upon in Making Purchasing Decision.
Business IT
TRANSFORM IN RT
Advanced Transformations,
Data Quality, Data Masking
4
Virtual Table
Replicated
CRM
Accounts
ACCESS & MERGE
2
Virtual Table
PROFILE IN RT
Business
Manager
Analyst,
Steward
Developer,
Architect
Common
Metadata
3
Virtual Table
MODEL
Customer
Name
Address
Category
Orders
1
Virtual Table
CRM
SCALE & PERFORM
Accounts
7
Optimizations
& Caching
Virtual Table
MOVE OR FEDERATE
AccountsCall Center
DW
6
Virtual Table
REUSE INSTANTLY
Batch Web Services
5
Query
Engine
WS
Server
Virtual Table
CRM
Agile Data Platform
6
Data Virtualization :
Piece of Agile Data Platform Puzzle
• Provides a semantic access layer atop variety of data sources
• Data needs to be clean, masked etc.
• Pre-built library of advanced data transformations, e.g. merge
• Integrated real-time, on-the-fly data profiling & data quality
DW
BI
Virtual View
Access
Merge
Deliver
DW
Prototype
First
Move to DW
or Instantly Reuse
as SQL/WS
Advanced
Transformations &
Data Quality
Analyze & Profile
Data & Logic
Anytime
Early Business
Involvement
7
Informatica Proprietary/Confidential. Informational Purposes Only. No Representation, Warranty or
Commitment regarding Future Functionality. Not to be Relied Upon in Making Purchasing Decision.
Key Considerations
1000s of
lines of code
TIME COST
Maintenance
Nightmare
Model & metadata-
driven environment
TIME COST
Sustain &
Maintain
Enabling Rapid
Development
v/s
Profile data AND
logic anywhere
TIME COST RISK
Get it Right
1st Time
Only source profiling,
need extra processing
Many Iterations
& Mistakes
TIME COST RISK
Analyzing &
Profiling
v/s Hand-coding can’t do
advanced transforms
TIME COST RISK
SQL
XQuery
Simple Cleansing
Web Service
Limited Rules,
No Data Quality
Leverage pre-built
logic including quality
TIME COST RISK
Virtual Table
Bake-in
Quality
Integrating
with Quality
v/s
Naturally extend
your infrastructure
TIME COST
Re-purpose
Logic & Skills
TIME COST
Re-work, re-deploy &
re-train every time
Re-invent the
Wheel
Leveraging
Investments
v/s
Scaling with
Flexibility
v/s
Virtualize or physically
materialize in 1 tool
TIME COST
Prototype First
& Then Scale
EII
Optimizations
TIME COST
Overburden Data
Virtualization
EII
X
RISK
Non-integrated
technologies
8
Gartner Magic Quadrant for
Data Integration Tools, 2011
“The ability to switch seamlessly and transparently
between delivery modes (bulk / batch vs. granular
real-time vs. federation) with minimal rework will be
key for IT organizations seeking to develop a
successful data integration strategy.”
Ted Friedman, VP Distinguished Analyst, Gartner
Leveraging the Power of the Platform
“With v9, Informatica advanced its capabilities with
on-the-fly data quality and profiling, a model-driven
approach to provisioning data services, performance
enhancements, cloud integration, common metadata,
and role-specific tools.”
The Forrester Wave: Data Virtualization, Q1 2012
Forrester Wave: Data
Virtualization, Q1 ‘12
Power of
The Platform
9
Sign-Up
Expert Roundtables
Customer Webinars
Whitepapers, Articles, Blogs…
Data Virtualization Corner
Learn More…
JOIN & DISCUSS
2500+ Strong
“Data Virtualization & Data
Services Architecture” Group
Industry’s
1st INDEPENDENT
Book

Weitere ähnliche Inhalte

Was ist angesagt?

Improving the Business of Healthcare through Better Analytics
Improving the Business of Healthcare through Better Analytics Improving the Business of Healthcare through Better Analytics
Improving the Business of Healthcare through Better Analytics
Pentaho
 

Was ist angesagt? (20)

Big Data in healthcare - opportunities and issues
Big Data in healthcare - opportunities and issuesBig Data in healthcare - opportunities and issues
Big Data in healthcare - opportunities and issues
 
Big data analytics in healthcare industry
Big data analytics in healthcare industryBig data analytics in healthcare industry
Big data analytics in healthcare industry
 
eBook - Data Analytics in Healthcare
eBook - Data Analytics in HealthcareeBook - Data Analytics in Healthcare
eBook - Data Analytics in Healthcare
 
Healthcare and Big Data - May 2017
Healthcare and Big Data -  May 2017Healthcare and Big Data -  May 2017
Healthcare and Big Data - May 2017
 
The Role of Data Lakes in Healthcare
The Role of Data Lakes in HealthcareThe Role of Data Lakes in Healthcare
The Role of Data Lakes in Healthcare
 
Baptist Health: Solving Healthcare Problems with Big Data
Baptist Health: Solving Healthcare Problems with Big DataBaptist Health: Solving Healthcare Problems with Big Data
Baptist Health: Solving Healthcare Problems with Big Data
 
HP & Sogeti Healthcare Big Data Presentation for Discover 2015
HP & Sogeti Healthcare Big Data Presentation for Discover 2015HP & Sogeti Healthcare Big Data Presentation for Discover 2015
HP & Sogeti Healthcare Big Data Presentation for Discover 2015
 
HealthCare and Big Data with Hadoop
HealthCare and Big Data with HadoopHealthCare and Big Data with Hadoop
HealthCare and Big Data with Hadoop
 
BioStorage Technologies Case Study: How to build an informatics platform usin...
BioStorage Technologies Case Study: How to build an informatics platform usin...BioStorage Technologies Case Study: How to build an informatics platform usin...
BioStorage Technologies Case Study: How to build an informatics platform usin...
 
Is Big Data a Big Deal... or Not?
Is Big Data a Big Deal... or Not?Is Big Data a Big Deal... or Not?
Is Big Data a Big Deal... or Not?
 
Healthcare Best Practices in Data Warehousing & Analytics
Healthcare Best Practices in Data Warehousing & AnalyticsHealthcare Best Practices in Data Warehousing & Analytics
Healthcare Best Practices in Data Warehousing & Analytics
 
Seeing Is Believing: How Clinical Trial Data Transparency is Changing How an...
Seeing Is Believing:  How Clinical Trial Data Transparency is Changing How an...Seeing Is Believing:  How Clinical Trial Data Transparency is Changing How an...
Seeing Is Believing: How Clinical Trial Data Transparency is Changing How an...
 
Deploying Predictive Analytics in Healthcare
Deploying Predictive Analytics in HealthcareDeploying Predictive Analytics in Healthcare
Deploying Predictive Analytics in Healthcare
 
Hadoop and Data Virtualization - A Case Study by VHA
Hadoop and Data Virtualization - A Case Study by VHAHadoop and Data Virtualization - A Case Study by VHA
Hadoop and Data Virtualization - A Case Study by VHA
 
Big data in Healthcare & Life Sciences
Big data in Healthcare & Life SciencesBig data in Healthcare & Life Sciences
Big data in Healthcare & Life Sciences
 
Mergers, acquisitions, and partnerships dramatically reducing it consolidati...
Mergers, acquisitions, and partnerships  dramatically reducing it consolidati...Mergers, acquisitions, and partnerships  dramatically reducing it consolidati...
Mergers, acquisitions, and partnerships dramatically reducing it consolidati...
 
20 Years in Healthcare Analytics & Data Warehousing: What did we learn? What'...
20 Years in Healthcare Analytics & Data Warehousing: What did we learn? What'...20 Years in Healthcare Analytics & Data Warehousing: What did we learn? What'...
20 Years in Healthcare Analytics & Data Warehousing: What did we learn? What'...
 
Big Data in Healthcare Made Simple: Where It Stands Today and Where It’s Going
Big Data in Healthcare Made Simple: Where It Stands Today and Where It’s GoingBig Data in Healthcare Made Simple: Where It Stands Today and Where It’s Going
Big Data in Healthcare Made Simple: Where It Stands Today and Where It’s Going
 
Big data in healthcare
Big data in healthcareBig data in healthcare
Big data in healthcare
 
Improving the Business of Healthcare through Better Analytics
Improving the Business of Healthcare through Better Analytics Improving the Business of Healthcare through Better Analytics
Improving the Business of Healthcare through Better Analytics
 

Andere mochten auch

a2c Boston Big Data Meet-up: Agile Data Warehouse Design
a2c Boston Big Data Meet-up:  Agile Data Warehouse Designa2c Boston Big Data Meet-up:  Agile Data Warehouse Design
a2c Boston Big Data Meet-up: Agile Data Warehouse Design
a2c
 
Expt panel hive_data_rp_20130320_final-1
Expt panel hive_data_rp_20130320_final-1Expt panel hive_data_rp_20130320_final-1
Expt panel hive_data_rp_20130320_final-1
The Hive
 
Tomer Shiran, MapR_Hadoop&SQL
Tomer Shiran, MapR_Hadoop&SQLTomer Shiran, MapR_Hadoop&SQL
Tomer Shiran, MapR_Hadoop&SQL
The Hive
 
Leanplum_Controlled Experimentation_Panel_The Hive
Leanplum_Controlled Experimentation_Panel_The HiveLeanplum_Controlled Experimentation_Panel_The Hive
Leanplum_Controlled Experimentation_Panel_The Hive
The Hive
 
Opportunites in Big Data by Sumant Mandal, Founder of The Hive for The Hive I...
Opportunites in Big Data by Sumant Mandal, Founder of The Hive for The Hive I...Opportunites in Big Data by Sumant Mandal, Founder of The Hive for The Hive I...
Opportunites in Big Data by Sumant Mandal, Founder of The Hive for The Hive I...
The Hive
 
1.nigam shah stanford_meetup
1.nigam shah stanford_meetup1.nigam shah stanford_meetup
1.nigam shah stanford_meetup
The Hive
 
Redbook
RedbookRedbook
Redbook
ens007
 
Very beautiful
Very beautifulVery beautiful
Very beautiful
asmaeazed
 

Andere mochten auch (20)

DW 101
DW 101DW 101
DW 101
 
SOA with Data Virtualization (session 4 from Packed Lunch Webinar Series)
SOA with Data Virtualization (session 4 from Packed Lunch Webinar Series)SOA with Data Virtualization (session 4 from Packed Lunch Webinar Series)
SOA with Data Virtualization (session 4 from Packed Lunch Webinar Series)
 
a2c Boston Big Data Meet-up: Agile Data Warehouse Design
a2c Boston Big Data Meet-up:  Agile Data Warehouse Designa2c Boston Big Data Meet-up:  Agile Data Warehouse Design
a2c Boston Big Data Meet-up: Agile Data Warehouse Design
 
High Performance Big Data Loading for AWS: Deep Dive and Best Practices from ...
High Performance Big Data Loading for AWS: Deep Dive and Best Practices from ...High Performance Big Data Loading for AWS: Deep Dive and Best Practices from ...
High Performance Big Data Loading for AWS: Deep Dive and Best Practices from ...
 
Informatica Cloud: Enterprise-Class Data Integration as a Service
 Informatica Cloud: Enterprise-Class Data Integration as a Service Informatica Cloud: Enterprise-Class Data Integration as a Service
Informatica Cloud: Enterprise-Class Data Integration as a Service
 
Expt panel hive_data_rp_20130320_final-1
Expt panel hive_data_rp_20130320_final-1Expt panel hive_data_rp_20130320_final-1
Expt panel hive_data_rp_20130320_final-1
 
Tomer Shiran, MapR_Hadoop&SQL
Tomer Shiran, MapR_Hadoop&SQLTomer Shiran, MapR_Hadoop&SQL
Tomer Shiran, MapR_Hadoop&SQL
 
Mumhsocialpdf
MumhsocialpdfMumhsocialpdf
Mumhsocialpdf
 
Leanplum_Controlled Experimentation_Panel_The Hive
Leanplum_Controlled Experimentation_Panel_The HiveLeanplum_Controlled Experimentation_Panel_The Hive
Leanplum_Controlled Experimentation_Panel_The Hive
 
San martin 2013 2014
San martin 2013 2014San martin 2013 2014
San martin 2013 2014
 
La musica
La musicaLa musica
La musica
 
Pre production planning
Pre production planningPre production planning
Pre production planning
 
Notes from the (greasy) field by Ranjit Nair - Co-founder and CTO, Altizon
Notes from the (greasy) field by Ranjit Nair - Co-founder and CTO, AltizonNotes from the (greasy) field by Ranjit Nair - Co-founder and CTO, Altizon
Notes from the (greasy) field by Ranjit Nair - Co-founder and CTO, Altizon
 
Opportunites in Big Data by Sumant Mandal, Founder of The Hive for The Hive I...
Opportunites in Big Data by Sumant Mandal, Founder of The Hive for The Hive I...Opportunites in Big Data by Sumant Mandal, Founder of The Hive for The Hive I...
Opportunites in Big Data by Sumant Mandal, Founder of The Hive for The Hive I...
 
1.nigam shah stanford_meetup
1.nigam shah stanford_meetup1.nigam shah stanford_meetup
1.nigam shah stanford_meetup
 
Redbook
RedbookRedbook
Redbook
 
Very beautiful
Very beautifulVery beautiful
Very beautiful
 
Startup Series: Lean Analytics, Innovation, and Tilting at Windmills
Startup Series: Lean Analytics, Innovation, and Tilting at WindmillsStartup Series: Lean Analytics, Innovation, and Tilting at Windmills
Startup Series: Lean Analytics, Innovation, and Tilting at Windmills
 
Redefine healthcare with IT by Niranjan Thirumale
Redefine healthcare with IT by Niranjan ThirumaleRedefine healthcare with IT by Niranjan Thirumale
Redefine healthcare with IT by Niranjan Thirumale
 
Bizitzaren historia
Bizitzaren  historiaBizitzaren  historia
Bizitzaren historia
 

Ähnlich wie The Hive Data Virtualization Introduction - Sanjay Krishnamurti, Chief Architect, Informatica

Accelerate Self-Service Analytics with Virtualization and Visualisation (Thai)
Accelerate Self-Service Analytics with Virtualization and Visualisation (Thai)Accelerate Self-Service Analytics with Virtualization and Visualisation (Thai)
Accelerate Self-Service Analytics with Virtualization and Visualisation (Thai)
Denodo
 
NetApp Tableau Presentation Final
NetApp Tableau Presentation FinalNetApp Tableau Presentation Final
NetApp Tableau Presentation Final
Mark Wu
 

Ähnlich wie The Hive Data Virtualization Introduction - Sanjay Krishnamurti, Chief Architect, Informatica (20)

Real Time Analytics
Real Time AnalyticsReal Time Analytics
Real Time Analytics
 
Real Time Analytics
Real Time AnalyticsReal Time Analytics
Real Time Analytics
 
Introduction To SQL Server 2014
Introduction To SQL Server 2014Introduction To SQL Server 2014
Introduction To SQL Server 2014
 
Self-Service Analytics with Guard Rails
Self-Service Analytics with Guard RailsSelf-Service Analytics with Guard Rails
Self-Service Analytics with Guard Rails
 
Accelerate Self-Service Analytics with Virtualization and Visualisation (Thai)
Accelerate Self-Service Analytics with Virtualization and Visualisation (Thai)Accelerate Self-Service Analytics with Virtualization and Visualisation (Thai)
Accelerate Self-Service Analytics with Virtualization and Visualisation (Thai)
 
NetApp Tableau Presentation Final
NetApp Tableau Presentation FinalNetApp Tableau Presentation Final
NetApp Tableau Presentation Final
 
Using OBIEE and Data Vault to Virtualize Your BI Environment: An Agile Approach
Using OBIEE and Data Vault to Virtualize Your BI Environment: An Agile ApproachUsing OBIEE and Data Vault to Virtualize Your BI Environment: An Agile Approach
Using OBIEE and Data Vault to Virtualize Your BI Environment: An Agile Approach
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
 
Skillwise Big Data part 2
Skillwise Big Data part 2Skillwise Big Data part 2
Skillwise Big Data part 2
 
Skilwise Big data
Skilwise Big dataSkilwise Big data
Skilwise Big data
 
Data Con LA 2022 - Why Data Quality vigilance requires an End-to-End, Automat...
Data Con LA 2022 - Why Data Quality vigilance requires an End-to-End, Automat...Data Con LA 2022 - Why Data Quality vigilance requires an End-to-End, Automat...
Data Con LA 2022 - Why Data Quality vigilance requires an End-to-End, Automat...
 
Big Data's Impact on the Enterprise
Big Data's Impact on the EnterpriseBig Data's Impact on the Enterprise
Big Data's Impact on the Enterprise
 
Defining and Applying Data Governance in Today’s Business Environment
Defining and Applying Data Governance in Today’s Business EnvironmentDefining and Applying Data Governance in Today’s Business Environment
Defining and Applying Data Governance in Today’s Business Environment
 
How Businesses use Big Data to Impact the Bottom Line
How Businesses use Big Data to Impact the Bottom LineHow Businesses use Big Data to Impact the Bottom Line
How Businesses use Big Data to Impact the Bottom Line
 
Process Automation Trends in SAP® Supply Chain for 2023
Process Automation Trends in SAP® Supply Chain for 2023Process Automation Trends in SAP® Supply Chain for 2023
Process Automation Trends in SAP® Supply Chain for 2023
 
Intro to Data Science on Hadoop
Intro to Data Science on HadoopIntro to Data Science on Hadoop
Intro to Data Science on Hadoop
 
Transforming Devon’s Data Pipeline with an Open Source Data Hub—Built on Data...
Transforming Devon’s Data Pipeline with an Open Source Data Hub—Built on Data...Transforming Devon’s Data Pipeline with an Open Source Data Hub—Built on Data...
Transforming Devon’s Data Pipeline with an Open Source Data Hub—Built on Data...
 
Data summit connect fall 2020 - rise of data ops
Data summit connect fall 2020 - rise of data opsData summit connect fall 2020 - rise of data ops
Data summit connect fall 2020 - rise of data ops
 
Data Governance, Compliance and Security in Hadoop with Cloudera
Data Governance, Compliance and Security in Hadoop with ClouderaData Governance, Compliance and Security in Hadoop with Cloudera
Data Governance, Compliance and Security in Hadoop with Cloudera
 
2022 Trends in Enterprise Analytics
2022 Trends in Enterprise Analytics2022 Trends in Enterprise Analytics
2022 Trends in Enterprise Analytics
 

Mehr von The Hive

The Hive Think Tank - The Microsoft Big Data Stack by Raghu Ramakrishnan, CTO...
The Hive Think Tank - The Microsoft Big Data Stack by Raghu Ramakrishnan, CTO...The Hive Think Tank - The Microsoft Big Data Stack by Raghu Ramakrishnan, CTO...
The Hive Think Tank - The Microsoft Big Data Stack by Raghu Ramakrishnan, CTO...
The Hive
 

Mehr von The Hive (20)

"Responsible AI", by Charlie Muirhead
"Responsible AI", by Charlie Muirhead"Responsible AI", by Charlie Muirhead
"Responsible AI", by Charlie Muirhead
 
Translating a Trillion Points of Data into Therapies, Diagnostics, and New In...
Translating a Trillion Points of Data into Therapies, Diagnostics, and New In...Translating a Trillion Points of Data into Therapies, Diagnostics, and New In...
Translating a Trillion Points of Data into Therapies, Diagnostics, and New In...
 
Digital Transformation; Digital Twins for Delivering Business Value in IIoT
Digital Transformation; Digital Twins for Delivering Business Value in IIoTDigital Transformation; Digital Twins for Delivering Business Value in IIoT
Digital Transformation; Digital Twins for Delivering Business Value in IIoT
 
Quantum Computing (IBM Q) - Hive Think Tank Event w/ Dr. Bob Sutor - 02.22.18
Quantum Computing (IBM Q) - Hive Think Tank Event w/ Dr. Bob Sutor - 02.22.18Quantum Computing (IBM Q) - Hive Think Tank Event w/ Dr. Bob Sutor - 02.22.18
Quantum Computing (IBM Q) - Hive Think Tank Event w/ Dr. Bob Sutor - 02.22.18
 
The Hive Think Tank: Rendezvous Architecture Makes Machine Learning Logistics...
The Hive Think Tank: Rendezvous Architecture Makes Machine Learning Logistics...The Hive Think Tank: Rendezvous Architecture Makes Machine Learning Logistics...
The Hive Think Tank: Rendezvous Architecture Makes Machine Learning Logistics...
 
Data Science in the Enterprise
Data Science in the EnterpriseData Science in the Enterprise
Data Science in the Enterprise
 
AI in Software for Augmenting Intelligence Across the Enterprise
AI in Software for Augmenting Intelligence Across the EnterpriseAI in Software for Augmenting Intelligence Across the Enterprise
AI in Software for Augmenting Intelligence Across the Enterprise
 
“ High Precision Analytics for Healthcare: Promises and Challenges” by Sriram...
“ High Precision Analytics for Healthcare: Promises and Challenges” by Sriram...“ High Precision Analytics for Healthcare: Promises and Challenges” by Sriram...
“ High Precision Analytics for Healthcare: Promises and Challenges” by Sriram...
 
"The Future of Manufacturing" by Sujeet Chand, SVP&CTO, Rockwell Automation
"The Future of Manufacturing" by Sujeet Chand, SVP&CTO, Rockwell Automation"The Future of Manufacturing" by Sujeet Chand, SVP&CTO, Rockwell Automation
"The Future of Manufacturing" by Sujeet Chand, SVP&CTO, Rockwell Automation
 
Social Impact & Ethics of AI by Steve Omohundro
Social Impact & Ethics of AI by Steve OmohundroSocial Impact & Ethics of AI by Steve Omohundro
Social Impact & Ethics of AI by Steve Omohundro
 
The Hive Think Tank: AI in The Enterprise by Venkat Srinivasan
The Hive Think Tank: AI in The Enterprise by Venkat SrinivasanThe Hive Think Tank: AI in The Enterprise by Venkat Srinivasan
The Hive Think Tank: AI in The Enterprise by Venkat Srinivasan
 
The Hive Think Tank: Machine Learning Applications in Genomics by Prof. Jian ...
The Hive Think Tank: Machine Learning Applications in Genomics by Prof. Jian ...The Hive Think Tank: Machine Learning Applications in Genomics by Prof. Jian ...
The Hive Think Tank: Machine Learning Applications in Genomics by Prof. Jian ...
 
The Hive Think Tank: The Future Of Customer Support - AI Driven Automation
The Hive Think Tank: The Future Of Customer Support - AI Driven AutomationThe Hive Think Tank: The Future Of Customer Support - AI Driven Automation
The Hive Think Tank: The Future Of Customer Support - AI Driven Automation
 
The Hive Think Tank: Talk by Mohandas Pai - India at 2030, How Tech Entrepren...
The Hive Think Tank: Talk by Mohandas Pai - India at 2030, How Tech Entrepren...The Hive Think Tank: Talk by Mohandas Pai - India at 2030, How Tech Entrepren...
The Hive Think Tank: Talk by Mohandas Pai - India at 2030, How Tech Entrepren...
 
The Hive Think Tank: The Content Trap - Strategist's Guide to Digital Change
The Hive Think Tank: The Content Trap - Strategist's Guide to Digital ChangeThe Hive Think Tank: The Content Trap - Strategist's Guide to Digital Change
The Hive Think Tank: The Content Trap - Strategist's Guide to Digital Change
 
Deep Visual Understanding from Deep Learning by Prof. Jitendra Malik
Deep Visual Understanding from Deep Learning by Prof. Jitendra MalikDeep Visual Understanding from Deep Learning by Prof. Jitendra Malik
Deep Visual Understanding from Deep Learning by Prof. Jitendra Malik
 
The Hive Think Tank: Heron at Twitter
The Hive Think Tank: Heron at TwitterThe Hive Think Tank: Heron at Twitter
The Hive Think Tank: Heron at Twitter
 
The Hive Think Tank: Unpacking AI for Healthcare
The Hive Think Tank: Unpacking AI for Healthcare The Hive Think Tank: Unpacking AI for Healthcare
The Hive Think Tank: Unpacking AI for Healthcare
 
The Hive Think Tank: Translating IoT into Innovation at Every Level by Prith ...
The Hive Think Tank: Translating IoT into Innovation at Every Level by Prith ...The Hive Think Tank: Translating IoT into Innovation at Every Level by Prith ...
The Hive Think Tank: Translating IoT into Innovation at Every Level by Prith ...
 
The Hive Think Tank - The Microsoft Big Data Stack by Raghu Ramakrishnan, CTO...
The Hive Think Tank - The Microsoft Big Data Stack by Raghu Ramakrishnan, CTO...The Hive Think Tank - The Microsoft Big Data Stack by Raghu Ramakrishnan, CTO...
The Hive Think Tank - The Microsoft Big Data Stack by Raghu Ramakrishnan, CTO...
 

Kürzlich hochgeladen

+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 

Kürzlich hochgeladen (20)

Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesHTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation Strategies
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 

The Hive Data Virtualization Introduction - Sanjay Krishnamurti, Chief Architect, Informatica

  • 1. 1 Informatica’s Data Virtualization Solution Sanjay Krishnamurthi May 28, 2013
  • 2. 2 Problem Statement : Takes too long to get business the data it needs On average, how long does it take to add a new source of data to your data warehouse? On average, how long does it take to create a complex report or dashboard with about 20 dimensions, 12 measures, and 6 user access roles? 2011 TDWI BI BENCHMARK REPORT Organizational and Performance Metrics for Business Intelligence Teams 1 week 2 weeks 3 weeks 1 month 2 months 3 months 4-6 months 6 months or more 11% 7% 7% 22% 20% 14% 12% 7% On average, how long does it take to change a hierarchy (e.g. a new way of classifying products or organizing sales regions)? 1 week 2 weeks 3 weeks 1 month 2 months 3 months 4-6 months 6 months or more 25% 13% 8% 25% 10% 6% 8% 4% 1 week 2 weeks 3 weeks 1 month 2 months 3 months 4-6 months 6 months or more 15% 9% 13% 16% 16% 16% 9% 5%
  • 3. 3 Why Does it Take So Long? It takes too long to explain requirements It takes months to change a DW/add new critical data It takes many iterations to get the right data/reports Changes break integrations & impact applications Directly accessing operational systems is not possible / ideal Typical Data Integration Process IT Has a Huge Backlog a b c d 1 2 3 4 5 6 Design Change Integrate Unit Test Validate Deploy Business is Involved Too Late As-Is Value Stream Map (LOT OF WAIT & WASTE) e
  • 4. 4 What is Needed? PROFILE AND CLEASE DATA SO IT CAN BE READILY TRUSTED DELIVER REUSABLE DATA SERVICES TO CONSUMERS CREATE A COMMON ACCESS LAYER ACROSS DATA SOURCES Enterprise Data Sources Data Virtualization (Built-On Lean Principles)…PRODUCTCUSTOMER ORDER Logical View of All Underlying Data QUICKLY & DIRECTLY ACCESS DATA WITHOUT MOVEMENT 00110101 00100101 01011010 10010110 PortalBI Composite Apps Data Consumers
  • 5. 5 Informatica Proprietary/Confidential. Informational Purposes Only. No Representation, Warranty or Commitment regarding Future Functionality. Not to be Relied Upon in Making Purchasing Decision. Business IT TRANSFORM IN RT Advanced Transformations, Data Quality, Data Masking 4 Virtual Table Replicated CRM Accounts ACCESS & MERGE 2 Virtual Table PROFILE IN RT Business Manager Analyst, Steward Developer, Architect Common Metadata 3 Virtual Table MODEL Customer Name Address Category Orders 1 Virtual Table CRM SCALE & PERFORM Accounts 7 Optimizations & Caching Virtual Table MOVE OR FEDERATE AccountsCall Center DW 6 Virtual Table REUSE INSTANTLY Batch Web Services 5 Query Engine WS Server Virtual Table CRM Agile Data Platform
  • 6. 6 Data Virtualization : Piece of Agile Data Platform Puzzle • Provides a semantic access layer atop variety of data sources • Data needs to be clean, masked etc. • Pre-built library of advanced data transformations, e.g. merge • Integrated real-time, on-the-fly data profiling & data quality DW BI Virtual View Access Merge Deliver DW Prototype First Move to DW or Instantly Reuse as SQL/WS Advanced Transformations & Data Quality Analyze & Profile Data & Logic Anytime Early Business Involvement
  • 7. 7 Informatica Proprietary/Confidential. Informational Purposes Only. No Representation, Warranty or Commitment regarding Future Functionality. Not to be Relied Upon in Making Purchasing Decision. Key Considerations 1000s of lines of code TIME COST Maintenance Nightmare Model & metadata- driven environment TIME COST Sustain & Maintain Enabling Rapid Development v/s Profile data AND logic anywhere TIME COST RISK Get it Right 1st Time Only source profiling, need extra processing Many Iterations & Mistakes TIME COST RISK Analyzing & Profiling v/s Hand-coding can’t do advanced transforms TIME COST RISK SQL XQuery Simple Cleansing Web Service Limited Rules, No Data Quality Leverage pre-built logic including quality TIME COST RISK Virtual Table Bake-in Quality Integrating with Quality v/s Naturally extend your infrastructure TIME COST Re-purpose Logic & Skills TIME COST Re-work, re-deploy & re-train every time Re-invent the Wheel Leveraging Investments v/s Scaling with Flexibility v/s Virtualize or physically materialize in 1 tool TIME COST Prototype First & Then Scale EII Optimizations TIME COST Overburden Data Virtualization EII X RISK Non-integrated technologies
  • 8. 8 Gartner Magic Quadrant for Data Integration Tools, 2011 “The ability to switch seamlessly and transparently between delivery modes (bulk / batch vs. granular real-time vs. federation) with minimal rework will be key for IT organizations seeking to develop a successful data integration strategy.” Ted Friedman, VP Distinguished Analyst, Gartner Leveraging the Power of the Platform “With v9, Informatica advanced its capabilities with on-the-fly data quality and profiling, a model-driven approach to provisioning data services, performance enhancements, cloud integration, common metadata, and role-specific tools.” The Forrester Wave: Data Virtualization, Q1 2012 Forrester Wave: Data Virtualization, Q1 ‘12 Power of The Platform
  • 9. 9 Sign-Up Expert Roundtables Customer Webinars Whitepapers, Articles, Blogs… Data Virtualization Corner Learn More… JOIN & DISCUSS 2500+ Strong “Data Virtualization & Data Services Architecture” Group Industry’s 1st INDEPENDENT Book

Hinweis der Redaktion

  1. The problem is that today it takes too long to deliver new critical data or reports to the business…You can see this from the results of the 2011 TDWI BI Benchmark Report, on an average, it takes months to add a new source of data to a data warehouse.
  2. Let’s see why it takes so long.As a first step, it is important to take into account a typical data integration process, which is by nature, multi-step and involves the business when it is too late. At this point, if the business wants changes or needs other data or identifies inaccuracies, getting IT’s help means going back into a queue and waiting for IT as they work through their backlog of requests.The reasons for this delay in delivering new data and reports are manifoldIt takes too long for the Business to explain requirements to ITIt takes months for IT to change a DW / add new critical dataIt takes many iterations between Business and IT to get the right data / reportsAny changes in the underlying data sources break integrations and impact consuming applicationsDirectly accessing operational systems is not possible / ideal
  3. Finally, Informatica enables business and IT to deliver a current, complete, and trusted view of the business – within days vs. months. It does this by:Creating a common logical data access layer across all data sources - a point to remember here is that if it is not possible or desirable to directly hit an operational system, data replication can be used to create a replica and then use that replica as a source – this step can be done by the Analyst, without waiting for IT’s helpAccessing and merging diverse data into a virtual view without physically moving the data – this step can also be done by the Analyst, without waiting for IT’s helpInvolving the Analyst to analyze and profile the federated data or the virtual view – which means no staging or no further processingApplying advanced transformations including data quality in real-time to the federated data or virtual view And then, delivering data services or virtual views that can be instantly reused across projectsAll these capabilities are available as a single package called PowerCenter Data Virtualization Edition. Enterprises can reuse existing Informatica skills and data integration logic to deliver BI projects up to five times faster and at a third of the cost.
  4. The ability to switch seamlessly and transparently between delivery modes (bulk/batch vs. granular real-time vs. federation) with minimalrework will be key for IT organizations seeking to develop a successful data integration strategy.Ted Friedman, VP Distinguished Analyst, Gartner