This document discusses how data virtualization can help address the challenges posed by data being a constraint in IT. It describes how data flooding infrastructure strains resources and costs companies huge amounts of money. Typical database architectures result in a "triple data tax". The document proposes data virtualization as a solution, allowing for unlimited clones of database environments using thin cloning. This enables faster development and testing, quicker disaster recovery, and real-time business intelligence. Case studies show benefits like doubling development throughput, 9x faster analytics refreshes, and cutting project timelines in half.
9. Theory of Constraints
work for IT ?
• Goals Clarify
• Metrics Define
• Constraints Identify
• Priorities Set
• Iterations Fast
• CI
• Cloud
• Agile
• Kanban
“IT is the factory floor of this century”
10. The Phoenix Project
What is the
constraint
in IT ?
“One of the most powerful things that organizations
can do is to enable development and testing to get
environment they need when they need it“
11. What is the constraint in IT
If you can’t satisfy
the business demands
then your process is broken.
12. Data is the constraint
60% Projects Over Schedule
85% delayed waiting for data
Data is the Constraint
CIO Magazine Survey:
only getting worse
13. • Data Constraint
I. strains IT
II. price is huge
III. companies unaware
• Solution
• Use Cases
In this presentation :
14. • Data Constraint
I. strains IT
II. price is huge
III. companies unaware
• Solution
• Use Cases
In this presentation :
15. – Storage & Systems
– Personnel
– Time
I. Data Constraint :
moving data is hard
21. I. Data constraint:
Data floods infrastructure
92% of the cost of business,
in financial services business , is “data”
www.wsta.org/resources/industry-articles
Most companies have
2-9% IT spending , ½ on “data”
http://uclue.com/?xq=1133
Gartner: Data Doomsday
22. • Data Constraint
I. strains IT
II. price is huge
III. companies unaware
• Solution
• Use Cases
In this presentation :
23. • Four Areas data tax hits
1. IT Capital resources $
2. IT Operations personnel $
3. Application Development $$$
4. Business $$$$$$$
II. Data constraint:
price is Huge
24. • Four Areas data tax hits
1. IT Capital resources $
2. IT Operations personnel $
3. Application Development $$$
4. Business $$$$$$$
II. Data constraint:
price is Huge
26. • Four Areas data tax hits
1. IT Capital resources $
2. IT Operations personnel $
3. Application Development $$$
4. Business $$$$$$$
II. Data constraint:
price is Huge
27. • People
– DBAs
– SYS Admin
– Storage Admin
– Backup Admin
– Network Admin
• Hours : 1000s just for DBAs
• $100s Millions for data center modernizations
II. Data constraint price is huge:
IT Operations
28. • Four Areas data tax hits
1. IT Capital resources $
2. IT Operations personnel $
3. Application Development $$$
4. Business $$$$$$$
II. Data constraint:
price is Huge
“One of the most powerful things that IT can do is get
environments to development and QA when they need it”
- Gene Kim author of The Phoenix Project
29. • Inefficient QA: Higher costs of QA
• QA Delays : Greater re-work of code
• Sharing DB Environments : Bottlenecks
• Using DB Subsets: More bugs in Prod
• Slow Environment Builds: Delays
II. Data constraint price is Huge :
Application Development
30. • Four Areas data tax hits
1. IT Capital resources $
2. IT Operations personnel $
3. Application Development $$$
4. Business $$$$$$$
Part II. Data constraint:
price is Huge
31. Ability to capture revenue
• Business Applications
– Delays cause lost revenue
• Business Intelligence
– Old data = less intelligence
II. Data constraint price is Huge :
Business
32. • Data Constraint
I. strains IT
II. price is huge
III. companies unaware
• Solution
• Use Cases
In this presentation :
44. 1. Technical Challenge
Copy
Time Flow
Purge
Production
File System Instance
DevelopmentStorage
21 3
Clone (snapshot)
Compress
Share Cache
Provision
Mount, recover, rename
Self Service, Roles & Security
Instance
45. 2. Bureaucracy Challenge
Developer Asks for DB Get Access
Manager approves
DBA Request
system
Setup DB
System
Admin
Request
storage
Setup
machine
Storage
Admin
Allocate
storage
(take snapshot)
46. Data Virtualization
How to get a Data Virtualization?
– EMC + SRDF + scripting
– Netapp + SMO + scripting
– Oracle EM 12c DBaaS + data guard + Netapp /ZFS + scripting
– Delphix
2 31
Production DevelopmentStorage
21 3
2 31
23 1
2 31
47. Data Supply Chain
6/18/2014 47
Data Supply Chain
• Security
• Masking
• Chain of custody
• Self Service
• Roles
• Restrictions
• Developer
• Data Versioning
• Refresh, Rollback
• Audit:
• Live Archive
Snap Shots
Thin Cloning
Data Virtualization
Data Supply Chain
62. Before Virtual Data
Production Dev, QA, UAT
Instance
Reporting Backup
File system
Database
Instance
File system
Database
File system
Database
File system
Database
Instance
Instance
Instance
File system
Database
File system
Database
“triple data
tax”
63. With Virtual Data
Production
Instance
Database
Dev & QA
Instance
Database
Reporting
Instance
Database
Backup
Instance Instance Instance
Database
InstanceInstance
Database
InstanceInstance
File system
Database
Data
Virtualization
Appliance
64. • Problem in the Industry
• Solution
• Use Cases
In this presentation :
70. Development : slow
Developer Asks for DB Get Access
Manager approves
DBA Request
system
Setup DB
System
Admin
Request
storage
Setup
machine
Storage
Admin
Allocate
storage
(take snapshot)
Weeks to Months to Deliver Data
80. QA : Virtual Data
• Fast
• Parallel
• Rollback
• A/B testing
QA
Virtual Data
81. QA : Long Build times
96% of QA time was building environment
$.04/$1.00 actual testing vs. setup
QA Build QA
QA Build QA
QA before virtual : resource expensive
83. QA Virtual Data : Fast
83
Dev
QA
Instance
Prod
DVA
Time Flow
1% of QA time was building environment
$.99/$1.00 actual testing vs. setup
QA
Build QA
QA
Build QA
QA Virtual Data : Fast
84. QA Virtual Data : Fast
Dev
QA
Instance
Prod
DVA
Time Flow
Bugs found quickly QA
Build QA
Sprint 1 Sprint 2
QA
Build QA
X
QA Virtual Data : Fast with Branching
85. QA with Virtual Data:
Rewind
Instance
Instance
Development
Prod
86. QA with Virtual Data: A/B
Instance
Instance
Instance
Index 1
Index 2
99. Business Intelligence:
ETL and DW Refreshes
Instance
Prod
Instance
DW & BI
Before Virtual: limited, slow ETL and DW refreshes
100. • Collect only Changes
• Refresh in minutes
Virtual Data:
Fast Refreshes
Instance Instance
Prod BI and DW
ETL
24x7
DVA Instance
Virtual Data: Fast Refreshes
111. How expensive is the
Data Constraint?
DVA at Fortune 500 :
Dev throughput increase by 2x
112. • 10 x Faster Financial Close
• 9x Faster BI refreshes
• 8x Faster surgical recovery
• 3x Project tracks
• 2x Faster Projects
How expensive is the
Data Constraint?
113. • Projects “12 months to 6 months.”
– New York Life
• Insurance product “about 50 days ... to about 23 days”
– Presbyterian Health
• “Can't imagine working without it”
– State of California
Virtual Data Quotes
114.
115. • Problem: Data is the constraint
• Solution: Virtual Data
• Results:
– Half the time for projects
– Higher quality
– Increase revenue
Summary
130. NetApp Filer - DevelopmentNetApp Filer - Production
Database
Luns
Snap
mirror
Snapshot Manager
for Oracle
Flexclone
Repository
Database
Snap
Drive
Protection
Manage
Production
Development
1 Netapp
Target A
Target B
Target C
InstanceInstance
InstanceInstance
InstanceInstance
Instance
131. Where we want to be
Database
File system
Production
Instance
Database
Development
Instance
Database
QA
Instance
Database
UAT
Instance
Snapshots
Instance Instance Instance Instance
132. EM 12c: Snap Clone
Production Development
Flexclone Flexclone
Netapp
Snap Manager for Oracle
137. III. Data Constraint
companies unaware
Why do I need an iPhone ?
Don’t we already do that ?
SQL scripts
Alter database begin backup
Back up datafiles
Redo
Archive
Alter database end backup
RMAN