This presentation was delivered during the February and March Discovery Series events titled “Protecting Data in an Era of Content Creation” sponsored by EMC.
Storage budgets are only growing 3-4% per year, but data is growing 30-40% per year – so in five years you will have to manage roughly five times as much data, with roughly the same budget. This data is also being accessed by more people, from more places and in new ways. The end result is that some of the old-standby approaches to protecting data are no longer cost effective or manageable. This presentation looks at the strategies IT can employ to provide adequate levels of protection and availability in a very different data management environment.
If you have any questions about the content or the event, please contact @scTechEvents.
3. Ferrol Macon – @ferrolmacon
• Director, Storage and Data Solutions
• Started career immersed in “Mission Critical”
• Started in Business Applications, transitioned
Into Storage and Data Protection
4. Mark Gowland – @MarkGowland
• Mark to provide info here.
• Building credibility is key – take time to explain who
you are and how you’ve build expertise in this area
– How many customers have you helped?
– What experiences in this field have you had? Projects
worked on…challenges you’ve faced…
• People like to feel connected to their presenter, so
share some fun or personal stories and let them get to
know you
• You want to start your presentation by building that
connection
5. The Story We Find Ourselves In
• We are entering a new era of Content Creation
• Today’s needs are stretching old approaches
• Tomorrow's needs are demanding new
approaches
6.
7. Technology Centric Model Focuses On Data
• Access to Data
• Data Processing Performance
• Protecting Data
12. In the new world of Information…
• Information meanings are variable and that’s OK.
• Audio & video are going to be just as important
as text.
• Unstructured data will be as searchable and
useable as structured data is today.
14. Three Phase Journey To The Private Cloud
IT PRODUCTION BUSINESS PRODUCTION IT AS A SERVICE
Lower Costs Improve Quality Of Service Improve Agility
Percent
virtualized
Platinum Gold
Phase 1 Phase 2 Phase 3
IT-Owned Applications Mission-Critical Applications Run IT As a business
15. Obstacles to Virtualization
• Source: Storage Choices For Virtual Server Environments, Q1 2011, Forrester Research, Inc., March 2011
16. Why Virtualization Transformation Stalls
Traditional Traditional Slow
Storage Backup Apps and virtual server
backups slow down
Complex
Neither built to manage
a virtual environment
Costly
Over-provisioned storage
and more backup data
18. The New Storage Portfolio
Scale-Out Block Scale-Out File & Object
Unified Storage
Backup And Recovery
19. Flash And EMC FAST Automation
MANUAL PROCESS 100% AUTOMATED
Oracle Exchange SQL
server server server
Oracle Exchange SQL OracleI ExchangeI SQLI
FAST Cache & FAST VP
SSD SAS NL-SAS
Static Association Of Dynamic, Self-Optimizing
Disk Groups To Applications Storage Pools
Based On Best Guess Based On Actual Data Activity
20. A Little Flash Goes A Long Way…
EMC UNIFIED STORAGE
• VMware, SharePoint, SQL, Oracle
• 108 TBs Student Data
• Added 24 SSDs + FAST Suite
• 1.6% FLASH Serves 78% Of All IO
Less Than 2% Flash Does ~ 80% Of The Work
21. Virtualization Management
• Integrated point of control to • One unified storage tool for all
simplify and speed VMware VNX, VNXe, Symmetrix,
storage management tasks CLARiiON, and Celerra
EMC Virtual
VMware vSphere Storage
Integrator
Unified storage
22. Data Protection Service Levels
Daily Backup: Recovery point is every 24 hours Optimize Protection
EMC Next Generation Backup
• Align service levels with
recovery objectives
Snaps: Recovery point is once every 4 hours
• Recover with confidence
• Utilize EMC Next Generation
Backup and EMC VNX Total
Protection Pack together to
EMC VNX Total Protection Pack
Replication: Recovery point is latest replicated image provide complete data
protection coverage
Continuous Protection: Recovery to any point in time
Time
Check- Pre- Patch Post- Cache Hot Check-
point Patch Patch Flush Backup point
23. VMware Backup Performance
Avamar backup versus traditional backup
Full Backup
• Single ESX Server
– 12 Virtual Machines
• 100 GB per machine
CPU Duration of Backup • 5% Change Rate
With Avamar – Turn-on CPU throttling and
CPU
save ESX CPU
Avamar With and Without Throttle
Network
• Minimize CPU utilized
56 Min. Saved by ESX
Duration of Backup 12 Min
Disk From Traditional
36 Min.
Traditional vs. Avamar Backup No I/O20 Min Saved
1Hr Added
24.
25.
26. Big Data Has Arrived
Electronic
Payments Video Rendering
Video
Mobile Sensors Social Media Surveillance
Medical Imaging
Gene
Sequencing
Geophysical
Smart Grids Exploration
34. Connect with me
• How can a customer connect and engage with
you post event?
– Twitter, LinkedIn etc.
• Where can they go for more information
– www.softchoice.com/optimizedatacenter
• Be explicit with your call to action
– Tweet, Share, Email, etc
Hinweis der Redaktion
Would like the T to just grow.Point – for the past few decades the focus of IT has been on the “T”From the time of the very first computers, their use has been focused on repetitive and structured tasks because of their inability to understand our world. This is why most business applications are focused on structuring data into databases and applications.
Would like the T to just grow.Point – for the past few decades the focus of IT has been on the “T”From the time of the very first computers, their use has been focused on repetitive and structured tasks because of their inability to understand our world. This is why most business applications are focused on structuring data into databases and applications.
Humans Collect and ContextualizeDataData Processed By ApplicationsHumans Receive and Interpret Data
Would like a better representation. This is here for concept only. Would rather have this be a build slide of individual objects – hairspray, hair gel, hair cream, etc.Want to make it a little funny or find a funny clip to get the concept that those in the industry call all of this one thing – PRODUCT.
Would like the T to just grow.Point – for the past few decades the focus of IT has been on the “T”From the time of the very first computers, their use has been focused on repetitive and structured tasks because of their inability to understand our world. This is why most business applications are focused on structuring data into databases and applications.
Google #1 search engineYoutube #2 search engine
Present this concept however we want…I want the last part of the presentation to be about tomorrow “Big Data” and “Content”. Just want to get the idea across that Tomorrow is not an option if we do not do today well. However, we need to make decisions today that set us up well for tomorrow.I like the first phrase above because it is more concise and sets the framework for both messages.
Virtualization delivers big benefits – in efficiency, productivity and agility. There are 3 simple phases in journey.Phase 1 is about lowering costs, focusing on IT applications that have no cost of downtime.With the savings and confidence you built in phase 1, you can start virtualizing mission critical applications. Phase 2 is much more about improving service levels rather than cost savings. There are still savings to be had, but focus is really on reducing downtime and improving agility.In Phase 3 you’re able to run IT as a service. Complete the transformation by building a service catalog, delivering portals that allow faster time to provision infrastructure and are heavily automated and aligns technology to business value. In phase 3, IT is able to drive greater business agility, investing their time in understanding the business and delivering new products / projects faster.
But there are obstacles (specifically storage and backup related) you will face as you move to greater virtualization – especially as you move through Phase 1 to Phase 2 – that can slow you down.According to Forrester Research, customers have identified:Performance and Completing backups on time;Efficiently Managing capacity;And Controlling costs;as top challenges related to virtualization. Source: Forrester Research (Reichman, Forrester, May 2011)
Traditional storage and backup solutions are the reasons those challenges were listed. Why?Traditional storage and backup was built for physical infrastructure – and can derail your virtualization efforts. Traditional storage can’t handle dynamic, flexible virtual environments—so applications slow down.Traditional backup treats Virtual Machines like physical machines – so backups are reduced to a crawl having to move redundant data. Neither provides simple management for a virtual environment – lacking the means to discover and manage this new world. It’s costly as well – over-provisioning storage to speed applications and backing up and storing more data in virtual environments. The ultimate result is stalled virtualization efforts - and further cost increases.
Roman/Greek – perfected Concrete and Arch/DomesSteam engine first built by Richard Trevithick in early 1800’s. Dominated transportation, West, Cities, IndustrializationElisha Otis – perfected Elevator for safety enabling high rise, urban developmentAs Look to the Cloud – confluence of 3 technologies
Flash delivers performance levels that can be 1,000,000 better than hard disk drives.There is a relatively small amount of hot data, that goes on Flash where the performance great. The cold data goes on spinning drives that can be compressed, deduped, spun down and otherwise made to be highly efficient.The challenge has been deciding when to use Flash and doing that in a manual way is next to impossible. We have found that we can optimized almost all workloads more effectively using an automated tiering method that we share across our entire portfolio. We call this FAST or Fully Automated Storage Tiering and it’s in our VNX family and our VMAX family.
Here is a FENOMONAL example.VANDERBILT UNVERSITY. This was using VNX.The university has a unique mix of student and faculty data to keep track on.And as YOU CAN IMAGINE: MUCH OF THIS DATA WAS OLDER – but still needed to be on-lineONLY A SMALL PORTION of data was HIGH ACTIVE at any point in time.VANDERBILT WAS THE PERFECT CANDIDATE for a FLASH 1st STRATEGY.They already had EMC Unified storage deployed and now added (8) 100GB EFD for FAST Cache and and (16) 100GB EFD for FAST VP which was only 1.6% of the on-line dataThat SMALL AMOUT of FLASH now serves 78% of ALL IO.The STUDENTS are HAPPY. The FACULTY is HAPPY. AND IT are THE HERO’s of VANDERBILT!Let’s give them a BIG HAND <APPLAUSE>Yes, FLASH FIRST is a BIG DEAL and it is ROLLING OUT EVERYWHERE<CLIC>
Again, let’s talk about some of the real-world results. What we’re looking at is a full backup. So these numbers are in comparison to doing a daily full backup. Avamar’s savings are dramatic here. Compared to a traditional backup agent, Avamar:Significantly less CPU impactGive CPU cycles back to the applicationsSignificantly less Network utilizationDon’t be a backup scheduling masterSignificantly less Disk I/OGive back to the applicationAt the end of the day your production applications should control the size of your Virtual infrastructureBackup is not ‘strategic’, it is a necessity – don’t let it control your Virtual infrastructure Finally, the ability to control the amount of resources available the Avamar agent (via a throttle) ensures consistent performance for applications during backup.Note to Presenter: Lab layout for full backup comparison, ESX Server:Dell 6850: 8 CPU, 64 GB RAM, 4 virtual machinesVirtual Machines: traditional backup client installed, Avamar client installed, Windows 2003 SP2Avamar: Avamar Virtual Edition for VMware 3.7.2, RAID 5 CX storage (single 800 GB LUN)Backup software: traditional backup software, backup target is RAID 5 CX storage
Need to edit this clip to include the following – First 6 second intro…then go to question at 0:48 in the presentation and play through the rest of the slide.
More examples of unstructured data and large scale structured data that will need to be correlated in an Information Centric age.The sources of information are expanding. Many new sources are machine generated. It’s also big files (siesmic scans can be 5TB per file) and massive numbers of small files (email, social media).Not all businesses will deal with some of there, but these are just the applications that we can think of today. There will be many more things to come that we can’t think of today.
It is no longer about the data, it is about meaning.Where we are going with technology is that we will see applications and systems on a new scale where IT will be more about people and less about machines.People do not live their lives in rows and tables. Unstructured data is data in human friendly form an that is where the interesting stuff happens.
The idea with Big Data Analytic platforms is that like Google, you can search and access all of your structure and unstructured data sources, analyze this data in real-time, on systems that offer scale out and co-processing of both structured and unstructured data.In essence this is what EMC Greenplum with its database for structured data and Hadoop platform for unstructured data.
It would be great if there was an image here to convey this message.Key thoughts: We will continue to do the same business / structured functions we do today. Some will be able to use information from this new world as appropriate.It will be an incremental step in getting there. Function will come before scale – this is why the push to unified and virtualized storage is so important.Some systems – Data Protection, for example, will be leveraged by both worlds.