This document discusses backup and archiving in the AWS cloud. It begins with an overview of why AWS is suitable for backup and archive needs due to its pay-as-you-go model and global infrastructure. Various cloud integrated backup and archive gateways are presented, along with data ingestion options and AWS storage and archive services like S3, EBS, and Glacier. Methods for retrieving and restoring data from the cloud are also covered.
2. Agenda
• Why AWS for Backup and Archive?
• AWS Global Infrastructure
• Backup and Archive Concepts
• Traditional vs. Cloud Approach
• Cloud Integrated Backup and Archive Gateways
• Data Ingestion Options
• AWS Storage and Archive Options
• Retrieve / Restore
• TCO
3. Why AWS for Backup and Archive?
Metered usage:
Pay as you go
No capital investment
No commitment
No risky capacity planning
Avoid opex and risks of
physical media
handling
Control your
geographic locality for
performance and
compliance
5. AWS Regions and Availability Zones
Customer decides where applications and data reside
Asia Pacific (Tokyo) US West (Oregon)EU (Ireland)US East (N. Virginia)
US West (N. Cal)
(Asia Pacific) Singapore
AWS GovCloud (US) South America (Sao Paulo)
China (Beijing –
Limited Preview)
Availability
Zone
Availability
Zone
Availability
Zone
Availability
Zone Availability
Zone
Availability
Zone
Availability
Zone
Availability
Zone
Availability
Zone
Availability
Zone
Availability
Zone
Availability
Zone
Availability
Zone
Availability
Zone
Availability
Zone
Availability
Zone
Availability
Zone
Availability
Zone
Availability
Zone
Availability
Zone
Availability
Zone
Availability
Zone
Availability
Zone
Availability
Zone
Availability
Zone
Asia Pacific (Sydney)
6. Archive:
Data retained for the
long term, for
compliance or research
Backup:
Data retained to
support near-term
business continuity
Backup and Archive defined
8. Traditional Backup and Archive
• Time: Long/slow recovery time
• Money: Capital intensive with ongoing upgrades
• Effort: Complex to manage
• Quality: Low durability, Error Prone
Days or Weeks
9. Traditional Backup and Archive
• Backup Software
• Edge to core topologies
• Dedicated people
• Tape Silos / Tape Libraries
• Tape Drives (LTO-X / DLT / etc)
• Virtual Tape Libraries (VTLs)
• Tape out / Vaulting
• Unified Backup Solutions
• Deduplication
• Compression
• Encryption
13. Cloud Backup and Archive
• Backup Software
• Edge to core topologies
• Single pane of glass
• Tape Silos / Tape Libraries
• Tape Drives (LTO-X / DLT / etc)
• Virtual Tape Libraries (VTLs)
• Tape out
• Unified Backup Solutions
• Deduplication
• Compression
• Encryption
14. Cloud Backup and Archive Topologies
1. Branch office backup to cloud
2. Core data center backup to cloud
3. Cloud backup to cloud
4. Hybrid cloud backup
15. Branch office backup to cloud
Considerations:
- Backup Software
- Storage / Caching Gateway
- WAN or Internet
- Deduplication
- Compression
- Encryption
- WAN Acceleration
16. Core data center backup to cloud
Considerations:
- Backup Software
- Storage / Caching gateway
- Direct Connect or Internet
- Telco Burst Models
- Time Warner Telecom
- Deduplication
- Compression
- Encryption
- WAN Acceleration
17. Cloud backup to Cloud
Applications running on EC2 backing up to S3 / Glacier
Considerations:
- Backup software
- Encryption
- Deduplication
- Compression
- Native S3 and Glacier
integration
- AMI Backup Appliance?
- AMI = Golden Master
- EBS Snaps / Scripting
20. AWS Storage Gateway
• On-premises, virtual iSCSI
storage appliance
• $125 / Month*
• Local cache enables low
latency access to data
• Server Side Encryption (SSE)
• 5 TB of throughput per day
• Recover to Amazon EBS
21. AWS Storage Gateway-VTL
(Virtual Tape Library)
• On-premises, virtual tape library
storage appliance
• $125 / Month
• 10 virtual tape drives / 1500
virtual tape slots
• 150 TB local cache
– VTL – virtual tape library
• Restore in seconds from VTL
– VTS – virtual tape shelf
• Next Generation Offsite Vault
• 24 hour retrieval from VTS
• Encryption in transit and at rest
• Gateway VTL-AMI
23. Commvault
• Unified platform integrates
Backup, Archive, Replication,
Analysis and Search, Alerting,
Reporting, and Tracking of all
data via a single common
code base
• Integrated with Amazon S3
and Amazon Glacier with
deduplication & encryption
support
• Single console management
Amazon S3 Amazon Glacier
24. Riverbed SteelStore
• Local caching appliance
• Presents NAS protocols
– CIFS / NFS
• Up to 30x deduplication
• S3 and Glacier support
• AMI Available
25. CA Cloud Storage for System z (Mainframe backup)
CA provides the System z
Connector
De-duplication,
compression
& encryption
Encrypted
In-Flight
Reduces dependency on
storage hardware up to 60%
Reduce data protection
costs by up to 80%
Slash Data Center Storage
Costs to Pennies per GB
Public Cloud Gateway
Amazon S3 Glacier
AWS
26. Ctera
CTERA Multi-Tenant Services Delivery Platform
Data Management, Provisioning, Monitoring and Metering + APIs
Versatile Cloud Storage Services
Edge Gateways, Endpoint and In-Cloud Clients
Ft. Endpoint Encryption & Source-Based Data Optimization
Virtual Private Cloud
Files in S3 Metadata in EBS
CTERA Portal in EC2
PC’s / Laptops
Mobile Devices
Servers
Gateway
27. Maginatics
• Create a globally accessible, NAS
archive based on SW clients
• NO HARDWARE REQUIRED
• High throughput migration
• Global Namespace, Deduplication,
Locking
• Purpose built cloud file system
• POSIX / SMB compliant
• Windows / Mac / Linux / iOS /
Android Amazon S3
29. Data Ingestion Options
AWS Direct Connect
Dedicated bandwidth between
your site and AWS
Internet
Transfer data in a secure SSL tunnel over
the public Internet
AWS Import/Export
Physical transfer of media into and
out of AWS
30. AWS Ingest Options
Internet / One Common Theme: Parallel Uploads
1. Multipart upload
2. Request rate optimization
3. TCP window scaling
4. TCP selective
acknowledgement
AWS has customers that ingest roughly 1 PB per day
31. AWS Ingest Options
AWS Direct Connect
• Private connectivity to AWS
– Physical connection – 1 Gbps or 10 Gbps
port
• Consistent network performance
• Consider burst models on ingest
• Reduces costs for bandwidth-
heavy outbound workloads
Locations
• CoreSite 32 Avenue of the Americas, NY
• CoreSite One Wilshire & 900 North Alameda, LA
• Equinix DC1 – DC6 & DC10 - DC11, Ashburn, VA
• Equinix SV1 & SV5, San Jose, CA
• Equinix SE2 & SE3, Seattle, WA
• Equinix SG2, Singapore
• Equinix SY3, Sydney
• Equinix TY2, Tokyo
• Eircom, Clonshaugh
• TelecityGroup Docklands, London
• Terremark NAP do Brasil, Sao Paulo
32. AWS Ingest Options
AWS Import/Export
• Rapidly move data into and
out of AWS
• Portable storage device
shipment to AWS
• Supports
– Amazon EBS
– Amazon S3
– Amazon Glacier
• Use cases
– Initial data migration
– Content distribution via portable
devices
– Disaster recovery
34. AWS Storage and Archive Options
Amazon Simple Storage Service (S3)
Highly scalable object storage
1 byte to 5 TB in size
99.999999999% durability
Amazon Elastic Block Store (EBS)
High-performance block storage device
1 GB to 1 TB in size
Mount as drives to instances with
snapshot/cloning functionalities
Amazon Glacier
Long-term object archive
Extremely low cost per gigabyte
99.999999999% durability
35. AWS Storage and Archive Options
Amazon Elastic Block Store (EBS)
• High I/O block storage for Amazon EC2
• Point-in-time snapshots to Amazon S3
• 99.999999999% Durability
• Snapshot software is FREE
• Point-in-time snapshots across regions
36. AWS Storage and Archive Options
Amazon S3: Trillions of objects under management
• New!: SSE-C
• Time: Instant access, any time, any where
• Money: Pay for what you store / 51% price reduction
• Effort: Scales as you grow
• Quality: 99.999999999% durability
37. AWS Storage and Archive Options
Amazon Glacier
• $0.01 per GB/mo, $120 per TB/yr
• 3-5 hour data retrieval latency
• Archives: single file or zipped files
• Vaults: collection of archives
• Infinite archival storage
• 99.999999999% durability
• Immutable, encrypted by default
38. AWS Storage and Archive Options
Object Lifecycle Management: Amazon S3 → Amazon Glacier
→
• Seamlessly move data from Amazon S3 → Amazon Glacier
• 3-5 hour asynchronous retrieval
• Data lifecycle policies
• $0.01 per GB for Amazon Glacier costs
45. TCO: On-Premises Cost Considerations
1. Primary storage hardware (primary / remote site)
2. DR / Remote site storage hardware
3. Raw to utilized storage (both primary and DR)
4. Storage growth (cost of upgrades)
5. Storage management software and 3rd party tools
6. Professional services
7. Hardware maintenance
8. Software maintenance
9. Backup software
10.Backup hardware (primary / remote site)
11.Offsite tape storage / vault
12.Archive software
13.Archive hardware
14.Power
15.Cooling
16.Space
17.Labor
18.Cost of capital
19.Training
20.Asset depreciation
21.Migration
22.Decommission / remove
23.Recycle
46. Customer Stories
AWS Storage Gateway is used in a variety of ways
Jollibee (JFC) is using the AWS Storage Gateway to backup and
mirror their Oracle SQL server database from their on-premises
data center to AWS. JFC is the largest fast food chain in the
Philippines with revenues well over 2 Billion USD.. The Storage
Gateway also provides us access to the same database
snapshots for use in Amazon EC2, providing a cost-effective in-
cloud DR solution.
AWS Storage Gateway provided us the most cost
effective way to backup our SAP workloads to AWS, it is
helped us perform SAP System ‘refresh’ much faster and
in a more convenient way, backing up to S3 has also helped us
to prepare for DR & also run SAP Dev/QA restores easily on EC2
“Amazon Web Services and AWS Storage Gateway are great
assets that help us scale fast, store data in an ultra-secure
environment, spend more time on product development (rather
than disaster recovery & backup)
…By using AWS Storage Gateway, we went to just hours instead
of days to restore from backup.”
The large Japanese Retail chain uses AWS
Storage Gateway to share & store files in S3 and
drastically cut down it’s spend on premise NAS
footprint.
47. • Digital marketing agency, What’s Up
Interactive, offers managed hosting for
high-availability websites
• What’s Up Interactive wanted to expand
this service to include affordable backup
and disaster recovery (DR)
• What’s Up Interactive’s backup and DR
solution is anchored by Amazon S3, which
is designed to provide 99.999999999%
durability and 99.99% availability of
objects over a given year
AWS Allows What’s Up Interactive to Save Over $1M
48. AWS – Your Global Data Center for Backup and Archive
• Choose the region that fits your business and compliance needs
• 10 regions world wide – set up with a few clicks
• Broad range of backup/archive tools that are AWS integrated
• Low cost, reliable AWS Transport and Storage options
• Enhance Security Posture
• Increase Scalability
• Significantly Higher Data Durability
• All at a lower TCO