2. #ibmedge
Abstract
This session will cover private and public
cloud storage options, including Flash,
Disk and Tape to address the different
types of cloud storage requirements
The difference between with block, file
and object stores, and where they are
best used for different workloads will be
explained.
1
3. #ibmedge
This week with Tony Pearson
Day Time Topic
Monday
2:30pm
All Flash is Not Created Equal:
Tony Pearson Contrasts IBM FlashSystem and SSD
Grand Garden Arena, Lower Level, MGM Grand - Studio A
Wednesday
11:00am
All Flash is Not Created Equal:
Tony Pearson Contrasts IBM FlashSystem and SSD
Grand Garden Arena, Lower Level, MGM Grand - Studio 2
1:15pm
Tony Pearson Presents
IBM Cloud Object Storage System and Its Applications
MGM Grand - Room 114
2:30pm
The Pendulum Swings Back: Tony Pearson Explains Converged and
Hyperconverged Environments
MGM Grand - Room 113
Thursday
09:45am
Tony Pearson Presents
IBM's Cloud Storage Options
MGM Grand - Room 116
2
4. #ibmedge
The Technology and the Business Side of Cloud
Cloud computing is a model for enabling
ubiquitous, convenient, on-demand network
access to a shared pool of configurable
computing resources (e.g., networks, servers,
storage, applications, and services) that can be
rapidly provisioned and released with minimal
management effort or service provider
interaction.
This cloud model is composed of five essential
characteristics:
• On-demand self-service
• Broad network access
• Resource pooling
• Rapid elasticity.
• Measured service. *
* Source: U.S. National Institute of Standards and Technology (nist.gov)
Technology Side… … Business Side
3
5. #ibmedge
Clients are Moving to Hybrid Cloud Deployments
Infrastructure
Dedicated
(Private)
Public
Local
(Private)
On-premises
Off-premises
Traditional IT
The IT department
is in direct
competition with
alternatives off
premises for Lines
of Business
funding On-Premises
Some workloads require a level of
security, availability or government
compliance
Off-Premises
Some workloads might prefer more
agile, flexible and scalable
environment
4
6. #ibmedge
OpenStack IBM Cloud Proprietary
IBM is a platinum sponsor of OpenStack
Foundation.
OpenStack open source code can
manage IBM compute, network and
storage resources OpenStack supports
x86, POWER Systems and z System
mainframe.
IBM offers OpenStack Cinder interfaces on
most of its major storage products and
OpenStack Swift interfaces for object
access.
IBM offerings are based on OpenStack
open source code with value-added
proprietary features from IBM
IBM Cloud Orchestrator supports a
variety of server hypervisors. IBM
BlueBox and IBM PowerVC provide
additional features.
IBM Spectrum Control™ provide
reporting, provisioning, trouble shooting
and chargeback capabilities for storage.
VMware and Microsoft are entirely
proprietary, but have a large market
share for x86-based server infrastructure.
IBM was VMware’s first partner since
1998. IBM has expanded its partnership
with VMware, offering new cloud tools to
make it even easier for enterprises.
IBM and Microsoft agreement to offer SQL
Server and .NET on IBM Cloud and IBM
software on Microsoft Azure.
“Some assembly required” Enterprise-ready out of the box
x86, POWER systems and z System mainframe Concerns about vendor lock-in
IBM is Focused on these Software Defined Environments
5
7. #ibmedge
Deliver Management Solutions
Perform Optimizations
Contribute Platform Support
IBM Cloud Orchestrator
IBM BlueBox
IBM PowerVC
IBM Spectrum Control
Live upgrades
Security and authentication
OVF Images
Membership services
Globalization translation integration
QA enhancements
PowerKVM, KVM, z/VM
IBM DS8000, Storwize, SVC, XIV
IBM Spectrum Storage
IBM FlashSystem
IBM SDN for VE, OpenFlow Switches
IBM OpenStack Platform
IBM Contributions
HEAT Orchestration
OpenStack IaaS APIs
TOSCA
Nova Cinder Neutron
IBM Unique Value
Swift
Drivers Drivers Drivers Drivers Drivers
Manila
IBM is committed to OpenStack
with contributions and added value
6
8. #ibmedge 7
Reference Storage
• Archives
• Images/Video
• WORM/NENR
Ephemeral Storage
• Typically boot volumes,
page files and temporary data
• Goes away when VM
is shutdown
Persistent Storage
• Persists across
VM reboots
• Can be shared
between VMs
• Transactional
• High Performance
Storage as the Storage CloudStorage for Compute Cloud
Hosted Storage
• File and Object access
• Enterprise File Sync & Share
• Backup/Disaster Recovery
Cloud Storage Taxonomy
9. #ibmedge
Cloud Storage Overview
Block
File
Object
Archival
Online
Ephemeral
Persistent
• Block storage offerings are differentiated by speed/throughput (as measured in
IOPS) and segmented by lifecycle of the disk. Device location does not matter.
• Ephemeral storage is tied to the lifecycle of a single VM (i.e. it is provisioned
when the VM is provisioned and destroyed when the VM is destroyed)
• Persistent storage has a lifecycle independent of any single VM and can be
provisioned/destroyed at any time and attached/detached to many VMs during it’s life
• File-based offerings are uncommon among providers, especially among those
targeting cloud native applications
• Primarily targeted at cloud enabled workloads
• Usage is being replaced in new application development with online object
storage
• Object storage offerings are differentiated by the durability of the data (i.e. odds of
irrecoverable loss) and segmented by the availability of the data (i.e. time
required to retrieve)
• An object in online storage is immediately accessible
• An object in archival storage may require minutes to hours to be accessible
8
10. #ibmedge
Software Defined Storage – Block level Access
XIV
FICON
z/OS
z/TPF
z/VSE
z/VM
Linux on z
FCP
Linux
Windows
UNIX
VMware
…
iSCSI
Linux
Windows
UNIX
VMware
…
LAN
InfiniBand
DS8000
FlashSystem
900
Software Defined Environment
OpenStack IBM Cloud Proprietary
IBM
Spectrum
Accelerate™
SAN
FlashSystem
A9000 and A9000R
9
11. #ibmedge
Radical Simplicity but with Enterprise-Class features
XIV and the FlashSystem A9000 and A9000R
Radical simplicity
• Breakthrough GUI, CLI and Mobile App
• OpenStack, IBM Cloud, VMware,
Microsoft and RESTful APIs
Low-touch management
• Innovative distributed RAID
• Extreme ease of use across all
functionality
• Create volume, resize volume, take
snapshot, mirror volume
Self Tuning / Self Healing
• Ultra fast rebuild times
• No manual performance optimization
• No hotspots, no tuning
• Not even when adding capacity!
Enterprise-class Software
features
• QoS performance classes
• Thin provisioning and space
reclamation
• IBM Hyper-Scale consistency and
mobility
• Advanced reporting
• Data at rest encryption
• Performance acceleration with Flash
cache
• Snapshots and remote mirroring
• Data migration from other disk
systems
10
13. #ibmedge
VM
2
IBM Spectrum Accelerate for Block-Level Hyperconvergence
12
Enables the IT administrator to single-
handedly manage the entire data center
stack
Allows hardware standardization of network,
compute, storage, power and environmentals
Leverages existing Data Center services
and maintenance contracts
Simplifies the architecture when lacking
specialized, domain-specific skill sets
iSCSI volumes can also be used by bare
metal servers and other hypervisors
Available as software-only or Supermicro®
Hyperconverged Appliance pre-built
system
Ethernet
Interconnect
Hypervisor
IBM
Spectrum
Accelerate
IBM
Spectrum
Accelerate
IBM
Spectrum
Accelerate
Hypervisor
iSCSI
Hypervisor
VM
1
VM
4
VM
6
iSCSI
iSCSI
VM
3
VM
5
iSCSI
14. #ibmedge
IBM Spectrum Accelerate – as a Service!
13
• Single order for Accelerate on IBM SoftLayer
• Operating Expense (OPEX) - no capital required
• Ordered:
• Base of 50TiB
• Increments of 20 TiB
• Two configurations are offered:
• Capacity oriented (for archive type of applications)
• Performance oriented (for real time processing applications)
• Each package includes all features and unlimited traffic
Capacity oriented servers
Dual CPU 6 cores
32 GB RAM
11 x 4TB SATA drives
10GbE dual private links
Performance oriented servers
Dual CPU 8 cores
64 GB RAM
11 x 4TB SATA drives
800GB SSD
10GbE dual private links
…
15. #ibmedge
…
…
IBM Spectrum Accelerate
on customer-choice HW
or Supermicro®
Hyperconverged Appliance
IBM
XIV
Gen 3
IBM Spectrum Accelerate
running off-premise
Unified Management Experience
14
• Remote Office, Branch Office. Provide high
availability storage to smaller or distributed
organizations such as banks and schools through
disaster recovery and other capabilities, using less
expensive hardware.
• Flexible deployment. Deploy test/dev or ad-hoc
project environments and later repurpose hardware for
other needs. Mix online transaction processing and
new applications like analytics.
• Hybrid Cloud. Seamlessly move workloads between
on-premise and off-premise deployments.
• Disaster recovery. Mirror to the cloud or to a data
center server running IBM Spectrum Accelerate. Use
powerful XIV snapshots to execute backup to
on-premise servers
IBM
FlashSystem
A9000
A9000R
16. #ibmedge
Software Defined Storage - Virtualization
15
XIV
DS8000 FlashSystem 900
Other
IBM and
non-IBM
Flash/Disk
systems
IBM Spectrum Virtualize™
SVC, Storwize, FlashSystem V9000, VersaStack
FCP
z/VSE
z/VM
Linux
Windows
UNIX
VMware
…
iSCSI
Linux
Windows
UNIX
VMware
…
SAN
FCoE
Linux
Windows
UNIX
VMware
…
Software Defined Environment
OpenStack IBM Cloud x86-based
TCP/IP
17. #ibmedge
IBM Storage Portfolio – IBM Spectrum Virtualize
IBM Spectrum Scale
Elastic Storage Server
IBM Spectrum Virtualize
FlashSystem
IBM
Spectrum
Accelerate
XIV
DS8000
All-Flash
Flash/Disk
Hybrid
IBM FlashSystem V9000
Storwize V7000
Unified
SAN Volume
Controller
Storwize V7000
Storwize V7000F
Storwize V5000
Storwize V5000F
Common Advanced Virtualization Code Base
Many Different Packages
16
18. #ibmedge
Global Namespace across Flash, Disk and Tape
17
XIV
DS8000 FlashSystem
Internal,
Direct
Attach
JBOF/
JBOD
FPO,
RAID
Elastic Storage
POSIX
Windows
Linux
AIX
NFS
SMB
FTP
SCP
HTTPS
Hadoop
IBM
Spectrum
Archive™
Single Drive
Library
IBM Spectrum Scale™,
Elastic Storage Server,
Storwize V7000 Unified
Other
IBM and
non-IBM
Flash/Disk
systems
Software Defined Environment
OpenStack IBM Cloud x86-based
Enterprise
LTFS
Object
• OpenStack
Swift
• Amazon S3
IBM Spectrum Virtualize™
SVC, FlashSystem V9000,
Storwize, ant VersaStack
19. #ibmedge
IBM Spectrum Scale™ –
Flexible File and Object Storage
FS1 FS256. . .
Exabyte-Scale, Global Namespace,
one big file system, or up to 256 file systems
Each file system
can be further
divided into fileset
containers
Metadata can be
separated to its own Pool
or intermixed with data
Files and objects
can be migrated
to Tape, Object
store, or Cloud
ROBO
Other
Datacenters
18
20. #ibmedge
IBM Spectrum Scale™ – More than just a file system!
ROBO
Other NAS
Other
Datacenters
Scale
Active File
Management (AFM)
caches data to where it
is needed, can be used
to migrate from other
NAS devices
Hierarchical Storage
Management (HSM) migrates
infrequently accessed files to
tape or object-based cloud,
automatically recalls back when
accessed
Local Read-Only Cache
(LROC) and Highly Available
Write Cache (HAWC) caches the
busiest blocks of files on local flash
Disaster Recovery
(DR) remotely mirrors
data to remote
locations
Migrate/Recall
Tape
NSD Client
Information Lifecycle
Management (ILM) moves
data across tiers of flash and
disk
Object Cloud
19
21. #ibmedge
Native Encryption and Secure Erase
Application
IBM Security Key
Lifecycle Manager (SKLM)
TCP/IP or
RDMA
Node-to-Node Encryption
Complies with NIST SP 800-131A
Data-at-Rest Encryption
• Files are encrypted by application node
• Each file assigned random File-key
• Master-key granularity by file or fileset,
determined by policies
• IBM SKLM stores Master-keys, and nodes
must have appropriate credentials
• Data is encrypted from application node all the
way to NSD (flash or disk) media
• FIPS 140-2 certified
Secure Erase
• Files are cryptographically erased by deleting
their Master-key
• Files that “stay” are re-Mastered to new key
20
22. #ibmedge
IBM Spectrum Scale supports OpenStack environments
Glance
• VM images
• Disk images
Cinder
• Volumes
Swift
• Objects
Manila
• File systems
Global Name Space
Volume-on-file Object-on-file
• Create, Delete and Extend volumes
• Take snapshots (FlashCopy) and clones
• Volumes Images, Images Volumes
• Attach and Detach to/from VM instances
• Create and Delete containers in account
• Upload, Download and Delete objects
• List containers or objects in a container
• Display and update metadata
Keystone
• Access control
21
23. #ibmedge
IBM Spectrum Scale™ solves “Dropbox problem”
for Enterprise File Sync-and-Share (EFSS)
22
SAN
Internal,
Direct-Attach
No IT Control:
• Servers and storage
• Security
• Access control
• User provisioning
• Sensitive data
TCP/IP or RDMA network
Twin-tailed
24. #ibmedge
IBM Spectrum Scale on any Cloud
23
Active File
Management
Private VLAN
NSD Clients and
Servers can be deployed
within a Private VLAN on
any Cloud
25. #ibmedge
IBM Spectrum Archive™ Overview
IBM Spectrum Archive enables IBM tape libraries to read and write
LTFS-format tapes as part of a IBM Spectrum Scale™ global
namespace
• Based on the integration of IBM Spectrum Scale™
and LTFS technology
• Supports IBM Spectrum-enabled devices
–TS1140 (or higher) Enterprise Drive
–LTO5 (or higher) Ultrium drive
–IBM Libraries TS4500, TS3500, TS3310, etc.
• Integrated functionality with
IBM Spectrum Scale
–Supports Policy based migrations
–Seamless DMAPI usage
–Data replication to multiple pools
• Supports scale-out for capacity and I/O
–Seamless cache controls between
IBM Spectrum Archive Nodes
–Tape drive performance balancing
–Multiple node performance balancing
Tokyo Las Vegas London
Clients
Wide Area Network (WAN)
Global Namespace
LTFS LTFS LTFS LTFS
24
26. #ibmedge
For large archives with occasional retrieval at very low
cost with predictable retrieval costs
IBM Spectrum Archive – as a Service!
25
27. #ibmedge
Global Namespace for Files and Objects
XIV
DS8000 FlashSystem
IBM Spectrum Scale™
POSIX
Windows
Linux
AIX
NFS
SMB
FTP
SCP
HTTPS
Object
• OpenStack
Swift
• Amazon S3
Hadoop
Spark
IBM Spectrum Scale™,
Elastic Storage Server,
Storwize V7000 Unified
Other
IBM and
non-IBM
Flash/Disk
systems
Software Defined Environment
OpenStack IBM Cloud x86-based
Internal and
Direct Attach
Storage
JBOF/JBOD
FPO,
RAID
IBM Spectrum Virtualize™
SVC, FlashSystem V9000,
Storwize, ant VersaStack
IBM Cloud Object
Storage System
26
28. #ibmedge
Storage Positioning – Filling a Gap
Unified file and
object storage.
Optimized for high
performance, across
flash and disk
Flash 15K
Unified file and object
storage on tape
Information Lifecycle
Management (ILM) across tiers
HighestPerformance
Lowest cost
Tape
IBM was looking to offer easy to
manage, scalable disk-based
object storage for unstructured
data
• Moderate performance
• Moderate cost
10K 7200 rpm
27
29. #ibmedge
How is Object Storage Different?
Block and File Storage
• Decide where to put it
– For block, which array/volume/LUN
– For file, which filer/subdirectory
• Remember where it is to get it back
• Read/Write records, append data
• Limits on LUN size, number of files
Object Storage
• Provide data over to the Object storage
– Get “claim stub” reference locator
• Use or share “claim stub” to access data
HTTP, OpenStack Swift, Amazon S3
• Get/Put/Delete object in its entirety
• Effectively “unlimited” scalability
28
30. #ibmedge
Object Store for Unstructured data
Hot Data
High-IOPS and Low-Latency
All-Flash and Hybrid Flash/Disk
Information Lifecycle Management (ILM)
Structured data / Random-Access
Virtual Machines and VDI
Single-Tenancy
Static and Stable data
Backups, Files, Archives
Seismic, Research, Telemetry, HPC
Video, Animation, Body Cams
Photos, Images, CAD/CAM, GIS
Music, Audio
Genomic, Medical Images
Multi-tenancy
Object Store provides a
• Secure
• Reliable
• Scalable
• Cost Effective
Platform For Unstructured data
Object Store
is not designed for
• High IOPS workflows
• Transaction
Processing
• Inherent ILM
29
31. #ibmedge
Object storage is dominant in cloud
0
50
100
150
200
250
2012 2013 2014 2015 2016 2017 2018
File-based software, appliances, and gateways
Object-based software and appliances (includes self-built/SaaS/cloud NAS)
IDC: Worldwide file- and object-based storage
2014–2018 Forecast (EB)
30
32. #ibmedge
Data Protection Schemes
Tolerate 1 drive failure Tolerate 2 drive failures Tolerate “M” failures
RAID-1 / RAID-10
K pieces 2 x K slices
RAID-5
K pieces K + 1 slices
2.0X
1.2X
3.0X
1.5X
1.7XTriplication
K pieces 3 x K slices
RAID-6
K pieces K + 2 slices
Erasure Coding
K pieces K+M =
N slices
31
33. #ibmedge
Data Growth at Petabyte (PB) Scale
PB of data
3 to 5x
Data Protection
RAID, Mirrors,
Replication, Tape
Data Protection
High Availability & Disaster Recovery
Geo-Distribution & Erasure Coding
Infrastructure
Proprietary, specialized
hardware, multiple systems
Operations
More than 1 FTE per PB
Maintenance outages
Infrastructure
Software Defined,
Commodity Hardware,
Single System
Operations
Less than 1 FTE per 6 PB
Single system, Secure
Self-healing
1.7 x
60% Less
Hardware &
Rack space
Traditional
Approach
IBM Cloud
Object Storage
Approach
70%
Lower
TCO
32
34. #ibmedge
IBM Spectrum Storage and IBM Cloud Object Storage
Unified file and object
storage. Optimized for
high performance, across
flash, disk and object
store
Flash
Object
Store
Disk
IBM Cloud Object Storage System
( File, backup and archive interfaces
available through variety of options )
IBM SoftLayer
OpenStack Swift
Amazon Web Services S3
Swift S3 emulation
Unified file and object
storage on tape
Transparent Cloud Tiering
Information Lifecycle
Management (ILM) across tiers
HighestPerformance
Lowest cost
Tape
33
35. #ibmedge
Cloud Container Pools for IBM Spectrum Protect
Client nodes
• OpenStack Swift
• IBM Cloud Object Storage System
Off-premises:
• IBM SoftLayer
• Amazon AWS
On-premises
IBM Spectrum
Protect Server
IBM Spectrum
Protect Server
34
36. #ibmedge
IBM SoftLayer Storage Services
Object Storage (OpenStack Swift,
IBM Cloud Object Storage System)
Consistent Endurance & Performance (NFS)
Local
SAN
Consistent Endurance & Performance
Portable Storage Volumes
Block
File
Object
Archival
Online
Ephemeral
Persistent
• Endurance has tiers - .25, 2, and 4 IOPS per GB
• Performance gives knobs - you can select size and
performance independently.
Big Storage
(IBM Spectrum Archive as a Service)
Object Storage (OpenStack Swift,
IBM Cloud Object Storage System)
Consistent Endurance & Performance (NFS)
Local
SAN
Consistent Endurance & Performance
Portable Storage Volumes
Big Storage
(IBM Spectrum Archive as a Service)
35
37. #ibmedge
Cloud Storage Taxonomy
Storage as the Storage CloudStorage for the Compute Cloud
Persistent
Storage
• Persists across
VM reboots
• Can be shared
between VMs
• Transactional
• High
Performance
Reference
Storage
• Archives
• Images
Video
• NENR and
WORM
Ephemeral
Storage
• Typically boot
volumes,
page files and
temporary
• Goes away when
VM is shutdown
Hosted Storage
• File Storage
• Object Storage
• Backup
• Disaster
Recovery
IBM Spectrum Accelerate, XIV
FlashSystem, DS8000, SVC
IBM Cloud Object Storage System,
IBM Spectrum Archive
IBM Spectrum Scale, Elastic Storage ServerTransactional
Performance
Universal Access
Lowest
TCO
36
39. #ibmedge
IBM Redbooks on IBM Spectrum Scale
38
IBM Spectrum
Scale (formerly
GPFS)
Implementing IBM
Spectrum Scale
IBM Spectrum
Scale in an
OpenStack
Environment
IBM Spectrum
Scale – Big Data
and Analytics
Solution
IBM Spectrum
Scale and ECM
FileNet Content
Manager
www.redbooks.ibm.com
40. #ibmedge
IBM Tucson Executive Briefing Center
• Tucson, Arizona is home for storage
hardware and software design and
development
• IBM Tucson Executive Briefing Center
offers:
• Technology briefings
• Product demonstrations
• Solution workshops
• Take a video tour!
• http://youtu.be/CXrpoCZAazg
39
41. #ibmedge
About the Speaker
Tony Pearson is a Master Inventor and Senior Software Engineer for the IBM Storage product line. Tony joined IBM
Corporation in 1986 in Tucson, Arizona, USA, and has lived there ever since. In his current role, Tony presents briefings on
storage topics covering the entire IBM Storage product line, IBM Spectrum Storage software products, and topics related to
Cloud Computing, Analytics and Cognitive Solutions. He interacts with clients, speaks at conferences and events, and leads
client workshops to help clients with strategic planning for IBM’s integrated set of storage management software, hardware, and
virtualization products.
Tony writes the “Inside System Storage” blog, which is read by hundreds of clients, IBM sales reps and IBM Business Partners
every week. This blog was rated one of the top 10 blogs for the IT storage industry by “Networking World” magazine, and #1
most read IBM blog on IBM’s developerWorks. The blog has been published in series of books, Inside System Storage: Volume
I through V.
Over the past years, Tony has worked in development, marketing and customer care positions for various storage hardware and
software products. Tony has a Bachelor of Science degree in Software Engineering, and a Master of Science degree in
Electrical Engineering, both from the University of Arizona. Tony holds 19 patents for inventions on storage hardware and
software products.
9000 S. Rita Road
Bldg 9032 Floor 1
Tucson, AZ 85744
+1 520-799-4309 (Office)
tpearson@us.ibm.com
Tony Pearson
Master Inventor
Senior Software
Engineer
IBM Storage
40
43. #ibmedge
Please Note: Edge 2016 Disclaimers
• IBM’s statements regarding its plans, directions, and intent are subject to change or
withdrawal without notice and at IBM’s sole discretion.
• Information regarding potential future products is intended to outline our general product
direction and it should not be relied on in making a purchasing decision.
• The information mentioned regarding potential future products is not a commitment, promise,
or legal obligation to deliver any material, code or functionality. Information about potential
future products may not be incorporated into any contract.
• The development, release, and timing of any future features or functionality described for our
products remains at our sole discretion.
• Performance is based on measurements and projections using standard IBM benchmarks in
a controlled environment. The actual throughput or performance that any user will
experience will vary depending upon many factors, including considerations such as the
amount of multiprogramming in the user’s job stream, the I/O configuration, the storage
configuration, and the workload processed. Therefore, no assurance can be given that an
individual user will achieve results similar to those stated here.
42