SlideShare ist ein Scribd-Unternehmen logo
1 von 46
Downloaden Sie, um offline zu lesen
Extreme Performance Series:
Storage in a Flash
Sankaran Sivathanu, VMware
Mark Achtemichuk, VMware
VSVC5603
#VSVC5603
2
Flash Storage
 Flash is everywhere
 Used extensively in smartphones, tablets, laptop computers,
storage arrays, etc.
 Adopting flash in enterprise servers?
• Presents an economical alternative to having a storage array
 How does VMware embrace flash technology in vSphere 5.5?
• Native support for provisioning of flash resources
• Flash caching support in ESXi storage stack
• vSAN leverages flash storage for high performance
 Focus:
Application performance on vSphere 5.5 when leveraging flash
3
Agenda
vSphere Flash
Read Cache
(vFRC)
Virtual SAN
(vSAN)
Configurations
and Troubleshooting
4
vFRC Overview
5
vFRC – Overview
 vSphere 5.5 introduces vSphere Flash Infrastructure layer
• Aggregates flash storage devices into a unified flash resource
• Supports locally connected flash devices (PCIe, SAS/SATA drives, etc.,)
 Flash resource can be used for read caching of VM I/Os
• vSphere Flash Read Cache (vFRC)
 Write Policy
• Write-through : write I/Os are written to persistent storage and vFRC
simultaneously
• Large writes are filtered – avoids cache pollution with log/streaming data
 Caches configured on per-VMDK basis
• Can be custom configured based on workload
6
vFRC - Overview
Flash Read Cache
VMDKs
VM Layer
ESX Layer
Storage Layer
7
vFRC Tunables
8
Performance Tunables in vFRC
 What workloads can benefit from vFRC ?
• Read-dominated I/O pattern
• High repeated access of data (E.g. 20% of working set accessed 80% of time)
• Sufficient flash capacity to hold data that is accessed repeatedly
 What impacts vFRC performance ?
• Cache Size – should be big enough to hold active working set of workload
• Cache Block Size – should match the dominant I/O size of workload
• Flash Device Types – PCIe flash cards vs. SSD drives
9
a. Cache Size
 Cache sizes are specified manually when enabling vFRC for
a VMDK
• Depends on working set of the application
• Should be sized to hold active working set
 Inadequate cache sizes lead to increased cache miss rate
 Over-sized cache leads to wastage of flash resources and sub-
optimal performance during vMotion
• By default cache is migrated during vMotion
• Over-sized cache increases vMotion time
 How to determine the right working set size?
• vscsiStats workload tracing
 Cache size can be modified at run-time if necessary
10
b. Cache Block Size
 Basic unit of cache fill and cache eviction operation
 Affects effective utilization of cache capacity
 Bigger cache blocks lead to internal fragmentation, but consumes
less memory
 Smaller cache blocks consumes more memory (upto 2% memory
space overhead)
 Default cache block size is 8KB
0
0.5
1
1.5
2
2.5
4 8 16 32 64 128 256 512
MemoryConsumed/Cachesize(%)
Cache Block Size (KB)
Memory Overhead wrt. vFRC block size
0
300
600
900
1200
1500
4KB 8 KB 16KB 32KB 64KB 128KB 256KB 512KB Baseline (no
vFRC)
LatencyinMicroseconds
Cache Block Size
Performance Impact of Cache Block Size
11
b. Cache Block Size
 Larger Cache Block Size (Example: 512KB cache block size for
workload I/O Size of 8KB) – Internal Fragmentation
vFRC
Cache Blocks
Valid Cached Data
12
c. Flash Device Type
30k – 40k Random Read IOPS
200 – 270 MB/s Read Bandwidth
Read Latency – 75 microseconds
Write Latency – 90 microseconds
Upto 750k Random Read IOPS
Upto 3 GB/s Read Bandwidth
Read Latency – 75 microseconds
Write Latency – 15 microseconds
High Performance Low Cost
VENDOR SPECIFICATIONS
13
vFRC Performance
14
vFRC Performance – Applications
 What workloads can benefit from vFRC?
• Read-dominated I/O pattern
• High repeated access of data (E.g. 20% of working set accessed 80% of time)
• Sufficient flash capacity to hold data that is accessed repeatedly
 Applications Considered:
• Data Warehousing (Swingbench DSS)
• Database Transactions (DVDstore)
• Real-World Enterprise Server Workloads (Publicly available I/O Traces)
15
1. Data Warehousing Application
 Decision Support System [TPC-H]
 Benchmark : Swingbench 2.4 using ‘Sales History’ Schema on
Oracle 11g R2 database
SWINGBENCH DSS
BENCHMARK ON RHEL
6.4 VM
QUERIES >>
<< RESULTS
ORACLE 11G R2 ON
WINDOWS 2008
SERVER VM
vFRC
EMC VNX 5700
1TB LUN, RAID5 OVER 5 FC
15k RPM HDDs
16
1. Data Warehousing Application
 Workload: Read dominated, High re-access rate
 vFRC Configuration: 8GB Cache Size and 8KB Cache block size
0
2000
4000
6000
8000
10000
12000
SRMC SCMC PSCR SMA PPSC TSQ SQC
#oftransacons
Transac on Type
Transac on Count
Baseline VFRC
17
1. Data Warehousing Application
 Workload: Read dominated, High re-access rate
 vFRC Configuration: 8GB Cache Size and 8KB Cache block size
 Up to 84% improvement in average throughput
 Up to 2X reduction in latency
61.7
112.9
0
20
40
60
80
100
120
Baseline VFRC
TPM
Transac ons Per Minute
20.389
10.859
0
5
10
15
20
25
Baseline VFRC
ResponseTIme(s)
Average Response Time
18
2. Database Transaction Application
 Benchmark : DVDStore
• Simulates online e-commerce site operations
• Database : MS SQL Server 2008
• Database Size : 15 GB
 Workload Characteristics
• 60% reads
• Mostly random I/Os
• Predominant I/O size : 8KB
 VM Configuration
• 8 vCPUs, 8GB Memory
• 25GB Database disk, 10GB Log disk
 Storage Array
• VNX 5700, 1TB LUN – RAID5 over 5 FC 15k RPM disk drives
19
2. Database Transaction Application
8802 8937
12319
0
2000
4000
6000
8000
10000
12000
14000
Baseline vFRC - 10GB vFRC - 15GB
Orders Per Minute
Up to 39% improvement in application throughput
20
3. Enterprise Server I/O Traces
1.23
0.321
0
0.2
0.4
0.6
0.8
1
1.2
1.4
Baseline vFRC
AverageLatency(ms)
 a. Hardware Monitoring
Server Workload
• Trace from servers that logs
data from multiple hardware
monitoring programs across a
datacenter
• Collected at Microsoft
Research, Cambridge*
• Trace replayed using
IOAnalyzer
 95% reads
 vFRC size – 4GB
 vFRC block size – 4KB
 vFRC hit percentage – 85%
* Dushyanth Narayanan, Austin Donnelly, and Antony Rowstron. 2008. Write off-loading: Practical power management for enterprise storage.
Trans. Storage 4, 3, Article 10 (November 2008).
21
3. Enterprise Server I/O Traces
 b. Proxy Server Workload
• Trace from a web proxy
server
• Collected at Microsoft
Research, Cambridge*
• Trace replayed using
IOAnalyzer
 67% reads
 vFRC Size : 16GB
 vFRC block Size : 4KB
 vFRC hit percentage :
83%
* Dushyanth Narayanan, Austin Donnelly, and Antony Rowstron. 2008. Write off-loading: Practical power management for enterprise storage.
Trans. Storage 4, 3, Article 10 (November 2008).
1.357
0.612
0
0.2
0.4
0.6
0.8
1
1.2
1.4
1.6
Baseline vFRC
AverageLatency(ms)
22
vMotion Performance with vFRC
 vFRC is fully supported by vMotion and other vSphere features
 vMotion behavior of vFRC-enabled VM
• VM caches are migrated by default
• Option to drop cache during vMotion
 Migrating cache preserves application performance gains
• Consumes more network bandwidth
• Increased vMotion time
 Dropping cache during vMotion leads to temporary dip in
application performance gains
• No extra overhead in vMotion
• Re-warms up the cache at destination
23
vFRC Performance Best Practices
24
vFRC Configuration Guidelines
 Cache size may be configured based on working set of workload
• Start with about 20% of VMDK size, and monitor vFRC stats to re-configure it
 Cache block size must match dominant I/O size of workload
• Workload I/O size is not equal to VM I/O Size!
 vFRC performs better with PCIe flash devices
 Decide on cache migration behavior during vMotion based on:
• Criticality of application performance
• Time taken for vMotion
• Network bandwidth availability
25
Making sense of vscsiStats and vFRC Stats
 vscsiStats can be used to know more about the workload
• IO Length Histogram
• Read Write Ratio
• I/O trace to compute working set size
 vFRC stats provide information about cache effectiveness
• numBlocks – total number of cache blocks for a VMDK
• numBlocksCurrentlyCached – number of cache blocks that actually contains
data
• Evict:avgNumBlocksPerOp – average number of evictions
• avgCacheLatency – average device latency of flash resource
• maxCacheLatency – maximum device latency of flash resource
• cacheHitPercentage – percentage of read cache hits
26
vFRC Sizing Decisions based on vFRC Stats
 Using vFRC stats to make sizing decisions
• numBlocksCurrentlyCached < numBlocks : cache size may be reduced
• numBlocksCurrentlyCached = numBlocks and Evict:avgNumBlocksPerOp is
high : cache size may be inadequate
• maxCacheLatency is very high : may be because of spike in device latency,
which may mean device has worn out
• cacheHitPercentage is high and Evict:avgNumBlocksPerOp is low : means
cache is correctly configured
 For more detailed information, please refer to our performance
whitepaper http://www.vmware.com/files/pdf/techpaper/vfrc-perf-
vsphere55.pdf
27
Agenda
vSphere Flash
Read Cache
(vFRC)
Virtual SAN
(vSAN)
Configurations
and Troubleshooting
28
Virtual SAN - Architecture
 Each ESXi host contributes:
• Flash storage to absorb IOPS
• Hard disk drives to provide capacity
 Virtual SAN aggregates these
resources from multiple servers
in a vSphere cluster
• Provides a global datastore for VMs
in the cluster
 HA/DRS ensures that the VM
restarts on a host crash
 Virtual SAN objects can be split
into multiple components for
performance and data protection
• Governed by storage policies
ESX
VSAN cluster
ESX ESX
VM
virtual disk
VSAN object
replica-1 replica-2
Witness
29
Experiment Setup
 Hardware
• 16 core 2.9 GHz Dell R720 machines
• 2 x Intel PCIe R910 SSD – 200GB (1 PCIe slot)
• 12 x 10K RPM Seagate SAS disks
• 10G vSAN dedicated network, 1G for VM network
 VSAN Configuration
• 2 x Disk groups per machine, 6 x Disks per disk group
• hostFailuresToTolerate 1, stripeWidth 1
 Workload Characteristics
• ViewPlanner 3.0 Standard benchmark with 2 sec think-time (heavy user)
• ViewPlanner Group A : CPU intensive & Group B: I/O intensive apps.
• 1900 x 1200 resolution, PCoIP
• Windows 7 desktop’s and Winxp Clients.
• VDI workload is known to be CPU intensive but sensitive to I/O latency.
30
Virtual SAN Delivers IOPS Required by VDI
• Virtual SAN can meet the IOPS required by VDI workload
31
Virtual SAN Scale..
275
460
667
0
100
200
300
400
500
600
700
800
3 node 5 node 7 node
NumberofHeavyVDIUsers
Virtual SAN scale
VSAN Linear (VSAN)
32
Group A Score Comparison
0
0.2
0.4
0.6
0.8
1
1.2
Avg
Application
Latency
Group A
VSAN
SAN
• Impact to Group A application latencies is marginal
• Virtual SAN uses very few cycles of Host CPU.
33
Group B Score Comparison
0
1
2
3
4
5
6
7
Avg
Application
Latency
Group B
VSAN
All-Flash-SAN
• Group B application latencies are close to All-Flash-SAN
• Virtual SAN can meet the IOPS required by VDI workload
34
VSAN VDI Consolidation compared to physical SAN array
 VSAN performs better than a typical mid-range FC storage array
• vSAN benefits from local flash storage that provide high performance
 Impact of VSAN CPU consumption on application performance is low
 Physical SAN array is not required to run VDI workload
0
100
200
300
400
500
600
700
800
3 5 7
NumberofVMs
Number of Nodes (Servers)
VDI Consolida on Ra o
Mid-range FC Array
vSAN
All-Flash FC Array
35
Agenda
vSphere Flash
Read Cache
(vFRC)
Virtual SAN
(vSAN)
Configurations
and Troubleshooting
36
Define the Performance Issue
 Understand Application Function & Architecture
• At a minimum know what your application does and what it’s dependent on
 Select Application KPIs
• Application performance must be measured using an application counter (tps,
response time, etc.) and not virtual resource consumption
 Define Success Criteria
• With your app owner, define at what level the application KPI’s must be to
consider it performant
 Comparisons must be Apples-to-Apples
• Any changes to infrastructure (physical or virtual) create
comparison challenges
 Now the Gap is Identified, Begin Troubleshooting
• With an understanding of the requirements and current deficiency, you can
now begin to investigate and/or tune
37
Disk I/O Latencies
Application
Guest OS
ESX Storage
Stack
VMM
Driver
KAVG
DAVG
GAVG
QAVG
* KAVG = GAVG – DAVG
Fabric
vSCSI
HBA
Time spent in ESX
storage stack is minimal,
for all practical purposes
KAVG ~= QAVG
In a well configured
system QAVG should
be zero
Array SP
38
Key Indicators – Investigative Thresholds
 Kernel Latency Average (KAVG)
• This counter tracks the latencies of IO passing thru the Kernel
• Investigation Threshold: 1ms
 Device Latency Average (DAVG)
• This is the latency seen at the device driver level. It includes the roundtrip time
between the HBA and the storage
• Investigation Threshold: 15-20ms, lower is better, some spikes okay
 Aborts (ABRT/s)
• The number of commands aborted per second
• Investigation Threshold: 1
39
Disk I/O Queues
GQLEN – Guest Queue
AQLEN – Adapter Queue
WQLEN – World Queue
DQLEN – Device / LUN
Queue
SQLEN – Array SP Queue
DQLEN
WQLEN
SQLEN
GQLEN
DQLEN can change
dynamically when SIOC
is enabled
Reported in esxtop
AQLEN
Application
Guest OS
ESX Storage
Stack
VMM
Driver
Fabric
vSCSI
HBA
Array SP
40
Performance Technical Resources
Performance Technical Papers
• http://www.vmware.com/files/pdf/techpaper/vfrc-perf-vsphere55.pdf
• http://www.vmware.com/resources/techresources/cat/91,96
Performance Best Practices
• http://www.vmware.com/pdf/Perf_Best_Practices_vSphere4.0.pdf
• http://www.vmware.com/pdf/Perf_Best_Practices_vSphere4.1.pdf
• http://www.vmware.com/pdf/Perf_Best_Practices_vSphere5.0.pdf
• http://www.vmware.com/pdf/Perf_Best_Practices_vSphere5.1.pdf
 Troubleshooting Performance Related Problems in vSphere Environments
• http://communities.vmware.com/docs/DOC-14905 (vSphere 4.1)
• http://communities.vmware.com/docs/DOC-19166 (vSphere 5)
• http://communities.vmware.com/docs/DOC-23094 (vSphere 5.x with vCOps)
41
Performance Community Resources
Performance Technology Pages
• http://www.vmware.com/technical-resources/performance/resources.html
Technical Marketing Blog
• http://blogs.vmware.com/vsphere/performance/
Performance Engineering Blog VROOM!
• http://blogs.vmware.com/performance
Performance Community Forum
• http://communities.vmware.com/community/vmtn/general/performance
Virtualizing Business Critical Applications
• http://www.vmware.com/solutions/business-critical-apps/
42
Extreme Performance Series Sessions
Extreme Performance Series:
vCenter of the Universe – Session #VSVC5234
Monster Virtual Machines – Session # VSVC4811
Network Speed Ahead – Session # VSVC5596
Storage in a Flash – Session # VSVC5603
Big Data:
Virtualized SAP HANA Performance, Scalability and Practices – Session
# VAPP5591
Hands on Labs:
HOL-SDC-1304 – Optimize vSphere Performance  includes vFRC
43
Other VMware Activities Related to This Session
 HOL:
HOL-SDC-1308
Virtual Storage Solutions
 Group Discussions:
VSVC1001-GD
Performance with Mark Achtemichuk
THANK YOU
Extreme Performance Series:
Storage in a Flash
Sankaran Sivathanu, VMware
Mark Achtemichuk, VMware
VSVC5603
#VSVC5603

Weitere ähnliche Inhalte

Was ist angesagt?

Ceph Day Shanghai - Recovery Erasure Coding and Cache Tiering
Ceph Day Shanghai - Recovery Erasure Coding and Cache TieringCeph Day Shanghai - Recovery Erasure Coding and Cache Tiering
Ceph Day Shanghai - Recovery Erasure Coding and Cache Tiering
Ceph Community
 
Best Practices of HA and Replication of PostgreSQL in Virtualized Environments
Best Practices of HA and Replication of PostgreSQL in Virtualized EnvironmentsBest Practices of HA and Replication of PostgreSQL in Virtualized Environments
Best Practices of HA and Replication of PostgreSQL in Virtualized Environments
Jignesh Shah
 

Was ist angesagt? (20)

Ceph Day Melbourne - Scale and performance: Servicing the Fabric and the Work...
Ceph Day Melbourne - Scale and performance: Servicing the Fabric and the Work...Ceph Day Melbourne - Scale and performance: Servicing the Fabric and the Work...
Ceph Day Melbourne - Scale and performance: Servicing the Fabric and the Work...
 
Storage spaces direct webinar
Storage spaces direct webinarStorage spaces direct webinar
Storage spaces direct webinar
 
IMCSummit 2015 - Day 1 Developer Track - Evolution of non-volatile memory exp...
IMCSummit 2015 - Day 1 Developer Track - Evolution of non-volatile memory exp...IMCSummit 2015 - Day 1 Developer Track - Evolution of non-volatile memory exp...
IMCSummit 2015 - Day 1 Developer Track - Evolution of non-volatile memory exp...
 
Технологии работы с дисковыми хранилищами и файловыми системами Windows Serve...
Технологии работы с дисковыми хранилищами и файловыми системами Windows Serve...Технологии работы с дисковыми хранилищами и файловыми системами Windows Serve...
Технологии работы с дисковыми хранилищами и файловыми системами Windows Serve...
 
Stabilizing Ceph
Stabilizing CephStabilizing Ceph
Stabilizing Ceph
 
TDS-16489U-R2 0215 EN
TDS-16489U-R2 0215 ENTDS-16489U-R2 0215 EN
TDS-16489U-R2 0215 EN
 
Simplifying Ceph Management with Virtual Storage Manager (VSM)
Simplifying Ceph Management with Virtual Storage Manager (VSM)Simplifying Ceph Management with Virtual Storage Manager (VSM)
Simplifying Ceph Management with Virtual Storage Manager (VSM)
 
Redis on NVMe SSD - Zvika Guz, Samsung
 Redis on NVMe SSD - Zvika Guz, Samsung Redis on NVMe SSD - Zvika Guz, Samsung
Redis on NVMe SSD - Zvika Guz, Samsung
 
Ceph Day Shanghai - Recovery Erasure Coding and Cache Tiering
Ceph Day Shanghai - Recovery Erasure Coding and Cache TieringCeph Day Shanghai - Recovery Erasure Coding and Cache Tiering
Ceph Day Shanghai - Recovery Erasure Coding and Cache Tiering
 
Best Practices of HA and Replication of PostgreSQL in Virtualized Environments
Best Practices of HA and Replication of PostgreSQL in Virtualized EnvironmentsBest Practices of HA and Replication of PostgreSQL in Virtualized Environments
Best Practices of HA and Replication of PostgreSQL in Virtualized Environments
 
Ceph Day Melbourne - Troubleshooting Ceph
Ceph Day Melbourne - Troubleshooting Ceph Ceph Day Melbourne - Troubleshooting Ceph
Ceph Day Melbourne - Troubleshooting Ceph
 
Ceph Day San Jose - Object Storage for Big Data
Ceph Day San Jose - Object Storage for Big Data Ceph Day San Jose - Object Storage for Big Data
Ceph Day San Jose - Object Storage for Big Data
 
Ceph Day San Jose - All-Flahs Ceph on NUMA-Balanced Server
Ceph Day San Jose - All-Flahs Ceph on NUMA-Balanced Server Ceph Day San Jose - All-Flahs Ceph on NUMA-Balanced Server
Ceph Day San Jose - All-Flahs Ceph on NUMA-Balanced Server
 
Ceph Performance Profiling and Reporting
Ceph Performance Profiling and ReportingCeph Performance Profiling and Reporting
Ceph Performance Profiling and Reporting
 
iSCSI Target Support for Ceph
iSCSI Target Support for Ceph iSCSI Target Support for Ceph
iSCSI Target Support for Ceph
 
Ceph Day San Jose - Ceph at Salesforce
Ceph Day San Jose - Ceph at Salesforce Ceph Day San Jose - Ceph at Salesforce
Ceph Day San Jose - Ceph at Salesforce
 
Ceph on All Flash Storage -- Breaking Performance Barriers
Ceph on All Flash Storage -- Breaking Performance BarriersCeph on All Flash Storage -- Breaking Performance Barriers
Ceph on All Flash Storage -- Breaking Performance Barriers
 
IMCSummit 2015 - Day 2 Developer Track - The NVM Revolution
IMCSummit 2015 - Day 2 Developer Track - The NVM RevolutionIMCSummit 2015 - Day 2 Developer Track - The NVM Revolution
IMCSummit 2015 - Day 2 Developer Track - The NVM Revolution
 
Ceph Day San Jose - Red Hat Storage Acceleration Utlizing Flash Technology
Ceph Day San Jose - Red Hat Storage Acceleration Utlizing Flash TechnologyCeph Day San Jose - Red Hat Storage Acceleration Utlizing Flash Technology
Ceph Day San Jose - Red Hat Storage Acceleration Utlizing Flash Technology
 
Ceph Day KL - Ceph on All-Flash Storage
Ceph Day KL - Ceph on All-Flash Storage Ceph Day KL - Ceph on All-Flash Storage
Ceph Day KL - Ceph on All-Flash Storage
 

Ähnlich wie VMworld 2013: Extreme Performance Series: Storage in a Flash

Track 1 Virtualizing Critical Applications with VMWARE VISPHERE by Roshan Shetty
Track 1 Virtualizing Critical Applications with VMWARE VISPHERE by Roshan ShettyTrack 1 Virtualizing Critical Applications with VMWARE VISPHERE by Roshan Shetty
Track 1 Virtualizing Critical Applications with VMWARE VISPHERE by Roshan Shetty
EMC Forum India
 
Presentacion NetApp Funcionalidades.pptx
Presentacion NetApp Funcionalidades.pptxPresentacion NetApp Funcionalidades.pptx
Presentacion NetApp Funcionalidades.pptx
ssuser97bdaf
 

Ähnlich wie VMworld 2013: Extreme Performance Series: Storage in a Flash (20)

VMworld 2013: vSphere Flash Read Cache Technical Overview
VMworld 2013: vSphere Flash Read Cache Technical Overview VMworld 2013: vSphere Flash Read Cache Technical Overview
VMworld 2013: vSphere Flash Read Cache Technical Overview
 
VMworld 2013: Successfully Virtualize Microsoft Exchange Server
VMworld 2013: Successfully Virtualize Microsoft Exchange Server VMworld 2013: Successfully Virtualize Microsoft Exchange Server
VMworld 2013: Successfully Virtualize Microsoft Exchange Server
 
Developing High Performance and Scalable ColdFusion Application Using Terraco...
Developing High Performance and Scalable ColdFusion Application Using Terraco...Developing High Performance and Scalable ColdFusion Application Using Terraco...
Developing High Performance and Scalable ColdFusion Application Using Terraco...
 
Developing High Performance and Scalable ColdFusion Applications Using Terrac...
Developing High Performance and Scalable ColdFusion Applications Using Terrac...Developing High Performance and Scalable ColdFusion Applications Using Terrac...
Developing High Performance and Scalable ColdFusion Applications Using Terrac...
 
VMworld 2013: Maximize Database Performance in Your Software-Defined Data Center
VMworld 2013: Maximize Database Performance in Your Software-Defined Data CenterVMworld 2013: Maximize Database Performance in Your Software-Defined Data Center
VMworld 2013: Maximize Database Performance in Your Software-Defined Data Center
 
VMworld - sto7650 -Software defined storage @VMmware primer
VMworld - sto7650 -Software defined storage  @VMmware primerVMworld - sto7650 -Software defined storage  @VMmware primer
VMworld - sto7650 -Software defined storage @VMmware primer
 
VMworld 2014: Advanced SQL Server on vSphere Techniques and Best Practices
VMworld 2014: Advanced SQL Server on vSphere Techniques and Best PracticesVMworld 2014: Advanced SQL Server on vSphere Techniques and Best Practices
VMworld 2014: Advanced SQL Server on vSphere Techniques and Best Practices
 
Open vStorage Road show 2015 Q1
Open vStorage Road show 2015 Q1Open vStorage Road show 2015 Q1
Open vStorage Road show 2015 Q1
 
VMworld 2013: How SRP Delivers More Than Power to Their Customers
VMworld 2013: How SRP Delivers More Than Power to Their Customers VMworld 2013: How SRP Delivers More Than Power to Their Customers
VMworld 2013: How SRP Delivers More Than Power to Their Customers
 
VMworld 2013: Storage IO Control: Concepts, Configuration and Best Practices ...
VMworld 2013: Storage IO Control: Concepts, Configuration and Best Practices ...VMworld 2013: Storage IO Control: Concepts, Configuration and Best Practices ...
VMworld 2013: Storage IO Control: Concepts, Configuration and Best Practices ...
 
What is coming for VMware vSphere?
What is coming for VMware vSphere?What is coming for VMware vSphere?
What is coming for VMware vSphere?
 
Road show 2015 triangle meetup
Road show 2015 triangle meetupRoad show 2015 triangle meetup
Road show 2015 triangle meetup
 
Ceph Day Beijing - Ceph all-flash array design based on NUMA architecture
Ceph Day Beijing - Ceph all-flash array design based on NUMA architectureCeph Day Beijing - Ceph all-flash array design based on NUMA architecture
Ceph Day Beijing - Ceph all-flash array design based on NUMA architecture
 
Track 1 Virtualizing Critical Applications with VMWARE VISPHERE by Roshan Shetty
Track 1 Virtualizing Critical Applications with VMWARE VISPHERE by Roshan ShettyTrack 1 Virtualizing Critical Applications with VMWARE VISPHERE by Roshan Shetty
Track 1 Virtualizing Critical Applications with VMWARE VISPHERE by Roshan Shetty
 
Mini-Training: To cache or not to cache
Mini-Training: To cache or not to cacheMini-Training: To cache or not to cache
Mini-Training: To cache or not to cache
 
Inter connect2016 yss1841-cloud-storage-options-v4
Inter connect2016 yss1841-cloud-storage-options-v4Inter connect2016 yss1841-cloud-storage-options-v4
Inter connect2016 yss1841-cloud-storage-options-v4
 
Presentazione VMware @ VMUGIT UserCon 2015
Presentazione VMware @ VMUGIT UserCon 2015Presentazione VMware @ VMUGIT UserCon 2015
Presentazione VMware @ VMUGIT UserCon 2015
 
Presentacion NetApp Funcionalidades.pptx
Presentacion NetApp Funcionalidades.pptxPresentacion NetApp Funcionalidades.pptx
Presentacion NetApp Funcionalidades.pptx
 
cache concepts and varnish-cache
cache concepts and varnish-cachecache concepts and varnish-cache
cache concepts and varnish-cache
 
vSphere
vSpherevSphere
vSphere
 

Mehr von VMworld

Mehr von VMworld (20)

VMworld 2016: vSphere 6.x Host Resource Deep Dive
VMworld 2016: vSphere 6.x Host Resource Deep DiveVMworld 2016: vSphere 6.x Host Resource Deep Dive
VMworld 2016: vSphere 6.x Host Resource Deep Dive
 
VMworld 2016: Troubleshooting 101 for Horizon
VMworld 2016: Troubleshooting 101 for HorizonVMworld 2016: Troubleshooting 101 for Horizon
VMworld 2016: Troubleshooting 101 for Horizon
 
VMworld 2016: Advanced Network Services with NSX
VMworld 2016: Advanced Network Services with NSXVMworld 2016: Advanced Network Services with NSX
VMworld 2016: Advanced Network Services with NSX
 
VMworld 2016: How to Deploy VMware NSX with Cisco Infrastructure
VMworld 2016: How to Deploy VMware NSX with Cisco InfrastructureVMworld 2016: How to Deploy VMware NSX with Cisco Infrastructure
VMworld 2016: How to Deploy VMware NSX with Cisco Infrastructure
 
VMworld 2016: Enforcing a vSphere Cluster Design with PowerCLI Automation
VMworld 2016: Enforcing a vSphere Cluster Design with PowerCLI AutomationVMworld 2016: Enforcing a vSphere Cluster Design with PowerCLI Automation
VMworld 2016: Enforcing a vSphere Cluster Design with PowerCLI Automation
 
VMworld 2016: What's New with Horizon 7
VMworld 2016: What's New with Horizon 7VMworld 2016: What's New with Horizon 7
VMworld 2016: What's New with Horizon 7
 
VMworld 2016: Virtual Volumes Technical Deep Dive
VMworld 2016: Virtual Volumes Technical Deep DiveVMworld 2016: Virtual Volumes Technical Deep Dive
VMworld 2016: Virtual Volumes Technical Deep Dive
 
VMworld 2016: Advances in Remote Display Protocol Technology with VMware Blas...
VMworld 2016: Advances in Remote Display Protocol Technology with VMware Blas...VMworld 2016: Advances in Remote Display Protocol Technology with VMware Blas...
VMworld 2016: Advances in Remote Display Protocol Technology with VMware Blas...
 
VMworld 2016: The KISS of vRealize Operations!
VMworld 2016: The KISS of vRealize Operations! VMworld 2016: The KISS of vRealize Operations!
VMworld 2016: The KISS of vRealize Operations!
 
VMworld 2016: Getting Started with PowerShell and PowerCLI for Your VMware En...
VMworld 2016: Getting Started with PowerShell and PowerCLI for Your VMware En...VMworld 2016: Getting Started with PowerShell and PowerCLI for Your VMware En...
VMworld 2016: Getting Started with PowerShell and PowerCLI for Your VMware En...
 
VMworld 2016: Ask the vCenter Server Exerts Panel
VMworld 2016: Ask the vCenter Server Exerts PanelVMworld 2016: Ask the vCenter Server Exerts Panel
VMworld 2016: Ask the vCenter Server Exerts Panel
 
VMworld 2016: Virtualize Active Directory, the Right Way!
VMworld 2016: Virtualize Active Directory, the Right Way! VMworld 2016: Virtualize Active Directory, the Right Way!
VMworld 2016: Virtualize Active Directory, the Right Way!
 
VMworld 2016: Migrating from a hardware based firewall to NSX to improve perf...
VMworld 2016: Migrating from a hardware based firewall to NSX to improve perf...VMworld 2016: Migrating from a hardware based firewall to NSX to improve perf...
VMworld 2016: Migrating from a hardware based firewall to NSX to improve perf...
 
VMworld 2015: Troubleshooting for vSphere 6
VMworld 2015: Troubleshooting for vSphere 6VMworld 2015: Troubleshooting for vSphere 6
VMworld 2015: Troubleshooting for vSphere 6
 
VMworld 2015: Monitoring and Managing Applications with vRealize Operations 6...
VMworld 2015: Monitoring and Managing Applications with vRealize Operations 6...VMworld 2015: Monitoring and Managing Applications with vRealize Operations 6...
VMworld 2015: Monitoring and Managing Applications with vRealize Operations 6...
 
VMworld 2015: Advanced SQL Server on vSphere
VMworld 2015: Advanced SQL Server on vSphereVMworld 2015: Advanced SQL Server on vSphere
VMworld 2015: Advanced SQL Server on vSphere
 
VMworld 2015: Virtualize Active Directory, the Right Way!
VMworld 2015: Virtualize Active Directory, the Right Way!VMworld 2015: Virtualize Active Directory, the Right Way!
VMworld 2015: Virtualize Active Directory, the Right Way!
 
VMworld 2015: Site Recovery Manager and Policy Based DR Deep Dive with Engine...
VMworld 2015: Site Recovery Manager and Policy Based DR Deep Dive with Engine...VMworld 2015: Site Recovery Manager and Policy Based DR Deep Dive with Engine...
VMworld 2015: Site Recovery Manager and Policy Based DR Deep Dive with Engine...
 
VMworld 2015: Building a Business Case for Virtual SAN
VMworld 2015: Building a Business Case for Virtual SANVMworld 2015: Building a Business Case for Virtual SAN
VMworld 2015: Building a Business Case for Virtual SAN
 
VMworld 2015: Explaining Advanced Virtual Volumes Configurations
VMworld 2015: Explaining Advanced Virtual Volumes ConfigurationsVMworld 2015: Explaining Advanced Virtual Volumes Configurations
VMworld 2015: Explaining Advanced Virtual Volumes Configurations
 

Kürzlich hochgeladen

Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
Joaquim Jorge
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
giselly40
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
vu2urc
 

Kürzlich hochgeladen (20)

Evaluating the top large language models.pdf
Evaluating the top large language models.pdfEvaluating the top large language models.pdf
Evaluating the top large language models.pdf
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 

VMworld 2013: Extreme Performance Series: Storage in a Flash

  • 1. Extreme Performance Series: Storage in a Flash Sankaran Sivathanu, VMware Mark Achtemichuk, VMware VSVC5603 #VSVC5603
  • 2. 2 Flash Storage  Flash is everywhere  Used extensively in smartphones, tablets, laptop computers, storage arrays, etc.  Adopting flash in enterprise servers? • Presents an economical alternative to having a storage array  How does VMware embrace flash technology in vSphere 5.5? • Native support for provisioning of flash resources • Flash caching support in ESXi storage stack • vSAN leverages flash storage for high performance  Focus: Application performance on vSphere 5.5 when leveraging flash
  • 3. 3 Agenda vSphere Flash Read Cache (vFRC) Virtual SAN (vSAN) Configurations and Troubleshooting
  • 5. 5 vFRC – Overview  vSphere 5.5 introduces vSphere Flash Infrastructure layer • Aggregates flash storage devices into a unified flash resource • Supports locally connected flash devices (PCIe, SAS/SATA drives, etc.,)  Flash resource can be used for read caching of VM I/Os • vSphere Flash Read Cache (vFRC)  Write Policy • Write-through : write I/Os are written to persistent storage and vFRC simultaneously • Large writes are filtered – avoids cache pollution with log/streaming data  Caches configured on per-VMDK basis • Can be custom configured based on workload
  • 6. 6 vFRC - Overview Flash Read Cache VMDKs VM Layer ESX Layer Storage Layer
  • 8. 8 Performance Tunables in vFRC  What workloads can benefit from vFRC ? • Read-dominated I/O pattern • High repeated access of data (E.g. 20% of working set accessed 80% of time) • Sufficient flash capacity to hold data that is accessed repeatedly  What impacts vFRC performance ? • Cache Size – should be big enough to hold active working set of workload • Cache Block Size – should match the dominant I/O size of workload • Flash Device Types – PCIe flash cards vs. SSD drives
  • 9. 9 a. Cache Size  Cache sizes are specified manually when enabling vFRC for a VMDK • Depends on working set of the application • Should be sized to hold active working set  Inadequate cache sizes lead to increased cache miss rate  Over-sized cache leads to wastage of flash resources and sub- optimal performance during vMotion • By default cache is migrated during vMotion • Over-sized cache increases vMotion time  How to determine the right working set size? • vscsiStats workload tracing  Cache size can be modified at run-time if necessary
  • 10. 10 b. Cache Block Size  Basic unit of cache fill and cache eviction operation  Affects effective utilization of cache capacity  Bigger cache blocks lead to internal fragmentation, but consumes less memory  Smaller cache blocks consumes more memory (upto 2% memory space overhead)  Default cache block size is 8KB 0 0.5 1 1.5 2 2.5 4 8 16 32 64 128 256 512 MemoryConsumed/Cachesize(%) Cache Block Size (KB) Memory Overhead wrt. vFRC block size 0 300 600 900 1200 1500 4KB 8 KB 16KB 32KB 64KB 128KB 256KB 512KB Baseline (no vFRC) LatencyinMicroseconds Cache Block Size Performance Impact of Cache Block Size
  • 11. 11 b. Cache Block Size  Larger Cache Block Size (Example: 512KB cache block size for workload I/O Size of 8KB) – Internal Fragmentation vFRC Cache Blocks Valid Cached Data
  • 12. 12 c. Flash Device Type 30k – 40k Random Read IOPS 200 – 270 MB/s Read Bandwidth Read Latency – 75 microseconds Write Latency – 90 microseconds Upto 750k Random Read IOPS Upto 3 GB/s Read Bandwidth Read Latency – 75 microseconds Write Latency – 15 microseconds High Performance Low Cost VENDOR SPECIFICATIONS
  • 14. 14 vFRC Performance – Applications  What workloads can benefit from vFRC? • Read-dominated I/O pattern • High repeated access of data (E.g. 20% of working set accessed 80% of time) • Sufficient flash capacity to hold data that is accessed repeatedly  Applications Considered: • Data Warehousing (Swingbench DSS) • Database Transactions (DVDstore) • Real-World Enterprise Server Workloads (Publicly available I/O Traces)
  • 15. 15 1. Data Warehousing Application  Decision Support System [TPC-H]  Benchmark : Swingbench 2.4 using ‘Sales History’ Schema on Oracle 11g R2 database SWINGBENCH DSS BENCHMARK ON RHEL 6.4 VM QUERIES >> << RESULTS ORACLE 11G R2 ON WINDOWS 2008 SERVER VM vFRC EMC VNX 5700 1TB LUN, RAID5 OVER 5 FC 15k RPM HDDs
  • 16. 16 1. Data Warehousing Application  Workload: Read dominated, High re-access rate  vFRC Configuration: 8GB Cache Size and 8KB Cache block size 0 2000 4000 6000 8000 10000 12000 SRMC SCMC PSCR SMA PPSC TSQ SQC #oftransacons Transac on Type Transac on Count Baseline VFRC
  • 17. 17 1. Data Warehousing Application  Workload: Read dominated, High re-access rate  vFRC Configuration: 8GB Cache Size and 8KB Cache block size  Up to 84% improvement in average throughput  Up to 2X reduction in latency 61.7 112.9 0 20 40 60 80 100 120 Baseline VFRC TPM Transac ons Per Minute 20.389 10.859 0 5 10 15 20 25 Baseline VFRC ResponseTIme(s) Average Response Time
  • 18. 18 2. Database Transaction Application  Benchmark : DVDStore • Simulates online e-commerce site operations • Database : MS SQL Server 2008 • Database Size : 15 GB  Workload Characteristics • 60% reads • Mostly random I/Os • Predominant I/O size : 8KB  VM Configuration • 8 vCPUs, 8GB Memory • 25GB Database disk, 10GB Log disk  Storage Array • VNX 5700, 1TB LUN – RAID5 over 5 FC 15k RPM disk drives
  • 19. 19 2. Database Transaction Application 8802 8937 12319 0 2000 4000 6000 8000 10000 12000 14000 Baseline vFRC - 10GB vFRC - 15GB Orders Per Minute Up to 39% improvement in application throughput
  • 20. 20 3. Enterprise Server I/O Traces 1.23 0.321 0 0.2 0.4 0.6 0.8 1 1.2 1.4 Baseline vFRC AverageLatency(ms)  a. Hardware Monitoring Server Workload • Trace from servers that logs data from multiple hardware monitoring programs across a datacenter • Collected at Microsoft Research, Cambridge* • Trace replayed using IOAnalyzer  95% reads  vFRC size – 4GB  vFRC block size – 4KB  vFRC hit percentage – 85% * Dushyanth Narayanan, Austin Donnelly, and Antony Rowstron. 2008. Write off-loading: Practical power management for enterprise storage. Trans. Storage 4, 3, Article 10 (November 2008).
  • 21. 21 3. Enterprise Server I/O Traces  b. Proxy Server Workload • Trace from a web proxy server • Collected at Microsoft Research, Cambridge* • Trace replayed using IOAnalyzer  67% reads  vFRC Size : 16GB  vFRC block Size : 4KB  vFRC hit percentage : 83% * Dushyanth Narayanan, Austin Donnelly, and Antony Rowstron. 2008. Write off-loading: Practical power management for enterprise storage. Trans. Storage 4, 3, Article 10 (November 2008). 1.357 0.612 0 0.2 0.4 0.6 0.8 1 1.2 1.4 1.6 Baseline vFRC AverageLatency(ms)
  • 22. 22 vMotion Performance with vFRC  vFRC is fully supported by vMotion and other vSphere features  vMotion behavior of vFRC-enabled VM • VM caches are migrated by default • Option to drop cache during vMotion  Migrating cache preserves application performance gains • Consumes more network bandwidth • Increased vMotion time  Dropping cache during vMotion leads to temporary dip in application performance gains • No extra overhead in vMotion • Re-warms up the cache at destination
  • 24. 24 vFRC Configuration Guidelines  Cache size may be configured based on working set of workload • Start with about 20% of VMDK size, and monitor vFRC stats to re-configure it  Cache block size must match dominant I/O size of workload • Workload I/O size is not equal to VM I/O Size!  vFRC performs better with PCIe flash devices  Decide on cache migration behavior during vMotion based on: • Criticality of application performance • Time taken for vMotion • Network bandwidth availability
  • 25. 25 Making sense of vscsiStats and vFRC Stats  vscsiStats can be used to know more about the workload • IO Length Histogram • Read Write Ratio • I/O trace to compute working set size  vFRC stats provide information about cache effectiveness • numBlocks – total number of cache blocks for a VMDK • numBlocksCurrentlyCached – number of cache blocks that actually contains data • Evict:avgNumBlocksPerOp – average number of evictions • avgCacheLatency – average device latency of flash resource • maxCacheLatency – maximum device latency of flash resource • cacheHitPercentage – percentage of read cache hits
  • 26. 26 vFRC Sizing Decisions based on vFRC Stats  Using vFRC stats to make sizing decisions • numBlocksCurrentlyCached < numBlocks : cache size may be reduced • numBlocksCurrentlyCached = numBlocks and Evict:avgNumBlocksPerOp is high : cache size may be inadequate • maxCacheLatency is very high : may be because of spike in device latency, which may mean device has worn out • cacheHitPercentage is high and Evict:avgNumBlocksPerOp is low : means cache is correctly configured  For more detailed information, please refer to our performance whitepaper http://www.vmware.com/files/pdf/techpaper/vfrc-perf- vsphere55.pdf
  • 27. 27 Agenda vSphere Flash Read Cache (vFRC) Virtual SAN (vSAN) Configurations and Troubleshooting
  • 28. 28 Virtual SAN - Architecture  Each ESXi host contributes: • Flash storage to absorb IOPS • Hard disk drives to provide capacity  Virtual SAN aggregates these resources from multiple servers in a vSphere cluster • Provides a global datastore for VMs in the cluster  HA/DRS ensures that the VM restarts on a host crash  Virtual SAN objects can be split into multiple components for performance and data protection • Governed by storage policies ESX VSAN cluster ESX ESX VM virtual disk VSAN object replica-1 replica-2 Witness
  • 29. 29 Experiment Setup  Hardware • 16 core 2.9 GHz Dell R720 machines • 2 x Intel PCIe R910 SSD – 200GB (1 PCIe slot) • 12 x 10K RPM Seagate SAS disks • 10G vSAN dedicated network, 1G for VM network  VSAN Configuration • 2 x Disk groups per machine, 6 x Disks per disk group • hostFailuresToTolerate 1, stripeWidth 1  Workload Characteristics • ViewPlanner 3.0 Standard benchmark with 2 sec think-time (heavy user) • ViewPlanner Group A : CPU intensive & Group B: I/O intensive apps. • 1900 x 1200 resolution, PCoIP • Windows 7 desktop’s and Winxp Clients. • VDI workload is known to be CPU intensive but sensitive to I/O latency.
  • 30. 30 Virtual SAN Delivers IOPS Required by VDI • Virtual SAN can meet the IOPS required by VDI workload
  • 31. 31 Virtual SAN Scale.. 275 460 667 0 100 200 300 400 500 600 700 800 3 node 5 node 7 node NumberofHeavyVDIUsers Virtual SAN scale VSAN Linear (VSAN)
  • 32. 32 Group A Score Comparison 0 0.2 0.4 0.6 0.8 1 1.2 Avg Application Latency Group A VSAN SAN • Impact to Group A application latencies is marginal • Virtual SAN uses very few cycles of Host CPU.
  • 33. 33 Group B Score Comparison 0 1 2 3 4 5 6 7 Avg Application Latency Group B VSAN All-Flash-SAN • Group B application latencies are close to All-Flash-SAN • Virtual SAN can meet the IOPS required by VDI workload
  • 34. 34 VSAN VDI Consolidation compared to physical SAN array  VSAN performs better than a typical mid-range FC storage array • vSAN benefits from local flash storage that provide high performance  Impact of VSAN CPU consumption on application performance is low  Physical SAN array is not required to run VDI workload 0 100 200 300 400 500 600 700 800 3 5 7 NumberofVMs Number of Nodes (Servers) VDI Consolida on Ra o Mid-range FC Array vSAN All-Flash FC Array
  • 35. 35 Agenda vSphere Flash Read Cache (vFRC) Virtual SAN (vSAN) Configurations and Troubleshooting
  • 36. 36 Define the Performance Issue  Understand Application Function & Architecture • At a minimum know what your application does and what it’s dependent on  Select Application KPIs • Application performance must be measured using an application counter (tps, response time, etc.) and not virtual resource consumption  Define Success Criteria • With your app owner, define at what level the application KPI’s must be to consider it performant  Comparisons must be Apples-to-Apples • Any changes to infrastructure (physical or virtual) create comparison challenges  Now the Gap is Identified, Begin Troubleshooting • With an understanding of the requirements and current deficiency, you can now begin to investigate and/or tune
  • 37. 37 Disk I/O Latencies Application Guest OS ESX Storage Stack VMM Driver KAVG DAVG GAVG QAVG * KAVG = GAVG – DAVG Fabric vSCSI HBA Time spent in ESX storage stack is minimal, for all practical purposes KAVG ~= QAVG In a well configured system QAVG should be zero Array SP
  • 38. 38 Key Indicators – Investigative Thresholds  Kernel Latency Average (KAVG) • This counter tracks the latencies of IO passing thru the Kernel • Investigation Threshold: 1ms  Device Latency Average (DAVG) • This is the latency seen at the device driver level. It includes the roundtrip time between the HBA and the storage • Investigation Threshold: 15-20ms, lower is better, some spikes okay  Aborts (ABRT/s) • The number of commands aborted per second • Investigation Threshold: 1
  • 39. 39 Disk I/O Queues GQLEN – Guest Queue AQLEN – Adapter Queue WQLEN – World Queue DQLEN – Device / LUN Queue SQLEN – Array SP Queue DQLEN WQLEN SQLEN GQLEN DQLEN can change dynamically when SIOC is enabled Reported in esxtop AQLEN Application Guest OS ESX Storage Stack VMM Driver Fabric vSCSI HBA Array SP
  • 40. 40 Performance Technical Resources Performance Technical Papers • http://www.vmware.com/files/pdf/techpaper/vfrc-perf-vsphere55.pdf • http://www.vmware.com/resources/techresources/cat/91,96 Performance Best Practices • http://www.vmware.com/pdf/Perf_Best_Practices_vSphere4.0.pdf • http://www.vmware.com/pdf/Perf_Best_Practices_vSphere4.1.pdf • http://www.vmware.com/pdf/Perf_Best_Practices_vSphere5.0.pdf • http://www.vmware.com/pdf/Perf_Best_Practices_vSphere5.1.pdf  Troubleshooting Performance Related Problems in vSphere Environments • http://communities.vmware.com/docs/DOC-14905 (vSphere 4.1) • http://communities.vmware.com/docs/DOC-19166 (vSphere 5) • http://communities.vmware.com/docs/DOC-23094 (vSphere 5.x with vCOps)
  • 41. 41 Performance Community Resources Performance Technology Pages • http://www.vmware.com/technical-resources/performance/resources.html Technical Marketing Blog • http://blogs.vmware.com/vsphere/performance/ Performance Engineering Blog VROOM! • http://blogs.vmware.com/performance Performance Community Forum • http://communities.vmware.com/community/vmtn/general/performance Virtualizing Business Critical Applications • http://www.vmware.com/solutions/business-critical-apps/
  • 42. 42 Extreme Performance Series Sessions Extreme Performance Series: vCenter of the Universe – Session #VSVC5234 Monster Virtual Machines – Session # VSVC4811 Network Speed Ahead – Session # VSVC5596 Storage in a Flash – Session # VSVC5603 Big Data: Virtualized SAP HANA Performance, Scalability and Practices – Session # VAPP5591 Hands on Labs: HOL-SDC-1304 – Optimize vSphere Performance  includes vFRC
  • 43. 43 Other VMware Activities Related to This Session  HOL: HOL-SDC-1308 Virtual Storage Solutions  Group Discussions: VSVC1001-GD Performance with Mark Achtemichuk
  • 45.
  • 46. Extreme Performance Series: Storage in a Flash Sankaran Sivathanu, VMware Mark Achtemichuk, VMware VSVC5603 #VSVC5603