16. SSD to the rescue?
SSD DDR-RAM
15
SSD PCI flash
25
SSD SATA Flash
80
Magnetic Disk
4,000
0 500 1,000 1,500 2,000 2,500 3,000 3,500 4,000 4,500
Seek time (us)
17. Economics of SSD
$/GB
0.00 10.00 20.00 30.00 40.00 50.00 60.00
0.06
FusionIO PCI SLC SSD 53.44
0.06
FusionIO PCI MLC Duo SSD 24.92
0.05
Intel SLC SATA SSD 21.88
$/IOP
0.05 $/GB
Intel MLC SATA SSD 6.88
Seagate SAS HDD 1.00 1.53
Seagate SATA HDD 0.09 2.38
0.00 0.50 1.00 1.50 2.00 2.50
$/IOP
18. Tiered storage management
Main Memory
DDR SSD
Flash SSD
$/IOP
$/GB
Fast Disk (SAS, RAID 0+1)
Slow Disk (SATA, RAID 5)
Tape, Flat Files, Hadoop
19. 12c Automatic Data Placement
• Segment on SSD tablespace ALTER TABLE …. ADD ILM
Active POLICY
• Segment on SAS tablespace TIER TO …. Tablespace AFTER 6
Frequent
Access
months of no access
• OLTP compression on SATA
Occassional
Access
tablespace COMPRESS FOR QUERY LOW
AFTER 12 months of no access
• Archive Compressed on RAID5
Dormant SATA
21. Flavours of Flash SSD
DDR RAM Drive
SATA flash drive
PCI flash drive
SSD storage Server
22. PCI SSD vs SATA SSD
PCI vs SATA
− SATA was designed for traditional disk drives with high latencies
− PCI is designed for high speed devices
− PCI SSD has latency ~ 1/3rd of SATA
24. Dell Express Flash
Higher
performance, durabili
ty, flexibility
Up to 1000x more IOPs than traditional HDD
Front loading, hot swappable
Maximum read and write lifespan
Improves workload processing
Enhances virtual environments
Express Flash PCIe-SSD Drives
The power to do more
25. PCIe SSD Solution Framework
PCIe SSS 0
PCIe SSS 1
PCIe SSS 2
PCIe SSS 3
SAS 0
SAS 1
SAS 2
SAS 3
SAS 4
SAS 5
SAS 6
SAS 7
Media Bay & Control
Panel
Extender Card (Adapter FF)
− Frees Up Valuable PCIe Slot
Real Estate
− 1X16 Slot Supports 4X4 PCIe
SSD Backplane
− Modular, scalable
4x Drives w/ BP
− Storage Device
− HDD form factor – fits in 2.5” carrier
− SLC: 175G & 350G
28. Flash Disk write degradation
Empty
Partially Full
All Blocks empty:
Write time=250 us
25% part full:
− Write time= ( ¾ * 250 us + 1/4 * 2000 us) = 687 us
75% part full
− Write time = ( ¼ * 250 us + ¾ * 2000 us ) = 1562 us
29. Data Insert Free Block Pool
Insert
SSD Controller
Used Block Pool
Empty Data Page
Valid Data Page
InValid Data Page
30. Free Block Pool
Data Update
Update
SSD Controller
Used Block Pool
Empty Data Page
Valid Data Page
Invalid Data Page
31. Free Block Pool
Garbage Collection
SSD Controller
Used Block Pool
Empty Data Page
Valid Data Page
Invalid Data Page
35. Buffer cache and Free buffer waits
Buffer
Read from buffer cache Free cache
Oracle process Buffer
Write to buffer cache
Waits
Free buffer waits often occur when DBWR
reads are much faster than writes....
Read from disk
Database
files Write dirty blocks to disk
36. Flash Cache
Buffer
Read from buffer cache cache
Oracle process
Write to buffer cache
Read from flash
cache Flash Cache
DBWR
Write clean
blocks (time
permitting)
DB Flash cache architecture is designed to accelerate
buffered reads
Read from disk Write dirty blocks to disk
Database
files
50. Sorting – what we expect
Multi-pass
Disk Sort
Time
Single Pass
Disk Sort
Memory Sort
PGA Memory available (MB)
Table/Index IO CPU Time Temp Segment IO 50 50
51. Disk Sorts – temporary tablespace
4000
3500
3000
Multi-pass
Disk Sort 2500
Elapsed time (s)
2000
1500
Single Pass 1000
Disk Sort
500
0
300 250 200 150 100 50 0
Sort Area Size
SAS based TTS SSD based TTS
51
52. Redo performance – Fusion IO
Flash based redo log 291.93
CPU
Log IO
SAS based redo log 292.39
0 50 100 150 200 250 300 350
Elapsed time (s)
53. Concurrent redo workload (x10)
Flash based redo log 1,637 331 1,681
CPU
Other
Log File IO
SAS based redo log 1,605 397 1,944
0 500 1,000 1,500 2,000 2,500 3,000 3,500 4,000 4,500
Elapsed time (s) 53 53
57. OS level direct cache
File System/ Raw File System/ Raw
Devices/ ASM Devices/ ASM
•Temp
Tablespace Caching Block Device
• Hot Segments Regular Block Device Read-
• Hot Partitions directCache intensive, pot
• DB Flash entially
Cache ioMemory VSL ioMemory VSL massive
tablespaces
(limited to the
size of the SSD)
LUN