SlideShare ist ein Scribd-Unternehmen logo
1 von 19
Downloaden Sie, um offline zu lesen
The	
  Death	
  of	
  Disk	
  
Panel	
  Session	
  
Erik	
  Riedel,	
  EMC	
  
HEC	
  FSIO	
  Workshop	
  
August	
  2011	
  
top	
  picture	
  “floppy	
  disks	
  for	
  breakfast”	
  by	
  Blude	
  via	
  flickr/cc	
  	
  
right	
  picture	
  by	
  AusMn	
  Marshall	
  via	
  flickr/cc	
  
Conclusion	
  
•  About	
  80%	
  of	
  stored	
  data	
  will	
  never	
  be	
  
accessed	
  again	
  
•  About	
  80%	
  of	
  the	
  rest	
  will	
  be	
  accessed	
  
predictably	
  
•  That	
  leaves	
  (maybe)	
  4%	
  of	
  stored	
  data	
  that	
  
potenMally	
  requires	
  “quick”	
  random	
  access	
  
•  =>	
  Buy	
  as	
  much	
  flash	
  as	
  you	
  can	
  afford,	
  use	
  
disks	
  for	
  the	
  rest	
  	
  
Most	
  Data	
  Is	
  Idle	
  
•  About	
  80%	
  of	
  stored	
  data	
  will	
  never	
  be	
  
accessed	
  again	
  
•  Disk	
  drives	
  have	
  long	
  been	
  designed	
  around	
  
this	
  key	
  fact	
  of	
  the	
  digital	
  world	
  
•  AmorMze	
  a	
  relaMvely	
  small	
  amount	
  of	
  
expensive	
  read/write	
  electronics	
  and	
  fancy	
  
material	
  science	
  over	
  a	
  large	
  and	
  cheap	
  
magneMc	
  media	
  
David	
  Anderson,	
  James	
  Dykes,	
  Erik	
  Riedel	
  “SCSI	
  vs.	
  ATA	
  -­‐	
  More	
  than	
  
an	
  interface”	
  2nd	
  Conference	
  on	
  File	
  and	
  Storage	
  Technology	
  (FAST).	
  
San	
  Francisco,	
  CA.	
  April	
  2003.	
  www.cs.cmu.edu/~riedel	
  
Consumer	
  Example	
  (At	
  My	
  House)	
  
Dinosaur	
  
Train	
  
Sid	
  The	
  
Science	
  Kid	
  
Super	
  
Why!	
  
Steelers	
  
Games	
  
Meet	
  the	
  
Press	
  
Nova	
  
Baby	
  
Einstein	
  
Most	
  Data	
  Access	
  Is	
  Predictable	
  
•  Caching	
  
•  Prefetching	
  
•  Tiering	
  
•  Staging	
  
•  Hierarchical	
  Storage	
  Mgmt	
  
•  all	
  these	
  tools	
  have	
  been	
  known	
  for	
  years	
  
•  just	
  need	
  to	
  open	
  our	
  toolbox,	
  sharpen	
  some	
  of	
  
them	
  to	
  apply	
  to	
  today’s	
  infrastructure/apps	
  
New	
  Tools	
  In	
  the	
  “Cloud”	
  
MarkeMng	
  buzz	
  –	
  IaaS	
  –	
  Infrastructure	
  as	
  a	
  Service	
  
New	
  Tools	
  In	
  the	
  “Cloud”	
  (2)	
  
MarkeMng	
  buzz	
  –	
  PaaS	
  –	
  Plagorm	
  as	
  a	
  Service	
  
New	
  Tools	
  in	
  the	
  “Cloud”	
  (3)	
  
•  Key	
  takeaways	
  
– both	
  IaaS	
  and	
  PaaS	
  are	
  “closed	
  loop”	
  
infrastructures	
  
– apps	
  cannot	
  be	
  deployed	
  except	
  at	
  the	
  “direcMon”	
  
of	
  the	
  system	
  
– logging	
  and	
  monitoring	
  are	
  constant	
  
•  need	
  to	
  get	
  high	
  uMlizaMon	
  rates	
  ($$)	
  
•  need	
  to	
  send	
  out	
  bills	
  ($$)	
  
•  want	
  high	
  rates	
  of	
  “mulM-­‐tenancy”	
  to	
  be	
  efficient	
  ($$)	
  
– this	
  leads	
  to	
  a	
  significant	
  level	
  of	
  “predictability”	
  
Get	
  Predictability	
  Into	
  Storage	
  
•  Key	
  challenge	
  is	
  how	
  to	
  translate	
  what	
  “the	
  
system”	
  knows	
  about	
  apps	
  and	
  behaviors	
  and	
  
“SLAs”	
  into	
  guidance	
  for	
  our	
  system-­‐level	
  tools	
  
(caching,	
  prefetching,	
  Mering,	
  etc.)	
  
•  Secondary	
  challenge	
  is	
  avoiding	
  “surprises”	
  
– where	
  performance	
  or	
  availability	
  or	
  durability	
  
don’t	
  meet	
  the	
  SLAs	
  (“quality	
  of	
  service”)	
  
•  Good	
  news	
  is	
  that	
  the	
  new	
  infrastructures	
  
have	
  some	
  powerful	
  new	
  ways	
  to	
  help	
  us	
  
One	
  Example	
  New	
  Tool	
  –	
  Stunning	
  
•  “The	
  amount	
  of	
  Mme	
  the	
  
virtual	
  machine	
  is	
  stunned	
  is	
  
dependent	
  on	
  the	
  amount	
  of	
  
memory	
  to	
  be	
  wrilen	
  to	
  disk	
  
for	
  such	
  an	
  operaMon,	
  and	
  
the	
  speed	
  and	
  
responsiveness	
  of	
  the	
  
datastore's	
  backing	
  storage.”	
  
–	
  VMware	
  KnowledgeBase	
  
hlp://kb.vmware.com/selfservice/microsites/search.do?
language=en_US&cmd=displayKC&externalId=1013163	
  
picture	
  by	
  Yamashita	
  Yohei	
  via	
  flickr/cc	
  	
  
What	
  About	
  Tape?	
  
pictures	
  by	
  Gill	
  Wildman	
  via	
  flickr/cc	
  
What	
  About	
  Tape?	
  
•  Tapes	
  are	
  not	
  a	
  commodity	
  technology	
  
•  2011	
  total	
  worldwide	
  market	
  for	
  tape	
  cartridges	
  
is	
  about	
  8m	
  units	
  (just	
  under	
  $1b	
  annual	
  
revenue)	
  
•  Compare	
  to	
  the	
  HDD	
  business	
  at	
  650m	
  units	
  in	
  
2010	
  (close	
  to	
  $40b	
  annual	
  revenue)	
  
•  80	
  disk	
  drives	
  are	
  manufactured	
  for	
  each	
  tape	
  
cartridge;	
  robots	
  are	
  complicated	
  
•  Fits	
  parMcular	
  applicaMon	
  segments	
  very	
  well,	
  but	
  
is	
  not	
  a	
  general-­‐purpose	
  soluMon	
  
hlp://www.storagenewsleler.com/news/tapes/sccg-­‐ww-­‐tape-­‐market-­‐lto-­‐1q11	
  
hlp://techreport.com/discussions.x/20890	
  
Conclusion	
  
•  About	
  80%	
  of	
  stored	
  data	
  will	
  never	
  be	
  
accessed	
  again	
  
•  About	
  80%	
  of	
  the	
  rest	
  will	
  be	
  accessed	
  
predictably	
  
•  That	
  leaves	
  (maybe)	
  4%	
  of	
  stored	
  data	
  that	
  
potenMally	
  requires	
  “quick”	
  random	
  access	
  
PragmaMc	
  Issues	
  
•  Power	
  
–  if	
  the	
  data	
  is	
  predictably	
  idle,	
  then	
  don’t	
  spin	
  it	
  
•  Wearout	
  
–  look	
  at	
  the	
  data	
  occasionally	
  (once/month,	
  once/yr);	
  
such	
  access	
  for	
  “scrubbing”	
  is	
  very	
  predictable	
  
•  Backup	
  
–  backup	
  1)	
  is	
  not	
  an	
  “applicaMon”	
  and	
  2)	
  is	
  predictable	
  
•  ReplicaMon	
  
–  esMmates	
  run	
  to	
  75%	
  of	
  stored	
  data	
  is	
  copies/replicas,	
  
only	
  25%	
  unique	
  bytes;	
  replicaMon	
  is	
  predictable	
  	
  
www.zdnet.com/blog/service-­‐oriented/size-­‐of-­‐the-­‐data-­‐universe-­‐12-­‐zelabytes-­‐and-­‐growing-­‐fast/4750	
  
Summary	
  –	
  How	
  Much	
  Data	
  
•  1.2	
  million	
  PB	
  esMmated	
  in	
  2010	
  
•  25%	
  unique	
  =>	
  leaves	
  300,000	
  PB	
  
•  80%	
  idle	
  =>	
  leaves	
  60,000	
  PB	
  
•  80%	
  predictable	
  =>	
  leaves	
  12,000	
  PB	
  
•  at	
  $1/GB	
  for	
  flash,	
  that	
  requires	
  $12b	
  
•  is	
  that	
  affordable?	
  
•  (remember	
  the	
  world	
  bought	
  ~$40b	
  of	
  HDD	
  in	
  2010)	
  
www.zdnet.com/blog/service-­‐oriented/size-­‐of-­‐the-­‐data-­‐universe-­‐12-­‐zelabytes-­‐and-­‐growing-­‐fast/4750	
  
www.computerworld.com/s/arMcle/9180943/NAND_flash_memory_pricing_to_plummet_to_1_per_GB	
  
recently	
  –	
  “the	
  price	
  of	
  flash	
  has	
  not	
  been	
  dropping	
  as	
  fast	
  as	
  the	
  suppliers	
  predicted”,	
  August	
  2011	
  
Conclusion	
  
•  About	
  80%	
  of	
  stored	
  data	
  will	
  never	
  be	
  
accessed	
  again	
  
•  About	
  80%	
  of	
  the	
  rest	
  will	
  be	
  accessed	
  
predictably	
  
•  That	
  leaves	
  (maybe)	
  4%	
  of	
  stored	
  data	
  that	
  
potenMally	
  requires	
  “quick”	
  random	
  access	
  
•  =>	
  Buy	
  as	
  much	
  flash	
  as	
  you	
  can	
  afford,	
  use	
  
disks	
  for	
  the	
  rest	
  	
  
n.b.	
  73.6%	
  of	
  all	
  staMsMcs	
  are	
  made	
  up,	
  
do	
  the	
  calculaMons	
  for	
  your	
  own	
  
environments,	
  your	
  mileage	
  may	
  vary	
  
www.businessinsider.com/736-­‐of-­‐all-­‐
staMsMcs-­‐are-­‐made-­‐up-­‐2010-­‐2	
  
•  In	
  the	
  balle	
  of	
  rust	
  vs.	
  silicon,	
  both	
  will	
  survive	
  
rust	
  picture	
  by	
  Jos	
  Faber	
  via	
  flickr/cc	
  	
  
silicon	
  picture	
  from	
  “Chip	
  bug	
  vs	
  chip	
  bug”	
  by	
  Windell	
  Oskay	
  via	
  flickr/cc	
  
hlp://www.emc.com/storage/atmos/atmos.htm	
  
a	
  brief	
  word	
  from	
  my	
  sponsors	
  

Weitere ähnliche Inhalte

Was ist angesagt?

Data core overview - haluk-final
Data core overview - haluk-finalData core overview - haluk-final
Data core overview - haluk-finalHaluk Ulubay
 
Four Assumptions Killing Backup Storage Webinar
Four Assumptions Killing Backup Storage WebinarFour Assumptions Killing Backup Storage Webinar
Four Assumptions Killing Backup Storage WebinarStorage Switzerland
 
Scalar Decisions: Emerging Trends and Technologies in Storage
Scalar  Decisions: Emerging Trends and Technologies in StorageScalar  Decisions: Emerging Trends and Technologies in Storage
Scalar Decisions: Emerging Trends and Technologies in Storagepatmisasi
 
The All-Flash SAP HANA Solution: Performance, Economics, and Reliability
The All-Flash SAP HANA Solution: Performance, Economics, and Reliability The All-Flash SAP HANA Solution: Performance, Economics, and Reliability
The All-Flash SAP HANA Solution: Performance, Economics, and Reliability Western Digital
 
Life Beyond Flash - ReRAM for embedded and memory-centric architectures in Io...
Life Beyond Flash - ReRAM for embedded and memory-centric architectures in Io...Life Beyond Flash - ReRAM for embedded and memory-centric architectures in Io...
Life Beyond Flash - ReRAM for embedded and memory-centric architectures in Io...Crossbarinc
 
Hitachi Virtual Storage Platform
Hitachi Virtual Storage PlatformHitachi Virtual Storage Platform
Hitachi Virtual Storage Platformmnalls
 
Josh Krischer - How to get more for less (4 november 2010 Storage Expo)
Josh Krischer - How to get more for less (4 november 2010 Storage Expo)Josh Krischer - How to get more for less (4 november 2010 Storage Expo)
Josh Krischer - How to get more for less (4 november 2010 Storage Expo)VNU Exhibitions Europe
 
Webinar: The Bifurcation of the Flash Market
Webinar: The Bifurcation of the Flash MarketWebinar: The Bifurcation of the Flash Market
Webinar: The Bifurcation of the Flash MarketStorage Switzerland
 
Consolidation on Flash- Hardware for Nothing, Get Your Flash for Free (I want...
Consolidation on Flash- Hardware for Nothing, Get Your Flash for Free (I want...Consolidation on Flash- Hardware for Nothing, Get Your Flash for Free (I want...
Consolidation on Flash- Hardware for Nothing, Get Your Flash for Free (I want...Western Digital
 
Webinar: Getting Beyond Flash 101 - Flash 102 Selecting the Right Flash Array
Webinar: Getting Beyond Flash 101 - Flash 102 Selecting the Right Flash ArrayWebinar: Getting Beyond Flash 101 - Flash 102 Selecting the Right Flash Array
Webinar: Getting Beyond Flash 101 - Flash 102 Selecting the Right Flash ArrayStorage Switzerland
 
Hu Yoshida - Storage Trends and Directions (Storage Expo 2010)
Hu Yoshida - Storage Trends and Directions (Storage Expo 2010)Hu Yoshida - Storage Trends and Directions (Storage Expo 2010)
Hu Yoshida - Storage Trends and Directions (Storage Expo 2010)VNU Exhibitions Europe
 
1f Backup
1f   Backup1f   Backup
1f BackupMISY
 
FAQ on Dedupe NetApp
FAQ on Dedupe NetAppFAQ on Dedupe NetApp
FAQ on Dedupe NetAppAshwin Pawar
 
Webinar: SDS is Broken - And How to Fix it
Webinar: SDS is Broken - And How to Fix itWebinar: SDS is Broken - And How to Fix it
Webinar: SDS is Broken - And How to Fix itStorage Switzerland
 
Enterprise Mass Storage TCO Case Study
Enterprise Mass Storage TCO Case StudyEnterprise Mass Storage TCO Case Study
Enterprise Mass Storage TCO Case StudyIT Brand Pulse
 
5 Things You Need to Know About Enterprise Fl
 5 Things You Need to Know About Enterprise Fl 5 Things You Need to Know About Enterprise Fl
5 Things You Need to Know About Enterprise FlWestern Digital
 
Storage user cases
Storage user casesStorage user cases
Storage user casesAndrea Mauro
 

Was ist angesagt? (20)

Data core overview - haluk-final
Data core overview - haluk-finalData core overview - haluk-final
Data core overview - haluk-final
 
Four Assumptions Killing Backup Storage Webinar
Four Assumptions Killing Backup Storage WebinarFour Assumptions Killing Backup Storage Webinar
Four Assumptions Killing Backup Storage Webinar
 
Scalar Decisions: Emerging Trends and Technologies in Storage
Scalar  Decisions: Emerging Trends and Technologies in StorageScalar  Decisions: Emerging Trends and Technologies in Storage
Scalar Decisions: Emerging Trends and Technologies in Storage
 
The All-Flash SAP HANA Solution: Performance, Economics, and Reliability
The All-Flash SAP HANA Solution: Performance, Economics, and Reliability The All-Flash SAP HANA Solution: Performance, Economics, and Reliability
The All-Flash SAP HANA Solution: Performance, Economics, and Reliability
 
Life Beyond Flash - ReRAM for embedded and memory-centric architectures in Io...
Life Beyond Flash - ReRAM for embedded and memory-centric architectures in Io...Life Beyond Flash - ReRAM for embedded and memory-centric architectures in Io...
Life Beyond Flash - ReRAM for embedded and memory-centric architectures in Io...
 
Hitachi Virtual Storage Platform
Hitachi Virtual Storage PlatformHitachi Virtual Storage Platform
Hitachi Virtual Storage Platform
 
Josh Krischer - How to get more for less (4 november 2010 Storage Expo)
Josh Krischer - How to get more for less (4 november 2010 Storage Expo)Josh Krischer - How to get more for less (4 november 2010 Storage Expo)
Josh Krischer - How to get more for less (4 november 2010 Storage Expo)
 
EMC config Hadoop
EMC config HadoopEMC config Hadoop
EMC config Hadoop
 
Webinar: The Bifurcation of the Flash Market
Webinar: The Bifurcation of the Flash MarketWebinar: The Bifurcation of the Flash Market
Webinar: The Bifurcation of the Flash Market
 
Consolidation on Flash- Hardware for Nothing, Get Your Flash for Free (I want...
Consolidation on Flash- Hardware for Nothing, Get Your Flash for Free (I want...Consolidation on Flash- Hardware for Nothing, Get Your Flash for Free (I want...
Consolidation on Flash- Hardware for Nothing, Get Your Flash for Free (I want...
 
S3
S3S3
S3
 
Webinar: Getting Beyond Flash 101 - Flash 102 Selecting the Right Flash Array
Webinar: Getting Beyond Flash 101 - Flash 102 Selecting the Right Flash ArrayWebinar: Getting Beyond Flash 101 - Flash 102 Selecting the Right Flash Array
Webinar: Getting Beyond Flash 101 - Flash 102 Selecting the Right Flash Array
 
Hu Yoshida - Storage Trends and Directions (Storage Expo 2010)
Hu Yoshida - Storage Trends and Directions (Storage Expo 2010)Hu Yoshida - Storage Trends and Directions (Storage Expo 2010)
Hu Yoshida - Storage Trends and Directions (Storage Expo 2010)
 
1f Backup
1f   Backup1f   Backup
1f Backup
 
FAQ on Dedupe NetApp
FAQ on Dedupe NetAppFAQ on Dedupe NetApp
FAQ on Dedupe NetApp
 
Webinar: SDS is Broken - And How to Fix it
Webinar: SDS is Broken - And How to Fix itWebinar: SDS is Broken - And How to Fix it
Webinar: SDS is Broken - And How to Fix it
 
Enterprise Mass Storage TCO Case Study
Enterprise Mass Storage TCO Case StudyEnterprise Mass Storage TCO Case Study
Enterprise Mass Storage TCO Case Study
 
Automated Storage Tiering
Automated Storage TieringAutomated Storage Tiering
Automated Storage Tiering
 
5 Things You Need to Know About Enterprise Fl
 5 Things You Need to Know About Enterprise Fl 5 Things You Need to Know About Enterprise Fl
5 Things You Need to Know About Enterprise Fl
 
Storage user cases
Storage user casesStorage user cases
Storage user cases
 

Ähnlich wie Death of Disk Panel Session - HEC-FSIO Workshop

Nimble Storage Series A presentation 2007
Nimble Storage Series A presentation 2007Nimble Storage Series A presentation 2007
Nimble Storage Series A presentation 2007Wing Venture Capital
 
In-Memory Computing: Myths and Facts
In-Memory Computing: Myths and FactsIn-Memory Computing: Myths and Facts
In-Memory Computing: Myths and FactsDATAVERSITY
 
Storage and performance- Batch processing, Whiptail
Storage and performance- Batch processing, WhiptailStorage and performance- Batch processing, Whiptail
Storage and performance- Batch processing, WhiptailInternet World
 
Day 2 General Session Presentations RedisConf
Day 2 General Session Presentations RedisConfDay 2 General Session Presentations RedisConf
Day 2 General Session Presentations RedisConfRedis Labs
 
Nimble storage investor_deck_public
Nimble storage investor_deck_publicNimble storage investor_deck_public
Nimble storage investor_deck_publicSequoia Capital
 
Spectra Logic BlackPearl Developer Summit 2015
Spectra Logic BlackPearl Developer Summit 2015Spectra Logic BlackPearl Developer Summit 2015
Spectra Logic BlackPearl Developer Summit 2015spectralogic
 
Rob Callaghan_OOW14 IO Performance for Database
Rob Callaghan_OOW14 IO Performance for DatabaseRob Callaghan_OOW14 IO Performance for Database
Rob Callaghan_OOW14 IO Performance for DatabaseRob Callaghan
 
OSBConf 2015 | Contemporary and cost efficient backups to to tape by josef we...
OSBConf 2015 | Contemporary and cost efficient backups to to tape by josef we...OSBConf 2015 | Contemporary and cost efficient backups to to tape by josef we...
OSBConf 2015 | Contemporary and cost efficient backups to to tape by josef we...NETWAYS
 
Webinar: Untethering Compute from Storage
Webinar: Untethering Compute from StorageWebinar: Untethering Compute from Storage
Webinar: Untethering Compute from StorageAvere Systems
 
Live CEO Interview and Webinar Update on the State of Deduplication
 Live CEO Interview and Webinar Update on the State of Deduplication Live CEO Interview and Webinar Update on the State of Deduplication
Live CEO Interview and Webinar Update on the State of DeduplicationStorage Switzerland
 
IBM Tape the future of tape
IBM Tape the future of tapeIBM Tape the future of tape
IBM Tape the future of tapeJosef Weingand
 
Building a High Performance Analytics Platform
Building a High Performance Analytics PlatformBuilding a High Performance Analytics Platform
Building a High Performance Analytics PlatformSantanu Dey
 
Optimizing RocksDB for Open-Channel SSDs
Optimizing RocksDB for Open-Channel SSDsOptimizing RocksDB for Open-Channel SSDs
Optimizing RocksDB for Open-Channel SSDsJavier González
 
Architecture at Scale
Architecture at ScaleArchitecture at Scale
Architecture at ScaleElasticsearch
 
Next generation storage: eliminating the guesswork and avoiding forklift upgrade
Next generation storage: eliminating the guesswork and avoiding forklift upgradeNext generation storage: eliminating the guesswork and avoiding forklift upgrade
Next generation storage: eliminating the guesswork and avoiding forklift upgradeJisc
 
HPC DAY 2017 | HPE Storage and Data Management for Big Data
HPC DAY 2017 | HPE Storage and Data Management for Big DataHPC DAY 2017 | HPE Storage and Data Management for Big Data
HPC DAY 2017 | HPE Storage and Data Management for Big DataHPC DAY
 
The future of tape april 16
The future of tape april 16The future of tape april 16
The future of tape april 16Josef Weingand
 

Ähnlich wie Death of Disk Panel Session - HEC-FSIO Workshop (20)

Nimble Storage Series A presentation 2007
Nimble Storage Series A presentation 2007Nimble Storage Series A presentation 2007
Nimble Storage Series A presentation 2007
 
In-Memory Computing: Myths and Facts
In-Memory Computing: Myths and FactsIn-Memory Computing: Myths and Facts
In-Memory Computing: Myths and Facts
 
Kinetic basho public
Kinetic basho publicKinetic basho public
Kinetic basho public
 
Storage and performance- Batch processing, Whiptail
Storage and performance- Batch processing, WhiptailStorage and performance- Batch processing, Whiptail
Storage and performance- Batch processing, Whiptail
 
Day 2 General Session Presentations RedisConf
Day 2 General Session Presentations RedisConfDay 2 General Session Presentations RedisConf
Day 2 General Session Presentations RedisConf
 
Nimble storage investor_deck_public
Nimble storage investor_deck_publicNimble storage investor_deck_public
Nimble storage investor_deck_public
 
The future of tape
The future of tapeThe future of tape
The future of tape
 
Spectra Logic BlackPearl Developer Summit 2015
Spectra Logic BlackPearl Developer Summit 2015Spectra Logic BlackPearl Developer Summit 2015
Spectra Logic BlackPearl Developer Summit 2015
 
Rob Callaghan_OOW14 IO Performance for Database
Rob Callaghan_OOW14 IO Performance for DatabaseRob Callaghan_OOW14 IO Performance for Database
Rob Callaghan_OOW14 IO Performance for Database
 
OSBConf 2015 | Contemporary and cost efficient backups to to tape by josef we...
OSBConf 2015 | Contemporary and cost efficient backups to to tape by josef we...OSBConf 2015 | Contemporary and cost efficient backups to to tape by josef we...
OSBConf 2015 | Contemporary and cost efficient backups to to tape by josef we...
 
Webinar: Untethering Compute from Storage
Webinar: Untethering Compute from StorageWebinar: Untethering Compute from Storage
Webinar: Untethering Compute from Storage
 
PyData Paris 2015 - Closing keynote Francesc Alted
PyData Paris 2015 - Closing keynote Francesc AltedPyData Paris 2015 - Closing keynote Francesc Alted
PyData Paris 2015 - Closing keynote Francesc Alted
 
Live CEO Interview and Webinar Update on the State of Deduplication
 Live CEO Interview and Webinar Update on the State of Deduplication Live CEO Interview and Webinar Update on the State of Deduplication
Live CEO Interview and Webinar Update on the State of Deduplication
 
IBM Tape the future of tape
IBM Tape the future of tapeIBM Tape the future of tape
IBM Tape the future of tape
 
Building a High Performance Analytics Platform
Building a High Performance Analytics PlatformBuilding a High Performance Analytics Platform
Building a High Performance Analytics Platform
 
Optimizing RocksDB for Open-Channel SSDs
Optimizing RocksDB for Open-Channel SSDsOptimizing RocksDB for Open-Channel SSDs
Optimizing RocksDB for Open-Channel SSDs
 
Architecture at Scale
Architecture at ScaleArchitecture at Scale
Architecture at Scale
 
Next generation storage: eliminating the guesswork and avoiding forklift upgrade
Next generation storage: eliminating the guesswork and avoiding forklift upgradeNext generation storage: eliminating the guesswork and avoiding forklift upgrade
Next generation storage: eliminating the guesswork and avoiding forklift upgrade
 
HPC DAY 2017 | HPE Storage and Data Management for Big Data
HPC DAY 2017 | HPE Storage and Data Management for Big DataHPC DAY 2017 | HPE Storage and Data Management for Big Data
HPC DAY 2017 | HPE Storage and Data Management for Big Data
 
The future of tape april 16
The future of tape april 16The future of tape april 16
The future of tape april 16
 

Mehr von Erik Riedel

Collaboration Is The Better Way
Collaboration Is The Better WayCollaboration Is The Better Way
Collaboration Is The Better WayErik Riedel
 
Everyday Opportunities for Inclusion & Collaboration - OSSNA 2018
Everyday Opportunities for Inclusion & Collaboration - OSSNA 2018Everyday Opportunities for Inclusion & Collaboration - OSSNA 2018
Everyday Opportunities for Inclusion & Collaboration - OSSNA 2018Erik Riedel
 
Coding - an intro for K to 3rd graders
Coding - an intro for K to 3rd gradersCoding - an intro for K to 3rd graders
Coding - an intro for K to 3rd gradersErik Riedel
 
Cloud Computing - Myths & Reality
Cloud Computing - Myths & RealityCloud Computing - Myths & Reality
Cloud Computing - Myths & RealityErik Riedel
 
Long-Term Storage - Panel Session @ Library of Congress Workshop
Long-Term Storage - Panel Session @ Library of Congress WorkshopLong-Term Storage - Panel Session @ Library of Congress Workshop
Long-Term Storage - Panel Session @ Library of Congress WorkshopErik Riedel
 
PDL Distinguished Alumni Talk
PDL Distinguished Alumni TalkPDL Distinguished Alumni Talk
PDL Distinguished Alumni TalkErik Riedel
 

Mehr von Erik Riedel (6)

Collaboration Is The Better Way
Collaboration Is The Better WayCollaboration Is The Better Way
Collaboration Is The Better Way
 
Everyday Opportunities for Inclusion & Collaboration - OSSNA 2018
Everyday Opportunities for Inclusion & Collaboration - OSSNA 2018Everyday Opportunities for Inclusion & Collaboration - OSSNA 2018
Everyday Opportunities for Inclusion & Collaboration - OSSNA 2018
 
Coding - an intro for K to 3rd graders
Coding - an intro for K to 3rd gradersCoding - an intro for K to 3rd graders
Coding - an intro for K to 3rd graders
 
Cloud Computing - Myths & Reality
Cloud Computing - Myths & RealityCloud Computing - Myths & Reality
Cloud Computing - Myths & Reality
 
Long-Term Storage - Panel Session @ Library of Congress Workshop
Long-Term Storage - Panel Session @ Library of Congress WorkshopLong-Term Storage - Panel Session @ Library of Congress Workshop
Long-Term Storage - Panel Session @ Library of Congress Workshop
 
PDL Distinguished Alumni Talk
PDL Distinguished Alumni TalkPDL Distinguished Alumni Talk
PDL Distinguished Alumni Talk
 

Kürzlich hochgeladen

Zeshan Sattar- Assessing the skill requirements and industry expectations for...
Zeshan Sattar- Assessing the skill requirements and industry expectations for...Zeshan Sattar- Assessing the skill requirements and industry expectations for...
Zeshan Sattar- Assessing the skill requirements and industry expectations for...itnewsafrica
 
QCon London: Mastering long-running processes in modern architectures
QCon London: Mastering long-running processes in modern architecturesQCon London: Mastering long-running processes in modern architectures
QCon London: Mastering long-running processes in modern architecturesBernd Ruecker
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxLoriGlavin3
 
React Native vs Ionic - The Best Mobile App Framework
React Native vs Ionic - The Best Mobile App FrameworkReact Native vs Ionic - The Best Mobile App Framework
React Native vs Ionic - The Best Mobile App FrameworkPixlogix Infotech
 
Glenn Lazarus- Why Your Observability Strategy Needs Security Observability
Glenn Lazarus- Why Your Observability Strategy Needs Security ObservabilityGlenn Lazarus- Why Your Observability Strategy Needs Security Observability
Glenn Lazarus- Why Your Observability Strategy Needs Security Observabilityitnewsafrica
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxLoriGlavin3
 
Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Farhan Tariq
 
Testing tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examplesTesting tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examplesKari Kakkonen
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxLoriGlavin3
 
A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersNicole Novielli
 
Time Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsTime Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsNathaniel Shimoni
 
Scale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL RouterScale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL RouterMydbops
 
Data governance with Unity Catalog Presentation
Data governance with Unity Catalog PresentationData governance with Unity Catalog Presentation
Data governance with Unity Catalog PresentationKnoldus Inc.
 
Emixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native developmentEmixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native developmentPim van der Noll
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc
 
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyesHow to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyesThousandEyes
 
Varsha Sewlal- Cyber Attacks on Critical Critical Infrastructure
Varsha Sewlal- Cyber Attacks on Critical Critical InfrastructureVarsha Sewlal- Cyber Attacks on Critical Critical Infrastructure
Varsha Sewlal- Cyber Attacks on Critical Critical Infrastructureitnewsafrica
 
Connecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfConnecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfNeo4j
 
MuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotes
MuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotesMuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotes
MuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotesManik S Magar
 

Kürzlich hochgeladen (20)

Zeshan Sattar- Assessing the skill requirements and industry expectations for...
Zeshan Sattar- Assessing the skill requirements and industry expectations for...Zeshan Sattar- Assessing the skill requirements and industry expectations for...
Zeshan Sattar- Assessing the skill requirements and industry expectations for...
 
QCon London: Mastering long-running processes in modern architectures
QCon London: Mastering long-running processes in modern architecturesQCon London: Mastering long-running processes in modern architectures
QCon London: Mastering long-running processes in modern architectures
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
 
React Native vs Ionic - The Best Mobile App Framework
React Native vs Ionic - The Best Mobile App FrameworkReact Native vs Ionic - The Best Mobile App Framework
React Native vs Ionic - The Best Mobile App Framework
 
Glenn Lazarus- Why Your Observability Strategy Needs Security Observability
Glenn Lazarus- Why Your Observability Strategy Needs Security ObservabilityGlenn Lazarus- Why Your Observability Strategy Needs Security Observability
Glenn Lazarus- Why Your Observability Strategy Needs Security Observability
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
 
Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...
 
Testing tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examplesTesting tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examples
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
 
A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software Developers
 
Time Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsTime Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directions
 
Scale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL RouterScale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL Router
 
Data governance with Unity Catalog Presentation
Data governance with Unity Catalog PresentationData governance with Unity Catalog Presentation
Data governance with Unity Catalog Presentation
 
Emixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native developmentEmixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native development
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
 
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyesHow to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
 
Varsha Sewlal- Cyber Attacks on Critical Critical Infrastructure
Varsha Sewlal- Cyber Attacks on Critical Critical InfrastructureVarsha Sewlal- Cyber Attacks on Critical Critical Infrastructure
Varsha Sewlal- Cyber Attacks on Critical Critical Infrastructure
 
Connecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfConnecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdf
 
MuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotes
MuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotesMuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotes
MuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotes
 

Death of Disk Panel Session - HEC-FSIO Workshop

  • 1. The  Death  of  Disk   Panel  Session   Erik  Riedel,  EMC   HEC  FSIO  Workshop   August  2011   top  picture  “floppy  disks  for  breakfast”  by  Blude  via  flickr/cc     right  picture  by  AusMn  Marshall  via  flickr/cc  
  • 2. Conclusion   •  About  80%  of  stored  data  will  never  be   accessed  again   •  About  80%  of  the  rest  will  be  accessed   predictably   •  That  leaves  (maybe)  4%  of  stored  data  that   potenMally  requires  “quick”  random  access   •  =>  Buy  as  much  flash  as  you  can  afford,  use   disks  for  the  rest    
  • 3. Most  Data  Is  Idle   •  About  80%  of  stored  data  will  never  be   accessed  again   •  Disk  drives  have  long  been  designed  around   this  key  fact  of  the  digital  world   •  AmorMze  a  relaMvely  small  amount  of   expensive  read/write  electronics  and  fancy   material  science  over  a  large  and  cheap   magneMc  media  
  • 4. David  Anderson,  James  Dykes,  Erik  Riedel  “SCSI  vs.  ATA  -­‐  More  than   an  interface”  2nd  Conference  on  File  and  Storage  Technology  (FAST).   San  Francisco,  CA.  April  2003.  www.cs.cmu.edu/~riedel  
  • 5. Consumer  Example  (At  My  House)   Dinosaur   Train   Sid  The   Science  Kid   Super   Why!   Steelers   Games   Meet  the   Press   Nova   Baby   Einstein  
  • 6. Most  Data  Access  Is  Predictable   •  Caching   •  Prefetching   •  Tiering   •  Staging   •  Hierarchical  Storage  Mgmt   •  all  these  tools  have  been  known  for  years   •  just  need  to  open  our  toolbox,  sharpen  some  of   them  to  apply  to  today’s  infrastructure/apps  
  • 7. New  Tools  In  the  “Cloud”   MarkeMng  buzz  –  IaaS  –  Infrastructure  as  a  Service  
  • 8. New  Tools  In  the  “Cloud”  (2)   MarkeMng  buzz  –  PaaS  –  Plagorm  as  a  Service  
  • 9. New  Tools  in  the  “Cloud”  (3)   •  Key  takeaways   – both  IaaS  and  PaaS  are  “closed  loop”   infrastructures   – apps  cannot  be  deployed  except  at  the  “direcMon”   of  the  system   – logging  and  monitoring  are  constant   •  need  to  get  high  uMlizaMon  rates  ($$)   •  need  to  send  out  bills  ($$)   •  want  high  rates  of  “mulM-­‐tenancy”  to  be  efficient  ($$)   – this  leads  to  a  significant  level  of  “predictability”  
  • 10. Get  Predictability  Into  Storage   •  Key  challenge  is  how  to  translate  what  “the   system”  knows  about  apps  and  behaviors  and   “SLAs”  into  guidance  for  our  system-­‐level  tools   (caching,  prefetching,  Mering,  etc.)   •  Secondary  challenge  is  avoiding  “surprises”   – where  performance  or  availability  or  durability   don’t  meet  the  SLAs  (“quality  of  service”)   •  Good  news  is  that  the  new  infrastructures   have  some  powerful  new  ways  to  help  us  
  • 11. One  Example  New  Tool  –  Stunning   •  “The  amount  of  Mme  the   virtual  machine  is  stunned  is   dependent  on  the  amount  of   memory  to  be  wrilen  to  disk   for  such  an  operaMon,  and   the  speed  and   responsiveness  of  the   datastore's  backing  storage.”   –  VMware  KnowledgeBase   hlp://kb.vmware.com/selfservice/microsites/search.do? language=en_US&cmd=displayKC&externalId=1013163   picture  by  Yamashita  Yohei  via  flickr/cc    
  • 12. What  About  Tape?   pictures  by  Gill  Wildman  via  flickr/cc  
  • 13. What  About  Tape?   •  Tapes  are  not  a  commodity  technology   •  2011  total  worldwide  market  for  tape  cartridges   is  about  8m  units  (just  under  $1b  annual   revenue)   •  Compare  to  the  HDD  business  at  650m  units  in   2010  (close  to  $40b  annual  revenue)   •  80  disk  drives  are  manufactured  for  each  tape   cartridge;  robots  are  complicated   •  Fits  parMcular  applicaMon  segments  very  well,  but   is  not  a  general-­‐purpose  soluMon   hlp://www.storagenewsleler.com/news/tapes/sccg-­‐ww-­‐tape-­‐market-­‐lto-­‐1q11   hlp://techreport.com/discussions.x/20890  
  • 14. Conclusion   •  About  80%  of  stored  data  will  never  be   accessed  again   •  About  80%  of  the  rest  will  be  accessed   predictably   •  That  leaves  (maybe)  4%  of  stored  data  that   potenMally  requires  “quick”  random  access  
  • 15. PragmaMc  Issues   •  Power   –  if  the  data  is  predictably  idle,  then  don’t  spin  it   •  Wearout   –  look  at  the  data  occasionally  (once/month,  once/yr);   such  access  for  “scrubbing”  is  very  predictable   •  Backup   –  backup  1)  is  not  an  “applicaMon”  and  2)  is  predictable   •  ReplicaMon   –  esMmates  run  to  75%  of  stored  data  is  copies/replicas,   only  25%  unique  bytes;  replicaMon  is  predictable     www.zdnet.com/blog/service-­‐oriented/size-­‐of-­‐the-­‐data-­‐universe-­‐12-­‐zelabytes-­‐and-­‐growing-­‐fast/4750  
  • 16. Summary  –  How  Much  Data   •  1.2  million  PB  esMmated  in  2010   •  25%  unique  =>  leaves  300,000  PB   •  80%  idle  =>  leaves  60,000  PB   •  80%  predictable  =>  leaves  12,000  PB   •  at  $1/GB  for  flash,  that  requires  $12b   •  is  that  affordable?   •  (remember  the  world  bought  ~$40b  of  HDD  in  2010)   www.zdnet.com/blog/service-­‐oriented/size-­‐of-­‐the-­‐data-­‐universe-­‐12-­‐zelabytes-­‐and-­‐growing-­‐fast/4750   www.computerworld.com/s/arMcle/9180943/NAND_flash_memory_pricing_to_plummet_to_1_per_GB   recently  –  “the  price  of  flash  has  not  been  dropping  as  fast  as  the  suppliers  predicted”,  August  2011  
  • 17. Conclusion   •  About  80%  of  stored  data  will  never  be   accessed  again   •  About  80%  of  the  rest  will  be  accessed   predictably   •  That  leaves  (maybe)  4%  of  stored  data  that   potenMally  requires  “quick”  random  access   •  =>  Buy  as  much  flash  as  you  can  afford,  use   disks  for  the  rest     n.b.  73.6%  of  all  staMsMcs  are  made  up,   do  the  calculaMons  for  your  own   environments,  your  mileage  may  vary   www.businessinsider.com/736-­‐of-­‐all-­‐ staMsMcs-­‐are-­‐made-­‐up-­‐2010-­‐2  
  • 18. •  In  the  balle  of  rust  vs.  silicon,  both  will  survive   rust  picture  by  Jos  Faber  via  flickr/cc     silicon  picture  from  “Chip  bug  vs  chip  bug”  by  Windell  Oskay  via  flickr/cc  
  • 19. hlp://www.emc.com/storage/atmos/atmos.htm   a  brief  word  from  my  sponsors