11. The Ruger Fault Equivalencytime = money fault tolerance = timeÂČ Â - risk tolerance Also known as: 'Fast, good and cheap : pick twoâ 5/27/2011 11
12. system design philosophy: 5/27/2011 12 leverage proven, open-source tech in the cloud to build a scaleable reliable secure operational foundation quickly
13. So how do you achievethe right level of fault tolerance in the cloud? 3 tenets 5/27/2011 13
16. Tenet #1prepare a fault-tolerant foundation with scripted repeatability aka automation 5/27/2011 16
17. from the start :script the non-interactive install of your toolsand OScustom AMIDebian : great package managementbased on Eric Hammondâs workhttp://alestic.com/ 5/27/2011 17
18. which will allow you toscript the setup/tear-down of your stack 5/27/2011 18
19. which will allow you toscript system testsintegrity (3-4K tests)performance (30-40K tests)load, capacity (2-4M requests) 5/27/2011 19
21. Thatâs how1 person set up andmanaged a networkcomprised of 90+/- server instancesfor 1.5 yearswhile serving various other roleswithout having to leave their chair 5/27/2011 21 try that with real hardware
27. Zone fail-over best practices:are you using auto-scaling?no : distribute server instances evenly between 2 or more zonesyes : trigger scaling on network I/O or custom metrics 5/27/2011 27
29. So itâs actually all about reduction of the right SPOFs for your business context Just adding the ability to fail-over and have backups within a region is huge! Probably enough for most. What about Fred? 5/27/2011 29