1. The Outlook isCloudyBenefits, pitfalls and issues in building large-scale clouds Terry Harmer 12 May 2011 EduServ 2011 Symposium 1 http://www.besc.ac.uk
2. What do I do? Technical Directorof Belfast e-Science Develop project ideas for digital economy applications Form consortia to bid for funding usually write the project funding proposals funding from EPSRC, TSB, LDA and private companies Lead Technical architect for projects Project Manager … also do software development I propose, design, manage and (help) build large-scale service centric applications. These projects are (and are increasingly) cloud based using utility infrastructure consisting of owned and multiple utility vendors. 12 May 2011 EduServ 2011 Symposium 2
7. Issues, advantages, problems, pitfalls…Organisation BeSC? How we started with clouds Project Overviews Financial services Digital media Issues 12 May 2011 EduServ 2011 Symposium 3 Cloud – Hype or Reality? For us this has been part of how we been doing things for a number of years.
8. Belfast e-Science Centre? Belfast e-Science was established in 2002 with funding from EPSRC and the DTI under the UK e-Science programme. EPSRC Platform Award funded and by TSB, EPSRC, INI, MoD, QinetiQ BeSC is entirely self funding(and has been since 2002) Don’t really use shared resources within a University infrastructure Have close connections with companies, commercial users and organisations We have the attitude and tend to operate like a small R&D company Have a tight budget and (perhaps too) big ambitions. The accidental decision to focus on commercial / industrial applications was driven by the challenges they offered. The challenges made us unique We met a bunch of people who got the idea! But they wanted to do large-scale and practical examples. 12 May 2011 EduServ 2011 Symposium 4
9. BeSC Infrastructure – Shared Services Email has been hosted by a provider (Jan 2007) Project calendars, email lists and chat rooms are all hosted by utility providers (besc.ac.uk) All of our project shared services have migrated to utility resources (Autumn 2007 onward). …projects have progressively moved to utility cloud services Use a range of commercial providers Turning off internal kit that we own and shared with our commercial partners
14. Digital media Working in the evolving on-demand media environment Started pre- iPlayer and YouTube! Concern early was on better resource utilisation in and expensive and highly dynamic environment. Early model of pooled resources Most recently in on-demand media infrastructures Project PRISM with BBC/QinetiQ/BT (completed 2009) Supported game console to Phone to set-top box access. Currently supporting film and on-demand services. Stats BBC archive ~52PB iPlayer 7PB of data transfer per month 12 May 2011 EduServ 2011 Symposium 10
20. A Dynamic Utility Resource Cloud (2007) 12 May 2011 EduServ 2011 Symposium 16
21. Infrastructure Summary Dynamic collections of services Managing real user groups Services scale and contract to established SLAs We attempt to keep our deployed infrastructure low Cheaper … and… a small attack surface Our infrastructure is a mix of owned and utility infrastructure increasingly the utility part is the majority Buying capacity and storage on demand is our norm. Play the provider market… 12 May 2011 EduServ 2011 Symposium 17
22. Issues - General Utility resource market is immature The offerings can be difficult to compare no standard unit of compute/storage simply specifying 1 GiBis pretty meaningless, need to talk about uptime, chance of data loss, etc Prices will be dependant on the user usage pattern What you get and what you can buy varies widely Some attempts at customer lock-in to providers Multi-provider clouds can be (relatively) expensive Not being multi-provider can be expensive in reliability Provider APIs and features constantly changing. No standard API New services and providers appearing. APIs not very well documented Weak SLAs from providers best SLAs guarantee 100% uptime and refund when there's downtime. There's always downtime. The cost of your downtime will be much higher than what you are paying for the resources. 12 May 2011 EduServ 2011 Symposium 18
23. Issues - Technical Machine performance unpredictable. CPU features especially unpredictable and can make a big difference to compute-heavy tasks e.g. we are heavy video transcoding users. individual instances can be unreliable (hosts DO crash) Bandwidth unpredictable and can be costly relative to JANET…more reasonable relative to real commercial rates Required to manage OS images proliferation of images; using anything but vendor images requires trust in creator. nobody has a trust framework-have to trust that user creating own images (or using other peoples) means more machines to keep up to date! 12 May 2011 EduServ 2011 Symposium 19
24. Issues - Technical/development You need significant metadata on your applications OS versions, Software stack, security configuration, operational behaviour, versions and compatibility What you are prepared to pay and how to model We develop this metadata as part of our software development process As software is developed and integrated Develop, manage, test costing models and scenarios for deployment Your applications need to be developed to cope with and recover from failure MTTR rather than MTTF
25.
26. Issues - Staff Utility cloud approaches require Staff with more rounded understanding of end-to-end deployment Understand software development Understand and can model operational behaviour Understand security Deployment In service management Fewer staff but ones that are better qualified …and harder to find…and harder to keep…and more expensive 12 May 2011 EduServ 2011 Symposium 22
27. Don’t go cloud Half-heartedly Picking up your server room and placing it in a provider Here is my cloud kit…I have a cloud without knowing why you are doing it and what you want to achieve without assessing the process and organisational changes you will need to make it work without knowing your applications and their behaviour because you think you will save money quickly You probably won’t initially… and think in terms of resources Clouds of services provide a better model of use Integrate owned and commercial services to meet user needs and adopt a single vendor or provider approach Single point of failure your technical roadmap will be controlled by someone else Academic applications have different characteristics to commercial applications 12 May 2011 EduServ 2011 Symposium 23
28. Headline Advantages You own what you need to own for as long as you need to own it and it can be configured for your needs. Focus on application rather on the supporting resources Requires applications to address resilience and surviving failure Use and be part of a marketplace of application services Can enable the application to remain immune from underlying technology changes Develop an infrastructure that suits the application we are deploying. The cost of ownership is pretty low. R&D organisation we can punch above our small size and relatively small budget. Reach out to real user groups Experiment with great flexibility ….. Unconstrained by (often entirely justified) corporate/academic infrastructure procedures. Security and procedure appropriate for the application 12 May 2011 EduServ 2011 Symposium 24