SlideShare ist ein Scribd-Unternehmen logo
1 von 27
Downloaden Sie, um offline zu lesen
Adventures in Research
Joel Merrick
BBC Research & Development
OpenNebula Conference 2013
Thursday, 26 September 13
About me
• From Manchester, UK
• Sysadmin by day, Project Lead for Internal Cloud by night
• Involved with Sahana Foundation in 2008, helping with
administration
• First production release running on KVM during 2010
Haiti Earthquake.
• It’s ready for prime-time
Thursday, 26 September 13
About BBC R&D
• Established in 1922, shortly after main organisation
• Initially 2 divisions, Research Department and Development
• Grew rapidly, moving homes several times
• Eventually settled at Kingswood Warren, Surrey
• Amalgamated to R&D in 1993
• Now 3 sites - Centre House, MediaCity UK, 1 Euston
Square
Thursday, 26 September 13
About BBC R&D
Kingswood Warren, Surrey
Centre
House,
London
MediaCity,
Manchester
1ES, London
Thursday, 26 September 13
Previous Technologies Developed
• Noise Cancelling Microphones
• Conversion from 405-line to 625 line
• Colour Television
• Transatlantic Cable & Satellite
• BBC Micro
• NICAM Stereo
• DAB Digital / DTV / Freeview
• YouView
Thursday, 26 September 13
Collaboration
SuperHiVision with NHK for London 2012 Olympic Games
http://www.bbc.co.uk/blogs/researchanddevelopment/2012/08/the-olympics-in-super-hi-visio.shtml
Thursday, 26 September 13
Areas of Research
Capture
This area covers learning how to recognise and isolate objects within audio and video files automatically, such
as individual sound sources or the motion of an actor or athlete, as well as how best to record and store
media so it is durable and compatible with other systems.
• Produce
Our research in this area helps keep costs down and make production more efficient by developing the kinds
of technology that might radically improve the way programmes are made in the future.
• Deliver
This research aims to develop new ways to distribute our programmes, while ensuring audiences receive them
in the best possible quality, wherever they are, whenever they want them and whatever device they are using.
• Discover
This area sees us experimenting with new types of programmes and, with the BBC about to open more than
70 years’ worth of archives, how audiences might find and interact with them.
• Experience
How our audiences experience BBC programmes is our focus here. In this area we anticipate their future
expectations and ensure new technology, however complex, is easy to use and accessible for everyone
Thursday, 26 September 13
Every Day is Different
• We don’t have one specific kind of
workload on the shared platform
• Make is as flexible as possible, but also keep
it performant
• Most users don’t really care about backend
technology, they just want a simple, yet
effective service.
Thursday, 26 September 13
Some Current Projects(not all, by any means!)
• IP Studio
• Object Based Audio
• Enhanced Subtitling
• World Service ArchiveVoice Analysis &
Scrubbing
Thursday, 26 September 13
Challenges
• Engineers left with flexibility to do their own thing
• Silos of knowledge, hinders cross-team interactions
• Time taken to provision
• Inconsistencies
• Harder to manage asset utilisation
• Demand for compute resources and storage will
only increase
Thursday, 26 September 13
Legacy
• Robust internal systems
• Virtualisation in use, but only really single nodes
and in ad hoc situations
• Each team had their favourite distribution
• Very little / no config management or deployment
tools in most project areas
Thursday, 26 September 13
A Different Approach
• Reduce the time drains
• Automate Everything (eventually!)
• Try to standardize where appropriate
• Take ownership of assets
• Make it easy to extend and reproduce the
platform
Thursday, 26 September 13
Early Stages
• Project been running for about 6 months
• Available to users for only 2 months
• 2 clusters currently online
• Project teams already committing to procurement
• Pan-BBC Interest
• Opportunity to develop best practice as well as better
interactions with other areas of the organisation
Thursday, 26 September 13
Current Uses
• Started hosting Internal Systems Infrastructure
• Build slaves
• Indexing (100GBVM!)
• General hosting
• Hacking on ideas!
Thursday, 26 September 13
Why Build a Cloud?
• We have ownership!
• We can be more confident in security policy
• Can be guaranteed of the execution venue, so legal
stipulations can be met
• Network access is much faster for users, latency
is a lot better
Thursday, 26 September 13
High Level ComponentView
• OpenNebula 4.2
• KVM
• Ceph (rbd forVMs) - Using snapshot layering driver
and custom libvirt
• Ubuntu 13.04 - may transition back to LTS
Thursday, 26 September 13
Hardware
Compute Nodes
• Dell R720 - 32x Xeon(R) CPU E5-2670 0 @ 2.60GHz /
128GB RAM
Storage Nodes
• Dell R320 - 8x Xeon(R) CPU E5-1410 0 @ 2.80GHz /
24GB RAM
• LSI SAS2308 HBA
• SuperMicro JBOD Chassis
Thursday, 26 September 13
Network
• All hosts have 10Gbit
interconnectivity
• Intel Corporation 82599EB 10-Gigabit
SFP+
• Copper TwinAx
• Cisco Nexus 5020 ‘brains’
• FEX 2232 (Fabric Extender) as ToR
switch
Thursday, 26 September 13
OpenNebula Setup
• Currently running 4.2
• Main user interaction is via Sunstone
• Users authenticate against LDAP
• Default view for users is ‘cloud’
• Ceph RBD asVM block storage
• CephFS as System Datastore
• OpenVSwitch
Thursday, 26 September 13
Storage Node / Ceph Setup
“Ceph is a distributed object store and file system designed to provide
excellent performance, reliability and scalability.”
• XFS based OSDs (not btrfs)
• 12TB per node initial, growth to 24/48TB per node
• Around 1/8th Petabyte currently
• No SSDs
• Journals on Disk
• Deployed using ceph-deploy (much better now)
• RBD writeback caching (also writethough available)
• OSDs on all, MON’s on a small subset, MDS on inverse.
Thursday, 26 September 13
Snapshot Layering
Thursday, 26 September 13
Ceph’s Future
• Can only get better!
• Better REST admin API’s
• 8x speed increase in CRC functions in testing
• OpenZFS to leverage journaling?
• Erasure encoding to reduce space requirements
• Mutli-site replication
• RBD client side SSD caching (specifically for OS deployment)
Thursday, 26 September 13
Deployment
• Generally Puppet Managed
• VM Image generated usingVeeWee
https://github.com/jedi4ever/veewee
"A great tool for creating and configuring lightweight,
reproducible, portable virtual machine environments
- often used with the addition of automation tools
such as Chef or Puppet."
Thursday, 26 September 13
Oversubscription
• Not allVMs have CPU intensive workloads
• Makes financial sense to over-commit resource
when applicable
• Shared resources have CPU over-commited by 4x
• Memory is not over-commited
• Project teams can manage their own level on
their own equipment
Thursday, 26 September 13
Future Work - OpenNebula
• Hypervisor-side SSD caching (B-cache, flash-cache,
enhanceio etc.)... possibly
• Better ceph integration (attach_disk etc)
• Multiple Ceph Pools for tiered storage
• SSD based local storage
• Leverage more of radosgw for S3 compliant storage
• IntegrateVM generator into Sunstone/ONE?
• Move to virtio-scsi
Thursday, 26 September 13
Future Work - Hardware Pools
• PCI Passthrough Pooling
• Mainly used for SR-IOV Network adapters
• Allow PCI capture devices to be bound to aVM
• Drive the SDI Matrix to attach a given soft-patch
• Other use cases?
Thursday, 26 September 13
Thanks!
Any Question?
Thursday, 26 September 13

Weitere ähnliche Inhalte

Andere mochten auch

Opening the Path to Technical Excellence
Opening the Path to Technical ExcellenceOpening the Path to Technical Excellence
Opening the Path to Technical Excellence
NETWAYS
 

Andere mochten auch (8)

Opening the Path to Technical Excellence
Opening the Path to Technical ExcellenceOpening the Path to Technical Excellence
Opening the Path to Technical Excellence
 
Monitoring of OpenNebula installations
Monitoring of OpenNebula installationsMonitoring of OpenNebula installations
Monitoring of OpenNebula installations
 
Enabling Scientific Workflows on FermiCloud using OpenNebula
Enabling Scientific Workflows on FermiCloud using OpenNebulaEnabling Scientific Workflows on FermiCloud using OpenNebula
Enabling Scientific Workflows on FermiCloud using OpenNebula
 
CentOS and OpenNebula, a Perfect Match
CentOS and OpenNebula, a Perfect MatchCentOS and OpenNebula, a Perfect Match
CentOS and OpenNebula, a Perfect Match
 
Welcome talk unleashing the future of open-source enterprise cloud computing
Welcome talk   unleashing the future of open-source enterprise cloud computingWelcome talk   unleashing the future of open-source enterprise cloud computing
Welcome talk unleashing the future of open-source enterprise cloud computing
 
Making Clouds: Turning OpenNebula into a Product
Making Clouds: Turning OpenNebula into a ProductMaking Clouds: Turning OpenNebula into a Product
Making Clouds: Turning OpenNebula into a Product
 
High Performance Computing Cloud at SURFsara: Experiences with OpenNebula 3.x
High Performance Computing Cloud at SURFsara: Experiences with OpenNebula 3.xHigh Performance Computing Cloud at SURFsara: Experiences with OpenNebula 3.x
High Performance Computing Cloud at SURFsara: Experiences with OpenNebula 3.x
 
Contrail Virtual Execution Platform
Contrail Virtual Execution PlatformContrail Virtual Execution Platform
Contrail Virtual Execution Platform
 

Ähnlich wie Adventures in Research

Cloud stack design camp on jun 15
Cloud stack design camp on jun 15Cloud stack design camp on jun 15
Cloud stack design camp on jun 15
Isaac Chiang
 
Ora mysql bothGetting the best of both worlds with Oracle 11g and MySQL Enter...
Ora mysql bothGetting the best of both worlds with Oracle 11g and MySQL Enter...Ora mysql bothGetting the best of both worlds with Oracle 11g and MySQL Enter...
Ora mysql bothGetting the best of both worlds with Oracle 11g and MySQL Enter...
Ivan Zoratti
 

Ähnlich wie Adventures in Research (20)

Panel: Open Infrastructure for an Open Society: OSG, Commercial Clouds, and B...
Panel: Open Infrastructure for an Open Society: OSG, Commercial Clouds, and B...Panel: Open Infrastructure for an Open Society: OSG, Commercial Clouds, and B...
Panel: Open Infrastructure for an Open Society: OSG, Commercial Clouds, and B...
 
Panel: Open Infrastructure for an Open Society: OSG, Commercial Clouds, and B...
Panel: Open Infrastructure for an Open Society: OSG, Commercial Clouds, and B...Panel: Open Infrastructure for an Open Society: OSG, Commercial Clouds, and B...
Panel: Open Infrastructure for an Open Society: OSG, Commercial Clouds, and B...
 
Panel: Open Infrastructure for an Open Society: OSG, Commercial Clouds, and B...
Panel: Open Infrastructure for an Open Society: OSG, Commercial Clouds, and B...Panel: Open Infrastructure for an Open Society: OSG, Commercial Clouds, and B...
Panel: Open Infrastructure for an Open Society: OSG, Commercial Clouds, and B...
 
Cloud stack design camp on jun 15
Cloud stack design camp on jun 15Cloud stack design camp on jun 15
Cloud stack design camp on jun 15
 
Introduction to Amazon Web Services (AWS)
Introduction to Amazon Web Services (AWS)Introduction to Amazon Web Services (AWS)
Introduction to Amazon Web Services (AWS)
 
Apereo OAE - Architectural overview
Apereo OAE - Architectural overviewApereo OAE - Architectural overview
Apereo OAE - Architectural overview
 
201311 - Middleware
201311 - Middleware201311 - Middleware
201311 - Middleware
 
Making Cloudy Peanut Butter Cups: Apache CloudStack + Riak CS
Making Cloudy Peanut Butter Cups: Apache CloudStack + Riak CSMaking Cloudy Peanut Butter Cups: Apache CloudStack + Riak CS
Making Cloudy Peanut Butter Cups: Apache CloudStack + Riak CS
 
GRP 19 - Nautilus, IceCube and LIGO
GRP 19 - Nautilus, IceCube and LIGOGRP 19 - Nautilus, IceCube and LIGO
GRP 19 - Nautilus, IceCube and LIGO
 
Sergey Dzyuban "To Build My Own Cloud with Blackjack…"
Sergey Dzyuban "To Build My Own Cloud with Blackjack…"Sergey Dzyuban "To Build My Own Cloud with Blackjack…"
Sergey Dzyuban "To Build My Own Cloud with Blackjack…"
 
What's New in Grizzly & Deploying OpenStack with Puppet
What's New in Grizzly & Deploying OpenStack with PuppetWhat's New in Grizzly & Deploying OpenStack with Puppet
What's New in Grizzly & Deploying OpenStack with Puppet
 
CloudLab Overview
CloudLab OverviewCloudLab Overview
CloudLab Overview
 
Sanger, upcoming Openstack for Bio-informaticians
Sanger, upcoming Openstack for Bio-informaticiansSanger, upcoming Openstack for Bio-informaticians
Sanger, upcoming Openstack for Bio-informaticians
 
Flexible compute
Flexible computeFlexible compute
Flexible compute
 
Ora mysql bothGetting the best of both worlds with Oracle 11g and MySQL Enter...
Ora mysql bothGetting the best of both worlds with Oracle 11g and MySQL Enter...Ora mysql bothGetting the best of both worlds with Oracle 11g and MySQL Enter...
Ora mysql bothGetting the best of both worlds with Oracle 11g and MySQL Enter...
 
Introducing Container Technology to TSUBAME3.0 Supercomputer
Introducing Container Technology to TSUBAME3.0 SupercomputerIntroducing Container Technology to TSUBAME3.0 Supercomputer
Introducing Container Technology to TSUBAME3.0 Supercomputer
 
OpenStack at Bloomberg
OpenStack at BloombergOpenStack at Bloomberg
OpenStack at Bloomberg
 
Yow Conference Dec 2013 Netflix Workshop Slides with Notes
Yow Conference Dec 2013 Netflix Workshop Slides with NotesYow Conference Dec 2013 Netflix Workshop Slides with Notes
Yow Conference Dec 2013 Netflix Workshop Slides with Notes
 
CloudStack Collaboration Conference Europe 2014 参加リポート
CloudStack Collaboration Conference Europe 2014 参加リポートCloudStack Collaboration Conference Europe 2014 参加リポート
CloudStack Collaboration Conference Europe 2014 参加リポート
 
Tech 2 Tech IPv6 presentation
Tech 2 Tech IPv6 presentationTech 2 Tech IPv6 presentation
Tech 2 Tech IPv6 presentation
 

Kürzlich hochgeladen

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 

Kürzlich hochgeladen (20)

TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Manulife - Insurer Innovation Award 2024
Manulife - Insurer Innovation Award 2024Manulife - Insurer Innovation Award 2024
Manulife - Insurer Innovation Award 2024
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Top 10 Most Downloaded Games on Play Store in 2024
Top 10 Most Downloaded Games on Play Store in 2024Top 10 Most Downloaded Games on Play Store in 2024
Top 10 Most Downloaded Games on Play Store in 2024
 
HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesHTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation Strategies
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 

Adventures in Research

  • 1. Adventures in Research Joel Merrick BBC Research & Development OpenNebula Conference 2013 Thursday, 26 September 13
  • 2. About me • From Manchester, UK • Sysadmin by day, Project Lead for Internal Cloud by night • Involved with Sahana Foundation in 2008, helping with administration • First production release running on KVM during 2010 Haiti Earthquake. • It’s ready for prime-time Thursday, 26 September 13
  • 3. About BBC R&D • Established in 1922, shortly after main organisation • Initially 2 divisions, Research Department and Development • Grew rapidly, moving homes several times • Eventually settled at Kingswood Warren, Surrey • Amalgamated to R&D in 1993 • Now 3 sites - Centre House, MediaCity UK, 1 Euston Square Thursday, 26 September 13
  • 4. About BBC R&D Kingswood Warren, Surrey Centre House, London MediaCity, Manchester 1ES, London Thursday, 26 September 13
  • 5. Previous Technologies Developed • Noise Cancelling Microphones • Conversion from 405-line to 625 line • Colour Television • Transatlantic Cable & Satellite • BBC Micro • NICAM Stereo • DAB Digital / DTV / Freeview • YouView Thursday, 26 September 13
  • 6. Collaboration SuperHiVision with NHK for London 2012 Olympic Games http://www.bbc.co.uk/blogs/researchanddevelopment/2012/08/the-olympics-in-super-hi-visio.shtml Thursday, 26 September 13
  • 7. Areas of Research Capture This area covers learning how to recognise and isolate objects within audio and video files automatically, such as individual sound sources or the motion of an actor or athlete, as well as how best to record and store media so it is durable and compatible with other systems. • Produce Our research in this area helps keep costs down and make production more efficient by developing the kinds of technology that might radically improve the way programmes are made in the future. • Deliver This research aims to develop new ways to distribute our programmes, while ensuring audiences receive them in the best possible quality, wherever they are, whenever they want them and whatever device they are using. • Discover This area sees us experimenting with new types of programmes and, with the BBC about to open more than 70 years’ worth of archives, how audiences might find and interact with them. • Experience How our audiences experience BBC programmes is our focus here. In this area we anticipate their future expectations and ensure new technology, however complex, is easy to use and accessible for everyone Thursday, 26 September 13
  • 8. Every Day is Different • We don’t have one specific kind of workload on the shared platform • Make is as flexible as possible, but also keep it performant • Most users don’t really care about backend technology, they just want a simple, yet effective service. Thursday, 26 September 13
  • 9. Some Current Projects(not all, by any means!) • IP Studio • Object Based Audio • Enhanced Subtitling • World Service ArchiveVoice Analysis & Scrubbing Thursday, 26 September 13
  • 10. Challenges • Engineers left with flexibility to do their own thing • Silos of knowledge, hinders cross-team interactions • Time taken to provision • Inconsistencies • Harder to manage asset utilisation • Demand for compute resources and storage will only increase Thursday, 26 September 13
  • 11. Legacy • Robust internal systems • Virtualisation in use, but only really single nodes and in ad hoc situations • Each team had their favourite distribution • Very little / no config management or deployment tools in most project areas Thursday, 26 September 13
  • 12. A Different Approach • Reduce the time drains • Automate Everything (eventually!) • Try to standardize where appropriate • Take ownership of assets • Make it easy to extend and reproduce the platform Thursday, 26 September 13
  • 13. Early Stages • Project been running for about 6 months • Available to users for only 2 months • 2 clusters currently online • Project teams already committing to procurement • Pan-BBC Interest • Opportunity to develop best practice as well as better interactions with other areas of the organisation Thursday, 26 September 13
  • 14. Current Uses • Started hosting Internal Systems Infrastructure • Build slaves • Indexing (100GBVM!) • General hosting • Hacking on ideas! Thursday, 26 September 13
  • 15. Why Build a Cloud? • We have ownership! • We can be more confident in security policy • Can be guaranteed of the execution venue, so legal stipulations can be met • Network access is much faster for users, latency is a lot better Thursday, 26 September 13
  • 16. High Level ComponentView • OpenNebula 4.2 • KVM • Ceph (rbd forVMs) - Using snapshot layering driver and custom libvirt • Ubuntu 13.04 - may transition back to LTS Thursday, 26 September 13
  • 17. Hardware Compute Nodes • Dell R720 - 32x Xeon(R) CPU E5-2670 0 @ 2.60GHz / 128GB RAM Storage Nodes • Dell R320 - 8x Xeon(R) CPU E5-1410 0 @ 2.80GHz / 24GB RAM • LSI SAS2308 HBA • SuperMicro JBOD Chassis Thursday, 26 September 13
  • 18. Network • All hosts have 10Gbit interconnectivity • Intel Corporation 82599EB 10-Gigabit SFP+ • Copper TwinAx • Cisco Nexus 5020 ‘brains’ • FEX 2232 (Fabric Extender) as ToR switch Thursday, 26 September 13
  • 19. OpenNebula Setup • Currently running 4.2 • Main user interaction is via Sunstone • Users authenticate against LDAP • Default view for users is ‘cloud’ • Ceph RBD asVM block storage • CephFS as System Datastore • OpenVSwitch Thursday, 26 September 13
  • 20. Storage Node / Ceph Setup “Ceph is a distributed object store and file system designed to provide excellent performance, reliability and scalability.” • XFS based OSDs (not btrfs) • 12TB per node initial, growth to 24/48TB per node • Around 1/8th Petabyte currently • No SSDs • Journals on Disk • Deployed using ceph-deploy (much better now) • RBD writeback caching (also writethough available) • OSDs on all, MON’s on a small subset, MDS on inverse. Thursday, 26 September 13
  • 22. Ceph’s Future • Can only get better! • Better REST admin API’s • 8x speed increase in CRC functions in testing • OpenZFS to leverage journaling? • Erasure encoding to reduce space requirements • Mutli-site replication • RBD client side SSD caching (specifically for OS deployment) Thursday, 26 September 13
  • 23. Deployment • Generally Puppet Managed • VM Image generated usingVeeWee https://github.com/jedi4ever/veewee "A great tool for creating and configuring lightweight, reproducible, portable virtual machine environments - often used with the addition of automation tools such as Chef or Puppet." Thursday, 26 September 13
  • 24. Oversubscription • Not allVMs have CPU intensive workloads • Makes financial sense to over-commit resource when applicable • Shared resources have CPU over-commited by 4x • Memory is not over-commited • Project teams can manage their own level on their own equipment Thursday, 26 September 13
  • 25. Future Work - OpenNebula • Hypervisor-side SSD caching (B-cache, flash-cache, enhanceio etc.)... possibly • Better ceph integration (attach_disk etc) • Multiple Ceph Pools for tiered storage • SSD based local storage • Leverage more of radosgw for S3 compliant storage • IntegrateVM generator into Sunstone/ONE? • Move to virtio-scsi Thursday, 26 September 13
  • 26. Future Work - Hardware Pools • PCI Passthrough Pooling • Mainly used for SR-IOV Network adapters • Allow PCI capture devices to be bound to aVM • Drive the SDI Matrix to attach a given soft-patch • Other use cases? Thursday, 26 September 13