SlideShare a Scribd company logo
1 of 22
Using OpenStack
In A Traditional Hosting Environment
Jun Park (Ph.D.), Sr. Systems Architect
Mike Wilson, Sr. Systems Architect
EIG/Bluehost
OpenStack Summit 2013
EIG/Bluehost 2
• Scale @10x what we were in about a year’s time
• Needed a system to manage it all
• Provisioning these systems had to be automated
• System had to scale to multiple data centers
• Btw, we have 2 months to come up with this
High level requirements
• Centralized management, horizontal scalability.
• Abstractions for physical and logical deployments of
devices and resources.
• Open-source project with lots of support and momentum
• Typical Cloud features
(migration, imaging, provisioning, etc.)
Easy transition into future cloud offering with support for
industry standard APIs
=
OpenStack!
3EIG/Bluehost
Bluehost Environment
• Scale
– More than 10,000 physical
servers
– Adding hundreds of nodes
per day
• Network
– Public network directly attached to VMs
– Plan on adding private network later
• OpenStack/Folsom Components
– Nova (compute, api, scheduler)
– Quantum (network)
– AMQP: Qpid (messaging)
– Mysql (db)
4EIG/Bluehost
Outline
• Scalability/Stability
• Rethink Quantum Network
• Operational Issues
• Wrap-up/Conclusions
5EIG/Bluehost
Why Difficult To Scale up?
• Components that don’t scale
– Messaging system
– Mysql server
– Heavy APIs
• Hard to Diagnose
– No simulator/emulator for high scalability testing
– No quantitative guide as to how to scale up
– No detailed error message or even not at all
– Requires detailed knowledge of codebase
6EIG/Bluehost
Look at that line!
Nova Enhancements
• Monitoring/troubleshooting
– Added service ping (similar to grizzly)
– Additional task_states
– Better errors to instance_faults
• Functionality
– Added LVM resize and injection
– Added stop_soft and stop_hard similar to reboot
• Stability/Scalability
– Some bug fixes to OVS VIF driver
– Added custom scheduler
7EIG/Bluehost
MySQL/Innodb Concurrency
• Nova behavior
– Direct connection to DB (addressed in grizzly though)
– Too MANY queries; much more read than write
– Some queries often return huge number of results
– Periodicity of loads due to periodic tasks
• Mysql behavior
– innodb_thread_concurrency = num of threads that can
execute queries in the queue
– innodb_concurrency_tickets = number of rows you can
return before requeue (default 500!)
– With default settings your mysql server will fall over with a
few thousand compute nodes
EIG/Bluehost 8
9EIG/Bluehost
So the answer is just increase concurrency and tickets right?
Sadly, no.
Tuning concurrency and tickets is a must! But we still requeue.
We don’t know why, but we have a workaround.
Our workaround:
Send most read queries to a cluster of mysql slaves.
I say most because many queries would be sensitive to
possible slave replication lag
Messaging System
• Qpid Chosen
– Speed and easy cluster ability
– Incredibly unstable at large scale (Even at small scale)
– Possibly poorly configured
– Older release in CentOS6
• Problem
– Broker bottleneck
– Unnecessarily complex
• Recommendation
– AVOID BROKER!
– ZeroMQ as replacement
EIG/Bluehost 10
Server Farm
> 10,000 physical servers
BlueHost OpenStack
Group of Controllers
• Nova-api
• Nova-scheduler
• Quantum-server
• Keystone
Read-only mysql slave
Read-only mysql slave
Read-only mysql slaves
mysql master
Qpid servers
Load Balanced
EIG/Bluehost
Host
BH-enhanced Nova-
compute
BH-enhanced OVS
Quantum Plugin
Rethink Quantum Network
• Problem
– Quantum API only for DB; no efficient API for Nova
– No API-around design for actual network objects
– Premature OpenvSwitch
(OVS) quantum plugin
• Our approach
– Adding intelligence to OVS plugin
– Optimizing API
• E.g., get_instance_nw_info() in nova; Python-level join operation
– Expand current implicit communication with nova-
compute (although not ideal)
12EIG/Bluehost
OpenvSwitch (OVS)
• Support
– OpenFlow 1.0 (1.2, 1.3 in experiment)
– Various Hypervisors including KVM
– Merged into main stream since Linux 3.3
• Functionality
– Filtering rules and associated actions
• E.g., anti-IP spoofing, DMAC filtering
– Replacement or superset of ebtables/iptables/linuxbridge
– QoS or Linux traffic control (TC)
– Various third party controllers for full-fledged OF
functionality
13EIG/Bluehost
Controller
Host
Quantum With Nova-Compute
Quantum-server
Quantum-agent
Nova-compute
Quantum
DB
Nova-api server
2. Call quantum-api to create port
4. Create OVS tap
5. Set up external-ids on tap
1. Monitor any change of external-
ids of the existing taps
1. Nova boot API
3. Modify external-ids on tap &
setup OVS flows
2. Call quantum-api to get port info
3. Create port in DB
Why nova-compute
still create tap?
14EIG/Bluehost
6. Do the rest of steps
for creating a new VM
Implicit communication
with nova-compute
BH-Enhanced OVS Quantum Plugin
• Idiosyncratic Requirements
– Focus on a Zone; plan on addressing multiple-datacenters issue
– Direct public IP using flat shared provider network with No NAT
– Still required to be isolated while allowing intra-traffic among VMs
• Our Approach
– Developing “Distributed OpenFlow controller” using OVS plugin
– Either no-tag or 24bit ID (e.g., QinQ, VxLAN, GRE), but NO VLAN
– Caveats: Neither API-around approach nor Virtual Appliance
• New Features
– Anti-IP/ARP spoofing OF flows
– Multiple-IPs per public port
– Network bandwidth throttling (QoS)
– Optimal Intra-traffic flows
15EIG/Bluehost
BH-Enhanced OVS Quantum Plugin
br-int
tap1
tap2
br-int-eth0
br-eth0
phy-br-eth0
eth0
pair of veth
• DMAC matching for incoming
packets
• TPA matching in ARP query
• Anti-IP
spoofing: SRC
IP matching for
outgoing
packetsVM1
VM2
Public Networks
• Deploy QoS for
outgoing packets
10 Mbps
50 Mbps
Int-loopback phy-loopback
pair of veth
• DMAC matching for
internal traffic to VMs
• DMAC matching for
internal traffic to VMs
16EIG/Bluehost
Tag=1 (MTU size bug)
Tag=2
Operational Issues
• Reboot hosts
– Problem
• Circular dependencies between libvirtd and nova-compute
• OVS bug, allows to add non-existing tap interfaces
– Solution
• A simple workaround to restart services in rc.local
• Restart services
– Problem
• Quantum recreates taps, resulting in network hiccups
– Solution
• Check out existing taps, no touch when unnecessary
17EIG/Bluehost
Operational Issues
• Monitor Health
– Problem
• Hard to track down
– Solution
• Adding health check APIs
• XML customization
– Problem
• No way to modify XML on the fly
• Not even allowed for static customization
– Solution
• Add more parameters
18EIG/Bluehost
Wrap-up/Conclusions
EIG/Bluehost 19
Problem vs. Solution
EIG/Bluehost 20
Problem Solution
Monitoring/troubleshootin
g
Service ping, task_states, etc.
Nova
No LVM
Overloaded scheduler
Add LVM driver
Custom scheduler
OVS VIF driver issue
Mysql
Fix bugs
Overloaded Read-only Mysql slave server
Innodb issue Optimized confs
Qpid
Instability issue Clustered qpid
Future plan No broker, ZeroMQ
Quantum
Heavy API Optimized API
Premature OVS plugin Add features (Anti-IP, No-tag flows, QoS)
Operations
Reboot, restart of services Simple workarounds
XML XML customization (cpu custom)
For True OpenStack Success
Scalability
 Scalable Messaging System & DB Infra
Networks
 Good Networks Abstraction
So … We don’t do live demo until …
21EIG/Bluehost
We’re sorry, we haven’t had time to contribute a whole lot YET. But here’s
some code:
https://github.com/upoopoo for BH nova
https://github.com/JunPark for BH Quantum (as in live production)
Browse the branches, many have to do with features discussed today
22EIG/Bluehost

More Related Content

What's hot

USENIX LISA15: How TubeMogul Handles over One Trillion HTTP Requests a Month
USENIX LISA15: How TubeMogul Handles over One Trillion HTTP Requests a MonthUSENIX LISA15: How TubeMogul Handles over One Trillion HTTP Requests a Month
USENIX LISA15: How TubeMogul Handles over One Trillion HTTP Requests a MonthNicolas Brousse
 
Tuning TCP and NGINX on EC2
Tuning TCP and NGINX on EC2Tuning TCP and NGINX on EC2
Tuning TCP and NGINX on EC2Chartbeat
 
Dockerizing the Hard Services: Neutron and Nova
Dockerizing the Hard Services: Neutron and NovaDockerizing the Hard Services: Neutron and Nova
Dockerizing the Hard Services: Neutron and Novaclayton_oneill
 
Kubernetes at Datadog the very hard way
Kubernetes at Datadog the very hard wayKubernetes at Datadog the very hard way
Kubernetes at Datadog the very hard wayLaurent Bernaille
 
Building a FaaS with pulsar
Building a FaaS with pulsarBuilding a FaaS with pulsar
Building a FaaS with pulsarStreamNative
 
Streaming millions of Contact Center interactions in (near) real-time with Pu...
Streaming millions of Contact Center interactions in (near) real-time with Pu...Streaming millions of Contact Center interactions in (near) real-time with Pu...
Streaming millions of Contact Center interactions in (near) real-time with Pu...Frank Kelly
 
Accelerated dataplanes integration and deployment
Accelerated dataplanes integration and deploymentAccelerated dataplanes integration and deployment
Accelerated dataplanes integration and deploymentOPNFV
 
Real-time Cloud Management with SaltStack
Real-time Cloud Management with SaltStackReal-time Cloud Management with SaltStack
Real-time Cloud Management with SaltStackSaltStack
 
[OpenInfra Days Korea 2018] Day 2 - E6: "SONA: ONOS SDN Controller 기반 OpenSta...
[OpenInfra Days Korea 2018] Day 2 - E6: "SONA: ONOS SDN Controller 기반 OpenSta...[OpenInfra Days Korea 2018] Day 2 - E6: "SONA: ONOS SDN Controller 기반 OpenSta...
[OpenInfra Days Korea 2018] Day 2 - E6: "SONA: ONOS SDN Controller 기반 OpenSta...OpenStack Korea Community
 
Chef cookbooks for OpenStack HA
Chef cookbooks for OpenStack HAChef cookbooks for OpenStack HA
Chef cookbooks for OpenStack HAAdam Spiers
 
Testing, CI Gating & Community Fast Feedback: The Challenge of Integration Pr...
Testing, CI Gating & Community Fast Feedback: The Challenge of Integration Pr...Testing, CI Gating & Community Fast Feedback: The Challenge of Integration Pr...
Testing, CI Gating & Community Fast Feedback: The Challenge of Integration Pr...OPNFV
 
Kafka Reliability - When it absolutely, positively has to be there
Kafka Reliability - When it absolutely, positively has to be thereKafka Reliability - When it absolutely, positively has to be there
Kafka Reliability - When it absolutely, positively has to be thereGwen (Chen) Shapira
 
A fun cup of joe with open liberty
A fun cup of joe with open libertyA fun cup of joe with open liberty
A fun cup of joe with open libertyAndy Mauer
 
SaltConf14 - Brendan Burns, Google - Management at Google Scale
SaltConf14 - Brendan Burns, Google - Management at Google ScaleSaltConf14 - Brendan Burns, Google - Management at Google Scale
SaltConf14 - Brendan Burns, Google - Management at Google ScaleSaltStack
 
Mike Guthrie - Revamping Your 10 Year Old Nagios Installation
Mike Guthrie - Revamping Your 10 Year Old Nagios InstallationMike Guthrie - Revamping Your 10 Year Old Nagios Installation
Mike Guthrie - Revamping Your 10 Year Old Nagios InstallationNagios
 
Simplifying Migration from Kafka to Pulsar - Pulsar Summit NA 2021
Simplifying Migration from Kafka to Pulsar - Pulsar Summit NA 2021Simplifying Migration from Kafka to Pulsar - Pulsar Summit NA 2021
Simplifying Migration from Kafka to Pulsar - Pulsar Summit NA 2021StreamNative
 
WebLogic Stability; Detect and Analyse Stuck Threads
WebLogic Stability; Detect and Analyse Stuck ThreadsWebLogic Stability; Detect and Analyse Stuck Threads
WebLogic Stability; Detect and Analyse Stuck ThreadsMaarten Smeets
 
Consumer offset management in Kafka
Consumer offset management in KafkaConsumer offset management in Kafka
Consumer offset management in KafkaJoel Koshy
 
High Availability for OpenStack
High Availability for OpenStackHigh Availability for OpenStack
High Availability for OpenStackKamesh Pemmaraju
 
Take Kafka-on-Pulsar to Production at Internet Scale: Improvements Made for P...
Take Kafka-on-Pulsar to Production at Internet Scale: Improvements Made for P...Take Kafka-on-Pulsar to Production at Internet Scale: Improvements Made for P...
Take Kafka-on-Pulsar to Production at Internet Scale: Improvements Made for P...StreamNative
 

What's hot (20)

USENIX LISA15: How TubeMogul Handles over One Trillion HTTP Requests a Month
USENIX LISA15: How TubeMogul Handles over One Trillion HTTP Requests a MonthUSENIX LISA15: How TubeMogul Handles over One Trillion HTTP Requests a Month
USENIX LISA15: How TubeMogul Handles over One Trillion HTTP Requests a Month
 
Tuning TCP and NGINX on EC2
Tuning TCP and NGINX on EC2Tuning TCP and NGINX on EC2
Tuning TCP and NGINX on EC2
 
Dockerizing the Hard Services: Neutron and Nova
Dockerizing the Hard Services: Neutron and NovaDockerizing the Hard Services: Neutron and Nova
Dockerizing the Hard Services: Neutron and Nova
 
Kubernetes at Datadog the very hard way
Kubernetes at Datadog the very hard wayKubernetes at Datadog the very hard way
Kubernetes at Datadog the very hard way
 
Building a FaaS with pulsar
Building a FaaS with pulsarBuilding a FaaS with pulsar
Building a FaaS with pulsar
 
Streaming millions of Contact Center interactions in (near) real-time with Pu...
Streaming millions of Contact Center interactions in (near) real-time with Pu...Streaming millions of Contact Center interactions in (near) real-time with Pu...
Streaming millions of Contact Center interactions in (near) real-time with Pu...
 
Accelerated dataplanes integration and deployment
Accelerated dataplanes integration and deploymentAccelerated dataplanes integration and deployment
Accelerated dataplanes integration and deployment
 
Real-time Cloud Management with SaltStack
Real-time Cloud Management with SaltStackReal-time Cloud Management with SaltStack
Real-time Cloud Management with SaltStack
 
[OpenInfra Days Korea 2018] Day 2 - E6: "SONA: ONOS SDN Controller 기반 OpenSta...
[OpenInfra Days Korea 2018] Day 2 - E6: "SONA: ONOS SDN Controller 기반 OpenSta...[OpenInfra Days Korea 2018] Day 2 - E6: "SONA: ONOS SDN Controller 기반 OpenSta...
[OpenInfra Days Korea 2018] Day 2 - E6: "SONA: ONOS SDN Controller 기반 OpenSta...
 
Chef cookbooks for OpenStack HA
Chef cookbooks for OpenStack HAChef cookbooks for OpenStack HA
Chef cookbooks for OpenStack HA
 
Testing, CI Gating & Community Fast Feedback: The Challenge of Integration Pr...
Testing, CI Gating & Community Fast Feedback: The Challenge of Integration Pr...Testing, CI Gating & Community Fast Feedback: The Challenge of Integration Pr...
Testing, CI Gating & Community Fast Feedback: The Challenge of Integration Pr...
 
Kafka Reliability - When it absolutely, positively has to be there
Kafka Reliability - When it absolutely, positively has to be thereKafka Reliability - When it absolutely, positively has to be there
Kafka Reliability - When it absolutely, positively has to be there
 
A fun cup of joe with open liberty
A fun cup of joe with open libertyA fun cup of joe with open liberty
A fun cup of joe with open liberty
 
SaltConf14 - Brendan Burns, Google - Management at Google Scale
SaltConf14 - Brendan Burns, Google - Management at Google ScaleSaltConf14 - Brendan Burns, Google - Management at Google Scale
SaltConf14 - Brendan Burns, Google - Management at Google Scale
 
Mike Guthrie - Revamping Your 10 Year Old Nagios Installation
Mike Guthrie - Revamping Your 10 Year Old Nagios InstallationMike Guthrie - Revamping Your 10 Year Old Nagios Installation
Mike Guthrie - Revamping Your 10 Year Old Nagios Installation
 
Simplifying Migration from Kafka to Pulsar - Pulsar Summit NA 2021
Simplifying Migration from Kafka to Pulsar - Pulsar Summit NA 2021Simplifying Migration from Kafka to Pulsar - Pulsar Summit NA 2021
Simplifying Migration from Kafka to Pulsar - Pulsar Summit NA 2021
 
WebLogic Stability; Detect and Analyse Stuck Threads
WebLogic Stability; Detect and Analyse Stuck ThreadsWebLogic Stability; Detect and Analyse Stuck Threads
WebLogic Stability; Detect and Analyse Stuck Threads
 
Consumer offset management in Kafka
Consumer offset management in KafkaConsumer offset management in Kafka
Consumer offset management in Kafka
 
High Availability for OpenStack
High Availability for OpenStackHigh Availability for OpenStack
High Availability for OpenStack
 
Take Kafka-on-Pulsar to Production at Internet Scale: Improvements Made for P...
Take Kafka-on-Pulsar to Production at Internet Scale: Improvements Made for P...Take Kafka-on-Pulsar to Production at Internet Scale: Improvements Made for P...
Take Kafka-on-Pulsar to Production at Internet Scale: Improvements Made for P...
 

Viewers also liked

Open stack in_production__the_good,_the_bad_&_the_ugly
Open stack in_production__the_good,_the_bad_&_the_uglyOpen stack in_production__the_good,_the_bad_&_the_ugly
Open stack in_production__the_good,_the_bad_&_the_uglyOpenStack Foundation
 
OpenStack Database as a Service - Juno Updates
OpenStack Database as a Service - Juno UpdatesOpenStack Database as a Service - Juno Updates
OpenStack Database as a Service - Juno UpdatesOpenStack Foundation
 
Blue host using openstack in a traditional hosting environment
Blue host using openstack in a traditional hosting environmentBlue host using openstack in a traditional hosting environment
Blue host using openstack in a traditional hosting environmentOpenStack Foundation
 
Mark Collier Keynote - OpenStack Day London June 2014
Mark Collier Keynote -  OpenStack Day London June 2014Mark Collier Keynote -  OpenStack Day London June 2014
Mark Collier Keynote - OpenStack Day London June 2014OpenStack Foundation
 
Top 10 Things We Learned Implementing OpenStack
Top 10 Things We Learned Implementing OpenStackTop 10 Things We Learned Implementing OpenStack
Top 10 Things We Learned Implementing OpenStackOpenStack Foundation
 
Stacking up with OpenStack: Building for High Availability
Stacking up with OpenStack: Building for High AvailabilityStacking up with OpenStack: Building for High Availability
Stacking up with OpenStack: Building for High AvailabilityOpenStack Foundation
 
Deploy from OpenStack Trunk into a Production Environment
Deploy from OpenStack Trunk into a Production EnvironmentDeploy from OpenStack Trunk into a Production Environment
Deploy from OpenStack Trunk into a Production EnvironmentOpenStack Foundation
 
Consideration for Building a Private Cloud
Consideration for Building a Private CloudConsideration for Building a Private Cloud
Consideration for Building a Private CloudOpenStack Foundation
 
Bercovici top 10 things net app learned 0416133
Bercovici top 10 things net app learned 0416133Bercovici top 10 things net app learned 0416133
Bercovici top 10 things net app learned 0416133OpenStack Foundation
 
Best Practices for Integrating a Third party Portal with OpenStack
Best Practices for Integrating a Third party Portal with OpenStackBest Practices for Integrating a Third party Portal with OpenStack
Best Practices for Integrating a Third party Portal with OpenStackOpenStack Foundation
 
Introduction to OpenStack Architecture
Introduction to OpenStack ArchitectureIntroduction to OpenStack Architecture
Introduction to OpenStack ArchitectureOpenStack Foundation
 

Viewers also liked (17)

Open stack in_production__the_good,_the_bad_&_the_ugly
Open stack in_production__the_good,_the_bad_&_the_uglyOpen stack in_production__the_good,_the_bad_&_the_ugly
Open stack in_production__the_good,_the_bad_&_the_ugly
 
OpenStack Database as a Service - Juno Updates
OpenStack Database as a Service - Juno UpdatesOpenStack Database as a Service - Juno Updates
OpenStack Database as a Service - Juno Updates
 
Blue host using openstack in a traditional hosting environment
Blue host using openstack in a traditional hosting environmentBlue host using openstack in a traditional hosting environment
Blue host using openstack in a traditional hosting environment
 
Mark Collier Keynote - OpenStack Day London June 2014
Mark Collier Keynote -  OpenStack Day London June 2014Mark Collier Keynote -  OpenStack Day London June 2014
Mark Collier Keynote - OpenStack Day London June 2014
 
Top 10 Things We Learned Implementing OpenStack
Top 10 Things We Learned Implementing OpenStackTop 10 Things We Learned Implementing OpenStack
Top 10 Things We Learned Implementing OpenStack
 
Clouds in High Energy
Clouds in High EnergyClouds in High Energy
Clouds in High Energy
 
Using *Grimoire to Analyze
Using *Grimoire to AnalyzeUsing *Grimoire to Analyze
Using *Grimoire to Analyze
 
Hadoop For OpenStack Log Analysis
Hadoop For OpenStack Log AnalysisHadoop For OpenStack Log Analysis
Hadoop For OpenStack Log Analysis
 
Stacking up with OpenStack: Building for High Availability
Stacking up with OpenStack: Building for High AvailabilityStacking up with OpenStack: Building for High Availability
Stacking up with OpenStack: Building for High Availability
 
Deploy from OpenStack Trunk into a Production Environment
Deploy from OpenStack Trunk into a Production EnvironmentDeploy from OpenStack Trunk into a Production Environment
Deploy from OpenStack Trunk into a Production Environment
 
OpenStack in Production
OpenStack in ProductionOpenStack in Production
OpenStack in Production
 
Consideration for Building a Private Cloud
Consideration for Building a Private CloudConsideration for Building a Private Cloud
Consideration for Building a Private Cloud
 
Clouds in High Energy Physics
Clouds in High Energy PhysicsClouds in High Energy Physics
Clouds in High Energy Physics
 
Bercovici top 10 things net app learned 0416133
Bercovici top 10 things net app learned 0416133Bercovici top 10 things net app learned 0416133
Bercovici top 10 things net app learned 0416133
 
Chef For OpenStack Overview
Chef For OpenStack OverviewChef For OpenStack Overview
Chef For OpenStack Overview
 
Best Practices for Integrating a Third party Portal with OpenStack
Best Practices for Integrating a Third party Portal with OpenStackBest Practices for Integrating a Third party Portal with OpenStack
Best Practices for Integrating a Third party Portal with OpenStack
 
Introduction to OpenStack Architecture
Introduction to OpenStack ArchitectureIntroduction to OpenStack Architecture
Introduction to OpenStack Architecture
 

Similar to Using OpenStack In a Traditional Hosting Environment

Blue host openstacksummit_2013
Blue host openstacksummit_2013Blue host openstacksummit_2013
Blue host openstacksummit_2013Jun Park
 
Capacity Management/Provisioning (Cloud's full, Can't build here)
Capacity Management/Provisioning (Cloud's full, Can't build here)Capacity Management/Provisioning (Cloud's full, Can't build here)
Capacity Management/Provisioning (Cloud's full, Can't build here)andyhky
 
Presentation oracle net services
Presentation    oracle net servicesPresentation    oracle net services
Presentation oracle net servicesxKinAnx
 
Understanding and deploying Network Virtualization
Understanding and deploying Network VirtualizationUnderstanding and deploying Network Virtualization
Understanding and deploying Network VirtualizationSDN Hub
 
OpenKilda: Stream Processing Meets Openflow
OpenKilda: Stream Processing Meets OpenflowOpenKilda: Stream Processing Meets Openflow
OpenKilda: Stream Processing Meets OpenflowAPNIC
 
DOE Magellan OpenStack user story
DOE Magellan OpenStack user storyDOE Magellan OpenStack user story
DOE Magellan OpenStack user storylaurabeckcahoon
 
Managing Open vSwitch Across a Large Heterogenous Fleet
Managing Open vSwitch Across a Large Heterogenous FleetManaging Open vSwitch Across a Large Heterogenous Fleet
Managing Open vSwitch Across a Large Heterogenous Fleetandyhky
 
The Challenges of SDN/OpenFlow in an Operational and Large-scale Network
The Challenges of SDN/OpenFlow in an Operational and Large-scale NetworkThe Challenges of SDN/OpenFlow in an Operational and Large-scale Network
The Challenges of SDN/OpenFlow in an Operational and Large-scale NetworkOpen Networking Summits
 
HTTP/2 Comes to Java: Servlet 4.0 and what it means for the Java/Jakarta EE e...
HTTP/2 Comes to Java: Servlet 4.0 and what it means for the Java/Jakarta EE e...HTTP/2 Comes to Java: Servlet 4.0 and what it means for the Java/Jakarta EE e...
HTTP/2 Comes to Java: Servlet 4.0 and what it means for the Java/Jakarta EE e...Edward Burns
 
[En] IPVS for Docker Containers
[En] IPVS for Docker Containers[En] IPVS for Docker Containers
[En] IPVS for Docker ContainersAndrey Sibirev
 
IPVS for Docker Containers
IPVS for Docker ContainersIPVS for Docker Containers
IPVS for Docker ContainersBob Sokol
 
Data Center Network Trends - Lin Nease
Data Center Network Trends - Lin NeaseData Center Network Trends - Lin Nease
Data Center Network Trends - Lin NeaseHPDutchWorld
 
Ruslan Belkin And Sean Dawson on LinkedIn's Network Updates Uncovered
Ruslan Belkin And Sean Dawson on LinkedIn's Network Updates UncoveredRuslan Belkin And Sean Dawson on LinkedIn's Network Updates Uncovered
Ruslan Belkin And Sean Dawson on LinkedIn's Network Updates UncoveredLinkedIn
 
How we scaled Rudder to 10k, and the road to 50k
How we scaled Rudder to 10k, and the road to 50kHow we scaled Rudder to 10k, and the road to 50k
How we scaled Rudder to 10k, and the road to 50kRUDDER
 
OpenStack Networking and Automation
OpenStack Networking and AutomationOpenStack Networking and Automation
OpenStack Networking and AutomationAdam Johnson
 
FreeSWITCH as a Microservice
FreeSWITCH as a MicroserviceFreeSWITCH as a Microservice
FreeSWITCH as a MicroserviceEvan McGee
 
07 (IDNOG02) SDN Research activity in Institut Teknologi Bandung by Affan Bas...
07 (IDNOG02) SDN Research activity in Institut Teknologi Bandung by Affan Bas...07 (IDNOG02) SDN Research activity in Institut Teknologi Bandung by Affan Bas...
07 (IDNOG02) SDN Research activity in Institut Teknologi Bandung by Affan Bas...Indonesia Network Operators Group
 

Similar to Using OpenStack In a Traditional Hosting Environment (20)

Blue host openstacksummit_2013
Blue host openstacksummit_2013Blue host openstacksummit_2013
Blue host openstacksummit_2013
 
Neutron scaling
Neutron scalingNeutron scaling
Neutron scaling
 
Scale Kubernetes to support 50000 services
Scale Kubernetes to support 50000 servicesScale Kubernetes to support 50000 services
Scale Kubernetes to support 50000 services
 
Capacity Management/Provisioning (Cloud's full, Can't build here)
Capacity Management/Provisioning (Cloud's full, Can't build here)Capacity Management/Provisioning (Cloud's full, Can't build here)
Capacity Management/Provisioning (Cloud's full, Can't build here)
 
Presentation oracle net services
Presentation    oracle net servicesPresentation    oracle net services
Presentation oracle net services
 
Understanding and deploying Network Virtualization
Understanding and deploying Network VirtualizationUnderstanding and deploying Network Virtualization
Understanding and deploying Network Virtualization
 
OpenKilda: Stream Processing Meets Openflow
OpenKilda: Stream Processing Meets OpenflowOpenKilda: Stream Processing Meets Openflow
OpenKilda: Stream Processing Meets Openflow
 
DOE Magellan OpenStack user story
DOE Magellan OpenStack user storyDOE Magellan OpenStack user story
DOE Magellan OpenStack user story
 
Managing Open vSwitch Across a Large Heterogenous Fleet
Managing Open vSwitch Across a Large Heterogenous FleetManaging Open vSwitch Across a Large Heterogenous Fleet
Managing Open vSwitch Across a Large Heterogenous Fleet
 
The Challenges of SDN/OpenFlow in an Operational and Large-scale Network
The Challenges of SDN/OpenFlow in an Operational and Large-scale NetworkThe Challenges of SDN/OpenFlow in an Operational and Large-scale Network
The Challenges of SDN/OpenFlow in an Operational and Large-scale Network
 
HTTP/2 Comes to Java: Servlet 4.0 and what it means for the Java/Jakarta EE e...
HTTP/2 Comes to Java: Servlet 4.0 and what it means for the Java/Jakarta EE e...HTTP/2 Comes to Java: Servlet 4.0 and what it means for the Java/Jakarta EE e...
HTTP/2 Comes to Java: Servlet 4.0 and what it means for the Java/Jakarta EE e...
 
[En] IPVS for Docker Containers
[En] IPVS for Docker Containers[En] IPVS for Docker Containers
[En] IPVS for Docker Containers
 
IPVS for Docker Containers
IPVS for Docker ContainersIPVS for Docker Containers
IPVS for Docker Containers
 
Data Center Network Trends - Lin Nease
Data Center Network Trends - Lin NeaseData Center Network Trends - Lin Nease
Data Center Network Trends - Lin Nease
 
OVS-LinuxCon 2013.pdf
OVS-LinuxCon 2013.pdfOVS-LinuxCon 2013.pdf
OVS-LinuxCon 2013.pdf
 
Ruslan Belkin And Sean Dawson on LinkedIn's Network Updates Uncovered
Ruslan Belkin And Sean Dawson on LinkedIn's Network Updates UncoveredRuslan Belkin And Sean Dawson on LinkedIn's Network Updates Uncovered
Ruslan Belkin And Sean Dawson on LinkedIn's Network Updates Uncovered
 
How we scaled Rudder to 10k, and the road to 50k
How we scaled Rudder to 10k, and the road to 50kHow we scaled Rudder to 10k, and the road to 50k
How we scaled Rudder to 10k, and the road to 50k
 
OpenStack Networking and Automation
OpenStack Networking and AutomationOpenStack Networking and Automation
OpenStack Networking and Automation
 
FreeSWITCH as a Microservice
FreeSWITCH as a MicroserviceFreeSWITCH as a Microservice
FreeSWITCH as a Microservice
 
07 (IDNOG02) SDN Research activity in Institut Teknologi Bandung by Affan Bas...
07 (IDNOG02) SDN Research activity in Institut Teknologi Bandung by Affan Bas...07 (IDNOG02) SDN Research activity in Institut Teknologi Bandung by Affan Bas...
07 (IDNOG02) SDN Research activity in Institut Teknologi Bandung by Affan Bas...
 

More from OpenStack Foundation

Sponsor Webinar - OpenStack Summit Vancouver 2018
Sponsor Webinar  - OpenStack Summit Vancouver 2018Sponsor Webinar  - OpenStack Summit Vancouver 2018
Sponsor Webinar - OpenStack Summit Vancouver 2018OpenStack Foundation
 
OpenStack Summits 101: A Guide For Attendees
OpenStack Summits 101: A Guide For AttendeesOpenStack Summits 101: A Guide For Attendees
OpenStack Summits 101: A Guide For AttendeesOpenStack Foundation
 
OpenStack Marketing Plan - Community Presentation
OpenStack Marketing Plan - Community PresentationOpenStack Marketing Plan - Community Presentation
OpenStack Marketing Plan - Community PresentationOpenStack Foundation
 
OpenStack 5th Birthday - User Group Parties
OpenStack 5th Birthday - User Group PartiesOpenStack 5th Birthday - User Group Parties
OpenStack 5th Birthday - User Group PartiesOpenStack Foundation
 
Liberty release: Preliminary marketing materials & messages
Liberty release: Preliminary marketing materials & messagesLiberty release: Preliminary marketing materials & messages
Liberty release: Preliminary marketing materials & messagesOpenStack Foundation
 
OpenStack Foundation 2H 2015 Marketing Plan
OpenStack Foundation 2H 2015 Marketing PlanOpenStack Foundation 2H 2015 Marketing Plan
OpenStack Foundation 2H 2015 Marketing PlanOpenStack Foundation
 
OpenStack Summit Tokyo Sponsor Webinar
OpenStack Summit Tokyo Sponsor Webinar OpenStack Summit Tokyo Sponsor Webinar
OpenStack Summit Tokyo Sponsor Webinar OpenStack Foundation
 
Neutron Updates - Liberty Edition
Neutron Updates - Liberty Edition Neutron Updates - Liberty Edition
Neutron Updates - Liberty Edition OpenStack Foundation
 
Searchlight Updates - Liberty Edition
Searchlight Updates - Liberty EditionSearchlight Updates - Liberty Edition
Searchlight Updates - Liberty EditionOpenStack Foundation
 
Congress Updates - Liberty Edition
Congress Updates - Liberty EditionCongress Updates - Liberty Edition
Congress Updates - Liberty EditionOpenStack Foundation
 
Release Cycle Management Updates - Liberty Edition
Release Cycle Management Updates - Liberty EditionRelease Cycle Management Updates - Liberty Edition
Release Cycle Management Updates - Liberty EditionOpenStack Foundation
 
OpenStack Day CEE 2015: Real-World Use Cases
OpenStack Day CEE 2015: Real-World Use CasesOpenStack Day CEE 2015: Real-World Use Cases
OpenStack Day CEE 2015: Real-World Use CasesOpenStack Foundation
 

More from OpenStack Foundation (20)

Sponsor Webinar - OpenStack Summit Vancouver 2018
Sponsor Webinar  - OpenStack Summit Vancouver 2018Sponsor Webinar  - OpenStack Summit Vancouver 2018
Sponsor Webinar - OpenStack Summit Vancouver 2018
 
OpenStack Summits 101: A Guide For Attendees
OpenStack Summits 101: A Guide For AttendeesOpenStack Summits 101: A Guide For Attendees
OpenStack Summits 101: A Guide For Attendees
 
OpenStack Marketing Plan - Community Presentation
OpenStack Marketing Plan - Community PresentationOpenStack Marketing Plan - Community Presentation
OpenStack Marketing Plan - Community Presentation
 
OpenStack 5th Birthday - User Group Parties
OpenStack 5th Birthday - User Group PartiesOpenStack 5th Birthday - User Group Parties
OpenStack 5th Birthday - User Group Parties
 
Liberty release: Preliminary marketing materials & messages
Liberty release: Preliminary marketing materials & messagesLiberty release: Preliminary marketing materials & messages
Liberty release: Preliminary marketing materials & messages
 
OpenStack Foundation 2H 2015 Marketing Plan
OpenStack Foundation 2H 2015 Marketing PlanOpenStack Foundation 2H 2015 Marketing Plan
OpenStack Foundation 2H 2015 Marketing Plan
 
OpenStack Summit Tokyo Sponsor Webinar
OpenStack Summit Tokyo Sponsor Webinar OpenStack Summit Tokyo Sponsor Webinar
OpenStack Summit Tokyo Sponsor Webinar
 
Cinder Updates - Liberty Edition
Cinder Updates - Liberty Edition Cinder Updates - Liberty Edition
Cinder Updates - Liberty Edition
 
Glance Updates - Liberty Edition
Glance Updates - Liberty EditionGlance Updates - Liberty Edition
Glance Updates - Liberty Edition
 
Heat Updates - Liberty Edition
Heat Updates - Liberty EditionHeat Updates - Liberty Edition
Heat Updates - Liberty Edition
 
Neutron Updates - Liberty Edition
Neutron Updates - Liberty Edition Neutron Updates - Liberty Edition
Neutron Updates - Liberty Edition
 
Nova Updates - Liberty Edition
Nova Updates - Liberty EditionNova Updates - Liberty Edition
Nova Updates - Liberty Edition
 
Sahara Updates - Liberty Edition
Sahara Updates - Liberty EditionSahara Updates - Liberty Edition
Sahara Updates - Liberty Edition
 
Searchlight Updates - Liberty Edition
Searchlight Updates - Liberty EditionSearchlight Updates - Liberty Edition
Searchlight Updates - Liberty Edition
 
Trove Updates - Liberty Edition
Trove Updates - Liberty EditionTrove Updates - Liberty Edition
Trove Updates - Liberty Edition
 
OpenStack: five years in
OpenStack: five years inOpenStack: five years in
OpenStack: five years in
 
Swift Updates - Liberty Edition
Swift Updates - Liberty EditionSwift Updates - Liberty Edition
Swift Updates - Liberty Edition
 
Congress Updates - Liberty Edition
Congress Updates - Liberty EditionCongress Updates - Liberty Edition
Congress Updates - Liberty Edition
 
Release Cycle Management Updates - Liberty Edition
Release Cycle Management Updates - Liberty EditionRelease Cycle Management Updates - Liberty Edition
Release Cycle Management Updates - Liberty Edition
 
OpenStack Day CEE 2015: Real-World Use Cases
OpenStack Day CEE 2015: Real-World Use CasesOpenStack Day CEE 2015: Real-World Use Cases
OpenStack Day CEE 2015: Real-World Use Cases
 

Recently uploaded

Potential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and InsightsPotential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and InsightsRavi Sanghani
 
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better StrongerModern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better Strongerpanagenda
 
Decarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a realityDecarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a realityIES VE
 
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...Wes McKinney
 
Generative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfGenerative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfIngrid Airi González
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity PlanDatabarracks
 
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyesHow to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyesThousandEyes
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfMounikaPolabathina
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsPixlogix Infotech
 
A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersNicole Novielli
 
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...AliaaTarek5
 
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...panagenda
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
Manual 508 Accessibility Compliance Audit
Manual 508 Accessibility Compliance AuditManual 508 Accessibility Compliance Audit
Manual 508 Accessibility Compliance AuditSkynet Technologies
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxLoriGlavin3
 
UiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to HeroUiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to HeroUiPathCommunity
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024Lonnie McRorey
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteDianaGray10
 
So einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdfSo einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdfpanagenda
 

Recently uploaded (20)

Potential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and InsightsPotential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and Insights
 
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better StrongerModern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
 
Decarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a realityDecarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a reality
 
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
 
Generative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfGenerative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdf
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity Plan
 
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyesHow to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdf
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and Cons
 
A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software Developers
 
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
 
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
Manual 508 Accessibility Compliance Audit
Manual 508 Accessibility Compliance AuditManual 508 Accessibility Compliance Audit
Manual 508 Accessibility Compliance Audit
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
 
UiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to HeroUiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to Hero
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test Suite
 
So einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdfSo einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdf
 

Using OpenStack In a Traditional Hosting Environment

  • 1. Using OpenStack In A Traditional Hosting Environment Jun Park (Ph.D.), Sr. Systems Architect Mike Wilson, Sr. Systems Architect EIG/Bluehost OpenStack Summit 2013
  • 2. EIG/Bluehost 2 • Scale @10x what we were in about a year’s time • Needed a system to manage it all • Provisioning these systems had to be automated • System had to scale to multiple data centers • Btw, we have 2 months to come up with this
  • 3. High level requirements • Centralized management, horizontal scalability. • Abstractions for physical and logical deployments of devices and resources. • Open-source project with lots of support and momentum • Typical Cloud features (migration, imaging, provisioning, etc.) Easy transition into future cloud offering with support for industry standard APIs = OpenStack! 3EIG/Bluehost
  • 4. Bluehost Environment • Scale – More than 10,000 physical servers – Adding hundreds of nodes per day • Network – Public network directly attached to VMs – Plan on adding private network later • OpenStack/Folsom Components – Nova (compute, api, scheduler) – Quantum (network) – AMQP: Qpid (messaging) – Mysql (db) 4EIG/Bluehost
  • 5. Outline • Scalability/Stability • Rethink Quantum Network • Operational Issues • Wrap-up/Conclusions 5EIG/Bluehost
  • 6. Why Difficult To Scale up? • Components that don’t scale – Messaging system – Mysql server – Heavy APIs • Hard to Diagnose – No simulator/emulator for high scalability testing – No quantitative guide as to how to scale up – No detailed error message or even not at all – Requires detailed knowledge of codebase 6EIG/Bluehost Look at that line!
  • 7. Nova Enhancements • Monitoring/troubleshooting – Added service ping (similar to grizzly) – Additional task_states – Better errors to instance_faults • Functionality – Added LVM resize and injection – Added stop_soft and stop_hard similar to reboot • Stability/Scalability – Some bug fixes to OVS VIF driver – Added custom scheduler 7EIG/Bluehost
  • 8. MySQL/Innodb Concurrency • Nova behavior – Direct connection to DB (addressed in grizzly though) – Too MANY queries; much more read than write – Some queries often return huge number of results – Periodicity of loads due to periodic tasks • Mysql behavior – innodb_thread_concurrency = num of threads that can execute queries in the queue – innodb_concurrency_tickets = number of rows you can return before requeue (default 500!) – With default settings your mysql server will fall over with a few thousand compute nodes EIG/Bluehost 8
  • 9. 9EIG/Bluehost So the answer is just increase concurrency and tickets right? Sadly, no. Tuning concurrency and tickets is a must! But we still requeue. We don’t know why, but we have a workaround. Our workaround: Send most read queries to a cluster of mysql slaves. I say most because many queries would be sensitive to possible slave replication lag
  • 10. Messaging System • Qpid Chosen – Speed and easy cluster ability – Incredibly unstable at large scale (Even at small scale) – Possibly poorly configured – Older release in CentOS6 • Problem – Broker bottleneck – Unnecessarily complex • Recommendation – AVOID BROKER! – ZeroMQ as replacement EIG/Bluehost 10
  • 11. Server Farm > 10,000 physical servers BlueHost OpenStack Group of Controllers • Nova-api • Nova-scheduler • Quantum-server • Keystone Read-only mysql slave Read-only mysql slave Read-only mysql slaves mysql master Qpid servers Load Balanced EIG/Bluehost Host BH-enhanced Nova- compute BH-enhanced OVS Quantum Plugin
  • 12. Rethink Quantum Network • Problem – Quantum API only for DB; no efficient API for Nova – No API-around design for actual network objects – Premature OpenvSwitch (OVS) quantum plugin • Our approach – Adding intelligence to OVS plugin – Optimizing API • E.g., get_instance_nw_info() in nova; Python-level join operation – Expand current implicit communication with nova- compute (although not ideal) 12EIG/Bluehost
  • 13. OpenvSwitch (OVS) • Support – OpenFlow 1.0 (1.2, 1.3 in experiment) – Various Hypervisors including KVM – Merged into main stream since Linux 3.3 • Functionality – Filtering rules and associated actions • E.g., anti-IP spoofing, DMAC filtering – Replacement or superset of ebtables/iptables/linuxbridge – QoS or Linux traffic control (TC) – Various third party controllers for full-fledged OF functionality 13EIG/Bluehost
  • 14. Controller Host Quantum With Nova-Compute Quantum-server Quantum-agent Nova-compute Quantum DB Nova-api server 2. Call quantum-api to create port 4. Create OVS tap 5. Set up external-ids on tap 1. Monitor any change of external- ids of the existing taps 1. Nova boot API 3. Modify external-ids on tap & setup OVS flows 2. Call quantum-api to get port info 3. Create port in DB Why nova-compute still create tap? 14EIG/Bluehost 6. Do the rest of steps for creating a new VM Implicit communication with nova-compute
  • 15. BH-Enhanced OVS Quantum Plugin • Idiosyncratic Requirements – Focus on a Zone; plan on addressing multiple-datacenters issue – Direct public IP using flat shared provider network with No NAT – Still required to be isolated while allowing intra-traffic among VMs • Our Approach – Developing “Distributed OpenFlow controller” using OVS plugin – Either no-tag or 24bit ID (e.g., QinQ, VxLAN, GRE), but NO VLAN – Caveats: Neither API-around approach nor Virtual Appliance • New Features – Anti-IP/ARP spoofing OF flows – Multiple-IPs per public port – Network bandwidth throttling (QoS) – Optimal Intra-traffic flows 15EIG/Bluehost
  • 16. BH-Enhanced OVS Quantum Plugin br-int tap1 tap2 br-int-eth0 br-eth0 phy-br-eth0 eth0 pair of veth • DMAC matching for incoming packets • TPA matching in ARP query • Anti-IP spoofing: SRC IP matching for outgoing packetsVM1 VM2 Public Networks • Deploy QoS for outgoing packets 10 Mbps 50 Mbps Int-loopback phy-loopback pair of veth • DMAC matching for internal traffic to VMs • DMAC matching for internal traffic to VMs 16EIG/Bluehost Tag=1 (MTU size bug) Tag=2
  • 17. Operational Issues • Reboot hosts – Problem • Circular dependencies between libvirtd and nova-compute • OVS bug, allows to add non-existing tap interfaces – Solution • A simple workaround to restart services in rc.local • Restart services – Problem • Quantum recreates taps, resulting in network hiccups – Solution • Check out existing taps, no touch when unnecessary 17EIG/Bluehost
  • 18. Operational Issues • Monitor Health – Problem • Hard to track down – Solution • Adding health check APIs • XML customization – Problem • No way to modify XML on the fly • Not even allowed for static customization – Solution • Add more parameters 18EIG/Bluehost
  • 20. Problem vs. Solution EIG/Bluehost 20 Problem Solution Monitoring/troubleshootin g Service ping, task_states, etc. Nova No LVM Overloaded scheduler Add LVM driver Custom scheduler OVS VIF driver issue Mysql Fix bugs Overloaded Read-only Mysql slave server Innodb issue Optimized confs Qpid Instability issue Clustered qpid Future plan No broker, ZeroMQ Quantum Heavy API Optimized API Premature OVS plugin Add features (Anti-IP, No-tag flows, QoS) Operations Reboot, restart of services Simple workarounds XML XML customization (cpu custom)
  • 21. For True OpenStack Success Scalability  Scalable Messaging System & DB Infra Networks  Good Networks Abstraction So … We don’t do live demo until … 21EIG/Bluehost
  • 22. We’re sorry, we haven’t had time to contribute a whole lot YET. But here’s some code: https://github.com/upoopoo for BH nova https://github.com/JunPark for BH Quantum (as in live production) Browse the branches, many have to do with features discussed today 22EIG/Bluehost

Editor's Notes

  1. Love!30mins talk + 10 minsqna during the main talk we have short answers, have longer answers in qna. May have off-line chats for more details.Take a quick poll: How many people with openstack and cloudstack?Time: Tell our story… main tasks remaining …
  2. Quick poll, how many nodes?
  3. Quick poll : how many nodes?
  4. LOVE!!!Just a small step for us to try to help community and share what we’ve done.
  5. Quick poll: how many people know about OVS plugin? How many people use OVS in production?
  6. The main motivation of quantum network is to pull out network functionality from nova-compute as much as possible. Because of this, nova-compute still needs to know all the details of OVS. No code reuse, confusing, and causing inconsistency, succinct failures when VMs do not have network connectivity. With some garbled taps, quantum-agent thinks it needs to do something even for them.
  7. Let me introduce our BH-enhanced OVS plugin. NO NAT, WHY? Avoid single point of failure, easy to diagnose, meet some specific software requirement such as cPanel.
  8. Vlan tagging No tag or double-tag or VxLAN whatever…Avoiding O(n^2) solution….
  9. Let’s get this straightened out!Structurally means most distributed structure among all other cloud platforms.