SlideShare ist ein Scribd-Unternehmen logo
1 von 22
Swimming upstream -
Doctor project case study
Gerald Kunzmann, NTT DOCOMO
Carlos Goncalves, NEC
Tomi Juvonen, Nokia
AGENDA
– Introduction to OPNFV Doctor project
– Detailed work flow
– Pain points
– Best practises for upstream contributions
– Summary
OPNFV Doctor project - Introduction
• Goal:
– Develop and build fault management and maintenance framework for high
availability of Network Services running on top of virtualized infrastructure.
 Proposed with a very clear target / key feature:
– Immediate notification of unavailability of virtualized resources
from VIM to Consumer
• Members:
– NEC (PTL: Ryota Mibu), AT&T, Cisco, Cloudbase Solutions, Corenova, Ericsson,
Hephaex, Huawei, Intel, KDDI, KT, Nokia, NTT DOCOMO, Spirent, Sprint,
Telecom Italia, Vmsec, ZTE
• https://wiki.opnfv.org/display/doctor/
OPNFV Doctor project – Timeline
Q3 Q4
Q1
2015
Q2 Q3 Q4
Q1
2016
Q2 Q3 Q4
Q1
2017
Q2
OPNFV launch
30 Sep, 2014
Doctor creation
2 Dec, 2014
Arno release
4 May, 2015
OPNFV Summit 2015
9 Nov, 2015
Bramahputra release
1 Mar, 2016
OPNFV Summit 2016
12 Jun, 2016
Colorado release
16 Sep, 2016
OpenStack Summit
Barcelona 2017
25 Oct, 2016 Danube release
4 Apr, 2017
ARNO
- Requirement document
BRAHMAPUTRA
- Ceilometer „Immediate
Notification“
- Nova „Mark Host Down“
- Functional test cases
- PoC demo at OPNFV Summit
- Documentation updates
COLORADO
- Nova:
„Get valid server state“
- Integration of Congress
as Doctor Inspector
- Extended functional tests
- PoC demo at OPNFV Summit and
OpenStack Summit Barcelona
- Documentation updates
DANUBE
- Neutron „Port Status update“
- Inspector design guidelines
- Performance profiler
- Documentation updates
Detailed work flow
1. Doctor requirements document
• Use cases and scenarios
– Active-Standby configuration (1+1 redundancy):
• Consumer of infrastructure has configured ACT-STBY
• Fault in virtualized infrastructure (NFVI)  inform the Consumer to switch to STBY instance
– Prevention actions based on fault prediction: Switch to STBY in case of a predicted fault
– NFVI maintenance: inform Consumer(s) of affected hardware about planned maintenance
• Requirements
1. Monitor physical and virtual resources and detect problems
2. Correlate faults and identify affected virtual resources
3. Notification of Consumer(s) of affected virtual resources
4. Execute steps 1-3 in less than e.g. 1 second to avoid service disruption
Consumer C1 Consumer C2 Consumer C3
Virtualized Infrastructure Manager
(VIM), e.g. OpenStack
Resource
Map
Server – VM mapping
Server S1 VM-1, VM-2
Server S2 VM-7
Server S3 VM-4
Ownership information
VM-1, VM-7 Consumer C1
VM-2 Consumer C2
VM-4 Consumer C3
Resource Pool
Hypervisor
Hardware
Server S1
VM-1
Hypervisor
Hardware
Server S2
Hypervisor
Hardware
Server S3
VM-2 VM-7 VM-4
X 1. Fault Monitoring
- Hardware fault
- Hypervisor fault
- Host OS fault
6. Execute Instruction
- e.g. migrate VM
2. Inform the Consumer?
If YES, find owner of
affected VMs from database
OpenStack Northbound Interface
3. FaultNotification
(VM ID, Fault ID)
5. Instruction
(VM ID)
4. Switch to SBY configuration
Doctor architecture and OpenStack mapping
Monitor
Notifier
Manager /
Consumer
Virtualized Infrastructure
(Resource Pool)
Alarm
Conf.
3. Update State
2. Find Affected
Application
Controller
Controller
Controller
Resource
Map
1. Raw Failure
Inspector
4. Notify all
5. Notify Error
0. Set Alarm
6. Action
Failure
Policy
Monitor
Monitor
Aodh
Vitrage
e.g. Zabbix
Cinder
Neutron
Nova
Congress
Resource state – missing feature
To be:
• Nova API shall support to change nova-compute state
• User shall be able to read OpenStack states and trust they are correct
As is (Kilo release):
• When a VM goes down due to a host HW, host OS or hypervisor failure,
nothing happens in OpenStack. The VMs of a crashed host/hypervisor
are reported to be live and OK through the OpenStack API.
• nova-compute state might change too slowly or the state is not reliable
if expecting also VMs to be down.
3. Gap analysis and solution brainstorming
Solution brainstorming:
• Discussion with experts on best way to address the gap
•  BP spec for „mark nova-compute down“
Immediate notification – deficiency in operation
To be:
• VIM to immediately notify unavailability of virtual resource to VIM user.
• User shall only receive fault notifications related to owned resource(s).
As is (Kilo release):
• OpenStack Metering ‘Ceilometer’ can notify unavailability of resource.
• Due to innerworking of Ceilometer (polling of events), notification of
faults takes seconds to few minutes (depending on polling interval).
• Performance issue for Ceilometer in medium to large scale deployments.
4. Reach upstream
Requirement Draft spec; review internally Submit upstream
5. Test cases and user manual
• End to end test cases
– Upstream: unit tests and scope-restricted functional tests upstream
– Downstream: E2E functional tests will validate full systems
integration
• Manuals
– How to use implemented blueprint
– How it plays within the framework
– How to run the tests and interpret results
– Greatly promotes technology adoption!
6. PoCs, demos and hackfests
Make people
aware of the
work
Collect
feedback
Find
supporters
Work
upstream
Integrate and
validate
7. Upstream achievements
Project Blueprint Spec Drafter Lead Developer Status
Aodh Event Alarm Evaluator Ryota Mibu (NEC) Ryota Mibu (NEC) Completed (Liberty)
Nova New nova API call to mark nova-compute down Tomi Juvonen (Nokia) Roman Dobosz (Intel) Completed (Liberty)
Support forcing service down Tomi Juvonen (Nokia) Carlos Goncalves (NEC) Completed (Liberty)
Get valid server state Tomi Juvonen (Nokia) Tomi Juvonen (Nokia) Completed (Mitaka)
Add notification for service status change Balazs Gibizer (Ericsson) Balazs Gibizer (Ericsson) Completed (Mitaka)
Congress Push Type Datasource Driver Masahito Muroi (NTT) Masahito Muroi (NTT) Completed (Mitaka)
Adds Doctor Driver Masahito Muroi (NTT) Masahito Muroi (NTT) Completed (Mitaka)
Neutron Port data plane status Carlos Goncalves (NEC) Carlos Goncalves (NEC) Completed (Pike)
8. Summary: typical work flow
Requirement
Architecture
& Gaps
Solution &
review
internally
Reach
upstream
Integrate,
test and
document
Present
demos,
collect
feedback
Pain points
•North America, Europe, Asia
Time zone
•OPNFV has handful of installers
•Each use different configuration management tools (e.g. Puppet, Ansible, Charm)
•Different default configurations (e.g. Ceilometer/Aodh, Nova)
Installers
•Internal and cross-project meetings
•Release planning and deliverables
•Alignment and collaboration with relevant SDOs (e.g. ETSI NFV)
Overhead
•OPNFV ships ~6-month old OpenStack release
•Need to backport bugs and features
•Address each installer individually
Backports
•Making sure system integration’s in place
•End to end scenarios require great knowledge across several upstream projectsTesting
•Sometimes forsaken
Documentation
Upstream contributions
Best practises
Contribute to a Project
• If you want to have something done, you should
do it yourself
• Contribute to OpenStack project you are planning
to have your change
– You will gain respect in the community and people
get to know you
– You get to understand how everything works
• Project on-boarding sessions
5/19/2017 16
From OPNFV Requirements to a Spec
• Explain your problem well and have a good material
linked to tell things in more detail
• Make use cases carefully
• Try to find the benefit for a wider audience
• Use common-ground terminology rather than NFV-specific
• Make proposal for the implementation. Review will shape it right
5/19/2017 17
Discuss Your Change
• Join openstack-dev mailing list, Project IRC and weekly
meetings
• OpenStack has PTG just before the start of the next release
cycle. There each project team meets face-to-face to actually
work on the new features, not just talk. There are also cross-
project sessions
• OpenStack summit is arranged 2 months after release cycle
starts. There is now a forum with the opportunity to discuss
requirements with the project and cross-project development
teams for the current +1 release (now Queens' Forum). For the
first time, everyone will have the opportunity to discuss
requirements face to face, in time to get them into the current
+1 release
5/19/2017 18
Need a Wider View
• Email to OpenStack ops and dev mailing list
• Telco reviewers
• Talk with operators
– IRC meetings
– OpenStack Operators Telco and NFV Working
Group
– OpenStack Operators Mid-Cycle
– OpenStack Operators sessions in Forum
5/19/2017 19
Pike Release Timeline - Steps With Your Change
→ Just before the next release cycle starts, have your idea ready in your
head and start to draft the spec. Join the PTG if necessary to discuss
face-to-face with the project. Start making the code if you didn’t already
→ Pike-1: You need to have the spec merged. Next you should have the
first version of your code ready for review. Note! Project cores do not
have much time, make all the test cases and code as good as you can
to have it trough reviews in time. There is usually earlier similar patches
as an example
→ Pike-2: Earlier non-priority code freeze, but more relaxed in current
release
→ Pike-3: Feature freeze. Everything have to be merged before this date
→ Pike release: Might drop some follow up patches
5/19/2017 20
(Feb 20)
(Apr 10)
(Jun 05)
(Jul 24)
(Aug 28)
Lessons Learned
• Be prepared
• POC code
• Scope it right
• Discuss and communicate; Avoid wasted effort
5/19/2017 21
THANKS

Weitere ähnliche Inhalte

Was ist angesagt?

Challenge in asia region connecting each testbed and poc of distributed nfv ...
Challenge in asia region  connecting each testbed and poc of distributed nfv ...Challenge in asia region  connecting each testbed and poc of distributed nfv ...
Challenge in asia region connecting each testbed and poc of distributed nfv ...
OPNFV
 
OPNFV scenarios challenges and opportunities
OPNFV scenarios  challenges and opportunitiesOPNFV scenarios  challenges and opportunities
OPNFV scenarios challenges and opportunities
OPNFV
 

Was ist angesagt? (20)

Challenge in asia region connecting each testbed and poc of distributed nfv ...
Challenge in asia region  connecting each testbed and poc of distributed nfv ...Challenge in asia region  connecting each testbed and poc of distributed nfv ...
Challenge in asia region connecting each testbed and poc of distributed nfv ...
 
Open Platform for NFV (developer)
Open Platform for NFV (developer)Open Platform for NFV (developer)
Open Platform for NFV (developer)
 
How OPNFV Uses OpenStack & How It's Useful
How OPNFV Uses OpenStack & How It's UsefulHow OPNFV Uses OpenStack & How It's Useful
How OPNFV Uses OpenStack & How It's Useful
 
OPNFV: Platform Performance Acceleration
OPNFV: Platform Performance AccelerationOPNFV: Platform Performance Acceleration
OPNFV: Platform Performance Acceleration
 
Evolution of OPNFV CI System: What already exists and what can be introduced
Evolution of OPNFV CI System: What already exists and what can be introduced  Evolution of OPNFV CI System: What already exists and what can be introduced
Evolution of OPNFV CI System: What already exists and what can be introduced
 
Functest in Depth
Functest in DepthFunctest in Depth
Functest in Depth
 
Automatic Integration, Testing and Certification of NFV in China Mobile
Automatic Integration, Testing and Certification of NFV in China MobileAutomatic Integration, Testing and Certification of NFV in China Mobile
Automatic Integration, Testing and Certification of NFV in China Mobile
 
Software-defined migration how to migrate bunch of v-ms and volumes within a...
Software-defined migration  how to migrate bunch of v-ms and volumes within a...Software-defined migration  how to migrate bunch of v-ms and volumes within a...
Software-defined migration how to migrate bunch of v-ms and volumes within a...
 
OPNFV: Open Source Carrier Networking Panel
OPNFV: Open Source Carrier Networking PanelOPNFV: Open Source Carrier Networking Panel
OPNFV: Open Source Carrier Networking Panel
 
Operating OPNFV
Operating OPNFVOperating OPNFV
Operating OPNFV
 
OPNFV scenarios challenges and opportunities
OPNFV scenarios  challenges and opportunitiesOPNFV scenarios  challenges and opportunities
OPNFV scenarios challenges and opportunities
 
Summit 16: The Open Source NFV Eco-system and OPNFV's Role Therein
Summit 16: The Open Source NFV Eco-system and OPNFV's Role ThereinSummit 16: The Open Source NFV Eco-system and OPNFV's Role Therein
Summit 16: The Open Source NFV Eco-system and OPNFV's Role Therein
 
What is OPNFV? An Introduction
What is OPNFV? An IntroductionWhat is OPNFV? An Introduction
What is OPNFV? An Introduction
 
OPNFV overview
OPNFV overviewOPNFV overview
OPNFV overview
 
OPNFV: Getting down to business
OPNFV: Getting down to businessOPNFV: Getting down to business
OPNFV: Getting down to business
 
OPNFV: A Multi-Vendor, Interoperable, NFV Solution
OPNFV: A Multi-Vendor, Interoperable, NFV SolutionOPNFV: A Multi-Vendor, Interoperable, NFV Solution
OPNFV: A Multi-Vendor, Interoperable, NFV Solution
 
Summit 16: ARM Mini-Summit - OPNFV vision, contributions and offerings - Enea
Summit 16: ARM Mini-Summit - OPNFV vision, contributions and offerings - EneaSummit 16: ARM Mini-Summit - OPNFV vision, contributions and offerings - Enea
Summit 16: ARM Mini-Summit - OPNFV vision, contributions and offerings - Enea
 
Summit 16: Carrier Grade Testing Integration
Summit 16: Carrier Grade Testing IntegrationSummit 16: Carrier Grade Testing Integration
Summit 16: Carrier Grade Testing Integration
 
Summit 16: ARM Mini-Summit - OpenDataPlane Monarch Release - Linaro
Summit 16: ARM Mini-Summit -   OpenDataPlane Monarch Release - LinaroSummit 16: ARM Mini-Summit -   OpenDataPlane Monarch Release - Linaro
Summit 16: ARM Mini-Summit - OpenDataPlane Monarch Release - Linaro
 
ONAP integration with opnfv via opera
ONAP integration with opnfv via opera ONAP integration with opnfv via opera
ONAP integration with opnfv via opera
 

Andere mochten auch

Crossing the river by feeling the stones from legacy to cloud native applica...
Crossing the river by feeling the stones  from legacy to cloud native applica...Crossing the river by feeling the stones  from legacy to cloud native applica...
Crossing the river by feeling the stones from legacy to cloud native applica...
OPNFV
 

Andere mochten auch (20)

Demo how to efficiently evaluate nf-vi performance by leveraging opnfv testi...
Demo  how to efficiently evaluate nf-vi performance by leveraging opnfv testi...Demo  how to efficiently evaluate nf-vi performance by leveraging opnfv testi...
Demo how to efficiently evaluate nf-vi performance by leveraging opnfv testi...
 
Summit 16: Keynote: Huawei - Road to All- Cloud Carrier Network
Summit 16: Keynote: Huawei - Road to All- Cloud Carrier NetworkSummit 16: Keynote: Huawei - Road to All- Cloud Carrier Network
Summit 16: Keynote: Huawei - Road to All- Cloud Carrier Network
 
Summit 16: ARM Mini-Summit - OpenFastPath is Open and Fast - Nokia
Summit 16: ARM Mini-Summit - OpenFastPath is Open and Fast - NokiaSummit 16: ARM Mini-Summit - OpenFastPath is Open and Fast - Nokia
Summit 16: ARM Mini-Summit - OpenFastPath is Open and Fast - Nokia
 
Summit 16: Open-O Mini-Summit - TOSCA and YANG Data Modeling for NFV
Summit 16: Open-O Mini-Summit - TOSCA and YANG Data Modeling for NFVSummit 16: Open-O Mini-Summit - TOSCA and YANG Data Modeling for NFV
Summit 16: Open-O Mini-Summit - TOSCA and YANG Data Modeling for NFV
 
Summit 16: Open-O Mini-Summit - Open Source, Orchestration, and OPNFV
Summit 16: Open-O Mini-Summit - Open Source, Orchestration, and OPNFVSummit 16: Open-O Mini-Summit - Open Source, Orchestration, and OPNFV
Summit 16: Open-O Mini-Summit - Open Source, Orchestration, and OPNFV
 
Crossing the river by feeling the stones from legacy to cloud native applica...
Crossing the river by feeling the stones  from legacy to cloud native applica...Crossing the river by feeling the stones  from legacy to cloud native applica...
Crossing the river by feeling the stones from legacy to cloud native applica...
 
Summit 16: Automated Platform for Testing VNF Performance and Interoperabili...
Summit 16: Automated Platform for  Testing VNF Performance and Interoperabili...Summit 16: Automated Platform for  Testing VNF Performance and Interoperabili...
Summit 16: Automated Platform for Testing VNF Performance and Interoperabili...
 
Summit 16: ARM Mini-Summit - NFV for the Masses - Marvell
Summit 16: ARM Mini-Summit - NFV for the Masses - MarvellSummit 16: ARM Mini-Summit - NFV for the Masses - Marvell
Summit 16: ARM Mini-Summit - NFV for the Masses - Marvell
 
Summit 16: ARM Mini-Summit - Efficient NFV solutions for Cloud and Edge - Cavium
Summit 16: ARM Mini-Summit - Efficient NFV solutions for Cloud and Edge - CaviumSummit 16: ARM Mini-Summit - Efficient NFV solutions for Cloud and Edge - Cavium
Summit 16: ARM Mini-Summit - Efficient NFV solutions for Cloud and Edge - Cavium
 
Summit 16: ARM Mini-Summit - NXP QorIQ NFV Solutions - NXP Semiconductors
Summit 16: ARM Mini-Summit - NXP QorIQ NFV Solutions - NXP SemiconductorsSummit 16: ARM Mini-Summit - NXP QorIQ NFV Solutions - NXP Semiconductors
Summit 16: ARM Mini-Summit - NXP QorIQ NFV Solutions - NXP Semiconductors
 
Open stack gluon + opnfv netready
Open stack gluon + opnfv netreadyOpen stack gluon + opnfv netready
Open stack gluon + opnfv netready
 
OPNFV: Overview and Approach to Upstream Integration
OPNFV: Overview and Approach to Upstream IntegrationOPNFV: Overview and Approach to Upstream Integration
OPNFV: Overview and Approach to Upstream Integration
 
Summit 16: Open-O Mini-Summit - Welcome & Introduction
Summit 16: Open-O Mini-Summit - Welcome & IntroductionSummit 16: Open-O Mini-Summit - Welcome & Introduction
Summit 16: Open-O Mini-Summit - Welcome & Introduction
 
Fast datastacks - fast and flexible nfv solution stacks leveraging fd.io
Fast datastacks - fast and flexible nfv solution stacks leveraging fd.ioFast datastacks - fast and flexible nfv solution stacks leveraging fd.io
Fast datastacks - fast and flexible nfv solution stacks leveraging fd.io
 
Summit 16: Open-O Mini-Summit - VF Event Streaming Project Proposal
Summit 16: Open-O Mini-Summit - VF Event Streaming Project ProposalSummit 16: Open-O Mini-Summit - VF Event Streaming Project Proposal
Summit 16: Open-O Mini-Summit - VF Event Streaming Project Proposal
 
Summit 16: Open-O Mini-Summit - Open Source Evolution for Carriers
Summit 16: Open-O Mini-Summit - Open Source Evolution for CarriersSummit 16: Open-O Mini-Summit - Open Source Evolution for Carriers
Summit 16: Open-O Mini-Summit - Open Source Evolution for Carriers
 
Summit 16: ARM Mini-Summit - Intro & Overview
Summit 16: ARM Mini-Summit - Intro & OverviewSummit 16: ARM Mini-Summit - Intro & Overview
Summit 16: ARM Mini-Summit - Intro & Overview
 
Requirement analysis of vim platform reliability in a three-layer decoupling ...
Requirement analysis of vim platform reliability in a three-layer decoupling ...Requirement analysis of vim platform reliability in a three-layer decoupling ...
Requirement analysis of vim platform reliability in a three-layer decoupling ...
 
MEF's inter-domain orchestration delivering dynamic third networks [presente...
MEF's  inter-domain orchestration delivering dynamic third networks [presente...MEF's  inter-domain orchestration delivering dynamic third networks [presente...
MEF's inter-domain orchestration delivering dynamic third networks [presente...
 
Challenges in testing for composite vim platforms
Challenges in testing for composite vim platformsChallenges in testing for composite vim platforms
Challenges in testing for composite vim platforms
 

Ähnlich wie Swimming upstream: OPNFV Doctor project case study

Onos summit roadmap dec 9
Onos summit  roadmap dec 9Onos summit  roadmap dec 9
Onos summit roadmap dec 9
ONOS Project
 
Resume_Thoota_Phani (2)
Resume_Thoota_Phani (2)Resume_Thoota_Phani (2)
Resume_Thoota_Phani (2)
Phani Thoota
 

Ähnlich wie Swimming upstream: OPNFV Doctor project case study (20)

OpenNebulaConf2015 1.07 Cloud for Scientific Computing @ STFC - Alexander Dibbo
OpenNebulaConf2015 1.07 Cloud for Scientific Computing @ STFC - Alexander DibboOpenNebulaConf2015 1.07 Cloud for Scientific Computing @ STFC - Alexander Dibbo
OpenNebulaConf2015 1.07 Cloud for Scientific Computing @ STFC - Alexander Dibbo
 
OpenStack Enabling DevOps
OpenStack Enabling DevOpsOpenStack Enabling DevOps
OpenStack Enabling DevOps
 
Onos summit roadmap dec 9
Onos summit  roadmap dec 9Onos summit  roadmap dec 9
Onos summit roadmap dec 9
 
StarlingX - A Platform for the Distributed Edge | Ildiko Vancsa
StarlingX - A Platform for the Distributed Edge | Ildiko VancsaStarlingX - A Platform for the Distributed Edge | Ildiko Vancsa
StarlingX - A Platform for the Distributed Edge | Ildiko Vancsa
 
Recap of OpenStack Tokyo Summit
Recap of OpenStack Tokyo SummitRecap of OpenStack Tokyo Summit
Recap of OpenStack Tokyo Summit
 
OpenStack London Meetup, 18 Nov 2015
OpenStack London Meetup, 18 Nov 2015OpenStack London Meetup, 18 Nov 2015
OpenStack London Meetup, 18 Nov 2015
 
Planning open stack-poc
Planning open stack-pocPlanning open stack-poc
Planning open stack-poc
 
Lessons learned so far in operationalizing NFV
Lessons learned so far in operationalizing NFVLessons learned so far in operationalizing NFV
Lessons learned so far in operationalizing NFV
 
Tacker vancouver project onboarding new
Tacker vancouver project onboarding newTacker vancouver project onboarding new
Tacker vancouver project onboarding new
 
OPNFV CI and Challenges: How we solved them - if we solved them at all!
OPNFV CI and Challenges: How we solved them - if we solved them at all!OPNFV CI and Challenges: How we solved them - if we solved them at all!
OPNFV CI and Challenges: How we solved them - if we solved them at all!
 
OpenStack at NTT Resonant: Lessons Learned in Web Infrastructure
OpenStack at NTT Resonant: Lessons Learned in Web InfrastructureOpenStack at NTT Resonant: Lessons Learned in Web Infrastructure
OpenStack at NTT Resonant: Lessons Learned in Web Infrastructure
 
Resume_Thoota_Phani (2)
Resume_Thoota_Phani (2)Resume_Thoota_Phani (2)
Resume_Thoota_Phani (2)
 
Faster, Higher, Stronger – Accelerating Fault Management to the Next Level
Faster, Higher, Stronger – Accelerating Fault Management to the Next LevelFaster, Higher, Stronger – Accelerating Fault Management to the Next Level
Faster, Higher, Stronger – Accelerating Fault Management to the Next Level
 
OpenStack at EBSCO
OpenStack at EBSCOOpenStack at EBSCO
OpenStack at EBSCO
 
Configuration Management Evolution at CERN
Configuration Management Evolution at CERNConfiguration Management Evolution at CERN
Configuration Management Evolution at CERN
 
Three years of OFELIA - taking stock
Three years of OFELIA - taking stockThree years of OFELIA - taking stock
Three years of OFELIA - taking stock
 
StarlingX - Project Onboarding
StarlingX - Project OnboardingStarlingX - Project Onboarding
StarlingX - Project Onboarding
 
Vishal_Resume
Vishal_ResumeVishal_Resume
Vishal_Resume
 
SDN and NFV
SDN and NFVSDN and NFV
SDN and NFV
 
Lessons Learned Running The Largest OpenStack Clouds
Lessons Learned Running The Largest OpenStack CloudsLessons Learned Running The Largest OpenStack Clouds
Lessons Learned Running The Largest OpenStack Clouds
 

Mehr von OPNFV

Being Brave: Deploying OpenStack from Master
Being Brave: Deploying OpenStack from MasterBeing Brave: Deploying OpenStack from Master
Being Brave: Deploying OpenStack from Master
OPNFV
 
Run OPNFV Danube on ODCC Scorpio Multi-node Server - Open Software on Open Ha...
Run OPNFV Danube on ODCC Scorpio Multi-node Server - Open Software on Open Ha...Run OPNFV Danube on ODCC Scorpio Multi-node Server - Open Software on Open Ha...
Run OPNFV Danube on ODCC Scorpio Multi-node Server - Open Software on Open Ha...
OPNFV
 

Mehr von OPNFV (20)

How to Reuse OPNFV Testing Components in Telco Validation Chain
How to Reuse OPNFV Testing Components in Telco Validation ChainHow to Reuse OPNFV Testing Components in Telco Validation Chain
How to Reuse OPNFV Testing Components in Telco Validation Chain
 
Energy Audit aaS with OPNFV
Energy Audit aaS with OPNFVEnergy Audit aaS with OPNFV
Energy Audit aaS with OPNFV
 
Hands-On Testing: How to Integrate Tests in OPNFV
Hands-On Testing: How to Integrate Tests in OPNFVHands-On Testing: How to Integrate Tests in OPNFV
Hands-On Testing: How to Integrate Tests in OPNFV
 
Storage Performance Indicators - Powered by StorPerf and QTIP
Storage Performance Indicators - Powered by StorPerf and QTIPStorage Performance Indicators - Powered by StorPerf and QTIP
Storage Performance Indicators - Powered by StorPerf and QTIP
 
Big Data for Testing - Heading for Post Process and Analytics
Big Data for Testing - Heading for Post Process and AnalyticsBig Data for Testing - Heading for Post Process and Analytics
Big Data for Testing - Heading for Post Process and Analytics
 
Testing, CI Gating & Community Fast Feedback: The Challenge of Integration Pr...
Testing, CI Gating & Community Fast Feedback: The Challenge of Integration Pr...Testing, CI Gating & Community Fast Feedback: The Challenge of Integration Pr...
Testing, CI Gating & Community Fast Feedback: The Challenge of Integration Pr...
 
How Many Ohs? (An Integration Guide to Apex & Triple-o)
How Many Ohs? (An Integration Guide to Apex & Triple-o)How Many Ohs? (An Integration Guide to Apex & Triple-o)
How Many Ohs? (An Integration Guide to Apex & Triple-o)
 
Being Brave: Deploying OpenStack from Master
Being Brave: Deploying OpenStack from MasterBeing Brave: Deploying OpenStack from Master
Being Brave: Deploying OpenStack from Master
 
Upstream Testing Collaboration
Upstream Testing Collaboration Upstream Testing Collaboration
Upstream Testing Collaboration
 
Enabling Carrier-Grade Availability Within a Cloud Infrastructure
Enabling Carrier-Grade Availability Within a Cloud InfrastructureEnabling Carrier-Grade Availability Within a Cloud Infrastructure
Enabling Carrier-Grade Availability Within a Cloud Infrastructure
 
Learnings From the First Year of the OPNFV Internship Program
Learnings From the First Year of the OPNFV Internship ProgramLearnings From the First Year of the OPNFV Internship Program
Learnings From the First Year of the OPNFV Internship Program
 
OPNFV and OCP: Perfect Together
OPNFV and OCP: Perfect TogetherOPNFV and OCP: Perfect Together
OPNFV and OCP: Perfect Together
 
The Return of QTIP, from Brahmaputra to Danube
The Return of QTIP, from Brahmaputra to DanubeThe Return of QTIP, from Brahmaputra to Danube
The Return of QTIP, from Brahmaputra to Danube
 
Improving POD Usage in Labs, CI and Testing
Improving POD Usage in Labs, CI and TestingImproving POD Usage in Labs, CI and Testing
Improving POD Usage in Labs, CI and Testing
 
Run OPNFV Danube on ODCC Scorpio Multi-node Server - Open Software on Open Ha...
Run OPNFV Danube on ODCC Scorpio Multi-node Server - Open Software on Open Ha...Run OPNFV Danube on ODCC Scorpio Multi-node Server - Open Software on Open Ha...
Run OPNFV Danube on ODCC Scorpio Multi-node Server - Open Software on Open Ha...
 
Distributed vnf management architecture and use-cases
Distributed vnf management  architecture and use-casesDistributed vnf management  architecture and use-cases
Distributed vnf management architecture and use-cases
 
Securing your nfv and sdn integrated open stack cloud- challenges, use-cases ...
Securing your nfv and sdn integrated open stack cloud- challenges, use-cases ...Securing your nfv and sdn integrated open stack cloud- challenges, use-cases ...
Securing your nfv and sdn integrated open stack cloud- challenges, use-cases ...
 
My network functions are virtualized, but are they cloud-ready
My network functions are virtualized, but are they cloud-readyMy network functions are virtualized, but are they cloud-ready
My network functions are virtualized, but are they cloud-ready
 
Accelerated dataplanes integration and deployment
Accelerated dataplanes integration and deploymentAccelerated dataplanes integration and deployment
Accelerated dataplanes integration and deployment
 
Openstack Tacker - Moving into Pike
Openstack Tacker - Moving into PikeOpenstack Tacker - Moving into Pike
Openstack Tacker - Moving into Pike
 

Kürzlich hochgeladen

AI Mastery 201: Elevating Your Workflow with Advanced LLM Techniques
AI Mastery 201: Elevating Your Workflow with Advanced LLM TechniquesAI Mastery 201: Elevating Your Workflow with Advanced LLM Techniques
AI Mastery 201: Elevating Your Workflow with Advanced LLM Techniques
VictorSzoltysek
 
TECUNIQUE: Success Stories: IT Service provider
TECUNIQUE: Success Stories: IT Service providerTECUNIQUE: Success Stories: IT Service provider
TECUNIQUE: Success Stories: IT Service provider
mohitmore19
 
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
Health
 
introduction-to-automotive Andoid os-csimmonds-ndctechtown-2021.pdf
introduction-to-automotive Andoid os-csimmonds-ndctechtown-2021.pdfintroduction-to-automotive Andoid os-csimmonds-ndctechtown-2021.pdf
introduction-to-automotive Andoid os-csimmonds-ndctechtown-2021.pdf
VishalKumarJha10
 

Kürzlich hochgeladen (20)

Azure_Native_Qumulo_High_Performance_Compute_Benchmarks.pdf
Azure_Native_Qumulo_High_Performance_Compute_Benchmarks.pdfAzure_Native_Qumulo_High_Performance_Compute_Benchmarks.pdf
Azure_Native_Qumulo_High_Performance_Compute_Benchmarks.pdf
 
Chinsurah Escorts ☎️8617697112 Starting From 5K to 15K High Profile Escorts ...
Chinsurah Escorts ☎️8617697112  Starting From 5K to 15K High Profile Escorts ...Chinsurah Escorts ☎️8617697112  Starting From 5K to 15K High Profile Escorts ...
Chinsurah Escorts ☎️8617697112 Starting From 5K to 15K High Profile Escorts ...
 
Unveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time ApplicationsUnveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
 
Exploring the Best Video Editing App.pdf
Exploring the Best Video Editing App.pdfExploring the Best Video Editing App.pdf
Exploring the Best Video Editing App.pdf
 
Microsoft AI Transformation Partner Playbook.pdf
Microsoft AI Transformation Partner Playbook.pdfMicrosoft AI Transformation Partner Playbook.pdf
Microsoft AI Transformation Partner Playbook.pdf
 
AI Mastery 201: Elevating Your Workflow with Advanced LLM Techniques
AI Mastery 201: Elevating Your Workflow with Advanced LLM TechniquesAI Mastery 201: Elevating Your Workflow with Advanced LLM Techniques
AI Mastery 201: Elevating Your Workflow with Advanced LLM Techniques
 
Sector 18, Noida Call girls :8448380779 Model Escorts | 100% verified
Sector 18, Noida Call girls :8448380779 Model Escorts | 100% verifiedSector 18, Noida Call girls :8448380779 Model Escorts | 100% verified
Sector 18, Noida Call girls :8448380779 Model Escorts | 100% verified
 
TECUNIQUE: Success Stories: IT Service provider
TECUNIQUE: Success Stories: IT Service providerTECUNIQUE: Success Stories: IT Service provider
TECUNIQUE: Success Stories: IT Service provider
 
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
 
Define the academic and professional writing..pdf
Define the academic and professional writing..pdfDefine the academic and professional writing..pdf
Define the academic and professional writing..pdf
 
%in tembisa+277-882-255-28 abortion pills for sale in tembisa
%in tembisa+277-882-255-28 abortion pills for sale in tembisa%in tembisa+277-882-255-28 abortion pills for sale in tembisa
%in tembisa+277-882-255-28 abortion pills for sale in tembisa
 
A Secure and Reliable Document Management System is Essential.docx
A Secure and Reliable Document Management System is Essential.docxA Secure and Reliable Document Management System is Essential.docx
A Secure and Reliable Document Management System is Essential.docx
 
ManageIQ - Sprint 236 Review - Slide Deck
ManageIQ - Sprint 236 Review - Slide DeckManageIQ - Sprint 236 Review - Slide Deck
ManageIQ - Sprint 236 Review - Slide Deck
 
BUS PASS MANGEMENT SYSTEM USING PHP.pptx
BUS PASS MANGEMENT SYSTEM USING PHP.pptxBUS PASS MANGEMENT SYSTEM USING PHP.pptx
BUS PASS MANGEMENT SYSTEM USING PHP.pptx
 
The Ultimate Test Automation Guide_ Best Practices and Tips.pdf
The Ultimate Test Automation Guide_ Best Practices and Tips.pdfThe Ultimate Test Automation Guide_ Best Practices and Tips.pdf
The Ultimate Test Automation Guide_ Best Practices and Tips.pdf
 
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
 
Direct Style Effect Systems - The Print[A] Example - A Comprehension Aid
Direct Style Effect Systems -The Print[A] Example- A Comprehension AidDirect Style Effect Systems -The Print[A] Example- A Comprehension Aid
Direct Style Effect Systems - The Print[A] Example - A Comprehension Aid
 
%in Stilfontein+277-882-255-28 abortion pills for sale in Stilfontein
%in Stilfontein+277-882-255-28 abortion pills for sale in Stilfontein%in Stilfontein+277-882-255-28 abortion pills for sale in Stilfontein
%in Stilfontein+277-882-255-28 abortion pills for sale in Stilfontein
 
%in Midrand+277-882-255-28 abortion pills for sale in midrand
%in Midrand+277-882-255-28 abortion pills for sale in midrand%in Midrand+277-882-255-28 abortion pills for sale in midrand
%in Midrand+277-882-255-28 abortion pills for sale in midrand
 
introduction-to-automotive Andoid os-csimmonds-ndctechtown-2021.pdf
introduction-to-automotive Andoid os-csimmonds-ndctechtown-2021.pdfintroduction-to-automotive Andoid os-csimmonds-ndctechtown-2021.pdf
introduction-to-automotive Andoid os-csimmonds-ndctechtown-2021.pdf
 

Swimming upstream: OPNFV Doctor project case study

  • 1. Swimming upstream - Doctor project case study Gerald Kunzmann, NTT DOCOMO Carlos Goncalves, NEC Tomi Juvonen, Nokia
  • 2. AGENDA – Introduction to OPNFV Doctor project – Detailed work flow – Pain points – Best practises for upstream contributions – Summary
  • 3. OPNFV Doctor project - Introduction • Goal: – Develop and build fault management and maintenance framework for high availability of Network Services running on top of virtualized infrastructure.  Proposed with a very clear target / key feature: – Immediate notification of unavailability of virtualized resources from VIM to Consumer • Members: – NEC (PTL: Ryota Mibu), AT&T, Cisco, Cloudbase Solutions, Corenova, Ericsson, Hephaex, Huawei, Intel, KDDI, KT, Nokia, NTT DOCOMO, Spirent, Sprint, Telecom Italia, Vmsec, ZTE • https://wiki.opnfv.org/display/doctor/
  • 4. OPNFV Doctor project – Timeline Q3 Q4 Q1 2015 Q2 Q3 Q4 Q1 2016 Q2 Q3 Q4 Q1 2017 Q2 OPNFV launch 30 Sep, 2014 Doctor creation 2 Dec, 2014 Arno release 4 May, 2015 OPNFV Summit 2015 9 Nov, 2015 Bramahputra release 1 Mar, 2016 OPNFV Summit 2016 12 Jun, 2016 Colorado release 16 Sep, 2016 OpenStack Summit Barcelona 2017 25 Oct, 2016 Danube release 4 Apr, 2017 ARNO - Requirement document BRAHMAPUTRA - Ceilometer „Immediate Notification“ - Nova „Mark Host Down“ - Functional test cases - PoC demo at OPNFV Summit - Documentation updates COLORADO - Nova: „Get valid server state“ - Integration of Congress as Doctor Inspector - Extended functional tests - PoC demo at OPNFV Summit and OpenStack Summit Barcelona - Documentation updates DANUBE - Neutron „Port Status update“ - Inspector design guidelines - Performance profiler - Documentation updates
  • 6. 1. Doctor requirements document • Use cases and scenarios – Active-Standby configuration (1+1 redundancy): • Consumer of infrastructure has configured ACT-STBY • Fault in virtualized infrastructure (NFVI)  inform the Consumer to switch to STBY instance – Prevention actions based on fault prediction: Switch to STBY in case of a predicted fault – NFVI maintenance: inform Consumer(s) of affected hardware about planned maintenance • Requirements 1. Monitor physical and virtual resources and detect problems 2. Correlate faults and identify affected virtual resources 3. Notification of Consumer(s) of affected virtual resources 4. Execute steps 1-3 in less than e.g. 1 second to avoid service disruption Consumer C1 Consumer C2 Consumer C3 Virtualized Infrastructure Manager (VIM), e.g. OpenStack Resource Map Server – VM mapping Server S1 VM-1, VM-2 Server S2 VM-7 Server S3 VM-4 Ownership information VM-1, VM-7 Consumer C1 VM-2 Consumer C2 VM-4 Consumer C3 Resource Pool Hypervisor Hardware Server S1 VM-1 Hypervisor Hardware Server S2 Hypervisor Hardware Server S3 VM-2 VM-7 VM-4 X 1. Fault Monitoring - Hardware fault - Hypervisor fault - Host OS fault 6. Execute Instruction - e.g. migrate VM 2. Inform the Consumer? If YES, find owner of affected VMs from database OpenStack Northbound Interface 3. FaultNotification (VM ID, Fault ID) 5. Instruction (VM ID) 4. Switch to SBY configuration
  • 7. Doctor architecture and OpenStack mapping Monitor Notifier Manager / Consumer Virtualized Infrastructure (Resource Pool) Alarm Conf. 3. Update State 2. Find Affected Application Controller Controller Controller Resource Map 1. Raw Failure Inspector 4. Notify all 5. Notify Error 0. Set Alarm 6. Action Failure Policy Monitor Monitor Aodh Vitrage e.g. Zabbix Cinder Neutron Nova Congress
  • 8. Resource state – missing feature To be: • Nova API shall support to change nova-compute state • User shall be able to read OpenStack states and trust they are correct As is (Kilo release): • When a VM goes down due to a host HW, host OS or hypervisor failure, nothing happens in OpenStack. The VMs of a crashed host/hypervisor are reported to be live and OK through the OpenStack API. • nova-compute state might change too slowly or the state is not reliable if expecting also VMs to be down. 3. Gap analysis and solution brainstorming Solution brainstorming: • Discussion with experts on best way to address the gap •  BP spec for „mark nova-compute down“ Immediate notification – deficiency in operation To be: • VIM to immediately notify unavailability of virtual resource to VIM user. • User shall only receive fault notifications related to owned resource(s). As is (Kilo release): • OpenStack Metering ‘Ceilometer’ can notify unavailability of resource. • Due to innerworking of Ceilometer (polling of events), notification of faults takes seconds to few minutes (depending on polling interval). • Performance issue for Ceilometer in medium to large scale deployments.
  • 9. 4. Reach upstream Requirement Draft spec; review internally Submit upstream
  • 10. 5. Test cases and user manual • End to end test cases – Upstream: unit tests and scope-restricted functional tests upstream – Downstream: E2E functional tests will validate full systems integration • Manuals – How to use implemented blueprint – How it plays within the framework – How to run the tests and interpret results – Greatly promotes technology adoption!
  • 11. 6. PoCs, demos and hackfests Make people aware of the work Collect feedback Find supporters Work upstream Integrate and validate
  • 12. 7. Upstream achievements Project Blueprint Spec Drafter Lead Developer Status Aodh Event Alarm Evaluator Ryota Mibu (NEC) Ryota Mibu (NEC) Completed (Liberty) Nova New nova API call to mark nova-compute down Tomi Juvonen (Nokia) Roman Dobosz (Intel) Completed (Liberty) Support forcing service down Tomi Juvonen (Nokia) Carlos Goncalves (NEC) Completed (Liberty) Get valid server state Tomi Juvonen (Nokia) Tomi Juvonen (Nokia) Completed (Mitaka) Add notification for service status change Balazs Gibizer (Ericsson) Balazs Gibizer (Ericsson) Completed (Mitaka) Congress Push Type Datasource Driver Masahito Muroi (NTT) Masahito Muroi (NTT) Completed (Mitaka) Adds Doctor Driver Masahito Muroi (NTT) Masahito Muroi (NTT) Completed (Mitaka) Neutron Port data plane status Carlos Goncalves (NEC) Carlos Goncalves (NEC) Completed (Pike)
  • 13. 8. Summary: typical work flow Requirement Architecture & Gaps Solution & review internally Reach upstream Integrate, test and document Present demos, collect feedback
  • 14. Pain points •North America, Europe, Asia Time zone •OPNFV has handful of installers •Each use different configuration management tools (e.g. Puppet, Ansible, Charm) •Different default configurations (e.g. Ceilometer/Aodh, Nova) Installers •Internal and cross-project meetings •Release planning and deliverables •Alignment and collaboration with relevant SDOs (e.g. ETSI NFV) Overhead •OPNFV ships ~6-month old OpenStack release •Need to backport bugs and features •Address each installer individually Backports •Making sure system integration’s in place •End to end scenarios require great knowledge across several upstream projectsTesting •Sometimes forsaken Documentation
  • 16. Contribute to a Project • If you want to have something done, you should do it yourself • Contribute to OpenStack project you are planning to have your change – You will gain respect in the community and people get to know you – You get to understand how everything works • Project on-boarding sessions 5/19/2017 16
  • 17. From OPNFV Requirements to a Spec • Explain your problem well and have a good material linked to tell things in more detail • Make use cases carefully • Try to find the benefit for a wider audience • Use common-ground terminology rather than NFV-specific • Make proposal for the implementation. Review will shape it right 5/19/2017 17
  • 18. Discuss Your Change • Join openstack-dev mailing list, Project IRC and weekly meetings • OpenStack has PTG just before the start of the next release cycle. There each project team meets face-to-face to actually work on the new features, not just talk. There are also cross- project sessions • OpenStack summit is arranged 2 months after release cycle starts. There is now a forum with the opportunity to discuss requirements with the project and cross-project development teams for the current +1 release (now Queens' Forum). For the first time, everyone will have the opportunity to discuss requirements face to face, in time to get them into the current +1 release 5/19/2017 18
  • 19. Need a Wider View • Email to OpenStack ops and dev mailing list • Telco reviewers • Talk with operators – IRC meetings – OpenStack Operators Telco and NFV Working Group – OpenStack Operators Mid-Cycle – OpenStack Operators sessions in Forum 5/19/2017 19
  • 20. Pike Release Timeline - Steps With Your Change → Just before the next release cycle starts, have your idea ready in your head and start to draft the spec. Join the PTG if necessary to discuss face-to-face with the project. Start making the code if you didn’t already → Pike-1: You need to have the spec merged. Next you should have the first version of your code ready for review. Note! Project cores do not have much time, make all the test cases and code as good as you can to have it trough reviews in time. There is usually earlier similar patches as an example → Pike-2: Earlier non-priority code freeze, but more relaxed in current release → Pike-3: Feature freeze. Everything have to be merged before this date → Pike release: Might drop some follow up patches 5/19/2017 20 (Feb 20) (Apr 10) (Jun 05) (Jul 24) (Aug 28)
  • 21. Lessons Learned • Be prepared • POC code • Scope it right • Discuss and communicate; Avoid wasted effort 5/19/2017 21

Hinweis der Redaktion

  1. NOVA MARK HOST DOWN BP (Tomi, Roman, Carlos) Have your proposals reviewed internally first Listen to other point of views and converge Smooths out lot of technical and editorial issues Be more inclusive Listen to other point of views and converge Be willing to make compromises Use common-ground terminology rather than NFV-specific Which benefits does your proposal bring to non-NFV users? Formulate solid use cases covering heterogeneous environments Higher chances of acceptance and engineering resources