SlideShare ist ein Scribd-Unternehmen logo
1 von 37
Swarming and Cynefin
SRVision 2019, Utrecht
Jon Hall
Principal Product Manager, Digital Service Management
© Copyright 2019 BMC Software, Inc.
@jonhall_
Escalation
Escalation
Recap: Tiered support
LEVEL 2 SUPPORT LEVEL 2 SUPPORTLEVEL 2 SUPPORT
LEVEL 1 SUPPORT
LEVEL 3 SPECIALISTS LEVEL 3 SPECIALISTS LEVEL 3 SPECIALISTS LEVEL 3 SPECIALISTS
@jonhall_
Swarming involves removing the tiers
of support, and calling on the collective
expertise of a “swarm” of analysts.
https://www.serviceinnovation.org/intelligent-swarming/
Swarming defined
@jonhall_
Local Product Line
Support Teams
Severity 1
Swarm
Local Dispatch Swarm
Prioritise
Severity 1
Swarm
Local Dispatch Swarm
Prioritise
Local Product Line
Support Teams
Swarming example: BMC’s Sev-1 and Dispatch Swarms
@jonhall_
• Rapid responders
• Three agents on a scheduled one-week rotation
• Primary focus: Provide immediate response, and resolve as soon as
possible
Swarm lead
Communications
Other members
Research, coordinate, test
Severity 1 Swarm
@jonhall_
• “Cherry pickers”
• Meet every 60-90 minutes
• Primary focus: Can new tickets be resolved immediately?
• Also: Validation of ticket details before assignment to specialists
Experienced analyst Less-experienced analyst
Dispatch Swarm
@jonhall_
Local Product Line Support Teams Local Product Line Support Teams
Backlog Swarm Backlog Swarm Backlog Swarm
Swarming example: BMC’s “Backlog Swarms”
@jonhall_
• Global fixers of troublesome tickets
• Meet regularly (often several times a day)
• Primary focus: Challenging 3rd-line tickets
• Replace reassignments and individual assignments
Experienced analysts R&D Engineers
Backlog Swarms
@jonhall_
Swarming Example: Drop-in SME support for Service Desk
@jonhall_
CUSTOMER CHAT SESSIONS
Service Desk Agents
CHAT
CHANNEL
Subject Experts
CHAT
CHANNEL
Subject Experts
CHAT
CHANNEL
Subject Experts
• Regional chat-based service
desk at a global Telco
• Agents can put customer on-
hold for 3 minutes
• Subject experts wait in
“always-on” chat channels
Swarming Example: Auto manufacturer’s connected cars team
@jonhall_
Engineering Team A
• First responder initiates and
coordinates swarms for big issues
• Other teams have 1 person on
rotation for swarming
• Swarms may also involve 3rd parties
(e.g. Amazon, Microsoft)
• Swarm grows and shrinks as needed
Engineering Team B 3rd Party Suppliers
First Responder
Challenge: Scaling from small beginnings to millions of vehicles
Application1 Application2
@jonhall_
Developers
Support SpecialistsOperations Team
Scenario: Government agency with a growing DevOps initiative
Before transformation…
• Traditional tiered teams for
Operations and Support
• Common pool of developers,
assigned and reassigned to tasks
across multiple projects
Swarming Example: “Always-on” Swarming
Application1
@jonhall_
Developer
Swarming Example: “Always-on” Swarming
Scenario: Government agency with a growing DevOps initiative
After transformation…
• Product, not project thinking
• Team leaders have autonomy to
create and change teams
• Support professionals embedded
in full-stack teams
Application2
Operator Support Specialist
• Work-in-progress queues
• Asynchronous communication
• Single role teams
• Individual over-exposure
• Lack of knowledge sharing
How to annoy a DevOps practitioner
@jonhall_
@jonhall_
DevOps is mainstream. Sample speakers from Devops Enterprise 2018:
Deployment frequency:
Change lead time:
Mean time to recover:
Change failure rate:
46x higher
2555x faster
2604x faster
7x lower
ITSM is under significant pressure from DevOps…
2018 State of DevOps Report
2018 State of DevOps Report
But… Service Management has a lot to offer to DevOps
@jonhall_
• New services and applications suddenly appear
• Lost visibility when issues go to developers
• Lack of knowledge sharing
• New kinds of customer, especially external
DevOps challenges Service Desk orthodoxies…
@jonhall_
• Scaling customer support
• Understanding the context of an issue
• Adaptation to life “on call”
• What to prioritise? Fix bugs or build new stuff?
• How to process alerts, particularly if noisy/low-quality.
…but enterprise realities challenge DevOps
@jonhall_
DevOps teams aren’t as ITSM-phobic as some think
“I need to understand
drifts, timelines…”
“The person who is on call at
4am needs to know who has
been doing what”
“Context is a trigger word for me...
in a company of 4000 people,
things can get out of hand really
fast if you don't have context”
“What is actually running
on an environment?”
“If you're dropped in the
middle of something,
how did you get here?"
(Real quotes from conversations at Configuration Management Camp, Ghent)
“The enterprise space doesn’t move slowly
because they’re stupid, or they hate technology.
It’s because they have users”
Luke Kanies, Puppet Founder, Configuration Management Camp 2015, Belgium.
@jonhall_
Swarming aligns really well to DevOps
• Autonomy and self-organisation
• Knowledge transfer and skills development
• ChatOps, not email
• Prevention of accumulation of queued work
• Protection of individuals from burnout
@jonhall_
We face an issue:
The tiered support system constrains ITSM’s
ability to adapt to new practices and thinking.
@jonhall_
• Pronounced “kuh-nev-in”
• Developed by Dave Snowden while at IBM in 1999
• “A decision support framework which comes from a
mixture of complexity theory and cognitive
science… the opposite of a one-size fits all model”
Cynefin: An example of new thinking
@jonhall_
@jonhall_
• Obvious and Complicated domains:
• Repeating relationship between cause and effect
• With Complicated you need to do analysis to find
that relationship
• Complex domain:
• Understanding the problem requires
experimentation and analysis.
• May, over time, be able to move to Complicated
• Chaotic domain:
• Dramatic and unconstrained
• Focus on damage limitation, try to move to
another domain
Cynefin “Domains” – an overview
“Obvious” Domain
@jonhall_
• “Sense, Categorise, Respond”
• Can apply best practice
• Template/knowledge-driven resolution
• Self service
“Complicated” Domain
@jonhall_
• “Sense, Analyse, Respond”
• Good practice.
• Dispatch-type swarm – pair up agents with varied experience
• Capture detailed knowledge for organizational learning
• Suits a “Dispatch Swarm” type approach?
Swarm
Lead
Swarm
Assistant
• Not acting is not an option: act immediately, observe impact
• Try to move from Chaotic to Complex by introducing constraints
• Chaos may be an opportunity to innovate
@jonhall_
Response Lead
Customer LiaisonDamage limitation/restoration Innovation
Swarming in response to a Chaotic situation
Planned Response
Enterprise systems are complex
@jonhall_
The impact of Complexity
@jonhall_
Charity Majors - Observability for emerging infra
Config Management Camp, Ghent 2019
“Distributed
systems have an
infinite list of
almost impossible
failure scenarios"
Some Complexity theory…
@jonhall_
• Complex systems contain mixtures of latent failures
• It’s impossible not to have multiple flaws
• The failures change constantly
• Complex systems run as broken
• Operating complex systems needs human expertise
• Issues have multiple causes, not a single root-cause
“How Complex Systems Fail” (1998) - Richard I. Cook, MD
Cognitive Technologies Laboratory, University of Chicago
Complex systems fail in complex ways
@jonhall_
“All twenty app services have 10% of nodes enter a simultaneous crash
loop cycle, about five times a day, at unpredictable intervals.
It clears up before we can debug it, every time”
“We run a platform, and it’s hard to distinguish between problems that
users are inflicting on themselves, and problems in our own code,
since they all manifest as the same errors or timeouts”.
“I have 20 microservices and three datastores across three regions, and
everything seems to be getting a little slower over the past 2 weeks
…but nothing has changed that we know of.
Latency is usually back to the historical norm on Tuesdays”
Who would you assign to? Charity Majors
Observability for emerging infra
Config Management Camp, Ghent 2018
Identify
“coherent”
hypotheses
Cynefin approach to a Complex issue
@jonhall_
• “Sense, Analyse, Respond”
• Identify multiple hypotheses
• Gain understanding of the system by interacting with it
• Create predictability, increase constraints, try to move to Complicated
Convene “safe
to fail”
experiments
Observe and
monitor impact
Amplify good
patterns,
dampen bad
Swarm
Lead
Assistant
Lead
Network
Specialist
Developer
Swarm
Lead
Assistant
Lead
Swarm
Lead
Network
Specialist
Vendor
Agent
Assistant
Lead
Developer Server
Technician
Swarm
Lead
Vendor
Agent
Developer
1. Initiate Analysis
• Detect complexity
• Clarify context
• Identify initial team
2. Establish Theories
• Gather information
• Form hypotheses
• Identify subgroups
3. Experiment and observe
• Parallel safe-to-fail experiments
• Observe and measure
• Amplify or dampen outcomes
4. Respond
• Assemble resolution team
• Release non-necessary people
• Resolve issue, document steps
Swarming in response to complex issue
“Probe, Sense, Respond”
@jonhall_
This could not work in a siloed, tiered structure!
The way forward
@jonhall_
• ITSM must adapt to retain relevance and credibility
• Over-constrained, inflexible practices will stifle this adaptation
• ITIL® v4 is a good step forward: giving more room to develop new
approaches to practices
• It’s a good time to be an innovative thinker
Swarming appearing in ITSM frameworks
ITIL® 4 Foundation (2019)
VeriSM – A service management
approach for the digital age (2017)
serviceinnovation.org/intelligent-swarming
Some more information
@jonhall_
http://www.bosslevelpodcast.com
/dave-snowden-on-complexity-
theory-and-astrology/
http://medium.com/@jonhall_https://www.youtube.com/watch?
v=fOdtgHu_KeA
(I’ve just tweeted these links)
Consortium for Service Innovation:
Intelligent Swarming
Boss Level Podcast:
Dave Snowden on Cynefin
Long-form blog on why
Swarming works better for DevOps
Charity Majors at #cfgmgmtcamp
Observability for emerging infra
@mipsytipsy
@snowded
© Copyright 2019 BMC Software, Inc.© Copyright 2019 BMC Software, Inc.
THANK YOU
@jonhall_

Weitere ähnliche Inhalte

Was ist angesagt?

Deep learning - a primer
Deep learning - a primerDeep learning - a primer
Deep learning - a primer
Uwe Friedrichsen
 
SecureWorld: Security is Dead, Rugged DevOps 1f
SecureWorld:  Security is Dead, Rugged DevOps 1fSecureWorld:  Security is Dead, Rugged DevOps 1f
SecureWorld: Security is Dead, Rugged DevOps 1f
Gene Kim
 
Bingham.alph
Bingham.alphBingham.alph
Bingham.alph
NASAPMC
 
Neil.dennehy
Neil.dennehyNeil.dennehy
Neil.dennehy
NASAPMC
 
Winnipeg ISACA Security is Dead, Rugged DevOps
Winnipeg ISACA Security is Dead, Rugged DevOpsWinnipeg ISACA Security is Dead, Rugged DevOps
Winnipeg ISACA Security is Dead, Rugged DevOps
Gene Kim
 
Excavating the knowledge of our ancestors
Excavating the knowledge of our ancestorsExcavating the knowledge of our ancestors
Excavating the knowledge of our ancestors
Uwe Friedrichsen
 
3B - How to effectively engage users and managers in IT projects - Richard Co...
3B - How to effectively engage users and managers in IT projects - Richard Co...3B - How to effectively engage users and managers in IT projects - Richard Co...
3B - How to effectively engage users and managers in IT projects - Richard Co...
CFG
 

Was ist angesagt? (20)

Service Manager Dag, Netherlands 2018: Why we should ditch the 3-tier support...
Service Manager Dag, Netherlands 2018: Why we should ditch the 3-tier support...Service Manager Dag, Netherlands 2018: Why we should ditch the 3-tier support...
Service Manager Dag, Netherlands 2018: Why we should ditch the 3-tier support...
 
Swarming: How a new approach to support can save DevOps teams from 3rd-line t...
Swarming: How a new approach to support can save DevOps teams from 3rd-line t...Swarming: How a new approach to support can save DevOps teams from 3rd-line t...
Swarming: How a new approach to support can save DevOps teams from 3rd-line t...
 
DevOpsDays Riga - Swarming Presentation
DevOpsDays Riga - Swarming PresentationDevOpsDays Riga - Swarming Presentation
DevOpsDays Riga - Swarming Presentation
 
Deep learning - a primer
Deep learning - a primerDeep learning - a primer
Deep learning - a primer
 
SecureWorld: Security is Dead, Rugged DevOps 1f
SecureWorld:  Security is Dead, Rugged DevOps 1fSecureWorld:  Security is Dead, Rugged DevOps 1f
SecureWorld: Security is Dead, Rugged DevOps 1f
 
2012 Velocity London: DevOps Patterns Distilled
2012 Velocity London: DevOps Patterns Distilled2012 Velocity London: DevOps Patterns Distilled
2012 Velocity London: DevOps Patterns Distilled
 
Techniques to build, engage and manage your intranet project
Techniques to build, engage and manage your intranet projectTechniques to build, engage and manage your intranet project
Techniques to build, engage and manage your intranet project
 
PuppetConf2012GeneKim
PuppetConf2012GeneKimPuppetConf2012GeneKim
PuppetConf2012GeneKim
 
Debugging (Docker) containers in production
Debugging (Docker) containers in productionDebugging (Docker) containers in production
Debugging (Docker) containers in production
 
2013 Velocity DevOps Metrics -- It's Not Just For WebOps Any More!
2013 Velocity DevOps Metrics -- It's Not Just For WebOps Any More!2013 Velocity DevOps Metrics -- It's Not Just For WebOps Any More!
2013 Velocity DevOps Metrics -- It's Not Just For WebOps Any More!
 
Digitization solutions - A new breed of software
Digitization solutions - A new breed of softwareDigitization solutions - A new breed of software
Digitization solutions - A new breed of software
 
Bingham.alph
Bingham.alphBingham.alph
Bingham.alph
 
Mary Poppendieck: The Aware Organization - Lean IT Summit 2014
Mary Poppendieck: The Aware Organization - Lean IT Summit 2014Mary Poppendieck: The Aware Organization - Lean IT Summit 2014
Mary Poppendieck: The Aware Organization - Lean IT Summit 2014
 
Neil.dennehy
Neil.dennehyNeil.dennehy
Neil.dennehy
 
Winnipeg ISACA Security is Dead, Rugged DevOps
Winnipeg ISACA Security is Dead, Rugged DevOpsWinnipeg ISACA Security is Dead, Rugged DevOps
Winnipeg ISACA Security is Dead, Rugged DevOps
 
2014 State Of DevOps Findings! Velocity Conference
2014 State Of DevOps Findings! Velocity Conference2014 State Of DevOps Findings! Velocity Conference
2014 State Of DevOps Findings! Velocity Conference
 
Excavating the knowledge of our ancestors
Excavating the knowledge of our ancestorsExcavating the knowledge of our ancestors
Excavating the knowledge of our ancestors
 
Thierry de Pauw - Feature Branching considered Evil - Codemotion Milan 2018
Thierry de Pauw - Feature Branching considered Evil - Codemotion Milan 2018Thierry de Pauw - Feature Branching considered Evil - Codemotion Milan 2018
Thierry de Pauw - Feature Branching considered Evil - Codemotion Milan 2018
 
3B - How to effectively engage users and managers in IT projects - Richard Co...
3B - How to effectively engage users and managers in IT projects - Richard Co...3B - How to effectively engage users and managers in IT projects - Richard Co...
3B - How to effectively engage users and managers in IT projects - Richard Co...
 
"The Lean Mindset": Mary & Tom Poppendieck's Keynote at AgileDayChile 2013
"The Lean Mindset": Mary & Tom Poppendieck's Keynote at AgileDayChile 2013"The Lean Mindset": Mary & Tom Poppendieck's Keynote at AgileDayChile 2013
"The Lean Mindset": Mary & Tom Poppendieck's Keynote at AgileDayChile 2013
 

Ähnlich wie SRVision 2019, Utrecht: Swarming and Cynefin

Master Technical Recruiting Workshop: How to Recruit Top Tech Talent
Master Technical Recruiting Workshop:  How to Recruit Top Tech TalentMaster Technical Recruiting Workshop:  How to Recruit Top Tech Talent
Master Technical Recruiting Workshop: How to Recruit Top Tech Talent
RecruitingDaily.com LLC
 
Embracing the Monolith in Small Teams: Doubling down on python to move fast w...
Embracing the Monolith in Small Teams: Doubling down on python to move fast w...Embracing the Monolith in Small Teams: Doubling down on python to move fast w...
Embracing the Monolith in Small Teams: Doubling down on python to move fast w...
PyData
 

Ähnlich wie SRVision 2019, Utrecht: Swarming and Cynefin (20)

ITSM, Swarming and Devops
ITSM, Swarming and DevopsITSM, Swarming and Devops
ITSM, Swarming and Devops
 
DevOpsDaysRiga 2018: Jon Hall - DevOps in the enterprise: how "swarming" can ...
DevOpsDaysRiga 2018: Jon Hall - DevOps in the enterprise: how "swarming" can ...DevOpsDaysRiga 2018: Jon Hall - DevOps in the enterprise: how "swarming" can ...
DevOpsDaysRiga 2018: Jon Hall - DevOps in the enterprise: how "swarming" can ...
 
SITS15: Swarming - A radical new way to deliver service
SITS15: Swarming - A radical new way to deliver serviceSITS15: Swarming - A radical new way to deliver service
SITS15: Swarming - A radical new way to deliver service
 
The Missing Link Between Governance and Agile Culture
The Missing Link Between Governance and Agile CultureThe Missing Link Between Governance and Agile Culture
The Missing Link Between Governance and Agile Culture
 
Agile leadership practices for PIONEERS
 Agile leadership practices for PIONEERS Agile leadership practices for PIONEERS
Agile leadership practices for PIONEERS
 
Master Technical Recruiting Workshop: How to Recruit Top Tech Talent
Master Technical Recruiting Workshop:  How to Recruit Top Tech TalentMaster Technical Recruiting Workshop:  How to Recruit Top Tech Talent
Master Technical Recruiting Workshop: How to Recruit Top Tech Talent
 
Embracing the Monolith
Embracing the MonolithEmbracing the Monolith
Embracing the Monolith
 
Embracing the Monolith in Small Teams: Doubling down on python to move fast w...
Embracing the Monolith in Small Teams: Doubling down on python to move fast w...Embracing the Monolith in Small Teams: Doubling down on python to move fast w...
Embracing the Monolith in Small Teams: Doubling down on python to move fast w...
 
POWRR Tools: Lessons learned from an IMLS National Leadership Grant
POWRR Tools: Lessons learned from an IMLS National Leadership GrantPOWRR Tools: Lessons learned from an IMLS National Leadership Grant
POWRR Tools: Lessons learned from an IMLS National Leadership Grant
 
Building and Scaling High Performing Technology Organizations by Jez Humble a...
Building and Scaling High Performing Technology Organizations by Jez Humble a...Building and Scaling High Performing Technology Organizations by Jez Humble a...
Building and Scaling High Performing Technology Organizations by Jez Humble a...
 
Jan de Vries - Becoming antifragile is more important than ever in disruptive...
Jan de Vries - Becoming antifragile is more important than ever in disruptive...Jan de Vries - Becoming antifragile is more important than ever in disruptive...
Jan de Vries - Becoming antifragile is more important than ever in disruptive...
 
Mucon microservices and innovation
Mucon microservices and innovationMucon microservices and innovation
Mucon microservices and innovation
 
How to Build Winning Products by Microsoft Sr. Product Manager
How to Build Winning Products by Microsoft Sr. Product ManagerHow to Build Winning Products by Microsoft Sr. Product Manager
How to Build Winning Products by Microsoft Sr. Product Manager
 
Intro to Product Management
Intro to Product Management Intro to Product Management
Intro to Product Management
 
Agile concepts for quality and process engineers for slideshare
Agile concepts for quality and process engineers   for slideshareAgile concepts for quality and process engineers   for slideshare
Agile concepts for quality and process engineers for slideshare
 
Beyond Projects: why projects are wrong and what to do instead
Beyond Projects: why projects are wrong and what to do insteadBeyond Projects: why projects are wrong and what to do instead
Beyond Projects: why projects are wrong and what to do instead
 
Seminar on Crystal Clear
Seminar on Crystal ClearSeminar on Crystal Clear
Seminar on Crystal Clear
 
The hothouse approach
The hothouse approachThe hothouse approach
The hothouse approach
 
Destroying DevOps Culture Anti-Patterns
Destroying DevOps Culture Anti-PatternsDestroying DevOps Culture Anti-Patterns
Destroying DevOps Culture Anti-Patterns
 
Cleaning Code - Tools and Techniques for Large Legacy Projects
Cleaning Code - Tools and Techniques for Large Legacy ProjectsCleaning Code - Tools and Techniques for Large Legacy Projects
Cleaning Code - Tools and Techniques for Large Legacy Projects
 

Mehr von Jon Stevens-Hall

How the Internet of Things and 20 billion devices will change your job
How the Internet of Things and 20 billion devices will change your jobHow the Internet of Things and 20 billion devices will change your job
How the Internet of Things and 20 billion devices will change your job
Jon Stevens-Hall
 
IT Trends Set to Shape Software Asset Management (IBSMA SAM Summit June 2015)
IT Trends Set to Shape Software Asset Management (IBSMA SAM Summit June 2015)IT Trends Set to Shape Software Asset Management (IBSMA SAM Summit June 2015)
IT Trends Set to Shape Software Asset Management (IBSMA SAM Summit June 2015)
Jon Stevens-Hall
 

Mehr von Jon Stevens-Hall (17)

Expanding our Understanding: Complex Adaptive Systems
Expanding our Understanding: Complex Adaptive SystemsExpanding our Understanding: Complex Adaptive Systems
Expanding our Understanding: Complex Adaptive Systems
 
Rethinking Site Reliability Engineering for ITSM - SDI virtual event "New Way...
Rethinking Site Reliability Engineering for ITSM - SDI virtual event "New Way...Rethinking Site Reliability Engineering for ITSM - SDI virtual event "New Way...
Rethinking Site Reliability Engineering for ITSM - SDI virtual event "New Way...
 
Configuration Management Camp 2018: The problem of becoming "3rd line support...
Configuration Management Camp 2018: The problem of becoming "3rd line support...Configuration Management Camp 2018: The problem of becoming "3rd line support...
Configuration Management Camp 2018: The problem of becoming "3rd line support...
 
BMC Engage 2015: Optimizing Service Desk Interactions with Knowledge Management
BMC Engage 2015: Optimizing Service Desk Interactions with Knowledge ManagementBMC Engage 2015: Optimizing Service Desk Interactions with Knowledge Management
BMC Engage 2015: Optimizing Service Desk Interactions with Knowledge Management
 
Is DevOps Really Changing IT Support?
Is DevOps Really Changing IT Support?Is DevOps Really Changing IT Support?
Is DevOps Really Changing IT Support?
 
devopsdays Stockholm Ignite talk: Aligning DevOps with Enterprise-scale custo...
devopsdays Stockholm Ignite talk: Aligning DevOps with Enterprise-scale custo...devopsdays Stockholm Ignite talk: Aligning DevOps with Enterprise-scale custo...
devopsdays Stockholm Ignite talk: Aligning DevOps with Enterprise-scale custo...
 
Knowledge Management in BMC Remedy 9.1
Knowledge Management in BMC Remedy 9.1Knowledge Management in BMC Remedy 9.1
Knowledge Management in BMC Remedy 9.1
 
How the Internet of Things and 20 billion devices will change your job
How the Internet of Things and 20 billion devices will change your jobHow the Internet of Things and 20 billion devices will change your job
How the Internet of Things and 20 billion devices will change your job
 
IAITAM ACE 2016, New Orleans - Presentation
IAITAM ACE 2016, New Orleans - PresentationIAITAM ACE 2016, New Orleans - Presentation
IAITAM ACE 2016, New Orleans - Presentation
 
Evolving Service for the Digital Workplace
Evolving Service for the Digital WorkplaceEvolving Service for the Digital Workplace
Evolving Service for the Digital Workplace
 
Optimizing Service Desk Interactions with Knowledge Management - BMC Engage 2015
Optimizing Service Desk Interactions with Knowledge Management - BMC Engage 2015Optimizing Service Desk Interactions with Knowledge Management - BMC Engage 2015
Optimizing Service Desk Interactions with Knowledge Management - BMC Engage 2015
 
BMC Engage 2015: IT Asset Management - An essential pillar for the digital en...
BMC Engage 2015: IT Asset Management - An essential pillar for the digital en...BMC Engage 2015: IT Asset Management - An essential pillar for the digital en...
BMC Engage 2015: IT Asset Management - An essential pillar for the digital en...
 
BMC Engage 2015: Smart IT, MyIT and the Power of the Service Platform
BMC Engage 2015: Smart IT, MyIT and the Power of the Service PlatformBMC Engage 2015: Smart IT, MyIT and the Power of the Service Platform
BMC Engage 2015: Smart IT, MyIT and the Power of the Service Platform
 
IT Trends Set to Shape Software Asset Management (IBSMA SAM Summit June 2015)
IT Trends Set to Shape Software Asset Management (IBSMA SAM Summit June 2015)IT Trends Set to Shape Software Asset Management (IBSMA SAM Summit June 2015)
IT Trends Set to Shape Software Asset Management (IBSMA SAM Summit June 2015)
 
Bridging the Gap - The Value of Integrated Asset and Service Management
Bridging the Gap - The Value of Integrated Asset and Service ManagementBridging the Gap - The Value of Integrated Asset and Service Management
Bridging the Gap - The Value of Integrated Asset and Service Management
 
BMC Engage - ITAM 2015-2020: The Evolving Role of the IT Asset Manager
BMC Engage - ITAM 2015-2020: The Evolving Role of the IT Asset ManagerBMC Engage - ITAM 2015-2020: The Evolving Role of the IT Asset Manager
BMC Engage - ITAM 2015-2020: The Evolving Role of the IT Asset Manager
 
Bridging the Gap - the Value of Integrated Asset and Service Management
Bridging the Gap - the Value of Integrated Asset and Service ManagementBridging the Gap - the Value of Integrated Asset and Service Management
Bridging the Gap - the Value of Integrated Asset and Service Management
 

Kürzlich hochgeladen

Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Victor Rentea
 

Kürzlich hochgeladen (20)

Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with Milvus
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistan
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Vector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptxVector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptx
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
 

SRVision 2019, Utrecht: Swarming and Cynefin

  • 1. Swarming and Cynefin SRVision 2019, Utrecht Jon Hall Principal Product Manager, Digital Service Management © Copyright 2019 BMC Software, Inc. @jonhall_
  • 2. Escalation Escalation Recap: Tiered support LEVEL 2 SUPPORT LEVEL 2 SUPPORTLEVEL 2 SUPPORT LEVEL 1 SUPPORT LEVEL 3 SPECIALISTS LEVEL 3 SPECIALISTS LEVEL 3 SPECIALISTS LEVEL 3 SPECIALISTS @jonhall_
  • 3. Swarming involves removing the tiers of support, and calling on the collective expertise of a “swarm” of analysts. https://www.serviceinnovation.org/intelligent-swarming/ Swarming defined @jonhall_
  • 4. Local Product Line Support Teams Severity 1 Swarm Local Dispatch Swarm Prioritise Severity 1 Swarm Local Dispatch Swarm Prioritise Local Product Line Support Teams Swarming example: BMC’s Sev-1 and Dispatch Swarms @jonhall_
  • 5. • Rapid responders • Three agents on a scheduled one-week rotation • Primary focus: Provide immediate response, and resolve as soon as possible Swarm lead Communications Other members Research, coordinate, test Severity 1 Swarm @jonhall_
  • 6. • “Cherry pickers” • Meet every 60-90 minutes • Primary focus: Can new tickets be resolved immediately? • Also: Validation of ticket details before assignment to specialists Experienced analyst Less-experienced analyst Dispatch Swarm @jonhall_
  • 7. Local Product Line Support Teams Local Product Line Support Teams Backlog Swarm Backlog Swarm Backlog Swarm Swarming example: BMC’s “Backlog Swarms” @jonhall_
  • 8. • Global fixers of troublesome tickets • Meet regularly (often several times a day) • Primary focus: Challenging 3rd-line tickets • Replace reassignments and individual assignments Experienced analysts R&D Engineers Backlog Swarms @jonhall_
  • 9. Swarming Example: Drop-in SME support for Service Desk @jonhall_ CUSTOMER CHAT SESSIONS Service Desk Agents CHAT CHANNEL Subject Experts CHAT CHANNEL Subject Experts CHAT CHANNEL Subject Experts • Regional chat-based service desk at a global Telco • Agents can put customer on- hold for 3 minutes • Subject experts wait in “always-on” chat channels
  • 10. Swarming Example: Auto manufacturer’s connected cars team @jonhall_ Engineering Team A • First responder initiates and coordinates swarms for big issues • Other teams have 1 person on rotation for swarming • Swarms may also involve 3rd parties (e.g. Amazon, Microsoft) • Swarm grows and shrinks as needed Engineering Team B 3rd Party Suppliers First Responder Challenge: Scaling from small beginnings to millions of vehicles
  • 11. Application1 Application2 @jonhall_ Developers Support SpecialistsOperations Team Scenario: Government agency with a growing DevOps initiative Before transformation… • Traditional tiered teams for Operations and Support • Common pool of developers, assigned and reassigned to tasks across multiple projects Swarming Example: “Always-on” Swarming
  • 12. Application1 @jonhall_ Developer Swarming Example: “Always-on” Swarming Scenario: Government agency with a growing DevOps initiative After transformation… • Product, not project thinking • Team leaders have autonomy to create and change teams • Support professionals embedded in full-stack teams Application2 Operator Support Specialist
  • 13. • Work-in-progress queues • Asynchronous communication • Single role teams • Individual over-exposure • Lack of knowledge sharing How to annoy a DevOps practitioner @jonhall_
  • 14. @jonhall_ DevOps is mainstream. Sample speakers from Devops Enterprise 2018:
  • 15. Deployment frequency: Change lead time: Mean time to recover: Change failure rate: 46x higher 2555x faster 2604x faster 7x lower ITSM is under significant pressure from DevOps… 2018 State of DevOps Report
  • 16. 2018 State of DevOps Report But… Service Management has a lot to offer to DevOps @jonhall_
  • 17. • New services and applications suddenly appear • Lost visibility when issues go to developers • Lack of knowledge sharing • New kinds of customer, especially external DevOps challenges Service Desk orthodoxies… @jonhall_
  • 18. • Scaling customer support • Understanding the context of an issue • Adaptation to life “on call” • What to prioritise? Fix bugs or build new stuff? • How to process alerts, particularly if noisy/low-quality. …but enterprise realities challenge DevOps @jonhall_
  • 19. DevOps teams aren’t as ITSM-phobic as some think “I need to understand drifts, timelines…” “The person who is on call at 4am needs to know who has been doing what” “Context is a trigger word for me... in a company of 4000 people, things can get out of hand really fast if you don't have context” “What is actually running on an environment?” “If you're dropped in the middle of something, how did you get here?" (Real quotes from conversations at Configuration Management Camp, Ghent)
  • 20. “The enterprise space doesn’t move slowly because they’re stupid, or they hate technology. It’s because they have users” Luke Kanies, Puppet Founder, Configuration Management Camp 2015, Belgium. @jonhall_
  • 21. Swarming aligns really well to DevOps • Autonomy and self-organisation • Knowledge transfer and skills development • ChatOps, not email • Prevention of accumulation of queued work • Protection of individuals from burnout @jonhall_
  • 22. We face an issue: The tiered support system constrains ITSM’s ability to adapt to new practices and thinking. @jonhall_
  • 23. • Pronounced “kuh-nev-in” • Developed by Dave Snowden while at IBM in 1999 • “A decision support framework which comes from a mixture of complexity theory and cognitive science… the opposite of a one-size fits all model” Cynefin: An example of new thinking @jonhall_
  • 24. @jonhall_ • Obvious and Complicated domains: • Repeating relationship between cause and effect • With Complicated you need to do analysis to find that relationship • Complex domain: • Understanding the problem requires experimentation and analysis. • May, over time, be able to move to Complicated • Chaotic domain: • Dramatic and unconstrained • Focus on damage limitation, try to move to another domain Cynefin “Domains” – an overview
  • 25. “Obvious” Domain @jonhall_ • “Sense, Categorise, Respond” • Can apply best practice • Template/knowledge-driven resolution • Self service
  • 26. “Complicated” Domain @jonhall_ • “Sense, Analyse, Respond” • Good practice. • Dispatch-type swarm – pair up agents with varied experience • Capture detailed knowledge for organizational learning • Suits a “Dispatch Swarm” type approach? Swarm Lead Swarm Assistant
  • 27. • Not acting is not an option: act immediately, observe impact • Try to move from Chaotic to Complex by introducing constraints • Chaos may be an opportunity to innovate @jonhall_ Response Lead Customer LiaisonDamage limitation/restoration Innovation Swarming in response to a Chaotic situation Planned Response
  • 28. Enterprise systems are complex @jonhall_
  • 29. The impact of Complexity @jonhall_ Charity Majors - Observability for emerging infra Config Management Camp, Ghent 2019 “Distributed systems have an infinite list of almost impossible failure scenarios"
  • 30. Some Complexity theory… @jonhall_ • Complex systems contain mixtures of latent failures • It’s impossible not to have multiple flaws • The failures change constantly • Complex systems run as broken • Operating complex systems needs human expertise • Issues have multiple causes, not a single root-cause “How Complex Systems Fail” (1998) - Richard I. Cook, MD Cognitive Technologies Laboratory, University of Chicago
  • 31. Complex systems fail in complex ways @jonhall_ “All twenty app services have 10% of nodes enter a simultaneous crash loop cycle, about five times a day, at unpredictable intervals. It clears up before we can debug it, every time” “We run a platform, and it’s hard to distinguish between problems that users are inflicting on themselves, and problems in our own code, since they all manifest as the same errors or timeouts”. “I have 20 microservices and three datastores across three regions, and everything seems to be getting a little slower over the past 2 weeks …but nothing has changed that we know of. Latency is usually back to the historical norm on Tuesdays” Who would you assign to? Charity Majors Observability for emerging infra Config Management Camp, Ghent 2018
  • 32. Identify “coherent” hypotheses Cynefin approach to a Complex issue @jonhall_ • “Sense, Analyse, Respond” • Identify multiple hypotheses • Gain understanding of the system by interacting with it • Create predictability, increase constraints, try to move to Complicated Convene “safe to fail” experiments Observe and monitor impact Amplify good patterns, dampen bad
  • 33. Swarm Lead Assistant Lead Network Specialist Developer Swarm Lead Assistant Lead Swarm Lead Network Specialist Vendor Agent Assistant Lead Developer Server Technician Swarm Lead Vendor Agent Developer 1. Initiate Analysis • Detect complexity • Clarify context • Identify initial team 2. Establish Theories • Gather information • Form hypotheses • Identify subgroups 3. Experiment and observe • Parallel safe-to-fail experiments • Observe and measure • Amplify or dampen outcomes 4. Respond • Assemble resolution team • Release non-necessary people • Resolve issue, document steps Swarming in response to complex issue “Probe, Sense, Respond” @jonhall_ This could not work in a siloed, tiered structure!
  • 34. The way forward @jonhall_ • ITSM must adapt to retain relevance and credibility • Over-constrained, inflexible practices will stifle this adaptation • ITIL® v4 is a good step forward: giving more room to develop new approaches to practices • It’s a good time to be an innovative thinker
  • 35. Swarming appearing in ITSM frameworks ITIL® 4 Foundation (2019) VeriSM – A service management approach for the digital age (2017)
  • 36. serviceinnovation.org/intelligent-swarming Some more information @jonhall_ http://www.bosslevelpodcast.com /dave-snowden-on-complexity- theory-and-astrology/ http://medium.com/@jonhall_https://www.youtube.com/watch? v=fOdtgHu_KeA (I’ve just tweeted these links) Consortium for Service Innovation: Intelligent Swarming Boss Level Podcast: Dave Snowden on Cynefin Long-form blog on why Swarming works better for DevOps Charity Majors at #cfgmgmtcamp Observability for emerging infra @mipsytipsy @snowded
  • 37. © Copyright 2019 BMC Software, Inc.© Copyright 2019 BMC Software, Inc. THANK YOU @jonhall_

Hinweis der Redaktion

  1. https://www.slideshare.net/JonHall7/is-devops-really-changing-it-support
  2. Teams map to Applications. a Each team is fully repsonsible for an Application. Teams form around applications. Team Leaders determine who’s in the team and what they do.
  3. Teams map to Applications. a Each team is fully repsonsible for an Application. Teams form around applications. Team Leaders determine who’s in the team and what they do.
  4. No more assignment to individuals
  5. signifies the multiple factors in our environment and our experience that influence us in ways we can never understand
  6. Linux, Apache, MySQL, PHP Experiments should be parallel – Otherwise, because you’re doing something novel, it is likely to be seen to be successful. Experiments might be naïve Enabling constraints: channel activity, focus it, enable people to do what they wouldn’t normally. Dispositional, not causal. Can make statements about the preset.
  7. Experiments should be parallel – Otherwise, because you’re doing something novel, it is likely to be seen to be successful. Experiments might be naïve Enabling constraints: channel activity, focus it, enable people to do what they wouldn’t normally. Dispositional, not causal. Can make statements about the preset.
  8. Experiments should be parallel – Otherwise, because you’re doing something novel, it is likely to be seen to be successful. Experiments might be naïve Enabling constraints: channel activity, focus it, enable people to do what they wouldn’t normally. Dispositional, not causal. Can make statements about the preset.
  9. Experiments should be parallel – Otherwise, because you’re doing something novel, it is likely to be seen to be successful. Experiments might be naïve Enabling constraints: channel activity, focus it, enable people to do what they wouldn’t normally. Dispositional, not causal. Can make statements about the preset.