Video and slides synchronized, mp3 and slide download available at URL http://bit.ly/2xNcGlf.
Nicky Wrightson gives some practical insight into how to handle failure in today's more complex distributed microservice systems. This includes looking at approaches to resiliency, understanding a system, understanding the requirements for fault tolerance, and the developers' mindset necessary for this. She shares real-world examples, and an occasional war story along the way too. Filmed at qconlondon.com.
Nicky Wrightson is a principal engineer working at River Island. She passionately drives forward cloud native architectures and approaches that allow engineers to deliver business value quickly whilst also reducing the support overhead needed for complex distributed systems.
2. InfoQ.com: News & Community Site
Watch the video with slide
synchronization on InfoQ.com!
https://www.infoq.com/presentations/
microservices-failure-insights/
• Over 1,000,000 software developers, architects and CTOs read the site world-
wide every month
• 250,000 senior developers subscribe to our weekly newsletter
• Published in 4 languages (English, Chinese, Japanese and Brazilian
Portuguese)
• Post content from our QCon conferences
• 2 dedicated podcast channels: The InfoQ Podcast, with a focus on
Architecture and The Engineering Culture Podcast, with a focus on building
• 96 deep dives on innovative topics packed as downloadable emags and
minibooks
• Over 40 new content items per week
3. Purpose of QCon
- to empower software development by facilitating the spread of
knowledge and innovation
Strategy
- practitioner-driven conference designed for YOU: influencers of
change and innovation in your teams
- speakers and topics driving the evolution and innovation
- connecting and catalyzing the influencers and innovators
Highlights
- attended by more than 12,000 delegates since 2007
- held in 9 cities worldwide
Presented at QCon London
www.qconlondon.com
11. @nickywrightson
2014
Consumers add a
caching layer to
protect against our
outages
2019
Out of hours calls
to 3rd line have all
but disappeared
2018
Migration to
Kubernetes
completed
2017
Our services were
given an SLA of
15mins recovery
time
20. @nickywrightson
“The quality of a system will appear to
be declining unless it is rigorously
maintained”
Lehmans Laws of Software
Evolution
“Declining Quality” (1996)
1
21. @nickywrightson
As system evolves, its complexity
increases unless work is done to
maintain or reduce it
Lehmans Laws of Software
Evolution cont.
"Increasing Complexity" (1974)
1
79. @nickywrightson
Resources
Testing Microservices, the sane way by Cindy Sridharan
https://medium.com/@copyconstruct/testing-microservices-the-sane-
way-9bb31d158c16
Microservices trade offs by Martin Fowler
https://martinfowler.com/articles/microservice-trade-offs.html
https://medium.com/netflix-techblog/vizceral-open-source-acc0c32113fe
80. Watch the video with slide
synchronization on InfoQ.com!
https://www.infoq.com/presentations/
microservices-failure-insights/