SlideShare ist ein Scribd-Unternehmen logo
1 von 35
CONFIDENTIAL and not for reproduction without prior written consent. © of The Sporting Exchange Limited.
SHOW ME
THE
METRICS James Brooks
CONFIDENTIAL and not for reproduction without prior written consent. © of The Sporting Exchange Limited.
Born in 1999
UK, Romania, Portugal, Ireland, Malta, Gibraltar, USA
Engineering Blog: www.betsandbits.com
700+ Engineers and Growing
Exchange / Sportsbook /Games
~750k active users
~70M daily transactions
~2.5Bn daily API calls
~100k/s Monitoring Points
E2E P95 transaction times <4ms
One of UK’s most successful .coms with ~£2Bn Market Cap
76
CONFIDENTIAL and not for reproduction without prior written consent. © of The Sporting Exchange Limited.
OpenTSDB &
OpenTSP at Betfair
77
CONFIDENTIAL and not for reproduction without prior written consent. © of The Sporting Exchange Limited. 78
What is OpenTSDB?
CONFIDENTIAL and not for reproduction without prior written consent. © of The Sporting Exchange Limited. 79
What is OpenTSDB?
A database optimised for
storing and querying Time
Series Metrics
os.filesystem.size 1413978032 912234340
host=fe-108 cluster=bf_c mount=/ type=Free
Name Timestamp Value
metadata
CONFIDENTIAL and not for reproduction without prior written consent. © of The Sporting Exchange Limited. 80
OpenTSDB Architecture
Hadoop
• TSDB Read
• TSDB Write
Frontend
Server
• TSDB Read
• TSDB Write
Frontend
Server
• TSDB Read
• TSDB Write
Frontend
Server
LoadBalancer
Metrics
Queries
TSDB
CONFIDENTIAL and not for reproduction without prior written consent. © of The Sporting Exchange Limited. 81
OpenTSDB Architecture
Metrics
Queries
Original
CONFIDENTIAL and not for reproduction without prior written consent. © of The Sporting Exchange Limited. 82
OpenTSDB Architecture (ctd)
TSDB
Source :http://mo.nitor.me/hadoop-hbase-opentsdb/
Original
CONFIDENTIAL and not for reproduction without prior written consent. © of The Sporting Exchange Limited. 84
OpenTSDB Architecture (ctd)
Metrilyx
TSDB
CONFIDENTIAL and not for reproduction without prior written consent. © of The Sporting Exchange Limited. 85
Original
CONFIDENTIAL and not for reproduction without prior written consent. © of The Sporting Exchange Limited. 86
OpenTSDB Architecture (ctd)
Metrilyx
TSDB Visualiser
CONFIDENTIAL and not for reproduction without prior written consent. © of The Sporting Exchange Limited. 87
tcollector
collect-os
collect-jmx
CONFIDENTIAL and not for reproduction without prior written consent. © of The Sporting Exchange Limited. 88
OpenTSDB Architecture (ctd)
TSDB
tcollector
collect-os
collect-jmx
tcollector
collect-os
collect-jmx
•  TSDB Use Cases
•  Troubleshooting / Root Cause analysis
•  Performance Analysis
CONFIDENTIAL and not for reproduction without prior written consent. © of The Sporting Exchange Limited. 89
CONFIDENTIAL and not for reproduction without prior written consent. © of The Sporting Exchange Limited. 90
CONFIDENTIAL and not for reproduction without prior written consent. © of The Sporting Exchange Limited. 91
CONFIDENTIAL and not for reproduction without prior written consent. © of The Sporting Exchange Limited. 92
CONFIDENTIAL and not for reproduction without prior written consent. © of The Sporting Exchange Limited. 93
CONFIDENTIAL and not for reproduction without prior written consent. © of The Sporting Exchange Limited. 94
CONFIDENTIAL and not for reproduction without prior written consent. © of The Sporting Exchange Limited. 95
•  TSDB Use Cases
•  Troubleshooting / Root Cause analysis
•  Performance Analysis
•  Monitoring and Alerting
•  Limitations
•  Load
•  Recency
CONFIDENTIAL and not for reproduction without prior written consent. © of The Sporting Exchange Limited. 96
CONFIDENTIAL and not for reproduction without prior written consent. © of The Sporting Exchange Limited. 97
•  TSDB Use Cases
•  Troubleshooting / Root Cause analysis
•  Performance Analysis
•  Monitoring and Alerting
•  Limitations
•  Load
•  Recency
•  TSP
CONFIDENTIAL and not for reproduction without prior written consent. © of The Sporting Exchange Limited. 98
CONFIDENTIAL and not for reproduction without prior written consent. © of The Sporting Exchange Limited. 99
OpenTSP
tcollector
collect-os
collect-jmx
CONFIDENTIAL and not for reproduction without prior written consent. © of The Sporting Exchange Limited. 100
OpenTSP
TSDB
tcollector
collect-os
collect-jmx
forwarder
collect-os
collect-jmx
CONFIDENTIAL and not for reproduction without prior written consent. © of The Sporting Exchange Limited. 101
OpenTSP Architecture: Forwarder
TAG
FILTER
collect-statse
CONFIDENTIAL and not for reproduction without prior written consent. © of The Sporting Exchange Limited. 102
OpenTSP Architecture: Statse
application collect-statse
count
mean
median
p95
p99
forwarder
collect-statse
count
mean
median
p95
p99
forwarder
application collect-statse
count
mean
median
p95
p99
forwarder
application
poller
CONFIDENTIAL and not for reproduction without prior written consent. © of The Sporting Exchange Limited. 103
OpenTSP Architecture: Poller
collect-snmp
collect-f5
collect-netscaler
switches
routers
firewalls
load balancers
storage
CONFIDENTIAL and not for reproduction without prior written consent. © of The Sporting Exchange Limited. 104
OpenTSP Architecture: Aggregator
forwarder
forwarder
poller
aggregator
collect-site
CONFIDENTIAL and not for reproduction without prior written consent. © of The Sporting Exchange Limited. 105
OpenTSP Architecture: Controller
controller
collectors aggregator
CONFIDENTIAL and not for reproduction without prior written consent. © of The Sporting Exchange Limited. 106
[james@ohlookahostname~]$1CONTROL_HOST=tspconfig.inf.betfair1tspDforwarder1Dt1
{1
11111111"Filter":1[1
1111111111111111{1
111111111111111111111111"Match":1[1
11111111111111111111111111111111"",1
11111111111111111111111111111111"host",1
11111111111111111111111111111111"^$"1
111111111111111111111111],1
111111111111111111111111"Set":1[1
11111111111111111111111111111111"",1
11111111111111111111111111111111"host",1
11111111111111111111111111111111“ohlookahostname.inf.betfair"1
111111111111111111111111]1
1111111111111111}1
11111111],1
11111111"LogPath":1"/var/log/tsp/forwarder.log",1
11111111"Relay":1{1
1111111111111111"aggregator":1{1
111111111111111111111111"DropRepeats":1false,1
111111111111111111111111"Host":1“tspaggregator.inf.betfair",1
111111111111111111111111"MaxConnsPerHost":11,1
111111111111111111111111"OnQueueFull":1"Drop"1
1111111111111111},1
1111111111111111"tsd":1{1
111111111111111111111111"DropRepeats":1true,1
111111111111111111111111"Host":1"tsdb.inf.betfair",1
111111111111111111111111"MaxConnsPerHost":11,1
111111111111111111111111"OnQueueFull":1"Drop"1
1111111111111111}1
11111111},1
11111111"CollectPath":1"/etc/tsp/collect.d"1
}1
Visualise
r
CONFIDENTIAL and not for reproduction without prior written consent. © of The Sporting Exchange Limited. 107
TSP at Betfaircontroller
collectors TSDB Visualiser
tsdb2mysql mysqld
feedhealth
Riemann
future?
aggregator
aggregator
CONFIDENTIAL and not for reproduction without prior written consent. © of The Sporting Exchange Limited. 108
Open Source
•  tsp-forwarder
•  collect-statse
•  tsp-poller
•  collect-f5
•  collect-netscaler
•  collect-snmp
•  tsp-aggregator
•  tsp-controller
Apache Licence v2
http://github.com/betfair/opentsp
james.brooks@betfair.com
CONFIDENTIAL and not for reproduction without prior written consent. © of The Sporting Exchange Limited. 109

Weitere ähnliche Inhalte

Ähnlich wie James Brooks (Betfair) - Show me the Metrics

Wellington APAC Groundbreakers tour - Upgrading to the 12c Optimizer
Wellington APAC Groundbreakers tour - Upgrading to the 12c OptimizerWellington APAC Groundbreakers tour - Upgrading to the 12c Optimizer
Wellington APAC Groundbreakers tour - Upgrading to the 12c OptimizerConnor McDonald
 
How to Take Advantage of Optimizer Improvements in MySQL 8.0
How to Take Advantage of Optimizer Improvements in MySQL 8.0How to Take Advantage of Optimizer Improvements in MySQL 8.0
How to Take Advantage of Optimizer Improvements in MySQL 8.0Norvald Ryeng
 
NTM Corp: Mobil SHC Presentation on power cost reduction using mobil shc - cu...
NTM Corp: Mobil SHC Presentation on power cost reduction using mobil shc - cu...NTM Corp: Mobil SHC Presentation on power cost reduction using mobil shc - cu...
NTM Corp: Mobil SHC Presentation on power cost reduction using mobil shc - cu...japjaca
 
Agilis_CorporateBrochure
Agilis_CorporateBrochureAgilis_CorporateBrochure
Agilis_CorporateBrochureBrett Stoia
 
TIBCO presentation at the Chief Analytics Officer Forum East Coast 2016 (#CAO...
TIBCO presentation at the Chief Analytics Officer Forum East Coast 2016 (#CAO...TIBCO presentation at the Chief Analytics Officer Forum East Coast 2016 (#CAO...
TIBCO presentation at the Chief Analytics Officer Forum East Coast 2016 (#CAO...Chief Analytics Officer Forum
 
PDO Predictive Analytics Share for the Annual Research Forum 2015
PDO Predictive Analytics Share for the Annual Research Forum 2015PDO Predictive Analytics Share for the Annual Research Forum 2015
PDO Predictive Analytics Share for the Annual Research Forum 2015Faris Al-Kharusi
 
RESUME - ALOK SAXENA - CNC - ELECTRONICS ENGINEER
RESUME - ALOK SAXENA - CNC - ELECTRONICS ENGINEERRESUME - ALOK SAXENA - CNC - ELECTRONICS ENGINEER
RESUME - ALOK SAXENA - CNC - ELECTRONICS ENGINEERAlok Saxena
 
Pattern Matching with SQL - APEX World Rotterdam 2019
Pattern Matching with SQL - APEX World Rotterdam 2019Pattern Matching with SQL - APEX World Rotterdam 2019
Pattern Matching with SQL - APEX World Rotterdam 2019Connor McDonald
 
OSA Con 2022 - Specifics of data analysis in Time Series Databases - Roman Kh...
OSA Con 2022 - Specifics of data analysis in Time Series Databases - Roman Kh...OSA Con 2022 - Specifics of data analysis in Time Series Databases - Roman Kh...
OSA Con 2022 - Specifics of data analysis in Time Series Databases - Roman Kh...Altinity Ltd
 
Case Study: bet365 Improves Its Odds for Site Reliability With Unified Monito...
Case Study: bet365 Improves Its Odds for Site Reliability With Unified Monito...Case Study: bet365 Improves Its Odds for Site Reliability With Unified Monito...
Case Study: bet365 Improves Its Odds for Site Reliability With Unified Monito...CA Technologies
 
Zentech Manufacturing Capabilities
Zentech Manufacturing CapabilitiesZentech Manufacturing Capabilities
Zentech Manufacturing CapabilitiesTransline Technology
 
Top 10 SQL Performance tips & tricks for Java Developers
Top 10 SQL Performance tips & tricks for Java DevelopersTop 10 SQL Performance tips & tricks for Java Developers
Top 10 SQL Performance tips & tricks for Java Developersgvenzl
 
Acceptance Test-Driven Development: Mastering Agile Testing
Acceptance Test-Driven Development: Mastering Agile TestingAcceptance Test-Driven Development: Mastering Agile Testing
Acceptance Test-Driven Development: Mastering Agile TestingTechWell
 
WorldLine Digital-New Technology for Transit Advertising
WorldLine Digital-New Technology for Transit Advertising WorldLine Digital-New Technology for Transit Advertising
WorldLine Digital-New Technology for Transit Advertising minh đức
 
Trillium Flow Technologies: Alloa Service Centre Capabilities
Trillium Flow Technologies: Alloa Service Centre CapabilitiesTrillium Flow Technologies: Alloa Service Centre Capabilities
Trillium Flow Technologies: Alloa Service Centre CapabilitiesMatthew Hewett
 

Ähnlich wie James Brooks (Betfair) - Show me the Metrics (20)

Wellington APAC Groundbreakers tour - Upgrading to the 12c Optimizer
Wellington APAC Groundbreakers tour - Upgrading to the 12c OptimizerWellington APAC Groundbreakers tour - Upgrading to the 12c Optimizer
Wellington APAC Groundbreakers tour - Upgrading to the 12c Optimizer
 
EMMMV - Caging the Gorilla
EMMMV - Caging the GorillaEMMMV - Caging the Gorilla
EMMMV - Caging the Gorilla
 
How to Take Advantage of Optimizer Improvements in MySQL 8.0
How to Take Advantage of Optimizer Improvements in MySQL 8.0How to Take Advantage of Optimizer Improvements in MySQL 8.0
How to Take Advantage of Optimizer Improvements in MySQL 8.0
 
NTM Corp: Mobil SHC Presentation on power cost reduction using mobil shc - cu...
NTM Corp: Mobil SHC Presentation on power cost reduction using mobil shc - cu...NTM Corp: Mobil SHC Presentation on power cost reduction using mobil shc - cu...
NTM Corp: Mobil SHC Presentation on power cost reduction using mobil shc - cu...
 
Agilis_CorporateBrochure
Agilis_CorporateBrochureAgilis_CorporateBrochure
Agilis_CorporateBrochure
 
TIBCO presentation at the Chief Analytics Officer Forum East Coast 2016 (#CAO...
TIBCO presentation at the Chief Analytics Officer Forum East Coast 2016 (#CAO...TIBCO presentation at the Chief Analytics Officer Forum East Coast 2016 (#CAO...
TIBCO presentation at the Chief Analytics Officer Forum East Coast 2016 (#CAO...
 
Mathworks CAE simulation suite – case in point from automotive and aerospace.
Mathworks CAE simulation suite – case in point from automotive and aerospace.Mathworks CAE simulation suite – case in point from automotive and aerospace.
Mathworks CAE simulation suite – case in point from automotive and aerospace.
 
PDO Predictive Analytics Share for the Annual Research Forum 2015
PDO Predictive Analytics Share for the Annual Research Forum 2015PDO Predictive Analytics Share for the Annual Research Forum 2015
PDO Predictive Analytics Share for the Annual Research Forum 2015
 
RESUME - ALOK SAXENA - CNC - ELECTRONICS ENGINEER
RESUME - ALOK SAXENA - CNC - ELECTRONICS ENGINEERRESUME - ALOK SAXENA - CNC - ELECTRONICS ENGINEER
RESUME - ALOK SAXENA - CNC - ELECTRONICS ENGINEER
 
Julius job details report
Julius job details report Julius job details report
Julius job details report
 
Pattern Matching with SQL - APEX World Rotterdam 2019
Pattern Matching with SQL - APEX World Rotterdam 2019Pattern Matching with SQL - APEX World Rotterdam 2019
Pattern Matching with SQL - APEX World Rotterdam 2019
 
GPS 2023
GPS 2023GPS 2023
GPS 2023
 
OSA Con 2022 - Specifics of data analysis in Time Series Databases - Roman Kh...
OSA Con 2022 - Specifics of data analysis in Time Series Databases - Roman Kh...OSA Con 2022 - Specifics of data analysis in Time Series Databases - Roman Kh...
OSA Con 2022 - Specifics of data analysis in Time Series Databases - Roman Kh...
 
Case Study: bet365 Improves Its Odds for Site Reliability With Unified Monito...
Case Study: bet365 Improves Its Odds for Site Reliability With Unified Monito...Case Study: bet365 Improves Its Odds for Site Reliability With Unified Monito...
Case Study: bet365 Improves Its Odds for Site Reliability With Unified Monito...
 
Manual licor 6200 condensado
Manual licor 6200 condensadoManual licor 6200 condensado
Manual licor 6200 condensado
 
Zentech Manufacturing Capabilities
Zentech Manufacturing CapabilitiesZentech Manufacturing Capabilities
Zentech Manufacturing Capabilities
 
Top 10 SQL Performance tips & tricks for Java Developers
Top 10 SQL Performance tips & tricks for Java DevelopersTop 10 SQL Performance tips & tricks for Java Developers
Top 10 SQL Performance tips & tricks for Java Developers
 
Acceptance Test-Driven Development: Mastering Agile Testing
Acceptance Test-Driven Development: Mastering Agile TestingAcceptance Test-Driven Development: Mastering Agile Testing
Acceptance Test-Driven Development: Mastering Agile Testing
 
WorldLine Digital-New Technology for Transit Advertising
WorldLine Digital-New Technology for Transit Advertising WorldLine Digital-New Technology for Transit Advertising
WorldLine Digital-New Technology for Transit Advertising
 
Trillium Flow Technologies: Alloa Service Centre Capabilities
Trillium Flow Technologies: Alloa Service Centre CapabilitiesTrillium Flow Technologies: Alloa Service Centre Capabilities
Trillium Flow Technologies: Alloa Service Centre Capabilities
 

Mehr von Outlyer

Murat Karslioglu, VP Solutions @ OpenEBS - Containerized storage for containe...
Murat Karslioglu, VP Solutions @ OpenEBS - Containerized storage for containe...Murat Karslioglu, VP Solutions @ OpenEBS - Containerized storage for containe...
Murat Karslioglu, VP Solutions @ OpenEBS - Containerized storage for containe...Outlyer
 
How & When to Feature Flag
How & When to Feature FlagHow & When to Feature Flag
How & When to Feature FlagOutlyer
 
Why You Need to Stop Using "The" Staging Server
Why You Need to Stop Using "The" Staging ServerWhy You Need to Stop Using "The" Staging Server
Why You Need to Stop Using "The" Staging ServerOutlyer
 
How GitHub combined with CI empowers rapid product delivery at Credit Karma
How GitHub combined with CI empowers rapid product delivery at Credit Karma How GitHub combined with CI empowers rapid product delivery at Credit Karma
How GitHub combined with CI empowers rapid product delivery at Credit Karma Outlyer
 
Packaging Services with Nix
Packaging Services with NixPackaging Services with Nix
Packaging Services with NixOutlyer
 
Minimum Viable Docker: our journey towards orchestration
Minimum Viable Docker: our journey towards orchestrationMinimum Viable Docker: our journey towards orchestration
Minimum Viable Docker: our journey towards orchestrationOutlyer
 
Ops is dead. long live ops.
Ops is dead. long live ops.Ops is dead. long live ops.
Ops is dead. long live ops.Outlyer
 
The service mesh: resilient communication for microservice applications
The service mesh: resilient communication for microservice applicationsThe service mesh: resilient communication for microservice applications
The service mesh: resilient communication for microservice applicationsOutlyer
 
Microservices: Why We Did It (and should you?)
Microservices: Why We Did It (and should you?) Microservices: Why We Did It (and should you?)
Microservices: Why We Did It (and should you?) Outlyer
 
Renan Dias: Using Alexa to deploy applications to Kubernetes
Renan Dias: Using Alexa to deploy applications to KubernetesRenan Dias: Using Alexa to deploy applications to Kubernetes
Renan Dias: Using Alexa to deploy applications to KubernetesOutlyer
 
Alex Dias: how to build a docker monitoring solution
Alex Dias: how to build a docker monitoring solution Alex Dias: how to build a docker monitoring solution
Alex Dias: how to build a docker monitoring solution Outlyer
 
How to build a container monitoring solution - David Gildeh, CEO and Co-Found...
How to build a container monitoring solution - David Gildeh, CEO and Co-Found...How to build a container monitoring solution - David Gildeh, CEO and Co-Found...
How to build a container monitoring solution - David Gildeh, CEO and Co-Found...Outlyer
 
Heresy in the church of - Corey Quinn, Principal at The Quinn Advisory Group
Heresy in the church of - Corey Quinn, Principal at The Quinn Advisory Group Heresy in the church of - Corey Quinn, Principal at The Quinn Advisory Group
Heresy in the church of - Corey Quinn, Principal at The Quinn Advisory Group Outlyer
 
Anatomy of a real-life incident -Alex Solomon, CTO and Co-Founder of PagerDuty
Anatomy of a real-life incident -Alex Solomon, CTO and Co-Founder of PagerDutyAnatomy of a real-life incident -Alex Solomon, CTO and Co-Founder of PagerDuty
Anatomy of a real-life incident -Alex Solomon, CTO and Co-Founder of PagerDutyOutlyer
 
A Holistic View of Operational Capabilities—Roy Rapoport, Insight Engineering...
A Holistic View of Operational Capabilities—Roy Rapoport, Insight Engineering...A Holistic View of Operational Capabilities—Roy Rapoport, Insight Engineering...
A Holistic View of Operational Capabilities—Roy Rapoport, Insight Engineering...Outlyer
 
The Network Knows—Avi Freedman, CEO & Co-Founder of Kentik
The Network Knows—Avi Freedman, CEO & Co-Founder of Kentik The Network Knows—Avi Freedman, CEO & Co-Founder of Kentik
The Network Knows—Avi Freedman, CEO & Co-Founder of Kentik Outlyer
 
Building a production-ready, fully-scalable Docker Swarm using Terraform & Pa...
Building a production-ready, fully-scalable Docker Swarm using Terraform & Pa...Building a production-ready, fully-scalable Docker Swarm using Terraform & Pa...
Building a production-ready, fully-scalable Docker Swarm using Terraform & Pa...Outlyer
 
Zero Downtime Postgres Upgrades
Zero Downtime Postgres UpgradesZero Downtime Postgres Upgrades
Zero Downtime Postgres UpgradesOutlyer
 
DOXLON November 2016: Facebook Engineering on cgroupv2
DOXLON November 2016: Facebook Engineering on cgroupv2DOXLON November 2016: Facebook Engineering on cgroupv2
DOXLON November 2016: Facebook Engineering on cgroupv2Outlyer
 
DOXLON November 2016 - ELK Stack and Beats
DOXLON November 2016 - ELK Stack and Beats DOXLON November 2016 - ELK Stack and Beats
DOXLON November 2016 - ELK Stack and Beats Outlyer
 

Mehr von Outlyer (20)

Murat Karslioglu, VP Solutions @ OpenEBS - Containerized storage for containe...
Murat Karslioglu, VP Solutions @ OpenEBS - Containerized storage for containe...Murat Karslioglu, VP Solutions @ OpenEBS - Containerized storage for containe...
Murat Karslioglu, VP Solutions @ OpenEBS - Containerized storage for containe...
 
How & When to Feature Flag
How & When to Feature FlagHow & When to Feature Flag
How & When to Feature Flag
 
Why You Need to Stop Using "The" Staging Server
Why You Need to Stop Using "The" Staging ServerWhy You Need to Stop Using "The" Staging Server
Why You Need to Stop Using "The" Staging Server
 
How GitHub combined with CI empowers rapid product delivery at Credit Karma
How GitHub combined with CI empowers rapid product delivery at Credit Karma How GitHub combined with CI empowers rapid product delivery at Credit Karma
How GitHub combined with CI empowers rapid product delivery at Credit Karma
 
Packaging Services with Nix
Packaging Services with NixPackaging Services with Nix
Packaging Services with Nix
 
Minimum Viable Docker: our journey towards orchestration
Minimum Viable Docker: our journey towards orchestrationMinimum Viable Docker: our journey towards orchestration
Minimum Viable Docker: our journey towards orchestration
 
Ops is dead. long live ops.
Ops is dead. long live ops.Ops is dead. long live ops.
Ops is dead. long live ops.
 
The service mesh: resilient communication for microservice applications
The service mesh: resilient communication for microservice applicationsThe service mesh: resilient communication for microservice applications
The service mesh: resilient communication for microservice applications
 
Microservices: Why We Did It (and should you?)
Microservices: Why We Did It (and should you?) Microservices: Why We Did It (and should you?)
Microservices: Why We Did It (and should you?)
 
Renan Dias: Using Alexa to deploy applications to Kubernetes
Renan Dias: Using Alexa to deploy applications to KubernetesRenan Dias: Using Alexa to deploy applications to Kubernetes
Renan Dias: Using Alexa to deploy applications to Kubernetes
 
Alex Dias: how to build a docker monitoring solution
Alex Dias: how to build a docker monitoring solution Alex Dias: how to build a docker monitoring solution
Alex Dias: how to build a docker monitoring solution
 
How to build a container monitoring solution - David Gildeh, CEO and Co-Found...
How to build a container monitoring solution - David Gildeh, CEO and Co-Found...How to build a container monitoring solution - David Gildeh, CEO and Co-Found...
How to build a container monitoring solution - David Gildeh, CEO and Co-Found...
 
Heresy in the church of - Corey Quinn, Principal at The Quinn Advisory Group
Heresy in the church of - Corey Quinn, Principal at The Quinn Advisory Group Heresy in the church of - Corey Quinn, Principal at The Quinn Advisory Group
Heresy in the church of - Corey Quinn, Principal at The Quinn Advisory Group
 
Anatomy of a real-life incident -Alex Solomon, CTO and Co-Founder of PagerDuty
Anatomy of a real-life incident -Alex Solomon, CTO and Co-Founder of PagerDutyAnatomy of a real-life incident -Alex Solomon, CTO and Co-Founder of PagerDuty
Anatomy of a real-life incident -Alex Solomon, CTO and Co-Founder of PagerDuty
 
A Holistic View of Operational Capabilities—Roy Rapoport, Insight Engineering...
A Holistic View of Operational Capabilities—Roy Rapoport, Insight Engineering...A Holistic View of Operational Capabilities—Roy Rapoport, Insight Engineering...
A Holistic View of Operational Capabilities—Roy Rapoport, Insight Engineering...
 
The Network Knows—Avi Freedman, CEO & Co-Founder of Kentik
The Network Knows—Avi Freedman, CEO & Co-Founder of Kentik The Network Knows—Avi Freedman, CEO & Co-Founder of Kentik
The Network Knows—Avi Freedman, CEO & Co-Founder of Kentik
 
Building a production-ready, fully-scalable Docker Swarm using Terraform & Pa...
Building a production-ready, fully-scalable Docker Swarm using Terraform & Pa...Building a production-ready, fully-scalable Docker Swarm using Terraform & Pa...
Building a production-ready, fully-scalable Docker Swarm using Terraform & Pa...
 
Zero Downtime Postgres Upgrades
Zero Downtime Postgres UpgradesZero Downtime Postgres Upgrades
Zero Downtime Postgres Upgrades
 
DOXLON November 2016: Facebook Engineering on cgroupv2
DOXLON November 2016: Facebook Engineering on cgroupv2DOXLON November 2016: Facebook Engineering on cgroupv2
DOXLON November 2016: Facebook Engineering on cgroupv2
 
DOXLON November 2016 - ELK Stack and Beats
DOXLON November 2016 - ELK Stack and Beats DOXLON November 2016 - ELK Stack and Beats
DOXLON November 2016 - ELK Stack and Beats
 

Kürzlich hochgeladen

Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxKatpro Technologies
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxOnBoard
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machinePadma Pradeep
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Allon Mureinik
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...HostedbyConfluent
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?XfilesPro
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Paola De la Torre
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...shyamraj55
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptxLBM Solutions
 
Azure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAzure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAndikSusilo4
 

Kürzlich hochgeladen (20)

Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptx
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food Manufacturing
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptx
 
Azure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAzure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & Application
 

James Brooks (Betfair) - Show me the Metrics

  • 1. CONFIDENTIAL and not for reproduction without prior written consent. © of The Sporting Exchange Limited. SHOW ME THE METRICS James Brooks
  • 2. CONFIDENTIAL and not for reproduction without prior written consent. © of The Sporting Exchange Limited. Born in 1999 UK, Romania, Portugal, Ireland, Malta, Gibraltar, USA Engineering Blog: www.betsandbits.com 700+ Engineers and Growing Exchange / Sportsbook /Games ~750k active users ~70M daily transactions ~2.5Bn daily API calls ~100k/s Monitoring Points E2E P95 transaction times <4ms One of UK’s most successful .coms with ~£2Bn Market Cap 76
  • 3. CONFIDENTIAL and not for reproduction without prior written consent. © of The Sporting Exchange Limited. OpenTSDB & OpenTSP at Betfair 77
  • 4. CONFIDENTIAL and not for reproduction without prior written consent. © of The Sporting Exchange Limited. 78 What is OpenTSDB?
  • 5. CONFIDENTIAL and not for reproduction without prior written consent. © of The Sporting Exchange Limited. 79 What is OpenTSDB? A database optimised for storing and querying Time Series Metrics os.filesystem.size 1413978032 912234340 host=fe-108 cluster=bf_c mount=/ type=Free Name Timestamp Value metadata
  • 6. CONFIDENTIAL and not for reproduction without prior written consent. © of The Sporting Exchange Limited. 80 OpenTSDB Architecture Hadoop • TSDB Read • TSDB Write Frontend Server • TSDB Read • TSDB Write Frontend Server • TSDB Read • TSDB Write Frontend Server LoadBalancer Metrics Queries
  • 7. TSDB CONFIDENTIAL and not for reproduction without prior written consent. © of The Sporting Exchange Limited. 81 OpenTSDB Architecture Metrics Queries
  • 8. Original CONFIDENTIAL and not for reproduction without prior written consent. © of The Sporting Exchange Limited. 82 OpenTSDB Architecture (ctd) TSDB
  • 10. Original CONFIDENTIAL and not for reproduction without prior written consent. © of The Sporting Exchange Limited. 84 OpenTSDB Architecture (ctd) Metrilyx TSDB
  • 11. CONFIDENTIAL and not for reproduction without prior written consent. © of The Sporting Exchange Limited. 85
  • 12. Original CONFIDENTIAL and not for reproduction without prior written consent. © of The Sporting Exchange Limited. 86 OpenTSDB Architecture (ctd) Metrilyx TSDB Visualiser
  • 13. CONFIDENTIAL and not for reproduction without prior written consent. © of The Sporting Exchange Limited. 87
  • 14. tcollector collect-os collect-jmx CONFIDENTIAL and not for reproduction without prior written consent. © of The Sporting Exchange Limited. 88 OpenTSDB Architecture (ctd) TSDB tcollector collect-os collect-jmx tcollector collect-os collect-jmx
  • 15. •  TSDB Use Cases •  Troubleshooting / Root Cause analysis •  Performance Analysis CONFIDENTIAL and not for reproduction without prior written consent. © of The Sporting Exchange Limited. 89
  • 16. CONFIDENTIAL and not for reproduction without prior written consent. © of The Sporting Exchange Limited. 90
  • 17. CONFIDENTIAL and not for reproduction without prior written consent. © of The Sporting Exchange Limited. 91
  • 18. CONFIDENTIAL and not for reproduction without prior written consent. © of The Sporting Exchange Limited. 92
  • 19. CONFIDENTIAL and not for reproduction without prior written consent. © of The Sporting Exchange Limited. 93
  • 20. CONFIDENTIAL and not for reproduction without prior written consent. © of The Sporting Exchange Limited. 94
  • 21. CONFIDENTIAL and not for reproduction without prior written consent. © of The Sporting Exchange Limited. 95
  • 22. •  TSDB Use Cases •  Troubleshooting / Root Cause analysis •  Performance Analysis •  Monitoring and Alerting •  Limitations •  Load •  Recency CONFIDENTIAL and not for reproduction without prior written consent. © of The Sporting Exchange Limited. 96
  • 23. CONFIDENTIAL and not for reproduction without prior written consent. © of The Sporting Exchange Limited. 97
  • 24. •  TSDB Use Cases •  Troubleshooting / Root Cause analysis •  Performance Analysis •  Monitoring and Alerting •  Limitations •  Load •  Recency •  TSP CONFIDENTIAL and not for reproduction without prior written consent. © of The Sporting Exchange Limited. 98
  • 25. CONFIDENTIAL and not for reproduction without prior written consent. © of The Sporting Exchange Limited. 99 OpenTSP
  • 26. tcollector collect-os collect-jmx CONFIDENTIAL and not for reproduction without prior written consent. © of The Sporting Exchange Limited. 100 OpenTSP TSDB tcollector collect-os collect-jmx forwarder collect-os collect-jmx
  • 27. CONFIDENTIAL and not for reproduction without prior written consent. © of The Sporting Exchange Limited. 101 OpenTSP Architecture: Forwarder TAG FILTER collect-statse
  • 28. CONFIDENTIAL and not for reproduction without prior written consent. © of The Sporting Exchange Limited. 102 OpenTSP Architecture: Statse application collect-statse count mean median p95 p99 forwarder collect-statse count mean median p95 p99 forwarder application collect-statse count mean median p95 p99 forwarder application
  • 29. poller CONFIDENTIAL and not for reproduction without prior written consent. © of The Sporting Exchange Limited. 103 OpenTSP Architecture: Poller collect-snmp collect-f5 collect-netscaler switches routers firewalls load balancers storage
  • 30. CONFIDENTIAL and not for reproduction without prior written consent. © of The Sporting Exchange Limited. 104 OpenTSP Architecture: Aggregator forwarder forwarder poller aggregator collect-site
  • 31. CONFIDENTIAL and not for reproduction without prior written consent. © of The Sporting Exchange Limited. 105 OpenTSP Architecture: Controller controller collectors aggregator
  • 32. CONFIDENTIAL and not for reproduction without prior written consent. © of The Sporting Exchange Limited. 106 [james@ohlookahostname~]$1CONTROL_HOST=tspconfig.inf.betfair1tspDforwarder1Dt1 {1 11111111"Filter":1[1 1111111111111111{1 111111111111111111111111"Match":1[1 11111111111111111111111111111111"",1 11111111111111111111111111111111"host",1 11111111111111111111111111111111"^$"1 111111111111111111111111],1 111111111111111111111111"Set":1[1 11111111111111111111111111111111"",1 11111111111111111111111111111111"host",1 11111111111111111111111111111111“ohlookahostname.inf.betfair"1 111111111111111111111111]1 1111111111111111}1 11111111],1 11111111"LogPath":1"/var/log/tsp/forwarder.log",1 11111111"Relay":1{1 1111111111111111"aggregator":1{1 111111111111111111111111"DropRepeats":1false,1 111111111111111111111111"Host":1“tspaggregator.inf.betfair",1 111111111111111111111111"MaxConnsPerHost":11,1 111111111111111111111111"OnQueueFull":1"Drop"1 1111111111111111},1 1111111111111111"tsd":1{1 111111111111111111111111"DropRepeats":1true,1 111111111111111111111111"Host":1"tsdb.inf.betfair",1 111111111111111111111111"MaxConnsPerHost":11,1 111111111111111111111111"OnQueueFull":1"Drop"1 1111111111111111}1 11111111},1 11111111"CollectPath":1"/etc/tsp/collect.d"1 }1
  • 33. Visualise r CONFIDENTIAL and not for reproduction without prior written consent. © of The Sporting Exchange Limited. 107 TSP at Betfaircontroller collectors TSDB Visualiser tsdb2mysql mysqld feedhealth Riemann future? aggregator aggregator
  • 34. CONFIDENTIAL and not for reproduction without prior written consent. © of The Sporting Exchange Limited. 108 Open Source •  tsp-forwarder •  collect-statse •  tsp-poller •  collect-f5 •  collect-netscaler •  collect-snmp •  tsp-aggregator •  tsp-controller Apache Licence v2 http://github.com/betfair/opentsp
  • 35. james.brooks@betfair.com CONFIDENTIAL and not for reproduction without prior written consent. © of The Sporting Exchange Limited. 109