Opening talk at Monitorama, talks about the problems of monitoring, challenges of creating monitoring tools and why monitoring vendors keep getting disrupted. Ended with a discussion of simulation testing and serverless architectures - Monitorless.
Adrian CockcroftTechnology Fellow at Battery Ventures um Battery Ventures
2. What does @adrianco do?
@adrianco
Technology Due
Diligence on
Deals
Presentations at
Companies and
Conferences
Tech and Board
Advisor
Support for
Portfolio
Companies
Consulting and
Training
Networking with
Interesting PeopleTinkering with
Technologies
Vendor
Relationships
Previously: Netflix, eBay, Sun Microsystems, Cambridge Consultants, City University London - BSc Applied Physics
4. Monitorama 2016
What problems does monitoring address?
Why isn’t this a solved problem already?
Who gets disrupted by what?
Stuff I’ve been tinkering with
13. Why isn’t there one
standard for monitoring?
We tried that once, immediately obsoleted by rise of Windows NT
X/Open Universal Measurement Architecture - 1997
http://pubs.opengroup.org/onlinepubs/009657299/c427-1/front.htm
16. 1970’s Mainframes
1980’s Minicomputers
Monitoring Evolution
Challenges
Platform - Entities - Hierarchy
Interfaces - Metrics - Schema
Scale - Ephemerality
Different vendors and tools in
each generation…
17. 1990’s Unix Servers
1970’s Mainframes
1980’s Minicomputers
Monitoring Evolution
Challenges
Platform - Entities - Hierarchy
Interfaces - Metrics - Schema
Scale - Ephemerality
Different vendors and tools in
each generation…
18. 1990’s Unix Servers
1970’s Mainframes
2000’s Windows on x86
1980’s Minicomputers
Monitoring Evolution
Challenges
Platform - Entities - Hierarchy
Interfaces - Metrics - Schema
Scale - Ephemerality
Different vendors and tools in
each generation…
19. 1990’s Unix Servers
1970’s Mainframes
2000’s Windows on x86
1980’s Minicomputers
2000’s Linux on x86
Monitoring Evolution
Challenges
Platform - Entities - Hierarchy
Interfaces - Metrics - Schema
Scale - Ephemerality
Different vendors and tools in
each generation…
20. 1990’s Unix Servers
1970’s Mainframes
2000’s Windows on x86
1980’s Minicomputers
2000’s Linux on x86
2000’s VMware on blades
Monitoring Evolution
Challenges
Platform - Entities - Hierarchy
Interfaces - Metrics - Schema
Scale - Ephemerality
Different vendors and tools in
each generation…
21. 1990’s Unix Servers
1970’s Mainframes
2000’s Windows on x86
1980’s Minicomputers
2000’s Linux on x86
2000’s VMware on blades
2010’s Public cloud
Monitoring Evolution
Challenges
Platform - Entities - Hierarchy
Interfaces - Metrics - Schema
Scale - Ephemerality
Different vendors and tools in
each generation…
22. 1990’s Unix Servers
1970’s Mainframes
2000’s Windows on x86
1980’s Minicomputers
2000’s Linux on x86
2000’s VMware on blades
2010’s Public cloud
2010’s Containers
Monitoring Evolution
Challenges
Platform - Entities - Hierarchy
Interfaces - Metrics - Schema
Scale - Ephemerality
Different vendors and tools in
each generation…
23. 1990’s Unix Servers
1970’s Mainframes
2000’s Windows on x86
1980’s Minicomputers
2000’s Linux on x86
2000’s VMware on blades
2010’s Public cloud
2010’s Containers
2010’s Serverless
Monitoring Evolution
Challenges
Platform - Entities - Hierarchy
Interfaces - Metrics - Schema
Scale - Ephemerality
Different vendors and tools in
each generation…
25. Cost per node drops
Revenue opportunity decreases
Waves of disruption
New vendors have new
schema’s, an order of
magnitude lower cost per node,
and many more shorter lived
nodes to monitor
26. $Millions (illustrative order of magnitude costs)
Cost per node drops
Revenue opportunity decreases
Waves of disruption
New vendors have new
schema’s, an order of
magnitude lower cost per node,
and many more shorter lived
nodes to monitor
27. $Millions (illustrative order of magnitude costs)
$1M
Cost per node drops
Revenue opportunity decreases
Waves of disruption
New vendors have new
schema’s, an order of
magnitude lower cost per node,
and many more shorter lived
nodes to monitor
28. $100K
$Millions (illustrative order of magnitude costs)
$1M
Cost per node drops
Revenue opportunity decreases
Waves of disruption
New vendors have new
schema’s, an order of
magnitude lower cost per node,
and many more shorter lived
nodes to monitor
29. $100K
$Millions (illustrative order of magnitude costs)
$10K
$1M
Cost per node drops
Revenue opportunity decreases
Waves of disruption
New vendors have new
schema’s, an order of
magnitude lower cost per node,
and many more shorter lived
nodes to monitor
30. $100K
$Millions (illustrative order of magnitude costs)
$10K
$1M
$5K
Cost per node drops
Revenue opportunity decreases
Waves of disruption
New vendors have new
schema’s, an order of
magnitude lower cost per node,
and many more shorter lived
nodes to monitor
31. $100K
$Millions (illustrative order of magnitude costs)
$10K
$1M
$5K
$1K per core
Cost per node drops
Revenue opportunity decreases
Waves of disruption
New vendors have new
schema’s, an order of
magnitude lower cost per node,
and many more shorter lived
nodes to monitor
32. $100K
$Millions (illustrative order of magnitude costs)
$10K
$1M
$5K
$1K per core
$100’s per month
Cost per node drops
Revenue opportunity decreases
Waves of disruption
New vendors have new
schema’s, an order of
magnitude lower cost per node,
and many more shorter lived
nodes to monitor
33. $100K
$Millions (illustrative order of magnitude costs)
$10K
$1M
$5K
$1K per core
$100’s per month
$10’s per month
Cost per node drops
Revenue opportunity decreases
Waves of disruption
New vendors have new
schema’s, an order of
magnitude lower cost per node,
and many more shorter lived
nodes to monitor
34. $100K
$Millions (illustrative order of magnitude costs)
$10K
$1M
$5K
$1K per core
$100’s per month
$10’s per month
$1’s per month
Cost per node drops
Revenue opportunity decreases
Waves of disruption
New vendors have new
schema’s, an order of
magnitude lower cost per node,
and many more shorter lived
nodes to monitor
36. A Tragic Quadrant
Ability to scale
Ability to
handle
rapidly
changing
microservices
In-house tools
at web scale
companies
Most current
monitoring & APM
tools
Next generation
APM
Next generation
Monitoring
Datacenter
Cloud
Containers
100s 1,000s 10,000s 100,000s
Lambda
37. A Tragic Quadrant
Ability to scale
Ability to
handle
rapidly
changing
microservices
In-house tools
at web scale
companies
Most current
monitoring & APM
tools
Next generation
APM
Next generation
Monitoring
Datacenter
Cloud
Containers
100s 1,000s 10,000s 100,000s
Lambda
Vendors - tell me where you belong on this plot…
39. Simulated Microservices
Model and visualize microservices
Simulate interesting architectures
Generate large scale configurations
Stress test real monitoring tools
Code: github.com/adrianco/spigo
Simulate Protocol Interactions in Go
Simian Army Visualizations
ELB Load Balancer
Zuul
API Proxy
Karyon
Business Logic
Staash
Data Access Layer
Priam
Cassandra Datastore
Three
Availability
Zones
Denominator
DNS Endpoint
43. memcached hit %
memcached response mysql response
service cpu time
memcached hit mode
mysql cache hit mode
mysql disk access mode
Hit rates: memcached 40% mysql 70%
Guesstimate
51. Serverless Programming Model
Event driven functions
Role based permissions
Whitelisted API based security
Good for simple single threaded code
52. Serverless Cost Efficiencies
100% useful work, no agents, overheads
100% utilization, no charge between requests
No need for extra capacity for peak traffic
Anecdotal costs ~1% of conventional system
Ideal for low traffic, Corp IT, spiky workloads
53. Serverless Work in Progress
Tooling for ease of use
Multi-region HA/DR patterns
Debugging and testing frameworks
Monitoring, end to end tracing
Using AWS Lambda to monitor AWS
54. DIY On-Premise
Serverless Operating Challenges
Scheduling and startup latency
Execution and monitoring overhead
Charging model
Capacity planning
58. Security
Visit http://www.battery.com/our-companies/ for a full list of all portfolio companies in which all Battery Funds have invested.
Palo Alto
Networks
Enterprise IT
Operations &
Management
Big DataCompute
Networking
Storage