14. Why graphing?
Discover trends and patterns
What time of the day do we get the most users?
When x happened, what was the effect on y?
How many hits am I getting per hour?
How does this compare to last week? last month?
Predict future events
When will we need to add more servers? Databases?
Negative feedback
Did the release into production fix problem x?
15. Cacti SUCKS
A few reasons:
Ancient user interface (no javascript/ajax), terrible workflow, cannot push metrics, no
formulas, no graph introspection, cannot push metrics, cannot feed out of sequence
metrics, ugly graphs, no API, expose system/os metrics on host via snmp, no graph
composer, no custom graphs, predefine metrics, predefine graphs, static polling interval,
unscalable, tons of work to create one graph, no 3rd party ecosystem, etc.
47. A Common Graphite Stack
Graphite-web
Collectd
Poller(s)
Applications
Carbon Whisper
Dashboards
Statsd
Scripts
Nagios
48. Collectd
Agent for system/hardware level metrics
Growing repository of plugins for a wide variety
of applications:
disk i/o, disk space, cpu, memory, mysql,
JMX, java, Redis, file sizes, load, etc.
https://collectd.org/wiki/index.php/Table_of_Plugins
Write your custom plugin in python
49. Nagios integration
You can write Nagios plugins that can alert off
of metrics values
Nagios can also feed graphite
performance data, events (ie: update
counter each time email is sent), etc.