Zabbix is an opensource monitoring tool that has been rapidly evolving during the last few years. We will talk about the growth of the product and look at it from several perspectives :
technical – how Zabbix has developed functionally, important decisions made
project management – which processes help to improve the software quality and which ones help less
community – how open is Zabbix and how that has changed over the years both towards more and less openness
The talk will illustrate points made with examples from the Zabbix community as well as from an extensive Zabbix use at Nokia.
18. Value preprocessing - regexp
Previously available for log monitoring
Now expanded to any item type
Possibilities:
● cut off " °C" from bad SNMP devices
● extract data from Apache, HAProxy status page
Original value:
33 °C
Regexp parsing:
([0-9]+), 1
Value:
33
19. Value preprocessing – XPath, JSON path
Extract values from XML, JSON
Status pages, API output...
21. Extract for displaying purposes
41904:20171112:042106.873 [Z3005] query failed: [2006]
MySQL server has gone away [begin;]
Extract the value
{{ITEM.VALUE}.regsub("(.*[[0-9]+] )(.*)[.*", Error: 2)}
Displayed or sent in the alert
(stored value not changed)
Error: MySQL server has gone away
22. Polling frequency
Before – seconds only
Now – time suffixes (5m, 2h etc)
LLD/user macros supported
● Custom frequency based on discovery
● Custom frequency based on variables
23. Same template, easy dependencies
Core router – pinged every 10 seconds
Edge router – pinged every 30 seconds
Printer – pinged every 60 seconds
26. Trigger per app or common
Application Sugar stopped
Application Honey_Honey stopped
Application Sugar started
Application Honey_Honey started
Alert
Nothing
Resolve
Nothing
27. Correlate by tags
Application Sugar stopped
Application Honey_Honey stopped
Application Sugar started
Application Honey_Honey started
Application=Sugar -> Problem
Application=Honey_Honey -> Problem
Application=Sugar -> Resolve
Application=Honey_Honey -> Resolve
44. Moar rezilience
More data logged in agent crashes
One process to query IPMI
Parallel alerting - no blocking, faster spamming
Alert on broken items missing data
45. New filtering in the GUI
● host groups
● templates
● global scripts
● maintenance periods
● actions
● network discovery rules
● proxies
● user groups
● users
● media types
46. Upgrade process
Easy – start the server
7344:20170630:183622.330 current database version (mandatory/optional):
7344:20170630:183622.330 required mandatory version: 03000000
7344:20170630:183622.330 starting automatic database upgrade
7344:20170630:183622.335 completed 0% of database upgrade
7344:20170630:183622.372 completed 1% of database upgrade
...
7344:20170630:183623.698 completed 99% of database upgrade
7344:20170630:183623.698 completed 100% of database upgrade
7344:20170630:183623.698 database upgrade fully completed