5. Infrastructure Monitoring
Observe your entire IT in one Place
Automation
Monitor massive amounts of data.
Cloud Monitoring
Monitor the private, the public and hybrid.
Metrics & Logs
Get the context and recognize trends.
Automation
Analyze data and perform proactively.
Notifications
Get alerted and react with confidence.
The Icinga Stack
15. Icinga 2.14 – Redundancy Groups
• Ability to model dependencies into groups
• Groups and “usual” dependencies can be mixed
• Thanks to Edgar Fuss for the contribution
21. Icinga for Windows
• Release of 1.11 in August
• Breaks compatibility with all previous generated performance data
• New versions for Plugins
• Hyper-V
• MSSQL
• Cluster
• Dashboards for Grafana
28. Icinga Certificate Monitoring
• Release of Icinga Certificate Monitoring 1.3.0 in October
• PostgreSQL support
• Filter / Search bar
• Revamped UI, specifically the certificate dashboard
• New UI for creating job schedules using the new IPL component
• Jobs and schedules are now stored to the database
36. Icinga DB Web
• Release of Icinga DB Web 1.1 in September
• Keyboard Navigation
• Host- and Servicegroup grid views
• Enhanced Check Execution Details
• PHP 8.2 support
40. Icinga Cube 1.3
• Release of Icinga Cube 1.3.0 in March
• Search: Filter the cube results according to your
requirements.
• Problems only: Focus on the essentials by switching to
"Problems Only" mode.
• Sorting: Sort the cube by dimension name or severity.
• Now it's possible to add host dimensions for service cubes
44. Icinga Business Process Modelling
• Release of Icinga Business Process Modelling 2.5 in October
• Unified searchbar for large number of objects
• Extended Business Detail
• Show critical count in Web menu
• PHP 8.2 support
54. Icinga Module for vSphere®
• Release of Icinga Module for vSphere® 1.7.1 in April
• UI improvements
• Disk- and Storage-related aggregations
• Advanced JSON Export
• Guest IP stack is now being fetched and visualized
• Hosts, Virtual Machines and Datastores can now be exported via REST API
57. Icinga Reporting
• Release of Icinga Reporting 1.0 in October
• New scheduling widget
• Create reports in Icinga DB Web (1.1)
• List or export reports using the CLI
• PHP 8.2 support
61. Icinga Web Jira Integration
• Release of Icinga Web Jira Integration 1.3.1 in June
• Jira configuration within Icinga Web
• Differentiate between Jira Cloud and Jira on-prem
• Create templates to be used when creating Jira tickets
66. Icinga Notifications
We shift our focus to true incident management and start the next
transition of Icinga. Notifications move into the center of monitoring.
67. Our Goals
• Simplified contacts management
• Duty management (on-call)
• Well maintained notification channels
• Central hub for notifications
68. Teamwork Manager
Ensure good communication processes
and efficient workflows.
• Enhanced management for Contacts and Groups
• Define and apply escalation rules
• Assign teams to specific parts of your infrastructure
• Finally define channels and recipients
74. Channel Genius
• Pick any notification channel you like. Icinga
impresses with flexibility towards integrations.
• Ticketing systems
• Platforms like PagerDuty, OpsGenie or ServiceNow
• SMS, Slack, Telegram and other messengers
76. Next Steps
• Preview release by the end of this year
• Collect feedback
• 1.0 release next year
• First notifications channels right after the first release
77.
78. Open Topics
• High Availability
• Attach AD/LDAP
• Incident aggegration
• Upgrade modules step by step
84. K8s Monitoring– What?
• Hosts (where K8s components run)
• K8s itself
• Services, e.g. Deployments, *Sets, Jobs
• Pods and Containers
• Key metrics
Not only infrastructure but also workloads
85. K8s Monitoring – Complexity
• Loads of resource types
• Multiple components and layers
• Different failure points
• Understanding of the entire stack
Via hosts, services and check plugins?
87. K8s Monitoring – Probes
Liveness probes periodically check container liveness and
restart containers that fail it.
Readiness probes indicate container readiness and remove
failing ones from their service endpoints.
Startup probes defer the execution of liveness and readiness
probes and restarts containers that fail it.
88. K8s Monitoring – Approaches
• Poll K8s APIs
• Agent per node via DaemonSet
• Agent per pod (sidecar container)
• Events
• Metrics
• Logs
• APM
89. Possible K8s Metric Sources
• Node metrics from Prometheus node exporter
• Container metrics from cAdvisor (or metrics-server)
• K8s metrics
• API server
• etcd
• scheduler
• controller manager
• kube-state-metrics