Monitoring Docker containers - Docker NYC Feb 2015

•Als PPTX, PDF herunterladen•

14 gefällt mir•3,187 views

Alexis goals this presentation are three-fold: 1) Dive into key Docker metrics 2) Explain operational complexity. In other words I want to take what we have seen on the field and show you where the pain points will be. 3) Rethink monitoring of Docker containers. The old tricks won’t work.

Technologie

Monitoring and Running
Docker Containers at Scale
Docker NYC Meetup
February 25th, 2015

Datadog
• Monitoring service
• Made for the cloud
• Aggregates everything
• Support for Docker (since 1.0)

Goal of this talk
Rethink the monitoring of Docker containers

Agenda
1.A (very) brief history of containers
2.Operational complexity
3.Monitoring Docker effectively
4.Demo

Containers in a nutshell
• Been around for a long time
– jails, zones, cgroups
• No full-virtualization overhead
• Used for runtime isolation (e.g. jails)
• Docker is an Escape from Dependency Hell

Escape from dependency hell
a.out
shared libs
packages
omnibus
Docker ==
?

Mini-host or über-process?
Process Container Host
Spec Source Dockerfile Kickstart
On disk .TEXT /var/lib/docker /
In memory PID Container ID Hostname
In the network Socket veth* eth*
Runtime
context
server core host data center

Combinatorial multiplication
Hardware
OS
Off-the-shelf
Your Application
Hardware
Hypervisor
Off-the-
shelf
App
OS OS
Off-the-
shelf
App
Hardware
Hypervisor
OS OS
A A A A
Containers
O O O O

Operational complexity
• Average containers per host: N (N=5, 10/2014)
• N-times as many “hosts” to manage
• Affects
– provisioning: prep’ing & building containers
– configuration: passing config to containers
– orchestration: deciding where/when containers
run
– monitoring: making sure containers run
properly

Complexity increases with...
1. Number of things to measure
2. Velocity of change

Number of things to measure
• 1 Amazon EC2 instance
– 10 CloudWatch metrics
• 1 operating system (e.g. linux)
– 100 metrics
•N containers
– 100*N metrics
•110 + 100*N metrics per instance

Combinatorial multiplication
100 500instances containers
Assuming only 5 containers per instance

Combinatorial multiplication
160 610metrics
per host
metrics
per host
Assuming only 5 containers per
instance

Combinatorial multiplication
100 61,000instances metrics
Assuming only 5 containers per instance

Velocity
hours,
days,
months
minutes,
hours,
days
Host half-life Container half-life

Aggravating factors
• Registry-based provisioning
– new images as fast as you can git commit
• Autonomic orchestration
– from imperative to declarative
– automated
– individual containers don’t matter
– e.g. kubernetes, mesos

If your monitoring is still centered on individual hosts or
instances…

Host-centric monitoring
Monitor
Monitor
GA
P
Hypervisor
OS OS
A A A A
Containers
O O O O

Layers of monitoring
Monitor
Hypervisor
OS OS
A A A A
Containers
O O O O

Layers of monitoring
CloudWatch
Infrastructure
Monitoring
APM
Hypervisor
OS OS
A A A A
Containers
O O O O

Layers of monitoring
cpu/net/io
filesystem
docker mem
docker cpu
db queries
web requests
app throughput
CloudWatch
Infrastructure
Monitoring
APM
e.g
.
Hypervisor
OS OS
A A A A
Containers
O O O O

Layers of monitoring
• Access to metrics from all the layers
• Amazon CloudWatch, OS metrics, Docker metrics,
app metrics in 1 place
• Shared timeline

If monitoring
does not cover all
layers,
pain.

Tags (a.k.a. labels)
You (probably) already use them

Tags
• Monitoring is like Auto-Scaling Groups
• Monitoring is like Docker orchestration
• From imperative to declarative
• Query-based
• Queries operate on tags

Monitoring with tags and queries
“Monitor all Docker containers running image web”
“… in region us-west-2 across all availability zones”
“… and make sure resident set size < 1GB on c3.xl”

Monitoring with tags and queries
“Monitor all Docker containers running image web”
“… in region us-west-2 across all availability zones”
“… that use more than 1.5x the average on c3.xl”

Take-aways
1. Docker increases operational complexity by an
order of magnitude unless…
2. You have layered monitoring, from the instance to
the container and to the application, and…
3. You monitor using tags and queries

Weitere ähnliche Inhalte

Was ist angesagt?

CoreOS: The Inside and Outside of Linux Containers

Ramit Surana

Tupperware: Containerized Deployment at FB

Docker, Inc.

Docker containers add portability but can also introduce complexity into your environment. In this session learn about why monitoring your container environment is essential to maintaining service reliability, and how Splunk software can help you monitor different layers of infrastructure running in a Docker environment, including third-party tools, instances, and custom code. Learn how to use Splunk software to collect, search and correlate container data with other infrastructure data for better service context, root cause monitoring and reporting. Additionally, receive introduction to the product integrations between Splunk and Docker such as the Splunk Logging Driver, Splunk Forwarder, and Splunk Logging Libraries.

Take an Analytics-driven Approach to Container Performance with Splunk for Co...

Docker, Inc.

Fully Automated Kubernetes Deployment and Management (Peng Jiang, Rancher Labs) - Kubernetes is rapidly gaining popularity as a powerful container orchestration and scheduling platform. But deploying and managing Kubernetes clusters is still a challenge for many organizations.How to ensure Kubernetes clusters in different clouds and data centers can communicate with each other? How to automate the deployment of multiple Kubernetes clusters? How to incorporate the new Kubernetes Federation into multi cloud and multi datacenter deployments? How to manage the health of Kubernetes cluster itself? etc. In this talk, Peng will share his experience on how to automate and simplify Kubernetes deployments, and discuss how some of the latest community projects (such as kubeadm and self-hosting Kubernetes) will help address the problems in the future.

Fully automated kubernetes deployment and management

LinuxCon ContainerCon CloudOpen China

Docker for Ops: Operationalize your Docker Built Apps in Production by Evan H...

Docker, Inc.

Stateful set in kubernetes implementation & usecases

Krishna-Kumar

Docker for Ops: Docker Networking Deep Dive, Considerations and Troubleshooti...

Docker, Inc.

With tools like Docker Toolbox, the entry barrier to Docker and containers is rather low. However, it takes a lot more to design, build and run an entire container platform, at scale, for production applications. This talk will focus on why it is important to have a well-defined reference model for building container platforms that guides container engineers and architects through the process of identifying platform concerns, patterns, components as well as the interactions between them in order to deliver a set of platform capabilities (service discovery, load balancing, security, and others) to support containerized applications using existing tooling. As part of this session will also see how a container architecture has enabled real projects in their delivery of container platforms.

Structured Container Delivery by Oscar Renalias, Accenture

Docker, Inc.

Velocity NYC 2016 - Containers @ Netflix

aspyker

How to Build Your First Web App in Go

All Things Open

1&1, Europe’s largest web hosting company, has been automatically deploying and managing multi-tenant server environments for 20 years. These servers support millions of active websites and services around the world. Historically software stacks were pre-installed using estimates of what was considered good, taking a ‘one size fits all’ approach. I am going to show how we are now combining Git, Gitlab, Openshift and Docker to revolutionise our approach to large scale hosting, providing greater power and flexibility without increasing support overhead. This includes showing: · Transforming the legacy multi-tenant LAMP environment into many single-tenant Docker projects · Managing thousands of projects on behalf of tenants · Gitlab CI for testing Docker containers · Testing container interactions and upgrade cycle

Application Deployment and Management at Scale with 1&1 by Matt Baldwin

Docker, Inc.

Fluentd and docker monitoring

Vinay Krishna

Members from over all over the world streamed over forty-two billion hours of Netflix content last year. Various Netflix batch jobs and an increasing number of service applications use containers for their processing. In this session, Netflix presents a deep dive on the motivations and the technology powering container deployment on top of Amazon Web Services. The session covers our approach to resource management and scheduling with the open source Fenzo library, along with details of how we integrate Docker and Netflix container scheduling running on AWS. We cover the approach we have taken to deliver AWS platform features to containers such as IAM roles, VPCs, security groups, metadata proxies, and user data. We want to take advantage of native AWS container resource management using Amazon ECS to reduce operational responsibilities. We are delivering these integrations in collaboration with the Amazon ECS engineering team. The session also shares some of the results so far, and lessons learned throughout our implementation and operations.

Re:invent 2016 Container Scheduling, Execution and AWS Integration

aspyker

K8S in prod

Mageshwaran Rajendran

Container Orchestration with Docker Swarm and Kubernetes

Will Hall

Real-Time Data Processing Pipeline & Visualization with Docker, Spark, Kafka and Cassandra ---------------------------------- Presentation: @TIAD Paris -- http://tiad.io/ ---------------------------------- Source code: https://github.com/rogaha/data-processing-pipeline ---------------------------------- Resources: https://www.docker.com/products/docker / https://www.docker.com/technologies/overview

Real-Time Data Processing Pipeline & Visualization with Docker, Spark, Kafka ...

Roberto Hashioka

Netflix Container Runtime - Titus - for Container Camp 2016

aspyker

In deploying apps that have been containerized, you have a lot to think about regarding what to use in production. There are a lot of things to manage, so orchestrators become a huge help. providing many services together such as scheduling, container communication, scaling, health, and more. There are major platforms to consider from Kubernetes, Swarm to ECS. In this talk we'll go through the overview of orchestrators and some of the differences between the big players. You should come out of the talk knowing where to go next in determining your orchestrator needs.

Container orchestration overview

Wyn B. Van Devanter

Introducing Chef | An IT automation for speed and awesomeness

Ramit Surana

From the Philly Kubernetes December 2016 Meetup. https://www.meetup.com/Kubernetes-Philly/events/234829676/ Kubernetes accelerates technical and business innovation through rapid development and deployment of applications. Learn how to deploy, scale, and manage your applications in a containerized environments using Kubernetes. In this 60-minute workshop, Ross Kukulinski will review fundamental Kubernetes concepts and architecture and then will show how to containerize and deploy a multi-tier web application to Kubernetes. Topics that will be covered include: • Working with the Kubernetes CLI (kubectl) • Pods, Deployments, & Services • Manual & Automated Application Scaling • Troubleshooting and debugging • Persistent storage

Kubernetes 101 for Developers

Ross Kukulinski

Was ist angesagt? (20)

CoreOS: The Inside and Outside of Linux Containers

Tupperware: Containerized Deployment at FB

Take an Analytics-driven Approach to Container Performance with Splunk for Co...

Fully automated kubernetes deployment and management

Docker for Ops: Operationalize your Docker Built Apps in Production by Evan H...

Stateful set in kubernetes implementation & usecases

Docker for Ops: Docker Networking Deep Dive, Considerations and Troubleshooti...

Structured Container Delivery by Oscar Renalias, Accenture

Velocity NYC 2016 - Containers @ Netflix

How to Build Your First Web App in Go

Application Deployment and Management at Scale with 1&1 by Matt Baldwin

Fluentd and docker monitoring

Re:invent 2016 Container Scheduling, Execution and AWS Integration

K8S in prod

Container Orchestration with Docker Swarm and Kubernetes

Real-Time Data Processing Pipeline & Visualization with Docker, Spark, Kafka ...

Netflix Container Runtime - Titus - for Container Camp 2016

Container orchestration overview

Introducing Chef | An IT automation for speed and awesomeness

Kubernetes 101 for Developers

Andere mochten auch

FOWA London 2015 Micro-service systems deliver wonderful adaptability to business needs, easy scalability, and low-risk deployment. What's not to like? You also end up with a system that's hard to understand, measure and predict. Traditional approaches to monitoring simply aren't powerful enough to handle the emergent properties of a system with lots of moving parts. The solution is to apply the scientific method! Anything can be measured. Uncertainty can be reduced, and stability can be an emergent property. We just have to learn the lessons that the natural world can teach us.

Measuring Micro-services. Richard Rodger

Future Insights

Performance monitoring for Docker Challenges - Anomaly detection - CoScale demo For more info about how to use CoScale Docker monitoring, some reading material here: http://www.coscale.com/blog/how-to-monitor-docker-containers-with-coscale and http://www.coscale.com/blog/how-to-monitor-your-kubernetes-cluster A summary of CoScale Docker performance monitoring can be found here: http://www.coscale.com/docker-monitoring

Performance monitoring for Docker - Lucerne meetup

Stijn Polfliet

Nagios Conference 2014 - Spenser Reinhardt - Detecting Security Breaches With...

Nagios

Monitoring docker container and dockerized applications

Ananth Padmanabhan

Docker Indy Meetup Monitoring 30-Aug-2016

Matt Bentley

Monitoring docker containers and dockerized applications

Satya Sanjibani Routray

ContainerDays NYC 2016: "Observability and Manageability in a Container Envir...

DynamicInfraDays

Voxxed Days Thessaloniki 2016 - Microservices in production

Voxxed Days Thessaloniki

2008 "An overview of Methods for analysis of Identifiability and Observabilit...

Steinar Elgsæter

At SoundCloud we managed to break away from the monolith while delivering key business features. Our journey towards a microservices architecture has not been a straightforward one. We experimented a lot to reach the set of tools and technologies that we use today. We changed how we build our applications. We introduced specific apis for our mobile and web clients. We call them BFFs (backend for the frontend). They became the central piece of SoundCloud’s architecture. We rethought how we monitor our services. We created a service registry for knowledge sharing. While making all these changes, we benefited from the learnings of our peer companies. This talk will share our learnings from this journey: what worked for us and what we moved away from.

BFF Pattern in Action: SoundCloud’s Microservices

Bora Tunca

Microservice Architecture

Engin Yoeyen

Tracing 2000+ polyglot microservices at Uber with Jaeger and OpenTracing

Yuri Shkuro

Often what you monitor and get alerted on is defined by your tools, rather than what makes the most sense to you and your organisation. Alerts on metrics such as CPU usage which are noisy and rarely spot real problems, while outages go undetected. Monitoring systems can also be challenging to maintain, and overall provide a poor return on investment. In the past few years several new monitoring systems have appeared with more powerful semantics and which are easier to run, which offer a way to vastly improve how your organisation operates Prometheus is one such system. This talk will look at the monitoring ideal and how whitebox monitoring with a time series database, multi-dimensional labels and a powerful querying/alerting language can free you from midnight pages.

Monitoring What Matters: The Prometheus Approach to Whitebox Monitoring (Berl...

Brian Brazil

In this session we’ll leave the need for performance a foregone conclusion and take a whirlwind tour through the complexity of modern Internet architectures. The complexities lead to evil optimization problems and significant challenges troubleshooting production issues to a speedy and successful end. Starting with the simple facts that you can’t fix what you can’t see and you can’t improve what you can’t measure, we’ll discuss what needs monitoring and why. We’ll talk about unlikely allies in the fight for time and budget to instrument systems, applications and processes for observability. You’ll leave the session with a better understanding of what it looks like to troubleshoot the storm of a malfunctioning large architecture and some tools and techniques you can use to not be swallowed by the Kraken.

Monitoring and observability

Theo Schlossnagle

Microservices promise to increase time-to-market, support growth and foster innovation by enforcing Agile, product-centered and self-enabled teams. However, building a system of microservices that actually works is not an easy endeavour - after all, you're building a highly dynamic, distributed and fault-tolerant system. In this presentation I'll share important learnings around microservices and how to use the Dynatrace digital performance management platform on Red Hat's OpenShift to manage the inherent complexities of microservices-oriented architectures.

Monitoring Microservices at Scale on OpenShift (OpenShift Commons Briefing #52)

Martin Etmajer

Delivered at the FISL13 conference in Brazil: http://www.youtube.com/watch?v=K9w2cipqfvc This talk introduces the USE Method: a simple strategy for performing a complete check of system performance health, identifying common bottlenecks and errors. This methodology can be used early in a performance investigation to quickly identify the most severe system performance issues, and is a methodology the speaker has used successfully for years in both enterprise and cloud computing environments. Checklists have been developed to show how the USE Method can be applied to Solaris/illumos-based and Linux-based systems. Many hardware and software resource types have been commonly overlooked, including memory and I/O busses, CPU interconnects, and kernel locks. Any of these can become a system bottleneck. The USE Method provides a way to find and identify these. This approach focuses on the questions to ask of the system, before reaching for the tools. Tools that are ultimately used include all the standard performance tools (vmstat, iostat, top), and more advanced tools, including dynamic tracing (DTrace), and hardware performance counters. Other performance methodologies are included for comparison: the Problem Statement Method, Workload Characterization Method, and Drill-Down Analysis Method.

Performance Analysis: The USE Method

Brendan Gregg

Talk from SREcon2016 by Brendan Gregg. Video: https://www.usenix.org/conference/srecon16/program/presentation/gregg . "There's limited time for performance analysis in the emergency room. When there is a performance-related site outage, the SRE team must analyze and solve complex performance issues as quickly as possible, and under pressure. Many performance tools and techniques are designed for a different environment: an engineer analyzing their system over the course of hours or days, and given time to try dozens of tools: profilers, tracers, monitoring tools, benchmarks, as well as different tunings and configurations. But when Netflix is down, minutes matter, and there's little time for such traditional systems analysis. As with aviation emergencies, short checklists and quick procedures can be applied by the on-call SRE staff to help solve performance issues as quickly as possible. In this talk, I'll cover a checklist for Linux performance analysis in 60 seconds, as well as other methodology-derived checklists and procedures for cloud computing, with examples of performance issues for context. Whether you are solving crises in the SRE war room, or just have limited time for performance engineering, these checklists and approaches should help you find some quick performance wins. Safe flying."

SREcon 2016 Performance Checklists for SREs

Brendan Gregg

If you are interested to know more about AWS Chicago Summit, please use the following to register: http://amzn.to/1RooPPL Amazon Kinesis is a fully managed, cloud-based service for real-time data processing over large, distributed data streams. AWS Lambda is a compute service that runs your code in response to events and automatically manages the compute resources for you. AWS Lambda can run code in response to data in Amazon Kinesis streams, making it easy to build big data applications that respond quickly to new information. In this webinar, we will cover key Kinesis and Lambda features, walk through sample use cases for stream processing, and discuss best practices on using the services together. We'll then demonstrate setting up an Amazon Kinesis stream and an associated Lambda function to capture and perform custom computations on click-stream data, all without setting up any infrastructure. Learning Objectives: • Understand key Amazon Kinesis and AWS Lambda features • Learn how to setup streaming data capture and processing framework using AWS Lambda • Learn sample use cases, best practices and tips on using AWS Lambda with Amazon Kinesis Who Should Attend: • Developers, Devops Engineers, IT Operations Professionals

AWS May Webinar Series - Streaming Data Processing with Amazon Kinesis and AW...

Amazon Web Services

Amazon EC2 provides a broad selection of instance types to accommodate a diverse mix of workloads. In this session, we provide an overview of the Amazon EC2 instance platform, key platform features, and the concept of instance generations. We dive into the current generation design choices of the different instance families, including the General Purpose, Compute Optimized, Storage Optimized, Memory Optimized, and GPU instance families. We also detail best practices and share performance tips for getting the most out of your Amazon EC2 instances.

AWS re:Invent 2016: Deep Dive on Amazon EC2 Instances, Featuring Performance ...

Amazon Web Services

Just as we got a hang of monitoring our server-based applications, they take away the server. How do you monitor something that doesn’t exist? Which metrics matter most in a serverless world? In this session, we will look at how applications are different in an AWS Lambda-based world and how to monitor them. Join us as we work our way through the stack and demonstrate how to capture the health and performance of your services. The focus of this session is not tool-specific. Attendees will learn production-tested lessons and leave with frameworks they can implement with their serverless workloads, no matter which platforms and tools they use. This session sponsored by Datadog. AWS Competency Partner

AWS re:Invent 2016: Monitoring, Hold the Infrastructure: Getting the Most fro...

Amazon Web Services

Andere mochten auch (20)

Measuring Micro-services. Richard Rodger

Performance monitoring for Docker - Lucerne meetup

Nagios Conference 2014 - Spenser Reinhardt - Detecting Security Breaches With...

Monitoring docker container and dockerized applications

Docker Indy Meetup Monitoring 30-Aug-2016

Monitoring docker containers and dockerized applications

ContainerDays NYC 2016: "Observability and Manageability in a Container Envir...

Voxxed Days Thessaloniki 2016 - Microservices in production

2008 "An overview of Methods for analysis of Identifiability and Observabilit...

BFF Pattern in Action: SoundCloud’s Microservices

Microservice Architecture

Tracing 2000+ polyglot microservices at Uber with Jaeger and OpenTracing

Monitoring What Matters: The Prometheus Approach to Whitebox Monitoring (Berl...

Monitoring and observability

Monitoring Microservices at Scale on OpenShift (OpenShift Commons Briefing #52)

Performance Analysis: The USE Method

SREcon 2016 Performance Checklists for SREs

AWS May Webinar Series - Streaming Data Processing with Amazon Kinesis and AW...

AWS re:Invent 2016: Deep Dive on Amazon EC2 Instances, Featuring Performance ...

AWS re:Invent 2016: Monitoring, Hold the Infrastructure: Getting the Most fro...

Ähnlich wie Monitoring Docker containers - Docker NYC Feb 2015

Docker is the developer-friendly container technology that enables creation of your application stack: OS, JVM, app server, app, database and all your custom configuration. So you are a Java developer but how comfortable are you and your team taking Docker from development to production? Are you hearing developers say, “But it works on my machine!” when code breaks in production? And if you are, how many hours are then spent standing up an accurate test environment to research and fix the bug that caused the problem? This workshop/session explains how to package, deploy, and scale Java applications using Docker.

Devoxx 2016 - Docker Nuts and Bolts

Patrick Chanezon

Intro Docker october 2013

dotCloud

Docker introduction

dotCloud

Incident response is generally predicated on the ability to examine a system post-breach, pull memory dumps, file system artifacts, system logs, etc. But what happens when that system was part of a fleet of containers? How do you pull a memory dump from an ephemeral container? How do you do forensics when the container and the host that ran the container have been gone for days? Even assuming you catch an intrusion while it's ongoing, how do you respond effectively if you can't access the systems in question because they are read-only, no SSH access? Coinbase has spent the last year attacking these challenges in a AWS-based, immutable and fully containerized infrastructure that stores over a billion dollars of digital currency. Come see how we do it.

Dock ir incident response in a containerized, immutable, continually deploy...

Shakacon

Write Once and REALLY Run Anywhere | OpenStack Summit HK 2013

dotCloud

OpenStack Summit

Docker, Inc.

Docker Presentation at the OpenStack Austin Meetup | 2013-09-12

dotCloud

Application Deployment on Openstack

Docker, Inc.

What's New in Docker - February 2017

Patrick Chanezon

Live recording with the demos: https://www.youtube.com/watch?v=0XRcmJEiZOM Contents - The application distribution challenge - The current solutions - Introduction to Docker, Containers, and the Matrix from Hell - Why people care: Separation of Concerns - Technical Discussion - Ecosystem, momentum - How to build Docker images - How to make containers talk to each other, how to handle data persistence - Demo 1: isolation - Demo 2: real case - installing Go Math! Academy, tail –f containers, unit tests

The challenge of application distribution - Introduction to Docker (2014 dec ...

Sébastien Portebois

State of the Container Ecosystem

Vinay Rao

Webinar Docker Tri Series

Newt Global Consulting LLC

Detailed Introduction To Docker

nklmish

Docker is the world's leading software containerization platform. This is a comprehensive introduction to Docker, suitable for delivering in introductory meetups to an audience who does not know about docker. In case you want to deliver this presentation somewhere, kindly drop me a mail at aditya.konarde@gmail.com You can contact me at: Connect with me onLinkedIN: https://www.linkedin.com/in/adityakonarde Add me on Facebook: https://www.facebook.com/Aditya.Konarde Tweet to me @aditya_konarde

Introduction to Docker

Aditya Konarde

Docker-Intro

Sujai Sivasamy

Adopting Docker for production applications and services used to be hard. You had to hand-roll a lot of the underlying infrastructure and write lots of custom code for service discovery, load balancing, orchestration, desired state, etc. Today, with the rise of open source container orchestration platforms and cloud-native offerings, it's a lot easier to get up and running. Github repo for demo: https://github.com/elabor8/dockertalk

Using Docker in production: Get started today!

Clarence Bakirtzidis

Containing the world with Docker

Giuseppe Piccolo

Docker & Daily DevOps

Satria Ady Pradana

Docker and-daily-devops

Satria Ady Pradana

Docker-Hanoi @DKT , Presentation about Docker Ecosystem

Van Phuc

Ähnlich wie Monitoring Docker containers - Docker NYC Feb 2015 (20)

Devoxx 2016 - Docker Nuts and Bolts

Intro Docker october 2013

Docker introduction

Dock ir incident response in a containerized, immutable, continually deploy...

Write Once and REALLY Run Anywhere | OpenStack Summit HK 2013

OpenStack Summit

Docker Presentation at the OpenStack Austin Meetup | 2013-09-12

Application Deployment on Openstack

What's New in Docker - February 2017

The challenge of application distribution - Introduction to Docker (2014 dec ...

State of the Container Ecosystem

Webinar Docker Tri Series

Detailed Introduction To Docker

Introduction to Docker

Docker-Intro

Using Docker in production: Get started today!

Containing the world with Docker

Docker & Daily DevOps

Docker and-daily-devops

Docker-Hanoi @DKT , Presentation about Docker Ecosystem

Mehr von Datadog

Webinar that took place on July 12 2017. The emergence of cloud-based infrastructure has dramatically reshaped the IT landscape for managed service providers and their customers. Infrastructure is now dynamic, elastic, and instantly available to any individual or organization. Customers are becoming increasingly aware of the value of cloud services, and with this heightened awareness comes the desire to partner with providers who can guide them toward innovative business solutions and high-performance environments. But in this new landscape, gaining insight into the status and performance of dynamic infrastructure and applications is more challenging than ever. Join us as we host Thomas Robinson, Solutions Architect at Amazon Web Services, and Patrick Hannah, VP of Engineering at CloudHesive, to discuss what it means to be a next-generation managed service provider and how Datadog provides visibility into modern cloud infrastructure and helps you adopt new approaches to remain competitive in this ever-changing environment.

What it Means to be a Next-Generation Managed Service Provider

Datadog

Monitoring kubernetes across data center and cloud

Datadog

Datadog + VictorOps Webinar

Datadog

Dataday Texas 2016 - Datadog

Datadog

Monitoring even a modestly-sized systems infrastructure quickly becomes untenable without automated alerting. For many metrics it is nontrivial to define ahead of time what constitutes “normal” versus “abnormal” values. This is especially true for metrics whose baseline value fluctuates over time. To make this problem more tractable, Datadog provides outlier detection functionality to automatically identify any host (or group of hosts) that is behaving abnormally compared to its peers. These slides cover the algorithms we use for outlier detection, and show how easy they are to implement using Python. This presentation also covers the lessons we've learned from using outlier detection on our own systems, along with some real-life examples on how to avoid false positives and negatives. Learn more at www.datadoghq.com.

PyData NYC 2015 - Automatically Detecting Outliers with Datadog

Datadog

Treating Infrastructure as Garbage

Datadog

Events and metrics the Lifeblood of Webops

Datadog

Big (IT) data

Datadog

Deep dive into Nagios analytics

Datadog

Just enough web ops for web developers

Datadog

Customer Ops: DevOps <3 customer support

Datadog

I <3 graphs in 20 slides

Datadog

Effective monitoring with StatsD

Datadog

Alerting: more signal, less noise, less pain

Datadog

Fact based monitoring

Datadog

Your configuration management is fact-based. Your orchestration is fact-based. Is your monitoring fact-based? What does that even mean? Monitoring is very similar to configuration, at least in its expression. Configuration cares about files, services, and hosts being present and in a certain state (""nginx should be running with the following configuration""). Monitoring cares about services being present, running, and in a certain state. Both describe your infrastructure as it should be (""nginx should be running and respond in less than 200ms""). Fact-based monitoring is about being able to control monitoring with the same facts that Puppet uses (""monitor nginx latency wherever Puppet says it should run""). This is in contrast with imperative monitoring (""monitor nginx on host a, b and c"") that gets out of sync and leads to mailbox meltdowns from spurious alerts. Using open source and commercial examples, this talk will help you express your monitoring in a way that will feel very natural to your Puppet configuration.

Fact-Based Monitoring

Datadog

NGINX just works and that's why we use it. That does not mean that it should be left unmonitored. As a web server, it plays a central role in a modern infrastructure. As a gatekeeper, it sees every interaction with the application. If you monitor it properly it can explain a lot about what is happening in the rest of your infrastructure. In this talk you will learn more about NGINX (plus) metrics, what they mean and how to use them. You will also learn different methods (status, statsd, logs) to monitor NGINX with their pros and cons, illustrated with real data coming from real servers.

Monitoring NGINX (plus): key metrics and how-to

Datadog

What’s in this Cookbook? - Mike Fiedler

Datadog

I Love Graphs - Alexis Lê-Quôc

Datadog

Virtualization at Gilt - Rangarajan Radhakrishnan

Datadog

Mehr von Datadog (20)

What it Means to be a Next-Generation Managed Service Provider

Monitoring kubernetes across data center and cloud

Datadog + VictorOps Webinar

Dataday Texas 2016 - Datadog

PyData NYC 2015 - Automatically Detecting Outliers with Datadog

Treating Infrastructure as Garbage

Events and metrics the Lifeblood of Webops

Big (IT) data

Deep dive into Nagios analytics

Just enough web ops for web developers

Customer Ops: DevOps <3 customer support

I <3 graphs in 20 slides

Effective monitoring with StatsD

Alerting: more signal, less noise, less pain

Fact based monitoring

Fact-Based Monitoring

Monitoring NGINX (plus): key metrics and how-to

What’s in this Cookbook? - Mike Fiedler

I Love Graphs - Alexis Lê-Quôc

Virtualization at Gilt - Rangarajan Radhakrishnan

Kürzlich hochgeladen

Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood

Juan lago vázquez

MINDCTI Revenue Release Quarter One 2024

MIND CTI

The microservices honeymoon is over. When starting a new project or revamping a legacy monolith, teams started looking for alternatives to microservices. The Modular Monolith, or 'Modulith', is an architecture that reaps the benefits of (vertical) functional decoupling without the high costs associated with separate deployments. This talk will delve into the advantages and challenges of this progressive architecture, beginning with exploring the concept of a 'module', its internal structure, public API, and inter-module communication patterns. Supported by spring-modulith, the talk provides practical guidance on addressing the main challenges of a Modultith Architecture: finding and guarding module boundaries, data decoupling, and integration module-testing. You should not miss this talk if you are a software architect or tech lead seeking practical, scalable solutions. About the author With two decades of experience, Victor is a Java Champion working as a trainer for top companies in Europe. Five thousands developers in 120 companies attended his workshops, so he gets to debate every week the challenges that various projects struggle with. In return, Victor summarizes key points from these workshops in conference talks and online meetups for the European Software Crafters, the world’s largest developer community around architecture, refactoring, and testing. Discover how Victor can help you on victorrentea.ro : company training catalog, consultancy and YouTube playlists.

Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024

Victor Rentea

EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER

MadyBayot

Webinar Recording: https://www.panagenda.com/webinars/why-teams-call-analytics-is-critical-to-your-entire-business Nothing is as frustrating and noticeable as being in an important call and being unable to see or hear the other person. Not surprising then, that issues with Teams calls are among the most common problems users call their helpdesk for. Having in depth insight into everything relevant going on at the user’s device, local network, ISP and Microsoft itself during the call is crucial for good Microsoft Teams Call quality support. To ensure a quick and adequate solution and to ensure your users get the most out of their Microsoft 365. But did you know that ‘bad calls’ are also an excellent indicator of other problems arising? Precisely because it is so noticeable!? Like the canary in the mine, bad calls can be early indicators of problems. Problems that might otherwise not have been noticed for a while but can have a big impact on productivity and satisfaction. Join this session by Christoph Adler to learn how true Microsoft Teams call quality analytics helped other organizations troubleshoot bad calls and identify and fix problems that impacted Teams calls or the use of Microsoft365 in general. See what it can do to keep your users happy and productive! In this session we will cover - Why CQD data alone is not enough to troubleshoot call problems - The importance of attributing call problems to the right call participant - What call quality analytics can do to help you quickly find, fix-, and prevent problems - Why having retrospective detailed insights matters - Real life examples of how others have used Microsoft Teams call quality monitoring to problem shoot problems with their ISP, network, device health and more.

Why Teams call analytics are critical to your entire business

panagenda

presentation ICT roal in 21st century education

jfdjdjcjdnsjd

DBX First Quarter 2024 Investor Presentation

Dropbox

Tracing the root cause of a performance issue requires a lot of patience, experience, and focus. It’s so hard that we sometimes attempt to guess by trying out tentative fixes, but that usually results in frustration, messy code, and a considerable waste of time and money. This talk explains how to correctly zoom in on a performance bottleneck using three levels of profiling: distributed tracing, metrics, and method profiling. After we learn to read the JVM profiler output as a flame graph, we explore a series of bottlenecks typical for backend systems, like connection/thread pool starvation, invisible aspects, blocking code, hot CPU methods, lock contention, and Virtual Thread pinning, and we learn to trace them even if they occur in library code you are not familiar with. Attend this talk and prepare for the performance issues that will eventually hit any successful system. About authorWith two decades of experience, Victor is a Java Champion working as a trainer for top companies in Europe. Five thousands developers in 120 companies attended his workshops, so he gets to debate every week the challenges that various projects struggle with. In return, Victor summarizes key points from these workshops in conference talks and online meetups for the European Software Crafters, the world’s largest developer community around architecture, refactoring, and testing. Discover how Victor can help you on victorrentea.ro : company training catalog, consultancy and YouTube playlists.

Finding Java's Hidden Performance Traps @ DevoxxUK 2024

Victor Rentea

Following the popularity of "Cloud Revolution: Exploring the New Wave of Serverless Spatial Data," we're thrilled to announce this much-anticipated encore webinar. In this sequel, we'll dive deeper into the Cloud-Native realm by uncovering practical applications and FME support for these new formats, including COGs, COPC, FlatGeoBuf, GeoParquet, STAC, and ZARR. Building on the foundation laid by industry leaders Michelle Roby of Radiant Earth and Chris Holmes of Planet in the first webinar, this second part offers an in-depth look at the real-world application and behind-the-scenes dynamics of these cutting-edge formats. We will spotlight specific use-cases and workflows, showcasing their efficiency and relevance in practical scenarios. Discover the vast possibilities each format holds, highlighted through detailed discussions and demonstrations. Our expert speakers will dissect the key aspects and provide critical takeaways for effective use, ensuring attendees leave with a thorough understanding of how to apply these formats in their own projects. Elevate your understanding of how FME supports these cutting-edge technologies, enhancing your ability to manage, share, and analyze spatial data. Whether you're building on knowledge from our initial session or are new to the serverless spatial data landscape, this webinar is your gateway to mastering cloud-native formats in your workflows.

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME

Safe Software

Corporate and higher education. Two industries that, in the past, have had a clear divide with very little crossover. The difference in goals, learning styles and objectives paved the way for differing learning technologies platforms to evolve. Now, those stark lines are blurring as both sides are discovering they have content that’s relevant to the other. Join Tammy Rutherford as she walks through the pros and cons of corporate and higher ed collaborating. And the challenges of these different technology platforms working together for a brighter future.

Corporate and higher education May webinar.pptx

Rustici Software

Elevate Developer Efficiency & build GenAI Application with Amazon Q

Bhuvaneswari Subramani

The value of a flexible API Management solution for Open Banking Steve Melan, Manager for IT Innovation and Architecture - State's and Saving's Bank of Luxembourg Apidays New York 2024: The API Economy in the AI Era (April 30 & May 1, 2024) ------ Check out our conferences at https://www.apidays.global/ Do you want to sponsor or talk at one of our conferences? https://apidays.typeform.com/to/ILJeAaV8 Learn more on APIscene, the global media made by the community for the community: https://www.apiscene.io Explore the API ecosystem with the API Landscape: https://apilandscape.apiscene.io/

Apidays New York 2024 - The value of a flexible API Management solution for O...

apidays

The Good, the Bad and the Governed - Why is governance a dirty word? David O'Neill, Chief Operating Officer - APIContext Apidays New York 2024: The API Economy in the AI Era (April 30 & May 1, 2024) ------ Check out our conferences at https://www.apidays.global/ Do you want to sponsor or talk at one of our conferences? https://apidays.typeform.com/to/ILJeAaV8 Learn more on APIscene, the global media made by the community for the community: https://www.apiscene.io Explore the API ecosystem with the API Landscape: https://apilandscape.apiscene.io/

Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...

apidays

[BuildWithAI] Introduction to Gemini.pdf

Sandro Moreira

💉💊+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHABI}}+971581248768 +971581248768 Mtp-Kit (500MG) Prices » Dubai [(+971581248768**)] Abortion Pills For Sale In Dubai, UAE, Mifepristone and Misoprostol Tablets Available In Dubai, UAE CONTACT DR.Maya Whatsapp +971581248768 We Have Abortion Pills / Cytotec Tablets /Mifegest Kit Available in Dubai, Sharjah, Abudhabi, Ajman, Alain, Fujairah, Ras Al Khaimah, Umm Al Quwain, UAE, Buy cytotec in Dubai +971581248768''''Abortion Pills near me DUBAI | ABU DHABI|UAE. Price of Misoprostol, Cytotec” +971581248768' Dr.DEEM ''BUY ABORTION PILLS MIFEGEST KIT, MISOPROTONE, CYTOTEC PILLS IN DUBAI, ABU DHABI,UAE'' Contact me now via What's App…… abortion Pills Cytotec also available Oman Qatar Doha Saudi Arabia Bahrain Above all, Cytotec Abortion Pills are Available In Dubai / UAE, you will be very happy to do abortion in Dubai we are providing cytotec 200mg abortion pill in Dubai, UAE. Medication abortion offers an alternative to Surgical Abortion for women in the early weeks of pregnancy. We only offer abortion pills from 1 week-6 Months. We then advise you to use surgery if its beyond 6 months. Our Abu Dhabi, Ajman, Al Ain, Dubai, Fujairah, Ras Al Khaimah (RAK), Sharjah, Umm Al Quwain (UAQ) United Arab Emirates Abortion Clinic provides the safest and most advanced techniques for providing non-surgical, medical and surgical abortion methods for early through late second trimester, including the Abortion By Pill Procedure (RU 486, Mifeprex, Mifepristone, early options French Abortion Pill), Tamoxifen, Methotrexate and Cytotec (Misoprostol). The Abu Dhabi, United Arab Emirates Abortion Clinic performs Same Day Abortion Procedure using medications that are taken on the first day of the office visit and will cause the abortion to occur generally within 4 to 6 hours (as early as 30 minutes) for patients who are 3 to 12 weeks pregnant. When Mifepristone and Misoprostol are used, 50% of patients complete in 4 to 6 hours; 75% to 80% in 12 hours; and 90% in 24 hours. We use a regimen that allows for completion without the need for surgery 99% of the time. All advanced second trimester and late term pregnancies at our Tampa clinic (17 to 24 weeks or greater) can be completed within 24 hours or less 99% of the time without the need surgery. The procedure is completed with minimal to no complications. Our Women's Health Center located in Abu Dhabi, United Arab Emirates, uses the latest medications for medical abortions (RU-486, Mifeprex, Mifegyne, Mifepristone, early options French abortion pill), Methotrexate and Cytotec (Misoprostol). The safety standards of our Abu Dhabi, United Arab Emirates Abortion Doctors remain unparalleled. They consistently maintain the lowest complication rates throughout the nation. Our Physicians and staff are always available to answer questions and care for women in one of the most difficult times in their lives. The decision to have an abortion at the Abortion Cl

+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...

?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@

MS Copilot expands with MS Graph connectors

Nanddeep Nachan

Exploring Multimodal Embeddings with Milvus

Zilliz

FWD Group - Insurer Innovation Award 2024

The Digital Insurer

Whatsapp Number Escorts Call girls 8617370543 Available 24x7 Mcleodganj Call Girls Service Offer Genuine VIP Model Escorts Call Girls in Your Budget. Mcleodganj Call Girls Service Provide Real Call Girls Number. Make Your Sexual Pleasure Memorable with Our Mcleodganj Call Girls at Affordable Price. Top VIP Escorts Call Girls, High Profile Independent Escorts Call Girls, Housewife Women Escorts Call Girl, College Girls Escorts Call Girls, Russian Escorts Call girls Service in Your Budget.

Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model

Deepika Singh

Passkeys: Developing APIs to enable passwordless authentication Cody Salas, Sr Developer Advocate | Solutions Architect - Yubico Apidays New York 2024: The API Economy in the AI Era (April 30 & May 1, 2024) ------ Check out our conferences at https://www.apidays.global/ Do you want to sponsor or talk at one of our conferences? https://apidays.typeform.com/to/ILJeAaV8 Learn more on APIscene, the global media made by the community for the community: https://www.apiscene.io Explore the API ecosystem with the API Landscape: https://apilandscape.apiscene.io/

Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...

apidays

Kürzlich hochgeladen (20)

Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood

MINDCTI Revenue Release Quarter One 2024

Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024

EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER

Why Teams call analytics are critical to your entire business

presentation ICT roal in 21st century education

DBX First Quarter 2024 Investor Presentation

Finding Java's Hidden Performance Traps @ DevoxxUK 2024

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME

Corporate and higher education May webinar.pptx

Elevate Developer Efficiency & build GenAI Application with Amazon Q

Apidays New York 2024 - The value of a flexible API Management solution for O...

Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...

[BuildWithAI] Introduction to Gemini.pdf

+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...

MS Copilot expands with MS Graph connectors

Exploring Multimodal Embeddings with Milvus

FWD Group - Insurer Innovation Award 2024

Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model

Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...

Monitoring Docker containers - Docker NYC Feb 2015

1. Monitoring and Running Docker Containers at Scale Docker NYC Meetup February 25th, 2015

2. @alq — CTO at Datadog

3. Datadog • Monitoring service • Made for the cloud • Aggregates everything • Support for Docker (since 1.0)

4. Goal of this talk Rethink the monitoring of Docker containers

5. Agenda 1.A (very) brief history of containers 2.Operational complexity 3.Monitoring Docker effectively 4.Demo

6. A brief history of containers

7. Containers in a nutshell • Been around for a long time – jails, zones, cgroups • No full-virtualization overhead • Used for runtime isolation (e.g. jails) • Docker is an Escape from Dependency Hell

8. Escape from dependency hell a.out shared libs packages omnibus Docker == ?

9. Mini-host or über-process? Process Container Host Spec Source Dockerfile Kickstart On disk .TEXT /var/lib/docker / In memory PID Container ID Hostname In the network Socket veth* eth* Runtime context server core host data center

10. Mini-host or über-process?

11. Operational complexity

12. Combinatorial multiplication Hardware OS Off-the-shelf Your Application Hardware Hypervisor Off-the- shelf App OS OS Off-the- shelf App Hardware Hypervisor OS OS A A A A Containers O O O O

13. Operational complexity • Average containers per host: N (N=5, 10/2014) • N-times as many “hosts” to manage • Affects – provisioning: prep’ing & building containers – configuration: passing config to containers – orchestration: deciding where/when containers run – monitoring: making sure containers run properly

14. Complexity increases with... 1. Number of things to measure 2. Velocity of change

15. Number of things to measure • 1 Amazon EC2 instance – 10 CloudWatch metrics • 1 operating system (e.g. linux) – 100 metrics •N containers – 100*N metrics •110 + 100*N metrics per instance

16. Combinatorial multiplication 100 500instances containers Assuming only 5 containers per instance

17. Combinatorial multiplication 160 610metrics per host metrics per host Assuming only 5 containers per instance

18. Combinatorial multiplication 100 61,000instances metrics Assuming only 5 containers per instance

19. Velocity hours, days, months minutes, hours, days Host half-life Container half-life

20. Aggravating factors • Registry-based provisioning – new images as fast as you can git commit • Autonomic orchestration – from imperative to declarative – automated – individual containers don’t matter – e.g. kubernetes, mesos

21. A lot more, A lot faster.

22. If your monitoring is still centered on individual hosts or instances…

23. Host-centric monitoring Monitor Monitor GA P Hypervisor OS OS A A A A Containers O O O O

24. A lot more pain, A lot faster.

25. Monitoring containers effectively

26. A new approach to container monitoring

27. Layers + Tags

28. Layers of monitoring Monitor Hypervisor OS OS A A A A Containers O O O O

29. Layers of monitoring CloudWatch Infrastructure Monitoring APM Hypervisor OS OS A A A A Containers O O O O

30. Layers of monitoring cpu/net/io filesystem docker mem docker cpu db queries web requests app throughput CloudWatch Infrastructure Monitoring APM e.g . Hypervisor OS OS A A A A Containers O O O O

31. Layers of monitoring • Access to metrics from all the layers • Amazon CloudWatch, OS metrics, Docker metrics, app metrics in 1 place • Shared timeline

32. If monitoring does not cover all layers, pain.

33. Tags (a.k.a. labels) You (probably) already use them

34. Tags • Monitoring is like Auto-Scaling Groups • Monitoring is like Docker orchestration • From imperative to declarative • Query-based • Queries operate on tags

35. Monitoring with tags and queries “Monitor all Docker containers running image web” “… in region us-west-2 across all availability zones” “… and make sure resident set size < 1GB on c3.xl”

36. Monitoring with tags and queries “Monitor all Docker containers running image web” “… in region us-west-2 across all availability zones” “… and make sure resident set size < 1GB on c3.xl”

37. Monitoring with tags and queries “Monitor all Docker containers running image web” “… in region us-west-2 across all availability zones” “… that use more than 1.5x the average on c3.xl”

38. Demo: layers & tags

39. Take-aways 1. Docker increases operational complexity by an order of magnitude unless… 2. You have layered monitoring, from the instance to the container and to the application, and… 3. You monitor using tags and queries

Hinweis der Redaktion

My name is Alexis. I’m the CTO of Datadog. We monitor cloud-based infrastructures. We have been monitoring containers for a few years now (lxc then docker)
Datadog is a monitoring service made for cloud environments, such as AWS, Azure, Google Cloud, etc. By that I mean that Datadog understands that your infrastructure can change at any time and deals with it naturally. To be able to monitor effectively, Datadog acts as an aggregator: it aggregates everything, it speaks native Cloudwatch and over 100 different other sources, like databases, web servers, etc.
My goals for this talk are three-fold. Dive into key Docker metrics Explain operational complexity. In other words I want to take what we have seen on the field and show you where the pain points will be. Rethink monitoring of Docker containers. The old tricks won’t work.
Here’s what I would like to talk about today. I will start with very brief history of containers and docker. This is a popular topic so I will only focus on operational matters, including key metrics that containers expose. I will focus on the inherent complexity that comes with running fleets of containers. I will illustrate this with what we see out there, in the real world. We have a particular vantage point that gives us good insight into this.
Containers, as lightweight virtual runtimes have been around for a while without going back all the way to the mainframe. Depending on the operating system, they go by the name of jails, zones, cgroups and are like traditional VMs, without the flexibility but also without the overhead. They were initially designed for security reasons (e.g. jails) but most recently have been used to escape dependency hell.
Dependency hell is this state where you end up having tens or hundreds of dependencies on shared code. Before shared libraries we had compile-time dependencies to build static executables. Shared libraries were a good idea when the size of a library was commensurate to the amount of RAM available in a machine. Now, obviously, there is a lot less memory pressure. Still, that has remained the default way to build software. Then, packages came: apt, yum, rvm, virtualenv, etc. as a partial solution to have a group of binaries that reliably work together. That proved too slow, having to wait for upstream updates so people started to bundle their code and dependencies into /opt. Then a way to make self-contained packages. And now we are back full-circle to static binaries, when we realized how much baggage we carried in shared code.
When you look at it a container is a hybrid between a process and a full-blown host. It has a Dockerfile, which is a manifest or a recipe to build the container, much like source code builds a binary and kickstart, chef or puppet build a full-blown host. Then you have the actual binary representation of the container on disk, in /var/lib/docker. For a binary, it’s the .text section. For a host it’s its filesystem. Finally when it runs a container has a unique ID, much like a process has a PID and a host has a hostname. So a container is this intermediary between a single binary and a full-blown host. It’s lik a static binary with a fully-functioning IP stack. To put it simply if you look at it from a dev point of view, a container looks like a binary. If you are think about it from an operations point of view, a container is closer to a host.
Let’s recap for a minute. We know that a container is a lightweight VM We know roughly what current deployments look like in number of containers per instance. We know how to measure the performance of a single container. How do we monitor the whole thing. Here I want to make the case that Docker introduces operational complexity
This is how the stack has evolved over the past 15 years. On the left, without virtualization. Off-the-shelf could be your J2EE runtime, or your database. Then when virtualization and services like EC2 were introduced, in the middle. It’s allowed better utilization and quasi-instant provisioning but for an engineer, few things have changed. And now running Docker containers inside EC2 instances on top of real hardware. There is a clear trend here toward a lot more moving parts than before. It also puts engineering much closer to operations.
Specifically by an order of magnitude or so given the 5 containers per instance on average.. This affects a lot of different things at run-time. provisioning: docker configuration: etcd, confd, consul, etc. orchestration: kubernetes, mesos monitoring: where I can contribute the most
Let’s look at monitoring an EC2 instance. I counted 10 CloudWatch metrics, about 100 metrics coming from the OS, 50 metrics coming from a container, 10-15 of which are critical to monitor, and let’s say 50 metrics for an off-the-shelf component, for instance a database. This is a conservative estimate as we see our customers use many more metrics per instance.
Now let’s plug in some numbers. Assuming you have 100 instances, and 5 containers per instance, you have 500 containers to manage and monitor. And remember, from a management standpoint, containers behave like hosts. Single-purpose hosts, but hosts none the less.
So for a given instance, you have moved from 160 metrics per instance, to about 410. Again assuming, 5 containers per host and being conservative on the number of metrics you need to keep an eye on.
If I recap, 100 instances, 41,000 metrics generated. That’s already 3x what you had before.
And it gets worse. Much worse Let’s talk about velocity. If you compare the “half-life” of an EC2 instance, and by half-life I mean the median uptime of your instances. You’re likely having a mix of hourly instances and long-lived instances that will go on for months. Compare this to containers. A container’s half-life can be in minutes, days at the most.
On top of that, you’ll have to layer in much faster provisioning, where new versions of containers are created on a daily basis, so you rotate your container fleet on a daily basis between versions. Much faster and much more often than doing an OS upgrade. And you add autonomic orchestration that go from imperative to declarative. So you can say, I need 1 container of this kind per instance per zone, at all times. And the scheduler makes sure it’s always the case. If you use mesos or kubernetes, this is your new reality
In summary, from a management and monitoring standpoint, it means a lot more and a lot faster. More moving parts that change pretty much all the time with limited predictability.
If your monitoring is still centered around hosts, this is what your world view looks like: complicated. When we talk to customers, they feel that the move to EC2 was a key factor to rethink their monitoring. Because instances come and go, different groups within their organization would spin up new stacks with little advance notice. Imagine if you throw containers in the mix. The old, host-centric monitoring practice simply stops working altogether. The host-centric monitoring practice that has you track individual hosts. It’s a bit like ptolemaic astronomy. Put the earth at the center of the universe and account for the movement of the planets. It gets pretty complicated.
In other words host-centric monitoring does not really understand containers, so either you treat them as hosts, and you have a lot of hosts that come and go every few minutes, which makes your life miserable because the host-centric monitoring system thinks half of your infrastructure is on fire. Or you don’t track containers, and you essentially have a gap. You see the OS, you see the app, and what happens in the middle, well…
So in short, if you think about monitoring containers like you’ve monitored hosts before, you’re in for a painful ride very very quickly.
So how do we do it properly?
We need a new approach, that does not treat everything like a host. The picture here, as you’ve guessed, comes from Copernicus. He suggested a radical approach to simplifying the universe. Don’t put the earth at the center of it… Compared to putting the earth at the center of the universe, this one is striking in clarity and simplicity.
So what’s the secret sauce? It’s simple: forget about hosts, think in layers and tags. What do you I mean by that…
Using a layered monitoring approach is pretty simple. This is where you want to be: have coverage from the bottom of the stack all the way to the top.
Which means using monitoring tools that don’t leave any gap. At the bottom, CloudWatch to know about the VMs. In the middle, an infrastructure monitoring system that understands containers. Ad at the top, an application performance monitoring tool.
So in terms of what you can see through these tools: At the bottom, raw resources like cpu, network, io of the VM. In the middle, anything from the OS to docker metrics. At the top, application throughput.
The key here is to have 1 shared timeline for everything. You want to get CloudWatch metrics, OS metrics, Docker metrics and app metrics, ideally in 1 place, all on the same timeline so that you can see when things break, how changes ripples through the different layers.
That’s the first part of the equation. Layers.
Tags is the second half of the equation. The good news is that you use them already. How are they relevant to monitoring in general and monitoring containers in particular?
Think of monitoring like ASG. Think of monitoring like container orchestration. Don’t think “imperative”, think “declarative”. Don’t monitor host X, Y and Z. Instead, monitor everything that share a common property, for instance being located in the same AZ. Think in terms of queries and you will see that tags work beautifully because queries operate on tags.
Here’s an example: Monitor… to make sure a container does not blow up in memory.
You can see the tags: Name of container image: web AWS Region: us-west-2 Instance type: c3.xlarge Do you see how powerful this is?
Once you have queries in place, you can express even more interesting things such as: Monitor …
Ok, demo time.

Monitoring Docker containers - Docker NYC Feb 2015

Empfohlen

Empfohlen

Weitere ähnliche Inhalte

Was ist angesagt?

Was ist angesagt? (20)

Andere mochten auch

Andere mochten auch (20)

Ähnlich wie Monitoring Docker containers - Docker NYC Feb 2015

Ähnlich wie Monitoring Docker containers - Docker NYC Feb 2015 (20)

Mehr von Datadog

Mehr von Datadog (20)

Kürzlich hochgeladen

Kürzlich hochgeladen (20)

Monitoring Docker containers - Docker NYC Feb 2015

Hinweis der Redaktion