SlideShare ist ein Scribd-Unternehmen logo
1 von 85
Downloaden Sie, um offline zu lesen
Confidential – Oracle Internal/Restricted/Highly Restricted 1
th
강인호
inho.kang@oracle.com
Distributed Tracing :
Jaeger를 활용한 분산 Tracing
2019.06.16
12thOracle
Developer
Meetup
§ .Net Developer
§ CBD, SOA Methodology Consulting
§ ITA/EA, ISP Consulting
§ Oracle Corp.
§ Middleware
§ Cloud Native Application, Container Native
§ Emerging Technology Team
§ k8s korea user group
innoshom@gamil.com
A Table of Contents
01 Why Distributed Tracing
02 Take a First Tracing
03
Tracing Standards
and EcoSystem
PART 1
Why Distributed Tracing
History and Multi-Dimensional Evolution of Computing
아키텍처의 변화 방향
Edge
Load
Balancer
Zuul
(Proxy Svc)
Playback
(Legacy Dev.)
API
(g/w)
Middle Tier & Platform
EVCache
Cassandra
Reference: http://microservices.io
http://microservices.io
§ Patterns of Microserivce
Reference: http://microservices.io
§ Single Serviece per Host
§ Multiple Services per Host
§ Serverless deployment
§ Oracle Fx
§ AWS Lamda
§ Service-per-Container
§ Service-per-VM
§ Service Deployment Platform
§ Docker orchestration
(ex: k8s, Docker Swarm mode)
§ Discovery
§ Service Registry
§ Server-side discovery
§ Client-side discovery
§ Self Registration
§ 3rd Party Registration
§ External API
§ API Gateway
§ Backend for front end
§ Communication Style
§ Reliability
How do we know
What’s going on?
Observability
3가지 구성요소
• Logging is used to record discrete events.
For example, the debugging or error
information of an application, which is
the basis of problem diagnosis.
• Metrics is used to record data that can be
aggregated.
For example, the current depth of a queue
can be defined as a metric and updated
when an element is added to or removed
from the queue. The number of HTTP
requests can be defined as a counter that
accumulates the number when new requests
are received.
• Tracing is used to record information
within the request scope.
For example, the process and consumed time
for a remote method call, This is the
tool we use to investigate system
performance issues. Logging, metrics, and
tracing have overlapping parts as follows.
https://www.alibabacloud.com/help/doc-detail/68035.htm
Monitoring == Observing Events
Metrics - Record events as aggregates (e.g. counters)
Tracing - Record transaction-scoped events
Logging - Record uniqueevents
Low volume
High volume
Observability
3가지 구성요소
The term "observability" in control theory states that the system is observable if the internal states of
the system and, accordingly, its behavior, can be determined by only looking at its inputs and outputs
2018 Observability
Practitioners Summit,
Bryan Cantrill,
the CTO of Joyent
전통적인 모니터링 방법들
기존의 Metrics나 Logs 방법으로는 분산환경에 대응 불가
Metrics and Logs are
• Per Instance
• Lack of context
Metrics : 메트릭은 숫자 정보(count,
usage)만 모으는 것이므로 자원을
많이 잡아 먹지도 않는다.
Logs : 로그는 메트릭 보다는 더
Observability가 좋은 툴이다. 하지만
debugging 툴로는 한계가 있다.
분산환경에서 Tracing을 한다는 것은?
폴리그랏 환경에서 Request Path를 추적해서
- 어느 구간에서 병목이 발생하고
- Request가 실패하면 어디에서 Error가 발생했는지?
- Critical Pah가 뭔지를 밝히는 작업
그러나 전통적인 모니터링 툴들은 아직 이것에 적합하지
못하다.
Basic Idea of Tracing
어떻게 하면 분산환경에서
Request Path를 추적할 수 있을까?
Basic Idea of Tracing
1. 각 단위, 단위의 프로세스 마다의 실행
시간과 정보들을 Profiling하고
2. 중앙에서 프로파일 정보를 모아서,
Request 별로 연관관계를 재조합해서
3. Visualization Tool 또는 분석 하도록 한다.
Basic Idea of Tracing
Request correlation
Schema Based
- Application 별로 Event Schema를 Manual 하게 정의해서 Log을 기반으로 연관성을 찾아내는 방법
- Modern Microservice 기반에는 한계
MetaData Propagation
- Global ID(Execution ID or Request ID)를 가지고 메타데이터를 전파하는 방식
Inter-process Propagation
(1) The Handler that processes the inbound request is wrapped into instrumentation that extracts
metadata from the request and stores it in a Context object in memory.
(2) Some in-process propagation mechanism, for example, based on thread-local variables.
(3) Instrumentation wraps an RPC client and injects metadata into outbound (downstream) requests.
Anatomy of Distributed Tracing
Technical background of tracing
• Dapper (Google): Foundation for all tracers
• Zipkin (Twitter)
• Jaeger (Uber)
• StackDriver Trace (Google)
• Appdash (golang)
• X-ray (AWS)
Tracing은 1990년도 부터 있었으나, Google의 논문“Dapper, a Large-Scale Distributed System Tracing
Infrastructure"발표 이후 주류로 자리 매김하게 됨
Tracing Systems
Zipkin and OpenZipkin
§ 최초의 Open Source Tracing system
§ 2012년, Twitter 공개
§ 많은 지원 생태계 구성
§ 자체 Tracer인 Brave API 기반으로 다양한 fraework가 이를 지원 (ex: Spring, Spark, Kafka, gRPC등)
§ 다른 tracing System과는 data-format level에서 연동
Jaeger
§ 2015년 Uber에서 만들어졌고, 2017년 Open Source Project 로 공개
§ CNCF산하의 프로젝트로 OpenTracing에 집중하면서 많은 주목을 받고 있음
§ Zipkin과의 호환 (B3 metadata format)
§ 직접적인 instrumentation을 제공하지 않으며 OpenTracing-compatible tracer를 이용
CNCF에서 Observability 비중
점점 비중이 강조 되고 있음
약 20%
§ Graduated : Prometehus, Fluentd (Log collection layer)
§ Incubating : OpenTracing, Jaeger,
§ Sandbox : OpenMetrics and Cortex (Monitoring)
• Prometheus: A monitoring and alerting platform
• Fluentd: A logging data collection layer
• OpenTracing: A vendor-neutral APIs and
instrumentation for distributed tracing
• Jaeger: A distributed tracing platform
OpenTracing
The OpenTracing specification is created to address the problem of API
incompatibility between different distributed tracing systems. OpenTracing is
a lightweight standardization layer and is located between applications/class
libraries and tracing or log analysis programs.
•Advantages
OpenTracing already enters
CNCF and provides unified
concept and data standards
for global distributed
tracing systems.
•OpenTracing provides APIs
with no relation to
platforms or vendors, which
allows developers to
conveniently add (or change)
the implementation of
tracing system.
Basic Idea of Tracing
Distributed Tracing In A Nutshell
A
B
C D
E
{context}
{context}
{context}{context}
Unique ID → {context}
Edge service
A
B
E
C
D
time
TRACE
SPANS
Span Relationship
Traces in OpenTracing are defined implicitly by their Spans. In particular, a Trace can be thought of
as a directed acyclic graph (DAG) of Spans, where the edges between Spans are called References.
For example, the following is an example Trace made up of 8 Spans:
Causal relationships between Spans in a single Trace
https://opentracing.io/specification/
Distributed Tracing In A Nutshell
time
SPANS
Span A
Trace ID : 1
Parent Span : [none]
Span B
Trace ID : 1
Parent Span : A
Span E
Trace ID : 1
Parent Span : A
Span C
Trace ID : 1
Parent Span : B
Span D
Trace ID : 1
Parent Span : B
TRACE
What is a Span
The “span” is the primary building block of a distributed trace, representing an individual unit of work
done in a distributed system.
Each component of the distributed system contributes a span - a named, timed operation
representing a piece of the workflow.
Spans can (and generally do) contain “References” to other spans, which allows multiple Spans to be
assembled into one complete Trace - a visualization of the life of a request as it moves through a
distributed system.
Each span encapsulates the following state
according to the OpenTracing specification:
• An operation name
• A start timestamp and finish timestamp
• A set of key:value span Tags
• A set of key:value span Logs
• A SpanContext
What is a Span – Tags
Tags are key:value pairs that enable user-defined annotation of spans in order to query, filter, and
comprehend trace data.
Span tags should apply to the whole span. There is a list available
at semantic_conventions.md listing conventional span tags for common scenarios. Examples may
include tag keys like db.instance to identify a databse host, http.status_code to represent the HTTP
response code, or error which can be set to True if the operation represented by the Span fails.
§ Example
§ http.url = “http://google.com”
§ http.status_code = 200
§ peer.service = “mysql”
§ db.statement = “select * from users”
What is a Span – Logs
Logs are key:value pairs that are useful for capturing span-specific logging messages and other
debugging or informational output from the application itself. Logs may be useful for documenting a
specific moment or event within the span (in contrast to tags which should apply to the span as a
whole).
Describes an event at a point in time during the span lifetime.
§ OpenTracing supports structured logging
§ Contains a timestamp and a set of fields
§ Example
span.log_kv(
{'event': 'open_conn', 'port': 433}
)
OpenTracing Data Model
https://opentracing.io/specification/
Distributed Tracing In A Nutshell
Remote Procedure Call 환경
Jaeger
Jaeger
§ 2015년 Uber에서 만들어졌고, 2017년 Open Source Project 로 공개
§ CNCF산하의 프로젝트로 OpenTracing에 집중하면서 많은 주목을 받고 있음
§ Zipkin과의 호환 (B3 metadata format)
§ 직접적인 instrumentation을 제공하지 않으며 OpenTracing-compatible tracer를 이용
Jaeger
Jaeger is an open-source distributed tracing system released by Uber. It is compatible with
OpenTracing APIs.
Jaeger Architecture
• Jaeger client: Implements SDKs that conform to OpenTracing standards for different
languages. Applications use the API to write data. The client library transmits trace
information to the jaeger-agent according to the sampling policy specified by the application.
• Agent: A network daemon that monitors span data received by the UDP port and then
sends the data to the collector in batches. It is designed as a basic component and
deployed to all hosts. The agent decouples the client library and collector, shielding the client
library from collector routing and discovery details.
• Collector: Receives the data sent by the jaeger-agent and then writes the data to backend
storage. The collector is designed as a stateless component. Therefore, you can
simultaneously run an arbitrary number of jaeger-collectors.
• Data store: The backend storage is designed as a pluggable component that supports
writing data to Cassandra and Elasticsearch.
• Query: Receives query requests, retrieves trace information from the backend storage
system, and displays it in the UI. Query is stateless and you can start multiple instances. You
can deploy the instances behind load balancers such as Nginx.
Understanding Sampling
§ Tracing data > than business traffic
§ Most tracing systems sample transactions
§ Head-based sampling: the sampling decision is made just before the
trace is started, and it is respected by all nodes in the graph
§ Tail-based sampling: the sampling decision is made after the trace is
completed / collected
§ Head-based sampling donʼt catch 0.001 probability anomalous
behavior.
PART 2
Take a first Tracing
HOTROD
Sample Application을 이용한 Jaeger와 친해지기
HOTROD
Architecture 확인
§ 전체 Architecture을 DAG(Directed Acyclic Graph)로 확인
HOTROD
Sample Application을 이용한 Jaeger와 친해지기
§ https://github.com/DannyKang/OpenTracing
§ 사전 준비 사항
§ Docker
§ Go lang (소스코드에서 직접 실행을 원하는 경우)
§ Jaeger 실행
$ sudo docker run -d --name jaeger 
-p 6831:6831/udp 
-p 16686:16686 
-p 14268:14268 
jaegertracing/all-in-one:1.6
§ HOTROD 실행
$ sudo docker run --rm -it 
--link jaeger 
-p8080-8083:8080-8083 
jaegertracing/example-hotrod:1.6 
all 
--jaeger-agent.host-port=jaeger:6831
https://medium.com/opentracing/take-opentracing-for-a-hotrod-ride-f6e3141f7941
§ Jaeger 접속 : http://localhost:16686/ § HotROD 접속 : http://127.0.0.1:8080/
HOTROD
Sample Application을 이용한 Jaeger와 친해지기
§ Source Code로 실행하기 (Go가 설치되었다는 가정하에)
§ mkdir -p $GOPATH/src/github.com/jaegertracing
cd $GOPATH/src/github.com/jaegertracing
git clone git@github.com:jaegertracing/jaeger.git jaeger
cd jaeger
make install
cd examples/hotrod
go run ./main.go all
§ 이후에 옵션등을 통해서 바로 실행하기 위해서 go build 로 binary를 만든다.
go build
여기서 ‘all’ 옵션을 전제 마이크로서비스를 하나의 binary로 실행한다는 의미이다.
그리고 log가 Standard out에 쓰기 때문에 console에서 전체 내용을 확인할 수 있다.
HOTROD
Architecture 확인
§ 전체 Architecture을 DAG(Directed Acyclic Graph)로 확인
HOTROD
DataFlow 확인
§ 전체 Architecture을 DAG(Directed Acyclic Graph)로 확인
HOTROD
DataFlow 확인
§ 전체 Architecture을 DAG(Directed Acyclic Graph)로 확인
HOTROD
Traditional Log vs Contextual Logs
HOTROD
Traditional Log vs Contextual Logs
HOTROD
Traditional Log vs Contextual Logs
HOTROD
Span tags versus logs
HOTROD
Identifying source of latency
Mysql이 전체 수행
시간에 가장 많은
시간(latency)이 걸림
HOTROD
Identifying source of latency
Zoom in 을 통해서
보면 route service가
parallel이 아니라
3개씩 쌍으로
수행되는 것으로
나타남.
HOTROD
Identifying source of latency
화면에서 같은
사용자로 Request를
증가 시키면 mysql의
수행시간이 1초
이상으로 소요됨
(기존 300ms)
HOTROD
Identifying source of latency
특정 Driver의
Request를 추적
HOTROD
Identifying source of latency
내부적으로 log를 보면
block을 걸고 있는
것을 볼 수 있음
HOTROD
Identifying source of latency
examples/hotrod/services/customer/database.go
$ ./example-hotrod help
$ ./example-hotrod -M -D 100ms all
-M은 lock을 없애는 옵션
-D는 db-query-delay를
300ms에서 100ms으로
줄이는 옵션
소스코드를 보면
“d.lock.Lock”을
확인할 수 있다.
HOTROD
Identifying source of latency
Go를 설치하지 않은 경우는 Docker의 옵션으로 추가하면 된다.
$ docker run --rm -it 
--link jaeger 
-p8080-8083:8080-8083 
jaegertracing/example-hotrod:1.6 
-M -D 100ms all 
--jaeger-agent.host-port=jaeger:6831
HOTROD
Identifying source of latency
다시 다량의 Request를
발생 시키면 여전히
delay를 확인할 수 있다.
HOTROD
Identifying source of latency
HOTROD
Identifying source of latency
services/config/config.go
$ ./example-hotrod -M -D 100ms -W 100 all
W는 WorkerPoolSize이다 Default 3
변경후 다량의 Request로 결과 확인
PART 3
Tracing Standards and EcoSystem
Tracing Instrumentation
3 Type of tracing instrumentation
§ Manual Instrument
§ Framework Based Manual Instrument
§ Agent Based automatic instrument
Company X
Company X
Different meanings of tracing
Different meanings of tracing: analyzing transactions (1), recording transactions (2),
federating transactions (3), describing transactions (4), and correlating transactions (5)
Tracing Systems
Zipkin and OpenZipkin
§ 최초의 Open Source Tracing system
§ 2012년, Twitter 공개
§ 많은 지원 생태계 구성
§ 자체 Tracer인 Brave API 기반으로 다양한 fraework가 이를 지원 (ex: Spring, Spark, Kafka, gRPC등)
§ 다른 tracing System과는 data-format level에서 연동
Jaeger
§ 2015년 Uber에서 만들어졌고, 2017년 Open Source Project 로 공개
§ CNCF산하의 프로젝트로 OpenTracing에 집중하면서 많은 주목을 받고 있음
§ Zipkin과의 호환 (B3 metadata format)
§ 직접적인 instrumentation을 제공하지 않으며 OpenTracing-compatible tracer를 이용
Tracing Systems
SkyWalking
§ 상대적으로 최신 프로젝트로 중국에서 시작
§ 2017년 Apache Foundation 의 incubation project
§ Tracing 시스템으로 시작해서 full-blown APM solution으로 진화중
§ 일부 OpenTracing-compatible(only Java)
§ 중국에서 많이 사용되는 framework에 대한 Agent-based instrumentation에 집중
X-Ray, Stackdriver
§ 클라우드 사업자의 Managed 서비스
§ 기존에 Zipkin이나 Jaeger를 사용하던 On-Prem과의 연계가 문제가 됨
§ 따라서 클라우드 사업자들이 data 표준화에 많은 투자를 하기 시작함
Tracing Systems
W3C Trace Context
§ 2017년 World Wide Web Consortium(W3C) 산하 Distributed Tracing Working Group 발족
§ 댜양한 vendors, cloud providers, open source projects 참여
§ Trace tool간의 interoperability를 제공하기 위한 표준 정의
§ Trace Context :Tracing metadata를 주고받을 때 사용할 format에만 focus
§ 2018년 Trace ID conceptual model 합의
§ 2개의 protocol header 제안
1. Traceparent
spandID : 16 byte and 8 byte array
Traceparent: 00-4bf92f3577b34da6a3ce929d0e0e4736-00f067aa0ba902b7-01
2. Tracestate
tracing vendor specific 정보 저장 영역
Tracestate: vendorname1=opaqueValue1,vendorname2=opaqueValue2
3. data 표준화에 많은 투자를 하기 시작함
Tracing Systems
W3C "Data Interchange Format"
§ 데이터를 주고 받을때 사용할 수 있는 format를 정의
§ Vendor마다의 이해가 달라 진전이 없는 상태
OpenCensus
§ Google의 Census라는 library의 일부로 시작한 프로젝트
§ Misssion : "vendor-agnostic single distribution of libraries to provide metrics collection
and tracing for your services."
§ 기존 Google의 Dapper format 을 그래도 사용
§ Google Stackdrivre와 Zipkin, Jaeger의 Metadata와 일치 (Dapper의 후손)
§ 그렇지만 다른 APM Vendor와는 많이 다름
Tracing Systems
OpenTracing
- Goal of OpenTracing Project
§ To provide a single API that application and framework developers can use to instrument their
code
§ To enable truly reusable, portable, and composable instrumentation, especially for the other
open source frameworks, libraries, and systems
§ To empower other developers, not just those working on the tracing systems, to write
instrumentation for their software and be comfortable that it will be compatible with other
modules in the same application that are instrumented with OpenTracing, while not being bound
to a specific tracing vendor
Understanding Sampling
§ Tracing data > than business traffic
§ Most tracing systems sample transactions
§ Head-based sampling: the sampling decision is made just before the
trace is started, and it is respected by all nodes in the graph
§ Tail-based sampling: the sampling decision is made after the trace is
completed / collected
§ Head-based sampling donʼt catch 0.001 probability anomalous
behavior.
OpenTelemetry
OpenTelemetry is an effort to combine all three verticals into a single set of system
components and language-specific telemetry libraries. It is meant to replace both
the OpenTracing project, which focused exclusively on tracing, and the OpenCensus project,
which focused on tracing and metrics.
OpenTelemetry will not initially support logging, though we aim to incorporate this over time.
OpenTelemetry
Container Based Observability
To be Continued
Reference: https://istio.io/talks/istio_talk_gluecon_2017.pdf
Example of istio tracing
Bookinfo Application with Istio
Example of istio tracing
kubectl port-forward -n istio-system $(kubectl get pod -n istio-system -l app=jaeger -o
jsonpath='{.items[0].metadata.name}') 16686:16686 &
Example of istio Metics
kubectl -n istio-system port-forward $(kubectl -n istio-system get pod -l app=grafana -o
jsonpath='{.items[0].metadata.name}') 3000:3000 &
Example of Istio service
graph
kubectl -n istio-system port-forward $(kubectl -n istio-system get pod -l app=kiali -o
jsonpath='{.items[0].metadata.name}') 20001:20001
References
• Mastering Distributed Tracing
• Alibaba Cloud ‒ OpenTracing implementation of Jaeger : https://www.alibabacloud.com/help/doc-
detail/68035.htm
• OpenTracing Best Practices : https://opentracing.io/docs/best-practices/instrumenting-your-
application/
• OpenTracing HOTROD : https://medium.com/opentracing/take-opentracing-for-a-hotrod-ride-
f6e3141f7941
From Chapter 1
Thank You

Weitere ähnliche Inhalte

Was ist angesagt?

GitOps - Operation By Pull Request
GitOps - Operation By Pull RequestGitOps - Operation By Pull Request
GitOps - Operation By Pull RequestKasper Nissen
 
OpenTelemetry Introduction
OpenTelemetry Introduction OpenTelemetry Introduction
OpenTelemetry Introduction DimitrisFinas1
 
OpenTelemetry For Developers
OpenTelemetry For DevelopersOpenTelemetry For Developers
OpenTelemetry For DevelopersKevin Brockhoff
 
MeetUp Monitoring with Prometheus and Grafana (September 2018)
MeetUp Monitoring with Prometheus and Grafana (September 2018)MeetUp Monitoring with Prometheus and Grafana (September 2018)
MeetUp Monitoring with Prometheus and Grafana (September 2018)Lucas Jellema
 
End-to-End CI/CD at scale with Infrastructure-as-Code on AWS
End-to-End CI/CD at scale with Infrastructure-as-Code on AWSEnd-to-End CI/CD at scale with Infrastructure-as-Code on AWS
End-to-End CI/CD at scale with Infrastructure-as-Code on AWSBhuvaneswari Subramani
 
Distributed tracing using open tracing & jaeger 2
Distributed tracing using open tracing & jaeger 2Distributed tracing using open tracing & jaeger 2
Distributed tracing using open tracing & jaeger 2Chandresh Pancholi
 
Meetup OpenTelemetry Intro
Meetup OpenTelemetry IntroMeetup OpenTelemetry Intro
Meetup OpenTelemetry IntroDimitrisFinas1
 
OSMC 2022 | OpenTelemetry 101 by Dotan Horovit s.pdf
OSMC 2022 | OpenTelemetry 101 by Dotan Horovit s.pdfOSMC 2022 | OpenTelemetry 101 by Dotan Horovit s.pdf
OSMC 2022 | OpenTelemetry 101 by Dotan Horovit s.pdfNETWAYS
 
Observability, Distributed Tracing, and Open Source: The Missing Primer
Observability, Distributed Tracing, and Open Source: The Missing PrimerObservability, Distributed Tracing, and Open Source: The Missing Primer
Observability, Distributed Tracing, and Open Source: The Missing PrimerVMware Tanzu
 
Backstage at CNCF Madison.pptx
Backstage at CNCF Madison.pptxBackstage at CNCF Madison.pptx
Backstage at CNCF Madison.pptxBrandenTimm1
 
Apache Flink, AWS Kinesis, Analytics
Apache Flink, AWS Kinesis, Analytics Apache Flink, AWS Kinesis, Analytics
Apache Flink, AWS Kinesis, Analytics Araf Karsh Hamid
 
Airflow at lyft
Airflow at lyftAirflow at lyft
Airflow at lyftTao Feng
 
THE STATE OF OPENTELEMETRY, DOTAN HOROVITS, Logz.io
THE STATE OF OPENTELEMETRY, DOTAN HOROVITS, Logz.ioTHE STATE OF OPENTELEMETRY, DOTAN HOROVITS, Logz.io
THE STATE OF OPENTELEMETRY, DOTAN HOROVITS, Logz.ioDevOpsDays Tel Aviv
 
Introduction to Distributed Tracing
Introduction to Distributed TracingIntroduction to Distributed Tracing
Introduction to Distributed Tracingpetabridge
 
Introduction to Kibana
Introduction to KibanaIntroduction to Kibana
Introduction to KibanaVineet .
 

Was ist angesagt? (20)

GitOps - Operation By Pull Request
GitOps - Operation By Pull RequestGitOps - Operation By Pull Request
GitOps - Operation By Pull Request
 
Observability & Datadog
Observability & DatadogObservability & Datadog
Observability & Datadog
 
Observability
Observability Observability
Observability
 
Observability
ObservabilityObservability
Observability
 
OpenTelemetry Introduction
OpenTelemetry Introduction OpenTelemetry Introduction
OpenTelemetry Introduction
 
OpenTelemetry For Developers
OpenTelemetry For DevelopersOpenTelemetry For Developers
OpenTelemetry For Developers
 
MeetUp Monitoring with Prometheus and Grafana (September 2018)
MeetUp Monitoring with Prometheus and Grafana (September 2018)MeetUp Monitoring with Prometheus and Grafana (September 2018)
MeetUp Monitoring with Prometheus and Grafana (September 2018)
 
End-to-End CI/CD at scale with Infrastructure-as-Code on AWS
End-to-End CI/CD at scale with Infrastructure-as-Code on AWSEnd-to-End CI/CD at scale with Infrastructure-as-Code on AWS
End-to-End CI/CD at scale with Infrastructure-as-Code on AWS
 
Distributed tracing using open tracing & jaeger 2
Distributed tracing using open tracing & jaeger 2Distributed tracing using open tracing & jaeger 2
Distributed tracing using open tracing & jaeger 2
 
Grafana.pptx
Grafana.pptxGrafana.pptx
Grafana.pptx
 
Meetup OpenTelemetry Intro
Meetup OpenTelemetry IntroMeetup OpenTelemetry Intro
Meetup OpenTelemetry Intro
 
OSMC 2022 | OpenTelemetry 101 by Dotan Horovit s.pdf
OSMC 2022 | OpenTelemetry 101 by Dotan Horovit s.pdfOSMC 2022 | OpenTelemetry 101 by Dotan Horovit s.pdf
OSMC 2022 | OpenTelemetry 101 by Dotan Horovit s.pdf
 
Observability, Distributed Tracing, and Open Source: The Missing Primer
Observability, Distributed Tracing, and Open Source: The Missing PrimerObservability, Distributed Tracing, and Open Source: The Missing Primer
Observability, Distributed Tracing, and Open Source: The Missing Primer
 
Backstage at CNCF Madison.pptx
Backstage at CNCF Madison.pptxBackstage at CNCF Madison.pptx
Backstage at CNCF Madison.pptx
 
12-Factor Apps
12-Factor Apps12-Factor Apps
12-Factor Apps
 
Apache Flink, AWS Kinesis, Analytics
Apache Flink, AWS Kinesis, Analytics Apache Flink, AWS Kinesis, Analytics
Apache Flink, AWS Kinesis, Analytics
 
Airflow at lyft
Airflow at lyftAirflow at lyft
Airflow at lyft
 
THE STATE OF OPENTELEMETRY, DOTAN HOROVITS, Logz.io
THE STATE OF OPENTELEMETRY, DOTAN HOROVITS, Logz.ioTHE STATE OF OPENTELEMETRY, DOTAN HOROVITS, Logz.io
THE STATE OF OPENTELEMETRY, DOTAN HOROVITS, Logz.io
 
Introduction to Distributed Tracing
Introduction to Distributed TracingIntroduction to Distributed Tracing
Introduction to Distributed Tracing
 
Introduction to Kibana
Introduction to KibanaIntroduction to Kibana
Introduction to Kibana
 

Ähnlich wie Distributed Tracing with Jaeger

Modern application development with oracle cloud sangam17
Modern application development with oracle cloud sangam17Modern application development with oracle cloud sangam17
Modern application development with oracle cloud sangam17Vinay Kumar
 
PinTrace Advanced AWS meetup
PinTrace Advanced AWS meetup PinTrace Advanced AWS meetup
PinTrace Advanced AWS meetup Suman Karumuri
 
Monitoring&Logging - Stanislav Kolenkin
Monitoring&Logging - Stanislav Kolenkin  Monitoring&Logging - Stanislav Kolenkin
Monitoring&Logging - Stanislav Kolenkin Kuberton
 
Tracing Micro Services with OpenTracing
Tracing Micro Services with OpenTracingTracing Micro Services with OpenTracing
Tracing Micro Services with OpenTracingHemant Kumar
 
初探 OpenTelemetry - 蒐集遙測數據的新標準
初探 OpenTelemetry - 蒐集遙測數據的新標準初探 OpenTelemetry - 蒐集遙測數據的新標準
初探 OpenTelemetry - 蒐集遙測數據的新標準Marcus Tung
 
OGCE MSI Presentation
OGCE MSI PresentationOGCE MSI Presentation
OGCE MSI Presentationmarpierc
 
Implementing Observability for Kubernetes.pdf
Implementing Observability for Kubernetes.pdfImplementing Observability for Kubernetes.pdf
Implementing Observability for Kubernetes.pdfJose Manuel Ortega Candel
 
ThroughTheLookingGlass_EffectiveObservability.pptx
ThroughTheLookingGlass_EffectiveObservability.pptxThroughTheLookingGlass_EffectiveObservability.pptx
ThroughTheLookingGlass_EffectiveObservability.pptxGrace Jansen
 
Runtime Behavior of JavaScript Programs
Runtime Behavior of JavaScript ProgramsRuntime Behavior of JavaScript Programs
Runtime Behavior of JavaScript ProgramsIRJET Journal
 
Monitoring and Instrumentation Strategies: Tips and Best Practices - AppSphere16
Monitoring and Instrumentation Strategies: Tips and Best Practices - AppSphere16Monitoring and Instrumentation Strategies: Tips and Best Practices - AppSphere16
Monitoring and Instrumentation Strategies: Tips and Best Practices - AppSphere16AppDynamics
 
Amazon EKS 그리고 Service Mesh (김세호 솔루션즈 아키텍트, AWS) :: Gaming on AWS 2018
Amazon EKS 그리고 Service Mesh (김세호 솔루션즈 아키텍트, AWS) :: Gaming on AWS 2018Amazon EKS 그리고 Service Mesh (김세호 솔루션즈 아키텍트, AWS) :: Gaming on AWS 2018
Amazon EKS 그리고 Service Mesh (김세호 솔루션즈 아키텍트, AWS) :: Gaming on AWS 2018Amazon Web Services Korea
 
Tracing-for-fun-and-profit.pptx
Tracing-for-fun-and-profit.pptxTracing-for-fun-and-profit.pptx
Tracing-for-fun-and-profit.pptxHai Nguyen Duy
 
Build cloud native solution using open source
Build cloud native solution using open source Build cloud native solution using open source
Build cloud native solution using open source Nitesh Jadhav
 
Streaming Processing with a Distributed Commit Log
Streaming Processing with a Distributed Commit LogStreaming Processing with a Distributed Commit Log
Streaming Processing with a Distributed Commit LogJoe Stein
 
Proactive ops for container orchestration environments
Proactive ops for container orchestration environmentsProactive ops for container orchestration environments
Proactive ops for container orchestration environmentsDocker, Inc.
 
Let’s Monitor Conditions at the Conference With Timothy Spann & David Kjerrum...
Let’s Monitor Conditions at the Conference With Timothy Spann & David Kjerrum...Let’s Monitor Conditions at the Conference With Timothy Spann & David Kjerrum...
Let’s Monitor Conditions at the Conference With Timothy Spann & David Kjerrum...HostedbyConfluent
 
(Current22) Let's Monitor The Conditions at the Conference
(Current22) Let's Monitor The Conditions at the Conference(Current22) Let's Monitor The Conditions at the Conference
(Current22) Let's Monitor The Conditions at the ConferenceTimothy Spann
 
Debugging Microservices - key challenges and techniques - Microservices Odesa...
Debugging Microservices - key challenges and techniques - Microservices Odesa...Debugging Microservices - key challenges and techniques - Microservices Odesa...
Debugging Microservices - key challenges and techniques - Microservices Odesa...Lohika_Odessa_TechTalks
 

Ähnlich wie Distributed Tracing with Jaeger (20)

Opentracing 101
Opentracing 101Opentracing 101
Opentracing 101
 
Distributed Tracing
Distributed TracingDistributed Tracing
Distributed Tracing
 
Modern application development with oracle cloud sangam17
Modern application development with oracle cloud sangam17Modern application development with oracle cloud sangam17
Modern application development with oracle cloud sangam17
 
PinTrace Advanced AWS meetup
PinTrace Advanced AWS meetup PinTrace Advanced AWS meetup
PinTrace Advanced AWS meetup
 
Monitoring&Logging - Stanislav Kolenkin
Monitoring&Logging - Stanislav Kolenkin  Monitoring&Logging - Stanislav Kolenkin
Monitoring&Logging - Stanislav Kolenkin
 
Tracing Micro Services with OpenTracing
Tracing Micro Services with OpenTracingTracing Micro Services with OpenTracing
Tracing Micro Services with OpenTracing
 
初探 OpenTelemetry - 蒐集遙測數據的新標準
初探 OpenTelemetry - 蒐集遙測數據的新標準初探 OpenTelemetry - 蒐集遙測數據的新標準
初探 OpenTelemetry - 蒐集遙測數據的新標準
 
OGCE MSI Presentation
OGCE MSI PresentationOGCE MSI Presentation
OGCE MSI Presentation
 
Implementing Observability for Kubernetes.pdf
Implementing Observability for Kubernetes.pdfImplementing Observability for Kubernetes.pdf
Implementing Observability for Kubernetes.pdf
 
ThroughTheLookingGlass_EffectiveObservability.pptx
ThroughTheLookingGlass_EffectiveObservability.pptxThroughTheLookingGlass_EffectiveObservability.pptx
ThroughTheLookingGlass_EffectiveObservability.pptx
 
Runtime Behavior of JavaScript Programs
Runtime Behavior of JavaScript ProgramsRuntime Behavior of JavaScript Programs
Runtime Behavior of JavaScript Programs
 
Monitoring and Instrumentation Strategies: Tips and Best Practices - AppSphere16
Monitoring and Instrumentation Strategies: Tips and Best Practices - AppSphere16Monitoring and Instrumentation Strategies: Tips and Best Practices - AppSphere16
Monitoring and Instrumentation Strategies: Tips and Best Practices - AppSphere16
 
Amazon EKS 그리고 Service Mesh (김세호 솔루션즈 아키텍트, AWS) :: Gaming on AWS 2018
Amazon EKS 그리고 Service Mesh (김세호 솔루션즈 아키텍트, AWS) :: Gaming on AWS 2018Amazon EKS 그리고 Service Mesh (김세호 솔루션즈 아키텍트, AWS) :: Gaming on AWS 2018
Amazon EKS 그리고 Service Mesh (김세호 솔루션즈 아키텍트, AWS) :: Gaming on AWS 2018
 
Tracing-for-fun-and-profit.pptx
Tracing-for-fun-and-profit.pptxTracing-for-fun-and-profit.pptx
Tracing-for-fun-and-profit.pptx
 
Build cloud native solution using open source
Build cloud native solution using open source Build cloud native solution using open source
Build cloud native solution using open source
 
Streaming Processing with a Distributed Commit Log
Streaming Processing with a Distributed Commit LogStreaming Processing with a Distributed Commit Log
Streaming Processing with a Distributed Commit Log
 
Proactive ops for container orchestration environments
Proactive ops for container orchestration environmentsProactive ops for container orchestration environments
Proactive ops for container orchestration environments
 
Let’s Monitor Conditions at the Conference With Timothy Spann & David Kjerrum...
Let’s Monitor Conditions at the Conference With Timothy Spann & David Kjerrum...Let’s Monitor Conditions at the Conference With Timothy Spann & David Kjerrum...
Let’s Monitor Conditions at the Conference With Timothy Spann & David Kjerrum...
 
(Current22) Let's Monitor The Conditions at the Conference
(Current22) Let's Monitor The Conditions at the Conference(Current22) Let's Monitor The Conditions at the Conference
(Current22) Let's Monitor The Conditions at the Conference
 
Debugging Microservices - key challenges and techniques - Microservices Odesa...
Debugging Microservices - key challenges and techniques - Microservices Odesa...Debugging Microservices - key challenges and techniques - Microservices Odesa...
Debugging Microservices - key challenges and techniques - Microservices Odesa...
 

Mehr von Inho Kang

Infra as Code with Packer, Ansible and Terraform
Infra as Code with Packer, Ansible and TerraformInfra as Code with Packer, Ansible and Terraform
Infra as Code with Packer, Ansible and TerraformInho Kang
 
K8s beginner 2_advanced_ep02_201904221130_post
K8s beginner 2_advanced_ep02_201904221130_postK8s beginner 2_advanced_ep02_201904221130_post
K8s beginner 2_advanced_ep02_201904221130_postInho Kang
 
kubernetes : From beginner to Advanced
kubernetes : From beginner to Advancedkubernetes : From beginner to Advanced
kubernetes : From beginner to AdvancedInho Kang
 
CQRS and Event Sourcing
CQRS and Event Sourcing CQRS and Event Sourcing
CQRS and Event Sourcing Inho Kang
 
The elegant way of implementing microservices with istio
The elegant way of implementing microservices with istioThe elegant way of implementing microservices with istio
The elegant way of implementing microservices with istioInho Kang
 
Microservice Pattern Launguage
Microservice Pattern LaunguageMicroservice Pattern Launguage
Microservice Pattern LaunguageInho Kang
 
Network virtualization for the better understanding of Data Center Network
Network virtualization for the better understanding of Data Center NetworkNetwork virtualization for the better understanding of Data Center Network
Network virtualization for the better understanding of Data Center NetworkInho Kang
 
Mastering devops with oracle 강인호
Mastering devops with oracle 강인호Mastering devops with oracle 강인호
Mastering devops with oracle 강인호Inho Kang
 
DevOps and Microservice
DevOps and MicroserviceDevOps and Microservice
DevOps and MicroserviceInho Kang
 

Mehr von Inho Kang (9)

Infra as Code with Packer, Ansible and Terraform
Infra as Code with Packer, Ansible and TerraformInfra as Code with Packer, Ansible and Terraform
Infra as Code with Packer, Ansible and Terraform
 
K8s beginner 2_advanced_ep02_201904221130_post
K8s beginner 2_advanced_ep02_201904221130_postK8s beginner 2_advanced_ep02_201904221130_post
K8s beginner 2_advanced_ep02_201904221130_post
 
kubernetes : From beginner to Advanced
kubernetes : From beginner to Advancedkubernetes : From beginner to Advanced
kubernetes : From beginner to Advanced
 
CQRS and Event Sourcing
CQRS and Event Sourcing CQRS and Event Sourcing
CQRS and Event Sourcing
 
The elegant way of implementing microservices with istio
The elegant way of implementing microservices with istioThe elegant way of implementing microservices with istio
The elegant way of implementing microservices with istio
 
Microservice Pattern Launguage
Microservice Pattern LaunguageMicroservice Pattern Launguage
Microservice Pattern Launguage
 
Network virtualization for the better understanding of Data Center Network
Network virtualization for the better understanding of Data Center NetworkNetwork virtualization for the better understanding of Data Center Network
Network virtualization for the better understanding of Data Center Network
 
Mastering devops with oracle 강인호
Mastering devops with oracle 강인호Mastering devops with oracle 강인호
Mastering devops with oracle 강인호
 
DevOps and Microservice
DevOps and MicroserviceDevOps and Microservice
DevOps and Microservice
 

Kürzlich hochgeladen

Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clashcharlottematthew16
 
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostZilliz
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity PlanDatabarracks
 
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo DayH2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo DaySri Ambati
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .Alan Dix
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxhariprasad279825
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):comworks
 
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfPrecisely
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenHervé Boutemy
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Enterprise Knowledge
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteDianaGray10
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 3652toLead Limited
 

Kürzlich hochgeladen (20)

Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptxE-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clash
 
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity Plan
 
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo DayH2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptx
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
 
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache Maven
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test Suite
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365
 

Distributed Tracing with Jaeger

  • 1. Confidential – Oracle Internal/Restricted/Highly Restricted 1 th 강인호 inho.kang@oracle.com Distributed Tracing : Jaeger를 활용한 분산 Tracing 2019.06.16 12thOracle Developer Meetup
  • 2. § .Net Developer § CBD, SOA Methodology Consulting § ITA/EA, ISP Consulting § Oracle Corp. § Middleware § Cloud Native Application, Container Native § Emerging Technology Team § k8s korea user group innoshom@gamil.com
  • 3. A Table of Contents 01 Why Distributed Tracing 02 Take a First Tracing 03 Tracing Standards and EcoSystem
  • 5. History and Multi-Dimensional Evolution of Computing 아키텍처의 변화 방향
  • 9. § Single Serviece per Host § Multiple Services per Host § Serverless deployment § Oracle Fx § AWS Lamda § Service-per-Container § Service-per-VM § Service Deployment Platform § Docker orchestration (ex: k8s, Docker Swarm mode)
  • 10. § Discovery § Service Registry § Server-side discovery § Client-side discovery § Self Registration § 3rd Party Registration § External API § API Gateway § Backend for front end § Communication Style § Reliability
  • 11.
  • 12.
  • 13. How do we know What’s going on?
  • 14. Observability 3가지 구성요소 • Logging is used to record discrete events. For example, the debugging or error information of an application, which is the basis of problem diagnosis. • Metrics is used to record data that can be aggregated. For example, the current depth of a queue can be defined as a metric and updated when an element is added to or removed from the queue. The number of HTTP requests can be defined as a counter that accumulates the number when new requests are received. • Tracing is used to record information within the request scope. For example, the process and consumed time for a remote method call, This is the tool we use to investigate system performance issues. Logging, metrics, and tracing have overlapping parts as follows. https://www.alibabacloud.com/help/doc-detail/68035.htm
  • 15. Monitoring == Observing Events Metrics - Record events as aggregates (e.g. counters) Tracing - Record transaction-scoped events Logging - Record uniqueevents Low volume High volume
  • 16. Observability 3가지 구성요소 The term "observability" in control theory states that the system is observable if the internal states of the system and, accordingly, its behavior, can be determined by only looking at its inputs and outputs 2018 Observability Practitioners Summit, Bryan Cantrill, the CTO of Joyent
  • 17. 전통적인 모니터링 방법들 기존의 Metrics나 Logs 방법으로는 분산환경에 대응 불가 Metrics and Logs are • Per Instance • Lack of context Metrics : 메트릭은 숫자 정보(count, usage)만 모으는 것이므로 자원을 많이 잡아 먹지도 않는다. Logs : 로그는 메트릭 보다는 더 Observability가 좋은 툴이다. 하지만 debugging 툴로는 한계가 있다.
  • 18. 분산환경에서 Tracing을 한다는 것은? 폴리그랏 환경에서 Request Path를 추적해서 - 어느 구간에서 병목이 발생하고 - Request가 실패하면 어디에서 Error가 발생했는지? - Critical Pah가 뭔지를 밝히는 작업 그러나 전통적인 모니터링 툴들은 아직 이것에 적합하지 못하다.
  • 19. Basic Idea of Tracing 어떻게 하면 분산환경에서 Request Path를 추적할 수 있을까?
  • 20. Basic Idea of Tracing 1. 각 단위, 단위의 프로세스 마다의 실행 시간과 정보들을 Profiling하고 2. 중앙에서 프로파일 정보를 모아서, Request 별로 연관관계를 재조합해서 3. Visualization Tool 또는 분석 하도록 한다.
  • 21. Basic Idea of Tracing
  • 22. Request correlation Schema Based - Application 별로 Event Schema를 Manual 하게 정의해서 Log을 기반으로 연관성을 찾아내는 방법 - Modern Microservice 기반에는 한계 MetaData Propagation - Global ID(Execution ID or Request ID)를 가지고 메타데이터를 전파하는 방식
  • 23. Inter-process Propagation (1) The Handler that processes the inbound request is wrapped into instrumentation that extracts metadata from the request and stores it in a Context object in memory. (2) Some in-process propagation mechanism, for example, based on thread-local variables. (3) Instrumentation wraps an RPC client and injects metadata into outbound (downstream) requests.
  • 25. Technical background of tracing • Dapper (Google): Foundation for all tracers • Zipkin (Twitter) • Jaeger (Uber) • StackDriver Trace (Google) • Appdash (golang) • X-ray (AWS) Tracing은 1990년도 부터 있었으나, Google의 논문“Dapper, a Large-Scale Distributed System Tracing Infrastructure"발표 이후 주류로 자리 매김하게 됨
  • 26. Tracing Systems Zipkin and OpenZipkin § 최초의 Open Source Tracing system § 2012년, Twitter 공개 § 많은 지원 생태계 구성 § 자체 Tracer인 Brave API 기반으로 다양한 fraework가 이를 지원 (ex: Spring, Spark, Kafka, gRPC등) § 다른 tracing System과는 data-format level에서 연동 Jaeger § 2015년 Uber에서 만들어졌고, 2017년 Open Source Project 로 공개 § CNCF산하의 프로젝트로 OpenTracing에 집중하면서 많은 주목을 받고 있음 § Zipkin과의 호환 (B3 metadata format) § 직접적인 instrumentation을 제공하지 않으며 OpenTracing-compatible tracer를 이용
  • 27. CNCF에서 Observability 비중 점점 비중이 강조 되고 있음 약 20% § Graduated : Prometehus, Fluentd (Log collection layer) § Incubating : OpenTracing, Jaeger, § Sandbox : OpenMetrics and Cortex (Monitoring) • Prometheus: A monitoring and alerting platform • Fluentd: A logging data collection layer • OpenTracing: A vendor-neutral APIs and instrumentation for distributed tracing • Jaeger: A distributed tracing platform
  • 28. OpenTracing The OpenTracing specification is created to address the problem of API incompatibility between different distributed tracing systems. OpenTracing is a lightweight standardization layer and is located between applications/class libraries and tracing or log analysis programs. •Advantages OpenTracing already enters CNCF and provides unified concept and data standards for global distributed tracing systems. •OpenTracing provides APIs with no relation to platforms or vendors, which allows developers to conveniently add (or change) the implementation of tracing system.
  • 29. Basic Idea of Tracing
  • 30. Distributed Tracing In A Nutshell A B C D E {context} {context} {context}{context} Unique ID → {context} Edge service A B E C D time TRACE SPANS
  • 31. Span Relationship Traces in OpenTracing are defined implicitly by their Spans. In particular, a Trace can be thought of as a directed acyclic graph (DAG) of Spans, where the edges between Spans are called References. For example, the following is an example Trace made up of 8 Spans: Causal relationships between Spans in a single Trace https://opentracing.io/specification/
  • 32. Distributed Tracing In A Nutshell time SPANS Span A Trace ID : 1 Parent Span : [none] Span B Trace ID : 1 Parent Span : A Span E Trace ID : 1 Parent Span : A Span C Trace ID : 1 Parent Span : B Span D Trace ID : 1 Parent Span : B TRACE
  • 33. What is a Span The “span” is the primary building block of a distributed trace, representing an individual unit of work done in a distributed system. Each component of the distributed system contributes a span - a named, timed operation representing a piece of the workflow. Spans can (and generally do) contain “References” to other spans, which allows multiple Spans to be assembled into one complete Trace - a visualization of the life of a request as it moves through a distributed system. Each span encapsulates the following state according to the OpenTracing specification: • An operation name • A start timestamp and finish timestamp • A set of key:value span Tags • A set of key:value span Logs • A SpanContext
  • 34. What is a Span – Tags Tags are key:value pairs that enable user-defined annotation of spans in order to query, filter, and comprehend trace data. Span tags should apply to the whole span. There is a list available at semantic_conventions.md listing conventional span tags for common scenarios. Examples may include tag keys like db.instance to identify a databse host, http.status_code to represent the HTTP response code, or error which can be set to True if the operation represented by the Span fails. § Example § http.url = “http://google.com” § http.status_code = 200 § peer.service = “mysql” § db.statement = “select * from users”
  • 35. What is a Span – Logs Logs are key:value pairs that are useful for capturing span-specific logging messages and other debugging or informational output from the application itself. Logs may be useful for documenting a specific moment or event within the span (in contrast to tags which should apply to the span as a whole). Describes an event at a point in time during the span lifetime. § OpenTracing supports structured logging § Contains a timestamp and a set of fields § Example span.log_kv( {'event': 'open_conn', 'port': 433} )
  • 37. Distributed Tracing In A Nutshell Remote Procedure Call 환경
  • 38. Jaeger Jaeger § 2015년 Uber에서 만들어졌고, 2017년 Open Source Project 로 공개 § CNCF산하의 프로젝트로 OpenTracing에 집중하면서 많은 주목을 받고 있음 § Zipkin과의 호환 (B3 metadata format) § 직접적인 instrumentation을 제공하지 않으며 OpenTracing-compatible tracer를 이용
  • 39. Jaeger Jaeger is an open-source distributed tracing system released by Uber. It is compatible with OpenTracing APIs.
  • 40. Jaeger Architecture • Jaeger client: Implements SDKs that conform to OpenTracing standards for different languages. Applications use the API to write data. The client library transmits trace information to the jaeger-agent according to the sampling policy specified by the application. • Agent: A network daemon that monitors span data received by the UDP port and then sends the data to the collector in batches. It is designed as a basic component and deployed to all hosts. The agent decouples the client library and collector, shielding the client library from collector routing and discovery details. • Collector: Receives the data sent by the jaeger-agent and then writes the data to backend storage. The collector is designed as a stateless component. Therefore, you can simultaneously run an arbitrary number of jaeger-collectors. • Data store: The backend storage is designed as a pluggable component that supports writing data to Cassandra and Elasticsearch. • Query: Receives query requests, retrieves trace information from the backend storage system, and displays it in the UI. Query is stateless and you can start multiple instances. You can deploy the instances behind load balancers such as Nginx.
  • 41. Understanding Sampling § Tracing data > than business traffic § Most tracing systems sample transactions § Head-based sampling: the sampling decision is made just before the trace is started, and it is respected by all nodes in the graph § Tail-based sampling: the sampling decision is made after the trace is completed / collected § Head-based sampling donʼt catch 0.001 probability anomalous behavior.
  • 42. PART 2 Take a first Tracing
  • 43. HOTROD Sample Application을 이용한 Jaeger와 친해지기
  • 44. HOTROD Architecture 확인 § 전체 Architecture을 DAG(Directed Acyclic Graph)로 확인
  • 45. HOTROD Sample Application을 이용한 Jaeger와 친해지기 § https://github.com/DannyKang/OpenTracing § 사전 준비 사항 § Docker § Go lang (소스코드에서 직접 실행을 원하는 경우) § Jaeger 실행 $ sudo docker run -d --name jaeger -p 6831:6831/udp -p 16686:16686 -p 14268:14268 jaegertracing/all-in-one:1.6 § HOTROD 실행 $ sudo docker run --rm -it --link jaeger -p8080-8083:8080-8083 jaegertracing/example-hotrod:1.6 all --jaeger-agent.host-port=jaeger:6831 https://medium.com/opentracing/take-opentracing-for-a-hotrod-ride-f6e3141f7941 § Jaeger 접속 : http://localhost:16686/ § HotROD 접속 : http://127.0.0.1:8080/
  • 46. HOTROD Sample Application을 이용한 Jaeger와 친해지기 § Source Code로 실행하기 (Go가 설치되었다는 가정하에) § mkdir -p $GOPATH/src/github.com/jaegertracing cd $GOPATH/src/github.com/jaegertracing git clone git@github.com:jaegertracing/jaeger.git jaeger cd jaeger make install cd examples/hotrod go run ./main.go all § 이후에 옵션등을 통해서 바로 실행하기 위해서 go build 로 binary를 만든다. go build 여기서 ‘all’ 옵션을 전제 마이크로서비스를 하나의 binary로 실행한다는 의미이다. 그리고 log가 Standard out에 쓰기 때문에 console에서 전체 내용을 확인할 수 있다.
  • 47. HOTROD Architecture 확인 § 전체 Architecture을 DAG(Directed Acyclic Graph)로 확인
  • 48. HOTROD DataFlow 확인 § 전체 Architecture을 DAG(Directed Acyclic Graph)로 확인
  • 49. HOTROD DataFlow 확인 § 전체 Architecture을 DAG(Directed Acyclic Graph)로 확인
  • 50. HOTROD Traditional Log vs Contextual Logs
  • 51. HOTROD Traditional Log vs Contextual Logs
  • 52. HOTROD Traditional Log vs Contextual Logs
  • 54. HOTROD Identifying source of latency Mysql이 전체 수행 시간에 가장 많은 시간(latency)이 걸림
  • 55. HOTROD Identifying source of latency Zoom in 을 통해서 보면 route service가 parallel이 아니라 3개씩 쌍으로 수행되는 것으로 나타남.
  • 56. HOTROD Identifying source of latency 화면에서 같은 사용자로 Request를 증가 시키면 mysql의 수행시간이 1초 이상으로 소요됨 (기존 300ms)
  • 57. HOTROD Identifying source of latency 특정 Driver의 Request를 추적
  • 58. HOTROD Identifying source of latency 내부적으로 log를 보면 block을 걸고 있는 것을 볼 수 있음
  • 59. HOTROD Identifying source of latency examples/hotrod/services/customer/database.go $ ./example-hotrod help $ ./example-hotrod -M -D 100ms all -M은 lock을 없애는 옵션 -D는 db-query-delay를 300ms에서 100ms으로 줄이는 옵션 소스코드를 보면 “d.lock.Lock”을 확인할 수 있다.
  • 60. HOTROD Identifying source of latency Go를 설치하지 않은 경우는 Docker의 옵션으로 추가하면 된다. $ docker run --rm -it --link jaeger -p8080-8083:8080-8083 jaegertracing/example-hotrod:1.6 -M -D 100ms all --jaeger-agent.host-port=jaeger:6831
  • 61. HOTROD Identifying source of latency 다시 다량의 Request를 발생 시키면 여전히 delay를 확인할 수 있다.
  • 63. HOTROD Identifying source of latency services/config/config.go $ ./example-hotrod -M -D 100ms -W 100 all W는 WorkerPoolSize이다 Default 3 변경후 다량의 Request로 결과 확인
  • 64. PART 3 Tracing Standards and EcoSystem
  • 65. Tracing Instrumentation 3 Type of tracing instrumentation § Manual Instrument § Framework Based Manual Instrument § Agent Based automatic instrument
  • 68. Different meanings of tracing Different meanings of tracing: analyzing transactions (1), recording transactions (2), federating transactions (3), describing transactions (4), and correlating transactions (5)
  • 69.
  • 70. Tracing Systems Zipkin and OpenZipkin § 최초의 Open Source Tracing system § 2012년, Twitter 공개 § 많은 지원 생태계 구성 § 자체 Tracer인 Brave API 기반으로 다양한 fraework가 이를 지원 (ex: Spring, Spark, Kafka, gRPC등) § 다른 tracing System과는 data-format level에서 연동 Jaeger § 2015년 Uber에서 만들어졌고, 2017년 Open Source Project 로 공개 § CNCF산하의 프로젝트로 OpenTracing에 집중하면서 많은 주목을 받고 있음 § Zipkin과의 호환 (B3 metadata format) § 직접적인 instrumentation을 제공하지 않으며 OpenTracing-compatible tracer를 이용
  • 71. Tracing Systems SkyWalking § 상대적으로 최신 프로젝트로 중국에서 시작 § 2017년 Apache Foundation 의 incubation project § Tracing 시스템으로 시작해서 full-blown APM solution으로 진화중 § 일부 OpenTracing-compatible(only Java) § 중국에서 많이 사용되는 framework에 대한 Agent-based instrumentation에 집중 X-Ray, Stackdriver § 클라우드 사업자의 Managed 서비스 § 기존에 Zipkin이나 Jaeger를 사용하던 On-Prem과의 연계가 문제가 됨 § 따라서 클라우드 사업자들이 data 표준화에 많은 투자를 하기 시작함
  • 72. Tracing Systems W3C Trace Context § 2017년 World Wide Web Consortium(W3C) 산하 Distributed Tracing Working Group 발족 § 댜양한 vendors, cloud providers, open source projects 참여 § Trace tool간의 interoperability를 제공하기 위한 표준 정의 § Trace Context :Tracing metadata를 주고받을 때 사용할 format에만 focus § 2018년 Trace ID conceptual model 합의 § 2개의 protocol header 제안 1. Traceparent spandID : 16 byte and 8 byte array Traceparent: 00-4bf92f3577b34da6a3ce929d0e0e4736-00f067aa0ba902b7-01 2. Tracestate tracing vendor specific 정보 저장 영역 Tracestate: vendorname1=opaqueValue1,vendorname2=opaqueValue2 3. data 표준화에 많은 투자를 하기 시작함
  • 73. Tracing Systems W3C "Data Interchange Format" § 데이터를 주고 받을때 사용할 수 있는 format를 정의 § Vendor마다의 이해가 달라 진전이 없는 상태 OpenCensus § Google의 Census라는 library의 일부로 시작한 프로젝트 § Misssion : "vendor-agnostic single distribution of libraries to provide metrics collection and tracing for your services." § 기존 Google의 Dapper format 을 그래도 사용 § Google Stackdrivre와 Zipkin, Jaeger의 Metadata와 일치 (Dapper의 후손) § 그렇지만 다른 APM Vendor와는 많이 다름
  • 74. Tracing Systems OpenTracing - Goal of OpenTracing Project § To provide a single API that application and framework developers can use to instrument their code § To enable truly reusable, portable, and composable instrumentation, especially for the other open source frameworks, libraries, and systems § To empower other developers, not just those working on the tracing systems, to write instrumentation for their software and be comfortable that it will be compatible with other modules in the same application that are instrumented with OpenTracing, while not being bound to a specific tracing vendor
  • 75. Understanding Sampling § Tracing data > than business traffic § Most tracing systems sample transactions § Head-based sampling: the sampling decision is made just before the trace is started, and it is respected by all nodes in the graph § Tail-based sampling: the sampling decision is made after the trace is completed / collected § Head-based sampling donʼt catch 0.001 probability anomalous behavior.
  • 76. OpenTelemetry OpenTelemetry is an effort to combine all three verticals into a single set of system components and language-specific telemetry libraries. It is meant to replace both the OpenTracing project, which focused exclusively on tracing, and the OpenCensus project, which focused on tracing and metrics. OpenTelemetry will not initially support logging, though we aim to incorporate this over time.
  • 80. Example of istio tracing Bookinfo Application with Istio
  • 81. Example of istio tracing kubectl port-forward -n istio-system $(kubectl get pod -n istio-system -l app=jaeger -o jsonpath='{.items[0].metadata.name}') 16686:16686 &
  • 82. Example of istio Metics kubectl -n istio-system port-forward $(kubectl -n istio-system get pod -l app=grafana -o jsonpath='{.items[0].metadata.name}') 3000:3000 &
  • 83. Example of Istio service graph kubectl -n istio-system port-forward $(kubectl -n istio-system get pod -l app=kiali -o jsonpath='{.items[0].metadata.name}') 20001:20001
  • 84. References • Mastering Distributed Tracing • Alibaba Cloud ‒ OpenTracing implementation of Jaeger : https://www.alibabacloud.com/help/doc- detail/68035.htm • OpenTracing Best Practices : https://opentracing.io/docs/best-practices/instrumenting-your- application/ • OpenTracing HOTROD : https://medium.com/opentracing/take-opentracing-for-a-hotrod-ride- f6e3141f7941 From Chapter 1