Profiling on steroids: Making Apache Spark Fast & Furious

•

1 gefällt mir•415 views

I

Ivan Kosianenko

We would like to share our story how we troubleshot our spark jobs performance using JVM profiler and InfluxDB. Speaker - Igor Mastesnyi, Senior Data Engineer @ AppsFlyer Data Group.

Daten & Analysen

- Proprietary & Confidential -
Profiling on steroids
Making Apache Spark Fast & Furious

Who am I
2
Big data engineer at Appsflyer
ML enthusiast
IoT amateur

Few things about Appsflyer
3
XX TBs data x0000 applications history
x000 EC2s
We r o g a n ti an v a er p it .
70B events a day

4
DAG
Metrics
Event timeline
Performance Optimization

Profiling
5
Find the hot path
What was really optimized
Quick look under the hood

Profiling tools
6

Profiling
7
System profilers
JVM profilers
Distributed profiling?

StatsD JVM profiler
8
JVM
Profiler agent:
ThreadMXBean.dumpThreads0
Application

9
JVM
Profiler agent:
ThreadMXBean.dumpThreads0
Application
JVM
Profiler agent:
ThreadMXBean.dumpThreads0
Application
JVM
Profiler agent:
ThreadMXBean.dumpThreads0
Application
JVM
Profiler agent:
ThreadMXBean.dumpThreads0
Application
JVM
Profiler agent:
ThreadMXBean.dumpThreads0
Application
JVM
Profiler agent:
ThreadMXBean.dumpThreads0
Application

Configuring
10
javaagent:./statsd-...jar-with-dependencies.jar=
server=influxdb.master.msp.com,
reporter=InfluxDBReporter,
database=profiler,username=profiler,password=pass,port=8086,
prefix=20190413,
tagMapping=barge,
httpServerEnabled=false,
packageBlacklist= io.netty.util.concurrent:io.netty.channel.nio:org.spark_p
roject.jetty.util.thread:org.apache.hadoop.net.unix:org.spark_project.jett
y.serve

Export
11
python influxdb_dump.py -o influxdb.imasternsinf.msp.com
-r 8086
-u profiler
-p profiler
-d profiler
-e 20190413
-t barge > test/executors-thread-dump-2019-04-13
(‘cpu.trace.com-amazonaws-AmazonWebServiceClient-computeServiceName-703.
com-amazonaws-AmazonWebServiceClient-getServiceNameIntern-676.
com-amazonaws-AmazonWebServiceClient-computeSignerByURI-278.
com-amazonaws-AmazonWebServiceClient-setEndpoint-160.
com-amazonaws-services-s3-AmazonS3Client-setEndpoint-475.com-amazonaws-services-s3-AmazonS3Client-init-447.com
-amazonaws-services-s3-AmazonS3Client-<init>-391.com-amazonaws-services-s3-AmazonS3Client-<init>-371.org-apach
e-hadoop-fs-s3a-S3AFileSystem-initialize-235.org-apache-hadoop-fs-FileSystem-createFileSystem-2669.org-apache-
hadoop-fs-FileSystem-access$200-94.org-apache-hadoop-fs-FileSystem$Cache-getInternal-2703.org-apache-hadoop-fs
-FileSystem$Cache-get-2685.org-apache-hadoop-fs-FileSystem-get-373.org-apache-hadoop-fs-Path-getFileSystem-295
.org-apache-parquet-hadoop-ParquetFileReader-<init>-565...’, 1

Flame graphs
12
Rectangle - stack fram - function on stack
Y - stack depth
Х - stack samples set. Sorted in alphabet order!
Width - % - of stack traces / total stack traces

Analyzing results

What we’ve got
19
Instrument to analyze Spark oriented code in depth
Tool to check performance of code changes/optimizations
Performance boost up to 16%

http://psy-lob-saw.blogspot.com/2016/02/why-most-sampling-java-profilers-are.html
https://github.com/etsy/statsd-jvm-profiler
https://github.com/cerndb/Hadoop-Profiler/tree/mast
er/src
http://www.brendangregg.com/
https://www.youtube.com/watch?v=QiGrTvsCZmA
Links

Thanks!igor.masternyi@appsflyer.com
@igormasternoy

Empfohlen

An experiment in agile threat modelling

An experiment in agile threat modelling

An experiment in agile threat modellingDevSecCon

Autonomous Incident and Root Cause Detection

Autonomous Incident and Root Cause Detection

Autonomous Incident and Root Cause DetectionDevOps.com

Using Machine Learning on K8s Logs to Find Root Cause Faster

Using Machine Learning on K8s Logs to Find Root Cause Faster

Using Machine Learning on K8s Logs to Find Root Cause FasterLibbySchulze

Python + MPP Database = Large Scale AI/ML Projects in Production Faster

Python + MPP Database = Large Scale AI/ML Projects in Production Faster

Python + MPP Database = Large Scale AI/ML Projects in Production FasterPaige_Roberts

HTTP Event Collector, Simplified Developer Logging

HTTP Event Collector, Simplified Developer Logging

HTTP Event Collector, Simplified Developer LoggingGlenn Block

AutoBLG by Sun Bo

AutoBLG by Sun Bo

AutoBLG by Sun Bo mori_tatsuya

Different Methodology To Recon Your Targets

Different Methodology To Recon Your Targets

Different Methodology To Recon Your TargetsEslamAkl

Application Security Workshop

Application Security Workshop

Application Security Workshop Priyanka Aash

Empfohlen

An experiment in agile threat modelling

An experiment in agile threat modelling

An experiment in agile threat modellingDevSecCon

Autonomous Incident and Root Cause Detection

Autonomous Incident and Root Cause Detection

Autonomous Incident and Root Cause DetectionDevOps.com

Using Machine Learning on K8s Logs to Find Root Cause Faster

Using Machine Learning on K8s Logs to Find Root Cause Faster

Using Machine Learning on K8s Logs to Find Root Cause FasterLibbySchulze

Python + MPP Database = Large Scale AI/ML Projects in Production Faster

Python + MPP Database = Large Scale AI/ML Projects in Production Faster

Python + MPP Database = Large Scale AI/ML Projects in Production FasterPaige_Roberts

HTTP Event Collector, Simplified Developer Logging

HTTP Event Collector, Simplified Developer Logging

HTTP Event Collector, Simplified Developer LoggingGlenn Block

AutoBLG by Sun Bo

AutoBLG by Sun Bo

AutoBLG by Sun Bo mori_tatsuya

Different Methodology To Recon Your Targets

Different Methodology To Recon Your Targets

Different Methodology To Recon Your TargetsEslamAkl

Application Security Workshop

Application Security Workshop

Application Security Workshop Priyanka Aash

Data analytics master class: predict hotel revenue

Data analytics master class: predict hotel revenue

Data analytics master class: predict hotel revenueKris Peeters

Lessons Learnt from Running Thousands of On-demand Spark Applications

Lessons Learnt from Running Thousands of On-demand Spark Applications

Lessons Learnt from Running Thousands of On-demand Spark ApplicationsItai Yaffe

Выявление и локализация проблем в сети с помощью инструментов Riverbed

Выявление и локализация проблем в сети с помощью инструментов Riverbed

Выявление и локализация проблем в сети с помощью инструментов RiverbedElena Marianenko

YOW2018 Cloud Performance Root Cause Analysis at Netflix

YOW2018 Cloud Performance Root Cause Analysis at Netflix

YOW2018 Cloud Performance Root Cause Analysis at NetflixBrendan Gregg

[Hands-on] CQRS(Command Query Responsibility Segregation) 와 Event Sourcing 패턴 실습

[Hands-on] CQRS(Command Query Responsibility Segregation) 와 Event Sourcing 패턴 실습

[Hands-on] CQRS(Command Query Responsibility Segregation) 와 Event Sourcing 패턴 실습Oracle Korea

CQRS and Event Sourcing

CQRS and Event Sourcing

CQRS and Event Sourcing Inho Kang

WebSphere Technical University: Top WebSphere Problem Determination Features

WebSphere Technical University: Top WebSphere Problem Determination Features

WebSphere Technical University: Top WebSphere Problem Determination FeaturesChris Bailey

Challenges in a Microservices Age: Monitoring, Logging and Tracing on Red Hat...

Challenges in a Microservices Age: Monitoring, Logging and Tracing on Red Hat...

Challenges in a Microservices Age: Monitoring, Logging and Tracing on Red Hat...Martin Etmajer

Elefrant [ng-Poznan]

Elefrant [ng-Poznan]

Elefrant [ng-Poznan]Marcos Latorre

A Journey to Building an Autonomous Streaming Data Platform—Scaling to Trilli...

A Journey to Building an Autonomous Streaming Data Platform—Scaling to Trilli...

A Journey to Building an Autonomous Streaming Data Platform—Scaling to Trilli...Databricks

Apache Eagle: Architecture Evolvement and New Features

Apache Eagle: Architecture Evolvement and New Features

Apache Eagle: Architecture Evolvement and New FeaturesHao Chen

Automatically scaling Kubernetes workloads - SVC215-S - New York AWS Summit

Automatically scaling Kubernetes workloads - SVC215-S - New York AWS Summit

Automatically scaling Kubernetes workloads - SVC215-S - New York AWS SummitAmazon Web Services

Intro to open source telemetry linux con 2016

Intro to open source telemetry linux con 2016

Intro to open source telemetry linux con 2016Matthew Broberg

Sherlock Homepage - A detective story about running large web services - WebN...

Sherlock Homepage - A detective story about running large web services - WebN...

Sherlock Homepage - A detective story about running large web services - WebN...Maarten Balliauw

Monitoring Big Data Systems Done "The Simple Way" - Demi Ben-Ari - Codemotion...

Monitoring Big Data Systems Done "The Simple Way" - Demi Ben-Ari - Codemotion...

Monitoring Big Data Systems Done "The Simple Way" - Demi Ben-Ari - Codemotion...Codemotion

Monitoring Big Data Systems "Done the simple way" - Demi Ben-Ari - Codemotion...

Monitoring Big Data Systems "Done the simple way" - Demi Ben-Ari - Codemotion...

Monitoring Big Data Systems "Done the simple way" - Demi Ben-Ari - Codemotion...Demi Ben-Ari

Dataservices: Processing Big Data the Microservice Way

Dataservices: Processing Big Data the Microservice Way

Dataservices: Processing Big Data the Microservice WayQAware GmbH

High Availability by Design

High Availability by Design

High Availability by DesignDavid Prinzing

Scalawox deeplearning

Scalawox deeplearning

Scalawox deeplearningscalawox

2021 JCConf 使用Dapr簡化Java微服務應用開發

2021 JCConf 使用Dapr簡化Java微服務應用開發

2021 JCConf 使用Dapr簡化Java微服務應用開發Rich Lee

hybrid Seed Production In Chilli & Capsicum.pptx

hybrid Seed Production In Chilli & Capsicum.pptx

hybrid Seed Production In Chilli & Capsicum.pptx9to5mart

Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...

Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...

Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Valters Lauzums

Weitere ähnliche Inhalte

Ähnlich wie Profiling on steroids: Making Apache Spark Fast & Furious

Data analytics master class: predict hotel revenue

Data analytics master class: predict hotel revenue

Data analytics master class: predict hotel revenueKris Peeters

Lessons Learnt from Running Thousands of On-demand Spark Applications

Lessons Learnt from Running Thousands of On-demand Spark Applications

Lessons Learnt from Running Thousands of On-demand Spark ApplicationsItai Yaffe

Выявление и локализация проблем в сети с помощью инструментов Riverbed

Выявление и локализация проблем в сети с помощью инструментов Riverbed

Выявление и локализация проблем в сети с помощью инструментов RiverbedElena Marianenko

YOW2018 Cloud Performance Root Cause Analysis at Netflix

YOW2018 Cloud Performance Root Cause Analysis at Netflix

YOW2018 Cloud Performance Root Cause Analysis at NetflixBrendan Gregg

[Hands-on] CQRS(Command Query Responsibility Segregation) 와 Event Sourcing 패턴 실습

[Hands-on] CQRS(Command Query Responsibility Segregation) 와 Event Sourcing 패턴 실습

[Hands-on] CQRS(Command Query Responsibility Segregation) 와 Event Sourcing 패턴 실습Oracle Korea

CQRS and Event Sourcing

CQRS and Event Sourcing

CQRS and Event Sourcing Inho Kang

WebSphere Technical University: Top WebSphere Problem Determination Features

WebSphere Technical University: Top WebSphere Problem Determination Features

WebSphere Technical University: Top WebSphere Problem Determination FeaturesChris Bailey

Challenges in a Microservices Age: Monitoring, Logging and Tracing on Red Hat...

Challenges in a Microservices Age: Monitoring, Logging and Tracing on Red Hat...

Challenges in a Microservices Age: Monitoring, Logging and Tracing on Red Hat...Martin Etmajer

Elefrant [ng-Poznan]

Elefrant [ng-Poznan]

Elefrant [ng-Poznan]Marcos Latorre

A Journey to Building an Autonomous Streaming Data Platform—Scaling to Trilli...

A Journey to Building an Autonomous Streaming Data Platform—Scaling to Trilli...

A Journey to Building an Autonomous Streaming Data Platform—Scaling to Trilli...Databricks

Apache Eagle: Architecture Evolvement and New Features

Apache Eagle: Architecture Evolvement and New Features

Apache Eagle: Architecture Evolvement and New FeaturesHao Chen

Automatically scaling Kubernetes workloads - SVC215-S - New York AWS Summit

Automatically scaling Kubernetes workloads - SVC215-S - New York AWS Summit

Automatically scaling Kubernetes workloads - SVC215-S - New York AWS SummitAmazon Web Services

Intro to open source telemetry linux con 2016

Intro to open source telemetry linux con 2016

Intro to open source telemetry linux con 2016Matthew Broberg

Sherlock Homepage - A detective story about running large web services - WebN...

Sherlock Homepage - A detective story about running large web services - WebN...

Sherlock Homepage - A detective story about running large web services - WebN...Maarten Balliauw

Monitoring Big Data Systems Done "The Simple Way" - Demi Ben-Ari - Codemotion...

Monitoring Big Data Systems Done "The Simple Way" - Demi Ben-Ari - Codemotion...

Monitoring Big Data Systems Done "The Simple Way" - Demi Ben-Ari - Codemotion...Codemotion

Monitoring Big Data Systems "Done the simple way" - Demi Ben-Ari - Codemotion...

Monitoring Big Data Systems "Done the simple way" - Demi Ben-Ari - Codemotion...

Monitoring Big Data Systems "Done the simple way" - Demi Ben-Ari - Codemotion...Demi Ben-Ari

Dataservices: Processing Big Data the Microservice Way

Dataservices: Processing Big Data the Microservice Way

Dataservices: Processing Big Data the Microservice WayQAware GmbH

High Availability by Design

High Availability by Design

High Availability by DesignDavid Prinzing

Scalawox deeplearning

Scalawox deeplearning

Scalawox deeplearningscalawox

2021 JCConf 使用Dapr簡化Java微服務應用開發

2021 JCConf 使用Dapr簡化Java微服務應用開發

2021 JCConf 使用Dapr簡化Java微服務應用開發Rich Lee

Ähnlich wie Profiling on steroids: Making Apache Spark Fast & Furious (20)

Data analytics master class: predict hotel revenue

Data analytics master class: predict hotel revenue

Data analytics master class: predict hotel revenue

Lessons Learnt from Running Thousands of On-demand Spark Applications

Lessons Learnt from Running Thousands of On-demand Spark Applications

Lessons Learnt from Running Thousands of On-demand Spark Applications

Выявление и локализация проблем в сети с помощью инструментов Riverbed

Выявление и локализация проблем в сети с помощью инструментов Riverbed

Выявление и локализация проблем в сети с помощью инструментов Riverbed

YOW2018 Cloud Performance Root Cause Analysis at Netflix

YOW2018 Cloud Performance Root Cause Analysis at Netflix

YOW2018 Cloud Performance Root Cause Analysis at Netflix

[Hands-on] CQRS(Command Query Responsibility Segregation) 와 Event Sourcing 패턴 실습

[Hands-on] CQRS(Command Query Responsibility Segregation) 와 Event Sourcing 패턴 실습

[Hands-on] CQRS(Command Query Responsibility Segregation) 와 Event Sourcing 패턴 실습

CQRS and Event Sourcing

CQRS and Event Sourcing

CQRS and Event Sourcing

WebSphere Technical University: Top WebSphere Problem Determination Features

WebSphere Technical University: Top WebSphere Problem Determination Features

WebSphere Technical University: Top WebSphere Problem Determination Features

Challenges in a Microservices Age: Monitoring, Logging and Tracing on Red Hat...

Challenges in a Microservices Age: Monitoring, Logging and Tracing on Red Hat...

Challenges in a Microservices Age: Monitoring, Logging and Tracing on Red Hat...

Elefrant [ng-Poznan]

Elefrant [ng-Poznan]

Elefrant [ng-Poznan]

A Journey to Building an Autonomous Streaming Data Platform—Scaling to Trilli...

A Journey to Building an Autonomous Streaming Data Platform—Scaling to Trilli...

A Journey to Building an Autonomous Streaming Data Platform—Scaling to Trilli...

Apache Eagle: Architecture Evolvement and New Features

Apache Eagle: Architecture Evolvement and New Features

Apache Eagle: Architecture Evolvement and New Features

Automatically scaling Kubernetes workloads - SVC215-S - New York AWS Summit

Automatically scaling Kubernetes workloads - SVC215-S - New York AWS Summit

Automatically scaling Kubernetes workloads - SVC215-S - New York AWS Summit

Intro to open source telemetry linux con 2016

Intro to open source telemetry linux con 2016

Intro to open source telemetry linux con 2016

Sherlock Homepage - A detective story about running large web services - WebN...

Sherlock Homepage - A detective story about running large web services - WebN...

Sherlock Homepage - A detective story about running large web services - WebN...

Monitoring Big Data Systems Done "The Simple Way" - Demi Ben-Ari - Codemotion...

Monitoring Big Data Systems Done "The Simple Way" - Demi Ben-Ari - Codemotion...

Monitoring Big Data Systems Done "The Simple Way" - Demi Ben-Ari - Codemotion...

Monitoring Big Data Systems "Done the simple way" - Demi Ben-Ari - Codemotion...

Monitoring Big Data Systems "Done the simple way" - Demi Ben-Ari - Codemotion...

Monitoring Big Data Systems "Done the simple way" - Demi Ben-Ari - Codemotion...

Dataservices: Processing Big Data the Microservice Way

Dataservices: Processing Big Data the Microservice Way

Dataservices: Processing Big Data the Microservice Way

High Availability by Design

High Availability by Design

High Availability by Design

Scalawox deeplearning

Scalawox deeplearning

Scalawox deeplearning

2021 JCConf 使用Dapr簡化Java微服務應用開發

2021 JCConf 使用Dapr簡化Java微服務應用開發

2021 JCConf 使用Dapr簡化Java微服務應用開發

Kürzlich hochgeladen

hybrid Seed Production In Chilli & Capsicum.pptx

hybrid Seed Production In Chilli & Capsicum.pptx

hybrid Seed Production In Chilli & Capsicum.pptx9to5mart

Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...

Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...

Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Valters Lauzums

CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE

CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE

CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE9953056974 Low Rate Call Girls In Saket, Delhi NCR

Probability Grade 10 Third Quarter Lessons

Probability Grade 10 Third Quarter Lessons

Probability Grade 10 Third Quarter LessonsJoseMangaJr1

Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Stand

Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Stand

Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Standamitlee9823

Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...

Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...

Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...amitlee9823

➥🔝 7737669865 🔝▻ Dindigul Call-girls in Women Seeking Men 🔝Dindigul🔝 Escor...

➥🔝 7737669865 🔝▻ Dindigul Call-girls in Women Seeking Men 🔝Dindigul🔝 Escor...

➥🔝 7737669865 🔝▻ Dindigul Call-girls in Women Seeking Men 🔝Dindigul🔝 Escor...amitlee9823

Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service

Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service

Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service9953056974 Low Rate Call Girls In Saket, Delhi NCR

DATA SUMMIT 24 Building Real-Time Pipelines With FLaNK

DATA SUMMIT 24 Building Real-Time Pipelines With FLaNK

DATA SUMMIT 24 Building Real-Time Pipelines With FLaNKTimothy Spann

Detecting Credit Card Fraud: A Machine Learning Approach

Detecting Credit Card Fraud: A Machine Learning Approach

Detecting Credit Card Fraud: A Machine Learning ApproachBoston Institute of Analytics

Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore

Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore

Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangaloreamitlee9823

Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand

Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand

Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Standamitlee9823

Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec

Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec

Abortion pills in Doha Qatar (+966572737505 ! Get CytotecAbortion pills in Riyadh +966572737505 get cytotec

Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...amitlee9823

Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure

Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure

Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% SecurePooja Nehwal

➥🔝 7737669865 🔝▻ Mathura Call-girls in Women Seeking Men 🔝Mathura🔝 Escorts...

➥🔝 7737669865 🔝▻ Mathura Call-girls in Women Seeking Men 🔝Mathura🔝 Escorts...

➥🔝 7737669865 🔝▻ Mathura Call-girls in Women Seeking Men 🔝Mathura🔝 Escorts...amitlee9823

👉 Amritsar Call Girl 👉📞 6367187148 👉📞 Just📲 Call Ruhi Call Girl Phone No Amri...

👉 Amritsar Call Girl 👉📞 6367187148 👉📞 Just📲 Call Ruhi Call Girl Phone No Amri...

👉 Amritsar Call Girl 👉📞 6367187148 👉📞 Just📲 Call Ruhi Call Girl Phone No Amri...karishmasinghjnh

(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7

(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7

(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7Call Girls in Nagpur High Profile Call Girls

Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night

Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night

Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightDelhi Call girls

5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed

5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed

5CL-ADBA,5cladba, Chinese supplier, safety is guaranteedamy56318795

Kürzlich hochgeladen (20)

hybrid Seed Production In Chilli & Capsicum.pptx

hybrid Seed Production In Chilli & Capsicum.pptx

hybrid Seed Production In Chilli & Capsicum.pptx

Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...

Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...

Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...

CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE

CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE

CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE

Probability Grade 10 Third Quarter Lessons

Probability Grade 10 Third Quarter Lessons

Probability Grade 10 Third Quarter Lessons

Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Stand

Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Stand

Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Stand

Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...

Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...

Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...

➥🔝 7737669865 🔝▻ Dindigul Call-girls in Women Seeking Men 🔝Dindigul🔝 Escor...

➥🔝 7737669865 🔝▻ Dindigul Call-girls in Women Seeking Men 🔝Dindigul🔝 Escor...

➥🔝 7737669865 🔝▻ Dindigul Call-girls in Women Seeking Men 🔝Dindigul🔝 Escor...

Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service

Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service

Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service

DATA SUMMIT 24 Building Real-Time Pipelines With FLaNK

DATA SUMMIT 24 Building Real-Time Pipelines With FLaNK

DATA SUMMIT 24 Building Real-Time Pipelines With FLaNK

Detecting Credit Card Fraud: A Machine Learning Approach

Detecting Credit Card Fraud: A Machine Learning Approach

Detecting Credit Card Fraud: A Machine Learning Approach

Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore

Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore

Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore

Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand

Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand

Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand

Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec

Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec

Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec

Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...

Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure

Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure

Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure

➥🔝 7737669865 🔝▻ Mathura Call-girls in Women Seeking Men 🔝Mathura🔝 Escorts...

➥🔝 7737669865 🔝▻ Mathura Call-girls in Women Seeking Men 🔝Mathura🔝 Escorts...

➥🔝 7737669865 🔝▻ Mathura Call-girls in Women Seeking Men 🔝Mathura🔝 Escorts...

👉 Amritsar Call Girl 👉📞 6367187148 👉📞 Just📲 Call Ruhi Call Girl Phone No Amri...

👉 Amritsar Call Girl 👉📞 6367187148 👉📞 Just📲 Call Ruhi Call Girl Phone No Amri...

👉 Amritsar Call Girl 👉📞 6367187148 👉📞 Just📲 Call Ruhi Call Girl Phone No Amri...

(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7

(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7

(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7

Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night

Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night

Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night

5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed

5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed

5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed

Profiling on steroids: Making Apache Spark Fast & Furious

1. - Proprietary & Confidential - Profiling on steroids Making Apache Spark Fast & Furious

2. Who am I 2 Big data engineer at Appsflyer ML enthusiast IoT amateur

3. Few things about Appsflyer 3 XX TBs data x0000 applications history x000 EC2s We r o g a n ti an v a er p it . 70B events a day

4. 4 DAG Metrics Event timeline Performance Optimization

5. Profiling 5 Find the hot path What was really optimized Quick look under the hood

6. Profiling tools 6

7. Profiling 7 System profilers JVM profilers Distributed profiling?

8. StatsD JVM profiler 8 JVM Profiler agent: ThreadMXBean.dumpThreads0 Application

9. 9 JVM Profiler agent: ThreadMXBean.dumpThreads0 Application JVM Profiler agent: ThreadMXBean.dumpThreads0 Application JVM Profiler agent: ThreadMXBean.dumpThreads0 Application JVM Profiler agent: ThreadMXBean.dumpThreads0 Application JVM Profiler agent: ThreadMXBean.dumpThreads0 Application JVM Profiler agent: ThreadMXBean.dumpThreads0 Application

10. Configuring 10 javaagent:./statsd-...jar-with-dependencies.jar= server=influxdb.master.msp.com, reporter=InfluxDBReporter, database=profiler,username=profiler,password=pass,port=8086, prefix=20190413, tagMapping=barge, httpServerEnabled=false, packageBlacklist= io.netty.util.concurrent:io.netty.channel.nio:org.spark_p roject.jetty.util.thread:org.apache.hadoop.net.unix:org.spark_project.jett y.serve

11. Export 11 python influxdb_dump.py -o influxdb.imasternsinf.msp.com -r 8086 -u profiler -p profiler -d profiler -e 20190413 -t barge > test/executors-thread-dump-2019-04-13 (‘cpu.trace.com-amazonaws-AmazonWebServiceClient-computeServiceName-703. com-amazonaws-AmazonWebServiceClient-getServiceNameIntern-676. com-amazonaws-AmazonWebServiceClient-computeSignerByURI-278. com-amazonaws-AmazonWebServiceClient-setEndpoint-160. com-amazonaws-services-s3-AmazonS3Client-setEndpoint-475.com-amazonaws-services-s3-AmazonS3Client-init-447.com -amazonaws-services-s3-AmazonS3Client-<init>-391.com-amazonaws-services-s3-AmazonS3Client-<init>-371.org-apach e-hadoop-fs-s3a-S3AFileSystem-initialize-235.org-apache-hadoop-fs-FileSystem-createFileSystem-2669.org-apache- hadoop-fs-FileSystem-access$200-94.org-apache-hadoop-fs-FileSystem$Cache-getInternal-2703.org-apache-hadoop-fs -FileSystem$Cache-get-2685.org-apache-hadoop-fs-FileSystem-get-373.org-apache-hadoop-fs-Path-getFileSystem-295 .org-apache-parquet-hadoop-ParquetFileReader-<init>-565...’, 1

12. Flame graphs 12 Rectangle - stack fram - function on stack Y - stack depth Х - stack samples set. Sorted in alphabet order! Width - % - of stack traces / total stack traces

13. Analyzing results

19. What we’ve got 19 Instrument to analyze Spark oriented code in depth Tool to check performance of code changes/optimizations Performance boost up to 16%

20. http://psy-lob-saw.blogspot.com/2016/02/why-most-sampling-java-profilers-are.html https://github.com/etsy/statsd-jvm-profiler https://github.com/cerndb/Hadoop-Profiler/tree/mast er/src http://www.brendangregg.com/ https://www.youtube.com/watch?v=QiGrTvsCZmA Links

21.

22. Thanks!igor.masternyi@appsflyer.com @igormasternoy