We would like to share our story how we troubleshot our spark jobs performance using JVM profiler and InfluxDB.
Speaker - Igor Mastesnyi, Senior Data Engineer @ AppsFlyer Data Group.
12. Flame graphs
12
Rectangle - stack fram - function on stack
Y - stack depth
Х - stack samples set. Sorted in alphabet order!
Width - % - of stack traces / total stack traces
19. What we’ve got
19
Instrument to analyze Spark oriented code in depth
Tool to check performance of code changes/optimizations
Performance boost up to 16%