KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx

Known Java APIs,
Unknown
Performance impact!
Ram Lakshmanan
Architect - yCrash

System.out.println() – source code
public void println(String x) {
synchronized (this) {
print(x);
newLine();
}
}

Performance Comparison with Log4j2
0
0.05
0.1
0.15
0.2
0.25
0.3
0.35
Logger System.out.print
Avg Response Time
91%
degradation

UUID Generation
https://blog.fastthread.io/2022/03/09/java-uuid-generation-performance-impact/

java.util.UUID#randomUUID()
https://tinyurl.com/57s55zfz
Real-world Problem
Entropy

Thread dump analysis report – stack trace
STATE : BLOCKED
java.security.SecureRandom.nextBytes(SecureRandom.java:433)
java.util.UUID.randomUUID(UUID.java:159)
com.buggycompany.jtm.bp.<init>(bp.java:185)
com.buggycompany.jtm.a4.f(a4.java:94)
com.buggycompany.agent.trace.RootTracer.topComponentMethodBbuggycompanyin(RootTracer.java:439)
weblogicx.servlet.gzip.filter.GZIPFilter.doFilter(GZIPFilter.java)
weblogic.servlet.internal.FilterChainImpl.doFilter(FilterChainImpl.java:56)
weblogic.servlet.internal.WebAppServletContext$ServletInvocationAction.wrapRun(WebAppServletContext.java:3730)
weblogic.servlet.internal.WebAppServletContext$ServletInvocationAction.run(WebAppServletContext.java:3696)
weblogic.security.acl.internal.AuthenticatedSubject.doAs(AuthenticatedSubject.java:321)
weblogic.security.service.SecurityManager.runAs(SecurityManager.java:120)
weblogic.servlet.internal.WebAppServletContext.securedExecute(WebAppServletContext.java:2273)
weblogic.servlet.internal.WebAppServletContext.execute(WebAppServletContext.java:2179)
weblogic.servlet.internal.ServletRequestImpl.run(ServletRequestImpl.java:1490)
weblogic.work.ExecuteThread.execute(ExecuteThread.java:256)
weblogic.work.ExecuteThread.run(ExecuteThread.java:221)
Checking Entropy in Linux
cat /proc/sys/kernel/random/entropy_avail
If < 1000, it’s a problem

Solution
• RHEL
• Upgrade to RHEL 7 or above version
• If < RHEL 7, follow recommendations given here
• Install Haveged Library - Unpredictable Random number generator
• Use /dev/urandom instead of /dev/random
• ‘/dev/random’ serve as pseudorandom number generators
• ‘/dev/urandom’ is another special file that is capable of generating random
numbers. Downside: reduced security due to less randomness
• -Djava.security.egd=file:/dev/urandom

System.getProperty()
https://blog.fastthread.io/2021/10/06/performance-impact-of-java-lang-system-getproperty/

System.getProperty()
• ‘java.lang.System.getProperty()’ API underlyingly uses
‘java.util.Hashtable.get()’ API.
public synchronized V get(Object key) {
:
:
}
• If used in critical code path, can significantly affect application
performance

Real world problem: Atlassian SDK
189 Threads Blocked

Victim Thread Stack trace
http-nio-8080-exec-293
Stack Trace is:
java.lang.Thread.State: BLOCKED (on object monitor)
at java.util.Hashtable.get(Hashtable.java:362)
- waiting to lock <0x0000000080f5e118> (a java.util.Properties)
at java.util.Properties.getProperty(Properties.java:969)
at java.lang.System.getProperty(System.java:756)
at net.java.ao.atlassian.ConverterUtils.enforceLength(ConverterUtils.java:16)
at net.java.ao.atlassian.ConverterUtils.checkLength(ConverterUtils.java:9)
:

Culprit Thread Stack trace
Camel Thread #6 – backboneThreadPool
Stack Trace is:
at java.util.Hashtable.get(Hashtable.java:362)
- locked <0x0000000080f5e118> (a java.util.Properties)
at java.lang.System.getProperty(System.java:756)
at net.java.ao.atlassian.ConverterUtils.enforceLength(ConverterUtils.java:16)
at net.java.ao.atlassian.ConverterUtils.checkLength(ConverterUtils.java:9)
:

Solution
• Upgrade to JDK 11 or above
 Synchronized HashTable has been replaced with ConcurrentHashMap
• Cache the values:
public static String getAppName() {
String app = System.getProperty("appName");
return app;
}
private static String app = System.getProperty("appName");
public static String getAppName() {
return app;
}

HashMap
https://blog.ycrash.io/2022/04/15/java-hashtable-hashmap-concurrenthashmap-
performance-impact/

Interview question
• What is the difference between HashMap and HashTable?
• But what happens when you do concurrent put() and get() on
HashMap - 
• How to diagnose CPU spike?
top –H –p <PROCESS_ID> + Thread dump

360° Troubleshooting artifacts
Open-source script:
https://github.com/ycrash/yc-data-script
1. GC Log
10. netstat
12. vmstat
2. Thread Dump
9. dmesg
3. Heap Dump (optional)
6. ps
8. Disk Usage
5. top
11. ping
16. metadata
4. Heap Substitute
7. top -H
13. iostat
14. Kernel Params
15. App Logs

Real case study: Major bank in Canada
• https://tinyurl.com/j5jnmrxr

Which Map to use?
• ConcurrentHashMap – Safe & fast
• https://blog.ycrash.io/2022/04/15/java-hashtable-hashmap-
concurrenthashmap-performance-impact/
0
10
20
30
40
50
60
HashMap ConcurrentHashMap Hashtable
3.16 4.26
56.27
Execution Time

Java.util.Collection#clear()
https://blog.ycrash.io/2023/02/18/clear-details-on-java-collection-clear-api/

ArrayList  Object[]
1 2 3 4 5 6 7 8 n
ArrayList
Object[]
public class ArrayList<E> extends AbstractList<E> {
:
:
transient Object[] elementData;
:
:
}

Without invoking clear()
https://tinyurl.com/5x78kvb5

Invoking clear() API
https://tinyurl.com/3fxcackb

Assigning ‘null’
https://tinyurl.com/56emdc7f

Memory Impact
0
5
10
15
20
25
30
Just created clear() null
27.5
4.64
0
ArrayList Size (MB)

Real world example – Trading app
public void clear() {
modCount++;
// clear to let GC do its work
for (int i = 0; i < size; i++)
elementData[i] = null;
size = 0;
}

Threads
https://blog.fastthread.io/2023/02/22/java-virtual-threads-easy-introduction/

JDBC
SOAP
MainFr
ame
REST
Server Thread Pool
Application Server
HTTP(S) request
Typical Architecture

1 million threads
for (int i = 0; i < 1_000_000; i++) {
new Thread(new Runnable() {
@Override
public void run() {
TimeUnit.HOURS.sleep(1);
}
}).start();
}

Performance Comparison
Thread Count Memory Size Thread Analysis Heap Analysis
Platform Threads 1599.
After that
OutOfMemoryErr
or
1.85 MB https://tinyurl.co
m/ntfastthread
https://tinyurl.co
m/ntheaphero
Virtual Threads 1 million.
No issues
401 MB https://tinyurl.co
m/vtfastthread
https://tinyurl.co
m/vtheaphero

Threads Architecture
O
T
O
T
O
T
O
T
O
T
O
T
P
T
P
T
P
T
P
T
P
T
P
T
Native Memory
OT Operating System Thread
PT Platform Thread

O
T
O
T
O
T
O
T
O
T
O
T
P
T
V
T
V
T
V
T
V
T
V
T
V
T
V
T
V
T
V
T
V
T
V
T
V
T
V
T
V
T
P
T
P
T
P
T
P
T
P
T
Java Heap
Native Memory
V
T
V
T
V
T
VT OT
Virtual Thread Operating System Thread
PT Platform Thread

O
T
O
T
O
T
O
T
O
T
O
T
P
T
V
T
V
T
V
T
V
T
V
T
V
T
V
T
V
T
V
T
V
T
V
T
V
T
V
T
V
T
V
T
P
T
P
T
P
T
P
T
P
T
Java Heap
Native Memory
V
T
V
T
V
T
VT OT
Virtual Thread Operating System Thread
PT Platform Thread

Surprise
https://blog.gceasy.io/2021/07/08/i-dont-have-to-worry-about-garbage-collection-is-it-true/
- Java 

Garbage Collection
HTTP Request
Objects
Memory
Garbage

How objects are garbage collected?
Evolution: Manual -> Automatic
3 – 4 decades before Now
Developer
Writes code to Manually evict
Garbage
JVM
JVM Automatically evicts
Garbage

Automatic GC sounds good, right?
Yes, but for
CPU consumption
GC pauses

Real world – Long GC Pause in Top Cloud Provider
https://blog.gceasy.io/2022/03/04/garbage-collection-tuning-success-story-reducing-young-gen-size/

What is GC throughput?
How does 96% GC Throughput sound?
1 day = 1440 Minutes (i.e., 24 hours x 60 minutes)
96% GC Throughput means app pausing for 57.6
minutes/day
Amount of time application spends in processing customer
transactions
vs
Amount of time application spends in processing garbage
collection activity

Real world – Largest Automobile manufacturer
https://blog.gceasy.io/2022/08/25/automobile-company-optimizes-performance-using-gceasy/
Avg response time (secs) Transactions > 25 secs (%)
Baseline 1.88 0.7
GC settings #2 1.36 0.12
49.46%
Improvement
GC Tuning improves entire app’s response time

More Case Studies
Uber Saves Millions of $
Major Cloud Provider improves it’s SLA
CloudBees (Jenkins Parent company) optimizes
https://blog.gceasy.io/2019/08/01/cloudbees-gc-performance-optimized-with-gceasy/
Oracle optimizes App performance by tuning GC
https://blog.gceasy.io/2022/12/06/oracle-architect-optimizes-performance-using-gceasy/

Large SaaS Company CEO’s tweet

How to tune GC Performance?
Free Video: https://www.youtube.com/watch?v=6G0E4O5yxks
Online Training: https://ycrash.io/java-performance-training

Thank you friends!
Ram Lakshmanan
ram@tier1app.com
@tier1app
linkedin.com/company/ycrash
This deck will be published in: https://blog.fastthread.io

KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx

Empfohlen

Empfohlen

Weitere ähnliche Inhalte

Ähnlich wie KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx

Ähnlich wie KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx (20)

Mehr von Tier1 app

Mehr von Tier1 app (20)

Kürzlich hochgeladen

Kürzlich hochgeladen (20)

KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx

Hinweis der Redaktion