Break My Site

a talk for CocoonGT 2007 by Jeremy Quinn

Break My Site
practical stress testing and tuning of Cocoon applications

photo credit: Môsieur J

1

This is designed as a beginner’s talk. I am the beginner.

scenario
Original
400

350

Page Load (secs)
Throughput (pages/min)
300
Stability %

Approximately 600 hits per data-point
250

200

150

100

50

0
1 10 20 30 40 50 60 70 80 90 100
Users

2

My starting point for optimisation
You may be able to see how useless one person clicking in a browser is at testing, the server
never gets loaded.
Disregard the absolute speed values, this was done on my laptop.

scenario
Original
400

350

Page Load (secs)
300
Stability %
Average
seconds per
page
OUCH !!!!
250

200

150

Throughput
pages per
minute
100
EEEK !!!

50

0
1 10 20 30 40 50 60 70 80 90 100
Users

3

The original server conﬁguration being taken to only 30 users, with disastrous eect.
Throughput drops to single ﬁgures, page load time climbs sky high, stability reduces by
nearly half.
From the logs I could see there were not enough threads or memory to cope.
This is badly broken.

scenario
Original

accumulated optimisations
400

-–— what’s possible
350

Page Load (secs)
300
Stability %

All graphs at the same scale
250

Memory Thread Optimisation
200

150

100

50

0
1 10 20 30 40 50 60 70 80 90 100
1 10 20 30 40 50 60 70 80 90 100
Users

4

Memory and Thread optimisation has made a tremendous dierence.
Now I can easily test up to 100 users, before I could not go past 30.
Page Load time, Throughput etc. is now linear with the increase in users.
At some point this conﬁguration will also ‘break’, you need to know where this point is
so that you can plan for it not happening.

scenario
Original

400

350

Page Load (secs)
300
Stability %

250

Memory Thread Optimisation Read Optimisation
200

150

100

50

0
1 10 20 30 40 50 60 70 80 90 100
1 10 20 30 40 50 60 70 80 90 100 1 10 20 30 40 50 60 70 80 90 100
Users

5

The server is doing slightly less work due to this optimisation
This shows as the gradients ﬂattening out a bit more.

scenario
Original

400

350

Page Load (secs)
300
Stability %

250

List Optimisation
Memory Thread Optimisation Read Optimisation
200
200

150
150

100
100

50
50

0 0
1 10 20 30 40 50 60 70 80 90 100
1 10 20 30 40 50 60 70 80 90 100 1 10 20 30 40 50 60 70 80 90 100 1 10 20 30 40 50 60 70 80 90 100
Users

6

Another optimisation, further reducing the work done on the server.
Flatter curves, better performance.

This gives you an idea of the scale of improvement you may be able to make.
In my case it is quite dramatic!

purpose
what do I hope to achieve?

photo credit: Brian Uhreen

7

Load testing can be used to solve dierent sorts of problems.
You don’t want your site broken by popularity.
Find out how much trafic will make a site break, so you can plan to cope.

comparing changes

• code optimisations
• different implementations
• caching strategies
• different datasources
• different server architectures

8

Load testing can be used during development to test changes.

test content

• test the content of pages
• test the behaviour of webapps

9

JMeter also has exhaustive tools for crawling and testing content within web pages.
It can also send parameters as POST requests etc. to help you test webapps.

gauge your server

• get a speciﬁcation
• how many of what power of machine?
• choosing and tuning a load-balancer

10

Before you start to work out what servers you need, you have to know what level of trafic it is
intended to cope with.
This has to come from either the implementation you are replacing or from marketing.
Test the machines you plan to use, so you know what capacity they can handle.
Round-Robins with machines of dierent capacities may work really badly.

planning
what am I going to do?

photo credit: Paul Goyette

11

Plan well and hopefully you will get useful data

what can I change?

• implementations
• server conﬁguration
• memory
• threads
• system architecture

12

What kind of changes do you want to compare?

what will happen?

• build a mental model
• predict what you expect to happen
• learn to read the data
• test your ideas against reality

13

As your tests run, relate events in the server logs to how the graphs look

order of change

• only make one change at a time

14

If there are several changes you can make
It is important to plan what order you do them in

order of change

• have you got that?

15

If you make more than one change at a time
you don’t know which change had what eect
which makes it more dificult to build a mental model of what is happening

order of change

• in the most meaningful order
• which depends on your purpose

16

I tested optimisations before playing with threads and memory
For each change, I ran the tests again.
Optimisations may require dierent ideal thread and memory settings.

what tools?
on MacOSX I used :

• JMeter

• Terminal

• Console

• Activity Monitor

• Grab

17

The tools I used to capture and watch real-time performance and issues

what tools?
on MacOSX I used :

• JMeter

• Terminal

• Console


• Grab

18

I found it was ﬁne to run JMeter on the server for running low level tests.
It is better to run JMeter on another machine though.
For very high throughput testing, you can aggregate results from many JMeter clients.

what tools?
on MacOSX I used :

• JMeter

• Terminal

• Console


• Grab

19

I can watch Cocoon in its Terminal window.
See errors as they happen

what tools?
on MacOSX I used :

• JMeter

• Terminal

• Console


• Grab

20

I used the Console to ﬁlter and watch Jetty’s logs for the repository.

what tools?
on MacOSX I used :

• JMeter

• Terminal

• Console


• Grab

21

In Activity Monitor I can track threads, memory and cpu usage.
Here we can see the two java processes
The top one is Cocoon, the bottom one the repository
Here you can see that through bottlenecks, the system is being under-utilised.
IMHO both cores should be working fully on a well tuned system

setup
what do I do to get ready?

photo credit: Fabio Venni

22

getting started

make a test

keep it simple stupid

23

Here is the simplest way to use JMeter

start JMeter

24

Starting JMeter from the command-line

start a plan

25

Start a new plan, give it a name

threads

26

Set up the number of threads you want to test with

request

27

Add an HTTP Request to the Thread Group
Set up the server’s IP, port and path

recording

28

A graph for plotting results
If you are running many tests, make sure you give everything suitable names and comments
so you know what you were testing.
The test output can be written to ﬁle for further analysis.

check everything

• content of datafeeds
• behaviour of pipelines
• outcome of transformations

29

Check every stage, so you know everything is working properly and set up as you expected.
Do this before you start testing.
Otherwise you might not be testing what you think you are.

caching

• repository?
• datasource?
• pipeline?

30

Make sure you know the state of all the caching,
so it matches what you want to test.

warm-up

• hit your test page(s) with a browser
• last chance for a visual check
• get Cocoon up to speed

31

Warm-up Cocoon.
Hit the pages you want to test in a browser to make sure they work.

be local

• clients server on the same subnet
• avoid possible network issues

32

Be on the server's local subnet.

enough clients?

• creating many threads
• creates its own overhead

33

Have enough clients to perform the level of testing you need.

quit processes

• everything that is unnecessary
• periodic processes skew your results

34

Letting something like your email client ﬁre o periodic checks for new mail, will skew your
results.

record your data

• server logs
• JMeter graphs
• JMeter data file (XML or CSV ?)
• screen grabs of specific events
• test it !!!!!

35

Prepare properly, to make sure all of your data will be recorded.
Grab relevant sections of logs.
Grab graphs after a run.
Choose what data to write to a file.

running tests
time for a cup of tea?

photo credit: thespeak

36

No, you have lots to do.

run-time data

• clear log windows on each run
• keep it organised
• use a ﬁle or folder naming convention

37

Record the real-time data you need.
I clear the log windows for each run so after the run is ﬁnished, I can easily scroll back
looking for exceptions etc.
Use naming conventions for multiple runs.

watch it run

• part of the learning process
• see where the problems happen
• see what you are ﬁxing
• make the link between cause and effect
• take snapshots

38

Watch the tools as the tests run.
Learn to read the graphs.

my screen

39

I can follow errors and timings change in Cocoon’s output.
Errors and query times changing for the repo, in Jetty’s logs.
Thread, memory and cpu usage in Activity Monitor.

interpreting
the results
what does it all mean?

photo credit: woneffe

40

The key for me, to begin to learn how to interpret the results, was to watch the tests live,
relating what appeared on the graph to what was happening (errors, slowdowns, garbage
collection etc.) on the server.

reading the graph

• load speed

• throughput

• deviation

• hits

41

NB. Unless you are testing on real hardware, these speeds have no meaning in isolation
but are useful for comparison

reading the graph

• load speed

• throughput

• deviation

• hits

42

Page load speeds in milliseconds.
Lower is better
On a stable system, it should go ﬂat

reading the graph

• load speed

• throughput

• deviation

• hits

43

Throughput in pages per second.
Higher is better

reading the graph

• load speed

• throughput

• deviation

• hits

44

The deviation (variability) of the load speed in milliseconds
Lower is better

reading the graph

• load speed

• throughput

• deviation

• hits

45

The black dots are individual sample times

reading the graph

• load speed

• throughput

• deviation

• hits

46

Sometimes JMeter draws data o the graph
I was using a beta, this may have been ﬁxed in the latest release

chaotic start

• test for long enough

• JMeter ramps threads

• JVMs warm up

• Servlet adds threads

47

Your tests need to run for long enough for you to get meaningful results.
I ran a minimum of 600 hits per test

what’s this sawtooth?

• hysteresis?

• randomise requests?

48

You can sometimes see a sawtooth in the throughput.
What causes this?
Is it important?
This is what I think is happening ....

Maybe you could remove the eect by randomising requests.

processing

• hysteresis?


49

Throughput slows as jobs build up and threads process.


• hysteresis?


sending

50

Responses comes in spurts as jobs complete.
JMeter issues new requests


• hysteresis?


51

The change in gradient can tell you the change in the increase in throughput (?)

shi‘appen

52

Learn to relate problems on the server to its eect on the graph.
Big spikes indicate that you have problems
You may see the eects of:
exceptions
garbage collection

good signs

• speed flattening

• throughput flattening

• deviation dropping

• no exceptions ; )

53

When the values on the graph begin to flatten out,
it shows that the system has become stable at that load.

more tests

• authorise
• fill in data
• upload files
• wrap tests in logic
• etc. etc.

54

I used the simplest possible configuration in JMeter.
I was only hitting one url.
It does a lot more.

more protocols
• JDBC
• LDAP
• FTP
• AJP
• JMS
• POP, IMAP
• etc. etc.

55

You can apply tests to many dierent parts of your infrastructure.

conclusions
• design to cope
• test early, test often
• plan well
• pay attention
• capture everything
• maintain a model

56

Break My Site

Empfohlen

Empfohlen

Weitere ähnliche Inhalte

Kürzlich hochgeladen

Kürzlich hochgeladen (20)

Break My Site