The Statistics of Web Performance Measurement

Introduction
Statistics - I
Statistics - II

The Statistics of Web Performance

Philip Tellis / philip@bluesmoon.info

ConFoo / 2010-03-12

ConFoo / 2010-03-12 The Statistics of Web Performance

Introduction
Statistics - I
Statistics - II

$ ﬁnger philip

Philip Tellis
philip@bluesmoon.info
@bluesmoon
yahoo
geek


Introduction
The goal
Statistics - I
Performance Measurement
Statistics - II

Introduction


Introduction
The goal
Statistics - I
Statistics - II

Accurately measure page performance
At least, as accurately as possible


Introduction
The goal
Statistics - I
Statistics - II

Be unintrusive

If you try to measure something accurately, you will change
something related
– Heisenberg’s uncertainty principle


Introduction
The goal
Statistics - I
Statistics - II

And one number to rule them all


Introduction
The goal
Statistics - I
Statistics - II

Bandwidth

Real bandwidth v/s advertised bandwidth
Bandwidth to your server, not to the ISP
Bandwidth during normal internet usage
If the user’s always watching movies, you’re not winning


Introduction
The goal
Statistics - I
Statistics - II

Latency

How long does it take a byte to get to the user?
Wired, wireless, mobile, satellite?
How many hops in between?
Speed of light is constant
This is not a battle we will soon win.
When was the last time you heard latency mentioned in a
TV ad?
http://www.stuartcheshire.org/rants/Latency.html


Introduction
The goal
Statistics - I
Statistics - II

User perceived page load time

Time from “click on a link” to “spinner stops spinning”
This is what users notice
Depends on how long your page takes to build
Depends on what’s in your page
Depends on how long components take to load
Depends on how long the browser takes to execute and
render


Introduction
The goal
Statistics - I
Statistics - II

We need to measure real user data


Introduction
The goal
Statistics - I
Statistics - II

The statistics apply to any kind of performance data though


Introduction Random Sampling
Statistics - I Margin of Error
Statistics - II Central Tendency

Statistics - I



Disclaimer

I am not a statistician



Population

All possible users of your system



Sample

Representative subset of the population



Bad sample

Sometimes it’s not



How to randomize?

Pick 10% of users at random and always test them

OR

For each user, decide at random if they should be tested
http://tech.bluesmoon.info/2010/01/statistics-of-performance-measurement.html



Select 10% of users - I

if($sessionid % 10 === 0) {
// instrument code for measurement
}

Once a user enters the measurement bucket, they stay
there until they log out
Fixed set of users, so tests may be more consistent
Error in the sample results in positive feedback



Select 10% of users - II

if(rand() < 0.1 * getrandmax()) {
// instrument code for measurement
}

For every request, a user has a 10% chance of being
tested
Gets rid of positive feedback errors, but sample size !=
10% of population



How big a sample is representative?

Select n such that
σ
1.96 √n ≤ 5%µ



Standard Deviation

Standard deviation tells you the spread of the curve
The narrower the curve, the more conﬁdent you can be



MoE at 95% conﬁdence

σ
±1.96 √n



MoE & Sample size

There is an inverse square root correlation between sample
size and margin of error



But wait... it’s not complicated enough.
We have different types of margins of error
...more about that later



Ding dong



One number

Mean (Arithmetic)
Good for symmetric curves
Affected by outliers

Mean(10, 11, 12, 11, 109) = 30



One number

Median
Middle value measures central tendency well
Not trivial to pull out of a DB

Median(10, 11, 12, 11, 109) = 11



One number

Mode
Not often used
Multi-modal distributions suggest problems

Mode(10, 11, 12, 11, 109) = 11



Other numbers

A percentile point in the distribution: 95th , 98.5th or 99th
Used to find out the worst user experience
Makes more sense if you filter data first

P95th (10, 11, 12, 11, 109) = 12



Other means

Geometric mean
Good if your data is exponential in nature
(with the tail on the right)

GMean(10, 11, 12, 11, 109) = 16.68



Wait... how did I get that?

N
ΠN xi — could lead to overﬂow
i=1

ΣN loge (xi )
i=1
N
e — computationally simpler



Other means

And there is also the Harmonic mean, but forget about that



...though consequently

We have other margins of error
Geometric margin of error
Uses geometric standard deviation
Median margin of error
Uses ranges of actual values from data set
Stick to the arithmetic MoE
– simpler to calculate, simpler to read and not incorrect


Introduction
Filtering
Statistics - I
The Log-Normal distribution
Statistics - II

Statistics - II


Introduction
Filtering
Statistics - I
Statistics - II

Outliers

Out of range data points
Nothing you can ﬁx here
There’s even a book about
them


Introduction
Filtering
Statistics - I
Statistics - II

DNS problems can cause outliers

2 or 3 DNS servers for an ISP
30 second timeout if ﬁrst fails
... 30 second increase in page load time
Maybe measure both and ﬁx what you can
http://nms.lcs.mit.edu/papers/dns-ton2002.pdf


Introduction
Filtering
Statistics - I
Statistics - II

Band-pass ﬁltering


Introduction
Filtering
Statistics - I
Statistics - II

Band-pass ﬁltering

Strip everything outside a reasonable range
Bandwidth range: 4kbps - 4Gbps
Page load time: 50ms - 120s
You may need to relook at the ranges all the time


Introduction
Filtering
Statistics - I
Statistics - II

IQR ﬁltering


Introduction
Filtering
Statistics - I
Statistics - II

IQR ﬁltering

Here, we derive the range from the data


Introduction
Filtering
Statistics - I
Statistics - II

Let’s look at some real charts


Introduction
Filtering
Statistics - I
Statistics - II

Bandwidth distribution for web devs

x-axis is linear


Introduction
Filtering
Statistics - I
Statistics - II

Now let’s use log(kbps) instead of kbps

x-axis is exponential


Introduction
Filtering
Statistics - I
Statistics - II

Exponential == Geometric

Categories/Buckets grow exponentially
Data is related geometrically
Use the geometric mean and geometric margin of error
gmean
Error _range = /gmoe , gmean ∗ gmoe
Non-linear ranges are hard for humans to grok


Introduction
Statistics - I
Statistics - II

So...


Introduction
Statistics - I
Statistics - II

Further reading

Web Performance - Not a Simple Number
http://www.netforecast.com/Articles/BCR+C25+Web+Performance+-+Not+A+Simple+Number.pdf

Revisiting statistics for web performance (introduction to
Log-Normal)
http://home.pacbell.net/ciemo/statistics/WhatDoYouMean.pdf

Random Sampling
http://tech.bluesmoon.info/2010/01/statistics-of-performance-measurement.html

Khan Academy’s tutorials on statistics
http://khanacademy.com/

Learning about Statistical Learning
http://measuringmeasures.blogspot.com/2010/01/learning-about-statistical-learning.html

Wikipedia articles on Random Sampling, Central Tendency,
Standard Error, Confounding, Means and IQR


Introduction
Statistics - I
Statistics - II

Summary

Choose a reasonable sample size and sampling factor
Tune sample size for minimal margin of error
Decide based on your data whether to use mode, median
or one of the means
Figure out whether your data is Normal, Log-Normal or
something else
Filter out anomalous outliers


Introduction
Statistics - I
Statistics - II

contact me

Philip Tellis
philip@bluesmoon.info
bluesmoon.info
@bluesmoon


Introduction
Statistics - I
Statistics - II

Photo credits

http://www.flickr.com/photos/leoffreitas/332360959/ by leoffreitas
http://www.flickr.com/photos/cobalt/56500295/ by cobalt123
http://www.flickr.com/photos/sophistechate/4264466015/ by Lisa
Brewster
http://www.flickr.com/photos/nchoz/243216008/ by nchoz


Introduction
Statistics - I
Statistics - II

List of ﬁgures

http://en.wikipedia.org/wiki/File:Standard_deviation_diagram.svg
http://en.wikipedia.org/wiki/File:Normal_Distribution_PDF.svg
http://en.wikipedia.org/wiki/File:KilroySchematic.svg
http://en.wikipedia.org/wiki/File:Boxplot_vs_PDF.png


The Statistics of Web Performance Measurement

Empfohlen

Empfohlen

Weitere ähnliche Inhalte

Ähnlich wie The Statistics of Web Performance Measurement

Ähnlich wie The Statistics of Web Performance Measurement (20)

Mehr von Philip Tellis

Mehr von Philip Tellis (20)

Kürzlich hochgeladen

Kürzlich hochgeladen (20)

The Statistics of Web Performance Measurement