11. TON@ONLINEDIALOGUE.COM
Make sure your testing solution has all users!
Users on template: 42186!
Users in the tool: 37652!
Users with code executed: 34312 !
100%!
89%!
81%!
17. TON@ONLINEDIALOGUE.COM
Be able to create behavioral segments!
Typical ecommerce flow example:
ü All users on your website with enough time to take action
ü All users on your website with at least some interaction
ü All users on your website with heavy interaction
ü All users on your website with clear intent to buy
ü All users on your website that are willing to buy
ü All users on your website that succeed in buying
ü All users on your website that return with intent to buy more
Funnel
+
Average
5me
30. TON@ONLINEDIALOGUE.COM
Power
Do not reject H0 Reject H0
H0 is true
Correct decision
J
Type I
False Positive (α)
H0 is false
Type II
False Negative (β)
Correct decision
J
Measured
Reality
31. TON@ONLINEDIALOGUE.COM
Power
New version is
NOT better
New version is
better
New version is
NOT better
Correct decision
J
Type I
False Positive (α)
New version is
better
Type II
False Negative (β)
Correct decision
J
Measured
Reality
32. TON@ONLINEDIALOGUE.COM
Power & Significance rule of thumb
Power
When you start: try to test on pages with a high Power
(>80%) à otherwise you don’t detect effects when there is
an effect to be detected (False negatives).
Significance
When you start: try to test against a high enough
significance level (90%) à otherwise you’ll declare winners,
when in reality there isn’t an effect (False positives).
41. TON@ONLINEDIALOGUE.COM
What does your calculation look like?!
If significant result:!
!
Extra new customers per week!
*!
52 weeks effective!
*!
Average lifetime value!
47. TON@ONLINEDIALOGUE.COM
What is your false discovery rate?!
Significance border: 90%!
100 experiments!
20 significant outcomes!
!
50%!* (it’s a little lower, this is the poor man’s calculation)!
(with every real win the number of experiments without wins becomes lower, which leads to less false positives)!
48. TON@ONLINEDIALOGUE.COM
So not really 50%!
FDR* = (Measured Wins - ((Measured Wins - !
((100% - Confidence Level) * Experiments))!
/ Confidence Level)) / Measured Wins!
!
=!
!
(20 – ((20 – ((100% - 90%) * 100)) / 90%)) / 20!
!
=!
!
44%!* (only if your power on all experiments was 100%)!
(Your Power will be lower, which means you had more real wins, but not measured (false negatives).!
This leads to less experiments without an effect, so the number of false positives will be even lower)!
51. TON@ONLINEDIALOGUE.COM
So all your experiments will bring you:!
Sum of (every winner *!
!
(100% - Type-M error % per winner))!
*!
(100% - FDR%)!
*!
Implementation % (within x months…)!
(assuming every new win is tested on the new default where all earlier wins are implemented)!
53. TON@ONLINEDIALOGUE.COM
So all your experiments will bring you:!
Sum of (every winner *!
!
(100% - Type-M error % per winner))!
*!
(100% - corrected FDR%)!
*!
Implementation % (within x months…)!
(assuming every new win is tested on the new default where all earlier wins are implemented)!
54. TON@ONLINEDIALOGUE.COM
Maximize your growth with ROI limits:!
Value of A/B-testing for Optimization!
___________________________________!
!
Costs of A/B-testing for Optimization!
= ROI!
55. TON@ONLINEDIALOGUE.COM
Finance: are you above or below your ROI limit?!
1. Above: increase budgets!
2. Below: increase knowledge!
3. Still below: decrease budgets!