SlideShare ist ein Scribd-Unternehmen logo
1 von 32
Downloaden Sie, um offline zu lesen
The basics of statistical hypothesis
testing in E-commerce.
By Anatoly Vuets
Agenda
• Why do we use (we should use) statistical hypothesis testing in e-commerce?
• Statistical test: how does it work and its main parameters
• Key features for e-commerce
Why do we need statistical
testing in e-commerce?
We need the right decisions
• A/B tests
• Ad-hoc analyses
• Building models
We need the right decisions
• A question: which of these groups makes more profit?
• What is missing here?
We need the right decisions
• A/B test: which version is better?
Statistical test: let’s recall
the basics!
• Random variable (discrete or continuous)
• Probability distribution function (PMF(x), PDF(x))
• Mean M or μ
• Standard deviation SD or σ
Basics of statistics
Basics of statistics: standard
distribution
Statistical test: uncertainty.
...
...
...
...
..................
True metrics value
Statistical population
Sample Possible samples
...
Observed value Other possible values
(distribution)
Uncertainty
We want to conclude about the statistical population based on single sample that we have
observed
Statistical population Observed sample Possible samples
Why is this important?
Distribution of metrics estimate
Statistical test: basic idea
and main parameters.
• We want to test a statement (typically existence of an effect).
• We have a set of observations (sample) from which we conclude the statement.
• Scenario, in which the statement is TRUE is called alternative hypothesis H1.
• Scenario, in which the statement is FALSE is called null hypothesis H0.
• Estimate the probability to observe the sample we have under H0.
• If the probability is high enough - we conclude that H1 can not be accepted. In the opposite
case, we accept H1.
Idea
... H0/H1𝗧(S)
H0: C = 5% H1: C > 5%
Statistical test
H0 H1
H0 Correct
P: 1 - α
Error T1
P: α
H1 Error T2
P: β
Correct
P: 1 - β
Test T(s)
Truth
• Error T1 - accept H1 when H0 is true.
• Error T2 -accept H0 when H1 is true.
• We would like to have a perfect test (α = 0, β = 0).
However as we shall see later, this is impossible in
practice. Because of this, test design and result
interpretation are crucial for proper decision
making.
Statistical test parameters
A detector can be considered as a binary classifier: passenger does not have (H0) or has metal
objects (H1) (weapon etc.)
The detector has a sensitivity knob (decision boundary).
If the sensitivity is low detector falsely detects metal in α = 5% of cases, but skips metal in β =
67% of cases.
If the sensitivity is high - it falsely detects metal in α = 50%, but skips in β = 0.3% of cases.
Intermediate sensitivity values allow choosing the trade-off between skipping a passenger
who has hidden metal objects (increases probability of an incident) and the service speed
(additional airport costs and lower passenger satisfaction).
Statistical test parameters: metal
detector in airport
Statistical tests based on data achieved from an A/B test can be treated as a classifier which is
supposed to tell whether conversion rate increased (H1) or remained the same (H0).
Question: which trade-off between α and β would you choose?
Statistical test parameters:
increasing web-page conversion rate
• H0: C = 5%, H1: C > 5%
• T(s) = c/n, n = 3600
• significance level = 5%
• P(T|H0) - ?
Theory:
Simulation:
bootstrap
How does statistical test work:
distribution P(T|H0)
How does statistical test works:
significance level and decision boundary
• H0: C = 5%, H1: C > 5%
• T(s) = c/n, n = 3600
• significance level = 5%
• P(T|H1) - ?
Hypothesis H1 consists of
infinite number of
hypotheses: C = 5.1%, C =
5.2% … Which one should
we consider?
• H1: С = 5.5%
(+ 10%, minimum expected boost)
How does statistical test works:
distribution P(T|H1)
How does statistical test work:
significance level vs power
How does statistical test work:
significance level vs power
Important features of statistical
testing in e-commerce
Growth dynamics of metrics
Significance level vs power trade-off
improvement: sample size
Significance level vs power trade-off
improvement: effect size
Question: what should we do if we choose α = 10% but got p.value = 12%?
Uncertainty of p-value
• Key parameters of the statistical test are significance level and power that correspond to the
probability of false detection and probability to miss effect.
• Increased test power can be achieved in two ways: by increasing sample size or by increasing
effect size
• Keep in mind that p-value is a random statistic! It is important to account for its uncertainty.
• Mind that some metrics (like conversion from registration to buyer) may take significant time
to measure
• Anomalies in data may dramatically impact test results
Summary
Conclusions
• In e-commerce, test power is often of the most importance (probability not to miss effect)
• In the case of high-traffic business: the required trade-off between significance level and
power can be easily achieved by increasing the sample size.
• In the case of low-traffic business: focus on features which:
1) are cheap, easy to implement and not risky, or
2) have potentially big effects.
Thank you for your attention!

Weitere ähnliche Inhalte

Was ist angesagt?

Pan europa foods - Project Management
Pan europa foods - Project ManagementPan europa foods - Project Management
Pan europa foods - Project Management
Robbi Palacios
 
case study on ERP success(cadbury) and failure(hershey's)
case study on ERP success(cadbury) and failure(hershey's)case study on ERP success(cadbury) and failure(hershey's)
case study on ERP success(cadbury) and failure(hershey's)
Chitrangada Roy
 
Role of transportation in supply chain mgmt
Role of transportation in supply chain mgmtRole of transportation in supply chain mgmt
Role of transportation in supply chain mgmt
tulasi
 
Case study on amazon
Case study on amazonCase study on amazon
Case study on amazon
Annamalai Ram
 

Was ist angesagt? (20)

Amazon Supply Chain Analysis
Amazon Supply Chain AnalysisAmazon Supply Chain Analysis
Amazon Supply Chain Analysis
 
Logistic
LogisticLogistic
Logistic
 
Eureka Forbes Limited
Eureka Forbes LimitedEureka Forbes Limited
Eureka Forbes Limited
 
Pan europa foods - Project Management
Pan europa foods - Project ManagementPan europa foods - Project Management
Pan europa foods - Project Management
 
Benchmarking the Procurement Function
Benchmarking the Procurement FunctionBenchmarking the Procurement Function
Benchmarking the Procurement Function
 
case study on ERP success(cadbury) and failure(hershey's)
case study on ERP success(cadbury) and failure(hershey's)case study on ERP success(cadbury) and failure(hershey's)
case study on ERP success(cadbury) and failure(hershey's)
 
Information technology in supply chain managemnet
Information technology in supply chain managemnetInformation technology in supply chain managemnet
Information technology in supply chain managemnet
 
Distribution network desing
Distribution network desingDistribution network desing
Distribution network desing
 
Inbound And Outbound Logistics
Inbound And Outbound LogisticsInbound And Outbound Logistics
Inbound And Outbound Logistics
 
Reverse logistic and reverse supply chain
Reverse logistic and reverse supply chainReverse logistic and reverse supply chain
Reverse logistic and reverse supply chain
 
Chap 4 Designing the Distribution Network in a Supply Chain
Chap 4 Designing the Distribution Network in a Supply ChainChap 4 Designing the Distribution Network in a Supply Chain
Chap 4 Designing the Distribution Network in a Supply Chain
 
Digital Technology Used by DHL
 Digital Technology Used by DHL Digital Technology Used by DHL
Digital Technology Used by DHL
 
Role of transportation in supply chain mgmt
Role of transportation in supply chain mgmtRole of transportation in supply chain mgmt
Role of transportation in supply chain mgmt
 
Unilever Supply Chain Management
Unilever Supply Chain ManagementUnilever Supply Chain Management
Unilever Supply Chain Management
 
Case study on amazon
Case study on amazonCase study on amazon
Case study on amazon
 
Amazon
Amazon Amazon
Amazon
 
Value Chain Analysis
Value Chain AnalysisValue Chain Analysis
Value Chain Analysis
 
Hershey's case study.: ERP Implementation Failure
Hershey's case study.: ERP Implementation FailureHershey's case study.: ERP Implementation Failure
Hershey's case study.: ERP Implementation Failure
 
The role of e business in supply chain management
The role of e business in supply chain managementThe role of e business in supply chain management
The role of e business in supply chain management
 
Multimodal transportation& Electronic Data Interchange
Multimodal transportation& Electronic Data InterchangeMultimodal transportation& Electronic Data Interchange
Multimodal transportation& Electronic Data Interchange
 

Ähnlich wie Statistical hypothesis testing in e commerce

How Significant is Statistically Significant? The case of Audio Music Similar...
How Significant is Statistically Significant? The case of Audio Music Similar...How Significant is Statistically Significant? The case of Audio Music Similar...
How Significant is Statistically Significant? The case of Audio Music Similar...
Julián Urbano
 
A05 Continuous One Variable Stat Tests
A05 Continuous One Variable Stat TestsA05 Continuous One Variable Stat Tests
A05 Continuous One Variable Stat Tests
Leanleaders.org
 
A05 Continuous One Variable Stat Tests
A05 Continuous One Variable Stat TestsA05 Continuous One Variable Stat Tests
A05 Continuous One Variable Stat Tests
Leanleaders.org
 
1192012 155942 f023_=_statistical_inference
1192012 155942 f023_=_statistical_inference1192012 155942 f023_=_statistical_inference
1192012 155942 f023_=_statistical_inference
Dev Pandey
 
Identifying Appropriate Test Statistics Involving Population Mean
Identifying Appropriate Test Statistics Involving Population MeanIdentifying Appropriate Test Statistics Involving Population Mean
Identifying Appropriate Test Statistics Involving Population Mean
MYRABACSAFRA2
 
Chi square analysis-for_attribute_data_(01-14-06)
Chi square analysis-for_attribute_data_(01-14-06)Chi square analysis-for_attribute_data_(01-14-06)
Chi square analysis-for_attribute_data_(01-14-06)
Daniel Augustine
 

Ähnlich wie Statistical hypothesis testing in e commerce (20)

ABTest-20231020.pptx
ABTest-20231020.pptxABTest-20231020.pptx
ABTest-20231020.pptx
 
Elementary Data Analysis with MS Excel_Day-5
Elementary Data Analysis with MS Excel_Day-5Elementary Data Analysis with MS Excel_Day-5
Elementary Data Analysis with MS Excel_Day-5
 
Intro to data science
Intro to data scienceIntro to data science
Intro to data science
 
Introduction To Data Science Using R
Introduction To Data Science Using RIntroduction To Data Science Using R
Introduction To Data Science Using R
 
How Significant is Statistically Significant? The case of Audio Music Similar...
How Significant is Statistically Significant? The case of Audio Music Similar...How Significant is Statistically Significant? The case of Audio Music Similar...
How Significant is Statistically Significant? The case of Audio Music Similar...
 
A05 Continuous One Variable Stat Tests
A05 Continuous One Variable Stat TestsA05 Continuous One Variable Stat Tests
A05 Continuous One Variable Stat Tests
 
A05 Continuous One Variable Stat Tests
A05 Continuous One Variable Stat TestsA05 Continuous One Variable Stat Tests
A05 Continuous One Variable Stat Tests
 
1192012 155942 f023_=_statistical_inference
1192012 155942 f023_=_statistical_inference1192012 155942 f023_=_statistical_inference
1192012 155942 f023_=_statistical_inference
 
Identifying Appropriate Test Statistics Involving Population Mean
Identifying Appropriate Test Statistics Involving Population MeanIdentifying Appropriate Test Statistics Involving Population Mean
Identifying Appropriate Test Statistics Involving Population Mean
 
Project two guidelines and rubric.html competencyin this pr
Project two guidelines and rubric.html competencyin this prProject two guidelines and rubric.html competencyin this pr
Project two guidelines and rubric.html competencyin this pr
 
Hypothesis Testing: Proportions (Compare 1:Standard)
Hypothesis Testing: Proportions (Compare 1:Standard)Hypothesis Testing: Proportions (Compare 1:Standard)
Hypothesis Testing: Proportions (Compare 1:Standard)
 
hypothesis teesting
 hypothesis teesting hypothesis teesting
hypothesis teesting
 
Chi square analysis-for_attribute_data_(01-14-06)
Chi square analysis-for_attribute_data_(01-14-06)Chi square analysis-for_attribute_data_(01-14-06)
Chi square analysis-for_attribute_data_(01-14-06)
 
Business Research Methods Unit V
Business Research Methods Unit VBusiness Research Methods Unit V
Business Research Methods Unit V
 
Hypothsis testing
Hypothsis testingHypothsis testing
Hypothsis testing
 
Meetup_FGVA_Uplift @ Dataiku
Meetup_FGVA_Uplift @ DataikuMeetup_FGVA_Uplift @ Dataiku
Meetup_FGVA_Uplift @ Dataiku
 
Vital QMS Process Validation Statistics - OMTEC 2018
Vital QMS Process Validation Statistics - OMTEC 2018Vital QMS Process Validation Statistics - OMTEC 2018
Vital QMS Process Validation Statistics - OMTEC 2018
 
ISSTA'16 Summer School: Intro to Statistics
ISSTA'16 Summer School: Intro to StatisticsISSTA'16 Summer School: Intro to Statistics
ISSTA'16 Summer School: Intro to Statistics
 
Calculating a Sample Size
Calculating a Sample SizeCalculating a Sample Size
Calculating a Sample Size
 
What is the Independent Samples T Test Method of Analysis and How Can it Bene...
What is the Independent Samples T Test Method of Analysis and How Can it Bene...What is the Independent Samples T Test Method of Analysis and How Can it Bene...
What is the Independent Samples T Test Method of Analysis and How Can it Bene...
 

Kürzlich hochgeladen

Uncommon Grace The Autobiography of Isaac Folorunso
Uncommon Grace The Autobiography of Isaac FolorunsoUncommon Grace The Autobiography of Isaac Folorunso
Uncommon Grace The Autobiography of Isaac Folorunso
Kayode Fayemi
 
Chiulli_Aurora_Oman_Raffaele_Beowulf.pptx
Chiulli_Aurora_Oman_Raffaele_Beowulf.pptxChiulli_Aurora_Oman_Raffaele_Beowulf.pptx
Chiulli_Aurora_Oman_Raffaele_Beowulf.pptx
raffaeleoman
 
Bring back lost lover in USA, Canada ,Uk ,Australia ,London Lost Love Spell C...
Bring back lost lover in USA, Canada ,Uk ,Australia ,London Lost Love Spell C...Bring back lost lover in USA, Canada ,Uk ,Australia ,London Lost Love Spell C...
Bring back lost lover in USA, Canada ,Uk ,Australia ,London Lost Love Spell C...
amilabibi1
 
If this Giant Must Walk: A Manifesto for a New Nigeria
If this Giant Must Walk: A Manifesto for a New NigeriaIf this Giant Must Walk: A Manifesto for a New Nigeria
If this Giant Must Walk: A Manifesto for a New Nigeria
Kayode Fayemi
 
No Advance 8868886958 Chandigarh Call Girls , Indian Call Girls For Full Nigh...
No Advance 8868886958 Chandigarh Call Girls , Indian Call Girls For Full Nigh...No Advance 8868886958 Chandigarh Call Girls , Indian Call Girls For Full Nigh...
No Advance 8868886958 Chandigarh Call Girls , Indian Call Girls For Full Nigh...
Sheetaleventcompany
 

Kürzlich hochgeladen (20)

Introduction to Prompt Engineering (Focusing on ChatGPT)
Introduction to Prompt Engineering (Focusing on ChatGPT)Introduction to Prompt Engineering (Focusing on ChatGPT)
Introduction to Prompt Engineering (Focusing on ChatGPT)
 
Dreaming Music Video Treatment _ Project & Portfolio III
Dreaming Music Video Treatment _ Project & Portfolio IIIDreaming Music Video Treatment _ Project & Portfolio III
Dreaming Music Video Treatment _ Project & Portfolio III
 
BDSM⚡Call Girls in Sector 97 Noida Escorts >༒8448380779 Escort Service
BDSM⚡Call Girls in Sector 97 Noida Escorts >༒8448380779 Escort ServiceBDSM⚡Call Girls in Sector 97 Noida Escorts >༒8448380779 Escort Service
BDSM⚡Call Girls in Sector 97 Noida Escorts >༒8448380779 Escort Service
 
Uncommon Grace The Autobiography of Isaac Folorunso
Uncommon Grace The Autobiography of Isaac FolorunsoUncommon Grace The Autobiography of Isaac Folorunso
Uncommon Grace The Autobiography of Isaac Folorunso
 
Air breathing and respiratory adaptations in diver animals
Air breathing and respiratory adaptations in diver animalsAir breathing and respiratory adaptations in diver animals
Air breathing and respiratory adaptations in diver animals
 
Report Writing Webinar Training
Report Writing Webinar TrainingReport Writing Webinar Training
Report Writing Webinar Training
 
AWS Data Engineer Associate (DEA-C01) Exam Dumps 2024.pdf
AWS Data Engineer Associate (DEA-C01) Exam Dumps 2024.pdfAWS Data Engineer Associate (DEA-C01) Exam Dumps 2024.pdf
AWS Data Engineer Associate (DEA-C01) Exam Dumps 2024.pdf
 
My Presentation "In Your Hands" by Halle Bailey
My Presentation "In Your Hands" by Halle BaileyMy Presentation "In Your Hands" by Halle Bailey
My Presentation "In Your Hands" by Halle Bailey
 
Chiulli_Aurora_Oman_Raffaele_Beowulf.pptx
Chiulli_Aurora_Oman_Raffaele_Beowulf.pptxChiulli_Aurora_Oman_Raffaele_Beowulf.pptx
Chiulli_Aurora_Oman_Raffaele_Beowulf.pptx
 
Thirunelveli call girls Tamil escorts 7877702510
Thirunelveli call girls Tamil escorts 7877702510Thirunelveli call girls Tamil escorts 7877702510
Thirunelveli call girls Tamil escorts 7877702510
 
Bring back lost lover in USA, Canada ,Uk ,Australia ,London Lost Love Spell C...
Bring back lost lover in USA, Canada ,Uk ,Australia ,London Lost Love Spell C...Bring back lost lover in USA, Canada ,Uk ,Australia ,London Lost Love Spell C...
Bring back lost lover in USA, Canada ,Uk ,Australia ,London Lost Love Spell C...
 
ICT role in 21st century education and it's challenges.pdf
ICT role in 21st century education and it's challenges.pdfICT role in 21st century education and it's challenges.pdf
ICT role in 21st century education and it's challenges.pdf
 
If this Giant Must Walk: A Manifesto for a New Nigeria
If this Giant Must Walk: A Manifesto for a New NigeriaIf this Giant Must Walk: A Manifesto for a New Nigeria
If this Giant Must Walk: A Manifesto for a New Nigeria
 
Sector 62, Noida Call girls :8448380779 Noida Escorts | 100% verified
Sector 62, Noida Call girls :8448380779 Noida Escorts | 100% verifiedSector 62, Noida Call girls :8448380779 Noida Escorts | 100% verified
Sector 62, Noida Call girls :8448380779 Noida Escorts | 100% verified
 
Aesthetic Colaba Mumbai Cst Call girls 📞 7738631006 Grant road Call Girls ❤️-...
Aesthetic Colaba Mumbai Cst Call girls 📞 7738631006 Grant road Call Girls ❤️-...Aesthetic Colaba Mumbai Cst Call girls 📞 7738631006 Grant road Call Girls ❤️-...
Aesthetic Colaba Mumbai Cst Call girls 📞 7738631006 Grant road Call Girls ❤️-...
 
Causes of poverty in France presentation.pptx
Causes of poverty in France presentation.pptxCauses of poverty in France presentation.pptx
Causes of poverty in France presentation.pptx
 
No Advance 8868886958 Chandigarh Call Girls , Indian Call Girls For Full Nigh...
No Advance 8868886958 Chandigarh Call Girls , Indian Call Girls For Full Nigh...No Advance 8868886958 Chandigarh Call Girls , Indian Call Girls For Full Nigh...
No Advance 8868886958 Chandigarh Call Girls , Indian Call Girls For Full Nigh...
 
Busty Desi⚡Call Girls in Sector 51 Noida Escorts >༒8448380779 Escort Service-...
Busty Desi⚡Call Girls in Sector 51 Noida Escorts >༒8448380779 Escort Service-...Busty Desi⚡Call Girls in Sector 51 Noida Escorts >༒8448380779 Escort Service-...
Busty Desi⚡Call Girls in Sector 51 Noida Escorts >༒8448380779 Escort Service-...
 
SaaStr Workshop Wednesday w/ Lucas Price, Yardstick
SaaStr Workshop Wednesday w/ Lucas Price, YardstickSaaStr Workshop Wednesday w/ Lucas Price, Yardstick
SaaStr Workshop Wednesday w/ Lucas Price, Yardstick
 
Dreaming Marissa Sánchez Music Video Treatment
Dreaming Marissa Sánchez Music Video TreatmentDreaming Marissa Sánchez Music Video Treatment
Dreaming Marissa Sánchez Music Video Treatment
 

Statistical hypothesis testing in e commerce

  • 1. The basics of statistical hypothesis testing in E-commerce. By Anatoly Vuets
  • 2. Agenda • Why do we use (we should use) statistical hypothesis testing in e-commerce? • Statistical test: how does it work and its main parameters • Key features for e-commerce
  • 3. Why do we need statistical testing in e-commerce?
  • 4. We need the right decisions • A/B tests • Ad-hoc analyses • Building models
  • 5. We need the right decisions • A question: which of these groups makes more profit? • What is missing here?
  • 6. We need the right decisions • A/B test: which version is better?
  • 7. Statistical test: let’s recall the basics!
  • 8. • Random variable (discrete or continuous) • Probability distribution function (PMF(x), PDF(x)) • Mean M or μ • Standard deviation SD or σ Basics of statistics
  • 9. Basics of statistics: standard distribution
  • 11. ... ... ... ... .................. True metrics value Statistical population Sample Possible samples ... Observed value Other possible values (distribution) Uncertainty
  • 12. We want to conclude about the statistical population based on single sample that we have observed Statistical population Observed sample Possible samples Why is this important?
  • 14. Statistical test: basic idea and main parameters.
  • 15. • We want to test a statement (typically existence of an effect). • We have a set of observations (sample) from which we conclude the statement. • Scenario, in which the statement is TRUE is called alternative hypothesis H1. • Scenario, in which the statement is FALSE is called null hypothesis H0. • Estimate the probability to observe the sample we have under H0. • If the probability is high enough - we conclude that H1 can not be accepted. In the opposite case, we accept H1. Idea
  • 16. ... H0/H1𝗧(S) H0: C = 5% H1: C > 5% Statistical test
  • 17. H0 H1 H0 Correct P: 1 - α Error T1 P: α H1 Error T2 P: β Correct P: 1 - β Test T(s) Truth • Error T1 - accept H1 when H0 is true. • Error T2 -accept H0 when H1 is true. • We would like to have a perfect test (α = 0, β = 0). However as we shall see later, this is impossible in practice. Because of this, test design and result interpretation are crucial for proper decision making. Statistical test parameters
  • 18. A detector can be considered as a binary classifier: passenger does not have (H0) or has metal objects (H1) (weapon etc.) The detector has a sensitivity knob (decision boundary). If the sensitivity is low detector falsely detects metal in α = 5% of cases, but skips metal in β = 67% of cases. If the sensitivity is high - it falsely detects metal in α = 50%, but skips in β = 0.3% of cases. Intermediate sensitivity values allow choosing the trade-off between skipping a passenger who has hidden metal objects (increases probability of an incident) and the service speed (additional airport costs and lower passenger satisfaction). Statistical test parameters: metal detector in airport
  • 19. Statistical tests based on data achieved from an A/B test can be treated as a classifier which is supposed to tell whether conversion rate increased (H1) or remained the same (H0). Question: which trade-off between α and β would you choose? Statistical test parameters: increasing web-page conversion rate
  • 20. • H0: C = 5%, H1: C > 5% • T(s) = c/n, n = 3600 • significance level = 5% • P(T|H0) - ? Theory: Simulation: bootstrap How does statistical test work: distribution P(T|H0)
  • 21. How does statistical test works: significance level and decision boundary
  • 22. • H0: C = 5%, H1: C > 5% • T(s) = c/n, n = 3600 • significance level = 5% • P(T|H1) - ? Hypothesis H1 consists of infinite number of hypotheses: C = 5.1%, C = 5.2% … Which one should we consider? • H1: С = 5.5% (+ 10%, minimum expected boost) How does statistical test works: distribution P(T|H1)
  • 23. How does statistical test work: significance level vs power
  • 24. How does statistical test work: significance level vs power
  • 25. Important features of statistical testing in e-commerce
  • 27. Significance level vs power trade-off improvement: sample size
  • 28. Significance level vs power trade-off improvement: effect size
  • 29. Question: what should we do if we choose α = 10% but got p.value = 12%? Uncertainty of p-value
  • 30. • Key parameters of the statistical test are significance level and power that correspond to the probability of false detection and probability to miss effect. • Increased test power can be achieved in two ways: by increasing sample size or by increasing effect size • Keep in mind that p-value is a random statistic! It is important to account for its uncertainty. • Mind that some metrics (like conversion from registration to buyer) may take significant time to measure • Anomalies in data may dramatically impact test results Summary
  • 31. Conclusions • In e-commerce, test power is often of the most importance (probability not to miss effect) • In the case of high-traffic business: the required trade-off between significance level and power can be easily achieved by increasing the sample size. • In the case of low-traffic business: focus on features which: 1) are cheap, easy to implement and not risky, or 2) have potentially big effects.
  • 32. Thank you for your attention!