SlideShare ist ein Scribd-Unternehmen logo
1 von 66
Downloaden Sie, um offline zu lesen
On p-values
Maarten van Smeden
Annual Julius Symposium 2016
About
• statistician by training
• phd (2016): diagnostic research in absence gold standard
(JC)
• post-doc: biostatistics / epidemiological methods (JC)
About this workshop
p-value?
ASA statement: why and what?
p-value alternatives?
Go to:
pvalue.presenterswall.nl
Point of departure
skeptical whenever I see a p-value
The term “inference”
p-value?
Formally defined by
The pioneers
Ronald Aylmer Fisher 

(1890 - 1962)
Jerzy Neyman 

(1894-1981)
Egon Pearson 

(1895-1980)
p-value ≥ α
“no effect”
p-value < α
“effect!”
α = .05, unless…
… the p-value fails
“arguably significant” (P = 0.07)
“direction heading to significance” (P = 0.10)
“flirting with conventional levels of significance” (P > 0.1)
“marginally significant” (P ≥ 0.1)
convenient sample from: https://mchankins.wordpress.com/2013/04/21/still-not-significant-2/
listing 509 expressions for non-significant results at α = .05 level (24 October 2016)
+ 23!!! supplementary files
Wasserstein & Lazar (2016) The ASA's Statement on p-Values: 

Context, Process, and Purpose, The American Statistician, 70:2, 129-133
A few quotes (1)
“The ASA has not previously taken positions on specific
matters of statistical practice.”

nb. founded in 1839
“Nothing in the ASA statement is new.”
from the ASA Statement
A few quotes (2)
“… process was lengthier and more controversial than
anticipated.”
“… the statement articulates in non-technical terms a few select
principles that could improve the conduct or interpretation of
quantitative science, according to widespread consensus in the
statistical community."
from the ASA Statement
p-value?

why?
Go to
pvalue.presenterswall.nl
Why do we need a statement?
‘“It’s science’s dirtiest secret: The ‘scientific method’ of testing
hypotheses by statistical analysis stands on a flimsy
foundation.”’
Quoting Siegfried (2010), Odds Are, It’s Wrong: Science Fails to Face the Shortcomings of Statistics, Science News, 177, 26.
from the ASA Statement: Wasserstein & Lazar (2016) The ASA's Statement on p-Values: 

Context, Process, and Purpose, The American Statistician, 70:2, 129-133
OK, but why now?
“… highly visible discussions over the last few years”
“The statistical community has been deeply concerned about
issues of reproducibility and replicability …”
from the ASA statement
In popular media
http://www.vox.com/2016/3/15/11225162/p-value-simple-definition-hacking
(~ 50 million unique visitors monthly)
The social sciences
Drastic measures…
NHST = Null hypothesis significance testing
P-value increasingly central in reporting
From: Chavalarias et al. JAMA. 2016;315(11):1141-1148, doi:10.1001/jama.2016.1952
Using text-mining >1.6 million abstracts
In the large (‘big’) data era
“With a combination of large datasets, confounding, flexibility in
analytical choices …, and superimposed selective reporting
bias, using a P < 0.05 threshold to declare “success,” …. 

means next to nothing.”
From ASA supplementary material, response by Ioannidis.
To summarise: why?
• p-values and the P < .05 rule are at the core of inference in
today’s science (social, biomedical, …)
• there is growing concern that these inference are often wrong
• perhaps, if we understand p-values better, we’ll be less
often wrong
p-value?

why?

what?
The statement: 6 principles
1. P-values can indicate how incompatible the data are with a specified
statistical model.
2. P-values do not measure the probability that the studied hypothesis is
true, or the probability that the data were produced by random chance
alone.
3. Scientific conclusions and business or policy decisions should not be
based only on whether a p-value passes a specific threshold.
4. Proper inference requires full reporting and transparency.
5. A p-value, or statistical significance, does not measure the size of an
effect or the importance of a result.
6. By itself, a p-value does not provide a good measure of evidence
regarding a model or hypothesis.
from the ASA statement
Statistical model?
• every method of statistical inference relies on a web of
assumptions which together can be viewed as a ‘statistical
model’
• the tested hypothesis is one of these assumptions. Often a
‘zero-effect’ called ‘null hypothesis’
About assumptions
the calculation of p-values always relies on assumptions
besides the hypothesis tested. It is easy to ignore/forget those
assumptions while analysing.
Your assumptions are your windows on the world.
Scrub them off every once in a while, or the light
won't come in.
Alan Alda
The statement: 6 principles
1. P-values can indicate how incompatible the data are with a specified
statistical model.
2. P-values do not measure the probability that the studied hypothesis
is true, or the probability that the data were produced by random
chance alone.
3. Scientific conclusions and business or policy decisions should not be
based only on whether a p-value passes a specific threshold.
4. Proper inference requires full reporting and transparency.
5. A p-value, or statistical significance, does not measure the size of an
effect or the importance of a result.
6. By itself, a p-value does not provide a good measure of evidence
regarding a model or hypothesis.
from the ASA statement
From a probability point of view
p-value*: P(Data|Hypothesis)
is not: P(Hypothesis|Data)
*Somewhat simplified, correct notation would be: P(T(X) ≥ x | Hypothesis)
Does it matter?
P(Death|Handgun)
= 5% to 20%*
P(Handgun|Death)
= 0.028%**
* from New York Times (http://www.nytimes.com article published: 2008/04/03/)
** from CBS StatLine (concerning deaths and registered gun crimes in 2015 in the Netherlands)
If there only was a way…
P(Data|Hypothesis)
P(Hypothesis|Data)
There is…
reverend Thomas Bayes

(1702-1761)
P(H|D) =
P(D|H) P(H)
P(D)
The statement: 6 principles
1. P-values can indicate how incompatible the data are with a specified
statistical model.
2. P-values do not measure the probability that the studied hypothesis is
true, or the probability that the data were produced by random chance
alone.
3. Scientific conclusions and business or policy decisions should not be
based only on whether a p-value passes a specific threshold.
4. Proper inference requires full reporting and transparency.
5. A p-value, or statistical significance, does not measure the size of an
effect or the importance of a result.
6. By itself, a p-value does not provide a good measure of evidence
regarding a model or hypothesis.
from the ASA statement
On bright-line rules
“Practices that reduce data analysis or scientific
inference to mechanical “bright-line” rules (such as “p <
0.05”) for justifying scientific claims or conclusions can
lead to erroneous beliefs and poor decision making. A
conclusion does not immediately become “true” on
one side of the divide and “false” on the other.”
from the ASA statement
If p ~ .05
D Colquhoun (2014). An investigation of the false discovery rate and the misinterpretation of p-values. R.Soc.opensci.1:140216.
“If you want to avoid making a fool of yourself very often, do not
regard anything greater than p < 0.001 as a demonstration that
you have discovered something”
If p > .05
The statement: 6 principles
1. P-values can indicate how incompatible the data are with a specified
statistical model.
2. P-values do not measure the probability that the studied hypothesis is
true, or the probability that the data were produced by random chance
alone.
3. Scientific conclusions and business or policy decisions should not be
based only on whether a p-value passes a specific threshold.
4. Proper inference requires full reporting and transparency.
5. A p-value, or statistical significance, does not measure the size of an
effect or the importance of a result.
6. By itself, a p-value does not provide a good measure of evidence
regarding a model or hypothesis.
from the ASA statement
The issue of pre-specified hypotheses
From: http://compare-trials.org/ accessed on November 20 2016
Ed Yong (2012). Replication studies: Bad copy, Nature. Data credits to: D Fanelli.
Why is this enormous positivity?
If you torture the data long enough,
it will confess to anything
Ronald Coase
besides journal editors requirement for p < .05
Multiple (potential) comparisons
aka

- p-hacking

- data fishing

- data dredging

- multiple testing

- multiplicity

- significance chasing

- significance questing

- selective inference

- etc.

Selective reporting
“Whenever a researcher chooses what to present based on
statistical results, valid interpretation of those results is
severely compromised if the reader is not informed of the choice
and its basis. Researchers should disclose the number of
hypotheses explored during the study, all data collection
decisions, all statistical analyses conducted, and all p-
values computed. Valid scientific conclusions based on p-
values and related statistics cannot be drawn without at least
knowing how many and which analyses were conducted, and
how those analyses (including p-values) were selected for
reporting.”
from the ASA statement
The statement: 6 principles
1. P-values can indicate how incompatible the data are with a specified
statistical model.
2. P-values do not measure the probability that the studied hypothesis is
true, or the probability that the data were produced by random chance
alone.
3. Scientific conclusions and business or policy decisions should not be
based only on whether a p-value passes a specific threshold.
4. Proper inference requires full reporting and transparency.
5. A p-value, or statistical significance, does not measure the size of
an effect or the importance of a result.
6. By itself, a p-value does not provide a good measure of evidence
regarding a model or hypothesis.
from the ASA statement
About effect size
• statistical significance does not imply practical importance
• to understand practical importance we need information on
the effect size
• Is the p-value a good measure for effect size?
Dance of the p-values
https://www.youtube.com/watch?v=5OL1RqHrZQ8&t=10s
Credits to Professor Geoff Cumming
The statement: 6 principles
1. P-values can indicate how incompatible the data are with a specified
statistical model.
2. P-values do not measure the probability that the studied hypothesis
is true, or the probability that the data were produced by random
chance alone.
3. Scientific conclusions and business or policy decisions should not
be based only on whether a p-value passes a specific threshold.
4. Proper inference requires full reporting and transparency.
5. A p-value, or statistical significance, does not measure the size of an
effect or the importance of a result.
6. By itself, a p-value does not provide a good measure of evidence
regarding a model or hypothesis.
from the ASA Statement
P-values in isolation
“Researchers should recognize that a p-value without context
or other evidence provides limited information. For example, a
p-value near 0.05 taken by itself offers only weak evidence
against the null hypothesis. Likewise, a relatively large p-value
does not imply evidence in favour of the null hypothesis; many
other hypotheses may be equally or more consistent with the
observed data. For these reasons, data analysis should not
end with the calculation of a p-value when other approaches
are appropriate and feasible.”
from the ASA statement
The statement: 6 principles
1. P-values can indicate how incompatible the data are with a specified
statistical model.
2. P-values do not measure the probability that the studied hypothesis
is true, or the probability that the data were produced by random
chance alone.
3. Scientific conclusions and business or policy decisions should not
be based only on whether a p-value passes a specific threshold.
4. Proper inference requires full reporting and transparency.
5. A p-value, or statistical significance, does not measure the size of an
effect or the importance of a result.
6. By itself, a p-value does not provide a good measure of evidence
regarding a model or hypothesis.
from the ASA statement
Agreement reached?
“you can believe me that had it been any stronger, then all but
one of the statisticians would have resigned.”
“If only the rest could have agreed with me, we would have a
much stronger statement.”
from SlideShare, by Stephen Senn: P Values and the art of herding cats (accessed on Oct 30 2016)
Stephen Senn, involved in the ASA statement
From a practical point of view
if you work with p-values (derived from the 6 ASA principles):
1. think carefully about the underlying assumptions
2. avoid statements about the truth of the tested hypothesis
3. avoid strong statements about effect based solely on p < .
05 or absence of effect based solely on p > .05
4. report no. and sequence of analyses; avoid data torture
5. avoid statements about effect size based on p-value
6. if feasible, use additional information from other inferential
tools
p-value?

why?

what?

p-value alternatives?
Other approaches
• Methods that emphasise estimation rather than testing
• confidence intervals
• prediction intervals
• credible intervals
• Bayesian methods
• Alternative measures of evidence
• likelihood ratios
• Bayes factors
• Other approaches
• Decision-theoretic modelling
• False discovery rates
From ASA statement
A too short introduction to Bayesian inference
Remember Bayes?
reverend Thomas Bayes

(1702-1761)
Using Bayes theorem
P(θ|D) =
P(D|θ) P(θ)
P(D)
P(θ|D) ∝ P(D|θ) P(θ)
“likelihood” “prior distribution”
“posterior distribution”
Rational for Bayesian inference
the posterior distribution (θ|D) is “more informative” than the
likelihood (D|θ)
However:
“Proponents of the “Bayesian revolution” should be wary of
chasing het another chimera: an apparently universal inference
procedure. A better path would be to promote both an
understanding of various devices in the “statistical toolbox” and
informed judgment to select among these.”

Gigerenzer and Marewski (2015), Surrogate Science: The Idol of a Universal Method for Scientific Inference. Journal of Management
p-value?

why?

what?

p-value alternatives?

some final remarks
The words of the pioneer
No scientific worker has a fixed level of
significance at which from year to year, and in
all circumstances, he rejects hypotheses; he rather
gives his mind to each particular case in the light of
his evidence and his ideas.
Ronald Fisher
Many initiatives to improve science…
see: http://www.scienceintransition.nl/english
and reduce waste
~ 85% of all health research is being avoidably “wasted”
see also: http://blogs.bmj.com/bmj/2016/01/14/paul-glasziou-and-iain-chalmers-is-85-of-health-research-really-wasted/,
and: Lancet’s 2014 series on increasing value, reducing waste (incl video’s etc.): http://www.thelancet.com/series/research
Conclusion
• statistical inference is inherently difficult; we should avoid
making a fool of ourselves too often
• p-values can be useful tools for inference; most often, p-
values should not be the ‘star of the inference show’
• bright line rules such as p < .05 give a false sense of
scientific objectivity
• like to play around with data? Me too! Think twice before you
publish such explorations; if you do, be honest and
transparent in reporting

Some random thoughts
• inference is thought as a primarily mathematical or
computational problem, it should not.
• we should ban the term “significant” from scientific output
for describing effects that are accompanied with p < .05.
• in applied statistics education, we should invest more time
in discussing various forms of inference (e.g., Bayesian
inference) and their merits and pitfalls
Go to:
pvalue.presenterswall.nl
Points for discussion
• is there a need for changing the way we do inference?
• if so, how and what do we change?
• education?
• journals?
• should we downplay the role of p < .05 in scientific output?

Weitere ähnliche Inhalte

Was ist angesagt?

Confidence Intervals: Basic concepts and overview
Confidence Intervals: Basic concepts and overviewConfidence Intervals: Basic concepts and overview
Confidence Intervals: Basic concepts and overviewRizwan S A
 
Hypothesis testing and p values 06
Hypothesis testing and p values  06Hypothesis testing and p values  06
Hypothesis testing and p values 06DrZahid Khan
 
Sensitivity, specificity, positive and negative predictive
Sensitivity, specificity, positive and negative predictiveSensitivity, specificity, positive and negative predictive
Sensitivity, specificity, positive and negative predictiveMusthafa Peedikayil
 
Basic Biostatistics and Data managment
Basic Biostatistics and Data managment Basic Biostatistics and Data managment
Basic Biostatistics and Data managment Tadesse Awoke Ayele
 
Tests of hypothesis (Statistical testing)
Tests of hypothesis (Statistical testing)Tests of hypothesis (Statistical testing)
Tests of hypothesis (Statistical testing)Rizwan S A
 
Bias and confounding
Bias and confoundingBias and confounding
Bias and confoundingIkram Ullah
 
Estimation in statistics
Estimation in statisticsEstimation in statistics
Estimation in statisticsRabea Jamal
 
Bias and confounding in Cohort and case control study
Bias and confounding in Cohort and case control studyBias and confounding in Cohort and case control study
Bias and confounding in Cohort and case control studyIkram Ullah
 
Survival analysis
Survival analysisSurvival analysis
Survival analysisHar Jindal
 
Development and evaluation of prediction models: pitfalls and solutions
Development and evaluation of prediction models: pitfalls and solutionsDevelopment and evaluation of prediction models: pitfalls and solutions
Development and evaluation of prediction models: pitfalls and solutionsMaarten van Smeden
 
Statistical significance
Statistical significanceStatistical significance
Statistical significanceMai Ngoc Duc
 
Hypothesis testing
Hypothesis testingHypothesis testing
Hypothesis testingArnab Sadhu
 

Was ist angesagt? (20)

Confounding.pptx
Confounding.pptxConfounding.pptx
Confounding.pptx
 
Confidence Intervals: Basic concepts and overview
Confidence Intervals: Basic concepts and overviewConfidence Intervals: Basic concepts and overview
Confidence Intervals: Basic concepts and overview
 
Types of bias
Types of biasTypes of bias
Types of bias
 
Hypothesis testing and p values 06
Hypothesis testing and p values  06Hypothesis testing and p values  06
Hypothesis testing and p values 06
 
Sensitivity, specificity, positive and negative predictive
Sensitivity, specificity, positive and negative predictiveSensitivity, specificity, positive and negative predictive
Sensitivity, specificity, positive and negative predictive
 
P value
P valueP value
P value
 
Basic Biostatistics and Data managment
Basic Biostatistics and Data managment Basic Biostatistics and Data managment
Basic Biostatistics and Data managment
 
Tests of hypothesis (Statistical testing)
Tests of hypothesis (Statistical testing)Tests of hypothesis (Statistical testing)
Tests of hypothesis (Statistical testing)
 
Mc Nemar
Mc NemarMc Nemar
Mc Nemar
 
Bias and confounding
Bias and confoundingBias and confounding
Bias and confounding
 
Bias in clinical research
Bias in clinical research Bias in clinical research
Bias in clinical research
 
Estimation in statistics
Estimation in statisticsEstimation in statistics
Estimation in statistics
 
Part 2 Cox Regression
Part 2 Cox RegressionPart 2 Cox Regression
Part 2 Cox Regression
 
Bias and confounding in Cohort and case control study
Bias and confounding in Cohort and case control studyBias and confounding in Cohort and case control study
Bias and confounding in Cohort and case control study
 
Tests of significance
Tests of significanceTests of significance
Tests of significance
 
Survival analysis
Survival analysisSurvival analysis
Survival analysis
 
Bias in Research
Bias in ResearchBias in Research
Bias in Research
 
Development and evaluation of prediction models: pitfalls and solutions
Development and evaluation of prediction models: pitfalls and solutionsDevelopment and evaluation of prediction models: pitfalls and solutions
Development and evaluation of prediction models: pitfalls and solutions
 
Statistical significance
Statistical significanceStatistical significance
Statistical significance
 
Hypothesis testing
Hypothesis testingHypothesis testing
Hypothesis testing
 

Ähnlich wie On p-values

"The Statistical Replication Crisis: Paradoxes and Scapegoats”
"The Statistical Replication Crisis: Paradoxes and Scapegoats”"The Statistical Replication Crisis: Paradoxes and Scapegoats”
"The Statistical Replication Crisis: Paradoxes and Scapegoats”jemille6
 
Statistical skepticism: How to use significance tests effectively
Statistical skepticism: How to use significance tests effectively Statistical skepticism: How to use significance tests effectively
Statistical skepticism: How to use significance tests effectively jemille6
 
Mayo minnesota 28 march 2 (1)
Mayo minnesota 28 march 2 (1)Mayo minnesota 28 march 2 (1)
Mayo minnesota 28 march 2 (1)jemille6
 
Controversy Over the Significance Test Controversy
Controversy Over the Significance Test ControversyControversy Over the Significance Test Controversy
Controversy Over the Significance Test Controversyjemille6
 
importance of P value and its uses in the realtime Significance
importance of P value and its uses in the realtime Significanceimportance of P value and its uses in the realtime Significance
importance of P value and its uses in the realtime SignificanceSukumarReddy43
 
The ASA president Task Force Statement on Statistical Significance and Replic...
The ASA president Task Force Statement on Statistical Significance and Replic...The ASA president Task Force Statement on Statistical Significance and Replic...
The ASA president Task Force Statement on Statistical Significance and Replic...jemille6
 
P-Value "Reforms": Fixing Science or Threat to Replication and Falsification
P-Value "Reforms": Fixing Science or Threat to Replication and FalsificationP-Value "Reforms": Fixing Science or Threat to Replication and Falsification
P-Value "Reforms": Fixing Science or Threat to Replication and Falsificationjemille6
 
Hypothesis testing
Hypothesis testingHypothesis testing
Hypothesis testingpraveen3030
 
Lecture by Professor Imre Janszky about random error.
Lecture by Professor Imre Janszky about random error. Lecture by Professor Imre Janszky about random error.
Lecture by Professor Imre Janszky about random error. EPINOR
 
Surviving statistics lecture 1
Surviving statistics lecture 1Surviving statistics lecture 1
Surviving statistics lecture 1MikeBlyth
 
Data Science interview questions of Statistics
Data Science interview questions of Statistics Data Science interview questions of Statistics
Data Science interview questions of Statistics Learnbay Datascience
 
D. Mayo: Replication Research Under an Error Statistical Philosophy
D. Mayo: Replication Research Under an Error Statistical Philosophy D. Mayo: Replication Research Under an Error Statistical Philosophy
D. Mayo: Replication Research Under an Error Statistical Philosophy jemille6
 
Overview of Statistical Concepts
Overview of Statistical ConceptsOverview of Statistical Concepts
Overview of Statistical ConceptsMichael770443
 

Ähnlich wie On p-values (20)

"The Statistical Replication Crisis: Paradoxes and Scapegoats”
"The Statistical Replication Crisis: Paradoxes and Scapegoats”"The Statistical Replication Crisis: Paradoxes and Scapegoats”
"The Statistical Replication Crisis: Paradoxes and Scapegoats”
 
Statistical skepticism: How to use significance tests effectively
Statistical skepticism: How to use significance tests effectively Statistical skepticism: How to use significance tests effectively
Statistical skepticism: How to use significance tests effectively
 
Sti2018 jws
Sti2018 jwsSti2018 jws
Sti2018 jws
 
Mayo minnesota 28 march 2 (1)
Mayo minnesota 28 march 2 (1)Mayo minnesota 28 march 2 (1)
Mayo minnesota 28 march 2 (1)
 
Amsterdam 2008
Amsterdam 2008Amsterdam 2008
Amsterdam 2008
 
Controversy Over the Significance Test Controversy
Controversy Over the Significance Test ControversyControversy Over the Significance Test Controversy
Controversy Over the Significance Test Controversy
 
importance of P value and its uses in the realtime Significance
importance of P value and its uses in the realtime Significanceimportance of P value and its uses in the realtime Significance
importance of P value and its uses in the realtime Significance
 
The ASA president Task Force Statement on Statistical Significance and Replic...
The ASA president Task Force Statement on Statistical Significance and Replic...The ASA president Task Force Statement on Statistical Significance and Replic...
The ASA president Task Force Statement on Statistical Significance and Replic...
 
Statistical significance
Statistical significanceStatistical significance
Statistical significance
 
P-Value "Reforms": Fixing Science or Threat to Replication and Falsification
P-Value "Reforms": Fixing Science or Threat to Replication and FalsificationP-Value "Reforms": Fixing Science or Threat to Replication and Falsification
P-Value "Reforms": Fixing Science or Threat to Replication and Falsification
 
Hypothesis testing
Hypothesis testingHypothesis testing
Hypothesis testing
 
Lecture by Professor Imre Janszky about random error.
Lecture by Professor Imre Janszky about random error. Lecture by Professor Imre Janszky about random error.
Lecture by Professor Imre Janszky about random error.
 
Statistics
StatisticsStatistics
Statistics
 
Surviving statistics lecture 1
Surviving statistics lecture 1Surviving statistics lecture 1
Surviving statistics lecture 1
 
Statistics
StatisticsStatistics
Statistics
 
Data Science interview questions of Statistics
Data Science interview questions of Statistics Data Science interview questions of Statistics
Data Science interview questions of Statistics
 
D. Mayo: Replication Research Under an Error Statistical Philosophy
D. Mayo: Replication Research Under an Error Statistical Philosophy D. Mayo: Replication Research Under an Error Statistical Philosophy
D. Mayo: Replication Research Under an Error Statistical Philosophy
 
Confidence intervals
Confidence intervalsConfidence intervals
Confidence intervals
 
Overview of Statistical Concepts
Overview of Statistical ConceptsOverview of Statistical Concepts
Overview of Statistical Concepts
 
Amsterdam 11.06.2008
Amsterdam 11.06.2008Amsterdam 11.06.2008
Amsterdam 11.06.2008
 

Mehr von Maarten van Smeden

Rage against the machine learning 2023
Rage against the machine learning 2023Rage against the machine learning 2023
Rage against the machine learning 2023Maarten van Smeden
 
A gentle introduction to AI for medicine
A gentle introduction to AI for medicineA gentle introduction to AI for medicine
A gentle introduction to AI for medicineMaarten van Smeden
 
Improving epidemiological research: avoiding the statistical paradoxes and fa...
Improving epidemiological research: avoiding the statistical paradoxes and fa...Improving epidemiological research: avoiding the statistical paradoxes and fa...
Improving epidemiological research: avoiding the statistical paradoxes and fa...Maarten van Smeden
 
Shrinkage in medical prediction: the poor man’s solution for an inadequate sa...
Shrinkage in medical prediction: the poor man’s solution for an inadequate sa...Shrinkage in medical prediction: the poor man’s solution for an inadequate sa...
Shrinkage in medical prediction: the poor man’s solution for an inadequate sa...Maarten van Smeden
 
Guideline for high-quality diagnostic and prognostic applications of AI in he...
Guideline for high-quality diagnostic and prognostic applications of AI in he...Guideline for high-quality diagnostic and prognostic applications of AI in he...
Guideline for high-quality diagnostic and prognostic applications of AI in he...Maarten van Smeden
 
Prognosis-based medicine: merits and pitfalls of forecasting patient health
Prognosis-based medicine: merits and pitfalls of forecasting patient healthPrognosis-based medicine: merits and pitfalls of forecasting patient health
Prognosis-based medicine: merits and pitfalls of forecasting patient healthMaarten van Smeden
 
Algorithm based medicine: old statistics wine in new machine learning bottles?
Algorithm based medicine: old statistics wine in new machine learning bottles?Algorithm based medicine: old statistics wine in new machine learning bottles?
Algorithm based medicine: old statistics wine in new machine learning bottles?Maarten van Smeden
 
Clinical prediction models for covid-19: alarming results from a living syste...
Clinical prediction models for covid-19: alarming results from a living syste...Clinical prediction models for covid-19: alarming results from a living syste...
Clinical prediction models for covid-19: alarming results from a living syste...Maarten van Smeden
 
Five questions about artificial intelligence
Five questions about artificial intelligenceFive questions about artificial intelligence
Five questions about artificial intelligenceMaarten van Smeden
 
Prediction models for diagnosis and prognosis related to COVID-19
Prediction models for diagnosis and prognosis related to COVID-19Prediction models for diagnosis and prognosis related to COVID-19
Prediction models for diagnosis and prognosis related to COVID-19Maarten van Smeden
 
Clinical prediction models: development, validation and beyond
Clinical prediction models:development, validation and beyondClinical prediction models:development, validation and beyond
Clinical prediction models: development, validation and beyondMaarten van Smeden
 
Correcting for missing data, measurement error and confounding
Correcting for missing data, measurement error and confoundingCorrecting for missing data, measurement error and confounding
Correcting for missing data, measurement error and confoundingMaarten van Smeden
 
Living systematic reviews: now and in the future
Living systematic reviews: now and in the futureLiving systematic reviews: now and in the future
Living systematic reviews: now and in the futureMaarten van Smeden
 
The statistics of the coronavirus
The statistics of the coronavirusThe statistics of the coronavirus
The statistics of the coronavirusMaarten van Smeden
 

Mehr von Maarten van Smeden (20)

Uncertainty in AI
Uncertainty in AIUncertainty in AI
Uncertainty in AI
 
UMC Utrecht AI Methods Lab
UMC Utrecht AI Methods LabUMC Utrecht AI Methods Lab
UMC Utrecht AI Methods Lab
 
Rage against the machine learning 2023
Rage against the machine learning 2023Rage against the machine learning 2023
Rage against the machine learning 2023
 
A gentle introduction to AI for medicine
A gentle introduction to AI for medicineA gentle introduction to AI for medicine
A gentle introduction to AI for medicine
 
Associate professor lecture
Associate professor lectureAssociate professor lecture
Associate professor lecture
 
Improving epidemiological research: avoiding the statistical paradoxes and fa...
Improving epidemiological research: avoiding the statistical paradoxes and fa...Improving epidemiological research: avoiding the statistical paradoxes and fa...
Improving epidemiological research: avoiding the statistical paradoxes and fa...
 
Shrinkage in medical prediction: the poor man’s solution for an inadequate sa...
Shrinkage in medical prediction: the poor man’s solution for an inadequate sa...Shrinkage in medical prediction: the poor man’s solution for an inadequate sa...
Shrinkage in medical prediction: the poor man’s solution for an inadequate sa...
 
Guideline for high-quality diagnostic and prognostic applications of AI in he...
Guideline for high-quality diagnostic and prognostic applications of AI in he...Guideline for high-quality diagnostic and prognostic applications of AI in he...
Guideline for high-quality diagnostic and prognostic applications of AI in he...
 
Predictimands
PredictimandsPredictimands
Predictimands
 
Prognosis-based medicine: merits and pitfalls of forecasting patient health
Prognosis-based medicine: merits and pitfalls of forecasting patient healthPrognosis-based medicine: merits and pitfalls of forecasting patient health
Prognosis-based medicine: merits and pitfalls of forecasting patient health
 
Algorithm based medicine
Algorithm based medicineAlgorithm based medicine
Algorithm based medicine
 
Algorithm based medicine: old statistics wine in new machine learning bottles?
Algorithm based medicine: old statistics wine in new machine learning bottles?Algorithm based medicine: old statistics wine in new machine learning bottles?
Algorithm based medicine: old statistics wine in new machine learning bottles?
 
Clinical prediction models for covid-19: alarming results from a living syste...
Clinical prediction models for covid-19: alarming results from a living syste...Clinical prediction models for covid-19: alarming results from a living syste...
Clinical prediction models for covid-19: alarming results from a living syste...
 
Five questions about artificial intelligence
Five questions about artificial intelligenceFive questions about artificial intelligence
Five questions about artificial intelligence
 
Prediction models for diagnosis and prognosis related to COVID-19
Prediction models for diagnosis and prognosis related to COVID-19Prediction models for diagnosis and prognosis related to COVID-19
Prediction models for diagnosis and prognosis related to COVID-19
 
Clinical prediction models: development, validation and beyond
Clinical prediction models:development, validation and beyondClinical prediction models:development, validation and beyond
Clinical prediction models: development, validation and beyond
 
Correcting for missing data, measurement error and confounding
Correcting for missing data, measurement error and confoundingCorrecting for missing data, measurement error and confounding
Correcting for missing data, measurement error and confounding
 
Living systematic reviews: now and in the future
Living systematic reviews: now and in the futureLiving systematic reviews: now and in the future
Living systematic reviews: now and in the future
 
Voorspelmodellen en COVID-19
Voorspelmodellen en COVID-19Voorspelmodellen en COVID-19
Voorspelmodellen en COVID-19
 
The statistics of the coronavirus
The statistics of the coronavirusThe statistics of the coronavirus
The statistics of the coronavirus
 

Kürzlich hochgeladen

Call Girls Service Jaipur {9521753030} ❤️VVIP RIDDHI Call Girl in Jaipur Raja...
Call Girls Service Jaipur {9521753030} ❤️VVIP RIDDHI Call Girl in Jaipur Raja...Call Girls Service Jaipur {9521753030} ❤️VVIP RIDDHI Call Girl in Jaipur Raja...
Call Girls Service Jaipur {9521753030} ❤️VVIP RIDDHI Call Girl in Jaipur Raja...Sheetaleventcompany
 
Manyata Tech Park ( Call Girls ) Bangalore ✔ 6297143586 ✔ Hot Model With Sexy...
Manyata Tech Park ( Call Girls ) Bangalore ✔ 6297143586 ✔ Hot Model With Sexy...Manyata Tech Park ( Call Girls ) Bangalore ✔ 6297143586 ✔ Hot Model With Sexy...
Manyata Tech Park ( Call Girls ) Bangalore ✔ 6297143586 ✔ Hot Model With Sexy...vidya singh
 
Call Girl in Indore 8827247818 {LowPrice} ❤️ (ahana) Indore Call Girls * UPA...
Call Girl in Indore 8827247818 {LowPrice} ❤️ (ahana) Indore Call Girls  * UPA...Call Girl in Indore 8827247818 {LowPrice} ❤️ (ahana) Indore Call Girls  * UPA...
Call Girl in Indore 8827247818 {LowPrice} ❤️ (ahana) Indore Call Girls * UPA...mahaiklolahd
 
9630942363 Genuine Call Girls In Ahmedabad Gujarat Call Girls Service
9630942363 Genuine Call Girls In Ahmedabad Gujarat Call Girls Service9630942363 Genuine Call Girls In Ahmedabad Gujarat Call Girls Service
9630942363 Genuine Call Girls In Ahmedabad Gujarat Call Girls ServiceGENUINE ESCORT AGENCY
 
Models Call Girls In Hyderabad 9630942363 Hyderabad Call Girl & Hyderabad Esc...
Models Call Girls In Hyderabad 9630942363 Hyderabad Call Girl & Hyderabad Esc...Models Call Girls In Hyderabad 9630942363 Hyderabad Call Girl & Hyderabad Esc...
Models Call Girls In Hyderabad 9630942363 Hyderabad Call Girl & Hyderabad Esc...GENUINE ESCORT AGENCY
 
Call Girl In Pune 👉 Just CALL ME: 9352988975 💋 Call Out Call Both With High p...
Call Girl In Pune 👉 Just CALL ME: 9352988975 💋 Call Out Call Both With High p...Call Girl In Pune 👉 Just CALL ME: 9352988975 💋 Call Out Call Both With High p...
Call Girl In Pune 👉 Just CALL ME: 9352988975 💋 Call Out Call Both With High p...chetankumar9855
 
Call Girls Ahmedabad Just Call 9630942363 Top Class Call Girl Service Available
Call Girls Ahmedabad Just Call 9630942363 Top Class Call Girl Service AvailableCall Girls Ahmedabad Just Call 9630942363 Top Class Call Girl Service Available
Call Girls Ahmedabad Just Call 9630942363 Top Class Call Girl Service AvailableGENUINE ESCORT AGENCY
 
Call Girls Service Jaipur {8445551418} ❤️VVIP BHAWNA Call Girl in Jaipur Raja...
Call Girls Service Jaipur {8445551418} ❤️VVIP BHAWNA Call Girl in Jaipur Raja...Call Girls Service Jaipur {8445551418} ❤️VVIP BHAWNA Call Girl in Jaipur Raja...
Call Girls Service Jaipur {8445551418} ❤️VVIP BHAWNA Call Girl in Jaipur Raja...parulsinha
 
All Time Service Available Call Girls Marine Drive 📳 9820252231 For 18+ VIP C...
All Time Service Available Call Girls Marine Drive 📳 9820252231 For 18+ VIP C...All Time Service Available Call Girls Marine Drive 📳 9820252231 For 18+ VIP C...
All Time Service Available Call Girls Marine Drive 📳 9820252231 For 18+ VIP C...Arohi Goyal
 
Call Girls Madurai Just Call 9630942363 Top Class Call Girl Service Available
Call Girls Madurai Just Call 9630942363 Top Class Call Girl Service AvailableCall Girls Madurai Just Call 9630942363 Top Class Call Girl Service Available
Call Girls Madurai Just Call 9630942363 Top Class Call Girl Service AvailableGENUINE ESCORT AGENCY
 
VIP Hyderabad Call Girls Bahadurpally 7877925207 ₹5000 To 25K With AC Room 💚😋
VIP Hyderabad Call Girls Bahadurpally 7877925207 ₹5000 To 25K With AC Room 💚😋VIP Hyderabad Call Girls Bahadurpally 7877925207 ₹5000 To 25K With AC Room 💚😋
VIP Hyderabad Call Girls Bahadurpally 7877925207 ₹5000 To 25K With AC Room 💚😋TANUJA PANDEY
 
💕SONAM KUMAR💕Premium Call Girls Jaipur ↘️9257276172 ↙️One Night Stand With Lo...
💕SONAM KUMAR💕Premium Call Girls Jaipur ↘️9257276172 ↙️One Night Stand With Lo...💕SONAM KUMAR💕Premium Call Girls Jaipur ↘️9257276172 ↙️One Night Stand With Lo...
💕SONAM KUMAR💕Premium Call Girls Jaipur ↘️9257276172 ↙️One Night Stand With Lo...khalifaescort01
 
Andheri East ) Call Girls in Mumbai Phone No 9004268417 Elite Escort Service ...
Andheri East ) Call Girls in Mumbai Phone No 9004268417 Elite Escort Service ...Andheri East ) Call Girls in Mumbai Phone No 9004268417 Elite Escort Service ...
Andheri East ) Call Girls in Mumbai Phone No 9004268417 Elite Escort Service ...Anamika Rawat
 
Trichy Call Girls Book Now 9630942363 Top Class Trichy Escort Service Available
Trichy Call Girls Book Now 9630942363 Top Class Trichy Escort Service AvailableTrichy Call Girls Book Now 9630942363 Top Class Trichy Escort Service Available
Trichy Call Girls Book Now 9630942363 Top Class Trichy Escort Service AvailableGENUINE ESCORT AGENCY
 
Most Beautiful Call Girl in Bangalore Contact on Whatsapp
Most Beautiful Call Girl in Bangalore Contact on WhatsappMost Beautiful Call Girl in Bangalore Contact on Whatsapp
Most Beautiful Call Girl in Bangalore Contact on WhatsappInaaya Sharma
 
Model Call Girls In Chennai WhatsApp Booking 7427069034 call girl service 24 ...
Model Call Girls In Chennai WhatsApp Booking 7427069034 call girl service 24 ...Model Call Girls In Chennai WhatsApp Booking 7427069034 call girl service 24 ...
Model Call Girls In Chennai WhatsApp Booking 7427069034 call girl service 24 ...hotbabesbook
 
Call Girls Vasai Virar Just Call 9630942363 Top Class Call Girl Service Avail...
Call Girls Vasai Virar Just Call 9630942363 Top Class Call Girl Service Avail...Call Girls Vasai Virar Just Call 9630942363 Top Class Call Girl Service Avail...
Call Girls Vasai Virar Just Call 9630942363 Top Class Call Girl Service Avail...GENUINE ESCORT AGENCY
 
Saket * Call Girls in Delhi - Phone 9711199012 Escorts Service at 6k to 50k a...
Saket * Call Girls in Delhi - Phone 9711199012 Escorts Service at 6k to 50k a...Saket * Call Girls in Delhi - Phone 9711199012 Escorts Service at 6k to 50k a...
Saket * Call Girls in Delhi - Phone 9711199012 Escorts Service at 6k to 50k a...BhumiSaxena1
 

Kürzlich hochgeladen (20)

Call Girls in Gagan Vihar (delhi) call me [🔝 9953056974 🔝] escort service 24X7
Call Girls in Gagan Vihar (delhi) call me [🔝  9953056974 🔝] escort service 24X7Call Girls in Gagan Vihar (delhi) call me [🔝  9953056974 🔝] escort service 24X7
Call Girls in Gagan Vihar (delhi) call me [🔝 9953056974 🔝] escort service 24X7
 
Call Girls Service Jaipur {9521753030} ❤️VVIP RIDDHI Call Girl in Jaipur Raja...
Call Girls Service Jaipur {9521753030} ❤️VVIP RIDDHI Call Girl in Jaipur Raja...Call Girls Service Jaipur {9521753030} ❤️VVIP RIDDHI Call Girl in Jaipur Raja...
Call Girls Service Jaipur {9521753030} ❤️VVIP RIDDHI Call Girl in Jaipur Raja...
 
Manyata Tech Park ( Call Girls ) Bangalore ✔ 6297143586 ✔ Hot Model With Sexy...
Manyata Tech Park ( Call Girls ) Bangalore ✔ 6297143586 ✔ Hot Model With Sexy...Manyata Tech Park ( Call Girls ) Bangalore ✔ 6297143586 ✔ Hot Model With Sexy...
Manyata Tech Park ( Call Girls ) Bangalore ✔ 6297143586 ✔ Hot Model With Sexy...
 
Call Girl in Indore 8827247818 {LowPrice} ❤️ (ahana) Indore Call Girls * UPA...
Call Girl in Indore 8827247818 {LowPrice} ❤️ (ahana) Indore Call Girls  * UPA...Call Girl in Indore 8827247818 {LowPrice} ❤️ (ahana) Indore Call Girls  * UPA...
Call Girl in Indore 8827247818 {LowPrice} ❤️ (ahana) Indore Call Girls * UPA...
 
9630942363 Genuine Call Girls In Ahmedabad Gujarat Call Girls Service
9630942363 Genuine Call Girls In Ahmedabad Gujarat Call Girls Service9630942363 Genuine Call Girls In Ahmedabad Gujarat Call Girls Service
9630942363 Genuine Call Girls In Ahmedabad Gujarat Call Girls Service
 
Models Call Girls In Hyderabad 9630942363 Hyderabad Call Girl & Hyderabad Esc...
Models Call Girls In Hyderabad 9630942363 Hyderabad Call Girl & Hyderabad Esc...Models Call Girls In Hyderabad 9630942363 Hyderabad Call Girl & Hyderabad Esc...
Models Call Girls In Hyderabad 9630942363 Hyderabad Call Girl & Hyderabad Esc...
 
Call Girl In Pune 👉 Just CALL ME: 9352988975 💋 Call Out Call Both With High p...
Call Girl In Pune 👉 Just CALL ME: 9352988975 💋 Call Out Call Both With High p...Call Girl In Pune 👉 Just CALL ME: 9352988975 💋 Call Out Call Both With High p...
Call Girl In Pune 👉 Just CALL ME: 9352988975 💋 Call Out Call Both With High p...
 
Call Girls Ahmedabad Just Call 9630942363 Top Class Call Girl Service Available
Call Girls Ahmedabad Just Call 9630942363 Top Class Call Girl Service AvailableCall Girls Ahmedabad Just Call 9630942363 Top Class Call Girl Service Available
Call Girls Ahmedabad Just Call 9630942363 Top Class Call Girl Service Available
 
Call Girls Service Jaipur {8445551418} ❤️VVIP BHAWNA Call Girl in Jaipur Raja...
Call Girls Service Jaipur {8445551418} ❤️VVIP BHAWNA Call Girl in Jaipur Raja...Call Girls Service Jaipur {8445551418} ❤️VVIP BHAWNA Call Girl in Jaipur Raja...
Call Girls Service Jaipur {8445551418} ❤️VVIP BHAWNA Call Girl in Jaipur Raja...
 
All Time Service Available Call Girls Marine Drive 📳 9820252231 For 18+ VIP C...
All Time Service Available Call Girls Marine Drive 📳 9820252231 For 18+ VIP C...All Time Service Available Call Girls Marine Drive 📳 9820252231 For 18+ VIP C...
All Time Service Available Call Girls Marine Drive 📳 9820252231 For 18+ VIP C...
 
Call Girls Madurai Just Call 9630942363 Top Class Call Girl Service Available
Call Girls Madurai Just Call 9630942363 Top Class Call Girl Service AvailableCall Girls Madurai Just Call 9630942363 Top Class Call Girl Service Available
Call Girls Madurai Just Call 9630942363 Top Class Call Girl Service Available
 
VIP Hyderabad Call Girls Bahadurpally 7877925207 ₹5000 To 25K With AC Room 💚😋
VIP Hyderabad Call Girls Bahadurpally 7877925207 ₹5000 To 25K With AC Room 💚😋VIP Hyderabad Call Girls Bahadurpally 7877925207 ₹5000 To 25K With AC Room 💚😋
VIP Hyderabad Call Girls Bahadurpally 7877925207 ₹5000 To 25K With AC Room 💚😋
 
💕SONAM KUMAR💕Premium Call Girls Jaipur ↘️9257276172 ↙️One Night Stand With Lo...
💕SONAM KUMAR💕Premium Call Girls Jaipur ↘️9257276172 ↙️One Night Stand With Lo...💕SONAM KUMAR💕Premium Call Girls Jaipur ↘️9257276172 ↙️One Night Stand With Lo...
💕SONAM KUMAR💕Premium Call Girls Jaipur ↘️9257276172 ↙️One Night Stand With Lo...
 
Andheri East ) Call Girls in Mumbai Phone No 9004268417 Elite Escort Service ...
Andheri East ) Call Girls in Mumbai Phone No 9004268417 Elite Escort Service ...Andheri East ) Call Girls in Mumbai Phone No 9004268417 Elite Escort Service ...
Andheri East ) Call Girls in Mumbai Phone No 9004268417 Elite Escort Service ...
 
Trichy Call Girls Book Now 9630942363 Top Class Trichy Escort Service Available
Trichy Call Girls Book Now 9630942363 Top Class Trichy Escort Service AvailableTrichy Call Girls Book Now 9630942363 Top Class Trichy Escort Service Available
Trichy Call Girls Book Now 9630942363 Top Class Trichy Escort Service Available
 
Most Beautiful Call Girl in Bangalore Contact on Whatsapp
Most Beautiful Call Girl in Bangalore Contact on WhatsappMost Beautiful Call Girl in Bangalore Contact on Whatsapp
Most Beautiful Call Girl in Bangalore Contact on Whatsapp
 
🌹Attapur⬅️ Vip Call Girls Hyderabad 📱9352852248 Book Well Trand Call Girls In...
🌹Attapur⬅️ Vip Call Girls Hyderabad 📱9352852248 Book Well Trand Call Girls In...🌹Attapur⬅️ Vip Call Girls Hyderabad 📱9352852248 Book Well Trand Call Girls In...
🌹Attapur⬅️ Vip Call Girls Hyderabad 📱9352852248 Book Well Trand Call Girls In...
 
Model Call Girls In Chennai WhatsApp Booking 7427069034 call girl service 24 ...
Model Call Girls In Chennai WhatsApp Booking 7427069034 call girl service 24 ...Model Call Girls In Chennai WhatsApp Booking 7427069034 call girl service 24 ...
Model Call Girls In Chennai WhatsApp Booking 7427069034 call girl service 24 ...
 
Call Girls Vasai Virar Just Call 9630942363 Top Class Call Girl Service Avail...
Call Girls Vasai Virar Just Call 9630942363 Top Class Call Girl Service Avail...Call Girls Vasai Virar Just Call 9630942363 Top Class Call Girl Service Avail...
Call Girls Vasai Virar Just Call 9630942363 Top Class Call Girl Service Avail...
 
Saket * Call Girls in Delhi - Phone 9711199012 Escorts Service at 6k to 50k a...
Saket * Call Girls in Delhi - Phone 9711199012 Escorts Service at 6k to 50k a...Saket * Call Girls in Delhi - Phone 9711199012 Escorts Service at 6k to 50k a...
Saket * Call Girls in Delhi - Phone 9711199012 Escorts Service at 6k to 50k a...
 

On p-values

  • 1. On p-values Maarten van Smeden Annual Julius Symposium 2016
  • 2. About • statistician by training • phd (2016): diagnostic research in absence gold standard (JC) • post-doc: biostatistics / epidemiological methods (JC)
  • 3.
  • 4. About this workshop p-value? ASA statement: why and what? p-value alternatives?
  • 6. Point of departure skeptical whenever I see a p-value
  • 10. The pioneers Ronald Aylmer Fisher 
 (1890 - 1962) Jerzy Neyman 
 (1894-1981) Egon Pearson 
 (1895-1980)
  • 11. p-value ≥ α “no effect” p-value < α “effect!” α = .05, unless…
  • 12. … the p-value fails “arguably significant” (P = 0.07) “direction heading to significance” (P = 0.10) “flirting with conventional levels of significance” (P > 0.1) “marginally significant” (P ≥ 0.1) convenient sample from: https://mchankins.wordpress.com/2013/04/21/still-not-significant-2/ listing 509 expressions for non-significant results at α = .05 level (24 October 2016)
  • 13. + 23!!! supplementary files Wasserstein & Lazar (2016) The ASA's Statement on p-Values: 
 Context, Process, and Purpose, The American Statistician, 70:2, 129-133
  • 14. A few quotes (1) “The ASA has not previously taken positions on specific matters of statistical practice.”
 nb. founded in 1839 “Nothing in the ASA statement is new.” from the ASA Statement
  • 15. A few quotes (2) “… process was lengthier and more controversial than anticipated.” “… the statement articulates in non-technical terms a few select principles that could improve the conduct or interpretation of quantitative science, according to widespread consensus in the statistical community." from the ASA Statement
  • 18. Why do we need a statement? ‘“It’s science’s dirtiest secret: The ‘scientific method’ of testing hypotheses by statistical analysis stands on a flimsy foundation.”’ Quoting Siegfried (2010), Odds Are, It’s Wrong: Science Fails to Face the Shortcomings of Statistics, Science News, 177, 26. from the ASA Statement: Wasserstein & Lazar (2016) The ASA's Statement on p-Values: 
 Context, Process, and Purpose, The American Statistician, 70:2, 129-133
  • 19. OK, but why now? “… highly visible discussions over the last few years” “The statistical community has been deeply concerned about issues of reproducibility and replicability …” from the ASA statement
  • 22. Drastic measures… NHST = Null hypothesis significance testing
  • 23.
  • 24. P-value increasingly central in reporting From: Chavalarias et al. JAMA. 2016;315(11):1141-1148, doi:10.1001/jama.2016.1952 Using text-mining >1.6 million abstracts
  • 25. In the large (‘big’) data era “With a combination of large datasets, confounding, flexibility in analytical choices …, and superimposed selective reporting bias, using a P < 0.05 threshold to declare “success,” …. 
 means next to nothing.” From ASA supplementary material, response by Ioannidis.
  • 26. To summarise: why? • p-values and the P < .05 rule are at the core of inference in today’s science (social, biomedical, …) • there is growing concern that these inference are often wrong • perhaps, if we understand p-values better, we’ll be less often wrong
  • 28. The statement: 6 principles 1. P-values can indicate how incompatible the data are with a specified statistical model. 2. P-values do not measure the probability that the studied hypothesis is true, or the probability that the data were produced by random chance alone. 3. Scientific conclusions and business or policy decisions should not be based only on whether a p-value passes a specific threshold. 4. Proper inference requires full reporting and transparency. 5. A p-value, or statistical significance, does not measure the size of an effect or the importance of a result. 6. By itself, a p-value does not provide a good measure of evidence regarding a model or hypothesis. from the ASA statement
  • 29. Statistical model? • every method of statistical inference relies on a web of assumptions which together can be viewed as a ‘statistical model’ • the tested hypothesis is one of these assumptions. Often a ‘zero-effect’ called ‘null hypothesis’
  • 30. About assumptions the calculation of p-values always relies on assumptions besides the hypothesis tested. It is easy to ignore/forget those assumptions while analysing. Your assumptions are your windows on the world. Scrub them off every once in a while, or the light won't come in. Alan Alda
  • 31. The statement: 6 principles 1. P-values can indicate how incompatible the data are with a specified statistical model. 2. P-values do not measure the probability that the studied hypothesis is true, or the probability that the data were produced by random chance alone. 3. Scientific conclusions and business or policy decisions should not be based only on whether a p-value passes a specific threshold. 4. Proper inference requires full reporting and transparency. 5. A p-value, or statistical significance, does not measure the size of an effect or the importance of a result. 6. By itself, a p-value does not provide a good measure of evidence regarding a model or hypothesis. from the ASA statement
  • 32. From a probability point of view p-value*: P(Data|Hypothesis) is not: P(Hypothesis|Data) *Somewhat simplified, correct notation would be: P(T(X) ≥ x | Hypothesis)
  • 33. Does it matter? P(Death|Handgun) = 5% to 20%* P(Handgun|Death) = 0.028%** * from New York Times (http://www.nytimes.com article published: 2008/04/03/) ** from CBS StatLine (concerning deaths and registered gun crimes in 2015 in the Netherlands)
  • 34. If there only was a way… P(Data|Hypothesis) P(Hypothesis|Data)
  • 35. There is… reverend Thomas Bayes
 (1702-1761) P(H|D) = P(D|H) P(H) P(D)
  • 36. The statement: 6 principles 1. P-values can indicate how incompatible the data are with a specified statistical model. 2. P-values do not measure the probability that the studied hypothesis is true, or the probability that the data were produced by random chance alone. 3. Scientific conclusions and business or policy decisions should not be based only on whether a p-value passes a specific threshold. 4. Proper inference requires full reporting and transparency. 5. A p-value, or statistical significance, does not measure the size of an effect or the importance of a result. 6. By itself, a p-value does not provide a good measure of evidence regarding a model or hypothesis. from the ASA statement
  • 37. On bright-line rules “Practices that reduce data analysis or scientific inference to mechanical “bright-line” rules (such as “p < 0.05”) for justifying scientific claims or conclusions can lead to erroneous beliefs and poor decision making. A conclusion does not immediately become “true” on one side of the divide and “false” on the other.” from the ASA statement
  • 38. If p ~ .05 D Colquhoun (2014). An investigation of the false discovery rate and the misinterpretation of p-values. R.Soc.opensci.1:140216. “If you want to avoid making a fool of yourself very often, do not regard anything greater than p < 0.001 as a demonstration that you have discovered something”
  • 39. If p > .05
  • 40. The statement: 6 principles 1. P-values can indicate how incompatible the data are with a specified statistical model. 2. P-values do not measure the probability that the studied hypothesis is true, or the probability that the data were produced by random chance alone. 3. Scientific conclusions and business or policy decisions should not be based only on whether a p-value passes a specific threshold. 4. Proper inference requires full reporting and transparency. 5. A p-value, or statistical significance, does not measure the size of an effect or the importance of a result. 6. By itself, a p-value does not provide a good measure of evidence regarding a model or hypothesis. from the ASA statement
  • 41. The issue of pre-specified hypotheses From: http://compare-trials.org/ accessed on November 20 2016
  • 42. Ed Yong (2012). Replication studies: Bad copy, Nature. Data credits to: D Fanelli.
  • 43. Why is this enormous positivity? If you torture the data long enough, it will confess to anything Ronald Coase besides journal editors requirement for p < .05
  • 44. Multiple (potential) comparisons aka
 - p-hacking
 - data fishing
 - data dredging
 - multiple testing
 - multiplicity
 - significance chasing
 - significance questing
 - selective inference
 - etc.

  • 45. Selective reporting “Whenever a researcher chooses what to present based on statistical results, valid interpretation of those results is severely compromised if the reader is not informed of the choice and its basis. Researchers should disclose the number of hypotheses explored during the study, all data collection decisions, all statistical analyses conducted, and all p- values computed. Valid scientific conclusions based on p- values and related statistics cannot be drawn without at least knowing how many and which analyses were conducted, and how those analyses (including p-values) were selected for reporting.” from the ASA statement
  • 46. The statement: 6 principles 1. P-values can indicate how incompatible the data are with a specified statistical model. 2. P-values do not measure the probability that the studied hypothesis is true, or the probability that the data were produced by random chance alone. 3. Scientific conclusions and business or policy decisions should not be based only on whether a p-value passes a specific threshold. 4. Proper inference requires full reporting and transparency. 5. A p-value, or statistical significance, does not measure the size of an effect or the importance of a result. 6. By itself, a p-value does not provide a good measure of evidence regarding a model or hypothesis. from the ASA statement
  • 47. About effect size • statistical significance does not imply practical importance • to understand practical importance we need information on the effect size • Is the p-value a good measure for effect size?
  • 48. Dance of the p-values https://www.youtube.com/watch?v=5OL1RqHrZQ8&t=10s Credits to Professor Geoff Cumming
  • 49. The statement: 6 principles 1. P-values can indicate how incompatible the data are with a specified statistical model. 2. P-values do not measure the probability that the studied hypothesis is true, or the probability that the data were produced by random chance alone. 3. Scientific conclusions and business or policy decisions should not be based only on whether a p-value passes a specific threshold. 4. Proper inference requires full reporting and transparency. 5. A p-value, or statistical significance, does not measure the size of an effect or the importance of a result. 6. By itself, a p-value does not provide a good measure of evidence regarding a model or hypothesis. from the ASA Statement
  • 50. P-values in isolation “Researchers should recognize that a p-value without context or other evidence provides limited information. For example, a p-value near 0.05 taken by itself offers only weak evidence against the null hypothesis. Likewise, a relatively large p-value does not imply evidence in favour of the null hypothesis; many other hypotheses may be equally or more consistent with the observed data. For these reasons, data analysis should not end with the calculation of a p-value when other approaches are appropriate and feasible.” from the ASA statement
  • 51. The statement: 6 principles 1. P-values can indicate how incompatible the data are with a specified statistical model. 2. P-values do not measure the probability that the studied hypothesis is true, or the probability that the data were produced by random chance alone. 3. Scientific conclusions and business or policy decisions should not be based only on whether a p-value passes a specific threshold. 4. Proper inference requires full reporting and transparency. 5. A p-value, or statistical significance, does not measure the size of an effect or the importance of a result. 6. By itself, a p-value does not provide a good measure of evidence regarding a model or hypothesis. from the ASA statement
  • 52. Agreement reached? “you can believe me that had it been any stronger, then all but one of the statisticians would have resigned.” “If only the rest could have agreed with me, we would have a much stronger statement.” from SlideShare, by Stephen Senn: P Values and the art of herding cats (accessed on Oct 30 2016) Stephen Senn, involved in the ASA statement
  • 53. From a practical point of view if you work with p-values (derived from the 6 ASA principles): 1. think carefully about the underlying assumptions 2. avoid statements about the truth of the tested hypothesis 3. avoid strong statements about effect based solely on p < . 05 or absence of effect based solely on p > .05 4. report no. and sequence of analyses; avoid data torture 5. avoid statements about effect size based on p-value 6. if feasible, use additional information from other inferential tools
  • 55. Other approaches • Methods that emphasise estimation rather than testing • confidence intervals • prediction intervals • credible intervals • Bayesian methods • Alternative measures of evidence • likelihood ratios • Bayes factors • Other approaches • Decision-theoretic modelling • False discovery rates From ASA statement
  • 56. A too short introduction to Bayesian inference Remember Bayes? reverend Thomas Bayes
 (1702-1761)
  • 57. Using Bayes theorem P(θ|D) = P(D|θ) P(θ) P(D) P(θ|D) ∝ P(D|θ) P(θ) “likelihood” “prior distribution” “posterior distribution”
  • 58. Rational for Bayesian inference the posterior distribution (θ|D) is “more informative” than the likelihood (D|θ) However: “Proponents of the “Bayesian revolution” should be wary of chasing het another chimera: an apparently universal inference procedure. A better path would be to promote both an understanding of various devices in the “statistical toolbox” and informed judgment to select among these.”
 Gigerenzer and Marewski (2015), Surrogate Science: The Idol of a Universal Method for Scientific Inference. Journal of Management
  • 60. The words of the pioneer No scientific worker has a fixed level of significance at which from year to year, and in all circumstances, he rejects hypotheses; he rather gives his mind to each particular case in the light of his evidence and his ideas. Ronald Fisher
  • 61. Many initiatives to improve science… see: http://www.scienceintransition.nl/english
  • 62. and reduce waste ~ 85% of all health research is being avoidably “wasted” see also: http://blogs.bmj.com/bmj/2016/01/14/paul-glasziou-and-iain-chalmers-is-85-of-health-research-really-wasted/, and: Lancet’s 2014 series on increasing value, reducing waste (incl video’s etc.): http://www.thelancet.com/series/research
  • 63. Conclusion • statistical inference is inherently difficult; we should avoid making a fool of ourselves too often • p-values can be useful tools for inference; most often, p- values should not be the ‘star of the inference show’ • bright line rules such as p < .05 give a false sense of scientific objectivity • like to play around with data? Me too! Think twice before you publish such explorations; if you do, be honest and transparent in reporting

  • 64. Some random thoughts • inference is thought as a primarily mathematical or computational problem, it should not. • we should ban the term “significant” from scientific output for describing effects that are accompanied with p < .05. • in applied statistics education, we should invest more time in discussing various forms of inference (e.g., Bayesian inference) and their merits and pitfalls
  • 66. Points for discussion • is there a need for changing the way we do inference? • if so, how and what do we change? • education? • journals? • should we downplay the role of p < .05 in scientific output?