SlideShare ist ein Scribd-Unternehmen logo
1 von 36
Downloaden Sie, um offline zu lesen
1
Abstract
Claims coming from human medical observational studies, when tested rigorously,
most often fail to replicate. Whereas randomized clinical trials replicate over 80%
of the time, medical observational studies replicate only 10 to 20% of the time. Multiple
re-test studies reported JAMA failed to replicate. For example in the early 1990s,
Vitamin E was reported to protect against heart attacks. Large, well-conducted
randomized clinical trials did not replicate this claim. The claim that Type A Personality
leads to heart attacks failed to replicate in two separate studies, yet the myth still lives.
Clearly, there are systematic problems with how observational studies are conducted
and analyzed that need to be identified and fixed. Edwards Deming, the most famous
quality expert ever, says that any problem with a failed process is not the fault of the
workers, scientists conducting observational studies, but of management. Funding
agencies and journal editors need to fix a clearly broken process. Technical problems
are identified. Tough management solution are proposed. A simple statistical analysis
strategy is presented. Many human health problems can only be examined using
observational data. Our proposals, technical and managerial, should lead to more
reliable claims along with fair ways to judge their reliability.
NISS
22
Contact
Information
Stan Young
National Institute of Statistical Sciences
www.niss.org
young@niss.org
919 685 9328
NISS
NISS 3
Hayek, 1974, Nobel Lecture
It is often difficult enough for the expert,
and certainly in many instances impossible
for the layman, to distinguish between
legitimate and illegitimate claims advanced
in the name of science.
It is often difficult enough for the expert,
and certainly in many instances impossible
for the layman, to distinguish between
legitimate and illegitimate claims advanced
in the name of science.
NISS 4
Hayek (2)
...much effort will have to be directed toward debunking
such arrogations*, some of which have by now become
the vested interests of established university departments.
*Claims without proper foundation
5
Reliability of Literature Claims
S. Stanley Young
National Institute of Statistical Sciences
Young@niss.org, 919 685 9328
VIP Lecture
NISS
66
Science point of view
What is the meaning of life?
What is real?
What is reproducible?
Fooled (fooling) by randomness?
NISS
77
The Players
1. The workers – scientists
2. The communicators –
a. PR people
b. Bloggers
c. Reporters
d. Science writers
3. The consumers – public, regulatory
agencies, trial lawyers
4. The management – funding agencies,
journal editors
NISS
88
The Worker is not the Problem.
W. Edwards Deming,
the most visionary innovator ever on quality control, said
The worker is not the problem.
The problem is at the top! Management!
To Deming, blaming the workers—individual researchers—
is as incorrect as it is useless.
Bringing the system under control is the responsibility
of those managing it.
NISS
9
Problems with observational studies
“Everything is dangerous”
1. Data staging
2. No written analysis protocol
3. Multiple testing
4. Multiple modeling
5. Uncorrected bias
6. Self-serving paper writing
7. Self-serving press release
8. Actually believe the claims
9NISS
Assertion : Every study is positive
Data Staging
 Bias
Multiple testing
Multiple model searching
Any or all will lead to essentially all
observational studies being positive!
10NISS
11
First, data staging
Stan:
Why do you think data staging is a big issue?
Because it can be done in myriad ways, is
rarely documented, and is usually not
reproducible?
David Madigan
11NISS
12
Multiple Testing: P-value, t-test
Population,
real or theoretical
Two samples,
random
NISS
10-sided dice experiment
12/25/12
NISS
14
How do you get a “p < 0.05”?
Answer: Ask lots of questions.
61 questions
95% chance of
a positive study!
NISS
1515
Let’s run an epidemiology study!
10-sided dice simulation:
Coffee causes X.
NISS
1616
P-value plot – 60 p-values.
NISS
17
Cereal determines human gender
Really?????
17NISS
NISS
19
2 Cancer types, 48 pesticies, 96 questions
Three claims made, only one appears valid.
NISS
Multiple Modeling/Bias
Take a simple difference
Paper, data, claim
American Cancer Society Cancer Prevention Study II
No association with CV deaths, corrected for PM2.5.
Ozone associated with respiratory deaths.
21
Jerrett et al. Large Search Space
22
Covariate Adjustment
27
= 128
23
Large search space
32 x 128 = 4,096
The data used in this paper is not available.
We are asked to trust that analysis decisions
were good and claims are robust.
Any adjustment for multiple testing
and/or multiple modeling renders p-values NS.
24
2525
Crisis in science? 2011, 2012
Nature, 2012
Significance, 2011
NISS
26
Claims from observational studies tested in RCTs
27
What can funding agencies do?
Fund data generation and analysis
separately.
Fund replication studies.
Require data used in publication be
posted on publication.
28
What can journal editors do?
Quality by inspection, p-value < 0.05, is not working.
(Many workers are gaming the system.)
Management needs to re-design the system to build
quality into the product.
Papers following good manufacturing procedures and
addressing important questions, should be accepted
without regard to statistical significance.
Require data used in publication be posted on
publication.
29
What can you, the consumer, do?
(not much)
1. Be skeptical of observational study claims.
2. Read the actual paper.
3. Count the claims under consideration.
4. Ask for the data set.
5. Letter to editor : voodoo stats and trust me
science. (Educate editors.)
6. Write to funding agency.
7. Write to congressman.
29NISS
30
“New” p-value plot, - log10
(p-value), Dmitri Zaykin
31
Conclusions
Most science claims do not replicate.
Deming: Don't blame the worker
(or expect them to adopt different methods).
Funding agencies and journal editors have been AWOL.
Require data to be placed in depository on publication.
32
One irate study evaluator, 2012
Mens Sana Monograph, 2012
3333
Contact
Information
Stan Young
National Institute of Statistical Sciences
www.niss.org
young@niss.org
919 685 9328
NISS
Ozone/PM2.5 Acute Deaths LA
34
3535
Suggestions for effective management
of observational studies
No funding / publication without:
1. Public posting protocol before study initiation.
2. Public posting of data set on publication.
3. Clear statement of questions under consideration.
4. Conform to “Reproducible Research” guidelines.
5. Any claims must be independently replicated.
NISS
3636
Congressional Management:
True Science Transparency Act
Any federal agency proposing rule-making or legislation
shall specifically name each document used to support
the proposed rule-making or legislation and provide
all data used in said document for viewing by the public.
See also OSTP memorandum, 22Feb2013.
NISS

Weitere ähnliche Inhalte

Was ist angesagt?

Mayo: Evidence as Passing a Severe Test (How it Gets You Beyond the Statistic...
Mayo: Evidence as Passing a Severe Test (How it Gets You Beyond the Statistic...Mayo: Evidence as Passing a Severe Test (How it Gets You Beyond the Statistic...
Mayo: Evidence as Passing a Severe Test (How it Gets You Beyond the Statistic...jemille6
 
Mayo: Day #2 slides
Mayo: Day #2 slidesMayo: Day #2 slides
Mayo: Day #2 slidesjemille6
 
D. Mayo: The Science Wars and the Statistics Wars: scientism, popular statist...
D. Mayo: The Science Wars and the Statistics Wars: scientism, popular statist...D. Mayo: The Science Wars and the Statistics Wars: scientism, popular statist...
D. Mayo: The Science Wars and the Statistics Wars: scientism, popular statist...jemille6
 
Probing with Severity: Beyond Bayesian Probabilism and Frequentist Performance
Probing with Severity: Beyond Bayesian Probabilism and Frequentist PerformanceProbing with Severity: Beyond Bayesian Probabilism and Frequentist Performance
Probing with Severity: Beyond Bayesian Probabilism and Frequentist Performancejemille6
 
D. Mayo: Replication Research Under an Error Statistical Philosophy
D. Mayo: Replication Research Under an Error Statistical Philosophy D. Mayo: Replication Research Under an Error Statistical Philosophy
D. Mayo: Replication Research Under an Error Statistical Philosophy jemille6
 
Gelman psych crisis_2
Gelman psych crisis_2Gelman psych crisis_2
Gelman psych crisis_2jemille6
 
Replication Crises and the Statistics Wars: Hidden Controversies
Replication Crises and the Statistics Wars: Hidden ControversiesReplication Crises and the Statistics Wars: Hidden Controversies
Replication Crises and the Statistics Wars: Hidden Controversiesjemille6
 
Severe Testing: The Key to Error Correction
Severe Testing: The Key to Error CorrectionSevere Testing: The Key to Error Correction
Severe Testing: The Key to Error Correctionjemille6
 
D. G. Mayo: The Replication Crises and its Constructive Role in the Philosoph...
D. G. Mayo: The Replication Crises and its Constructive Role in the Philosoph...D. G. Mayo: The Replication Crises and its Constructive Role in the Philosoph...
D. G. Mayo: The Replication Crises and its Constructive Role in the Philosoph...jemille6
 
D. G. Mayo: Your data-driven claims must still be probed severely
D. G. Mayo: Your data-driven claims must still be probed severelyD. G. Mayo: Your data-driven claims must still be probed severely
D. G. Mayo: Your data-driven claims must still be probed severelyjemille6
 
Mayo &amp; parker spsp 2016 june 16
Mayo &amp; parker   spsp 2016 june 16Mayo &amp; parker   spsp 2016 june 16
Mayo &amp; parker spsp 2016 june 16jemille6
 
Fusion Confusion? Comments on Nancy Reid: "BFF Four-Are we Converging?"
Fusion Confusion? Comments on Nancy Reid: "BFF Four-Are we Converging?"Fusion Confusion? Comments on Nancy Reid: "BFF Four-Are we Converging?"
Fusion Confusion? Comments on Nancy Reid: "BFF Four-Are we Converging?"jemille6
 
Exploratory Research is More Reliable Than Confirmatory Research
Exploratory Research is More Reliable Than Confirmatory ResearchExploratory Research is More Reliable Than Confirmatory Research
Exploratory Research is More Reliable Than Confirmatory Researchjemille6
 
Senn repligate
Senn repligateSenn repligate
Senn repligatejemille6
 
Statistical skepticism: How to use significance tests effectively
Statistical skepticism: How to use significance tests effectively Statistical skepticism: How to use significance tests effectively
Statistical skepticism: How to use significance tests effectively jemille6
 
Improve your study with pre-registration
Improve your study with pre-registrationImprove your study with pre-registration
Improve your study with pre-registrationDorothy Bishop
 
The Scientific Method
The Scientific Method The Scientific Method
The Scientific Method simonandisa
 
Oom not doom a novel method for improving psychological science, Bradley Woods
Oom not doom   a novel method for improving psychological science, Bradley WoodsOom not doom   a novel method for improving psychological science, Bradley Woods
Oom not doom a novel method for improving psychological science, Bradley WoodsNZ Psychological Society
 
Effect of fMRI Scan Presentation on Perceptions of Homosexuality
Effect of fMRI Scan Presentation on Perceptions of HomosexualityEffect of fMRI Scan Presentation on Perceptions of Homosexuality
Effect of fMRI Scan Presentation on Perceptions of HomosexualityJacob Wilson
 

Was ist angesagt? (20)

Mayo: Evidence as Passing a Severe Test (How it Gets You Beyond the Statistic...
Mayo: Evidence as Passing a Severe Test (How it Gets You Beyond the Statistic...Mayo: Evidence as Passing a Severe Test (How it Gets You Beyond the Statistic...
Mayo: Evidence as Passing a Severe Test (How it Gets You Beyond the Statistic...
 
Mayo: Day #2 slides
Mayo: Day #2 slidesMayo: Day #2 slides
Mayo: Day #2 slides
 
D. Mayo: The Science Wars and the Statistics Wars: scientism, popular statist...
D. Mayo: The Science Wars and the Statistics Wars: scientism, popular statist...D. Mayo: The Science Wars and the Statistics Wars: scientism, popular statist...
D. Mayo: The Science Wars and the Statistics Wars: scientism, popular statist...
 
Probing with Severity: Beyond Bayesian Probabilism and Frequentist Performance
Probing with Severity: Beyond Bayesian Probabilism and Frequentist PerformanceProbing with Severity: Beyond Bayesian Probabilism and Frequentist Performance
Probing with Severity: Beyond Bayesian Probabilism and Frequentist Performance
 
D. Mayo: Replication Research Under an Error Statistical Philosophy
D. Mayo: Replication Research Under an Error Statistical Philosophy D. Mayo: Replication Research Under an Error Statistical Philosophy
D. Mayo: Replication Research Under an Error Statistical Philosophy
 
Gelman psych crisis_2
Gelman psych crisis_2Gelman psych crisis_2
Gelman psych crisis_2
 
Replication Crises and the Statistics Wars: Hidden Controversies
Replication Crises and the Statistics Wars: Hidden ControversiesReplication Crises and the Statistics Wars: Hidden Controversies
Replication Crises and the Statistics Wars: Hidden Controversies
 
Severe Testing: The Key to Error Correction
Severe Testing: The Key to Error CorrectionSevere Testing: The Key to Error Correction
Severe Testing: The Key to Error Correction
 
D. G. Mayo: The Replication Crises and its Constructive Role in the Philosoph...
D. G. Mayo: The Replication Crises and its Constructive Role in the Philosoph...D. G. Mayo: The Replication Crises and its Constructive Role in the Philosoph...
D. G. Mayo: The Replication Crises and its Constructive Role in the Philosoph...
 
D. G. Mayo: Your data-driven claims must still be probed severely
D. G. Mayo: Your data-driven claims must still be probed severelyD. G. Mayo: Your data-driven claims must still be probed severely
D. G. Mayo: Your data-driven claims must still be probed severely
 
Mayo &amp; parker spsp 2016 june 16
Mayo &amp; parker   spsp 2016 june 16Mayo &amp; parker   spsp 2016 june 16
Mayo &amp; parker spsp 2016 june 16
 
AstroLecture2
AstroLecture2AstroLecture2
AstroLecture2
 
Fusion Confusion? Comments on Nancy Reid: "BFF Four-Are we Converging?"
Fusion Confusion? Comments on Nancy Reid: "BFF Four-Are we Converging?"Fusion Confusion? Comments on Nancy Reid: "BFF Four-Are we Converging?"
Fusion Confusion? Comments on Nancy Reid: "BFF Four-Are we Converging?"
 
Exploratory Research is More Reliable Than Confirmatory Research
Exploratory Research is More Reliable Than Confirmatory ResearchExploratory Research is More Reliable Than Confirmatory Research
Exploratory Research is More Reliable Than Confirmatory Research
 
Senn repligate
Senn repligateSenn repligate
Senn repligate
 
Statistical skepticism: How to use significance tests effectively
Statistical skepticism: How to use significance tests effectively Statistical skepticism: How to use significance tests effectively
Statistical skepticism: How to use significance tests effectively
 
Improve your study with pre-registration
Improve your study with pre-registrationImprove your study with pre-registration
Improve your study with pre-registration
 
The Scientific Method
The Scientific Method The Scientific Method
The Scientific Method
 
Oom not doom a novel method for improving psychological science, Bradley Woods
Oom not doom   a novel method for improving psychological science, Bradley WoodsOom not doom   a novel method for improving psychological science, Bradley Woods
Oom not doom a novel method for improving psychological science, Bradley Woods
 
Effect of fMRI Scan Presentation on Perceptions of Homosexuality
Effect of fMRI Scan Presentation on Perceptions of HomosexualityEffect of fMRI Scan Presentation on Perceptions of Homosexuality
Effect of fMRI Scan Presentation on Perceptions of Homosexuality
 

Ähnlich wie 02 young vpi lecture 2014

Dichotomania and other challenges for the collaborating biostatistician
Dichotomania and other challenges for the collaborating biostatisticianDichotomania and other challenges for the collaborating biostatistician
Dichotomania and other challenges for the collaborating biostatisticianLaure Wynants
 
Machine learning applied in health
Machine learning applied in healthMachine learning applied in health
Machine learning applied in healthBig Data Colombia
 
MAKING SENSE OFSTATISTICSWhat statistics tell you an.docx
MAKING SENSE OFSTATISTICSWhat statistics tell you an.docxMAKING SENSE OFSTATISTICSWhat statistics tell you an.docx
MAKING SENSE OFSTATISTICSWhat statistics tell you an.docxsmile790243
 
Principles of data_science
Principles of data_sciencePrinciples of data_science
Principles of data_sciencetvk66866
 
Biomedical Research_House of Cards_Editorial
Biomedical Research_House of Cards_EditorialBiomedical Research_House of Cards_Editorial
Biomedical Research_House of Cards_EditorialRathnam Chaguturu
 
The Many Lives of Data
The Many Lives of DataThe Many Lives of Data
The Many Lives of Dataljmcneill33
 
Big Data: Learning from MIMIC- Celi
Big Data: Learning from MIMIC- CeliBig Data: Learning from MIMIC- Celi
Big Data: Learning from MIMIC- Celiintensivecaresociety
 
Sun==big data analytics for health care
Sun==big data analytics for health careSun==big data analytics for health care
Sun==big data analytics for health careAravindharamanan S
 
Questions for knowledge creators
Questions for knowledge creatorsQuestions for knowledge creators
Questions for knowledge creatorsRichard Bookman
 
MIT Medical Evidence Bootcamp for Journalists 2012
MIT Medical Evidence Bootcamp for Journalists 2012MIT Medical Evidence Bootcamp for Journalists 2012
MIT Medical Evidence Bootcamp for Journalists 2012GarySchwitzer
 
Published Research, Flawed, Misleading, Nefarious - Use of Reporting Guidelin...
Published Research, Flawed, Misleading, Nefarious - Use of Reporting Guidelin...Published Research, Flawed, Misleading, Nefarious - Use of Reporting Guidelin...
Published Research, Flawed, Misleading, Nefarious - Use of Reporting Guidelin...John Hoey
 
Research methodology
Research methodologyResearch methodology
Research methodologyTosif Ahmad
 
SDR 24_2 proof
SDR 24_2 proofSDR 24_2 proof
SDR 24_2 proofminamorris
 
Believable Science
Believable ScienceBelievable Science
Believable Sciencetamsin.rose
 
TITLE OF THE PAPER121Report on GeriatricsPro
TITLE OF THE PAPER121Report on GeriatricsProTITLE OF THE PAPER121Report on GeriatricsPro
TITLE OF THE PAPER121Report on GeriatricsProTakishaPeck109
 
Title of the paper121 report on geriatricspro
Title of the paper121 report on geriatricsproTitle of the paper121 report on geriatricspro
Title of the paper121 report on geriatricsproraju957290
 
A Guide To Using Qualitative Research Methodology
A Guide To Using Qualitative Research MethodologyA Guide To Using Qualitative Research Methodology
A Guide To Using Qualitative Research MethodologyJim Jimenez
 

Ähnlich wie 02 young vpi lecture 2014 (20)

Dichotomania and other challenges for the collaborating biostatistician
Dichotomania and other challenges for the collaborating biostatisticianDichotomania and other challenges for the collaborating biostatistician
Dichotomania and other challenges for the collaborating biostatistician
 
Machine learning applied in health
Machine learning applied in healthMachine learning applied in health
Machine learning applied in health
 
MAKING SENSE OFSTATISTICSWhat statistics tell you an.docx
MAKING SENSE OFSTATISTICSWhat statistics tell you an.docxMAKING SENSE OFSTATISTICSWhat statistics tell you an.docx
MAKING SENSE OFSTATISTICSWhat statistics tell you an.docx
 
Principles of data_science
Principles of data_sciencePrinciples of data_science
Principles of data_science
 
Biomedical Research_House of Cards_Editorial
Biomedical Research_House of Cards_EditorialBiomedical Research_House of Cards_Editorial
Biomedical Research_House of Cards_Editorial
 
The Many Lives of Data
The Many Lives of DataThe Many Lives of Data
The Many Lives of Data
 
Big Data: Learning from MIMIC- Celi
Big Data: Learning from MIMIC- CeliBig Data: Learning from MIMIC- Celi
Big Data: Learning from MIMIC- Celi
 
Nurses and Data Science
Nurses and Data ScienceNurses and Data Science
Nurses and Data Science
 
Alopecia Areata Research Summit Summary
Alopecia Areata Research Summit SummaryAlopecia Areata Research Summit Summary
Alopecia Areata Research Summit Summary
 
Research questions
Research questions Research questions
Research questions
 
Sun==big data analytics for health care
Sun==big data analytics for health careSun==big data analytics for health care
Sun==big data analytics for health care
 
Questions for knowledge creators
Questions for knowledge creatorsQuestions for knowledge creators
Questions for knowledge creators
 
MIT Medical Evidence Bootcamp for Journalists 2012
MIT Medical Evidence Bootcamp for Journalists 2012MIT Medical Evidence Bootcamp for Journalists 2012
MIT Medical Evidence Bootcamp for Journalists 2012
 
Published Research, Flawed, Misleading, Nefarious - Use of Reporting Guidelin...
Published Research, Flawed, Misleading, Nefarious - Use of Reporting Guidelin...Published Research, Flawed, Misleading, Nefarious - Use of Reporting Guidelin...
Published Research, Flawed, Misleading, Nefarious - Use of Reporting Guidelin...
 
Research methodology
Research methodologyResearch methodology
Research methodology
 
SDR 24_2 proof
SDR 24_2 proofSDR 24_2 proof
SDR 24_2 proof
 
Believable Science
Believable ScienceBelievable Science
Believable Science
 
TITLE OF THE PAPER121Report on GeriatricsPro
TITLE OF THE PAPER121Report on GeriatricsProTITLE OF THE PAPER121Report on GeriatricsPro
TITLE OF THE PAPER121Report on GeriatricsPro
 
Title of the paper121 report on geriatricspro
Title of the paper121 report on geriatricsproTitle of the paper121 report on geriatricspro
Title of the paper121 report on geriatricspro
 
A Guide To Using Qualitative Research Methodology
A Guide To Using Qualitative Research MethodologyA Guide To Using Qualitative Research Methodology
A Guide To Using Qualitative Research Methodology
 

Mehr von jemille6

“The importance of philosophy of science for statistical science and vice versa”
“The importance of philosophy of science for statistical science and vice versa”“The importance of philosophy of science for statistical science and vice versa”
“The importance of philosophy of science for statistical science and vice versa”jemille6
 
Statistical Inference as Severe Testing: Beyond Performance and Probabilism
Statistical Inference as Severe Testing: Beyond Performance and ProbabilismStatistical Inference as Severe Testing: Beyond Performance and Probabilism
Statistical Inference as Severe Testing: Beyond Performance and Probabilismjemille6
 
D. Mayo JSM slides v2.pdf
D. Mayo JSM slides v2.pdfD. Mayo JSM slides v2.pdf
D. Mayo JSM slides v2.pdfjemille6
 
reid-postJSM-DRC.pdf
reid-postJSM-DRC.pdfreid-postJSM-DRC.pdf
reid-postJSM-DRC.pdfjemille6
 
Errors of the Error Gatekeepers: The case of Statistical Significance 2016-2022
Errors of the Error Gatekeepers: The case of Statistical Significance 2016-2022Errors of the Error Gatekeepers: The case of Statistical Significance 2016-2022
Errors of the Error Gatekeepers: The case of Statistical Significance 2016-2022jemille6
 
Causal inference is not statistical inference
Causal inference is not statistical inferenceCausal inference is not statistical inference
Causal inference is not statistical inferencejemille6
 
What are questionable research practices?
What are questionable research practices?What are questionable research practices?
What are questionable research practices?jemille6
 
What's the question?
What's the question? What's the question?
What's the question? jemille6
 
The neglected importance of complexity in statistics and Metascience
The neglected importance of complexity in statistics and MetascienceThe neglected importance of complexity in statistics and Metascience
The neglected importance of complexity in statistics and Metasciencejemille6
 
Mathematically Elegant Answers to Research Questions No One is Asking (meta-a...
Mathematically Elegant Answers to Research Questions No One is Asking (meta-a...Mathematically Elegant Answers to Research Questions No One is Asking (meta-a...
Mathematically Elegant Answers to Research Questions No One is Asking (meta-a...jemille6
 
On Severity, the Weight of Evidence, and the Relationship Between the Two
On Severity, the Weight of Evidence, and the Relationship Between the TwoOn Severity, the Weight of Evidence, and the Relationship Between the Two
On Severity, the Weight of Evidence, and the Relationship Between the Twojemille6
 
Revisiting the Two Cultures in Statistical Modeling and Inference as they rel...
Revisiting the Two Cultures in Statistical Modeling and Inference as they rel...Revisiting the Two Cultures in Statistical Modeling and Inference as they rel...
Revisiting the Two Cultures in Statistical Modeling and Inference as they rel...jemille6
 
Comparing Frequentists and Bayesian Control of Multiple Testing
Comparing Frequentists and Bayesian Control of Multiple TestingComparing Frequentists and Bayesian Control of Multiple Testing
Comparing Frequentists and Bayesian Control of Multiple Testingjemille6
 
Good Data Dredging
Good Data DredgingGood Data Dredging
Good Data Dredgingjemille6
 
The Duality of Parameters and the Duality of Probability
The Duality of Parameters and the Duality of ProbabilityThe Duality of Parameters and the Duality of Probability
The Duality of Parameters and the Duality of Probabilityjemille6
 
Error Control and Severity
Error Control and SeverityError Control and Severity
Error Control and Severityjemille6
 
The Statistics Wars and Their Causalities (refs)
The Statistics Wars and Their Causalities (refs)The Statistics Wars and Their Causalities (refs)
The Statistics Wars and Their Causalities (refs)jemille6
 
The Statistics Wars and Their Casualties (w/refs)
The Statistics Wars and Their Casualties (w/refs)The Statistics Wars and Their Casualties (w/refs)
The Statistics Wars and Their Casualties (w/refs)jemille6
 
On the interpretation of the mathematical characteristics of statistical test...
On the interpretation of the mathematical characteristics of statistical test...On the interpretation of the mathematical characteristics of statistical test...
On the interpretation of the mathematical characteristics of statistical test...jemille6
 
The role of background assumptions in severity appraisal (
The role of background assumptions in severity appraisal (The role of background assumptions in severity appraisal (
The role of background assumptions in severity appraisal (jemille6
 

Mehr von jemille6 (20)

“The importance of philosophy of science for statistical science and vice versa”
“The importance of philosophy of science for statistical science and vice versa”“The importance of philosophy of science for statistical science and vice versa”
“The importance of philosophy of science for statistical science and vice versa”
 
Statistical Inference as Severe Testing: Beyond Performance and Probabilism
Statistical Inference as Severe Testing: Beyond Performance and ProbabilismStatistical Inference as Severe Testing: Beyond Performance and Probabilism
Statistical Inference as Severe Testing: Beyond Performance and Probabilism
 
D. Mayo JSM slides v2.pdf
D. Mayo JSM slides v2.pdfD. Mayo JSM slides v2.pdf
D. Mayo JSM slides v2.pdf
 
reid-postJSM-DRC.pdf
reid-postJSM-DRC.pdfreid-postJSM-DRC.pdf
reid-postJSM-DRC.pdf
 
Errors of the Error Gatekeepers: The case of Statistical Significance 2016-2022
Errors of the Error Gatekeepers: The case of Statistical Significance 2016-2022Errors of the Error Gatekeepers: The case of Statistical Significance 2016-2022
Errors of the Error Gatekeepers: The case of Statistical Significance 2016-2022
 
Causal inference is not statistical inference
Causal inference is not statistical inferenceCausal inference is not statistical inference
Causal inference is not statistical inference
 
What are questionable research practices?
What are questionable research practices?What are questionable research practices?
What are questionable research practices?
 
What's the question?
What's the question? What's the question?
What's the question?
 
The neglected importance of complexity in statistics and Metascience
The neglected importance of complexity in statistics and MetascienceThe neglected importance of complexity in statistics and Metascience
The neglected importance of complexity in statistics and Metascience
 
Mathematically Elegant Answers to Research Questions No One is Asking (meta-a...
Mathematically Elegant Answers to Research Questions No One is Asking (meta-a...Mathematically Elegant Answers to Research Questions No One is Asking (meta-a...
Mathematically Elegant Answers to Research Questions No One is Asking (meta-a...
 
On Severity, the Weight of Evidence, and the Relationship Between the Two
On Severity, the Weight of Evidence, and the Relationship Between the TwoOn Severity, the Weight of Evidence, and the Relationship Between the Two
On Severity, the Weight of Evidence, and the Relationship Between the Two
 
Revisiting the Two Cultures in Statistical Modeling and Inference as they rel...
Revisiting the Two Cultures in Statistical Modeling and Inference as they rel...Revisiting the Two Cultures in Statistical Modeling and Inference as they rel...
Revisiting the Two Cultures in Statistical Modeling and Inference as they rel...
 
Comparing Frequentists and Bayesian Control of Multiple Testing
Comparing Frequentists and Bayesian Control of Multiple TestingComparing Frequentists and Bayesian Control of Multiple Testing
Comparing Frequentists and Bayesian Control of Multiple Testing
 
Good Data Dredging
Good Data DredgingGood Data Dredging
Good Data Dredging
 
The Duality of Parameters and the Duality of Probability
The Duality of Parameters and the Duality of ProbabilityThe Duality of Parameters and the Duality of Probability
The Duality of Parameters and the Duality of Probability
 
Error Control and Severity
Error Control and SeverityError Control and Severity
Error Control and Severity
 
The Statistics Wars and Their Causalities (refs)
The Statistics Wars and Their Causalities (refs)The Statistics Wars and Their Causalities (refs)
The Statistics Wars and Their Causalities (refs)
 
The Statistics Wars and Their Casualties (w/refs)
The Statistics Wars and Their Casualties (w/refs)The Statistics Wars and Their Casualties (w/refs)
The Statistics Wars and Their Casualties (w/refs)
 
On the interpretation of the mathematical characteristics of statistical test...
On the interpretation of the mathematical characteristics of statistical test...On the interpretation of the mathematical characteristics of statistical test...
On the interpretation of the mathematical characteristics of statistical test...
 
The role of background assumptions in severity appraisal (
The role of background assumptions in severity appraisal (The role of background assumptions in severity appraisal (
The role of background assumptions in severity appraisal (
 

Kürzlich hochgeladen

Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfAdmir Softic
 
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Sapana Sha
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxheathfieldcps1
 
Measures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDMeasures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDThiyagu K
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactPECB
 
IGNOU MSCCFT and PGDCFT Exam Question Pattern: MCFT003 Counselling and Family...
IGNOU MSCCFT and PGDCFT Exam Question Pattern: MCFT003 Counselling and Family...IGNOU MSCCFT and PGDCFT Exam Question Pattern: MCFT003 Counselling and Family...
IGNOU MSCCFT and PGDCFT Exam Question Pattern: MCFT003 Counselling and Family...PsychoTech Services
 
Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104misteraugie
 
The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13Steve Thomason
 
Disha NEET Physics Guide for classes 11 and 12.pdf
Disha NEET Physics Guide for classes 11 and 12.pdfDisha NEET Physics Guide for classes 11 and 12.pdf
Disha NEET Physics Guide for classes 11 and 12.pdfchloefrazer622
 
Interactive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationInteractive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationnomboosow
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactdawncurless
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsTechSoup
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxiammrhaywood
 
Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Celine George
 
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...fonyou31
 
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...christianmathematics
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introductionMaksud Ahmed
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfsanyamsingh5019
 
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Krashi Coaching
 

Kürzlich hochgeladen (20)

Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdf
 
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptx
 
Measures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDMeasures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SD
 
Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global Impact
 
IGNOU MSCCFT and PGDCFT Exam Question Pattern: MCFT003 Counselling and Family...
IGNOU MSCCFT and PGDCFT Exam Question Pattern: MCFT003 Counselling and Family...IGNOU MSCCFT and PGDCFT Exam Question Pattern: MCFT003 Counselling and Family...
IGNOU MSCCFT and PGDCFT Exam Question Pattern: MCFT003 Counselling and Family...
 
Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104
 
The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13
 
Disha NEET Physics Guide for classes 11 and 12.pdf
Disha NEET Physics Guide for classes 11 and 12.pdfDisha NEET Physics Guide for classes 11 and 12.pdf
Disha NEET Physics Guide for classes 11 and 12.pdf
 
Interactive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationInteractive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communication
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impact
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The Basics
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
 
Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17
 
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
 
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introduction
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdf
 
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
 

02 young vpi lecture 2014

  • 1. 1 Abstract Claims coming from human medical observational studies, when tested rigorously, most often fail to replicate. Whereas randomized clinical trials replicate over 80% of the time, medical observational studies replicate only 10 to 20% of the time. Multiple re-test studies reported JAMA failed to replicate. For example in the early 1990s, Vitamin E was reported to protect against heart attacks. Large, well-conducted randomized clinical trials did not replicate this claim. The claim that Type A Personality leads to heart attacks failed to replicate in two separate studies, yet the myth still lives. Clearly, there are systematic problems with how observational studies are conducted and analyzed that need to be identified and fixed. Edwards Deming, the most famous quality expert ever, says that any problem with a failed process is not the fault of the workers, scientists conducting observational studies, but of management. Funding agencies and journal editors need to fix a clearly broken process. Technical problems are identified. Tough management solution are proposed. A simple statistical analysis strategy is presented. Many human health problems can only be examined using observational data. Our proposals, technical and managerial, should lead to more reliable claims along with fair ways to judge their reliability. NISS
  • 2. 22 Contact Information Stan Young National Institute of Statistical Sciences www.niss.org young@niss.org 919 685 9328 NISS
  • 3. NISS 3 Hayek, 1974, Nobel Lecture It is often difficult enough for the expert, and certainly in many instances impossible for the layman, to distinguish between legitimate and illegitimate claims advanced in the name of science. It is often difficult enough for the expert, and certainly in many instances impossible for the layman, to distinguish between legitimate and illegitimate claims advanced in the name of science.
  • 4. NISS 4 Hayek (2) ...much effort will have to be directed toward debunking such arrogations*, some of which have by now become the vested interests of established university departments. *Claims without proper foundation
  • 5. 5 Reliability of Literature Claims S. Stanley Young National Institute of Statistical Sciences Young@niss.org, 919 685 9328 VIP Lecture NISS
  • 6. 66 Science point of view What is the meaning of life? What is real? What is reproducible? Fooled (fooling) by randomness? NISS
  • 7. 77 The Players 1. The workers – scientists 2. The communicators – a. PR people b. Bloggers c. Reporters d. Science writers 3. The consumers – public, regulatory agencies, trial lawyers 4. The management – funding agencies, journal editors NISS
  • 8. 88 The Worker is not the Problem. W. Edwards Deming, the most visionary innovator ever on quality control, said The worker is not the problem. The problem is at the top! Management! To Deming, blaming the workers—individual researchers— is as incorrect as it is useless. Bringing the system under control is the responsibility of those managing it. NISS
  • 9. 9 Problems with observational studies “Everything is dangerous” 1. Data staging 2. No written analysis protocol 3. Multiple testing 4. Multiple modeling 5. Uncorrected bias 6. Self-serving paper writing 7. Self-serving press release 8. Actually believe the claims 9NISS
  • 10. Assertion : Every study is positive Data Staging  Bias Multiple testing Multiple model searching Any or all will lead to essentially all observational studies being positive! 10NISS
  • 11. 11 First, data staging Stan: Why do you think data staging is a big issue? Because it can be done in myriad ways, is rarely documented, and is usually not reproducible? David Madigan 11NISS
  • 12. 12 Multiple Testing: P-value, t-test Population, real or theoretical Two samples, random NISS
  • 14. 14 How do you get a “p < 0.05”? Answer: Ask lots of questions. 61 questions 95% chance of a positive study! NISS
  • 15. 1515 Let’s run an epidemiology study! 10-sided dice simulation: Coffee causes X. NISS
  • 16. 1616 P-value plot – 60 p-values. NISS
  • 17. 17 Cereal determines human gender Really????? 17NISS
  • 18. NISS
  • 19. 19 2 Cancer types, 48 pesticies, 96 questions Three claims made, only one appears valid.
  • 21. Paper, data, claim American Cancer Society Cancer Prevention Study II No association with CV deaths, corrected for PM2.5. Ozone associated with respiratory deaths. 21
  • 22. Jerrett et al. Large Search Space 22
  • 24. Large search space 32 x 128 = 4,096 The data used in this paper is not available. We are asked to trust that analysis decisions were good and claims are robust. Any adjustment for multiple testing and/or multiple modeling renders p-values NS. 24
  • 25. 2525 Crisis in science? 2011, 2012 Nature, 2012 Significance, 2011 NISS
  • 26. 26 Claims from observational studies tested in RCTs
  • 27. 27 What can funding agencies do? Fund data generation and analysis separately. Fund replication studies. Require data used in publication be posted on publication.
  • 28. 28 What can journal editors do? Quality by inspection, p-value < 0.05, is not working. (Many workers are gaming the system.) Management needs to re-design the system to build quality into the product. Papers following good manufacturing procedures and addressing important questions, should be accepted without regard to statistical significance. Require data used in publication be posted on publication.
  • 29. 29 What can you, the consumer, do? (not much) 1. Be skeptical of observational study claims. 2. Read the actual paper. 3. Count the claims under consideration. 4. Ask for the data set. 5. Letter to editor : voodoo stats and trust me science. (Educate editors.) 6. Write to funding agency. 7. Write to congressman. 29NISS
  • 30. 30 “New” p-value plot, - log10 (p-value), Dmitri Zaykin
  • 31. 31 Conclusions Most science claims do not replicate. Deming: Don't blame the worker (or expect them to adopt different methods). Funding agencies and journal editors have been AWOL. Require data to be placed in depository on publication.
  • 32. 32 One irate study evaluator, 2012 Mens Sana Monograph, 2012
  • 33. 3333 Contact Information Stan Young National Institute of Statistical Sciences www.niss.org young@niss.org 919 685 9328 NISS
  • 35. 3535 Suggestions for effective management of observational studies No funding / publication without: 1. Public posting protocol before study initiation. 2. Public posting of data set on publication. 3. Clear statement of questions under consideration. 4. Conform to “Reproducible Research” guidelines. 5. Any claims must be independently replicated. NISS
  • 36. 3636 Congressional Management: True Science Transparency Act Any federal agency proposing rule-making or legislation shall specifically name each document used to support the proposed rule-making or legislation and provide all data used in said document for viewing by the public. See also OSTP memorandum, 22Feb2013. NISS