Nara guidelines-jr

Guidelines for statistical analysis
of arthroplasty data
Jonas Ranstam PhD

All modern medical
science publications

60%

Source: Pubmed

Randomized clinical
trial of streptomycin
and tubercolosis (1948)
Bradford Hill & MRC

Source: Pubmed

Cohort study of smoking
and lung cancer (1954)
Bradford Hill & Doll

Case-control study of
smoking and lung
cancer (1950)

Randomized clinical
trial of streptomycin
and tubercolosis (1948)
Bradford Hill & MRC

Source: Pubmed

Cohort study of smoking
and lung cancer (1954)

Case-control study of
smoking and lung
cancer (1950)
Bradford Hill & Doll Evidence based medicine
Systematic reviews and
Randomized clinical meta analyses
trial of streptomycin (The Cochrane
and tubercolosis (1948) collaboration 1993)
Bradford Hill & MRC

Source: Pubmed

Evidence levels
1. Strong evidence from at least one systematic review of multiple
well-designed randomized controlled trials.
2. Strong evidence from at least one properly designed randomized
controlled trial of appropriate size.
3. Evidence from well-designed trials such as pseudo-randomized
or non-randomized trials, cohort studies, time series or matched
case-controlled studies.
4. Evidence from well-designed non-experimental studies from more
than one center or research group or from case reports.
5. Opinions of respected authorities, based on clinical evidence,
descriptive studies or reports of expert committees.

Any claim coming from an observational
study is most likely to be wrong
12 randomised trials have tested 52 observational claims (about the
effects of vitamine B6, B12, C, D, E, beta carotene, hormone
replacement therapy, folic acid and selenium).

“They all confirmed no claims in the direction of the observational
claim. We repeat that figure: 0 out of 52. To put it in another way,
100% of the observational claims failed to replicate. In fact, five claims
(9.6%) are statistically significant in the opposite direction to the
observational claim.”

Stanley Young and Allan Karr, Significance, September 2011

Guidelines
Systematic reviews and meta analyses benefit from a
standardized, transparent and accurate reporting of studies.

STREGA, STROBE, STARD, SQUIRE, MOOSE, PRISMA,
GNOSIS, TREND, ORION, COREQ, QUOROM, REMARK,
CONSORT...

Internal validity

Internal validity by design (blocking of
Experi- known risk factors and randomization of
mental unknown)

Potential for confounding: none

Study
design

Internal validity by statistical analysis
Obser- (confounding adjustment for known and
vational measured risk factors)

Potential for confounding: massive

Confounder (or case-mix) adjustment
How much of the variation in endpoints can be explained by known
factors, and how much has unknown causes?

Unexplained variation (1-r2)

95%-99% Arthroplasty revision

85%-95% EQ-5D, SF36

50%-70% Coronary heart disease risk

Aims and characteristics

Hypothesis generation Pre-specified hypotheses

Exploration Confirmation

Academic analysis freedom Legislation, regulatory
guidelines

Uncertainty tolerance Uncertainty intolerance

Aetiology Study scope Treatment

Research areas

Experi-
Laboratory experiments Randomized clinical
mental
trials

Study
design

Obser-
vational Epidemiological Patient register
studies studies


Analysis strategies and publication guidelines

Experi-
Laboratory experiments Randomized clinical
mental
trials
ARRIVE
CONSORT
Study
design

Obser-
vational Epidemiological Patient register
studies studies
STROBE ?


Analysis strategies and publication guidelines
NARA

The Nordic Arthroplasty Register Association (NARA) study group
decided in September 2009 at a meeting in Lund, Sweden, to
develop guidelines for statistical analysis of arthroplasty quality
register data.

The guidelines were published In April, 2011.

Acta Orthopaedica 2011;82:253-267.

The NARA Guidelines
A collaborative effort by

1. Independent observations (Pulkkinen & Mäkelä )
2. Competing risks (Mehnert & Pedersen)
3. Proportional hazard rates (Espehaug & Furnes)
4. Rankable revision risk estimates (Ranstam & Kärrholm)

The NARA study group

LI Havelin, LB Engesæter AM Fenstad (NO)
S Overgaard, A Odgaard (DA)
A Eskelinen, V Remes, P Virolainen (FI)
G Garellick, M Sundberg, O Robertsson (SE)

The NARA Guidelines
have been developed to

– define a NARA consensus view on statistical analysis
– describe foreseeable problems and recommend solutions
– improve the comparability of reports
– facilitate reading, writing and reviewing of reports

The NARA Guidelines
are not intended to

– stifle creativity
– promote uniformity

NARA Guidelines
Structure

1. Review of underlying assumptions
2. Consequences of departure from these assumptions
3. Verifying that the assumptions are fulfilled
4. Possible methodological alternatives
5. Practical recommendations

1 – Independent observations

Independent observations

Pseudoreplication
Two rats are sampled
from a population with a
mean (μ) of 50 and a
standard deviation (σ) of
10, and ten measure-
ments of an arbitrary
outcome variable are
made on each rat.
- Biological variability.
- Measurement errors.

Ripatti S and Palmgren J. Estimation of multivariate frailty models using penalized
partial likelihood. Biometrics 2000, 56:1016-1022.

Schwarzer G, Schumacher M, Maurer TB and Ochsner PE. Statistical analysis of
failure times in total joint replacement. J Clin Epidemiol 2001, 54:997-1003.

Visuri T, Turula KB, Pulkkinen P and Nevalainen J. Survivorship of hip prosthesis in
primary arthrosis: influence of bilaterality and interoperative time in 45,000 hip
prostheses from the Finnish endoprosthesis register. Acta Orthop Scand 2002,
73:287-290.

Robertsson O and Ranstam J. No bias of ignored bilaterality when analysing the
revision risk of knee prostheses: Analysis of a population based sample of 44,590
patients with 55,298 knee prostheses from the national Swedish Knee Arthroplasty
Register. BMC Musculoskeletal Disorders 2003, 4:1.

Lie SA, Engesaeter LB, Havelin LI, Gjessing HK and Vollset SE. Dependency issues
in survival analyses of 55,782 primary hip replacements from 47,355 patients. Stat
Med. 2004 Oct 30;23(20):3227-40.

Recommendations

The inclusion of bilateral observations in analysis of knee- and hip
prosthesis survival does not seem to affect the reliability of the
results, but this need not be the case with other types of
prostheses.

The number of bilateral observations should always be presented.
Sensitivity analyses can be useful when the results robustness
against departures from the assumption of independence.

Competing risks
Kaplan-Meier analysis

The statistical analysis of arthroplasty failure is primarily about the
length of time from primary operation to revision.

Not all patients are revised during follow up. The length of follow
up usually differ, and some patients are withdrawn before end of
follow up; these observations are “censored”.

With Kaplan-Meier analysis censored observations are included in
the analysis, until their censoring.

Competing risks
Kaplan-Meier-analys

Competing risks
Kaplan-Meier assumption
The time at which a patient gets a revision is assumed to be
independent of the censoring mechanism. Other events than the one
studied are competing risk events if they alter the risk of being
revised.

Re-revision

Revision
Primary Death
operation
Death

Competing risks
Alternative method: Cumulative incidence
The probability that a particular event, such as revision or a
competing risk event, has occurred before a given time.
The cumulative incidence function for an event of interest can be
calculated by appropriately accounting for the presence of competing
risk events.
Censored observations can be included in the analysis.

Competing risks
With competing risk events Kaplan-Meier estimates will overestimate
the real failure risk.

Competing risks
Recommendations

With competing risks the Kaplan-Meier failure function over-estimates
the revision risk.

An alternative method can be to calculate the cumulative incidence of
revisions. However, from the patient's perspective this may be less
relevant.

The presence of competing risks should always be presented and
both the number and types of censored observations should be
described.

Competing risks

- do not condition on the future;
- do not regard individuals at risk after they have died; and
- stick to this world.

3 – Proportional hazard rates

Proportional hazard rates
Adjustment for case-mix effects

Risk estimates can be adjusted for the confounding effect of an
imbalance of known and measured risk factors using statistical
modeling.

This is usually achieved using a Cox model.

Cox model

The Cox model is a regression model for revision times (or more
specifically, hazard rates).

The purpose of the model is to explore the simultaneous effects
of different factors on the revision risk.

The Cox model is based on the
assumption of proportional
hazards (conditional revision
risks). It is also known as the
“proportional hazards model”.

The assumption of proportional
hazards is not always fulfilled.

Schoenfeld residual

The covariate value for the implant that failed minus its expected
value.

Consequences

When the effect of one or more of the prognostic factors in a Cox
regression model changes over time, the average hazard ratio for
such a prognostic factor is under- or overestimated.

Weighted estimation in Cox regression (Schemper's method) is a
parsimonious alternative without additional parameters.

Recommendations

Non-proportional hazards may be an interesting finding in itself.

In register studies with large sample sizes, analyses can usually
be performed by partitioning follow up time, by stratification, or by
including time dependent covariates.

If the average relative risk is of interest, Schemper's method can
be an alternative.

It should always be evaluated whether the assumption on
proportional hazard is fulfilled or not. Testing the Schoenfeld
residuals may be a solution.

4 – Rankable revision risk estimates

Rankable revision risk estimates
Performance monitoring

Swedish Knee Arthroplasty Register Annual Report 2011

Ranking is a problematic method for comparisons. If ranking is
performed, the uncertainty in the ranks should be clearly indicated,
preferably with confidence intervals.

Consequences of misclassification (registration errors) should be
evaluated and case-mix effects considered as far as possible.

Finally
Revisions and updates

The guidelines should be open for revision and updating.

They have been developed as a consensus and should evolve as
a consensus.

Experience and feedback is essential.

Forward your suggestions to the NARA study group!

Guidelines
Guidelines are particularly prevalent in clinical trials.

CONSORT

ICH E9 - Statistical Principles for Clinical Trials

EMA Points to Consider, on multiplicity, baseline covariates,
superiority and non-inferiority, etc. and similar documents from the
FDA

Etc.

Nara guidelines-jr

Empfohlen

Empfohlen

Weitere ähnliche Inhalte

Was ist angesagt?

Was ist angesagt? (16)

Andere mochten auch

Andere mochten auch (8)

Ähnlich wie Nara guidelines-jr

Ähnlich wie Nara guidelines-jr (20)

Mehr von Jonas Ranstam PhD

Mehr von Jonas Ranstam PhD (20)

Nara guidelines-jr