SlideShare ist ein Scribd-Unternehmen logo
1 von 111
Downloaden Sie, um offline zu lesen
Association Mapping
                  Through local genealogies


Thomas Mailund
Bioinformatics Research Center
http://www.birc.au.dk/
Gunshot wounds
Car accidents
Smoking induced
lung cancer       “Genetic” Diseases
Cardiovascular
disease
Obesity
Diabetes 2
Alzheimer
Schizophrenia
BRCA1
breast cancer
Cystic fibrosis
Haemophilia
Disease Mapping...
Locate disease-affecting polymorphism

   Cases (affected)
                        --A--------C--------A----G---X----T---C---A----
                        --T--------G--------A----G---X----C---C---A----
                        --A--------G--------G----G---X----C---C---A----
                        --A--------C--------A----G---X----T---C---A----
                        --T--------C--------A----G---X----T---C---A----
                        --T--------C--------A----T---X----T---A---A----

                        --A--------C--------A----G---X----T---C---A----
                        --A--------C--------A----G---X----T---C---A----
                        --A--------C--------A----G---X----T---C---G----
                        --T--------C--------A----T---X----T---C---A----
                        --A--------C--------A----G---X----T---C---A----
                        --A--------C--------G----T---X----C---A---A----
                        --A--------C--------A----G---X----C---C---G----

Controls (unaffected)
Unrealistic Assumptions
We only measure                  -A--          -C-         -A--

“unphased” data
                  --T--   --G-          -G--         -C-
                                 -A--                      -A--
                  --A--   --C-          -G--   -T-   -C-
Unrealistic Assumptions
We only measure                   -A--            -C-            -A--

“unphased” data
                  --T--   --G-           -G--            -C-
                                  -A--                           -A--
                  --A--   --C-           -G--     -T-     -C-




We first need to
infer the phase
                     --T--------G--------A----G--------C---C---A----
                     --A--------C--------A----G--------T---C---A----
Unrealistic Assumptions
We only measure                   -A--            -C-            -A--

“unphased” data
                  --T--   --G-           -G--            -C-
                                  -A--                           -A--
                  --A--   --C-           -G--     -T-     -C-




We first need to
infer the phase
                     --T--------G--------A----G--------C---C---A----
                     --A--------C--------A----G--------T---C---A----


                     --T--------G--------A----G--------T---C---A----
                     --A--------C--------A----G--------C---C---A----
Unrealistic Assumptions
We only measure                   -A--            -C-            -A--

“unphased” data
                  --T--   --G-           -G--            -C-
                                  -A--                           -A--
                  --A--   --C-           -G--     -T-     -C-




We first need to
infer the phase
                     --T--------G--------A----G--------C---C---A----
                     --A--------C--------A----G--------T---C---A----


                     --T--------G--------A----G--------T---C---A----
                     --A--------C--------A----G--------C---C---A----


                     --T--------C--------A----G--------T---C---A----
                     --A--------G--------A----G--------C---C---A----
Unrealistic Assumptions
We only measure                       -A--            -C-            -A--

“unphased” data
                      --T--   --G-           -G--            -C-
                                      -A--                           -A--
                      --A--   --C-           -G--     -T-     -C-




We first need to




                  ?
infer the phase
                         --T--------G--------A----G--------C---C---A----
                         --A--------C--------A----G--------T---C---A----


                         --A--------G--------A----G--------C---C---A----
                         --T--------C--------A----G--------T---C---A----


                         --T--------C--------A----G--------T---C---A----
                         --A--------G--------A----G--------C---C---A----
Disease Mapping...
Markers are locally correlated

   Cases (affected)
                        --A--------C--------A----G---X----T---C---A----
                        --T--------G--------A----G---X----C---C---A----
                        --A--------G--------G----G---X----C---C---A----
                        --A--------C--------A----G---X----T---C---A----
                        --T--------C--------A----G---X----T---C---A----
                        --T--------C--------A----T---X----T---A---A----

                        --A--------C--------A----G---X----T---C---A----
                        --A--------C--------A----G---X----T---C---A----
                        --A--------C--------A----G---X----T---C---G----
                        --T--------C--------A----T---X----T---C---A----
                        --A--------C--------A----G---X----T---C---A----
                        --A--------C--------G----T---X----C---A---A----
                        --A--------C--------A----G---X----C---C---G----

Controls (unaffected)
Disease Mapping...
Search for indirect signals

   Cases (affected)
                        --A--------C--------A----G---X----T---C---A----
                        --T--------G--------A----G---X----C---C---A----
                        --A--------G--------G----G---X----C---C---A----
                        --A--------C--------A----G---X----T---C---A----
                        --T--------C--------A----G---X----T---C---A----
                        --T--------C--------A----T---X----T---A---A----

                        --A--------C--------A----G---X----T---C---A----
                        --A--------C--------A----G---X----T---C---A----
                        --A--------C--------A----G---X----T---C---G----
                        --T--------C--------A----T---X----T---C---A----
                        --A--------C--------A----G---X----T---C---A----
                        --A--------C--------G----T---X----C---A---A----
                        --A--------C--------A----G---X----C---C---G----

Controls (unaffected)
Marker Relatedness
                 Linkage disequilibrium (LD)

Empirical Results                               Theoretical Results




                                      LD (r2)




                                                     Recombination rate

Clark et al. 2003, AJHG 73:285-300.                  Hein et al. 2005
Indirect Association
               “Tag” markers                   Unobserved marker


   Cases (affected)
                        --A--------C--------A----G---X----T---C---A----
                        --T--------G--------A----G---X----C---C---A----
                        --A--------G--------G----G---X----C---C---A----
                        --A--------C--------A----G---X----T---C---A----
                        --T--------C--------A----G---X----T---C---A----
                        --T--------C--------A----T---X----T---A---A----

                        --A--------C--------A----G---X----T---C---A----
                        --A--------C--------A----G---X----T---C---A----
                        --A--------C--------A----G---X----T---C---G----
                        --T--------C--------A----T---X----T---C---A----
                        --A--------C--------A----G---X----T---C---A----
                        --A--------C--------G----T---X----C---A---A----
                        --A--------C--------A----G---X----C---C---G----

Controls (unaffected)
Indirect Association


   Cases (affected)
                        --A--------C--------A----G---X----T---C---A----
                        --T--------G--------A----G---X----C---C---A----
                        --A--------G--------G----G---X----C---C---A----
                        --A--------C--------A----G---X----T---C---A----
                        --T--------C--------A----G---X----T---C---A----
                        --T--------C--------A----T---X----T---A---A----

                        --A--------C--------A----G---X----T---C---A----
                        --A--------C--------A----G---X----T---C---A----
                        --A--------C--------A----G---X----T---C---G----
                        --T--------C--------A----T---X----T---C---A----
                        --A--------C--------A----G---X----T---C---A----
                        --A--------C--------G----T---X----C---A---A----
                        --A--------C--------A----G---X----C---C---G----

Controls (unaffected)
Indirect Association


   Cases (affected)
                        --A--------C--------A----G---X----T---C---A----
                        --T--------G--------A----G---X----C---C---A----
                        --A--------G--------G----G---X----C---C---A----
                        --A--------C--------A----G---X----T---C---A----
                        --T--------C--------A----G---X----T---C---A----
                        --T--------C--------A----T---X----T---A---A----

                        --A--------C--------A----G---X----T---C---A----
                        --A--------C--------A----G---X----T---C---A----
                        --A--------C--------A----G---X----T---C---G----
                        --T--------C--------A----T---X----T---C---A----
                        --A--------C--------A----G---X----T---C---A----
                        --A--------C--------G----T---X----C---A---A----
                        --A--------C--------A----G---X----C---C---G----

Controls (unaffected)
Indirect Association


   Cases (affected)
                        --A--------C--------A----G---X----T---C---A----
                        --T--------G--------A----G---X----C---C---A----
                        --A--------G--------G----G---X----C---C---A----
                        --A--------C--------A----G---X----T---C---A----
                        --T--------C--------A----G---X----T---C---A----
                        --T--------C--------A----T---X----T---A---A----

                        --A--------C--------A----G---X----T---C---A----
                        --A--------C--------A----G---X----T---C---A----
                        --A--------C--------A----G---X----T---C---G----
                        --T--------C--------A----T---X----T---C---A----
                        --A--------C--------A----G---X----T---C---A----
                        --A--------C--------G----T---X----C---A---A----
                        --A--------C--------A----G---X----C---C---G----

Controls (unaffected)
Indirect Association


   Cases (affected)
                        --A--------C--------A----G---X----T---C---A----
                        --T--------G--------A----G---X----C---C---A----
                        --A--------G--------G----G---X----C---C---A----
                        --A--------C--------A----G---X----T---C---A----
                        --T--------C--------A----G---X----T---C---A----
                        --T--------C--------A----T---X----T---A---A----

                        --A--------C--------A----G---X----T---C---A----
                        --A--------C--------A----G---X----T---C---A----
                        --A--------C--------A----G---X----T---C---G----
                        --T--------C--------A----T---X----T---C---A----
                        --A--------C--------A----G---X----T---C---A----
                        --A--------C--------G----T---X----C---A---A----
                        --A--------C--------A----G---X----C---C---G----

Controls (unaffected)
Indirect
         Multi-Marker
          Association
   Cases (affected)
                        --A--------C--------A----G---X----T---C---A----
                        --T--------G--------A----G---X----C---C---A----
                        --A--------G--------G----G---X----C---C---A----
                        --A--------C--------A----G---X----T---C---A----
                        --T--------C--------A----G---X----T---C---A----
                        --T--------C--------A----T---X----T---A---A----

                        --A--------C--------A----G---X----T---C---A----
                        --A--------C--------A----G---X----T---C---A----
                        --A--------C--------A----G---X----T---C---G----
                        --T--------C--------A----T---X----T---C---A----
                        --A--------C--------A----G---X----T---C---A----
                        --A--------C--------G----T---X----C---A---A----
                        --A--------C--------A----G---X----C---C---G----

Controls (unaffected)
The Ancestral
Recombination Graph




              Hudson 1990, Griffith and Marjoram 1996
The Coalescent
   Process
The Coalescent
   Process
The Coalescent
   Process
The Coalescent
   Process
The Coalescent
   Process
The Coalescent
   Process
The Coalescent
   Process
The Coalescent
   Process
The Coalescent
   Process
The Coalescent
   Process
The Coalescent
   Process
The Coalescent
   Process
The Coalescent
   Process
The Coalescent
   Process
The Coalescent
   Process
The Coalescent
   Process
The Coalescent
   Process
The Coalescent
   Process
The Coalescent
   Process
The Coalescent
   Process
The Coalescent
   Process
The Coalescent
   Process
The Coalescent
   Process
The Coalescent
   Process
The Coalescent
   Process
The Coalescent
   Process
The Coalescent
   Process
The Coalescent
   Process
The Coalescent
   Process
The Coalescent
   Process
The Coalescent
   Process
The Coalescent
   Process
The Coalescent
   Process
The Coalescent
   Process
A Reasonable Local Model
    Copyright Ó 2007 by the Genetics Society of America
    DOI: 10.1534/genetics.107.071126



     On Recombination-Induced Multiple and Simultaneous Coalescent Events

          Joanna L. Davies,1 Frantisek Simanc´k, Rune Lyngsø, Thomas Mailund and Jotun Hein
                                   ˇ        ˇı
                                   Department of Statistics, University of Oxford, Oxford, OX1 3TG, United Kingdom
                                                            Manuscript received January 18, 2007
                                                          Accepted for publication October 2, 2007


                                                                   ABSTRACT
                    Coalescent theory deals with the dynamics of how sampled genetic material has spread through a
                 population from a single ancestor over many generations and is ubiquitous in contemporary molecular
                 population genetics. Inherent in most applications is a continuous-time approximation that is derived
                 under the assumption that sample size is small relative to the actual population size. In effect, this
                 precludes multiple and simultaneous coalescent events that take place in the history of large samples. If
                 sequences do not recombine, the number of sequences ancestral to a large sample is reduced sufficiently
                 after relatively few generations such that use of the continuous-time approximation is justified. However,
                 in tracing the history of large chromosomal segments, a large recombination rate per generation will
                 consistently maintain a large number of ancestors. This can create a major disparity between discrete-time
                 and continuous-time models and we analyze its importance, illustrated with model parameters typical of
                 the human genome. The presence of gene conversion exacerbates the disparity and could seriously
                 undermine applications of coalescent theory to complete genomes. However, we show that multiple and
                 simultaneous coalescent events influence global quantities, such as total number of ancestors, but have
                 negligible effect on local quantities, such as linkage disequilibrium. Reassuringly, most applications of the
                 coalescent model with recombination (including association mapping) focus on local quantities.




    K    INGMAN (1982) models the ancestry of a sample
          of sequences with a continuous-time Markov pro-
    cess referred to as the Kingman coalescent. Lineages
                                                                                ulation size, the probability of such events occurring
                                                                                becomes nonnegligible and consequently in these
                                                                                instances the rate of coalescence is underestimated
    collide or coalesce after random exponential waiting                        by Hudson’s continuous-time model. Hudson’s model
A Reasonable Local Model
  • The “back in time” approach (in general)
    means we ignore selection
  • Implicit assumption that the disease is
    selectively neutral
    • Which may or may not be reasonable...
    • Might be okay for late onset diseases...
The ARG as a
 Statistical Model


P(                   )
The ARG as a
 Statistical Model


P( |                 )
The ARG as a
 Statistical Model


P( |                 )
The ARG as a
 Statistical Model


P( |                 )
The ARG as a
   Statistical Model


P( |     , )P(         |)
The ARG as a
      Statistical Model
          lhd( )=
          P( | )=
∫P(   |    , )P(    | )d
The ARG as a
      Statistical Model
          lhd( )=
∫P(   |     , )P(                | )d
          Integration by magic
The ARG as a
      Statistical Model
          lhd( )=
∫P(   |      , )P(              | )d
          Integration by magic
                         statistical sampling
ARG Methods

• Sampling ARGs from the coalescence
  process
• Sampling ARGs conditional on the data
  (importance sampling)
• Sampling parsimonious ARGs conditional
  on the data
ARG Methods
• Sampling ARGs from the coalescence
  process
 •   This is a no go -- you would never sample an
     ARG that can explain the data

• Sampling ARGs conditional on the data
  (importance sampling)
• Sampling parsimonious ARGs conditional
  on the data
ARG Methods
• Sampling ARGs from the coalescence
  process
• Sampling ARGs conditional on the data
  (importance sampling)
 •   Larribe, Lessard and Schork 2002 -- scales to
     tens of individuals and tens of markers

• Sampling parsimonious ARGs conditional
  on the data
ARG Methods
• Sampling parsimonious ARGs conditional on
  the data
 •   Lyngsø, Song & Hein 2005 (calculates parsimonious
     ARGs -- a 2008 paper in press for sampling)

 •   Minichiello & Durbin 2006 (samples parsimonious
     ARGs and scores local genealogies)

 •   Both preferentially selects mutations and
     coalescence events over recombinations

 •   Scales to thousands of individuals and hundreds of
     markers
Local Phylogenies
For each “point” on the chromosome, the ARG
determines a (local) tree:
Local Phylogenies
For each “point” on the chromosome, the ARG
determines a (local) tree:
Local Phylogenies
For each “point” on the chromosome, the ARG
determines a (local) tree:
Local Phylogenies
For each “point” on the chromosome, the ARG
determines a (local) tree:
Changing Phylogenies
Type 1: No change




Type 2: Change in branch lengths




Type 3: Change in topology




                                   From Hein et al. 2005
Trees and LD
Tree similarity




                                       LD r2




                  Recombination rate           Recombination rate
Can we use just the trees?
Clustering on a Tree
           Disease affecting mutation
Clustering on a Tree
  Complete penetrance


          Incomplete penetrance



  Spurious disease
Clustering on a Tree

  25%
              Case/control clustering
              is not random on the tree...
        75%




                             40%
                    60%
Sampling Trees
(with recombination)




                Zöllner & Pritchard 2005
Sampling Trees
(with recombination)




                Zöllner & Pritchard 2005
Sampling Trees
(with recombination)




                Zöllner & Pritchard 2005
Sampling Trees
(with recombination)




                Zöllner & Pritchard 2005
Sampling Trees
(with recombination)




                Zöllner & Pritchard 2005
Sampling Trees
(with recombination)




                Zöllner & Pritchard 2005
Sampling Trees
(with recombination)




                Zöllner & Pritchard 2005
Sampling Trees
(with recombination)




                Zöllner & Pritchard 2005
Sampling Trees
(with recombination)




                Zöllner & Pritchard 2005
Sampling Trees
(with recombination)




                Zöllner & Pritchard 2005
Sampling Trees
(with recombination)




                Zöllner & Pritchard 2005
Sampling Trees
(with recombination)




                Zöllner & Pritchard 2005
Sampling Trees
(with recombination)




                Zöllner & Pritchard 2005
Sampling Trees
(with recombination)




                Zöllner & Pritchard 2005
Sampling Trees
(with recombination)




                Zöllner & Pritchard 2005
Sampling Trees
(with recombination)




                Zöllner & Pritchard 2005
Sampling Trees
(with recombination)




                Zöllner & Pritchard 2005
Sampling Trees
(with recombination)
    We only sample the
    process on the left --
    much fewer events




                             Zöllner & Pritchard 2005
Using “Perfect Phylogenies”
 Use the four-gamete test to find regions that
 can be explained by a tree with no recurrent mutations




                                         Mailund, Besenbacher & Schierup 2006
Using “Perfect Phylogenies”
 Build trees for each such region




                                    Mailund, Besenbacher & Schierup 2006
Using “Perfect Phylogenies”
 Each marker splits a sub-tree in two




                                        Mailund, Besenbacher & Schierup 2006
Using “Perfect Phylogenies”
 Each marker splits a sub-tree in two




                                        Mailund, Besenbacher & Schierup 2006
Using “Perfect Phylogenies”
 Each marker splits a sub-tree in two




                                        Mailund, Besenbacher & Schierup 2006
Using “Perfect Phylogenies”



Much faster (and much cruder)

Catches the essential tree structure


                                       Mailund, Besenbacher & Schierup 2006
Scoring the Clustering

                   Red=cases
                   Green=controls



Are the case chromosomes significantly
over-represented in some clusters?
Wild-types



                   Mutation

         Mutants


We can place “mutations” on the tree edges
and partition chromosomes into “mutants”
and “wild-types” and test for different
distributions of cases and controls
Wild-types



                   Mutation

         Mutants


Use average or maximum to score the tree

Average is kosher Bayesian stats; maximum
needs to be corrected for over-fitting.
Blossoc
(BLOck aSSOCiation)
Homepage: www.birc.au.dk/~mailund/Blossoc
                           Command line and
                           graphical user interface
                           (with limited functionality)
Blossoc
(BLOck aSSOCiation)
Homepage: www.birc.au.dk/~mailund/Blossoc


                           Fast enough to analyse
                           tens of thousands of
                           individuals in hundred of
                           thousands of markers in a
                           day or two on a desktop
                           computer...
Localisation Accuracy
          A single causal mutation
 Max BF / min p-value used as point estimate
Localisation Accuracy
           Two causal mutations
 Max BF / min p-value used as point estimate
Thank you!


                More information at
http://www.birc.au.dk/~mailund/association-mapping/

Weitere ähnliche Inhalte

Andere mochten auch

HIS
HIS HIS
HIS GOOFS
 
Scrobbld(1)
Scrobbld(1)Scrobbld(1)
Scrobbld(1)scrobbld
 
КОТ - Новая Площадь
КОТ - Новая ПлощадьКОТ - Новая Площадь
КОТ - Новая Площадьguest4f99c3
 
Inholland Workshop Entrepreneurship & Internet
Inholland Workshop Entrepreneurship & InternetInholland Workshop Entrepreneurship & Internet
Inholland Workshop Entrepreneurship & InternetAyman van Bregt
 
HIS
HIS HIS
HIS GOOFS
 
A L D E A R R O D R I G O2
A L D E A R R O D R I G O2A L D E A R R O D R I G O2
A L D E A R R O D R I G O2pabloaldea
 
Secrets to harnessing innovation
Secrets to harnessing innovationSecrets to harnessing innovation
Secrets to harnessing innovationBabelfish
 
Picture Dictionary3
Picture  Dictionary3Picture  Dictionary3
Picture Dictionary3ziear
 
Diapositivas Proyecto Expositivo
Diapositivas Proyecto ExpositivoDiapositivas Proyecto Expositivo
Diapositivas Proyecto Expositivoguest34659c
 
Scrobbld.com Export Page
Scrobbld.com Export PageScrobbld.com Export Page
Scrobbld.com Export Pagescrobbld
 
Quelosepaelmundoentero
QuelosepaelmundoenteroQuelosepaelmundoentero
QuelosepaelmundoenteroPersio
 

Andere mochten auch (20)

HIS
HIS HIS
HIS
 
Scrobbld(1)
Scrobbld(1)Scrobbld(1)
Scrobbld(1)
 
КОТ - Новая Площадь
КОТ - Новая ПлощадьКОТ - Новая Площадь
КОТ - Новая Площадь
 
Inholland Workshop Entrepreneurship & Internet
Inholland Workshop Entrepreneurship & InternetInholland Workshop Entrepreneurship & Internet
Inholland Workshop Entrepreneurship & Internet
 
Mj Base
Mj BaseMj Base
Mj Base
 
HIS
HIS HIS
HIS
 
A L D E A R R O D R I G O2
A L D E A R R O D R I G O2A L D E A R R O D R I G O2
A L D E A R R O D R I G O2
 
Bonsai
BonsaiBonsai
Bonsai
 
Secrets to harnessing innovation
Secrets to harnessing innovationSecrets to harnessing innovation
Secrets to harnessing innovation
 
Picture Dictionary3
Picture  Dictionary3Picture  Dictionary3
Picture Dictionary3
 
Web20
Web20Web20
Web20
 
Diapositivas Proyecto Expositivo
Diapositivas Proyecto ExpositivoDiapositivas Proyecto Expositivo
Diapositivas Proyecto Expositivo
 
Scrobbld.com Export Page
Scrobbld.com Export PageScrobbld.com Export Page
Scrobbld.com Export Page
 
M J Base
M J  BaseM J  Base
M J Base
 
Las aves
Las avesLas aves
Las aves
 
LA SINERGIA
LA SINERGIALA SINERGIA
LA SINERGIA
 
Quelosepaelmundoentero
QuelosepaelmundoenteroQuelosepaelmundoentero
Quelosepaelmundoentero
 
Desde El Div N
Desde El Div NDesde El Div N
Desde El Div N
 
carnavalesss
carnavalessscarnavalesss
carnavalesss
 
O Monge
O MongeO Monge
O Monge
 

Ähnlich wie Epidemiologisk FredagsmøDe 15 2 2008

Association mapping using local genealogies
Association mapping using local genealogiesAssociation mapping using local genealogies
Association mapping using local genealogiesmailund
 
Brighter tab by paramore
Brighter tab by paramoreBrighter tab by paramore
Brighter tab by paramoreCristian
 
Nirvana, smells like teen spirit
Nirvana, smells like teen spiritNirvana, smells like teen spirit
Nirvana, smells like teen spiritJammy RH
 

Ähnlich wie Epidemiologisk FredagsmøDe 15 2 2008 (7)

Association mapping using local genealogies
Association mapping using local genealogiesAssociation mapping using local genealogies
Association mapping using local genealogies
 
Brighter tab by paramore
Brighter tab by paramoreBrighter tab by paramore
Brighter tab by paramore
 
مذكرة انجليزى kg2
مذكرة انجليزى kg2مذكرة انجليزى kg2
مذكرة انجليزى kg2
 
Nirvana, smells like teen spirit
Nirvana, smells like teen spiritNirvana, smells like teen spirit
Nirvana, smells like teen spirit
 
Asdff
AsdffAsdff
Asdff
 
Asdff
AsdffAsdff
Asdff
 
Asdff
AsdffAsdff
Asdff
 

Mehr von mailund

Chapter 9 divide and conquer handouts with notes
Chapter 9   divide and conquer handouts with notesChapter 9   divide and conquer handouts with notes
Chapter 9 divide and conquer handouts with notesmailund
 
Chapter 9 divide and conquer handouts
Chapter 9   divide and conquer handoutsChapter 9   divide and conquer handouts
Chapter 9 divide and conquer handoutsmailund
 
Chapter 9 divide and conquer
Chapter 9   divide and conquerChapter 9   divide and conquer
Chapter 9 divide and conquermailund
 
Chapter 7 recursion handouts with notes
Chapter 7   recursion handouts with notesChapter 7   recursion handouts with notes
Chapter 7 recursion handouts with notesmailund
 
Chapter 7 recursion handouts
Chapter 7   recursion handoutsChapter 7   recursion handouts
Chapter 7 recursion handoutsmailund
 
Chapter 7 recursion
Chapter 7   recursionChapter 7   recursion
Chapter 7 recursionmailund
 
Chapter 5 searching and sorting handouts with notes
Chapter 5   searching and sorting handouts with notesChapter 5   searching and sorting handouts with notes
Chapter 5 searching and sorting handouts with notesmailund
 
Chapter 5 searching and sorting handouts
Chapter 5   searching and sorting handoutsChapter 5   searching and sorting handouts
Chapter 5 searching and sorting handoutsmailund
 
Chapter 5 searching and sorting
Chapter 5   searching and sortingChapter 5   searching and sorting
Chapter 5 searching and sortingmailund
 
Chapter 4 algorithmic efficiency handouts (with notes)
Chapter 4   algorithmic efficiency handouts (with notes)Chapter 4   algorithmic efficiency handouts (with notes)
Chapter 4 algorithmic efficiency handouts (with notes)mailund
 
Chapter 4 algorithmic efficiency handouts
Chapter 4   algorithmic efficiency handoutsChapter 4   algorithmic efficiency handouts
Chapter 4 algorithmic efficiency handoutsmailund
 
Chapter 4 algorithmic efficiency
Chapter 4   algorithmic efficiencyChapter 4   algorithmic efficiency
Chapter 4 algorithmic efficiencymailund
 
Chapter 3 introduction to algorithms slides
Chapter 3 introduction to algorithms slidesChapter 3 introduction to algorithms slides
Chapter 3 introduction to algorithms slidesmailund
 
Chapter 3 introduction to algorithms handouts (with notes)
Chapter 3 introduction to algorithms handouts (with notes)Chapter 3 introduction to algorithms handouts (with notes)
Chapter 3 introduction to algorithms handouts (with notes)mailund
 
Chapter 3 introduction to algorithms handouts
Chapter 3 introduction to algorithms handoutsChapter 3 introduction to algorithms handouts
Chapter 3 introduction to algorithms handoutsmailund
 
Ku 05 08 2009
Ku 05 08 2009Ku 05 08 2009
Ku 05 08 2009mailund
 
Neural Networks
Neural NetworksNeural Networks
Neural Networksmailund
 
Probability And Stats Intro
Probability And Stats IntroProbability And Stats Intro
Probability And Stats Intromailund
 
Probability And Stats Intro2
Probability And Stats Intro2Probability And Stats Intro2
Probability And Stats Intro2mailund
 
Linear Regression Ex
Linear Regression ExLinear Regression Ex
Linear Regression Exmailund
 

Mehr von mailund (20)

Chapter 9 divide and conquer handouts with notes
Chapter 9   divide and conquer handouts with notesChapter 9   divide and conquer handouts with notes
Chapter 9 divide and conquer handouts with notes
 
Chapter 9 divide and conquer handouts
Chapter 9   divide and conquer handoutsChapter 9   divide and conquer handouts
Chapter 9 divide and conquer handouts
 
Chapter 9 divide and conquer
Chapter 9   divide and conquerChapter 9   divide and conquer
Chapter 9 divide and conquer
 
Chapter 7 recursion handouts with notes
Chapter 7   recursion handouts with notesChapter 7   recursion handouts with notes
Chapter 7 recursion handouts with notes
 
Chapter 7 recursion handouts
Chapter 7   recursion handoutsChapter 7   recursion handouts
Chapter 7 recursion handouts
 
Chapter 7 recursion
Chapter 7   recursionChapter 7   recursion
Chapter 7 recursion
 
Chapter 5 searching and sorting handouts with notes
Chapter 5   searching and sorting handouts with notesChapter 5   searching and sorting handouts with notes
Chapter 5 searching and sorting handouts with notes
 
Chapter 5 searching and sorting handouts
Chapter 5   searching and sorting handoutsChapter 5   searching and sorting handouts
Chapter 5 searching and sorting handouts
 
Chapter 5 searching and sorting
Chapter 5   searching and sortingChapter 5   searching and sorting
Chapter 5 searching and sorting
 
Chapter 4 algorithmic efficiency handouts (with notes)
Chapter 4   algorithmic efficiency handouts (with notes)Chapter 4   algorithmic efficiency handouts (with notes)
Chapter 4 algorithmic efficiency handouts (with notes)
 
Chapter 4 algorithmic efficiency handouts
Chapter 4   algorithmic efficiency handoutsChapter 4   algorithmic efficiency handouts
Chapter 4 algorithmic efficiency handouts
 
Chapter 4 algorithmic efficiency
Chapter 4   algorithmic efficiencyChapter 4   algorithmic efficiency
Chapter 4 algorithmic efficiency
 
Chapter 3 introduction to algorithms slides
Chapter 3 introduction to algorithms slidesChapter 3 introduction to algorithms slides
Chapter 3 introduction to algorithms slides
 
Chapter 3 introduction to algorithms handouts (with notes)
Chapter 3 introduction to algorithms handouts (with notes)Chapter 3 introduction to algorithms handouts (with notes)
Chapter 3 introduction to algorithms handouts (with notes)
 
Chapter 3 introduction to algorithms handouts
Chapter 3 introduction to algorithms handoutsChapter 3 introduction to algorithms handouts
Chapter 3 introduction to algorithms handouts
 
Ku 05 08 2009
Ku 05 08 2009Ku 05 08 2009
Ku 05 08 2009
 
Neural Networks
Neural NetworksNeural Networks
Neural Networks
 
Probability And Stats Intro
Probability And Stats IntroProbability And Stats Intro
Probability And Stats Intro
 
Probability And Stats Intro2
Probability And Stats Intro2Probability And Stats Intro2
Probability And Stats Intro2
 
Linear Regression Ex
Linear Regression ExLinear Regression Ex
Linear Regression Ex
 

Kürzlich hochgeladen

Call Girls In Andheri East Call 9920874524 Book Hot And Sexy Girls
Call Girls In Andheri East Call 9920874524 Book Hot And Sexy GirlsCall Girls In Andheri East Call 9920874524 Book Hot And Sexy Girls
Call Girls In Andheri East Call 9920874524 Book Hot And Sexy Girlsnehamumbai
 
VIP Call Girls Mumbai Arpita 9910780858 Independent Escort Service Mumbai
VIP Call Girls Mumbai Arpita 9910780858 Independent Escort Service MumbaiVIP Call Girls Mumbai Arpita 9910780858 Independent Escort Service Mumbai
VIP Call Girls Mumbai Arpita 9910780858 Independent Escort Service Mumbaisonalikaur4
 
Book Call Girls in Yelahanka - For 7001305949 Cheap & Best with original Photos
Book Call Girls in Yelahanka - For 7001305949 Cheap & Best with original PhotosBook Call Girls in Yelahanka - For 7001305949 Cheap & Best with original Photos
Book Call Girls in Yelahanka - For 7001305949 Cheap & Best with original Photosnarwatsonia7
 
Hematology and Immunology - Leukocytes Functions
Hematology and Immunology - Leukocytes FunctionsHematology and Immunology - Leukocytes Functions
Hematology and Immunology - Leukocytes FunctionsMedicoseAcademics
 
Glomerular Filtration and determinants of glomerular filtration .pptx
Glomerular Filtration and  determinants of glomerular filtration .pptxGlomerular Filtration and  determinants of glomerular filtration .pptx
Glomerular Filtration and determinants of glomerular filtration .pptxDr.Nusrat Tariq
 
Call Girls Kanakapura Road Just Call 7001305949 Top Class Call Girl Service A...
Call Girls Kanakapura Road Just Call 7001305949 Top Class Call Girl Service A...Call Girls Kanakapura Road Just Call 7001305949 Top Class Call Girl Service A...
Call Girls Kanakapura Road Just Call 7001305949 Top Class Call Girl Service A...narwatsonia7
 
Call Girl Service Bidadi - For 7001305949 Cheap & Best with original Photos
Call Girl Service Bidadi - For 7001305949 Cheap & Best with original PhotosCall Girl Service Bidadi - For 7001305949 Cheap & Best with original Photos
Call Girl Service Bidadi - For 7001305949 Cheap & Best with original Photosnarwatsonia7
 
Call Girl Lucknow Mallika 7001305949 Independent Escort Service Lucknow
Call Girl Lucknow Mallika 7001305949 Independent Escort Service LucknowCall Girl Lucknow Mallika 7001305949 Independent Escort Service Lucknow
Call Girl Lucknow Mallika 7001305949 Independent Escort Service Lucknownarwatsonia7
 
Call Girls Thane Just Call 9910780858 Get High Class Call Girls Service
Call Girls Thane Just Call 9910780858 Get High Class Call Girls ServiceCall Girls Thane Just Call 9910780858 Get High Class Call Girls Service
Call Girls Thane Just Call 9910780858 Get High Class Call Girls Servicesonalikaur4
 
Call Girls ITPL Just Call 7001305949 Top Class Call Girl Service Available
Call Girls ITPL Just Call 7001305949 Top Class Call Girl Service AvailableCall Girls ITPL Just Call 7001305949 Top Class Call Girl Service Available
Call Girls ITPL Just Call 7001305949 Top Class Call Girl Service Availablenarwatsonia7
 
Call Girl Surat Madhuri 7001305949 Independent Escort Service Surat
Call Girl Surat Madhuri 7001305949 Independent Escort Service SuratCall Girl Surat Madhuri 7001305949 Independent Escort Service Surat
Call Girl Surat Madhuri 7001305949 Independent Escort Service Suratnarwatsonia7
 
Low Rate Call Girls Mumbai Suman 9910780858 Independent Escort Service Mumbai
Low Rate Call Girls Mumbai Suman 9910780858 Independent Escort Service MumbaiLow Rate Call Girls Mumbai Suman 9910780858 Independent Escort Service Mumbai
Low Rate Call Girls Mumbai Suman 9910780858 Independent Escort Service Mumbaisonalikaur4
 
Noida Sector 135 Call Girls ( 9873940964 ) Book Hot And Sexy Girls In A Few C...
Noida Sector 135 Call Girls ( 9873940964 ) Book Hot And Sexy Girls In A Few C...Noida Sector 135 Call Girls ( 9873940964 ) Book Hot And Sexy Girls In A Few C...
Noida Sector 135 Call Girls ( 9873940964 ) Book Hot And Sexy Girls In A Few C...rajnisinghkjn
 
Call Girls Jayanagar Just Call 7001305949 Top Class Call Girl Service Available
Call Girls Jayanagar Just Call 7001305949 Top Class Call Girl Service AvailableCall Girls Jayanagar Just Call 7001305949 Top Class Call Girl Service Available
Call Girls Jayanagar Just Call 7001305949 Top Class Call Girl Service Availablenarwatsonia7
 
call girls in green park DELHI 🔝 >༒9540349809 🔝 genuine Escort Service 🔝✔️✔️
call girls in green park  DELHI 🔝 >༒9540349809 🔝 genuine Escort Service 🔝✔️✔️call girls in green park  DELHI 🔝 >༒9540349809 🔝 genuine Escort Service 🔝✔️✔️
call girls in green park DELHI 🔝 >༒9540349809 🔝 genuine Escort Service 🔝✔️✔️saminamagar
 
Glomerular Filtration rate and its determinants.pptx
Glomerular Filtration rate and its determinants.pptxGlomerular Filtration rate and its determinants.pptx
Glomerular Filtration rate and its determinants.pptxDr.Nusrat Tariq
 
Call Girls Service Chennai Jiya 7001305949 Independent Escort Service Chennai
Call Girls Service Chennai Jiya 7001305949 Independent Escort Service ChennaiCall Girls Service Chennai Jiya 7001305949 Independent Escort Service Chennai
Call Girls Service Chennai Jiya 7001305949 Independent Escort Service ChennaiNehru place Escorts
 
College Call Girls Vyasarpadi Whatsapp 7001305949 Independent Escort Service
College Call Girls Vyasarpadi Whatsapp 7001305949 Independent Escort ServiceCollege Call Girls Vyasarpadi Whatsapp 7001305949 Independent Escort Service
College Call Girls Vyasarpadi Whatsapp 7001305949 Independent Escort ServiceNehru place Escorts
 
Call Girl Nagpur Sia 7001305949 Independent Escort Service Nagpur
Call Girl Nagpur Sia 7001305949 Independent Escort Service NagpurCall Girl Nagpur Sia 7001305949 Independent Escort Service Nagpur
Call Girl Nagpur Sia 7001305949 Independent Escort Service NagpurRiya Pathan
 
Call Girls Whitefield Just Call 7001305949 Top Class Call Girl Service Available
Call Girls Whitefield Just Call 7001305949 Top Class Call Girl Service AvailableCall Girls Whitefield Just Call 7001305949 Top Class Call Girl Service Available
Call Girls Whitefield Just Call 7001305949 Top Class Call Girl Service Availablenarwatsonia7
 

Kürzlich hochgeladen (20)

Call Girls In Andheri East Call 9920874524 Book Hot And Sexy Girls
Call Girls In Andheri East Call 9920874524 Book Hot And Sexy GirlsCall Girls In Andheri East Call 9920874524 Book Hot And Sexy Girls
Call Girls In Andheri East Call 9920874524 Book Hot And Sexy Girls
 
VIP Call Girls Mumbai Arpita 9910780858 Independent Escort Service Mumbai
VIP Call Girls Mumbai Arpita 9910780858 Independent Escort Service MumbaiVIP Call Girls Mumbai Arpita 9910780858 Independent Escort Service Mumbai
VIP Call Girls Mumbai Arpita 9910780858 Independent Escort Service Mumbai
 
Book Call Girls in Yelahanka - For 7001305949 Cheap & Best with original Photos
Book Call Girls in Yelahanka - For 7001305949 Cheap & Best with original PhotosBook Call Girls in Yelahanka - For 7001305949 Cheap & Best with original Photos
Book Call Girls in Yelahanka - For 7001305949 Cheap & Best with original Photos
 
Hematology and Immunology - Leukocytes Functions
Hematology and Immunology - Leukocytes FunctionsHematology and Immunology - Leukocytes Functions
Hematology and Immunology - Leukocytes Functions
 
Glomerular Filtration and determinants of glomerular filtration .pptx
Glomerular Filtration and  determinants of glomerular filtration .pptxGlomerular Filtration and  determinants of glomerular filtration .pptx
Glomerular Filtration and determinants of glomerular filtration .pptx
 
Call Girls Kanakapura Road Just Call 7001305949 Top Class Call Girl Service A...
Call Girls Kanakapura Road Just Call 7001305949 Top Class Call Girl Service A...Call Girls Kanakapura Road Just Call 7001305949 Top Class Call Girl Service A...
Call Girls Kanakapura Road Just Call 7001305949 Top Class Call Girl Service A...
 
Call Girl Service Bidadi - For 7001305949 Cheap & Best with original Photos
Call Girl Service Bidadi - For 7001305949 Cheap & Best with original PhotosCall Girl Service Bidadi - For 7001305949 Cheap & Best with original Photos
Call Girl Service Bidadi - For 7001305949 Cheap & Best with original Photos
 
Call Girl Lucknow Mallika 7001305949 Independent Escort Service Lucknow
Call Girl Lucknow Mallika 7001305949 Independent Escort Service LucknowCall Girl Lucknow Mallika 7001305949 Independent Escort Service Lucknow
Call Girl Lucknow Mallika 7001305949 Independent Escort Service Lucknow
 
Call Girls Thane Just Call 9910780858 Get High Class Call Girls Service
Call Girls Thane Just Call 9910780858 Get High Class Call Girls ServiceCall Girls Thane Just Call 9910780858 Get High Class Call Girls Service
Call Girls Thane Just Call 9910780858 Get High Class Call Girls Service
 
Call Girls ITPL Just Call 7001305949 Top Class Call Girl Service Available
Call Girls ITPL Just Call 7001305949 Top Class Call Girl Service AvailableCall Girls ITPL Just Call 7001305949 Top Class Call Girl Service Available
Call Girls ITPL Just Call 7001305949 Top Class Call Girl Service Available
 
Call Girl Surat Madhuri 7001305949 Independent Escort Service Surat
Call Girl Surat Madhuri 7001305949 Independent Escort Service SuratCall Girl Surat Madhuri 7001305949 Independent Escort Service Surat
Call Girl Surat Madhuri 7001305949 Independent Escort Service Surat
 
Low Rate Call Girls Mumbai Suman 9910780858 Independent Escort Service Mumbai
Low Rate Call Girls Mumbai Suman 9910780858 Independent Escort Service MumbaiLow Rate Call Girls Mumbai Suman 9910780858 Independent Escort Service Mumbai
Low Rate Call Girls Mumbai Suman 9910780858 Independent Escort Service Mumbai
 
Noida Sector 135 Call Girls ( 9873940964 ) Book Hot And Sexy Girls In A Few C...
Noida Sector 135 Call Girls ( 9873940964 ) Book Hot And Sexy Girls In A Few C...Noida Sector 135 Call Girls ( 9873940964 ) Book Hot And Sexy Girls In A Few C...
Noida Sector 135 Call Girls ( 9873940964 ) Book Hot And Sexy Girls In A Few C...
 
Call Girls Jayanagar Just Call 7001305949 Top Class Call Girl Service Available
Call Girls Jayanagar Just Call 7001305949 Top Class Call Girl Service AvailableCall Girls Jayanagar Just Call 7001305949 Top Class Call Girl Service Available
Call Girls Jayanagar Just Call 7001305949 Top Class Call Girl Service Available
 
call girls in green park DELHI 🔝 >༒9540349809 🔝 genuine Escort Service 🔝✔️✔️
call girls in green park  DELHI 🔝 >༒9540349809 🔝 genuine Escort Service 🔝✔️✔️call girls in green park  DELHI 🔝 >༒9540349809 🔝 genuine Escort Service 🔝✔️✔️
call girls in green park DELHI 🔝 >༒9540349809 🔝 genuine Escort Service 🔝✔️✔️
 
Glomerular Filtration rate and its determinants.pptx
Glomerular Filtration rate and its determinants.pptxGlomerular Filtration rate and its determinants.pptx
Glomerular Filtration rate and its determinants.pptx
 
Call Girls Service Chennai Jiya 7001305949 Independent Escort Service Chennai
Call Girls Service Chennai Jiya 7001305949 Independent Escort Service ChennaiCall Girls Service Chennai Jiya 7001305949 Independent Escort Service Chennai
Call Girls Service Chennai Jiya 7001305949 Independent Escort Service Chennai
 
College Call Girls Vyasarpadi Whatsapp 7001305949 Independent Escort Service
College Call Girls Vyasarpadi Whatsapp 7001305949 Independent Escort ServiceCollege Call Girls Vyasarpadi Whatsapp 7001305949 Independent Escort Service
College Call Girls Vyasarpadi Whatsapp 7001305949 Independent Escort Service
 
Call Girl Nagpur Sia 7001305949 Independent Escort Service Nagpur
Call Girl Nagpur Sia 7001305949 Independent Escort Service NagpurCall Girl Nagpur Sia 7001305949 Independent Escort Service Nagpur
Call Girl Nagpur Sia 7001305949 Independent Escort Service Nagpur
 
Call Girls Whitefield Just Call 7001305949 Top Class Call Girl Service Available
Call Girls Whitefield Just Call 7001305949 Top Class Call Girl Service AvailableCall Girls Whitefield Just Call 7001305949 Top Class Call Girl Service Available
Call Girls Whitefield Just Call 7001305949 Top Class Call Girl Service Available
 

Epidemiologisk FredagsmøDe 15 2 2008

  • 1. Association Mapping Through local genealogies Thomas Mailund Bioinformatics Research Center http://www.birc.au.dk/
  • 2. Gunshot wounds Car accidents Smoking induced lung cancer “Genetic” Diseases Cardiovascular disease Obesity Diabetes 2 Alzheimer Schizophrenia BRCA1 breast cancer Cystic fibrosis Haemophilia
  • 3. Disease Mapping... Locate disease-affecting polymorphism Cases (affected) --A--------C--------A----G---X----T---C---A---- --T--------G--------A----G---X----C---C---A---- --A--------G--------G----G---X----C---C---A---- --A--------C--------A----G---X----T---C---A---- --T--------C--------A----G---X----T---C---A---- --T--------C--------A----T---X----T---A---A---- --A--------C--------A----G---X----T---C---A---- --A--------C--------A----G---X----T---C---A---- --A--------C--------A----G---X----T---C---G---- --T--------C--------A----T---X----T---C---A---- --A--------C--------A----G---X----T---C---A---- --A--------C--------G----T---X----C---A---A---- --A--------C--------A----G---X----C---C---G---- Controls (unaffected)
  • 4. Unrealistic Assumptions We only measure -A-- -C- -A-- “unphased” data --T-- --G- -G-- -C- -A-- -A-- --A-- --C- -G-- -T- -C-
  • 5. Unrealistic Assumptions We only measure -A-- -C- -A-- “unphased” data --T-- --G- -G-- -C- -A-- -A-- --A-- --C- -G-- -T- -C- We first need to infer the phase --T--------G--------A----G--------C---C---A---- --A--------C--------A----G--------T---C---A----
  • 6. Unrealistic Assumptions We only measure -A-- -C- -A-- “unphased” data --T-- --G- -G-- -C- -A-- -A-- --A-- --C- -G-- -T- -C- We first need to infer the phase --T--------G--------A----G--------C---C---A---- --A--------C--------A----G--------T---C---A---- --T--------G--------A----G--------T---C---A---- --A--------C--------A----G--------C---C---A----
  • 7. Unrealistic Assumptions We only measure -A-- -C- -A-- “unphased” data --T-- --G- -G-- -C- -A-- -A-- --A-- --C- -G-- -T- -C- We first need to infer the phase --T--------G--------A----G--------C---C---A---- --A--------C--------A----G--------T---C---A---- --T--------G--------A----G--------T---C---A---- --A--------C--------A----G--------C---C---A---- --T--------C--------A----G--------T---C---A---- --A--------G--------A----G--------C---C---A----
  • 8. Unrealistic Assumptions We only measure -A-- -C- -A-- “unphased” data --T-- --G- -G-- -C- -A-- -A-- --A-- --C- -G-- -T- -C- We first need to ? infer the phase --T--------G--------A----G--------C---C---A---- --A--------C--------A----G--------T---C---A---- --A--------G--------A----G--------C---C---A---- --T--------C--------A----G--------T---C---A---- --T--------C--------A----G--------T---C---A---- --A--------G--------A----G--------C---C---A----
  • 9. Disease Mapping... Markers are locally correlated Cases (affected) --A--------C--------A----G---X----T---C---A---- --T--------G--------A----G---X----C---C---A---- --A--------G--------G----G---X----C---C---A---- --A--------C--------A----G---X----T---C---A---- --T--------C--------A----G---X----T---C---A---- --T--------C--------A----T---X----T---A---A---- --A--------C--------A----G---X----T---C---A---- --A--------C--------A----G---X----T---C---A---- --A--------C--------A----G---X----T---C---G---- --T--------C--------A----T---X----T---C---A---- --A--------C--------A----G---X----T---C---A---- --A--------C--------G----T---X----C---A---A---- --A--------C--------A----G---X----C---C---G---- Controls (unaffected)
  • 10. Disease Mapping... Search for indirect signals Cases (affected) --A--------C--------A----G---X----T---C---A---- --T--------G--------A----G---X----C---C---A---- --A--------G--------G----G---X----C---C---A---- --A--------C--------A----G---X----T---C---A---- --T--------C--------A----G---X----T---C---A---- --T--------C--------A----T---X----T---A---A---- --A--------C--------A----G---X----T---C---A---- --A--------C--------A----G---X----T---C---A---- --A--------C--------A----G---X----T---C---G---- --T--------C--------A----T---X----T---C---A---- --A--------C--------A----G---X----T---C---A---- --A--------C--------G----T---X----C---A---A---- --A--------C--------A----G---X----C---C---G---- Controls (unaffected)
  • 11. Marker Relatedness Linkage disequilibrium (LD) Empirical Results Theoretical Results LD (r2) Recombination rate Clark et al. 2003, AJHG 73:285-300. Hein et al. 2005
  • 12. Indirect Association “Tag” markers Unobserved marker Cases (affected) --A--------C--------A----G---X----T---C---A---- --T--------G--------A----G---X----C---C---A---- --A--------G--------G----G---X----C---C---A---- --A--------C--------A----G---X----T---C---A---- --T--------C--------A----G---X----T---C---A---- --T--------C--------A----T---X----T---A---A---- --A--------C--------A----G---X----T---C---A---- --A--------C--------A----G---X----T---C---A---- --A--------C--------A----G---X----T---C---G---- --T--------C--------A----T---X----T---C---A---- --A--------C--------A----G---X----T---C---A---- --A--------C--------G----T---X----C---A---A---- --A--------C--------A----G---X----C---C---G---- Controls (unaffected)
  • 13. Indirect Association Cases (affected) --A--------C--------A----G---X----T---C---A---- --T--------G--------A----G---X----C---C---A---- --A--------G--------G----G---X----C---C---A---- --A--------C--------A----G---X----T---C---A---- --T--------C--------A----G---X----T---C---A---- --T--------C--------A----T---X----T---A---A---- --A--------C--------A----G---X----T---C---A---- --A--------C--------A----G---X----T---C---A---- --A--------C--------A----G---X----T---C---G---- --T--------C--------A----T---X----T---C---A---- --A--------C--------A----G---X----T---C---A---- --A--------C--------G----T---X----C---A---A---- --A--------C--------A----G---X----C---C---G---- Controls (unaffected)
  • 14. Indirect Association Cases (affected) --A--------C--------A----G---X----T---C---A---- --T--------G--------A----G---X----C---C---A---- --A--------G--------G----G---X----C---C---A---- --A--------C--------A----G---X----T---C---A---- --T--------C--------A----G---X----T---C---A---- --T--------C--------A----T---X----T---A---A---- --A--------C--------A----G---X----T---C---A---- --A--------C--------A----G---X----T---C---A---- --A--------C--------A----G---X----T---C---G---- --T--------C--------A----T---X----T---C---A---- --A--------C--------A----G---X----T---C---A---- --A--------C--------G----T---X----C---A---A---- --A--------C--------A----G---X----C---C---G---- Controls (unaffected)
  • 15. Indirect Association Cases (affected) --A--------C--------A----G---X----T---C---A---- --T--------G--------A----G---X----C---C---A---- --A--------G--------G----G---X----C---C---A---- --A--------C--------A----G---X----T---C---A---- --T--------C--------A----G---X----T---C---A---- --T--------C--------A----T---X----T---A---A---- --A--------C--------A----G---X----T---C---A---- --A--------C--------A----G---X----T---C---A---- --A--------C--------A----G---X----T---C---G---- --T--------C--------A----T---X----T---C---A---- --A--------C--------A----G---X----T---C---A---- --A--------C--------G----T---X----C---A---A---- --A--------C--------A----G---X----C---C---G---- Controls (unaffected)
  • 16. Indirect Association Cases (affected) --A--------C--------A----G---X----T---C---A---- --T--------G--------A----G---X----C---C---A---- --A--------G--------G----G---X----C---C---A---- --A--------C--------A----G---X----T---C---A---- --T--------C--------A----G---X----T---C---A---- --T--------C--------A----T---X----T---A---A---- --A--------C--------A----G---X----T---C---A---- --A--------C--------A----G---X----T---C---A---- --A--------C--------A----G---X----T---C---G---- --T--------C--------A----T---X----T---C---A---- --A--------C--------A----G---X----T---C---A---- --A--------C--------G----T---X----C---A---A---- --A--------C--------A----G---X----C---C---G---- Controls (unaffected)
  • 17. Indirect Multi-Marker Association Cases (affected) --A--------C--------A----G---X----T---C---A---- --T--------G--------A----G---X----C---C---A---- --A--------G--------G----G---X----C---C---A---- --A--------C--------A----G---X----T---C---A---- --T--------C--------A----G---X----T---C---A---- --T--------C--------A----T---X----T---A---A---- --A--------C--------A----G---X----T---C---A---- --A--------C--------A----G---X----T---C---A---- --A--------C--------A----G---X----T---C---G---- --T--------C--------A----T---X----T---C---A---- --A--------C--------A----G---X----T---C---A---- --A--------C--------G----T---X----C---A---A---- --A--------C--------A----G---X----C---C---G---- Controls (unaffected)
  • 18. The Ancestral Recombination Graph Hudson 1990, Griffith and Marjoram 1996
  • 19. The Coalescent Process
  • 20. The Coalescent Process
  • 21. The Coalescent Process
  • 22. The Coalescent Process
  • 23. The Coalescent Process
  • 24. The Coalescent Process
  • 25. The Coalescent Process
  • 26. The Coalescent Process
  • 27. The Coalescent Process
  • 28. The Coalescent Process
  • 29. The Coalescent Process
  • 30. The Coalescent Process
  • 31. The Coalescent Process
  • 32. The Coalescent Process
  • 33. The Coalescent Process
  • 34. The Coalescent Process
  • 35. The Coalescent Process
  • 36. The Coalescent Process
  • 37. The Coalescent Process
  • 38. The Coalescent Process
  • 39. The Coalescent Process
  • 40. The Coalescent Process
  • 41. The Coalescent Process
  • 42. The Coalescent Process
  • 43. The Coalescent Process
  • 44. The Coalescent Process
  • 45. The Coalescent Process
  • 46. The Coalescent Process
  • 47. The Coalescent Process
  • 48. The Coalescent Process
  • 49. The Coalescent Process
  • 50. The Coalescent Process
  • 51. The Coalescent Process
  • 52. The Coalescent Process
  • 53. A Reasonable Local Model Copyright Ó 2007 by the Genetics Society of America DOI: 10.1534/genetics.107.071126 On Recombination-Induced Multiple and Simultaneous Coalescent Events Joanna L. Davies,1 Frantisek Simanc´k, Rune Lyngsø, Thomas Mailund and Jotun Hein ˇ ˇı Department of Statistics, University of Oxford, Oxford, OX1 3TG, United Kingdom Manuscript received January 18, 2007 Accepted for publication October 2, 2007 ABSTRACT Coalescent theory deals with the dynamics of how sampled genetic material has spread through a population from a single ancestor over many generations and is ubiquitous in contemporary molecular population genetics. Inherent in most applications is a continuous-time approximation that is derived under the assumption that sample size is small relative to the actual population size. In effect, this precludes multiple and simultaneous coalescent events that take place in the history of large samples. If sequences do not recombine, the number of sequences ancestral to a large sample is reduced sufficiently after relatively few generations such that use of the continuous-time approximation is justified. However, in tracing the history of large chromosomal segments, a large recombination rate per generation will consistently maintain a large number of ancestors. This can create a major disparity between discrete-time and continuous-time models and we analyze its importance, illustrated with model parameters typical of the human genome. The presence of gene conversion exacerbates the disparity and could seriously undermine applications of coalescent theory to complete genomes. However, we show that multiple and simultaneous coalescent events influence global quantities, such as total number of ancestors, but have negligible effect on local quantities, such as linkage disequilibrium. Reassuringly, most applications of the coalescent model with recombination (including association mapping) focus on local quantities. K INGMAN (1982) models the ancestry of a sample of sequences with a continuous-time Markov pro- cess referred to as the Kingman coalescent. Lineages ulation size, the probability of such events occurring becomes nonnegligible and consequently in these instances the rate of coalescence is underestimated collide or coalesce after random exponential waiting by Hudson’s continuous-time model. Hudson’s model
  • 54. A Reasonable Local Model • The “back in time” approach (in general) means we ignore selection • Implicit assumption that the disease is selectively neutral • Which may or may not be reasonable... • Might be okay for late onset diseases...
  • 55. The ARG as a Statistical Model P( )
  • 56. The ARG as a Statistical Model P( | )
  • 57. The ARG as a Statistical Model P( | )
  • 58. The ARG as a Statistical Model P( | )
  • 59. The ARG as a Statistical Model P( | , )P( |)
  • 60. The ARG as a Statistical Model lhd( )= P( | )= ∫P( | , )P( | )d
  • 61. The ARG as a Statistical Model lhd( )= ∫P( | , )P( | )d Integration by magic
  • 62. The ARG as a Statistical Model lhd( )= ∫P( | , )P( | )d Integration by magic statistical sampling
  • 63. ARG Methods • Sampling ARGs from the coalescence process • Sampling ARGs conditional on the data (importance sampling) • Sampling parsimonious ARGs conditional on the data
  • 64. ARG Methods • Sampling ARGs from the coalescence process • This is a no go -- you would never sample an ARG that can explain the data • Sampling ARGs conditional on the data (importance sampling) • Sampling parsimonious ARGs conditional on the data
  • 65. ARG Methods • Sampling ARGs from the coalescence process • Sampling ARGs conditional on the data (importance sampling) • Larribe, Lessard and Schork 2002 -- scales to tens of individuals and tens of markers • Sampling parsimonious ARGs conditional on the data
  • 66. ARG Methods • Sampling parsimonious ARGs conditional on the data • Lyngsø, Song & Hein 2005 (calculates parsimonious ARGs -- a 2008 paper in press for sampling) • Minichiello & Durbin 2006 (samples parsimonious ARGs and scores local genealogies) • Both preferentially selects mutations and coalescence events over recombinations • Scales to thousands of individuals and hundreds of markers
  • 67. Local Phylogenies For each “point” on the chromosome, the ARG determines a (local) tree:
  • 68. Local Phylogenies For each “point” on the chromosome, the ARG determines a (local) tree:
  • 69. Local Phylogenies For each “point” on the chromosome, the ARG determines a (local) tree:
  • 70. Local Phylogenies For each “point” on the chromosome, the ARG determines a (local) tree:
  • 71. Changing Phylogenies Type 1: No change Type 2: Change in branch lengths Type 3: Change in topology From Hein et al. 2005
  • 72. Trees and LD Tree similarity LD r2 Recombination rate Recombination rate
  • 73. Can we use just the trees?
  • 74. Clustering on a Tree Disease affecting mutation
  • 75. Clustering on a Tree Complete penetrance Incomplete penetrance Spurious disease
  • 76. Clustering on a Tree 25% Case/control clustering is not random on the tree... 75% 40% 60%
  • 77. Sampling Trees (with recombination) Zöllner & Pritchard 2005
  • 78. Sampling Trees (with recombination) Zöllner & Pritchard 2005
  • 79. Sampling Trees (with recombination) Zöllner & Pritchard 2005
  • 80. Sampling Trees (with recombination) Zöllner & Pritchard 2005
  • 81. Sampling Trees (with recombination) Zöllner & Pritchard 2005
  • 82. Sampling Trees (with recombination) Zöllner & Pritchard 2005
  • 83. Sampling Trees (with recombination) Zöllner & Pritchard 2005
  • 84. Sampling Trees (with recombination) Zöllner & Pritchard 2005
  • 85. Sampling Trees (with recombination) Zöllner & Pritchard 2005
  • 86. Sampling Trees (with recombination) Zöllner & Pritchard 2005
  • 87. Sampling Trees (with recombination) Zöllner & Pritchard 2005
  • 88. Sampling Trees (with recombination) Zöllner & Pritchard 2005
  • 89. Sampling Trees (with recombination) Zöllner & Pritchard 2005
  • 90. Sampling Trees (with recombination) Zöllner & Pritchard 2005
  • 91. Sampling Trees (with recombination) Zöllner & Pritchard 2005
  • 92. Sampling Trees (with recombination) Zöllner & Pritchard 2005
  • 93. Sampling Trees (with recombination) Zöllner & Pritchard 2005
  • 94. Sampling Trees (with recombination) We only sample the process on the left -- much fewer events Zöllner & Pritchard 2005
  • 95. Using “Perfect Phylogenies” Use the four-gamete test to find regions that can be explained by a tree with no recurrent mutations Mailund, Besenbacher & Schierup 2006
  • 96. Using “Perfect Phylogenies” Build trees for each such region Mailund, Besenbacher & Schierup 2006
  • 97. Using “Perfect Phylogenies” Each marker splits a sub-tree in two Mailund, Besenbacher & Schierup 2006
  • 98. Using “Perfect Phylogenies” Each marker splits a sub-tree in two Mailund, Besenbacher & Schierup 2006
  • 99. Using “Perfect Phylogenies” Each marker splits a sub-tree in two Mailund, Besenbacher & Schierup 2006
  • 100. Using “Perfect Phylogenies” Much faster (and much cruder) Catches the essential tree structure Mailund, Besenbacher & Schierup 2006
  • 101. Scoring the Clustering Red=cases Green=controls Are the case chromosomes significantly over-represented in some clusters?
  • 102.
  • 103. Wild-types Mutation Mutants We can place “mutations” on the tree edges and partition chromosomes into “mutants” and “wild-types” and test for different distributions of cases and controls
  • 104. Wild-types Mutation Mutants Use average or maximum to score the tree Average is kosher Bayesian stats; maximum needs to be corrected for over-fitting.
  • 105. Blossoc (BLOck aSSOCiation) Homepage: www.birc.au.dk/~mailund/Blossoc Command line and graphical user interface (with limited functionality)
  • 106. Blossoc (BLOck aSSOCiation) Homepage: www.birc.au.dk/~mailund/Blossoc Fast enough to analyse tens of thousands of individuals in hundred of thousands of markers in a day or two on a desktop computer...
  • 107.
  • 108.
  • 109. Localisation Accuracy A single causal mutation Max BF / min p-value used as point estimate
  • 110. Localisation Accuracy Two causal mutations Max BF / min p-value used as point estimate
  • 111. Thank you! More information at http://www.birc.au.dk/~mailund/association-mapping/