SlideShare a Scribd company logo
1 of 34
Download to read offline
Journal Club – Bayes Estimators for Phylogenetic
                   Reconstruction
     Syst. Biol. 60(4), 528 – 540, 2011 doi 10.1093/sysbio/syr021



                           Leonardo de O. Martins

                               University of Vigo



                               July 22, 2011



Leo Martins (Univ. Vigo)           Journal Club               22/7   1 / 12
Outline


1 Distance as a penalty


2 Distances, everywhere


3 No phylogenetics, yet...


4 Trees as points in space


5 To the paper, then




  Leo Martins (Univ. Vigo)   Journal Club   22/7   2 / 12
Statistical Risk


                                      ˆ
The risk ρ associated with a decision θ is the expected loss of this decision
ˆ
θ (which can be, for instance, an estimate of θ).




   Leo Martins (Univ. Vigo)       Journal Club                      22/7   3 / 12
Statistical Risk


                                      ˆ
The risk ρ associated with a decision θ is the expected loss of this decision
ˆ
θ (which can be, for instance, an estimate of θ).


                                ˆ
                              ρ(θ) =        ˆ
                                       L(θ, θ) P(θ | data) dθ

(promptly called posterior expected loss)




   Leo Martins (Univ. Vigo)              Journal Club               22/7   3 / 12
Statistical Risk


                                      ˆ
The risk ρ associated with a decision θ is the expected loss of this decision
ˆ
θ (which can be, for instance, an estimate of θ).


                                ˆ
                              ρ(θ) =        ˆ
                                       L(θ, θ) P(θ | data) dθ

(promptly called posterior expected loss)
                       ˆ
The loss function L(θ, θ) is a penalty we give for ”deciding” away from the
parameter. Examples are the squared loss and the absolute loss.




   Leo Martins (Univ. Vigo)              Journal Club               22/7   3 / 12
Statistical Risk


                                      ˆ
The risk ρ associated with a decision θ is the expected loss of this decision
ˆ
θ (which can be, for instance, an estimate of θ).


                                ˆ
                              ρ(θ) =        ˆ
                                       L(θ, θ) P(θ | data) dθ

(promptly called posterior expected loss)
                       ˆ
The loss function L(θ, θ) is a penalty we give for ”deciding” away from the
parameter. Examples are the squared loss and the absolute loss.

For some loss functions, we can calculate what is the best decision (i.e.
the one that minimizes the risk, for any data).




   Leo Martins (Univ. Vigo)              Journal Club               22/7    3 / 12
Outline


1 Distance as a penalty


2 Distances, everywhere


3 No phylogenetics, yet...


4 Trees as points in space


5 To the paper, then




  Leo Martins (Univ. Vigo)   Journal Club   22/7   4 / 12
How to summarise a collection of objects?




                                          scattered points

  library ( MASS ) ;
  x <- mvrnorm ( n =1000 , mu = c (0 ,0) , Sigma = matrix ( c (1 , 0.8 , 0.9 , 1) , 2 , 2 , byrow = T ) ) ;
  plot ( x [ ,1] , x [ ,2] , pch = " . " , cex = 2 , xlab = " x " , ylab = " y " ) ;




   Leo Martins (Univ. Vigo)                      Journal Club                                    22/7     5 / 12
How to summarise a collection of objects?




                      centroid: minimizes a distance to all points

  library ( MASS ) ;
  x <- mvrnorm ( n =1000 , mu = c (0 ,0) , Sigma = matrix ( c (1 , 0.8 , 0.9 , 1) , 2 , 2 , byrow = T ) ) ;
  plot ( x [ ,1] , x [ ,2] , pch = " . " , cex = 2 , xlab = " x " , ylab = " y " ) ;




   Leo Martins (Univ. Vigo)                      Journal Club                                    22/7     5 / 12
How to summarise a collection of objects?




                  regression line: minimizes a distance to all points

  library ( MASS ) ;
  x <- mvrnorm ( n =1000 , mu = c (0 ,0) , Sigma = matrix ( c (1 , 0.8 , 0.9 , 1) , 2 , 2 , byrow = T ) ) ;
  plot ( x [ ,1] , x [ ,2] , pch = " . " , cex = 2 , xlab = " x " , ylab = " y " ) ;




   Leo Martins (Univ. Vigo)                      Journal Club                                    22/7     5 / 12
Outline


1 Distance as a penalty


2 Distances, everywhere


3 No phylogenetics, yet...


4 Trees as points in space


5 To the paper, then




  Leo Martins (Univ. Vigo)   Journal Club   22/7   6 / 12
How to summarise the posterior distribution P(X)?




  Leo Martins (Univ. Vigo)   Journal Club           22/7   7 / 12
How to summarise the posterior distribution P(X)?




Posterior mean
Minimize the expected loss under a squared loss function
                                   ˆ         ˆ
                              L(θ, θ) = (θ − θ)2

(Euclidean distance)

   Leo Martins (Univ. Vigo)        Journal Club            22/7   7 / 12
How to summarise the posterior distribution P(X)?




Posterior median
Minimize the expected loss under a linear loss function
                                   ˆ         ˆ
                              L(θ, θ) =| θ − θ |

(Manhattan distance)

   Leo Martins (Univ. Vigo)        Journal Club           22/7   7 / 12
How to summarise the posterior distribution P(X)?




Posterior mode
a.k.a. Maximum A Posteriori (MAP) estimate.
Minimize the expected loss under a delta loss function

                                           0,                    ˆ
                                                         for θ = θ
                                   ˆ
                              L(θ, θ) =
                                           1,                    ˆ
                                                         for θ = θ
   Leo Martins (Univ. Vigo)               Journal Club               22/7   7 / 12
Outline


1 Distance as a penalty


2 Distances, everywhere


3 No phylogenetics, yet...


4 Trees as points in space


5 To the paper, then




  Leo Martins (Univ. Vigo)   Journal Club   22/7   8 / 12
Distances between trees
                      D                               D
                                  C                            E
                         
                                                                
                                                                 
             € €
              €                                     € €
                                                      €        
          
                                                  
                                                   
        E                                        C
                  €                                      €
                   f
                   f                                       f
                                                           f
                     f                                       f
                       fˆˆ                                     fˆˆ
                       ¢   ˆˆ
                            ˆB                                 ¢   ˆˆ
                                                                    ˆB
                       ¢                                      ¢
                     ¢                                       ¢
                   ¢                                       ¢
                              A                            A
Trees from the article




   Leo Martins (Univ. Vigo)           Journal Club                 22/7   9 / 12
Distances between trees
                     D                               D
                                 C                            E
                        
                                                               
                                                                
            € €
             €                                     € €
                                                     €        
         
                                                 
                                                  
       E                                        C
                 €                                      €
                  f
                  f                                       f
                                                          f
                    f                                       f
                      fˆˆ                                     fˆˆ
                      ¢   ˆˆ
                           ˆB                                 ¢   ˆˆ
                                                                   ˆB
                      ¢                                      ¢
                    ¢                                       ¢
                  ¢                                       ¢
                             A                            A
RF distance
    DE|ABC and CD|ABE
    total 2 branches




  Leo Martins (Univ. Vigo)           Journal Club                 22/7   9 / 12
Distances between trees
                     D                               D
                                 C                            E
                        
                                                               
                                                                
            € €
             €                                     € €
                                                     €        
         
                                                 
                                                  
       E                                        C
                 €                                      €
                  f
                  f                                       f
                                                          f
                    f                                       f
                      fˆˆ                                     fˆˆ
                      ¢   ˆˆ
                           ˆB                                 ¢   ˆˆ
                                                                   ˆB
                      ¢                                      ¢
                    ¢                                       ¢
                  ¢                                       ¢
                             A                            A
Quartet distance
    AC|DE and AE|CD
    BC|DE and BE|CD
    4 quartets are different



  Leo Martins (Univ. Vigo)           Journal Club                 22/7   9 / 12
Distances between trees
                     D                               D
                                 C                            E
                        
                                                               
                                                                
            € €
             €                                     € €
                                                     €        
         
                                                 
                                                  
       E                                        C
                 €                                      €
                  f
                  f                                       f
                                                          f
                    f                                       f
                      fˆˆ                                     fˆˆ
                      ¢   ˆˆ
                           ˆB                                 ¢   ˆˆ
                                                                   ˆB
                      ¢                                      ¢
                    ¢                                       ¢
                  ¢                                       ¢
                             A                            A
Quartet distance
    AC|DE and AE|CD
    BC|DE and BE|CD
    4 quartets are different



  Leo Martins (Univ. Vigo)           Journal Club                 22/7   9 / 12
Distances between trees
                      D                               D
                                  C                            E
                         
                                                                
                                                                 
             € €
              €                                     € €
                                                      €        
          
                                                  
                                                   
        E                                        C
                  €                                      €
                   f
                   f                                       f
                                                           f
                     f                                       f
                       fˆˆ                                     fˆˆ
                       ¢   ˆˆ
                            ˆB                                 ¢   ˆˆ
                                                                    ˆB
                       ¢                                      ¢
                     ¢                                       ¢
                   ¢                                       ¢
                              A                            A
Path difference (number of speciations between trees)
     path from A to E is one edge longer in one tree than the other
     (...)
     the overall difference is 6



   Leo Martins (Univ. Vigo)           Journal Club                 22/7   9 / 12
Outline


1 Distance as a penalty


2 Distances, everywhere


3 No phylogenetics, yet...


4 Trees as points in space


5 To the paper, then




  Leo Martins (Univ. Vigo)   Journal Club   22/7   10 / 12
If there is a distance, there is a Bayes estimator

For points in Rn , we know that the mean minimizes the Euclidean
distance, etc.

For phylogenies:

     there are several Euclidean distances


But some distances between trees also lead to “analytical” solutions:




   Leo Martins (Univ. Vigo)       Journal Club                   22/7   11 / 12
If there is a distance, there is a Bayes estimator

For points in Rn , we know that the mean minimizes the Euclidean
distance, etc.

For phylogenies:

     there are several Euclidean distances
     the mean does not work since a tree has restrictions

But some distances between trees also lead to “analytical” solutions:




   Leo Martins (Univ. Vigo)       Journal Club                   22/7   11 / 12
If there is a distance, there is a Bayes estimator

For points in Rn , we know that the mean minimizes the Euclidean
distance, etc.

For phylogenies:

     there are several Euclidean distances
     the mean does not work since a tree has restrictions

But some distances between trees also lead to “analytical” solutions:

     the consensus tree minimizes the Robinson-Foulds distance between
     the samples




   Leo Martins (Univ. Vigo)       Journal Club                   22/7   11 / 12
If there is a distance, there is a Bayes estimator

For points in Rn , we know that the mean minimizes the Euclidean
distance, etc.

For phylogenies:

     there are several Euclidean distances
     the mean does not work since a tree has restrictions

But some distances between trees also lead to “analytical” solutions:

     the consensus tree minimizes the Robinson-Foulds distance between
     the samples
     the quartet puzzling minimizes the quartet distance




   Leo Martins (Univ. Vigo)       Journal Club                   22/7   11 / 12
If there is a distance, there is a Bayes estimator

For points in Rn , we know that the mean minimizes the Euclidean
distance, etc.

For phylogenies:

     there are several Euclidean distances
     the mean does not work since a tree has restrictions

But some distances between trees also lead to “analytical” solutions:

     the consensus tree minimizes the Robinson-Foulds distance between
     the samples
     the quartet puzzling minimizes the quartet distance
     the Buneman tree minimizes (I think) the dissimilarity map distance



   Leo Martins (Univ. Vigo)       Journal Club                   22/7   11 / 12
If there is a distance, there is a Bayes estimator

For points in Rn , we know that the mean minimizes the Euclidean
distance, etc.

For phylogenies:

     there are several Euclidean distances
     the mean does not work since a tree has restrictions

But some distances between trees also lead to “analytical” solutions:

     the consensus tree minimizes the Robinson-Foulds distance between
     the samples
     the quartet puzzling minimizes the quartet distance
     the Buneman tree minimizes (I think) the dissimilarity map distance
     some of these are hard to solve as well

   Leo Martins (Univ. Vigo)       Journal Club                   22/7   11 / 12
How do they find, then, the Bayes estimates?



    like many other softwares: hill-climbing on the space of possible
    topologies




  Leo Martins (Univ. Vigo)       Journal Club                    22/7   12 / 12
How do they find, then, the Bayes estimates?



    like many other softwares: hill-climbing on the space of possible
    topologies
    their input data is the posterior distribution of trees from MrBayes




  Leo Martins (Univ. Vigo)       Journal Club                     22/7     12 / 12
How do they find, then, the Bayes estimates?



    like many other softwares: hill-climbing on the space of possible
    topologies
    their input data is the posterior distribution of trees from MrBayes
    starting tree can be NJ, MAP tree, ML...




  Leo Martins (Univ. Vigo)       Journal Club                     22/7     12 / 12
How do they find, then, the Bayes estimates?



    like many other softwares: hill-climbing on the space of possible
    topologies
    their input data is the posterior distribution of trees from MrBayes
    starting tree can be NJ, MAP tree, ML...
    apply branch-swap (NNI) to current optimal tree, then verify distance
    to all samples




  Leo Martins (Univ. Vigo)       Journal Club                     22/7     12 / 12
How do they find, then, the Bayes estimates?



    like many other softwares: hill-climbing on the space of possible
    topologies
    their input data is the posterior distribution of trees from MrBayes
    starting tree can be NJ, MAP tree, ML...
    apply branch-swap (NNI) to current optimal tree, then verify distance
    to all samples
           the distance used is the path difference (matrix subtraction)




  Leo Martins (Univ. Vigo)           Journal Club                         22/7   12 / 12
How do they find, then, the Bayes estimates?



    like many other softwares: hill-climbing on the space of possible
    topologies
    their input data is the posterior distribution of trees from MrBayes
    starting tree can be NJ, MAP tree, ML...
    apply branch-swap (NNI) to current optimal tree, then verify distance
    to all samples
           the distance used is the path difference (matrix subtraction)
           don’t need to recalculate distance to all samples, just to matrix with
           average values




  Leo Martins (Univ. Vigo)           Journal Club                        22/7   12 / 12

More Related Content

Viewers also liked

Schaefer, Joseph, R. Fidaxomicin Presentation
Schaefer, Joseph, R. Fidaxomicin PresentationSchaefer, Joseph, R. Fidaxomicin Presentation
Schaefer, Joseph, R. Fidaxomicin Presentation
Joseph Schaefer
 

Viewers also liked (16)

20140328 TNTL journal club axion electrodynamics, TI-FI interface (nomura, ...
20140328 TNTL journal club   axion electrodynamics, TI-FI interface (nomura, ...20140328 TNTL journal club   axion electrodynamics, TI-FI interface (nomura, ...
20140328 TNTL journal club axion electrodynamics, TI-FI interface (nomura, ...
 
Pseudogene Journal Club Presentation
Pseudogene Journal Club PresentationPseudogene Journal Club Presentation
Pseudogene Journal Club Presentation
 
Schaefer, Joseph, R. Fidaxomicin Presentation
Schaefer, Joseph, R. Fidaxomicin PresentationSchaefer, Joseph, R. Fidaxomicin Presentation
Schaefer, Joseph, R. Fidaxomicin Presentation
 
Journal Club - Early versus Late Parenteral Nutrition in Critically Ill Adults
Journal Club - Early versus Late Parenteral Nutrition in Critically Ill AdultsJournal Club - Early versus Late Parenteral Nutrition in Critically Ill Adults
Journal Club - Early versus Late Parenteral Nutrition in Critically Ill Adults
 
Rituximab CJASN Journal Club
Rituximab CJASN Journal ClubRituximab CJASN Journal Club
Rituximab CJASN Journal Club
 
Parkinson's Disease Presentation
Parkinson's Disease PresentationParkinson's Disease Presentation
Parkinson's Disease Presentation
 
Azithromycin for prevention of exacerbations of copd
Azithromycin for prevention of exacerbations of copdAzithromycin for prevention of exacerbations of copd
Azithromycin for prevention of exacerbations of copd
 
Acute exacerbation of COPD
Acute exacerbation of COPDAcute exacerbation of COPD
Acute exacerbation of COPD
 
Journal Club: Daily Corticosteroids Reduce Infection-associated Relapses in F...
Journal Club: Daily Corticosteroids Reduce Infection-associated Relapses in F...Journal Club: Daily Corticosteroids Reduce Infection-associated Relapses in F...
Journal Club: Daily Corticosteroids Reduce Infection-associated Relapses in F...
 
Journal Club: Fidaxomicin versus Vancomycin for Clostridium Difficile Infection
Journal Club: Fidaxomicin versus Vancomycin for Clostridium Difficile InfectionJournal Club: Fidaxomicin versus Vancomycin for Clostridium Difficile Infection
Journal Club: Fidaxomicin versus Vancomycin for Clostridium Difficile Infection
 
Genetic Basis Of Parkinson Disease
Genetic Basis Of Parkinson DiseaseGenetic Basis Of Parkinson Disease
Genetic Basis Of Parkinson Disease
 
Prevention of Venous Thromboembolism
Prevention of Venous ThromboembolismPrevention of Venous Thromboembolism
Prevention of Venous Thromboembolism
 
Journal Club
Journal ClubJournal Club
Journal Club
 
Journal Club: Thrombin-Receptor Antagonist Vorapaxar in Acute Coronary Syndromes
Journal Club: Thrombin-Receptor Antagonist Vorapaxar in Acute Coronary SyndromesJournal Club: Thrombin-Receptor Antagonist Vorapaxar in Acute Coronary Syndromes
Journal Club: Thrombin-Receptor Antagonist Vorapaxar in Acute Coronary Syndromes
 
Parkinsons Disease
Parkinsons DiseaseParkinsons Disease
Parkinsons Disease
 
How to present a journal club
How to present a journal clubHow to present a journal club
How to present a journal club
 

Similar to Journal Club @ UVigo 2011.07.22

MUMS Opening Workshop - Quantifying Nonparametric Modeling Uncertainty with B...
MUMS Opening Workshop - Quantifying Nonparametric Modeling Uncertainty with B...MUMS Opening Workshop - Quantifying Nonparametric Modeling Uncertainty with B...
MUMS Opening Workshop - Quantifying Nonparametric Modeling Uncertainty with B...
The Statistical and Applied Mathematical Sciences Institute
 
Triangle counting handout
Triangle counting handoutTriangle counting handout
Triangle counting handout
csedays
 
Slides: Hypothesis testing, information divergence and computational geometry
Slides: Hypothesis testing, information divergence and computational geometrySlides: Hypothesis testing, information divergence and computational geometry
Slides: Hypothesis testing, information divergence and computational geometry
Frank Nielsen
 

Similar to Journal Club @ UVigo 2011.07.22 (20)

Habilitation à diriger des recherches
Habilitation à diriger des recherchesHabilitation à diriger des recherches
Habilitation à diriger des recherches
 
Computational Information Geometry: A quick review (ICMS)
Computational Information Geometry: A quick review (ICMS)Computational Information Geometry: A quick review (ICMS)
Computational Information Geometry: A quick review (ICMS)
 
Approximate Bayesian model choice via random forests
Approximate Bayesian model choice via random forestsApproximate Bayesian model choice via random forests
Approximate Bayesian model choice via random forests
 
Likelihood free computational statistics
Likelihood free computational statisticsLikelihood free computational statistics
Likelihood free computational statistics
 
MUMS Opening Workshop - Quantifying Nonparametric Modeling Uncertainty with B...
MUMS Opening Workshop - Quantifying Nonparametric Modeling Uncertainty with B...MUMS Opening Workshop - Quantifying Nonparametric Modeling Uncertainty with B...
MUMS Opening Workshop - Quantifying Nonparametric Modeling Uncertainty with B...
 
Computational Information Geometry on Matrix Manifolds (ICTP 2013)
Computational Information Geometry on Matrix Manifolds (ICTP 2013)Computational Information Geometry on Matrix Manifolds (ICTP 2013)
Computational Information Geometry on Matrix Manifolds (ICTP 2013)
 
Bayesian_Decision_Theory-3.pdf
Bayesian_Decision_Theory-3.pdfBayesian_Decision_Theory-3.pdf
Bayesian_Decision_Theory-3.pdf
 
guenomu software -- model and agorithm in 2013
guenomu software -- model and agorithm in 2013guenomu software -- model and agorithm in 2013
guenomu software -- model and agorithm in 2013
 
MCQMC 2016 Tutorial
MCQMC 2016 TutorialMCQMC 2016 Tutorial
MCQMC 2016 Tutorial
 
MUMS: Bayesian, Fiducial, and Frequentist Conference - Multidimensional Monot...
MUMS: Bayesian, Fiducial, and Frequentist Conference - Multidimensional Monot...MUMS: Bayesian, Fiducial, and Frequentist Conference - Multidimensional Monot...
MUMS: Bayesian, Fiducial, and Frequentist Conference - Multidimensional Monot...
 
NBBC15, Reyjavik, June 08, 2015
NBBC15, Reyjavik, June 08, 2015NBBC15, Reyjavik, June 08, 2015
NBBC15, Reyjavik, June 08, 2015
 
3rd NIPS Workshop on PROBABILISTIC PROGRAMMING
3rd NIPS Workshop on PROBABILISTIC PROGRAMMING3rd NIPS Workshop on PROBABILISTIC PROGRAMMING
3rd NIPS Workshop on PROBABILISTIC PROGRAMMING
 
Computational Tools and Techniques for Numerical Macro-Financial Modeling
Computational Tools and Techniques for Numerical Macro-Financial ModelingComputational Tools and Techniques for Numerical Macro-Financial Modeling
Computational Tools and Techniques for Numerical Macro-Financial Modeling
 
random forests for ABC model choice and parameter estimation
random forests for ABC model choice and parameter estimationrandom forests for ABC model choice and parameter estimation
random forests for ABC model choice and parameter estimation
 
Triangle counting handout
Triangle counting handoutTriangle counting handout
Triangle counting handout
 
Workshop in honour of Don Poskitt and Gael Martin
Workshop in honour of Don Poskitt and Gael MartinWorkshop in honour of Don Poskitt and Gael Martin
Workshop in honour of Don Poskitt and Gael Martin
 
main
mainmain
main
 
Slides: Hypothesis testing, information divergence and computational geometry
Slides: Hypothesis testing, information divergence and computational geometrySlides: Hypothesis testing, information divergence and computational geometry
Slides: Hypothesis testing, information divergence and computational geometry
 
An Importance Sampling Approach to Integrate Expert Knowledge When Learning B...
An Importance Sampling Approach to Integrate Expert Knowledge When Learning B...An Importance Sampling Approach to Integrate Expert Knowledge When Learning B...
An Importance Sampling Approach to Integrate Expert Knowledge When Learning B...
 
Algebras for programming languages
Algebras for programming languagesAlgebras for programming languages
Algebras for programming languages
 

Recently uploaded

Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
ZurliaSoop
 
The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptx
heathfieldcps1
 

Recently uploaded (20)

SOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning PresentationSOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning Presentation
 
HMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptx
HMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptxHMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptx
HMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptx
 
ICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptxICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptx
 
FSB Advising Checklist - Orientation 2024
FSB Advising Checklist - Orientation 2024FSB Advising Checklist - Orientation 2024
FSB Advising Checklist - Orientation 2024
 
Google Gemini An AI Revolution in Education.pptx
Google Gemini An AI Revolution in Education.pptxGoogle Gemini An AI Revolution in Education.pptx
Google Gemini An AI Revolution in Education.pptx
 
Sociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning ExhibitSociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning Exhibit
 
How to Create and Manage Wizard in Odoo 17
How to Create and Manage Wizard in Odoo 17How to Create and Manage Wizard in Odoo 17
How to Create and Manage Wizard in Odoo 17
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The Basics
 
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
 
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptxHMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
 
Kodo Millet PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
Kodo Millet  PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...Kodo Millet  PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
Kodo Millet PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
 
This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.
 
80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...
80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...
80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...
 
REMIFENTANIL: An Ultra short acting opioid.pptx
REMIFENTANIL: An Ultra short acting opioid.pptxREMIFENTANIL: An Ultra short acting opioid.pptx
REMIFENTANIL: An Ultra short acting opioid.pptx
 
Towards a code of practice for AI in AT.pptx
Towards a code of practice for AI in AT.pptxTowards a code of practice for AI in AT.pptx
Towards a code of practice for AI in AT.pptx
 
The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptx
 
On National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan FellowsOn National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan Fellows
 
General Principles of Intellectual Property: Concepts of Intellectual Proper...
General Principles of Intellectual Property: Concepts of Intellectual  Proper...General Principles of Intellectual Property: Concepts of Intellectual  Proper...
General Principles of Intellectual Property: Concepts of Intellectual Proper...
 
Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)
 
Graduate Outcomes Presentation Slides - English
Graduate Outcomes Presentation Slides - EnglishGraduate Outcomes Presentation Slides - English
Graduate Outcomes Presentation Slides - English
 

Journal Club @ UVigo 2011.07.22

  • 1. Journal Club – Bayes Estimators for Phylogenetic Reconstruction Syst. Biol. 60(4), 528 – 540, 2011 doi 10.1093/sysbio/syr021 Leonardo de O. Martins University of Vigo July 22, 2011 Leo Martins (Univ. Vigo) Journal Club 22/7 1 / 12
  • 2. Outline 1 Distance as a penalty 2 Distances, everywhere 3 No phylogenetics, yet... 4 Trees as points in space 5 To the paper, then Leo Martins (Univ. Vigo) Journal Club 22/7 2 / 12
  • 3. Statistical Risk ˆ The risk ρ associated with a decision θ is the expected loss of this decision ˆ θ (which can be, for instance, an estimate of θ). Leo Martins (Univ. Vigo) Journal Club 22/7 3 / 12
  • 4. Statistical Risk ˆ The risk ρ associated with a decision θ is the expected loss of this decision ˆ θ (which can be, for instance, an estimate of θ). ˆ ρ(θ) = ˆ L(θ, θ) P(θ | data) dθ (promptly called posterior expected loss) Leo Martins (Univ. Vigo) Journal Club 22/7 3 / 12
  • 5. Statistical Risk ˆ The risk ρ associated with a decision θ is the expected loss of this decision ˆ θ (which can be, for instance, an estimate of θ). ˆ ρ(θ) = ˆ L(θ, θ) P(θ | data) dθ (promptly called posterior expected loss) ˆ The loss function L(θ, θ) is a penalty we give for ”deciding” away from the parameter. Examples are the squared loss and the absolute loss. Leo Martins (Univ. Vigo) Journal Club 22/7 3 / 12
  • 6. Statistical Risk ˆ The risk ρ associated with a decision θ is the expected loss of this decision ˆ θ (which can be, for instance, an estimate of θ). ˆ ρ(θ) = ˆ L(θ, θ) P(θ | data) dθ (promptly called posterior expected loss) ˆ The loss function L(θ, θ) is a penalty we give for ”deciding” away from the parameter. Examples are the squared loss and the absolute loss. For some loss functions, we can calculate what is the best decision (i.e. the one that minimizes the risk, for any data). Leo Martins (Univ. Vigo) Journal Club 22/7 3 / 12
  • 7. Outline 1 Distance as a penalty 2 Distances, everywhere 3 No phylogenetics, yet... 4 Trees as points in space 5 To the paper, then Leo Martins (Univ. Vigo) Journal Club 22/7 4 / 12
  • 8. How to summarise a collection of objects? scattered points library ( MASS ) ; x <- mvrnorm ( n =1000 , mu = c (0 ,0) , Sigma = matrix ( c (1 , 0.8 , 0.9 , 1) , 2 , 2 , byrow = T ) ) ; plot ( x [ ,1] , x [ ,2] , pch = " . " , cex = 2 , xlab = " x " , ylab = " y " ) ; Leo Martins (Univ. Vigo) Journal Club 22/7 5 / 12
  • 9. How to summarise a collection of objects? centroid: minimizes a distance to all points library ( MASS ) ; x <- mvrnorm ( n =1000 , mu = c (0 ,0) , Sigma = matrix ( c (1 , 0.8 , 0.9 , 1) , 2 , 2 , byrow = T ) ) ; plot ( x [ ,1] , x [ ,2] , pch = " . " , cex = 2 , xlab = " x " , ylab = " y " ) ; Leo Martins (Univ. Vigo) Journal Club 22/7 5 / 12
  • 10. How to summarise a collection of objects? regression line: minimizes a distance to all points library ( MASS ) ; x <- mvrnorm ( n =1000 , mu = c (0 ,0) , Sigma = matrix ( c (1 , 0.8 , 0.9 , 1) , 2 , 2 , byrow = T ) ) ; plot ( x [ ,1] , x [ ,2] , pch = " . " , cex = 2 , xlab = " x " , ylab = " y " ) ; Leo Martins (Univ. Vigo) Journal Club 22/7 5 / 12
  • 11. Outline 1 Distance as a penalty 2 Distances, everywhere 3 No phylogenetics, yet... 4 Trees as points in space 5 To the paper, then Leo Martins (Univ. Vigo) Journal Club 22/7 6 / 12
  • 12. How to summarise the posterior distribution P(X)? Leo Martins (Univ. Vigo) Journal Club 22/7 7 / 12
  • 13. How to summarise the posterior distribution P(X)? Posterior mean Minimize the expected loss under a squared loss function ˆ ˆ L(θ, θ) = (θ − θ)2 (Euclidean distance) Leo Martins (Univ. Vigo) Journal Club 22/7 7 / 12
  • 14. How to summarise the posterior distribution P(X)? Posterior median Minimize the expected loss under a linear loss function ˆ ˆ L(θ, θ) =| θ − θ | (Manhattan distance) Leo Martins (Univ. Vigo) Journal Club 22/7 7 / 12
  • 15. How to summarise the posterior distribution P(X)? Posterior mode a.k.a. Maximum A Posteriori (MAP) estimate. Minimize the expected loss under a delta loss function 0, ˆ for θ = θ ˆ L(θ, θ) = 1, ˆ for θ = θ Leo Martins (Univ. Vigo) Journal Club 22/7 7 / 12
  • 16. Outline 1 Distance as a penalty 2 Distances, everywhere 3 No phylogenetics, yet... 4 Trees as points in space 5 To the paper, then Leo Martins (Univ. Vigo) Journal Club 22/7 8 / 12
  • 17. Distances between trees D D C E € € € € € € E C € € f f f f f f fˆˆ fˆˆ ¢ ˆˆ ˆB ¢ ˆˆ ˆB ¢ ¢ ¢ ¢ ¢ ¢ A A Trees from the article Leo Martins (Univ. Vigo) Journal Club 22/7 9 / 12
  • 18. Distances between trees D D C E € € € € € € E C € € f f f f f f fˆˆ fˆˆ ¢ ˆˆ ˆB ¢ ˆˆ ˆB ¢ ¢ ¢ ¢ ¢ ¢ A A RF distance DE|ABC and CD|ABE total 2 branches Leo Martins (Univ. Vigo) Journal Club 22/7 9 / 12
  • 19. Distances between trees D D C E € € € € € € E C € € f f f f f f fˆˆ fˆˆ ¢ ˆˆ ˆB ¢ ˆˆ ˆB ¢ ¢ ¢ ¢ ¢ ¢ A A Quartet distance AC|DE and AE|CD BC|DE and BE|CD 4 quartets are different Leo Martins (Univ. Vigo) Journal Club 22/7 9 / 12
  • 20. Distances between trees D D C E € € € € € € E C € € f f f f f f fˆˆ fˆˆ ¢ ˆˆ ˆB ¢ ˆˆ ˆB ¢ ¢ ¢ ¢ ¢ ¢ A A Quartet distance AC|DE and AE|CD BC|DE and BE|CD 4 quartets are different Leo Martins (Univ. Vigo) Journal Club 22/7 9 / 12
  • 21. Distances between trees D D C E € € € € € € E C € € f f f f f f fˆˆ fˆˆ ¢ ˆˆ ˆB ¢ ˆˆ ˆB ¢ ¢ ¢ ¢ ¢ ¢ A A Path difference (number of speciations between trees) path from A to E is one edge longer in one tree than the other (...) the overall difference is 6 Leo Martins (Univ. Vigo) Journal Club 22/7 9 / 12
  • 22. Outline 1 Distance as a penalty 2 Distances, everywhere 3 No phylogenetics, yet... 4 Trees as points in space 5 To the paper, then Leo Martins (Univ. Vigo) Journal Club 22/7 10 / 12
  • 23. If there is a distance, there is a Bayes estimator For points in Rn , we know that the mean minimizes the Euclidean distance, etc. For phylogenies: there are several Euclidean distances But some distances between trees also lead to “analytical” solutions: Leo Martins (Univ. Vigo) Journal Club 22/7 11 / 12
  • 24. If there is a distance, there is a Bayes estimator For points in Rn , we know that the mean minimizes the Euclidean distance, etc. For phylogenies: there are several Euclidean distances the mean does not work since a tree has restrictions But some distances between trees also lead to “analytical” solutions: Leo Martins (Univ. Vigo) Journal Club 22/7 11 / 12
  • 25. If there is a distance, there is a Bayes estimator For points in Rn , we know that the mean minimizes the Euclidean distance, etc. For phylogenies: there are several Euclidean distances the mean does not work since a tree has restrictions But some distances between trees also lead to “analytical” solutions: the consensus tree minimizes the Robinson-Foulds distance between the samples Leo Martins (Univ. Vigo) Journal Club 22/7 11 / 12
  • 26. If there is a distance, there is a Bayes estimator For points in Rn , we know that the mean minimizes the Euclidean distance, etc. For phylogenies: there are several Euclidean distances the mean does not work since a tree has restrictions But some distances between trees also lead to “analytical” solutions: the consensus tree minimizes the Robinson-Foulds distance between the samples the quartet puzzling minimizes the quartet distance Leo Martins (Univ. Vigo) Journal Club 22/7 11 / 12
  • 27. If there is a distance, there is a Bayes estimator For points in Rn , we know that the mean minimizes the Euclidean distance, etc. For phylogenies: there are several Euclidean distances the mean does not work since a tree has restrictions But some distances between trees also lead to “analytical” solutions: the consensus tree minimizes the Robinson-Foulds distance between the samples the quartet puzzling minimizes the quartet distance the Buneman tree minimizes (I think) the dissimilarity map distance Leo Martins (Univ. Vigo) Journal Club 22/7 11 / 12
  • 28. If there is a distance, there is a Bayes estimator For points in Rn , we know that the mean minimizes the Euclidean distance, etc. For phylogenies: there are several Euclidean distances the mean does not work since a tree has restrictions But some distances between trees also lead to “analytical” solutions: the consensus tree minimizes the Robinson-Foulds distance between the samples the quartet puzzling minimizes the quartet distance the Buneman tree minimizes (I think) the dissimilarity map distance some of these are hard to solve as well Leo Martins (Univ. Vigo) Journal Club 22/7 11 / 12
  • 29. How do they find, then, the Bayes estimates? like many other softwares: hill-climbing on the space of possible topologies Leo Martins (Univ. Vigo) Journal Club 22/7 12 / 12
  • 30. How do they find, then, the Bayes estimates? like many other softwares: hill-climbing on the space of possible topologies their input data is the posterior distribution of trees from MrBayes Leo Martins (Univ. Vigo) Journal Club 22/7 12 / 12
  • 31. How do they find, then, the Bayes estimates? like many other softwares: hill-climbing on the space of possible topologies their input data is the posterior distribution of trees from MrBayes starting tree can be NJ, MAP tree, ML... Leo Martins (Univ. Vigo) Journal Club 22/7 12 / 12
  • 32. How do they find, then, the Bayes estimates? like many other softwares: hill-climbing on the space of possible topologies their input data is the posterior distribution of trees from MrBayes starting tree can be NJ, MAP tree, ML... apply branch-swap (NNI) to current optimal tree, then verify distance to all samples Leo Martins (Univ. Vigo) Journal Club 22/7 12 / 12
  • 33. How do they find, then, the Bayes estimates? like many other softwares: hill-climbing on the space of possible topologies their input data is the posterior distribution of trees from MrBayes starting tree can be NJ, MAP tree, ML... apply branch-swap (NNI) to current optimal tree, then verify distance to all samples the distance used is the path difference (matrix subtraction) Leo Martins (Univ. Vigo) Journal Club 22/7 12 / 12
  • 34. How do they find, then, the Bayes estimates? like many other softwares: hill-climbing on the space of possible topologies their input data is the posterior distribution of trees from MrBayes starting tree can be NJ, MAP tree, ML... apply branch-swap (NNI) to current optimal tree, then verify distance to all samples the distance used is the path difference (matrix subtraction) don’t need to recalculate distance to all samples, just to matrix with average values Leo Martins (Univ. Vigo) Journal Club 22/7 12 / 12