SlideShare ist ein Scribd-Unternehmen logo
1 von 68
Downloaden Sie, um offline zu lesen
A vanilla Rao–Blackwellisation of
Metropolis–Hastings algorithms

 Randal DOUC and Christian ROBERT
     Telecom SudParis, France

   randal.douc@it-sudparis.eu


            April 2009



                                    1 / 24
Main themes

   1   Rao–Blackwellisation on MCMC.
   2   Can be performed in any Hastings Metropolis algorithm.
   3   Asymptotically more efficient to usual MCMC with a
       controlled amount of calculations.




                                                                2 / 24
Main themes

   1   Rao–Blackwellisation on MCMC.
   2   Can be performed in any Hastings Metropolis algorithm.
   3   Asymptotically more efficient to usual MCMC with a
       controlled amount of calculations.




                                                                2 / 24
Main themes

   1   Rao–Blackwellisation on MCMC.
   2   Can be performed in any Hastings Metropolis algorithm.
   3   Asymptotically more efficient to usual MCMC with a
       controlled amount of calculations.




                                                                2 / 24
Main themes

   1   Rao–Blackwellisation on MCMC.
   2   Can be performed in any Hastings Metropolis algorithm.
   3   Asymptotically more efficient to usual MCMC with a
       controlled amount of calculations.




                                                                2 / 24
Introduction         Some properties of the HM algorithm   Rao–Blackwellisation   Illustrations   Conclusion




Outline


       1       Introduction

       2       Some properties of the HM algorithm

       3       Rao–Blackwellisation
                 Variance reduction
                 Asymptotic results

       4       Illustrations

       5       Conclusion



                                                                                                        3 / 24
Introduction         Some properties of the HM algorithm   Rao–Blackwellisation   Illustrations   Conclusion




Outline


       1       Introduction

       2       Some properties of the HM algorithm

       3       Rao–Blackwellisation
                 Variance reduction
                 Asymptotic results

       4       Illustrations

       5       Conclusion



                                                                                                        3 / 24
Introduction         Some properties of the HM algorithm   Rao–Blackwellisation   Illustrations   Conclusion




Outline


       1       Introduction

       2       Some properties of the HM algorithm

       3       Rao–Blackwellisation
                 Variance reduction
                 Asymptotic results

       4       Illustrations

       5       Conclusion



                                                                                                        3 / 24
Introduction         Some properties of the HM algorithm   Rao–Blackwellisation   Illustrations   Conclusion




Outline


       1       Introduction

       2       Some properties of the HM algorithm

       3       Rao–Blackwellisation
                 Variance reduction
                 Asymptotic results

       4       Illustrations

       5       Conclusion



                                                                                                        3 / 24
Introduction         Some properties of the HM algorithm   Rao–Blackwellisation   Illustrations   Conclusion




Outline


       1       Introduction

       2       Some properties of the HM algorithm

       3       Rao–Blackwellisation
                 Variance reduction
                 Asymptotic results

       4       Illustrations

       5       Conclusion



                                                                                                        3 / 24
Introduction         Some properties of the HM algorithm   Rao–Blackwellisation   Illustrations   Conclusion




Outline


       1       Introduction

       2       Some properties of the HM algorithm

       3       Rao–Blackwellisation
                 Variance reduction
                 Asymptotic results

       4       Illustrations

       5       Conclusion



                                                                                                        4 / 24
Introduction      Some properties of the HM algorithm       Rao–Blackwellisation      Illustrations   Conclusion




Metropolis Hastings algorithm


           1   We wish to approximate

                                            h(x )π(x )dx
                                   I=                    =            h(x )¯ (x )dx
                                                                           π
                                              π(x )dx


           2   x → π(x ) is known but not               π(x )dx .
                                          1             n
           3   Approximate I with δ = n t=1 h(x (t) ) where (x (t) ) is a Markov
               chain with limiting distribution π .
                                                ¯
           4   Convergence obtained from Law of Large Numbers or CLT for
               Markov chains.


                                                                                                            5 / 24
Introduction      Some properties of the HM algorithm       Rao–Blackwellisation      Illustrations   Conclusion




Metropolis Hastings algorithm


           1   We wish to approximate

                                            h(x )π(x )dx
                                   I=                    =            h(x )¯ (x )dx
                                                                           π
                                              π(x )dx


           2   x → π(x ) is known but not               π(x )dx .
                                          1             n
           3   Approximate I with δ = n t=1 h(x (t) ) where (x (t) ) is a Markov
               chain with limiting distribution π .
                                                ¯
           4   Convergence obtained from Law of Large Numbers or CLT for
               Markov chains.


                                                                                                            5 / 24
Introduction      Some properties of the HM algorithm       Rao–Blackwellisation      Illustrations   Conclusion




Metropolis Hastings algorithm


           1   We wish to approximate

                                            h(x )π(x )dx
                                   I=                    =            h(x )¯ (x )dx
                                                                           π
                                              π(x )dx


           2   x → π(x ) is known but not               π(x )dx .
                                          1             n
           3   Approximate I with δ = n t=1 h(x (t) ) where (x (t) ) is a Markov
               chain with limiting distribution π .
                                                ¯
           4   Convergence obtained from Law of Large Numbers or CLT for
               Markov chains.


                                                                                                            5 / 24
Introduction      Some properties of the HM algorithm       Rao–Blackwellisation      Illustrations   Conclusion




Metropolis Hastings algorithm


           1   We wish to approximate

                                            h(x )π(x )dx
                                   I=                    =            h(x )¯ (x )dx
                                                                           π
                                              π(x )dx


           2   x → π(x ) is known but not               π(x )dx .
                                          1             n
           3   Approximate I with δ = n t=1 h(x (t) ) where (x (t) ) is a Markov
               chain with limiting distribution π .
                                                ¯
           4   Convergence obtained from Law of Large Numbers or CLT for
               Markov chains.


                                                                                                            5 / 24
Introduction       Some properties of the HM algorithm   Rao–Blackwellisation         Illustrations   Conclusion




Metropolis Hasting Algorithm

       Suppose that x (t) is drawn.
           1   Simulate yt ∼ q(·|x (t) ).
           2   Set x (t+1) = yt with probability

                                                             π(yt ) q(x (t) |yt )
                                 α(x (t) , yt ) = min 1,
                                                            π(x (t) ) q(yt |x (t) )

               Otherwise, set x (t+1) = x (t) .
           3   α is such that the detailed balance equation is satisfied: ⊲ π is
                                                                           ¯
               the stationary distribution of (x (t) ).
       ◮ The accepted candidates are simulated with the rejection
       algorithm.

                                                                                                            6 / 24
Introduction       Some properties of the HM algorithm   Rao–Blackwellisation         Illustrations   Conclusion




Metropolis Hasting Algorithm

       Suppose that x (t) is drawn.
           1   Simulate yt ∼ q(·|x (t) ).
           2   Set x (t+1) = yt with probability

                                                             π(yt ) q(x (t) |yt )
                                 α(x (t) , yt ) = min 1,
                                                            π(x (t) ) q(yt |x (t) )

               Otherwise, set x (t+1) = x (t) .
           3   α is such that the detailed balance equation is satisfied: ⊲ π is
                                                                           ¯
               the stationary distribution of (x (t) ).
       ◮ The accepted candidates are simulated with the rejection
       algorithm.

                                                                                                            6 / 24
Introduction       Some properties of the HM algorithm   Rao–Blackwellisation         Illustrations   Conclusion




Metropolis Hasting Algorithm
       Suppose that x (t) is drawn.
           1   Simulate yt ∼ q(·|x (t) ).
           2   Set x (t+1) = yt with probability

                                                             π(yt ) q(x (t) |yt )
                                 α(x (t) , yt ) = min 1,
                                                            π(x (t) ) q(yt |x (t) )

               Otherwise, set x (t+1) = x (t) .
           3   α is such that the detailed balance equation is satisfied:

                                 π(x )q(y |x )α(x , y ) = π(y )q(x |y )α(y , x ).

               ⊲ π is the stationary distribution of (x (t) ).
                 ¯
       ◮ The accepted candidates are simulated with the rejection
       algorithm.
                                                                                                            6 / 24
Introduction       Some properties of the HM algorithm   Rao–Blackwellisation         Illustrations   Conclusion




Metropolis Hasting Algorithm
       Suppose that x (t) is drawn.
           1   Simulate yt ∼ q(·|x (t) ).
           2   Set x (t+1) = yt with probability

                                                             π(yt ) q(x (t) |yt )
                                 α(x (t) , yt ) = min 1,
                                                            π(x (t) ) q(yt |x (t) )

               Otherwise, set x (t+1) = x (t) .
           3   α is such that the detailed balance equation is satisfied:

                                 π(x )q(y |x )α(x , y ) = π(y )q(x |y )α(y , x ).

               ⊲ π is the stationary distribution of (x (t) ).
                 ¯
       ◮ The accepted candidates are simulated with the rejection
       algorithm.
                                                                                                            6 / 24
Introduction         Some properties of the HM algorithm   Rao–Blackwellisation   Illustrations   Conclusion




Outline


       1       Introduction

       2       Some properties of the HM algorithm

       3       Rao–Blackwellisation
                 Variance reduction
                 Asymptotic results

       4       Illustrations

       5       Conclusion



                                                                                                        7 / 24
Introduction      Some properties of the HM algorithm         Rao–Blackwellisation          Illustrations   Conclusion




           1   Alternative representation of the estimator δ is
                                                n                       MN
                                       1                (t)  1
                                    δ=               h(x ) =                  ni h(zi ) ,
                                       n                     N
                                               t=1                      i=1

               where
                    zi ’s are the accepted yj ’s,
                    MN is the number of accepted yj ’s till time N,
                    ni is the number of times zi appears in the sequence (x (t) )t .




                                                                                                                  8 / 24
Introduction      Some properties of the HM algorithm       Rao–Blackwellisation    Illustrations   Conclusion




                                               α(zi , ·) q(·|zi )   q(·|zi )
                                 ˜
                                 q (·|zi ) =                      ≤          ,
                                                    p(zi )          p(zi )

       where p(zi ) =                                                           ˜
                              α(zi , y ) q(y |zi )dy . To simulate according to q (·|zi ):
           1   Propose a candidate y ∼ q(·|zi )
           2   Accept with probability

                                                        q(y |zi )
                                        ˜
                                        q (y |zi )/                  = α(zi , y )
                                                         p(zi )

               Otherwise, reject it and starts again.
           3   ◮ this is the transition of the HM algorithm.
                             ˜
       The transition kernel q admits π as a stationary distribution:
                                      ˜

                                                ˜ ˜
                                                π (x )q (y |x ) =


                                                                                                          9 / 24
Introduction      Some properties of the HM algorithm       Rao–Blackwellisation    Illustrations   Conclusion




                                               α(zi , ·) q(·|zi )   q(·|zi )
                                 ˜
                                 q (·|zi ) =                      ≤          ,
                                                    p(zi )          p(zi )

       where p(zi ) =                                                           ˜
                              α(zi , y ) q(y |zi )dy . To simulate according to q (·|zi ):
           1   Propose a candidate y ∼ q(·|zi )
           2   Accept with probability

                                                        q(y |zi )
                                        ˜
                                        q (y |zi )/                  = α(zi , y )
                                                         p(zi )

               Otherwise, reject it and starts again.
           3   ◮ this is the transition of the HM algorithm.
                             ˜
       The transition kernel q admits π as a stationary distribution:
                                      ˜

                                                ˜ ˜
                                                π (x )q (y |x ) =


                                                                                                          9 / 24
Introduction      Some properties of the HM algorithm       Rao–Blackwellisation    Illustrations   Conclusion




                                               α(zi , ·) q(·|zi )   q(·|zi )
                                 ˜
                                 q (·|zi ) =                      ≤          ,
                                                    p(zi )          p(zi )

       where p(zi ) =                                                           ˜
                              α(zi , y ) q(y |zi )dy . To simulate according to q (·|zi ):
           1   Propose a candidate y ∼ q(·|zi )
           2   Accept with probability

                                                        q(y |zi )
                                        ˜
                                        q (y |zi )/                  = α(zi , y )
                                                         p(zi )

               Otherwise, reject it and starts again.
           3   ◮ this is the transition of the HM algorithm.
                             ˜
       The transition kernel q admits π as a stationary distribution:
                                      ˜

                                                ˜ ˜
                                                π (x )q (y |x ) =


                                                                                                          9 / 24
Introduction      Some properties of the HM algorithm           Rao–Blackwellisation       Illustrations   Conclusion




                                               α(zi , ·) q(·|zi )   q(·|zi )
                                 ˜
                                 q (·|zi ) =                      ≤          ,
                                                    p(zi )          p(zi )
       where p(zi ) =                                                           ˜
                              α(zi , y ) q(y |zi )dy . To simulate according to q (·|zi ):
           1   Propose a candidate y ∼ q(·|zi )
           2   Accept with probability
                                                        q(y |zi )
                                        ˜
                                        q (y |zi )/                      = α(zi , y )
                                                         p(zi )
               Otherwise, reject it and starts again.
           3   ◮ this is the transition of the HM algorithm.
                             ˜
       The transition kernel q admits π as a stationary distribution:
                                      ˜
                                                  π(x )p(x ) α(x , y )q(y |x )
                        ˜ ˜
                        π (x )q (y |x ) =
                                                  π(u)p(u)du      p(x )
                                                        π (x)
                                                        ˜                       ˜
                                                                                q (y |x)

                                                                                                                 9 / 24
Introduction      Some properties of the HM algorithm       Rao–Blackwellisation    Illustrations   Conclusion




                                               α(zi , ·) q(·|zi )   q(·|zi )
                                 ˜
                                 q (·|zi ) =                      ≤          ,
                                                    p(zi )          p(zi )

       where p(zi ) =                                                           ˜
                              α(zi , y ) q(y |zi )dy . To simulate according to q (·|zi ):
           1   Propose a candidate y ∼ q(·|zi )
           2   Accept with probability

                                                        q(y |zi )
                                        ˜
                                        q (y |zi )/                  = α(zi , y )
                                                         p(zi )

               Otherwise, reject it and starts again.
           3   ◮ this is the transition of the HM algorithm.
                             ˜
       The transition kernel q admits π as a stationary distribution:
                                      ˜

                                                        π(x )α(x , y )q(y |x )
                                 ˜ ˜
                                 π (x )q (y |x ) =
                                                             π(u)p(u)du

                                                                                                          9 / 24
Introduction      Some properties of the HM algorithm       Rao–Blackwellisation    Illustrations   Conclusion




                                               α(zi , ·) q(·|zi )   q(·|zi )
                                 ˜
                                 q (·|zi ) =                      ≤          ,
                                                    p(zi )          p(zi )

       where p(zi ) =                                                           ˜
                              α(zi , y ) q(y |zi )dy . To simulate according to q (·|zi ):
           1   Propose a candidate y ∼ q(·|zi )
           2   Accept with probability

                                                        q(y |zi )
                                        ˜
                                        q (y |zi )/                  = α(zi , y )
                                                         p(zi )

               Otherwise, reject it and starts again.
           3   ◮ this is the transition of the HM algorithm.
                             ˜
       The transition kernel q admits π as a stationary distribution:
                                      ˜

                                                        π(y )α(y , x )q(x |y )
                                 ˜ ˜
                                 π (x )q (y |x ) =
                                                             π(u)p(u)du

                                                                                                          9 / 24
Introduction      Some properties of the HM algorithm       Rao–Blackwellisation    Illustrations   Conclusion




                                               α(zi , ·) q(·|zi )   q(·|zi )
                                 ˜
                                 q (·|zi ) =                      ≤          ,
                                                    p(zi )          p(zi )

       where p(zi ) =                                                           ˜
                              α(zi , y ) q(y |zi )dy . To simulate according to q (·|zi ):
           1   Propose a candidate y ∼ q(·|zi )
           2   Accept with probability

                                                        q(y |zi )
                                        ˜
                                        q (y |zi )/                  = α(zi , y )
                                                         p(zi )

               Otherwise, reject it and starts again.
           3   ◮ this is the transition of the HM algorithm.
                             ˜
       The transition kernel q admits π as a stationary distribution:
                                      ˜

                                      ˜ ˜               ˜ ˜
                                      π (x )q (y |x ) = π (y )q (x |y ) ,


                                                                                                          9 / 24
Introduction       Some properties of the HM algorithm      Rao–Blackwellisation     Illustrations   Conclusion




       Lemme
       The sequence (zi , ni ) satisfies
           1   (zi , ni )i is a Markov chain;
           2   zi+1 and ni are independent given zi ;
           3   ni is distributed as a geometric random variable with probability
               parameter
                                         p(zi ) :=       α(zi , y ) q(y |zi ) dy ;                   (1)


           4   (zi )i is a Markov chain with transition kernel
                ˜            ˜
                Q(z, dy ) = q (y |z)dy and stationary distribution π such that
                                                                   ˜

                              ˜
                              q (·|z) ∝ α(z, ·) q(·|z)        and      π (·) ∝ π(·)p(·) .
                                                                       ˜


                                                                                                           10 / 24
Introduction       Some properties of the HM algorithm      Rao–Blackwellisation     Illustrations   Conclusion




       Lemme
       The sequence (zi , ni ) satisfies
           1   (zi , ni )i is a Markov chain;
           2   zi+1 and ni are independent given zi ;
           3   ni is distributed as a geometric random variable with probability
               parameter
                                         p(zi ) :=       α(zi , y ) q(y |zi ) dy ;                   (1)


           4   (zi )i is a Markov chain with transition kernel
                ˜            ˜
                Q(z, dy ) = q (y |z)dy and stationary distribution π such that
                                                                   ˜

                              ˜
                              q (·|z) ∝ α(z, ·) q(·|z)        and      π (·) ∝ π(·)p(·) .
                                                                       ˜


                                                                                                           10 / 24
Introduction       Some properties of the HM algorithm      Rao–Blackwellisation     Illustrations   Conclusion




       Lemme
       The sequence (zi , ni ) satisfies
           1   (zi , ni )i is a Markov chain;
           2   zi+1 and ni are independent given zi ;
           3   ni is distributed as a geometric random variable with probability
               parameter
                                         p(zi ) :=       α(zi , y ) q(y |zi ) dy ;                   (1)


           4   (zi )i is a Markov chain with transition kernel
                ˜            ˜
                Q(z, dy ) = q (y |z)dy and stationary distribution π such that
                                                                   ˜

                              ˜
                              q (·|z) ∝ α(z, ·) q(·|z)        and      π (·) ∝ π(·)p(·) .
                                                                       ˜


                                                                                                           10 / 24
Introduction       Some properties of the HM algorithm      Rao–Blackwellisation     Illustrations   Conclusion




       Lemme
       The sequence (zi , ni ) satisfies
           1   (zi , ni )i is a Markov chain;
           2   zi+1 and ni are independent given zi ;
           3   ni is distributed as a geometric random variable with probability
               parameter
                                         p(zi ) :=       α(zi , y ) q(y |zi ) dy ;                   (1)


           4   (zi )i is a Markov chain with transition kernel
                ˜            ˜
                Q(z, dy ) = q (y |z)dy and stationary distribution π such that
                                                                   ˜

                              ˜
                              q (·|z) ∝ α(z, ·) q(·|z)        and      π (·) ∝ π(·)p(·) .
                                                                       ˜


                                                                                                           10 / 24
Introduction   Some properties of the HM algorithm   Rao–Blackwellisation   Illustrations   Conclusion




                                zi−1




                                                                                                 11 / 24
Introduction   Some properties of the HM algorithm   Rao–Blackwellisation   Illustrations   Conclusion




                                           indep
                               zi−1                  zi


                                    indep


                               ni−1




                                                                                                 11 / 24
Introduction   Some properties of the HM algorithm        Rao–Blackwellisation      Illustrations   Conclusion




                                        indep                 indep
                            zi−1                     zi                      zi+1


                                 indep                    indep


                            ni−1                     ni




                                                                                                         11 / 24
Introduction   Some properties of the HM algorithm        Rao–Blackwellisation       Illustrations   Conclusion




                                        indep                   indep
                            zi−1                     zi                      zi+1


                                 indep                    indep


                            ni−1                     ni




                                         n                       MN
                                    1                       1
                             δ=               h(x (t) ) =              ni h(zi ) .
                                    n                       N
                                        t=1                      i=1




                                                                                                          11 / 24
Introduction   Some properties of the HM algorithm        Rao–Blackwellisation       Illustrations   Conclusion




                                        indep                   indep
                            zi−1                     zi                      zi+1


                                 indep                    indep


                            ni−1                     ni




                                         n                       MN
                                    1                       1
                             δ=               h(x (t) ) =              ni h(zi ) .
                                    n                       N
                                        t=1                      i=1




                                                                                                          11 / 24
Introduction         Some properties of the HM algorithm   Rao–Blackwellisation   Illustrations   Conclusion




Outline


       1       Introduction

       2       Some properties of the HM algorithm

       3       Rao–Blackwellisation
                 Variance reduction
                 Asymptotic results

       4       Illustrations

       5       Conclusion



                                                                                                       12 / 24
Introduction      Some properties of the HM algorithm       Rao–Blackwellisation   Illustrations   Conclusion




           1   A natural idea:
                                                            MN
                                                        1         h(zi )
                                                δ∗ =                     ,
                                                        N         p(zi )
                                                            i=1




                                                                                                        13 / 24
Introduction      Some properties of the HM algorithm     Rao–Blackwellisation       Illustrations   Conclusion




           1   A natural idea:

                                              MN    h(zi )       MN   π(zi )
                                              i=1                i=1          h(zi )
                                                    p(zi )            π (zi )
                                                                      ˜
                                  δ∗ ≃                     =                         .
                                              MN     1              MN π(zi )
                                              i=1                   i=1
                                                    p(zi )               π (zi )
                                                                         ˜




                                                                                                          13 / 24
Introduction      Some properties of the HM algorithm     Rao–Blackwellisation       Illustrations   Conclusion




           1   A natural idea:

                                              MN    h(zi )       MN   π(zi )
                                              i=1                i=1          h(zi )
                                    ∗               p(zi )            π (zi )
                                                                      ˜
                                  δ ≃                      =                         .
                                              MN     1              MN π(zi )
                                              i=1                   i=1
                                                    p(zi )               π (zi )
                                                                         ˜

           2   But p not available in closed form.




                                                                                                          13 / 24
Introduction      Some properties of the HM algorithm     Rao–Blackwellisation       Illustrations   Conclusion




           1   A natural idea:

                                              MN    h(zi )       MN   π(zi )
                                              i=1                i=1          h(zi )
                                    ∗               p(zi )            π (zi )
                                                                      ˜
                                  δ ≃                      =                         .
                                              MN     1              MN π(zi )
                                              i=1                   i=1
                                                    p(zi )               π (zi )
                                                                         ˜

           2   But p not available in closed form.
           3   The geometric ni is the obvious solution that is used in the
               original Metropolis–Hastings estimate.




                                                                                                          13 / 24
Introduction      Some properties of the HM algorithm       Rao–Blackwellisation       Illustrations   Conclusion




           1   A natural idea:

                                              MN    h(zi )         MN   π(zi )
                                              i=1                  i=1          h(zi )
                                                    p(zi )              π (zi )
                                                                        ˜
                                  δ∗ ≃                     =                           .
                                              MN     1                MN π(zi )
                                              i=1                     i=1
                                                    p(zi )                 π (zi )
                                                                           ˜

           2   But p not available in closed form.
           3   The geometric ni is the obvious solution that is used in the
               original Metropolis–Hastings estimate.

                                               ∞
                                ni = 1 +                 I {uℓ ≥ α(zi , yℓ )} ,
                                               j=1 ℓ≤j




                                                                                                            13 / 24
Introduction    Some properties of the HM algorithm       Rao–Blackwellisation   Illustrations   Conclusion




                                             ∞
                              ni = 1 +                 I {uℓ ≥ α(zi , yℓ )} ,
                                             j=1 ℓ≤j


       Lemma
       If (yj )j is an iid sequence with distribution q(y |zi ), the quantity
                                                ∞
                                 ˆ
                                 ξi = 1 +                {1 − α(zi , yℓ )}
                                               j=1 ℓ≤j


       is an unbiased estimator of 1/p(zi ) which variance, conditional on zi ,
       is lower than the conditional variance of ni , {1 − p(zi )}/p2 (zi ).




                                                                                                      13 / 24
Introduction      Some properties of the HM algorithm       Rao–Blackwellisation         Illustrations   Conclusion




                                                  ∞
                                   ˆ
                                   ξi = 1 +                {1 − α(zi , yℓ )}
                                                 j=1 ℓ≤j


           1   Infinite sum but sometimes finite:
                                                                π(yt ) q(x (t) |yt )
                                α(x (t) , yt ) = min 1,
                                                               π(x (t) ) q(yt |x (t) )

               For example: take a symetric random walk as a proposal.
           2   What if we wish to be sure that the sum is finite?




                                                                                                              14 / 24
Introduction         Some properties of the HM algorithm     Rao–Blackwellisation      Illustrations   Conclusion


Variance reduction




       Proposition
       If (yj )j is an iid sequence with distribution q(y |zi ) and (uj )j is an iid
       uniform sequence, for any k ≥ 0, the quantity
                               ∞
                ˆ
                ξik = 1 +                       {1 − α(zi , yj )}              I {uℓ ≥ α(zi , yℓ )}    (2)
                               j=1 1≤ℓ≤k ∧j                         k +1≤ℓ≤j


       is an unbiased estimator of 1/p(zi ) with an almost sure finite number
       of terms.




                                                                                                             15 / 24
Introduction         Some properties of the HM algorithm     Rao–Blackwellisation          Illustrations      Conclusion


Variance reduction




       Proposition
       If (yj )j is an iid sequence with distribution q(y |zi ) and (uj )j is an iid
       uniform sequence, for any k ≥ 0, the quantity
                               ∞
                ˆ
                ξik = 1 +                       {1 − α(zi , yj )}              I {uℓ ≥ α(zi , yℓ )}          (2)
                               j=1 1≤ℓ≤k ∧j                         k +1≤ℓ≤j


       is an unbiased estimator of 1/p(zi ) with an almost sure finite number
       of terms. Moreover, for k ≥ 1,

         ˆ        1 − p(zi ) 1 − (1 − 2p(zi ) + r (zi ))k                           2 − p(zi )
       V ξik zi =            −                                                                       (p(zi )−r (zi )) ,
                    p2 (zi )      2p(zi ) − r (zi )                                  p2 (zi )

       where p(zi ) :=            α(zi , y ) q(y |zi ) dy . and r (zi ) :=          α2 (zi , y ) q(y |zi ) dy .



                                                                                                                   15 / 24
Introduction         Some properties of the HM algorithm     Rao–Blackwellisation      Illustrations   Conclusion


Variance reduction




       Proposition
       If (yj )j is an iid sequence with distribution q(y |zi ) and (uj )j is an iid
       uniform sequence, for any k ≥ 0, the quantity
                               ∞
                ˆ
                ξik = 1 +                       {1 − α(zi , yj )}              I {uℓ ≥ α(zi , yℓ )}    (2)
                               j=1 1≤ℓ≤k ∧j                         k +1≤ℓ≤j


       is an unbiased estimator of 1/p(zi ) with an almost sure finite number
       of terms. Therefore, we have

                            ˆ         ˆ          ˆ
                          V ξi zi ≤ V ξik zi ≤ V ξi0 zi = V [ni | zi ] .




                                                                                                             15 / 24
Introduction         Some properties of the HM algorithm    Rao–Blackwellisation       Illustrations   Conclusion


Variance reduction



                                      zi−1




                             ∞
               ˆ
               ξik = 1 +                      {1 − α(zi , yj )}              I {uℓ ≥ α(zi , yℓ )}      (3)
                            j=1 1≤ℓ≤k ∧j                          k +1≤ℓ≤j




                                                                                                             16 / 24
Introduction         Some properties of the HM algorithm    Rao–Blackwellisation       Illustrations   Conclusion


Variance reduction

                                              not indep
                                     zi−1                   zi


                                          not indep


                                     ˆk
                                     ξi−1




                             ∞
               ˆ
               ξik = 1 +                      {1 − α(zi , yj )}              I {uℓ ≥ α(zi , yℓ )}      (3)
                            j=1 1≤ℓ≤k ∧j                          k +1≤ℓ≤j




                                                                                                             16 / 24
Introduction         Some properties of the HM algorithm         Rao–Blackwellisation      Illustrations   Conclusion


Variance reduction

                                           not indep              not indep
                                  zi−1                     zi                       zi+1


                                       not indep                 not indep


                                  ˆk
                                  ξi−1                     ˆ
                                                           ξik




                             ∞
               ˆ
               ξik = 1 +                      {1 − α(zi , yj )}                 I {uℓ ≥ α(zi , yℓ )}       (3)
                            j=1 1≤ℓ≤k ∧j                            k +1≤ℓ≤j




                                                                                                                 16 / 24
Introduction         Some properties of the HM algorithm         Rao–Blackwellisation      Illustrations   Conclusion


Variance reduction



                                           not indep              not indep
                                  zi−1                      zi                      zi+1


                                       not indep                 not indep


                                  ˆk
                                  ξi−1                     ˆ
                                                           ξik




                                                           M ˆk
                                               k           i=1 ξi h(zi )
                                              δM =            M ˆk
                                                                            .
                                                              i=1 ξi




                                                                                                                16 / 24
Introduction         Some properties of the HM algorithm         Rao–Blackwellisation      Illustrations   Conclusion


Variance reduction



                                           not indep              not indep
                                  zi−1                      zi                      zi+1


                                       not indep                 not indep


                                  ˆk
                                  ξi−1                     ˆ
                                                           ξik




                                                           M ˆk
                                               k           i=1 ξi h(zi )
                                              δM =            M ˆk
                                                                            .
                                                              i=1 ξi




                                                                                                                16 / 24
Introduction         Some properties of the HM algorithm       Rao–Blackwellisation   Illustrations   Conclusion


Asymptotic results




       Let
                                                           M ˆk
                                               k           i=1 ξi h(zi )
                                              δM =            M ˆk
                                                                           .
                                                              i=1 ξi

       For any positive function ϕ, we denote Cϕ = {h; |h/ϕ|∞ < ∞}.




                                                                                                           17 / 24
Introduction         Some properties of the HM algorithm       Rao–Blackwellisation   Illustrations   Conclusion


Asymptotic results

       Let
                                                           M ˆk
                                                 k         i=1 ξi h(zi )
                                                δM =          M ˆk
                                                                           .
                                                              i=1 ξi

       For any positive function ϕ, we denote Cϕ = {h; |h/ϕ|∞ < ∞}.
       Assume that there exist a positive function ϕ ≥ 1 such that
                                                       M
                                                       i=1 h(zi )/p(zi )       P
                                 ∀h ∈ Cϕ ,               M
                                                                           −→ π(h)                    (3)
                                                         i=1 1/p(zi )



       Theorem
       Under the assumption that π(p) > 0, the following convergence
       property holds:
           i) If h is in Cϕ , then

                                      k     P
                                     δM −→M→∞ π(h) (◮C ONSISTENCY)
                                                                                                            17 / 24
Introduction         Some properties of the HM algorithm       Rao–Blackwellisation       Illustrations   Conclusion


Asymptotic results

       Let
                                                           M ˆk
                                               k           i=1 ξi h(zi )
                                              δM =            M ˆk
                                                                           .
                                                              i=1 ξi

       For any positive function ϕ, we denote Cϕ = {h; |h/ϕ|∞ < ∞}.
       Assume that there exist a positive function ψ such that
                                  √            M
                                               i=1 h(zi )/p(zi )                      L
                ∀h ∈ Cψ ,          M             M
                                                                    − π(h)        −→ N (0, Γ(h))
                                                 i=1 1/p(zi )


       Theorem

       Under the assumption that π(p) > 0, the following convergence
       property holds:
          ii) If, in addition, h2 /p ∈ Cϕ and h ∈ Cψ , then
                       √   k          L
                        M(δM − π(h)) −→M→∞ N (0, Vk [h − π(h)]) , (◮C LT)

               where Vk (h) := π(p)                        ˆ
                                                    π(dz)V ξik z h2 (z)p(z) + Γ(h) .
                                                                                                               17 / 24
Introduction         Some properties of the HM algorithm    Rao–Blackwellisation   Illustrations   Conclusion


Asymptotic results

       We will need some additional assumptions. Assume a maximal
       inequality for the Markov chain (zi )i : there exists a measurable
       function ζ such that for any starting point x ,
                                                             
                                       i
                                                                  NCh (x )
              ∀h ∈ Cζ , Px  sup          [h(zi ) − π (h)] > ǫ ≤
                                                    ˜
                               0≤i≤N                                 ǫ2
                                                     j=0


       Theorem
       Assume that h is such that h/p ∈ Cζ and {Ch/p , h2 /p2 } ⊂ Cφ . Assume
       moreover that
                              √   0         L
                               M δM − π(h) −→ N (0, V0 [h − π(h)]) .

       Then, for any starting point x ,

                                N
                                t=1 h(x (t) )               L
                     MN                       − π(h)       −→N→∞ N (0, V0 [h − π(h)]) ,
                                    N
                                                                                                        18 / 24
Introduction         Some properties of the HM algorithm   Rao–Blackwellisation   Illustrations   Conclusion


Asymptotic results

       We will need some additional assumptions. Assume a maximal
       inequality for the Markov chain (zi )i : there exists a measurable
       function ζ such that for any starting point x ,
                                                             
                                       i
                                                                  NCh (x )
              ∀h ∈ Cζ , Px  sup          [h(zi ) − π (h)] > ǫ ≤
                                                    ˜
                               0≤i≤N                                 ǫ2
                                                     j=0

       Moreover, assume that ∃φ ≥ 1 such that for any starting point x ,
                           ∀h ∈ Cφ ,        ˜            P
                                            Q n (x , h) −→ π (h) = π(ph)/π(p) ,
                                                           ˜


       Theorem
       Assume that h is such that h/p ∈ Cζ and {Ch/p , h2 /p2 } ⊂ Cφ . Assume
       moreover that
                              √   0         L
                               M δM − π(h) −→ N (0, V0 [h − π(h)]) .

       Then, for any starting point x ,
                                                                                                       18 / 24
Introduction         Some properties of the HM algorithm   Rao–Blackwellisation   Illustrations   Conclusion


Asymptotic results

       We will need some additional assumptions. Assume a maximal
       inequality for the Markov chain (zi )i : there exists a measurable
       function ζ such that for any starting point x ,
                                                             
                                       i
                                                                  NCh (x )
              ∀h ∈ Cζ , Px  sup          [h(zi ) − π (h)] > ǫ ≤
                                                    ˜
                               0≤i≤N                                 ǫ2
                                                     j=0

       Moreover, assume that ∃φ ≥ 1 such that for any starting point x ,
                           ∀h ∈ Cφ ,        ˜            P
                                            Q n (x , h) −→ π (h) = π(ph)/π(p) ,
                                                           ˜


       Theorem
       Assume that h is such that h/p ∈ Cζ and {Ch/p , h2 /p2 } ⊂ Cφ . Assume
       moreover that
                              √   0         L
                               M δM − π(h) −→ N (0, V0 [h − π(h)]) .

       Then, for any starting point x ,
                                                                                                       18 / 24
Introduction         Some properties of the HM algorithm       Rao–Blackwellisation       Illustrations   Conclusion


Asymptotic results

                                                                                     
                                                      i
                                                                                          NCh (x )
                 ∀h ∈ Cζ ,        Px  sup                 [h(zi ) − π (h)] > ǫ ≤
                                                                     ˜
                                           0≤i≤N j=0                                        ǫ2

                           ∀h ∈ Cφ ,        ˜            P
                                            Q n (x , h) −→ π (h) = π(ph)/π(p) ,
                                                           ˜


       Theorem
       Assume that h is such that h/p ∈ Cζ and {Ch/p , h2 /p2 } ⊂ Cφ . Assume
       moreover that
                              √   0         L
                               M δM − π(h) −→ N (0, V0 [h − π(h)]) .

       Then, for any starting point x ,

                                N
                                t=1 h(x (t) )                  L
                     MN                       − π(h)          −→N→∞ N (0, V0 [h − π(h)]) ,
                                    N

                                                                                                               18 / 24
Introduction         Some properties of the HM algorithm    Rao–Blackwellisation   Illustrations   Conclusion


Asymptotic results



       Theorem
       Assume that h is such that h/p ∈ Cζ and {Ch/p , h2 /p2 } ⊂ Cφ . Assume
       moreover that
                              √   0         L
                               M δM − π(h) −→ N (0, V0 [h − π(h)]) .

       Then, for any starting point x ,

                                N
                                t=1 h(x (t) )               L
                     MN                       − π(h)       −→N→∞ N (0, V0 [h − π(h)]) ,
                                    N

       where MN is defined by
                                             MN                MN +1
                                                   ˆ
                                                   ξi0 ≤ N <           ˆ
                                                                       ξi0 .                       (3)
                                             i=1                i=1


                                                                                                         18 / 24
Introduction         Some properties of the HM algorithm    Rao–Blackwellisation   Illustrations   Conclusion


Asymptotic results



       Theorem
       Assume that h is such that h/p ∈ Cζ and {Ch/p , h2 /p2 } ⊂ Cφ . Assume
       moreover that
                              √   0         L
                               M δM − π(h) −→ N (0, V0 [h − π(h)]) .

       Then, for any starting point x ,

                                N
                                t=1 h(x (t) )               L
                     MN                       − π(h)       −→N→∞ N (0, V0 [h − π(h)]) ,
                                    N

       where MN is defined by
                                             MN                MN +1
                                                   ˆ
                                                   ξi0 ≤ N <           ˆ
                                                                       ξi0 .                       (3)
                                             i=1                i=1


                                                                                                         18 / 24
Introduction         Some properties of the HM algorithm   Rao–Blackwellisation   Illustrations   Conclusion




Outline


       1       Introduction

       2       Some properties of the HM algorithm

       3       Rao–Blackwellisation
                 Variance reduction
                 Asymptotic results

       4       Illustrations

       5       Conclusion



                                                                                                       19 / 24
Introduction    Some properties of the HM algorithm   Rao–Blackwellisation   Illustrations   Conclusion




       Figure: Overlay of the variations of 250 iid realisations of the
       estimates δ (gold) and δ ∞ (grey) of E[X ] = 0 for 1000 iterations, along
       with the 90% interquantile range for the estimates δ (brown) and δ ∞
       (pink), in the setting of a random walk Gaussian proposal with scale
       τ = 10.

                                                                                                  20 / 24
Introduction   Some properties of the HM algorithm   Rao–Blackwellisation   Illustrations   Conclusion




       Figure: Overlay of the variations of 500 iid realisations of the
       estimates δ (deep grey), δ ∞ (medium grey) and of the importance
       sampling version (light grey) of E[X ] = 10 when X ∼ Exp(.1) for 100
       iterations, along with the 90% interquantile ranges (same colour
       code), in the setting of an independent exponential proposal with
       scale µ = 0.02.
                                                                                                 21 / 24
Introduction   Some properties of the HM algorithm   Rao–Blackwellisation     Illustrations   Conclusion




                                                               I|x−y |=1    if x > 0 ,
               π(x ) = β(1 − β)x and 2q(y |x ) =
                                                               I|y |≤1      if x = 0 .
       For this problem,

                       p(x ) = 1 − β/2 and r (x ) = 1 − β + β 2 /2 .

       We can therefore compute the gain in variance

                      p(x ) − r (x ) 2 − p(x )    β(1 − β)(2 + β)
                                               =2
                     2p(x ) − r (x ) p2 (x )      (2 − β 2 )(2 − β)2

       which is optimal for β = 0.174, leading to a gain of 0.578 while the
       relative gain in variance is

                        p(x ) − r (x ) 2 − p(x )    (1 − β)(2 + β)
                                                  =
                        2p(x ) − r (x ) 1 − p(x )      (2 − β 2 )

       which is decreasing in β.
                                                                                                   22 / 24
Introduction         Some properties of the HM algorithm   Rao–Blackwellisation   Illustrations   Conclusion




Outline


       1       Introduction

       2       Some properties of the HM algorithm

       3       Rao–Blackwellisation
                 Variance reduction
                 Asymptotic results

       4       Illustrations

       5       Conclusion



                                                                                                       23 / 24
Introduction        Some properties of the HM algorithm   Rao–Blackwellisation   Illustrations   Conclusion




               a) Rao Blackwellisation of any HM algorithm with a controled
                  amount of additional calculation.
               b) Link with the importance sampling of Markov chains.
               c) Analysis with asymptotic results on triangular arrays.




                                                                                                      24 / 24

Weitere ähnliche Inhalte

Was ist angesagt?

Research Inventy : International Journal of Engineering and Science
Research Inventy : International Journal of Engineering and ScienceResearch Inventy : International Journal of Engineering and Science
Research Inventy : International Journal of Engineering and Scienceresearchinventy
 
Design of a Pseudo-Random Binary Code Generator via a Developed Simulation Model
Design of a Pseudo-Random Binary Code Generator via a Developed Simulation ModelDesign of a Pseudo-Random Binary Code Generator via a Developed Simulation Model
Design of a Pseudo-Random Binary Code Generator via a Developed Simulation ModelIDES Editor
 
Branch and bound technique
Branch and bound techniqueBranch and bound technique
Branch and bound techniqueishmecse13
 
CPSC 125 Ch 1 sec 4
CPSC 125 Ch 1 sec 4CPSC 125 Ch 1 sec 4
CPSC 125 Ch 1 sec 4David Wood
 
A brief introduction to Hartree-Fock and TDDFT
A brief introduction to Hartree-Fock and TDDFTA brief introduction to Hartree-Fock and TDDFT
A brief introduction to Hartree-Fock and TDDFTJiahao Chen
 
Algorithm review
Algorithm reviewAlgorithm review
Algorithm reviewchidabdu
 
Parellelism in spectral methods
Parellelism in spectral methodsParellelism in spectral methods
Parellelism in spectral methodsRamona Corman
 
Mathematics and applications of the Hartley and Fourier Transforms
Mathematics and applications of the Hartley and Fourier TransformsMathematics and applications of the Hartley and Fourier Transforms
Mathematics and applications of the Hartley and Fourier Transformsmmjalbiaty
 
Particle filtering
Particle filteringParticle filtering
Particle filteringWei Wang
 
IJCER (www.ijceronline.com) International Journal of computational Engineerin...
IJCER (www.ijceronline.com) International Journal of computational Engineerin...IJCER (www.ijceronline.com) International Journal of computational Engineerin...
IJCER (www.ijceronline.com) International Journal of computational Engineerin...ijceronline
 
Jyokyo-kai-20120605
Jyokyo-kai-20120605Jyokyo-kai-20120605
Jyokyo-kai-20120605ketanaka
 
General relativity 2010
General relativity 2010General relativity 2010
General relativity 20101physics4me
 

Was ist angesagt? (20)

Beamerpresentation
BeamerpresentationBeamerpresentation
Beamerpresentation
 
Research Inventy : International Journal of Engineering and Science
Research Inventy : International Journal of Engineering and ScienceResearch Inventy : International Journal of Engineering and Science
Research Inventy : International Journal of Engineering and Science
 
Design of a Pseudo-Random Binary Code Generator via a Developed Simulation Model
Design of a Pseudo-Random Binary Code Generator via a Developed Simulation ModelDesign of a Pseudo-Random Binary Code Generator via a Developed Simulation Model
Design of a Pseudo-Random Binary Code Generator via a Developed Simulation Model
 
Branch and bound technique
Branch and bound techniqueBranch and bound technique
Branch and bound technique
 
CPSC 125 Ch 1 sec 4
CPSC 125 Ch 1 sec 4CPSC 125 Ch 1 sec 4
CPSC 125 Ch 1 sec 4
 
A brief introduction to Hartree-Fock and TDDFT
A brief introduction to Hartree-Fock and TDDFTA brief introduction to Hartree-Fock and TDDFT
A brief introduction to Hartree-Fock and TDDFT
 
Algorithm review
Algorithm reviewAlgorithm review
Algorithm review
 
Parellelism in spectral methods
Parellelism in spectral methodsParellelism in spectral methods
Parellelism in spectral methods
 
Mathematics and applications of the Hartley and Fourier Transforms
Mathematics and applications of the Hartley and Fourier TransformsMathematics and applications of the Hartley and Fourier Transforms
Mathematics and applications of the Hartley and Fourier Transforms
 
Particle filtering
Particle filteringParticle filtering
Particle filtering
 
IJCER (www.ijceronline.com) International Journal of computational Engineerin...
IJCER (www.ijceronline.com) International Journal of computational Engineerin...IJCER (www.ijceronline.com) International Journal of computational Engineerin...
IJCER (www.ijceronline.com) International Journal of computational Engineerin...
 
CVC Seminar
CVC SeminarCVC Seminar
CVC Seminar
 
Jyokyo-kai-20120605
Jyokyo-kai-20120605Jyokyo-kai-20120605
Jyokyo-kai-20120605
 
Bq25399403
Bq25399403Bq25399403
Bq25399403
 
Daubechies wavelets
Daubechies waveletsDaubechies wavelets
Daubechies wavelets
 
Introduction to DFT Part 1
Introduction to DFT Part 1 Introduction to DFT Part 1
Introduction to DFT Part 1
 
paper
paperpaper
paper
 
Vertex frequency
Vertex frequencyVertex frequency
Vertex frequency
 
General relativity 2010
General relativity 2010General relativity 2010
General relativity 2010
 
www.ijerd.com
www.ijerd.comwww.ijerd.com
www.ijerd.com
 

Andere mochten auch

MCMC and likelihood-free methods
MCMC and likelihood-free methodsMCMC and likelihood-free methods
MCMC and likelihood-free methodsChristian Robert
 
Optimal scaling in MTM algorithms
Optimal scaling in MTM algorithmsOptimal scaling in MTM algorithms
Optimal scaling in MTM algorithmsrandaldouc
 
Rao-Blackwellisation schemes for accelerating Metropolis-Hastings algorithms
Rao-Blackwellisation schemes for accelerating Metropolis-Hastings algorithmsRao-Blackwellisation schemes for accelerating Metropolis-Hastings algorithms
Rao-Blackwellisation schemes for accelerating Metropolis-Hastings algorithmsChristian Robert
 
Introduction to MCMC methods
Introduction to MCMC methodsIntroduction to MCMC methods
Introduction to MCMC methodsChristian Robert
 
OPASS - Microsoft Test Manager
OPASS - Microsoft Test ManagerOPASS - Microsoft Test Manager
OPASS - Microsoft Test ManagerEsteban Garcia
 
solution manual of goldsmith wireless communication
solution manual of goldsmith wireless communicationsolution manual of goldsmith wireless communication
solution manual of goldsmith wireless communicationNIT Raipur
 
Reuters: Pictures of the Year 2016 (Part 2)
Reuters: Pictures of the Year 2016 (Part 2)Reuters: Pictures of the Year 2016 (Part 2)
Reuters: Pictures of the Year 2016 (Part 2)maditabalnco
 
The Six Highest Performing B2B Blog Post Formats
The Six Highest Performing B2B Blog Post FormatsThe Six Highest Performing B2B Blog Post Formats
The Six Highest Performing B2B Blog Post FormatsBarry Feldman
 
The Outcome Economy
The Outcome EconomyThe Outcome Economy
The Outcome EconomyHelge Tennø
 

Andere mochten auch (11)

MCMC and likelihood-free methods
MCMC and likelihood-free methodsMCMC and likelihood-free methods
MCMC and likelihood-free methods
 
By BIRASA FABRICE
By BIRASA FABRICEBy BIRASA FABRICE
By BIRASA FABRICE
 
Deep Dive Modern Apps Lifecycle with Visual Studio 2012: Software Testing wit...
Deep Dive Modern Apps Lifecycle with Visual Studio 2012: Software Testing wit...Deep Dive Modern Apps Lifecycle with Visual Studio 2012: Software Testing wit...
Deep Dive Modern Apps Lifecycle with Visual Studio 2012: Software Testing wit...
 
Optimal scaling in MTM algorithms
Optimal scaling in MTM algorithmsOptimal scaling in MTM algorithms
Optimal scaling in MTM algorithms
 
Rao-Blackwellisation schemes for accelerating Metropolis-Hastings algorithms
Rao-Blackwellisation schemes for accelerating Metropolis-Hastings algorithmsRao-Blackwellisation schemes for accelerating Metropolis-Hastings algorithms
Rao-Blackwellisation schemes for accelerating Metropolis-Hastings algorithms
 
Introduction to MCMC methods
Introduction to MCMC methodsIntroduction to MCMC methods
Introduction to MCMC methods
 
OPASS - Microsoft Test Manager
OPASS - Microsoft Test ManagerOPASS - Microsoft Test Manager
OPASS - Microsoft Test Manager
 
solution manual of goldsmith wireless communication
solution manual of goldsmith wireless communicationsolution manual of goldsmith wireless communication
solution manual of goldsmith wireless communication
 
Reuters: Pictures of the Year 2016 (Part 2)
Reuters: Pictures of the Year 2016 (Part 2)Reuters: Pictures of the Year 2016 (Part 2)
Reuters: Pictures of the Year 2016 (Part 2)
 
The Six Highest Performing B2B Blog Post Formats
The Six Highest Performing B2B Blog Post FormatsThe Six Highest Performing B2B Blog Post Formats
The Six Highest Performing B2B Blog Post Formats
 
The Outcome Economy
The Outcome EconomyThe Outcome Economy
The Outcome Economy
 

Ähnlich wie Dr09 Slide

Ähnlich wie Dr09 Slide (8)

Vanilla rao blackwellisation
Vanilla rao blackwellisationVanilla rao blackwellisation
Vanilla rao blackwellisation
 
Monte Carlo Statistical Methods
Monte Carlo Statistical MethodsMonte Carlo Statistical Methods
Monte Carlo Statistical Methods
 
Quasi Lie systems and applications
Quasi Lie systems and applicationsQuasi Lie systems and applications
Quasi Lie systems and applications
 
Hastings 1970
Hastings 1970Hastings 1970
Hastings 1970
 
Demo
DemoDemo
Demo
 
Demo
DemoDemo
Demo
 
Demo
DemoDemo
Demo
 
Bathymetry smoothing in ROMS: A new approach
Bathymetry smoothing in ROMS: A new approachBathymetry smoothing in ROMS: A new approach
Bathymetry smoothing in ROMS: A new approach
 

Kürzlich hochgeladen

What is Model Inheritance in Odoo 17 ERP
What is Model Inheritance in Odoo 17 ERPWhat is Model Inheritance in Odoo 17 ERP
What is Model Inheritance in Odoo 17 ERPCeline George
 
Full Stack Web Development Course for Beginners
Full Stack Web Development Course  for BeginnersFull Stack Web Development Course  for Beginners
Full Stack Web Development Course for BeginnersSabitha Banu
 
Keynote by Prof. Wurzer at Nordex about IP-design
Keynote by Prof. Wurzer at Nordex about IP-designKeynote by Prof. Wurzer at Nordex about IP-design
Keynote by Prof. Wurzer at Nordex about IP-designMIPLM
 
Influencing policy (training slides from Fast Track Impact)
Influencing policy (training slides from Fast Track Impact)Influencing policy (training slides from Fast Track Impact)
Influencing policy (training slides from Fast Track Impact)Mark Reed
 
ICS2208 Lecture6 Notes for SL spaces.pdf
ICS2208 Lecture6 Notes for SL spaces.pdfICS2208 Lecture6 Notes for SL spaces.pdf
ICS2208 Lecture6 Notes for SL spaces.pdfVanessa Camilleri
 
4.16.24 Poverty and Precarity--Desmond.pptx
4.16.24 Poverty and Precarity--Desmond.pptx4.16.24 Poverty and Precarity--Desmond.pptx
4.16.24 Poverty and Precarity--Desmond.pptxmary850239
 
Earth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice greatEarth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice greatYousafMalik24
 
USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...
USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...
USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...Postal Advocate Inc.
 
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)lakshayb543
 
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptx
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptxINTRODUCTION TO CATHOLIC CHRISTOLOGY.pptx
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptxHumphrey A Beña
 
Karra SKD Conference Presentation Revised.pptx
Karra SKD Conference Presentation Revised.pptxKarra SKD Conference Presentation Revised.pptx
Karra SKD Conference Presentation Revised.pptxAshokKarra1
 
Concurrency Control in Database Management system
Concurrency Control in Database Management systemConcurrency Control in Database Management system
Concurrency Control in Database Management systemChristalin Nelson
 
Student Profile Sample - We help schools to connect the data they have, with ...
Student Profile Sample - We help schools to connect the data they have, with ...Student Profile Sample - We help schools to connect the data they have, with ...
Student Profile Sample - We help schools to connect the data they have, with ...Seán Kennedy
 
Virtual-Orientation-on-the-Administration-of-NATG12-NATG6-and-ELLNA.pdf
Virtual-Orientation-on-the-Administration-of-NATG12-NATG6-and-ELLNA.pdfVirtual-Orientation-on-the-Administration-of-NATG12-NATG6-and-ELLNA.pdf
Virtual-Orientation-on-the-Administration-of-NATG12-NATG6-and-ELLNA.pdfErwinPantujan2
 
AUDIENCE THEORY -CULTIVATION THEORY - GERBNER.pptx
AUDIENCE THEORY -CULTIVATION THEORY -  GERBNER.pptxAUDIENCE THEORY -CULTIVATION THEORY -  GERBNER.pptx
AUDIENCE THEORY -CULTIVATION THEORY - GERBNER.pptxiammrhaywood
 
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptxMULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptxAnupkumar Sharma
 

Kürzlich hochgeladen (20)

What is Model Inheritance in Odoo 17 ERP
What is Model Inheritance in Odoo 17 ERPWhat is Model Inheritance in Odoo 17 ERP
What is Model Inheritance in Odoo 17 ERP
 
Full Stack Web Development Course for Beginners
Full Stack Web Development Course  for BeginnersFull Stack Web Development Course  for Beginners
Full Stack Web Development Course for Beginners
 
Keynote by Prof. Wurzer at Nordex about IP-design
Keynote by Prof. Wurzer at Nordex about IP-designKeynote by Prof. Wurzer at Nordex about IP-design
Keynote by Prof. Wurzer at Nordex about IP-design
 
Influencing policy (training slides from Fast Track Impact)
Influencing policy (training slides from Fast Track Impact)Influencing policy (training slides from Fast Track Impact)
Influencing policy (training slides from Fast Track Impact)
 
ICS2208 Lecture6 Notes for SL spaces.pdf
ICS2208 Lecture6 Notes for SL spaces.pdfICS2208 Lecture6 Notes for SL spaces.pdf
ICS2208 Lecture6 Notes for SL spaces.pdf
 
4.16.24 Poverty and Precarity--Desmond.pptx
4.16.24 Poverty and Precarity--Desmond.pptx4.16.24 Poverty and Precarity--Desmond.pptx
4.16.24 Poverty and Precarity--Desmond.pptx
 
Earth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice greatEarth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice great
 
USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...
USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...
USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...
 
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
 
Raw materials used in Herbal Cosmetics.pptx
Raw materials used in Herbal Cosmetics.pptxRaw materials used in Herbal Cosmetics.pptx
Raw materials used in Herbal Cosmetics.pptx
 
YOUVE GOT EMAIL_FINALS_EL_DORADO_2024.pptx
YOUVE GOT EMAIL_FINALS_EL_DORADO_2024.pptxYOUVE GOT EMAIL_FINALS_EL_DORADO_2024.pptx
YOUVE GOT EMAIL_FINALS_EL_DORADO_2024.pptx
 
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptx
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptxINTRODUCTION TO CATHOLIC CHRISTOLOGY.pptx
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptx
 
Karra SKD Conference Presentation Revised.pptx
Karra SKD Conference Presentation Revised.pptxKarra SKD Conference Presentation Revised.pptx
Karra SKD Conference Presentation Revised.pptx
 
LEFT_ON_C'N_ PRELIMS_EL_DORADO_2024.pptx
LEFT_ON_C'N_ PRELIMS_EL_DORADO_2024.pptxLEFT_ON_C'N_ PRELIMS_EL_DORADO_2024.pptx
LEFT_ON_C'N_ PRELIMS_EL_DORADO_2024.pptx
 
Concurrency Control in Database Management system
Concurrency Control in Database Management systemConcurrency Control in Database Management system
Concurrency Control in Database Management system
 
Student Profile Sample - We help schools to connect the data they have, with ...
Student Profile Sample - We help schools to connect the data they have, with ...Student Profile Sample - We help schools to connect the data they have, with ...
Student Profile Sample - We help schools to connect the data they have, with ...
 
Virtual-Orientation-on-the-Administration-of-NATG12-NATG6-and-ELLNA.pdf
Virtual-Orientation-on-the-Administration-of-NATG12-NATG6-and-ELLNA.pdfVirtual-Orientation-on-the-Administration-of-NATG12-NATG6-and-ELLNA.pdf
Virtual-Orientation-on-the-Administration-of-NATG12-NATG6-and-ELLNA.pdf
 
AUDIENCE THEORY -CULTIVATION THEORY - GERBNER.pptx
AUDIENCE THEORY -CULTIVATION THEORY -  GERBNER.pptxAUDIENCE THEORY -CULTIVATION THEORY -  GERBNER.pptx
AUDIENCE THEORY -CULTIVATION THEORY - GERBNER.pptx
 
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptxMULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
 
YOUVE_GOT_EMAIL_PRELIMS_EL_DORADO_2024.pptx
YOUVE_GOT_EMAIL_PRELIMS_EL_DORADO_2024.pptxYOUVE_GOT_EMAIL_PRELIMS_EL_DORADO_2024.pptx
YOUVE_GOT_EMAIL_PRELIMS_EL_DORADO_2024.pptx
 

Dr09 Slide

  • 1. A vanilla Rao–Blackwellisation of Metropolis–Hastings algorithms Randal DOUC and Christian ROBERT Telecom SudParis, France randal.douc@it-sudparis.eu April 2009 1 / 24
  • 2. Main themes 1 Rao–Blackwellisation on MCMC. 2 Can be performed in any Hastings Metropolis algorithm. 3 Asymptotically more efficient to usual MCMC with a controlled amount of calculations. 2 / 24
  • 3. Main themes 1 Rao–Blackwellisation on MCMC. 2 Can be performed in any Hastings Metropolis algorithm. 3 Asymptotically more efficient to usual MCMC with a controlled amount of calculations. 2 / 24
  • 4. Main themes 1 Rao–Blackwellisation on MCMC. 2 Can be performed in any Hastings Metropolis algorithm. 3 Asymptotically more efficient to usual MCMC with a controlled amount of calculations. 2 / 24
  • 5. Main themes 1 Rao–Blackwellisation on MCMC. 2 Can be performed in any Hastings Metropolis algorithm. 3 Asymptotically more efficient to usual MCMC with a controlled amount of calculations. 2 / 24
  • 6. Introduction Some properties of the HM algorithm Rao–Blackwellisation Illustrations Conclusion Outline 1 Introduction 2 Some properties of the HM algorithm 3 Rao–Blackwellisation Variance reduction Asymptotic results 4 Illustrations 5 Conclusion 3 / 24
  • 7. Introduction Some properties of the HM algorithm Rao–Blackwellisation Illustrations Conclusion Outline 1 Introduction 2 Some properties of the HM algorithm 3 Rao–Blackwellisation Variance reduction Asymptotic results 4 Illustrations 5 Conclusion 3 / 24
  • 8. Introduction Some properties of the HM algorithm Rao–Blackwellisation Illustrations Conclusion Outline 1 Introduction 2 Some properties of the HM algorithm 3 Rao–Blackwellisation Variance reduction Asymptotic results 4 Illustrations 5 Conclusion 3 / 24
  • 9. Introduction Some properties of the HM algorithm Rao–Blackwellisation Illustrations Conclusion Outline 1 Introduction 2 Some properties of the HM algorithm 3 Rao–Blackwellisation Variance reduction Asymptotic results 4 Illustrations 5 Conclusion 3 / 24
  • 10. Introduction Some properties of the HM algorithm Rao–Blackwellisation Illustrations Conclusion Outline 1 Introduction 2 Some properties of the HM algorithm 3 Rao–Blackwellisation Variance reduction Asymptotic results 4 Illustrations 5 Conclusion 3 / 24
  • 11. Introduction Some properties of the HM algorithm Rao–Blackwellisation Illustrations Conclusion Outline 1 Introduction 2 Some properties of the HM algorithm 3 Rao–Blackwellisation Variance reduction Asymptotic results 4 Illustrations 5 Conclusion 4 / 24
  • 12. Introduction Some properties of the HM algorithm Rao–Blackwellisation Illustrations Conclusion Metropolis Hastings algorithm 1 We wish to approximate h(x )π(x )dx I= = h(x )¯ (x )dx π π(x )dx 2 x → π(x ) is known but not π(x )dx . 1 n 3 Approximate I with δ = n t=1 h(x (t) ) where (x (t) ) is a Markov chain with limiting distribution π . ¯ 4 Convergence obtained from Law of Large Numbers or CLT for Markov chains. 5 / 24
  • 13. Introduction Some properties of the HM algorithm Rao–Blackwellisation Illustrations Conclusion Metropolis Hastings algorithm 1 We wish to approximate h(x )π(x )dx I= = h(x )¯ (x )dx π π(x )dx 2 x → π(x ) is known but not π(x )dx . 1 n 3 Approximate I with δ = n t=1 h(x (t) ) where (x (t) ) is a Markov chain with limiting distribution π . ¯ 4 Convergence obtained from Law of Large Numbers or CLT for Markov chains. 5 / 24
  • 14. Introduction Some properties of the HM algorithm Rao–Blackwellisation Illustrations Conclusion Metropolis Hastings algorithm 1 We wish to approximate h(x )π(x )dx I= = h(x )¯ (x )dx π π(x )dx 2 x → π(x ) is known but not π(x )dx . 1 n 3 Approximate I with δ = n t=1 h(x (t) ) where (x (t) ) is a Markov chain with limiting distribution π . ¯ 4 Convergence obtained from Law of Large Numbers or CLT for Markov chains. 5 / 24
  • 15. Introduction Some properties of the HM algorithm Rao–Blackwellisation Illustrations Conclusion Metropolis Hastings algorithm 1 We wish to approximate h(x )π(x )dx I= = h(x )¯ (x )dx π π(x )dx 2 x → π(x ) is known but not π(x )dx . 1 n 3 Approximate I with δ = n t=1 h(x (t) ) where (x (t) ) is a Markov chain with limiting distribution π . ¯ 4 Convergence obtained from Law of Large Numbers or CLT for Markov chains. 5 / 24
  • 16. Introduction Some properties of the HM algorithm Rao–Blackwellisation Illustrations Conclusion Metropolis Hasting Algorithm Suppose that x (t) is drawn. 1 Simulate yt ∼ q(·|x (t) ). 2 Set x (t+1) = yt with probability π(yt ) q(x (t) |yt ) α(x (t) , yt ) = min 1, π(x (t) ) q(yt |x (t) ) Otherwise, set x (t+1) = x (t) . 3 α is such that the detailed balance equation is satisfied: ⊲ π is ¯ the stationary distribution of (x (t) ). ◮ The accepted candidates are simulated with the rejection algorithm. 6 / 24
  • 17. Introduction Some properties of the HM algorithm Rao–Blackwellisation Illustrations Conclusion Metropolis Hasting Algorithm Suppose that x (t) is drawn. 1 Simulate yt ∼ q(·|x (t) ). 2 Set x (t+1) = yt with probability π(yt ) q(x (t) |yt ) α(x (t) , yt ) = min 1, π(x (t) ) q(yt |x (t) ) Otherwise, set x (t+1) = x (t) . 3 α is such that the detailed balance equation is satisfied: ⊲ π is ¯ the stationary distribution of (x (t) ). ◮ The accepted candidates are simulated with the rejection algorithm. 6 / 24
  • 18. Introduction Some properties of the HM algorithm Rao–Blackwellisation Illustrations Conclusion Metropolis Hasting Algorithm Suppose that x (t) is drawn. 1 Simulate yt ∼ q(·|x (t) ). 2 Set x (t+1) = yt with probability π(yt ) q(x (t) |yt ) α(x (t) , yt ) = min 1, π(x (t) ) q(yt |x (t) ) Otherwise, set x (t+1) = x (t) . 3 α is such that the detailed balance equation is satisfied: π(x )q(y |x )α(x , y ) = π(y )q(x |y )α(y , x ). ⊲ π is the stationary distribution of (x (t) ). ¯ ◮ The accepted candidates are simulated with the rejection algorithm. 6 / 24
  • 19. Introduction Some properties of the HM algorithm Rao–Blackwellisation Illustrations Conclusion Metropolis Hasting Algorithm Suppose that x (t) is drawn. 1 Simulate yt ∼ q(·|x (t) ). 2 Set x (t+1) = yt with probability π(yt ) q(x (t) |yt ) α(x (t) , yt ) = min 1, π(x (t) ) q(yt |x (t) ) Otherwise, set x (t+1) = x (t) . 3 α is such that the detailed balance equation is satisfied: π(x )q(y |x )α(x , y ) = π(y )q(x |y )α(y , x ). ⊲ π is the stationary distribution of (x (t) ). ¯ ◮ The accepted candidates are simulated with the rejection algorithm. 6 / 24
  • 20. Introduction Some properties of the HM algorithm Rao–Blackwellisation Illustrations Conclusion Outline 1 Introduction 2 Some properties of the HM algorithm 3 Rao–Blackwellisation Variance reduction Asymptotic results 4 Illustrations 5 Conclusion 7 / 24
  • 21. Introduction Some properties of the HM algorithm Rao–Blackwellisation Illustrations Conclusion 1 Alternative representation of the estimator δ is n MN 1 (t) 1 δ= h(x ) = ni h(zi ) , n N t=1 i=1 where zi ’s are the accepted yj ’s, MN is the number of accepted yj ’s till time N, ni is the number of times zi appears in the sequence (x (t) )t . 8 / 24
  • 22. Introduction Some properties of the HM algorithm Rao–Blackwellisation Illustrations Conclusion α(zi , ·) q(·|zi ) q(·|zi ) ˜ q (·|zi ) = ≤ , p(zi ) p(zi ) where p(zi ) = ˜ α(zi , y ) q(y |zi )dy . To simulate according to q (·|zi ): 1 Propose a candidate y ∼ q(·|zi ) 2 Accept with probability q(y |zi ) ˜ q (y |zi )/ = α(zi , y ) p(zi ) Otherwise, reject it and starts again. 3 ◮ this is the transition of the HM algorithm. ˜ The transition kernel q admits π as a stationary distribution: ˜ ˜ ˜ π (x )q (y |x ) = 9 / 24
  • 23. Introduction Some properties of the HM algorithm Rao–Blackwellisation Illustrations Conclusion α(zi , ·) q(·|zi ) q(·|zi ) ˜ q (·|zi ) = ≤ , p(zi ) p(zi ) where p(zi ) = ˜ α(zi , y ) q(y |zi )dy . To simulate according to q (·|zi ): 1 Propose a candidate y ∼ q(·|zi ) 2 Accept with probability q(y |zi ) ˜ q (y |zi )/ = α(zi , y ) p(zi ) Otherwise, reject it and starts again. 3 ◮ this is the transition of the HM algorithm. ˜ The transition kernel q admits π as a stationary distribution: ˜ ˜ ˜ π (x )q (y |x ) = 9 / 24
  • 24. Introduction Some properties of the HM algorithm Rao–Blackwellisation Illustrations Conclusion α(zi , ·) q(·|zi ) q(·|zi ) ˜ q (·|zi ) = ≤ , p(zi ) p(zi ) where p(zi ) = ˜ α(zi , y ) q(y |zi )dy . To simulate according to q (·|zi ): 1 Propose a candidate y ∼ q(·|zi ) 2 Accept with probability q(y |zi ) ˜ q (y |zi )/ = α(zi , y ) p(zi ) Otherwise, reject it and starts again. 3 ◮ this is the transition of the HM algorithm. ˜ The transition kernel q admits π as a stationary distribution: ˜ ˜ ˜ π (x )q (y |x ) = 9 / 24
  • 25. Introduction Some properties of the HM algorithm Rao–Blackwellisation Illustrations Conclusion α(zi , ·) q(·|zi ) q(·|zi ) ˜ q (·|zi ) = ≤ , p(zi ) p(zi ) where p(zi ) = ˜ α(zi , y ) q(y |zi )dy . To simulate according to q (·|zi ): 1 Propose a candidate y ∼ q(·|zi ) 2 Accept with probability q(y |zi ) ˜ q (y |zi )/ = α(zi , y ) p(zi ) Otherwise, reject it and starts again. 3 ◮ this is the transition of the HM algorithm. ˜ The transition kernel q admits π as a stationary distribution: ˜ π(x )p(x ) α(x , y )q(y |x ) ˜ ˜ π (x )q (y |x ) = π(u)p(u)du p(x ) π (x) ˜ ˜ q (y |x) 9 / 24
  • 26. Introduction Some properties of the HM algorithm Rao–Blackwellisation Illustrations Conclusion α(zi , ·) q(·|zi ) q(·|zi ) ˜ q (·|zi ) = ≤ , p(zi ) p(zi ) where p(zi ) = ˜ α(zi , y ) q(y |zi )dy . To simulate according to q (·|zi ): 1 Propose a candidate y ∼ q(·|zi ) 2 Accept with probability q(y |zi ) ˜ q (y |zi )/ = α(zi , y ) p(zi ) Otherwise, reject it and starts again. 3 ◮ this is the transition of the HM algorithm. ˜ The transition kernel q admits π as a stationary distribution: ˜ π(x )α(x , y )q(y |x ) ˜ ˜ π (x )q (y |x ) = π(u)p(u)du 9 / 24
  • 27. Introduction Some properties of the HM algorithm Rao–Blackwellisation Illustrations Conclusion α(zi , ·) q(·|zi ) q(·|zi ) ˜ q (·|zi ) = ≤ , p(zi ) p(zi ) where p(zi ) = ˜ α(zi , y ) q(y |zi )dy . To simulate according to q (·|zi ): 1 Propose a candidate y ∼ q(·|zi ) 2 Accept with probability q(y |zi ) ˜ q (y |zi )/ = α(zi , y ) p(zi ) Otherwise, reject it and starts again. 3 ◮ this is the transition of the HM algorithm. ˜ The transition kernel q admits π as a stationary distribution: ˜ π(y )α(y , x )q(x |y ) ˜ ˜ π (x )q (y |x ) = π(u)p(u)du 9 / 24
  • 28. Introduction Some properties of the HM algorithm Rao–Blackwellisation Illustrations Conclusion α(zi , ·) q(·|zi ) q(·|zi ) ˜ q (·|zi ) = ≤ , p(zi ) p(zi ) where p(zi ) = ˜ α(zi , y ) q(y |zi )dy . To simulate according to q (·|zi ): 1 Propose a candidate y ∼ q(·|zi ) 2 Accept with probability q(y |zi ) ˜ q (y |zi )/ = α(zi , y ) p(zi ) Otherwise, reject it and starts again. 3 ◮ this is the transition of the HM algorithm. ˜ The transition kernel q admits π as a stationary distribution: ˜ ˜ ˜ ˜ ˜ π (x )q (y |x ) = π (y )q (x |y ) , 9 / 24
  • 29. Introduction Some properties of the HM algorithm Rao–Blackwellisation Illustrations Conclusion Lemme The sequence (zi , ni ) satisfies 1 (zi , ni )i is a Markov chain; 2 zi+1 and ni are independent given zi ; 3 ni is distributed as a geometric random variable with probability parameter p(zi ) := α(zi , y ) q(y |zi ) dy ; (1) 4 (zi )i is a Markov chain with transition kernel ˜ ˜ Q(z, dy ) = q (y |z)dy and stationary distribution π such that ˜ ˜ q (·|z) ∝ α(z, ·) q(·|z) and π (·) ∝ π(·)p(·) . ˜ 10 / 24
  • 30. Introduction Some properties of the HM algorithm Rao–Blackwellisation Illustrations Conclusion Lemme The sequence (zi , ni ) satisfies 1 (zi , ni )i is a Markov chain; 2 zi+1 and ni are independent given zi ; 3 ni is distributed as a geometric random variable with probability parameter p(zi ) := α(zi , y ) q(y |zi ) dy ; (1) 4 (zi )i is a Markov chain with transition kernel ˜ ˜ Q(z, dy ) = q (y |z)dy and stationary distribution π such that ˜ ˜ q (·|z) ∝ α(z, ·) q(·|z) and π (·) ∝ π(·)p(·) . ˜ 10 / 24
  • 31. Introduction Some properties of the HM algorithm Rao–Blackwellisation Illustrations Conclusion Lemme The sequence (zi , ni ) satisfies 1 (zi , ni )i is a Markov chain; 2 zi+1 and ni are independent given zi ; 3 ni is distributed as a geometric random variable with probability parameter p(zi ) := α(zi , y ) q(y |zi ) dy ; (1) 4 (zi )i is a Markov chain with transition kernel ˜ ˜ Q(z, dy ) = q (y |z)dy and stationary distribution π such that ˜ ˜ q (·|z) ∝ α(z, ·) q(·|z) and π (·) ∝ π(·)p(·) . ˜ 10 / 24
  • 32. Introduction Some properties of the HM algorithm Rao–Blackwellisation Illustrations Conclusion Lemme The sequence (zi , ni ) satisfies 1 (zi , ni )i is a Markov chain; 2 zi+1 and ni are independent given zi ; 3 ni is distributed as a geometric random variable with probability parameter p(zi ) := α(zi , y ) q(y |zi ) dy ; (1) 4 (zi )i is a Markov chain with transition kernel ˜ ˜ Q(z, dy ) = q (y |z)dy and stationary distribution π such that ˜ ˜ q (·|z) ∝ α(z, ·) q(·|z) and π (·) ∝ π(·)p(·) . ˜ 10 / 24
  • 33. Introduction Some properties of the HM algorithm Rao–Blackwellisation Illustrations Conclusion zi−1 11 / 24
  • 34. Introduction Some properties of the HM algorithm Rao–Blackwellisation Illustrations Conclusion indep zi−1 zi indep ni−1 11 / 24
  • 35. Introduction Some properties of the HM algorithm Rao–Blackwellisation Illustrations Conclusion indep indep zi−1 zi zi+1 indep indep ni−1 ni 11 / 24
  • 36. Introduction Some properties of the HM algorithm Rao–Blackwellisation Illustrations Conclusion indep indep zi−1 zi zi+1 indep indep ni−1 ni n MN 1 1 δ= h(x (t) ) = ni h(zi ) . n N t=1 i=1 11 / 24
  • 37. Introduction Some properties of the HM algorithm Rao–Blackwellisation Illustrations Conclusion indep indep zi−1 zi zi+1 indep indep ni−1 ni n MN 1 1 δ= h(x (t) ) = ni h(zi ) . n N t=1 i=1 11 / 24
  • 38. Introduction Some properties of the HM algorithm Rao–Blackwellisation Illustrations Conclusion Outline 1 Introduction 2 Some properties of the HM algorithm 3 Rao–Blackwellisation Variance reduction Asymptotic results 4 Illustrations 5 Conclusion 12 / 24
  • 39. Introduction Some properties of the HM algorithm Rao–Blackwellisation Illustrations Conclusion 1 A natural idea: MN 1 h(zi ) δ∗ = , N p(zi ) i=1 13 / 24
  • 40. Introduction Some properties of the HM algorithm Rao–Blackwellisation Illustrations Conclusion 1 A natural idea: MN h(zi ) MN π(zi ) i=1 i=1 h(zi ) p(zi ) π (zi ) ˜ δ∗ ≃ = . MN 1 MN π(zi ) i=1 i=1 p(zi ) π (zi ) ˜ 13 / 24
  • 41. Introduction Some properties of the HM algorithm Rao–Blackwellisation Illustrations Conclusion 1 A natural idea: MN h(zi ) MN π(zi ) i=1 i=1 h(zi ) ∗ p(zi ) π (zi ) ˜ δ ≃ = . MN 1 MN π(zi ) i=1 i=1 p(zi ) π (zi ) ˜ 2 But p not available in closed form. 13 / 24
  • 42. Introduction Some properties of the HM algorithm Rao–Blackwellisation Illustrations Conclusion 1 A natural idea: MN h(zi ) MN π(zi ) i=1 i=1 h(zi ) ∗ p(zi ) π (zi ) ˜ δ ≃ = . MN 1 MN π(zi ) i=1 i=1 p(zi ) π (zi ) ˜ 2 But p not available in closed form. 3 The geometric ni is the obvious solution that is used in the original Metropolis–Hastings estimate. 13 / 24
  • 43. Introduction Some properties of the HM algorithm Rao–Blackwellisation Illustrations Conclusion 1 A natural idea: MN h(zi ) MN π(zi ) i=1 i=1 h(zi ) p(zi ) π (zi ) ˜ δ∗ ≃ = . MN 1 MN π(zi ) i=1 i=1 p(zi ) π (zi ) ˜ 2 But p not available in closed form. 3 The geometric ni is the obvious solution that is used in the original Metropolis–Hastings estimate. ∞ ni = 1 + I {uℓ ≥ α(zi , yℓ )} , j=1 ℓ≤j 13 / 24
  • 44. Introduction Some properties of the HM algorithm Rao–Blackwellisation Illustrations Conclusion ∞ ni = 1 + I {uℓ ≥ α(zi , yℓ )} , j=1 ℓ≤j Lemma If (yj )j is an iid sequence with distribution q(y |zi ), the quantity ∞ ˆ ξi = 1 + {1 − α(zi , yℓ )} j=1 ℓ≤j is an unbiased estimator of 1/p(zi ) which variance, conditional on zi , is lower than the conditional variance of ni , {1 − p(zi )}/p2 (zi ). 13 / 24
  • 45. Introduction Some properties of the HM algorithm Rao–Blackwellisation Illustrations Conclusion ∞ ˆ ξi = 1 + {1 − α(zi , yℓ )} j=1 ℓ≤j 1 Infinite sum but sometimes finite: π(yt ) q(x (t) |yt ) α(x (t) , yt ) = min 1, π(x (t) ) q(yt |x (t) ) For example: take a symetric random walk as a proposal. 2 What if we wish to be sure that the sum is finite? 14 / 24
  • 46. Introduction Some properties of the HM algorithm Rao–Blackwellisation Illustrations Conclusion Variance reduction Proposition If (yj )j is an iid sequence with distribution q(y |zi ) and (uj )j is an iid uniform sequence, for any k ≥ 0, the quantity ∞ ˆ ξik = 1 + {1 − α(zi , yj )} I {uℓ ≥ α(zi , yℓ )} (2) j=1 1≤ℓ≤k ∧j k +1≤ℓ≤j is an unbiased estimator of 1/p(zi ) with an almost sure finite number of terms. 15 / 24
  • 47. Introduction Some properties of the HM algorithm Rao–Blackwellisation Illustrations Conclusion Variance reduction Proposition If (yj )j is an iid sequence with distribution q(y |zi ) and (uj )j is an iid uniform sequence, for any k ≥ 0, the quantity ∞ ˆ ξik = 1 + {1 − α(zi , yj )} I {uℓ ≥ α(zi , yℓ )} (2) j=1 1≤ℓ≤k ∧j k +1≤ℓ≤j is an unbiased estimator of 1/p(zi ) with an almost sure finite number of terms. Moreover, for k ≥ 1, ˆ 1 − p(zi ) 1 − (1 − 2p(zi ) + r (zi ))k 2 − p(zi ) V ξik zi = − (p(zi )−r (zi )) , p2 (zi ) 2p(zi ) − r (zi ) p2 (zi ) where p(zi ) := α(zi , y ) q(y |zi ) dy . and r (zi ) := α2 (zi , y ) q(y |zi ) dy . 15 / 24
  • 48. Introduction Some properties of the HM algorithm Rao–Blackwellisation Illustrations Conclusion Variance reduction Proposition If (yj )j is an iid sequence with distribution q(y |zi ) and (uj )j is an iid uniform sequence, for any k ≥ 0, the quantity ∞ ˆ ξik = 1 + {1 − α(zi , yj )} I {uℓ ≥ α(zi , yℓ )} (2) j=1 1≤ℓ≤k ∧j k +1≤ℓ≤j is an unbiased estimator of 1/p(zi ) with an almost sure finite number of terms. Therefore, we have ˆ ˆ ˆ V ξi zi ≤ V ξik zi ≤ V ξi0 zi = V [ni | zi ] . 15 / 24
  • 49. Introduction Some properties of the HM algorithm Rao–Blackwellisation Illustrations Conclusion Variance reduction zi−1 ∞ ˆ ξik = 1 + {1 − α(zi , yj )} I {uℓ ≥ α(zi , yℓ )} (3) j=1 1≤ℓ≤k ∧j k +1≤ℓ≤j 16 / 24
  • 50. Introduction Some properties of the HM algorithm Rao–Blackwellisation Illustrations Conclusion Variance reduction not indep zi−1 zi not indep ˆk ξi−1 ∞ ˆ ξik = 1 + {1 − α(zi , yj )} I {uℓ ≥ α(zi , yℓ )} (3) j=1 1≤ℓ≤k ∧j k +1≤ℓ≤j 16 / 24
  • 51. Introduction Some properties of the HM algorithm Rao–Blackwellisation Illustrations Conclusion Variance reduction not indep not indep zi−1 zi zi+1 not indep not indep ˆk ξi−1 ˆ ξik ∞ ˆ ξik = 1 + {1 − α(zi , yj )} I {uℓ ≥ α(zi , yℓ )} (3) j=1 1≤ℓ≤k ∧j k +1≤ℓ≤j 16 / 24
  • 52. Introduction Some properties of the HM algorithm Rao–Blackwellisation Illustrations Conclusion Variance reduction not indep not indep zi−1 zi zi+1 not indep not indep ˆk ξi−1 ˆ ξik M ˆk k i=1 ξi h(zi ) δM = M ˆk . i=1 ξi 16 / 24
  • 53. Introduction Some properties of the HM algorithm Rao–Blackwellisation Illustrations Conclusion Variance reduction not indep not indep zi−1 zi zi+1 not indep not indep ˆk ξi−1 ˆ ξik M ˆk k i=1 ξi h(zi ) δM = M ˆk . i=1 ξi 16 / 24
  • 54. Introduction Some properties of the HM algorithm Rao–Blackwellisation Illustrations Conclusion Asymptotic results Let M ˆk k i=1 ξi h(zi ) δM = M ˆk . i=1 ξi For any positive function ϕ, we denote Cϕ = {h; |h/ϕ|∞ < ∞}. 17 / 24
  • 55. Introduction Some properties of the HM algorithm Rao–Blackwellisation Illustrations Conclusion Asymptotic results Let M ˆk k i=1 ξi h(zi ) δM = M ˆk . i=1 ξi For any positive function ϕ, we denote Cϕ = {h; |h/ϕ|∞ < ∞}. Assume that there exist a positive function ϕ ≥ 1 such that M i=1 h(zi )/p(zi ) P ∀h ∈ Cϕ , M −→ π(h) (3) i=1 1/p(zi ) Theorem Under the assumption that π(p) > 0, the following convergence property holds: i) If h is in Cϕ , then k P δM −→M→∞ π(h) (◮C ONSISTENCY) 17 / 24
  • 56. Introduction Some properties of the HM algorithm Rao–Blackwellisation Illustrations Conclusion Asymptotic results Let M ˆk k i=1 ξi h(zi ) δM = M ˆk . i=1 ξi For any positive function ϕ, we denote Cϕ = {h; |h/ϕ|∞ < ∞}. Assume that there exist a positive function ψ such that √ M i=1 h(zi )/p(zi ) L ∀h ∈ Cψ , M M − π(h) −→ N (0, Γ(h)) i=1 1/p(zi ) Theorem Under the assumption that π(p) > 0, the following convergence property holds: ii) If, in addition, h2 /p ∈ Cϕ and h ∈ Cψ , then √ k L M(δM − π(h)) −→M→∞ N (0, Vk [h − π(h)]) , (◮C LT) where Vk (h) := π(p) ˆ π(dz)V ξik z h2 (z)p(z) + Γ(h) . 17 / 24
  • 57. Introduction Some properties of the HM algorithm Rao–Blackwellisation Illustrations Conclusion Asymptotic results We will need some additional assumptions. Assume a maximal inequality for the Markov chain (zi )i : there exists a measurable function ζ such that for any starting point x ,   i NCh (x ) ∀h ∈ Cζ , Px  sup [h(zi ) − π (h)] > ǫ ≤ ˜ 0≤i≤N ǫ2 j=0 Theorem Assume that h is such that h/p ∈ Cζ and {Ch/p , h2 /p2 } ⊂ Cφ . Assume moreover that √ 0 L M δM − π(h) −→ N (0, V0 [h − π(h)]) . Then, for any starting point x , N t=1 h(x (t) ) L MN − π(h) −→N→∞ N (0, V0 [h − π(h)]) , N 18 / 24
  • 58. Introduction Some properties of the HM algorithm Rao–Blackwellisation Illustrations Conclusion Asymptotic results We will need some additional assumptions. Assume a maximal inequality for the Markov chain (zi )i : there exists a measurable function ζ such that for any starting point x ,   i NCh (x ) ∀h ∈ Cζ , Px  sup [h(zi ) − π (h)] > ǫ ≤ ˜ 0≤i≤N ǫ2 j=0 Moreover, assume that ∃φ ≥ 1 such that for any starting point x , ∀h ∈ Cφ , ˜ P Q n (x , h) −→ π (h) = π(ph)/π(p) , ˜ Theorem Assume that h is such that h/p ∈ Cζ and {Ch/p , h2 /p2 } ⊂ Cφ . Assume moreover that √ 0 L M δM − π(h) −→ N (0, V0 [h − π(h)]) . Then, for any starting point x , 18 / 24
  • 59. Introduction Some properties of the HM algorithm Rao–Blackwellisation Illustrations Conclusion Asymptotic results We will need some additional assumptions. Assume a maximal inequality for the Markov chain (zi )i : there exists a measurable function ζ such that for any starting point x ,   i NCh (x ) ∀h ∈ Cζ , Px  sup [h(zi ) − π (h)] > ǫ ≤ ˜ 0≤i≤N ǫ2 j=0 Moreover, assume that ∃φ ≥ 1 such that for any starting point x , ∀h ∈ Cφ , ˜ P Q n (x , h) −→ π (h) = π(ph)/π(p) , ˜ Theorem Assume that h is such that h/p ∈ Cζ and {Ch/p , h2 /p2 } ⊂ Cφ . Assume moreover that √ 0 L M δM − π(h) −→ N (0, V0 [h − π(h)]) . Then, for any starting point x , 18 / 24
  • 60. Introduction Some properties of the HM algorithm Rao–Blackwellisation Illustrations Conclusion Asymptotic results   i NCh (x ) ∀h ∈ Cζ , Px  sup [h(zi ) − π (h)] > ǫ ≤ ˜ 0≤i≤N j=0 ǫ2 ∀h ∈ Cφ , ˜ P Q n (x , h) −→ π (h) = π(ph)/π(p) , ˜ Theorem Assume that h is such that h/p ∈ Cζ and {Ch/p , h2 /p2 } ⊂ Cφ . Assume moreover that √ 0 L M δM − π(h) −→ N (0, V0 [h − π(h)]) . Then, for any starting point x , N t=1 h(x (t) ) L MN − π(h) −→N→∞ N (0, V0 [h − π(h)]) , N 18 / 24
  • 61. Introduction Some properties of the HM algorithm Rao–Blackwellisation Illustrations Conclusion Asymptotic results Theorem Assume that h is such that h/p ∈ Cζ and {Ch/p , h2 /p2 } ⊂ Cφ . Assume moreover that √ 0 L M δM − π(h) −→ N (0, V0 [h − π(h)]) . Then, for any starting point x , N t=1 h(x (t) ) L MN − π(h) −→N→∞ N (0, V0 [h − π(h)]) , N where MN is defined by MN MN +1 ˆ ξi0 ≤ N < ˆ ξi0 . (3) i=1 i=1 18 / 24
  • 62. Introduction Some properties of the HM algorithm Rao–Blackwellisation Illustrations Conclusion Asymptotic results Theorem Assume that h is such that h/p ∈ Cζ and {Ch/p , h2 /p2 } ⊂ Cφ . Assume moreover that √ 0 L M δM − π(h) −→ N (0, V0 [h − π(h)]) . Then, for any starting point x , N t=1 h(x (t) ) L MN − π(h) −→N→∞ N (0, V0 [h − π(h)]) , N where MN is defined by MN MN +1 ˆ ξi0 ≤ N < ˆ ξi0 . (3) i=1 i=1 18 / 24
  • 63. Introduction Some properties of the HM algorithm Rao–Blackwellisation Illustrations Conclusion Outline 1 Introduction 2 Some properties of the HM algorithm 3 Rao–Blackwellisation Variance reduction Asymptotic results 4 Illustrations 5 Conclusion 19 / 24
  • 64. Introduction Some properties of the HM algorithm Rao–Blackwellisation Illustrations Conclusion Figure: Overlay of the variations of 250 iid realisations of the estimates δ (gold) and δ ∞ (grey) of E[X ] = 0 for 1000 iterations, along with the 90% interquantile range for the estimates δ (brown) and δ ∞ (pink), in the setting of a random walk Gaussian proposal with scale τ = 10. 20 / 24
  • 65. Introduction Some properties of the HM algorithm Rao–Blackwellisation Illustrations Conclusion Figure: Overlay of the variations of 500 iid realisations of the estimates δ (deep grey), δ ∞ (medium grey) and of the importance sampling version (light grey) of E[X ] = 10 when X ∼ Exp(.1) for 100 iterations, along with the 90% interquantile ranges (same colour code), in the setting of an independent exponential proposal with scale µ = 0.02. 21 / 24
  • 66. Introduction Some properties of the HM algorithm Rao–Blackwellisation Illustrations Conclusion I|x−y |=1 if x > 0 , π(x ) = β(1 − β)x and 2q(y |x ) = I|y |≤1 if x = 0 . For this problem, p(x ) = 1 − β/2 and r (x ) = 1 − β + β 2 /2 . We can therefore compute the gain in variance p(x ) − r (x ) 2 − p(x ) β(1 − β)(2 + β) =2 2p(x ) − r (x ) p2 (x ) (2 − β 2 )(2 − β)2 which is optimal for β = 0.174, leading to a gain of 0.578 while the relative gain in variance is p(x ) − r (x ) 2 − p(x ) (1 − β)(2 + β) = 2p(x ) − r (x ) 1 − p(x ) (2 − β 2 ) which is decreasing in β. 22 / 24
  • 67. Introduction Some properties of the HM algorithm Rao–Blackwellisation Illustrations Conclusion Outline 1 Introduction 2 Some properties of the HM algorithm 3 Rao–Blackwellisation Variance reduction Asymptotic results 4 Illustrations 5 Conclusion 23 / 24
  • 68. Introduction Some properties of the HM algorithm Rao–Blackwellisation Illustrations Conclusion a) Rao Blackwellisation of any HM algorithm with a controled amount of additional calculation. b) Link with the importance sampling of Markov chains. c) Analysis with asymptotic results on triangular arrays. 24 / 24