SlideShare ist ein Scribd-Unternehmen logo
1 von 7
Subproblem-Tree Calibration: A Unified
Approach to Max-Product Message Passing∗
- A
Review†
Prolok Sundaresan, Varad Meru
University of California, Irvine
{sunderap,vmeru}@uci.edu
Abstract
Max-product (max-sum) message passing algorithms are widely used
for MAP inference in MRFs. It has many variants and they share the
common theme of passing “messages” over some graph-object. Recent
advances revealed that its convergent versions can be viewed as perform-
ing block coordinate descent (BCD) in a dual objective. However, most
existing algorithms are limited to updating blocks selected restricted ways.
In this report, we review a unifying technique proposed by [1] that sub-
sumes MPLP, MSD, and TRW-S as special cases when applied to their
respective choices of dual objective and blocks, and is able to perform
BCD under much more flexible choices of blocks.
1 Introduction
The task of finsing MAP-MRF - Finding the most probable assignments for
MRFs (MPE), is know to be NP-Hard. And to overcome that, a large family
of methods based on solving a dual problem of an LP relaxation have come up.
But recently, convergent version of these algorithms were seen to be interpreted
as block coordinate descent (BCD) in the dual. Variants methods operate on
small blocks - Max-product linear programming algorithm (MPLP), max-sum
diffusion (MSD) and Tree-weighted max-product message passing (TRW-S), and
enforce some consistency constraint over the block of duals. It was observed that
these menthods count not be generalised easily due to strong consistency con-
straint requirement put forth by the methods themselves - which are sufficient
but not necessary.
The work in [1] aims to show that dual-optimality can be established on
a much broader choices of the dual objective and derives a “unified” message
passing algorithms in an arbitrary dual-decomposition. In the process, they
create a new representation called as graph-object (subproblem multi-graph, or
∗Wang, Huayan, and Koller Daphne. Subproblem-tree calibration: A unified approach
to max-product message passing. In Proceedings of the 30th International Conference on
Machine Learning (ICML-13), pp. 190-198. 2013.
†This work was does as a part of the project for CS 276: Belief Networks, Fall 2014, taught
by Prof. Rina Dechter.
1
SMG) and present a method which subsumes MPLP, MSD, and TRW-S while
achieving dual-optimality on blocks with flexible choices.
2 Background
2.1 MAP Inference
The MAP Inference problem over X and graph strcuture G = {V, E} can be
formulated as:
maximize
X
Θ(X) (1)
Where Θ(X) = α∈A θα(Xα); A is the set of MRF cliques. Without loss of
generality we choose A = V ∪ E for a parametrization with unary and pairwise
potentials. The authors use lowercase xi ∈ V al(Xi) and x = x1:N to denote
assignments to the variables.
2.2 LP relaxation and Dual decomposition
We now see MAP building on by solving a Linear programming(LP) relaxation
maximize
µ∈M
Θ · µ (2)
Where µ = {µi(xi), µij(xi, xj)|∀i, xi, (i, j), (xi, xj)}; Θ is all MRF parameters
{θi, θij} concatenated in same ordering as µ. Choosing different polytopes M
(such as Linear programs, max-product, and variational relaxation, etc.) results
in different LP relaxations. One choice is marginal polytope:
MG = {µ|∃p(X),
x∈V al(X)
p(x) = 1, p(X) ≥ 0, p(Xi) = µi, p(Xi, Xj) = µij, ∀i, (i, j)}
(3)
where p(Xi) and p(Xi, Xj) denote marginal distributions of p(X). When M =
MG, the LP relaxation 2 is equivalent to the original problem 1. Let us consider
a decomposition of Θ(X) into subproblems c ∈ C, parameterized by {Θc
}
∀x,
c∈C
Θc
(x|c) = Θ(x) (4)
Where x|c denotes restricting the joint assignment to the scope of subproblem c.
The subproblems could be single nodes, edges, trees, or cycles that are assumed
to be tractable. Note that 4 has exponentially many constraints. Enforcing
constraint by expressing reparameterization in terms of messages
Θc
= Θc
 +
c :Xc∩X c=∅
δc →c(Xc ∩ X c) (5)
where the messages satisfy δc →c = −δc→c . Each subproblem has its own copy
of variables Xc
2
2.3 Bethe cluster (region) Graph
It contains bipartite structure with one layer of “factor” nodes and one layer of
small (usually unary) nodes. Specifically, consider using the set of subproblem
C = i ∪ F consisting of unary subproblems i and larger subproblems (factors)
F. It has a restricted design due to historical concern of satisfying the ’running
intersection property’.
D( δf→i ) =
i
max
Xi
θi
(Xi
) +
f
max
Xf
Θf
(Xf
) (6)
Figure 1 shows the structure of Cluster and Bethe Cluster graphs.
(a) Markov Random
Field
(b) Cluster Graph (not Bethe
Cluster)
(c) Bethe Cluster Graph
Figure 1: Cluster and Bethe Graph
3 STC Framework
3.1 Subproblem multi-graph and subproblem tree
Given set of subproblems C, the subproblem multi-graph (SMG) G = (V, E)
has one node for each c ∈ C and one edge between c and c for each tuple (c,c ,ϕ),
where ϕ ∈ V ∪ E is shared by c and c . A subproblem multi-graph (SMG)
is a tree T ⊂ G
If we include all unary subproblems into the decomposition, we would get a
SMG similar to Fig: 2(b) but with extra edges among the non-unary subprob-
lems. So a tree in the Bethe cluster graph (which we call a Bethe tree) is also
a subproblem tree by definition.
For each SMG edge (c, c , ϕ) ∈ E, we have messages δc →c = −δc→c . There-
fore the block (of dual variables) associated with subproblem tree T is given
by:
BT
= {δc →c(Xϕ) : (c, c , ϕ) ∈ T }. (7)
3.2 Max-consistency and dual-optimal trees
Let us first define the dual-optimal with respect to a given block. Given a block
BT
associated with some subproblem tree T , we want to achieve dual-optimal
3
with respect to that block. The subproblem potentials Θc
are dual-optimal
on T if we can not further decrease the dual objective by changing messages
in BT
. Message passing algorithm achieves dual-optimality by enforcing some
Consistency Constraint. We now identify constraint that is equivalent to dual-
optimal on T : Assignments to all subproblems {xc
}c∈T agree on T , denoted as
xc
∼ T , if for ∀(c, c , ϕ) ∈ T , we have xc
ϕ = xc
ϕ .
{Θc
}c∈T satisfies weak max-consistency if
c∈T
max
Xc
Θc
(Xc
) = max
{Xc}∼T
c∈T
Θc
(Xc
) (8)
Thus, maximizing each subproblem independently gets to the same optimal
value as maximizing them while requiring the assignments to agree on the tree.
Now, let Mc
ϕ be the (log)-max-marginal of c on ϕ, then
Mc
ϕ(xϕ) = max
Xc|ϕ=xϕ
Θc
(Xc
) (9)
if ϕ = (i, j) ∈ E, Xc
|ϕ = xϕ means Xc
i = xi and Xc
j = xj.
{Θc
}c∈T satisfies strong max-consistency if Mc
ϕ = Mc
ϕ ∀(c, c , ϕ) ∈ T
The relations among these consistency constraints are (Proposition 1):
• For any Bethe tree T ,
MPLP max-consistency =⇒ Weak max-consistency
• For any subproblem tree T (including Bethe trees),
Strong max-consistency =⇒ Weak max-consistency.
Weak max-consistency ⇐⇒ Dual-optimal on T .
3.3 Subproblem tree calibration algorithm
The algorithm calibrates a subproblem-tree by an upstream pass and a down-
stream pass. Both update subproblem potentials “in place” without storing any
message.
Algorithm 1 STC ALgorithm
1: procedure STC
2: Given MRF
3: Split into subproblems (dual decomposition)
4: Build a multi-graph with a node for each subproblem (middle figure)
5: while t do
6: Randomly choose a subproblem-tree (right figure)
7: “Calibrate” the tree by max-product / min-sum message passing
8: end while
9: end procedure
4
(a) MRF (b) SMG (c) Spanning Tree of
SMG
Figure 2: Flow of the Algorithm: Start with (a) to generate (b) and randomly
selected (c) and ”Calibrate”
Figure 3: Flow of the Algorithm from [1]
3.3.1 Properties
Some of the observed properties of the algorithm
1. Each tree calibration is a block coordinate descent step for the dual prob-
lem.
2. The “block” corresponds to all edges in the subproblem-tree.
3. Subsumes MPLP, TRW-S, and max-sum diffusion as special cases.
4. Handles larger and more flexible “blocks” than these methods.
3.3.2 Choosing allocation weights
In the algorithm, while the upstream pass uses the BCD to optimize the values
across sub-problem.The downstream pass allocates ”energy” to all the subprob-
lems, based on their allocation weights. That is , the downstream pass essen-
tially moves around in a plateau of the dual, preparing for the next Therefore
the allocation weights determine where to settle on that plateau. With the help
of appropriate allocation weights, we can subsume both the MPLP and MSD
using STC algorithm.
5
3.3.3 General Primal solution
We can generate the primal solution in many different ways. One way is to
assign the values of the unary variables available as subproblem.We visit the
variables (in the original MRF) in someordering, for example, X1, X2, · · ·
XN.And when visiting each Xi, we choose its assignment to maximize the sum
of all max-marginals from all subproblems covering Xi . And then we fix Xi =
xi in all subproblem
4 Experimental Results
4.1 Experimental map inference tasks
1. The protein design benchmark
• 20 largest problems from that dataset
• Number of Variables - 101 to 180
• Number of Edges - 1973 to 3005
• Variable Cardinality - 154
2. Synthetic 20-by-20 grid
• Potentials from N(0, 1)
• Variable Cardinality - 100
3. ”Object detection” task from PIC-2011
• 37 problem instances
• Number of Variables - 60 / problem instance
• Number of Edges - 1770 / problem instance
• Variable Cardinality - 11 - 21
We observe that different methods tend to “converge” to different dual ob-
jectives, Even though the dual objectives in each plot should have exactly the
same optimal value.
5 Conclusion
The framework revealed two dimensions of flexibility in designing a message
passing algorithm for MAP inference: choosing blocks to update, and choosing
a dual state on a plateau in each BCD step. The STC algorithm can be ap-
plied with extreme flexibility in these choices. Finding principled and adaptive
strategies in making these choices will help us to design much more powerful
message passing algorithms.
6
Figure 4: Experimental Results
References
[1] Wang, Huayan, and Koller Daphne. Subproblem-tree calibration: A unified
approach to max-product message passing. In Proceedings of the 30th Inter-
national Conference on Machine Learning (ICML-13), pp. 190-198. 2013.
[2] Globerson, Amir and Jaakkola, Tommi. Fixing maxproduct: Convergent
message passing algorithms for MAP LP-relaxations. In NIPS, 2007
[3] Sontag, David and Jaakkola, Tommi. Tree block coordinate descent for MAP
in graphical models. AISTATS, 5:544–551, 2009.
7

Weitere ähnliche Inhalte

Was ist angesagt?

Machine learning in science and industry — day 1
Machine learning in science and industry — day 1Machine learning in science and industry — day 1
Machine learning in science and industry — day 1arogozhnikov
 
Clustering: Large Databases in data mining
Clustering: Large Databases in data miningClustering: Large Databases in data mining
Clustering: Large Databases in data miningZHAO Sam
 
Fuzzy c-Means Clustering Algorithms
Fuzzy c-Means Clustering AlgorithmsFuzzy c-Means Clustering Algorithms
Fuzzy c-Means Clustering AlgorithmsJustin Cletus
 
(DL hacks輪読)Bayesian Neural Network
(DL hacks輪読)Bayesian Neural Network(DL hacks輪読)Bayesian Neural Network
(DL hacks輪読)Bayesian Neural NetworkMasahiro Suzuki
 
Machine learning in science and industry — day 2
Machine learning in science and industry — day 2Machine learning in science and industry — day 2
Machine learning in science and industry — day 2arogozhnikov
 
DSP IEEE paper
DSP IEEE paperDSP IEEE paper
DSP IEEE paperprreiya
 
(DL輪読)Variational Dropout Sparsifies Deep Neural Networks
(DL輪読)Variational Dropout Sparsifies Deep Neural Networks(DL輪読)Variational Dropout Sparsifies Deep Neural Networks
(DL輪読)Variational Dropout Sparsifies Deep Neural NetworksMasahiro Suzuki
 
(研究会輪読) Weight Uncertainty in Neural Networks
(研究会輪読) Weight Uncertainty in Neural Networks(研究会輪読) Weight Uncertainty in Neural Networks
(研究会輪読) Weight Uncertainty in Neural NetworksMasahiro Suzuki
 
Iterative Soft Decision Based Complex K-best MIMO Decoder
Iterative Soft Decision Based Complex K-best MIMO DecoderIterative Soft Decision Based Complex K-best MIMO Decoder
Iterative Soft Decision Based Complex K-best MIMO DecoderCSCJournals
 
Fuzzieee-98-final
Fuzzieee-98-finalFuzzieee-98-final
Fuzzieee-98-finalSumit Sen
 
Machine learning in science and industry — day 3
Machine learning in science and industry — day 3Machine learning in science and industry — day 3
Machine learning in science and industry — day 3arogozhnikov
 
Optics ordering points to identify the clustering structure
Optics ordering points to identify the clustering structureOptics ordering points to identify the clustering structure
Optics ordering points to identify the clustering structureRajesh Piryani
 
"Automatic Variational Inference in Stan" NIPS2015_yomi2016-01-20
"Automatic Variational Inference in Stan" NIPS2015_yomi2016-01-20"Automatic Variational Inference in Stan" NIPS2015_yomi2016-01-20
"Automatic Variational Inference in Stan" NIPS2015_yomi2016-01-20Yuta Kashino
 
Density Based Clustering
Density Based ClusteringDensity Based Clustering
Density Based ClusteringSSA KPI
 
Convolutional networks and graph networks through kernels
Convolutional networks and graph networks through kernelsConvolutional networks and graph networks through kernels
Convolutional networks and graph networks through kernelstuxette
 

Was ist angesagt? (20)

Machine learning in science and industry — day 1
Machine learning in science and industry — day 1Machine learning in science and industry — day 1
Machine learning in science and industry — day 1
 
Clustering lect
Clustering lectClustering lect
Clustering lect
 
Clustering: Large Databases in data mining
Clustering: Large Databases in data miningClustering: Large Databases in data mining
Clustering: Large Databases in data mining
 
Fuzzy c-Means Clustering Algorithms
Fuzzy c-Means Clustering AlgorithmsFuzzy c-Means Clustering Algorithms
Fuzzy c-Means Clustering Algorithms
 
(DL hacks輪読)Bayesian Neural Network
(DL hacks輪読)Bayesian Neural Network(DL hacks輪読)Bayesian Neural Network
(DL hacks輪読)Bayesian Neural Network
 
Analysis_molf
Analysis_molfAnalysis_molf
Analysis_molf
 
Machine learning in science and industry — day 2
Machine learning in science and industry — day 2Machine learning in science and industry — day 2
Machine learning in science and industry — day 2
 
DSP IEEE paper
DSP IEEE paperDSP IEEE paper
DSP IEEE paper
 
2012 cvpr gtw
2012 cvpr gtw2012 cvpr gtw
2012 cvpr gtw
 
(DL輪読)Variational Dropout Sparsifies Deep Neural Networks
(DL輪読)Variational Dropout Sparsifies Deep Neural Networks(DL輪読)Variational Dropout Sparsifies Deep Neural Networks
(DL輪読)Variational Dropout Sparsifies Deep Neural Networks
 
(研究会輪読) Weight Uncertainty in Neural Networks
(研究会輪読) Weight Uncertainty in Neural Networks(研究会輪読) Weight Uncertainty in Neural Networks
(研究会輪読) Weight Uncertainty in Neural Networks
 
Iterative Soft Decision Based Complex K-best MIMO Decoder
Iterative Soft Decision Based Complex K-best MIMO DecoderIterative Soft Decision Based Complex K-best MIMO Decoder
Iterative Soft Decision Based Complex K-best MIMO Decoder
 
Fuzzieee-98-final
Fuzzieee-98-finalFuzzieee-98-final
Fuzzieee-98-final
 
Machine learning in science and industry — day 3
Machine learning in science and industry — day 3Machine learning in science and industry — day 3
Machine learning in science and industry — day 3
 
widely-linear-minimum (1)
widely-linear-minimum (1)widely-linear-minimum (1)
widely-linear-minimum (1)
 
Optics ordering points to identify the clustering structure
Optics ordering points to identify the clustering structureOptics ordering points to identify the clustering structure
Optics ordering points to identify the clustering structure
 
"Automatic Variational Inference in Stan" NIPS2015_yomi2016-01-20
"Automatic Variational Inference in Stan" NIPS2015_yomi2016-01-20"Automatic Variational Inference in Stan" NIPS2015_yomi2016-01-20
"Automatic Variational Inference in Stan" NIPS2015_yomi2016-01-20
 
Density Based Clustering
Density Based ClusteringDensity Based Clustering
Density Based Clustering
 
Convolutional networks and graph networks through kernels
Convolutional networks and graph networks through kernelsConvolutional networks and graph networks through kernels
Convolutional networks and graph networks through kernels
 
Clique and sting
Clique and stingClique and sting
Clique and sting
 

Ähnlich wie Subproblem-Tree Calibration: A Unified Approach to Max-Product Message Passing- A Review

A Comparison Of Methods For Solving MAX-SAT Problems
A Comparison Of Methods For Solving MAX-SAT ProblemsA Comparison Of Methods For Solving MAX-SAT Problems
A Comparison Of Methods For Solving MAX-SAT ProblemsKarla Adamson
 
Jörg Stelzer
Jörg StelzerJörg Stelzer
Jörg Stelzerbutest
 
CHN and Swap Heuristic to Solve the Maximum Independent Set Problem
CHN and Swap Heuristic to Solve the Maximum Independent Set ProblemCHN and Swap Heuristic to Solve the Maximum Independent Set Problem
CHN and Swap Heuristic to Solve the Maximum Independent Set ProblemIJECEIAES
 
Machine Learning Algorithms (Part 1)
Machine Learning Algorithms (Part 1)Machine Learning Algorithms (Part 1)
Machine Learning Algorithms (Part 1)Zihui Li
 
2014 vulnerability assesment of spatial network - models and solutions
2014   vulnerability assesment of spatial network - models and solutions2014   vulnerability assesment of spatial network - models and solutions
2014 vulnerability assesment of spatial network - models and solutionsFrancisco Pérez
 
Pattern Recognition and Machine Learning : Graphical Models
Pattern Recognition and Machine Learning : Graphical ModelsPattern Recognition and Machine Learning : Graphical Models
Pattern Recognition and Machine Learning : Graphical Modelsbutest
 
Anomaly detection using deep one class classifier
Anomaly detection using deep one class classifierAnomaly detection using deep one class classifier
Anomaly detection using deep one class classifier홍배 김
 
Parallel Computing 2007: Bring your own parallel application
Parallel Computing 2007: Bring your own parallel applicationParallel Computing 2007: Bring your own parallel application
Parallel Computing 2007: Bring your own parallel applicationGeoffrey Fox
 
Introduction to Support Vector Machines
Introduction to Support Vector MachinesIntroduction to Support Vector Machines
Introduction to Support Vector MachinesSilicon Mentor
 
SURF 2012 Final Report(1)
SURF 2012 Final Report(1)SURF 2012 Final Report(1)
SURF 2012 Final Report(1)Eric Zhang
 
A comparison of efficient algorithms for scheduling parallel data redistribution
A comparison of efficient algorithms for scheduling parallel data redistributionA comparison of efficient algorithms for scheduling parallel data redistribution
A comparison of efficient algorithms for scheduling parallel data redistributionIJCNCJournal
 
Firefly exact MCMC for Big Data
Firefly exact MCMC for Big DataFirefly exact MCMC for Big Data
Firefly exact MCMC for Big DataGianvito Siciliano
 
Achieving Portability and Efficiency in a HPC Code Using Standard Message-pas...
Achieving Portability and Efficiency in a HPC Code Using Standard Message-pas...Achieving Portability and Efficiency in a HPC Code Using Standard Message-pas...
Achieving Portability and Efficiency in a HPC Code Using Standard Message-pas...Derryck Lamptey, MPhil, CISSP
 
Multi-Layer Perceptrons
Multi-Layer PerceptronsMulti-Layer Perceptrons
Multi-Layer PerceptronsESCOM
 
Iterative Soft Decision Based Complex K-best MIMO Decoder
Iterative Soft Decision Based Complex K-best MIMO DecoderIterative Soft Decision Based Complex K-best MIMO Decoder
Iterative Soft Decision Based Complex K-best MIMO DecoderCSCJournals
 
IMPROVING SCHEDULING OF DATA TRANSMISSION IN TDMA SYSTEMS
IMPROVING SCHEDULING OF DATA TRANSMISSION IN TDMA SYSTEMSIMPROVING SCHEDULING OF DATA TRANSMISSION IN TDMA SYSTEMS
IMPROVING SCHEDULING OF DATA TRANSMISSION IN TDMA SYSTEMScsandit
 

Ähnlich wie Subproblem-Tree Calibration: A Unified Approach to Max-Product Message Passing- A Review (20)

A Comparison Of Methods For Solving MAX-SAT Problems
A Comparison Of Methods For Solving MAX-SAT ProblemsA Comparison Of Methods For Solving MAX-SAT Problems
A Comparison Of Methods For Solving MAX-SAT Problems
 
Jörg Stelzer
Jörg StelzerJörg Stelzer
Jörg Stelzer
 
CHN and Swap Heuristic to Solve the Maximum Independent Set Problem
CHN and Swap Heuristic to Solve the Maximum Independent Set ProblemCHN and Swap Heuristic to Solve the Maximum Independent Set Problem
CHN and Swap Heuristic to Solve the Maximum Independent Set Problem
 
Machine Learning Algorithms (Part 1)
Machine Learning Algorithms (Part 1)Machine Learning Algorithms (Part 1)
Machine Learning Algorithms (Part 1)
 
2014 vulnerability assesment of spatial network - models and solutions
2014   vulnerability assesment of spatial network - models and solutions2014   vulnerability assesment of spatial network - models and solutions
2014 vulnerability assesment of spatial network - models and solutions
 
Pattern Recognition and Machine Learning : Graphical Models
Pattern Recognition and Machine Learning : Graphical ModelsPattern Recognition and Machine Learning : Graphical Models
Pattern Recognition and Machine Learning : Graphical Models
 
Program on Mathematical and Statistical Methods for Climate and the Earth Sys...
Program on Mathematical and Statistical Methods for Climate and the Earth Sys...Program on Mathematical and Statistical Methods for Climate and the Earth Sys...
Program on Mathematical and Statistical Methods for Climate and the Earth Sys...
 
Anomaly detection using deep one class classifier
Anomaly detection using deep one class classifierAnomaly detection using deep one class classifier
Anomaly detection using deep one class classifier
 
QMC Program: Trends and Advances in Monte Carlo Sampling Algorithms Workshop,...
QMC Program: Trends and Advances in Monte Carlo Sampling Algorithms Workshop,...QMC Program: Trends and Advances in Monte Carlo Sampling Algorithms Workshop,...
QMC Program: Trends and Advances in Monte Carlo Sampling Algorithms Workshop,...
 
ECE 565 FInal Project
ECE 565 FInal ProjectECE 565 FInal Project
ECE 565 FInal Project
 
Parallel Computing 2007: Bring your own parallel application
Parallel Computing 2007: Bring your own parallel applicationParallel Computing 2007: Bring your own parallel application
Parallel Computing 2007: Bring your own parallel application
 
Introduction to Support Vector Machines
Introduction to Support Vector MachinesIntroduction to Support Vector Machines
Introduction to Support Vector Machines
 
SURF 2012 Final Report(1)
SURF 2012 Final Report(1)SURF 2012 Final Report(1)
SURF 2012 Final Report(1)
 
A comparison of efficient algorithms for scheduling parallel data redistribution
A comparison of efficient algorithms for scheduling parallel data redistributionA comparison of efficient algorithms for scheduling parallel data redistribution
A comparison of efficient algorithms for scheduling parallel data redistribution
 
Firefly exact MCMC for Big Data
Firefly exact MCMC for Big DataFirefly exact MCMC for Big Data
Firefly exact MCMC for Big Data
 
Achieving Portability and Efficiency in a HPC Code Using Standard Message-pas...
Achieving Portability and Efficiency in a HPC Code Using Standard Message-pas...Achieving Portability and Efficiency in a HPC Code Using Standard Message-pas...
Achieving Portability and Efficiency in a HPC Code Using Standard Message-pas...
 
APPLICATION OF NUMERICAL METHODS IN SMALL SIZE
APPLICATION OF NUMERICAL METHODS IN SMALL SIZEAPPLICATION OF NUMERICAL METHODS IN SMALL SIZE
APPLICATION OF NUMERICAL METHODS IN SMALL SIZE
 
Multi-Layer Perceptrons
Multi-Layer PerceptronsMulti-Layer Perceptrons
Multi-Layer Perceptrons
 
Iterative Soft Decision Based Complex K-best MIMO Decoder
Iterative Soft Decision Based Complex K-best MIMO DecoderIterative Soft Decision Based Complex K-best MIMO Decoder
Iterative Soft Decision Based Complex K-best MIMO Decoder
 
IMPROVING SCHEDULING OF DATA TRANSMISSION IN TDMA SYSTEMS
IMPROVING SCHEDULING OF DATA TRANSMISSION IN TDMA SYSTEMSIMPROVING SCHEDULING OF DATA TRANSMISSION IN TDMA SYSTEMS
IMPROVING SCHEDULING OF DATA TRANSMISSION IN TDMA SYSTEMS
 

Mehr von Varad Meru

Predicting rainfall using ensemble of ensembles
Predicting rainfall using ensemble of ensemblesPredicting rainfall using ensemble of ensembles
Predicting rainfall using ensemble of ensemblesVarad Meru
 
Generating Musical Notes and Transcription using Deep Learning
Generating Musical Notes and Transcription using Deep LearningGenerating Musical Notes and Transcription using Deep Learning
Generating Musical Notes and Transcription using Deep LearningVarad Meru
 
Kakuro: Solving the Constraint Satisfaction Problem
Kakuro: Solving the Constraint Satisfaction ProblemKakuro: Solving the Constraint Satisfaction Problem
Kakuro: Solving the Constraint Satisfaction ProblemVarad Meru
 
CS295 Week5: Megastore - Providing Scalable, Highly Available Storage for Int...
CS295 Week5: Megastore - Providing Scalable, Highly Available Storage for Int...CS295 Week5: Megastore - Providing Scalable, Highly Available Storage for Int...
CS295 Week5: Megastore - Providing Scalable, Highly Available Storage for Int...Varad Meru
 
Cassandra - A Decentralized Structured Storage System
Cassandra - A Decentralized Structured Storage SystemCassandra - A Decentralized Structured Storage System
Cassandra - A Decentralized Structured Storage SystemVarad Meru
 
Cloud Computing: An Overview
Cloud Computing: An OverviewCloud Computing: An Overview
Cloud Computing: An OverviewVarad Meru
 
Live Wide-Area Migration of Virtual Machines including Local Persistent State.
Live Wide-Area Migration of Virtual Machines including Local Persistent State.Live Wide-Area Migration of Virtual Machines including Local Persistent State.
Live Wide-Area Migration of Virtual Machines including Local Persistent State.Varad Meru
 
Machine Learning and Apache Mahout : An Introduction
Machine Learning and Apache Mahout : An IntroductionMachine Learning and Apache Mahout : An Introduction
Machine Learning and Apache Mahout : An IntroductionVarad Meru
 
K-Means, its Variants and its Applications
K-Means, its Variants and its ApplicationsK-Means, its Variants and its Applications
K-Means, its Variants and its ApplicationsVarad Meru
 
Introduction to Mahout and Machine Learning
Introduction to Mahout and Machine LearningIntroduction to Mahout and Machine Learning
Introduction to Mahout and Machine LearningVarad Meru
 
Data clustering using map reduce
Data clustering using map reduceData clustering using map reduce
Data clustering using map reduceVarad Meru
 
Large-scale Parallel Collaborative Filtering and Clustering using MapReduce f...
Large-scale Parallel Collaborative Filtering and Clustering using MapReduce f...Large-scale Parallel Collaborative Filtering and Clustering using MapReduce f...
Large-scale Parallel Collaborative Filtering and Clustering using MapReduce f...Varad Meru
 
Big Data, Hadoop, NoSQL and more ...
Big Data, Hadoop, NoSQL and more ...Big Data, Hadoop, NoSQL and more ...
Big Data, Hadoop, NoSQL and more ...Varad Meru
 
Final Year Project Guidance
Final Year Project GuidanceFinal Year Project Guidance
Final Year Project GuidanceVarad Meru
 
OpenSourceEducation
OpenSourceEducationOpenSourceEducation
OpenSourceEducationVarad Meru
 

Mehr von Varad Meru (15)

Predicting rainfall using ensemble of ensembles
Predicting rainfall using ensemble of ensemblesPredicting rainfall using ensemble of ensembles
Predicting rainfall using ensemble of ensembles
 
Generating Musical Notes and Transcription using Deep Learning
Generating Musical Notes and Transcription using Deep LearningGenerating Musical Notes and Transcription using Deep Learning
Generating Musical Notes and Transcription using Deep Learning
 
Kakuro: Solving the Constraint Satisfaction Problem
Kakuro: Solving the Constraint Satisfaction ProblemKakuro: Solving the Constraint Satisfaction Problem
Kakuro: Solving the Constraint Satisfaction Problem
 
CS295 Week5: Megastore - Providing Scalable, Highly Available Storage for Int...
CS295 Week5: Megastore - Providing Scalable, Highly Available Storage for Int...CS295 Week5: Megastore - Providing Scalable, Highly Available Storage for Int...
CS295 Week5: Megastore - Providing Scalable, Highly Available Storage for Int...
 
Cassandra - A Decentralized Structured Storage System
Cassandra - A Decentralized Structured Storage SystemCassandra - A Decentralized Structured Storage System
Cassandra - A Decentralized Structured Storage System
 
Cloud Computing: An Overview
Cloud Computing: An OverviewCloud Computing: An Overview
Cloud Computing: An Overview
 
Live Wide-Area Migration of Virtual Machines including Local Persistent State.
Live Wide-Area Migration of Virtual Machines including Local Persistent State.Live Wide-Area Migration of Virtual Machines including Local Persistent State.
Live Wide-Area Migration of Virtual Machines including Local Persistent State.
 
Machine Learning and Apache Mahout : An Introduction
Machine Learning and Apache Mahout : An IntroductionMachine Learning and Apache Mahout : An Introduction
Machine Learning and Apache Mahout : An Introduction
 
K-Means, its Variants and its Applications
K-Means, its Variants and its ApplicationsK-Means, its Variants and its Applications
K-Means, its Variants and its Applications
 
Introduction to Mahout and Machine Learning
Introduction to Mahout and Machine LearningIntroduction to Mahout and Machine Learning
Introduction to Mahout and Machine Learning
 
Data clustering using map reduce
Data clustering using map reduceData clustering using map reduce
Data clustering using map reduce
 
Large-scale Parallel Collaborative Filtering and Clustering using MapReduce f...
Large-scale Parallel Collaborative Filtering and Clustering using MapReduce f...Large-scale Parallel Collaborative Filtering and Clustering using MapReduce f...
Large-scale Parallel Collaborative Filtering and Clustering using MapReduce f...
 
Big Data, Hadoop, NoSQL and more ...
Big Data, Hadoop, NoSQL and more ...Big Data, Hadoop, NoSQL and more ...
Big Data, Hadoop, NoSQL and more ...
 
Final Year Project Guidance
Final Year Project GuidanceFinal Year Project Guidance
Final Year Project Guidance
 
OpenSourceEducation
OpenSourceEducationOpenSourceEducation
OpenSourceEducation
 

Kürzlich hochgeladen

EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEarley Information Science
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century educationjfdjdjcjdnsjd
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProduct Anonymous
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024The Digital Insurer
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Enterprise Knowledge
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoffsammart93
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherRemote DBA Services
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 

Kürzlich hochgeladen (20)

EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 

Subproblem-Tree Calibration: A Unified Approach to Max-Product Message Passing- A Review

  • 1. Subproblem-Tree Calibration: A Unified Approach to Max-Product Message Passing∗ - A Review† Prolok Sundaresan, Varad Meru University of California, Irvine {sunderap,vmeru}@uci.edu Abstract Max-product (max-sum) message passing algorithms are widely used for MAP inference in MRFs. It has many variants and they share the common theme of passing “messages” over some graph-object. Recent advances revealed that its convergent versions can be viewed as perform- ing block coordinate descent (BCD) in a dual objective. However, most existing algorithms are limited to updating blocks selected restricted ways. In this report, we review a unifying technique proposed by [1] that sub- sumes MPLP, MSD, and TRW-S as special cases when applied to their respective choices of dual objective and blocks, and is able to perform BCD under much more flexible choices of blocks. 1 Introduction The task of finsing MAP-MRF - Finding the most probable assignments for MRFs (MPE), is know to be NP-Hard. And to overcome that, a large family of methods based on solving a dual problem of an LP relaxation have come up. But recently, convergent version of these algorithms were seen to be interpreted as block coordinate descent (BCD) in the dual. Variants methods operate on small blocks - Max-product linear programming algorithm (MPLP), max-sum diffusion (MSD) and Tree-weighted max-product message passing (TRW-S), and enforce some consistency constraint over the block of duals. It was observed that these menthods count not be generalised easily due to strong consistency con- straint requirement put forth by the methods themselves - which are sufficient but not necessary. The work in [1] aims to show that dual-optimality can be established on a much broader choices of the dual objective and derives a “unified” message passing algorithms in an arbitrary dual-decomposition. In the process, they create a new representation called as graph-object (subproblem multi-graph, or ∗Wang, Huayan, and Koller Daphne. Subproblem-tree calibration: A unified approach to max-product message passing. In Proceedings of the 30th International Conference on Machine Learning (ICML-13), pp. 190-198. 2013. †This work was does as a part of the project for CS 276: Belief Networks, Fall 2014, taught by Prof. Rina Dechter. 1
  • 2. SMG) and present a method which subsumes MPLP, MSD, and TRW-S while achieving dual-optimality on blocks with flexible choices. 2 Background 2.1 MAP Inference The MAP Inference problem over X and graph strcuture G = {V, E} can be formulated as: maximize X Θ(X) (1) Where Θ(X) = α∈A θα(Xα); A is the set of MRF cliques. Without loss of generality we choose A = V ∪ E for a parametrization with unary and pairwise potentials. The authors use lowercase xi ∈ V al(Xi) and x = x1:N to denote assignments to the variables. 2.2 LP relaxation and Dual decomposition We now see MAP building on by solving a Linear programming(LP) relaxation maximize µ∈M Θ · µ (2) Where µ = {µi(xi), µij(xi, xj)|∀i, xi, (i, j), (xi, xj)}; Θ is all MRF parameters {θi, θij} concatenated in same ordering as µ. Choosing different polytopes M (such as Linear programs, max-product, and variational relaxation, etc.) results in different LP relaxations. One choice is marginal polytope: MG = {µ|∃p(X), x∈V al(X) p(x) = 1, p(X) ≥ 0, p(Xi) = µi, p(Xi, Xj) = µij, ∀i, (i, j)} (3) where p(Xi) and p(Xi, Xj) denote marginal distributions of p(X). When M = MG, the LP relaxation 2 is equivalent to the original problem 1. Let us consider a decomposition of Θ(X) into subproblems c ∈ C, parameterized by {Θc } ∀x, c∈C Θc (x|c) = Θ(x) (4) Where x|c denotes restricting the joint assignment to the scope of subproblem c. The subproblems could be single nodes, edges, trees, or cycles that are assumed to be tractable. Note that 4 has exponentially many constraints. Enforcing constraint by expressing reparameterization in terms of messages Θc = Θc  + c :Xc∩X c=∅ δc →c(Xc ∩ X c) (5) where the messages satisfy δc →c = −δc→c . Each subproblem has its own copy of variables Xc 2
  • 3. 2.3 Bethe cluster (region) Graph It contains bipartite structure with one layer of “factor” nodes and one layer of small (usually unary) nodes. Specifically, consider using the set of subproblem C = i ∪ F consisting of unary subproblems i and larger subproblems (factors) F. It has a restricted design due to historical concern of satisfying the ’running intersection property’. D( δf→i ) = i max Xi θi (Xi ) + f max Xf Θf (Xf ) (6) Figure 1 shows the structure of Cluster and Bethe Cluster graphs. (a) Markov Random Field (b) Cluster Graph (not Bethe Cluster) (c) Bethe Cluster Graph Figure 1: Cluster and Bethe Graph 3 STC Framework 3.1 Subproblem multi-graph and subproblem tree Given set of subproblems C, the subproblem multi-graph (SMG) G = (V, E) has one node for each c ∈ C and one edge between c and c for each tuple (c,c ,ϕ), where ϕ ∈ V ∪ E is shared by c and c . A subproblem multi-graph (SMG) is a tree T ⊂ G If we include all unary subproblems into the decomposition, we would get a SMG similar to Fig: 2(b) but with extra edges among the non-unary subprob- lems. So a tree in the Bethe cluster graph (which we call a Bethe tree) is also a subproblem tree by definition. For each SMG edge (c, c , ϕ) ∈ E, we have messages δc →c = −δc→c . There- fore the block (of dual variables) associated with subproblem tree T is given by: BT = {δc →c(Xϕ) : (c, c , ϕ) ∈ T }. (7) 3.2 Max-consistency and dual-optimal trees Let us first define the dual-optimal with respect to a given block. Given a block BT associated with some subproblem tree T , we want to achieve dual-optimal 3
  • 4. with respect to that block. The subproblem potentials Θc are dual-optimal on T if we can not further decrease the dual objective by changing messages in BT . Message passing algorithm achieves dual-optimality by enforcing some Consistency Constraint. We now identify constraint that is equivalent to dual- optimal on T : Assignments to all subproblems {xc }c∈T agree on T , denoted as xc ∼ T , if for ∀(c, c , ϕ) ∈ T , we have xc ϕ = xc ϕ . {Θc }c∈T satisfies weak max-consistency if c∈T max Xc Θc (Xc ) = max {Xc}∼T c∈T Θc (Xc ) (8) Thus, maximizing each subproblem independently gets to the same optimal value as maximizing them while requiring the assignments to agree on the tree. Now, let Mc ϕ be the (log)-max-marginal of c on ϕ, then Mc ϕ(xϕ) = max Xc|ϕ=xϕ Θc (Xc ) (9) if ϕ = (i, j) ∈ E, Xc |ϕ = xϕ means Xc i = xi and Xc j = xj. {Θc }c∈T satisfies strong max-consistency if Mc ϕ = Mc ϕ ∀(c, c , ϕ) ∈ T The relations among these consistency constraints are (Proposition 1): • For any Bethe tree T , MPLP max-consistency =⇒ Weak max-consistency • For any subproblem tree T (including Bethe trees), Strong max-consistency =⇒ Weak max-consistency. Weak max-consistency ⇐⇒ Dual-optimal on T . 3.3 Subproblem tree calibration algorithm The algorithm calibrates a subproblem-tree by an upstream pass and a down- stream pass. Both update subproblem potentials “in place” without storing any message. Algorithm 1 STC ALgorithm 1: procedure STC 2: Given MRF 3: Split into subproblems (dual decomposition) 4: Build a multi-graph with a node for each subproblem (middle figure) 5: while t do 6: Randomly choose a subproblem-tree (right figure) 7: “Calibrate” the tree by max-product / min-sum message passing 8: end while 9: end procedure 4
  • 5. (a) MRF (b) SMG (c) Spanning Tree of SMG Figure 2: Flow of the Algorithm: Start with (a) to generate (b) and randomly selected (c) and ”Calibrate” Figure 3: Flow of the Algorithm from [1] 3.3.1 Properties Some of the observed properties of the algorithm 1. Each tree calibration is a block coordinate descent step for the dual prob- lem. 2. The “block” corresponds to all edges in the subproblem-tree. 3. Subsumes MPLP, TRW-S, and max-sum diffusion as special cases. 4. Handles larger and more flexible “blocks” than these methods. 3.3.2 Choosing allocation weights In the algorithm, while the upstream pass uses the BCD to optimize the values across sub-problem.The downstream pass allocates ”energy” to all the subprob- lems, based on their allocation weights. That is , the downstream pass essen- tially moves around in a plateau of the dual, preparing for the next Therefore the allocation weights determine where to settle on that plateau. With the help of appropriate allocation weights, we can subsume both the MPLP and MSD using STC algorithm. 5
  • 6. 3.3.3 General Primal solution We can generate the primal solution in many different ways. One way is to assign the values of the unary variables available as subproblem.We visit the variables (in the original MRF) in someordering, for example, X1, X2, · · · XN.And when visiting each Xi, we choose its assignment to maximize the sum of all max-marginals from all subproblems covering Xi . And then we fix Xi = xi in all subproblem 4 Experimental Results 4.1 Experimental map inference tasks 1. The protein design benchmark • 20 largest problems from that dataset • Number of Variables - 101 to 180 • Number of Edges - 1973 to 3005 • Variable Cardinality - 154 2. Synthetic 20-by-20 grid • Potentials from N(0, 1) • Variable Cardinality - 100 3. ”Object detection” task from PIC-2011 • 37 problem instances • Number of Variables - 60 / problem instance • Number of Edges - 1770 / problem instance • Variable Cardinality - 11 - 21 We observe that different methods tend to “converge” to different dual ob- jectives, Even though the dual objectives in each plot should have exactly the same optimal value. 5 Conclusion The framework revealed two dimensions of flexibility in designing a message passing algorithm for MAP inference: choosing blocks to update, and choosing a dual state on a plateau in each BCD step. The STC algorithm can be ap- plied with extreme flexibility in these choices. Finding principled and adaptive strategies in making these choices will help us to design much more powerful message passing algorithms. 6
  • 7. Figure 4: Experimental Results References [1] Wang, Huayan, and Koller Daphne. Subproblem-tree calibration: A unified approach to max-product message passing. In Proceedings of the 30th Inter- national Conference on Machine Learning (ICML-13), pp. 190-198. 2013. [2] Globerson, Amir and Jaakkola, Tommi. Fixing maxproduct: Convergent message passing algorithms for MAP LP-relaxations. In NIPS, 2007 [3] Sontag, David and Jaakkola, Tommi. Tree block coordinate descent for MAP in graphical models. AISTATS, 5:544–551, 2009. 7