SlideShare ist ein Scribd-Unternehmen logo
1 von 20
Downloaden Sie, um offline zu lesen
.
......
Bayes Independence Test
Joe Suzuki
Osaka University
GABA 2014
Joe Suzuki (Osaka University) Bayes Independence Test GABA 2014 1 / 20
Road Map
...1 Problem
...2 Discrete Case
...3 Continuous Case
...4 HSIC
...5 Experiments
...6 Concluding Remarks
Joe Suzuki (Osaka University) Bayes Independence Test GABA 2014 2 / 20
Problem
Problem: Decide X ⊥⊥ Y given (x1, y1), · · · , (xn, yn)
Mutual Information: I(X, Y ) :=
∑
x
∑
y
PXY (x, y) log
PXY (x, y)
PX (x)PY (y)
I(X, Y ) = 0 ⇐⇒ X ⊥⊥ Y
Hilbert Schmidt independent criterion: Non-linear Correlation
Correlation Coefficient (X, Y ) = 0
⇐=
̸=⇒
X ⊥⊥ Y
HSIC(X, Y ) = 0 ⇐⇒ X ⊥⊥ Y
.
Independence Test (Whether X ⊥⊥ Y or not)
..
......Given (x1, y1), · · · , (xn, yn), estimate I(X, Y ), HSIC(X, Y ), etc.
Joe Suzuki (Osaka University) Bayes Independence Test GABA 2014 3 / 20
Discrete Case
Estimating MI (Maximum Likelihood)
X, Y : discrete
In(xn
, yn
) :=
∑
x
∑
y
ˆPn(x, y) log
ˆPn(x, y)
ˆPn(x)ˆPn(y)
ˆPn(x, y): relative occurency of (X, Y ) = (x, y) in (x1, y1), · · · , (xn, yn)
ˆPn(x): relative occurency of X = x in x1, · · · , xn
ˆPn(y): relative occurrency of Y = y in y1, · · · , yn
In(x, y) → I(X, Y ) (n → ∞)
even if X ⊥⊥ Y , In(xn, yn) > 0 occurs infnitely many times
constructing Independent Test requires thresholds {ϵn} s.t.
In(xn
, yn
) < ϵn ⇐⇒ X ⊥⊥ Y
cannot be extended into the case when X, Y are continuous
Joe Suzuki (Osaka University) Bayes Independence Test GABA 2014 4 / 20
Discrete Case
Bayesian Estimation of MI (Proposal)
.
Lempel-Ziv (lzh, gzip etc.)
..
......
Compressing xn = (x1, · · · , xn) into zm = (z1, · · · , zm) ∈ {0, 1}m
...1 The compression ratio
m
n
converges to its Entropy H(X) for any PX .
...2
∑
2−m
≤ 1 (Kraft’s inequality)
for Qn
X (xn) := 2−m, m = − log Qn
X (xn) will be lenth after compression
for Qn
Y (yn), Qn
XY (xn, yn), and prior p of X ⊥⊥ Y ,
Jn(xn
, yn
) :=
1
n
log
(1 − p)Qn
XY (xn, yn)
pQn
X (xn)Qn
Y (yn)
Joe Suzuki (Osaka University) Bayes Independence Test GABA 2014 5 / 20
Discrete Case
MDL(minimum description length) Principle
From examples, a model s.t. the total length of
description of the model
description of the examples given the model
is minimized should be chosen (Rissanen, 1976)
MDL(X ⊥⊥ Y ) := − log p −
1
n
log Qn
X (xn
) −
1
n
log Qn
Y (yn
)
MDL(X ̸⊥⊥ Y ) := − log(1 − p) −
1
n
log Qn
XY (xn
, yn
)
.
Consistency
..
......The MDL model coincides with the true model with. prob.1 as n → ∞.
Joe Suzuki (Osaka University) Bayes Independence Test GABA 2014 6 / 20
Discrete Case
Bayesian Estimation of MI (Proposal, cont’d)
Consistency of MDL implies that of Independence Test:
Jn(xn
, yn
) ≤ 0 ⇐⇒ MDL(X ⊥⊥ Y ) ≤ MDL(X ̸⊥⊥ Y )
for α := |X|, β := |Y |
Jn(xn
, yn
) ≈ In(xn
.yn
) −
(α − 1)(β − 1)
2n
log n
Jn(xn
, yn
) ≤ 0 ⇐⇒ In(xn
, yn
) ≤ ϵn :=
(α − 1)(β − 1)
2n
log n
Jn(xn, yn) → I(X, Y ) (n → ∞)
O(n) computation
p =
1
2
was assumed in Suzuki 2012.
Joe Suzuki (Osaka University) Bayes Independence Test GABA 2014 7 / 20
Discrete Case
Universality: Discrete
For any PX ,
m
n
= −
1
n
log Qn
X (xn
) → H(X)
From i.i.d. and the law of large numbers, for any PX ,
−
1
n
log Pn
X (xn
) = −
1
n
n∑
i=1
log PX (xi ) → E[− log PX (X)] = H(X)
For any PX ,
1
n
log
Pn
X (xn)
Qn
X (xn)
→ 0 .
Joe Suzuki (Osaka University) Bayes Independence Test GABA 2014 8 / 20
Continuous Case
Universality: Continuous
Under regularity, there exists gn
X s.t. for any fX ,
1
n
log
f n
X (xn)
gn
X (xn)
→ 0
∫ ∞
−∞
gn
(xn
)dx ≤ 1
(Ryabko 2009)
removing regularity
even for more than one variables
either discrete, continuous, or none of them
(Suzuki 2013)
Joe Suzuki (Osaka University) Bayes Independence Test GABA 2014 9 / 20
Continuous Case
Construcion of gn
X
Quantization at level k: xn = (x1, · · · , xn) → (a
(k)
1 , · · · , a
(k)
n )
...
...
...
...
E
E
E
Level 1
Level 2
Level k
Qn
1 (a
(1)
1 , · · · , a
(1)
n )
λ(a
(1)
1 ) · · · λ(a
(1)
n )
Qn
2 (a
(2)
1 , · · · , a
(2)
n )
λ(a
(2)
1 ) · · · λ(a
(2)
n )
Qn
k (a
(k)
1 , · · · , a
(k)
n )
λ(a
(k)
1 ) · · · λ(a
(k)
n )
wi > 0 ,
∑
i
wi = 1 , gn
X (xn
) =
∑
i
wi
Qn
i (a
(i)
1 , · · · , a
(i)
n )
λ(a
(i)
1 ) · · · λ(a
(i)
n )
Joe Suzuki (Osaka University) Bayes Independence Test GABA 2014 10 / 20
Continuous Case
Bayesian Estimation of MI: General Case
.
Bayesian Estimation of MI
..
......
Jn(xn
, yn
) :=
1
n
log
(1 − p)gn
XY (xn, yn)
pgn
X (xn)gn
Y (yn)
 
Generalization of MDL:
MDL(X ⊥⊥ Y ) := − log p −
1
n
log gn
X (xn
) −
1
n
log gn
Y (yn
)
MDL(X ̸⊥⊥ Y ) := − log(1 − p) −
1
n
log gn
XY (xn
, yn
)
.
Consistency
..
......
The MDL model coincides with the true model with prob. 1 as n → ∞:
X ⊥⊥ Y ⇐⇒ MDL(X ⊥⊥ Y ) ≤ MDL(X ̸⊥⊥ Y )
Joe Suzuki (Osaka University) Bayes Independence Test GABA 2014 11 / 20
Continuous Case
Jn(xn
, yn
) → I(X, Y ) (n → ∞)
Proof: since xn, yn are i.i.d., from the law of large numbers, for any fX ,
1
n
log
f n
XY (xn, yn)
f n
X (xn)f n
Y (xn)
=
1
n
n∑
i=1
log
f n
XY (xn, yn)
f n
X (xn)f n
Y (xn)
→ E[log
fXY (XY )
fX (X)fY (Y )
] = I(X, Y )
Jn(xn
, yn
) − I(X, Y )
= −
1
n
log
f n
XY (xn, yn)
gn
XY (xn, yn)
+
1
n
log
f n
X (xn)
gn
X (xn)
+
1
n
log
f n
Y (yn)
gn
Y (yn)
+
1
n
log
f n
XY (xn, yn)
f n
X (xn)f n
Y (xn)
− I(X, Y ) +
1
n
log
1 − p
p
→ 0
Joe Suzuki (Osaka University) Bayes Independence Test GABA 2014 12 / 20
HSIC
HSIC
A nonlinear corralation coefficient cov(X, Y )
Random Variable X Y
Hilbert Space X Y
RKHS F: Basis {fi } G: Basis {gj }
kernel k : X × X → R l : Y × Y → R
HSIC(PXY , F, G) =
∑
i,j
cov(fi (X), gj (Y ))2
For the universal kernels, HSIC(PXY , F, G) = 0 ⇐⇒ X ⊥⊥ Y
ex: the Gaussian kernel is known to be universal:
k(x, y) = exp{−(x − y)2
/2}
Joe Suzuki (Osaka University) Bayes Independence Test GABA 2014 13 / 20
HSIC
Limitions of HSIC
.
Unbiased Estimator of HSIC(PXY , F, G)
..
......
For K = (k(xi , xj )), L = (l(yi , yj )), H = (δi,j − 1
n )
HSIC(xn
, yn
) =
1
(n − 1)2
tr(KHLH)
HSIC(PXY , F, G) → HSIC(PXY , F, G) as n → ∞
has been proved only for weak consistncy.
Computation of HSIC(xn, yn, F, G): O(n3)
Computation of the asymptotic distribution of H0:
is O(n3
) w.r.t. n based on U-statistics (Bunlphone, et. al, 2014).
may not give correct estimaton based on permutation test.
Joe Suzuki (Osaka University) Bayes Independence Test GABA 2014 14 / 20
Experiments
Experiments
...1
E¨¨
¨¨¨¨¨B
Errr
rrrrj
X Y
0
1
0
1
1
2
1
2
p
1 − p
I(X, Y ) = HSIC(X, Y ) = 0
⇐⇒ p =
1
2
⇐⇒ X ⊥⊥ Y
...2 (X, Y ) ∼ N(0, Σ), Σ =
[
1 ρ
ρ 1
]
, −1 < ρ < 1
I(X, Y ) = HSIC(X, Y ) = 0 ⇐⇒ ρ = 0 ⇐⇒ X ⊥⊥ Y
...3 P(X = 0) = P(X = 1) = 1
2 , Y ∼ N(aX, 1), a ≥ 0
I(X, Y ) = HSIC(X, Y ) = 0 ⇐⇒ a = 0 ⇐⇒ X ⊥⊥ Y
Joe Suzuki (Osaka University) Bayes Independence Test GABA 2014 15 / 20
Experiments
Experiment 1
The Error Probabilities for n = 100
True p Proposal HSIC
→Estimated p Threshold (×10−4)
4 8 12 16 20
p = 0.5 → p ̸= 0.5 0.084 0.306 0.135 0.077 0.043 0.022
p = 0.4 → p = 0.5 0.758 0.507 0.694 0.787 0.860 0.908
p = 0.3 → p = 0.5 0.333 0.139 0.251 0.396 0.505 0.610
p = 0.2 → p = 0.5 0.048 0.018 0.035 0.083 0.135 0.201
p = 0.1 → p = 0.5 0.001 0.000 0.001 0.005 0.010 0.021
↑
Joe Suzuki (Osaka University) Bayes Independence Test GABA 2014 16 / 20
Experiments
Experiments 2
The Error Probabilities for n = 100
ρ Proposal HSIC
Threshold (×10−3)
2 4 6 8
0.0 0.095 0.338 0.036 0.006 0.00
0.2 0.628 0.298 0.676 0.884 0.97
0.4 0.168 0.008 0.088 0.300 0.512
0.6 0.008 0.000 0.000 0.002 0.006
0.8 0.000 0.000 0.000 0.000 0.000
↑
For the Gaussian kernel and Gauss distributions, HSIC performs very well.
Joe Suzuki (Osaka University) Bayes Independence Test GABA 2014 17 / 20
Experiments
HSIC shows poor performance in cases such as
ˆˆˆˆˆˆˆˆˆz







~
$$$$$$$$$X
ˆˆˆˆˆˆˆˆˆz$$$$$$$$$X
ˆˆˆˆˆˆˆˆˆz&
&
&
&
&
&
&
&&b
$$$$$$$$$X
X Y
0
ϵ
1
1 − ϵ
0
ϵ
1
1 − ϵif ϵ > 0 small.
HSIC(xn
.yn
)
=
1
(n − 1)2
∑
i
∑
j
{k(xi , xj ) −
1
n
∑
h
k(xi , xh)}{l(yi , yj ) −
1
n
∑
h
l(yi , yh)}
k(u, v) = l(u, v) = exp{−(u − v)2
}
Joe Suzuki (Osaka University) Bayes Independence Test GABA 2014 18 / 20
Experiments
Execution Time
Execution (sec)
n 100 500 1000 2000
Proposal 0.30 0.33 0.62 1.05
HSIC 0.50 9.51 40.28 185.53
Joe Suzuki (Osaka University) Bayes Independence Test GABA 2014 19 / 20
Concluding Remarks
Concluding Remarks
.
Contribution
..
......Independence Test based on MDL/Bayes
Proposal HSIC
Principle Bayes Detection will be maximied
Strong Discrete Continuous
Threshold Not Necessay Necessary
Prior Necessary Not Necessary
Computation O(n) O(n3)
Consistency Strong Weak
Future Works
The Border for which either Bays/MDL or HSIC outperforms
R Package
Joe Suzuki (Osaka University) Bayes Independence Test GABA 2014 20 / 20

Weitere ähnliche Inhalte

Was ist angesagt?

Simple Comparison of Convergence of GeneralIterations and Effect of Variation...
Simple Comparison of Convergence of GeneralIterations and Effect of Variation...Simple Comparison of Convergence of GeneralIterations and Effect of Variation...
Simple Comparison of Convergence of GeneralIterations and Effect of Variation...
Komal Goyal
 
A unique common fixed point theorem for four
A unique common fixed point theorem for fourA unique common fixed point theorem for four
A unique common fixed point theorem for four
Alexander Decker
 

Was ist angesagt? (15)

2013 IEEE International Symposium on Information Theory
2013 IEEE International Symposium on Information Theory2013 IEEE International Symposium on Information Theory
2013 IEEE International Symposium on Information Theory
 
Existence, Uniqueness and Stability Solution of Differential Equations with B...
Existence, Uniqueness and Stability Solution of Differential Equations with B...Existence, Uniqueness and Stability Solution of Differential Equations with B...
Existence, Uniqueness and Stability Solution of Differential Equations with B...
 
RuFiDiM
RuFiDiMRuFiDiM
RuFiDiM
 
WITMSE 2013
WITMSE 2013WITMSE 2013
WITMSE 2013
 
QMC Program: Trends and Advances in Monte Carlo Sampling Algorithms Workshop,...
QMC Program: Trends and Advances in Monte Carlo Sampling Algorithms Workshop,...QMC Program: Trends and Advances in Monte Carlo Sampling Algorithms Workshop,...
QMC Program: Trends and Advances in Monte Carlo Sampling Algorithms Workshop,...
 
Simple Comparison of Convergence of GeneralIterations and Effect of Variation...
Simple Comparison of Convergence of GeneralIterations and Effect of Variation...Simple Comparison of Convergence of GeneralIterations and Effect of Variation...
Simple Comparison of Convergence of GeneralIterations and Effect of Variation...
 
NONLINEAR DIFFERENCE EQUATIONS WITH SMALL PARAMETERS OF MULTIPLE SCALES
NONLINEAR DIFFERENCE EQUATIONS WITH SMALL PARAMETERS OF MULTIPLE SCALESNONLINEAR DIFFERENCE EQUATIONS WITH SMALL PARAMETERS OF MULTIPLE SCALES
NONLINEAR DIFFERENCE EQUATIONS WITH SMALL PARAMETERS OF MULTIPLE SCALES
 
A unique common fixed point theorem for four
A unique common fixed point theorem for fourA unique common fixed point theorem for four
A unique common fixed point theorem for four
 
Common fixed theorems for weakly compatible mappings via an
Common fixed theorems for weakly compatible mappings via anCommon fixed theorems for weakly compatible mappings via an
Common fixed theorems for weakly compatible mappings via an
 
STUDIES ON INTUTIONISTIC FUZZY INFORMATION MEASURE
STUDIES ON INTUTIONISTIC FUZZY INFORMATION MEASURESTUDIES ON INTUTIONISTIC FUZZY INFORMATION MEASURE
STUDIES ON INTUTIONISTIC FUZZY INFORMATION MEASURE
 
Catalan Tau Collocation for Numerical Solution of 2-Dimentional Nonlinear Par...
Catalan Tau Collocation for Numerical Solution of 2-Dimentional Nonlinear Par...Catalan Tau Collocation for Numerical Solution of 2-Dimentional Nonlinear Par...
Catalan Tau Collocation for Numerical Solution of 2-Dimentional Nonlinear Par...
 
DISCRETIZATION OF A MATHEMATICAL MODEL FOR TUMOR-IMMUNE SYSTEM INTERACTION WI...
DISCRETIZATION OF A MATHEMATICAL MODEL FOR TUMOR-IMMUNE SYSTEM INTERACTION WI...DISCRETIZATION OF A MATHEMATICAL MODEL FOR TUMOR-IMMUNE SYSTEM INTERACTION WI...
DISCRETIZATION OF A MATHEMATICAL MODEL FOR TUMOR-IMMUNE SYSTEM INTERACTION WI...
 
R180304110115
R180304110115R180304110115
R180304110115
 
Holographic Cotton Tensor
Holographic Cotton TensorHolographic Cotton Tensor
Holographic Cotton Tensor
 
Image denoising
Image denoisingImage denoising
Image denoising
 

Ähnlich wie Bayes Independence Test

Properties of field induced Josephson junction(s)
Properties of field induced Josephson junction(s)Properties of field induced Josephson junction(s)
Properties of field induced Josephson junction(s)
Krzysztof Pomorski
 

Ähnlich wie Bayes Independence Test (20)

Bayesian network structure estimation based on the Bayesian/MDL criteria when...
Bayesian network structure estimation based on the Bayesian/MDL criteria when...Bayesian network structure estimation based on the Bayesian/MDL criteria when...
Bayesian network structure estimation based on the Bayesian/MDL criteria when...
 
2014 9-16
2014 9-162014 9-16
2014 9-16
 
MUMS Opening Workshop - Panel Discussion: Facts About Some Statisitcal Models...
MUMS Opening Workshop - Panel Discussion: Facts About Some Statisitcal Models...MUMS Opening Workshop - Panel Discussion: Facts About Some Statisitcal Models...
MUMS Opening Workshop - Panel Discussion: Facts About Some Statisitcal Models...
 
2014 9-22
2014 9-222014 9-22
2014 9-22
 
Finance Enginering from Columbia.pdf
Finance Enginering from Columbia.pdfFinance Enginering from Columbia.pdf
Finance Enginering from Columbia.pdf
 
comm_ch02_random_en.pdf
comm_ch02_random_en.pdfcomm_ch02_random_en.pdf
comm_ch02_random_en.pdf
 
HPWFcorePRES--FUR2016
HPWFcorePRES--FUR2016HPWFcorePRES--FUR2016
HPWFcorePRES--FUR2016
 
Generative models : VAE and GAN
Generative models : VAE and GANGenerative models : VAE and GAN
Generative models : VAE and GAN
 
Proba stats-r1-2017
Proba stats-r1-2017Proba stats-r1-2017
Proba stats-r1-2017
 
a - ANTI FUZZY NEW IDEAL OF PUALGEBRA
a - ANTI FUZZY NEW IDEAL OF PUALGEBRAa - ANTI FUZZY NEW IDEAL OF PUALGEBRA
a - ANTI FUZZY NEW IDEAL OF PUALGEBRA
 
α -ANTI FUZZY NEW IDEAL OF PUALGEBRA
α -ANTI FUZZY NEW IDEAL OF PUALGEBRAα -ANTI FUZZY NEW IDEAL OF PUALGEBRA
α -ANTI FUZZY NEW IDEAL OF PUALGEBRA
 
Properties of field induced Josephson junction(s)
Properties of field induced Josephson junction(s)Properties of field induced Josephson junction(s)
Properties of field induced Josephson junction(s)
 
Slides: On the Chi Square and Higher-Order Chi Distances for Approximating f-...
Slides: On the Chi Square and Higher-Order Chi Distances for Approximating f-...Slides: On the Chi Square and Higher-Order Chi Distances for Approximating f-...
Slides: On the Chi Square and Higher-Order Chi Distances for Approximating f-...
 
QMC Program: Trends and Advances in Monte Carlo Sampling Algorithms Workshop,...
QMC Program: Trends and Advances in Monte Carlo Sampling Algorithms Workshop,...QMC Program: Trends and Advances in Monte Carlo Sampling Algorithms Workshop,...
QMC Program: Trends and Advances in Monte Carlo Sampling Algorithms Workshop,...
 
Lecture2 xing
Lecture2 xingLecture2 xing
Lecture2 xing
 
COMMON FIXED POINT THEOREMS IN COMPATIBLE MAPPINGS OF TYPE (P*) OF GENERALIZE...
COMMON FIXED POINT THEOREMS IN COMPATIBLE MAPPINGS OF TYPE (P*) OF GENERALIZE...COMMON FIXED POINT THEOREMS IN COMPATIBLE MAPPINGS OF TYPE (P*) OF GENERALIZE...
COMMON FIXED POINT THEOREMS IN COMPATIBLE MAPPINGS OF TYPE (P*) OF GENERALIZE...
 
COMMON FIXED POINT THEOREMS IN COMPATIBLE MAPPINGS OF TYPE (P*) OF GENERALIZE...
COMMON FIXED POINT THEOREMS IN COMPATIBLE MAPPINGS OF TYPE (P*) OF GENERALIZE...COMMON FIXED POINT THEOREMS IN COMPATIBLE MAPPINGS OF TYPE (P*) OF GENERALIZE...
COMMON FIXED POINT THEOREMS IN COMPATIBLE MAPPINGS OF TYPE (P*) OF GENERALIZE...
 
COMMON FIXED POINT THEOREMS IN COMPATIBLE MAPPINGS OF TYPE (P*) OF GENERALIZE...
COMMON FIXED POINT THEOREMS IN COMPATIBLE MAPPINGS OF TYPE (P*) OF GENERALIZE...COMMON FIXED POINT THEOREMS IN COMPATIBLE MAPPINGS OF TYPE (P*) OF GENERALIZE...
COMMON FIXED POINT THEOREMS IN COMPATIBLE MAPPINGS OF TYPE (P*) OF GENERALIZE...
 
COMMON FIXED POINT THEOREMS IN COMPATIBLE MAPPINGS OF TYPE (P*) OF GENERALIZE...
COMMON FIXED POINT THEOREMS IN COMPATIBLE MAPPINGS OF TYPE (P*) OF GENERALIZE...COMMON FIXED POINT THEOREMS IN COMPATIBLE MAPPINGS OF TYPE (P*) OF GENERALIZE...
COMMON FIXED POINT THEOREMS IN COMPATIBLE MAPPINGS OF TYPE (P*) OF GENERALIZE...
 
Maximum likelihood estimation of regularisation parameters in inverse problem...
Maximum likelihood estimation of regularisation parameters in inverse problem...Maximum likelihood estimation of regularisation parameters in inverse problem...
Maximum likelihood estimation of regularisation parameters in inverse problem...
 

Mehr von Joe Suzuki

分枝限定法でモデル選択の計算量を低減する
分枝限定法でモデル選択の計算量を低減する分枝限定法でモデル選択の計算量を低減する
分枝限定法でモデル選択の計算量を低減する
Joe Suzuki
 
離散と連続の入り混じった相互情報量を推定して、 SNP と遺伝子発現量の因果関係をさぐる
離散と連続の入り混じった相互情報量を推定して、SNP と遺伝子発現量の因果関係をさぐる離散と連続の入り混じった相互情報量を推定して、SNP と遺伝子発現量の因果関係をさぐる
離散と連続の入り混じった相互情報量を推定して、 SNP と遺伝子発現量の因果関係をさぐる
Joe Suzuki
 

Mehr von Joe Suzuki (20)

RとPythonを比較する
RとPythonを比較するRとPythonを比較する
RとPythonを比較する
 
R集会@統数研
R集会@統数研R集会@統数研
R集会@統数研
 
E-learning Development of Statistics and in Duex: Practical Approaches and Th...
E-learning Development of Statistics and in Duex: Practical Approaches and Th...E-learning Development of Statistics and in Duex: Practical Approaches and Th...
E-learning Development of Statistics and in Duex: Practical Approaches and Th...
 
分枝限定法でモデル選択の計算量を低減する
分枝限定法でモデル選択の計算量を低減する分枝限定法でモデル選択の計算量を低減する
分枝限定法でモデル選択の計算量を低減する
 
連続変量を含む条件付相互情報量の推定
連続変量を含む条件付相互情報量の推定連続変量を含む条件付相互情報量の推定
連続変量を含む条件付相互情報量の推定
 
E-learning Design and Development for Data Science in Osaka University
E-learning Design and Development for Data Science in Osaka UniversityE-learning Design and Development for Data Science in Osaka University
E-learning Design and Development for Data Science in Osaka University
 
UAI 2017
UAI 2017UAI 2017
UAI 2017
 
AMBN2017 サテライトワークショップ
AMBN2017 サテライトワークショップAMBN2017 サテライトワークショップ
AMBN2017 サテライトワークショップ
 
CRAN Rパッケージ BNSLの概要
CRAN Rパッケージ BNSLの概要CRAN Rパッケージ BNSLの概要
CRAN Rパッケージ BNSLの概要
 
Forest Learning from Data
Forest Learning from DataForest Learning from Data
Forest Learning from Data
 
A Bayesian Approach to Data Compression
A Bayesian Approach to Data CompressionA Bayesian Approach to Data Compression
A Bayesian Approach to Data Compression
 
研究紹介(学生向け)
研究紹介(学生向け)研究紹介(学生向け)
研究紹介(学生向け)
 
Efficietly Learning Bayesian Network Structures based on the B&B Strategy: A ...
Efficietly Learning Bayesian Network Structuresbased on the B&B Strategy: A ...Efficietly Learning Bayesian Network Structuresbased on the B&B Strategy: A ...
Efficietly Learning Bayesian Network Structures based on the B&B Strategy: A ...
 
Forest Learning based on the Chow-Liu Algorithm and its Application to Genom...
Forest Learning based on the Chow-Liu Algorithm and its Application to Genom...Forest Learning based on the Chow-Liu Algorithm and its Application to Genom...
Forest Learning based on the Chow-Liu Algorithm and its Application to Genom...
 
2016 7-13
2016 7-132016 7-13
2016 7-13
 
Structure Learning of Bayesian Networks with p Nodes from n Samples when n&lt...
Structure Learning of Bayesian Networks with p Nodes from n Samples when n&lt...Structure Learning of Bayesian Networks with p Nodes from n Samples when n&lt...
Structure Learning of Bayesian Networks with p Nodes from n Samples when n&lt...
 
連続変量を含む相互情報量の推定
連続変量を含む相互情報量の推定連続変量を含む相互情報量の推定
連続変量を含む相互情報量の推定
 
Jeffreys' and BDeu Priors for Model Selection
Jeffreys' and BDeu Priors for Model SelectionJeffreys' and BDeu Priors for Model Selection
Jeffreys' and BDeu Priors for Model Selection
 
離散と連続の入り混じった相互情報量を推定して、 SNP と遺伝子発現量の因果関係をさぐる
離散と連続の入り混じった相互情報量を推定して、SNP と遺伝子発現量の因果関係をさぐる離散と連続の入り混じった相互情報量を推定して、SNP と遺伝子発現量の因果関係をさぐる
離散と連続の入り混じった相互情報量を推定して、 SNP と遺伝子発現量の因果関係をさぐる
 
MaCaulay2 Miuraパッケージの開発と今後
MaCaulay2 Miuraパッケージの開発と今後MaCaulay2 Miuraパッケージの開発と今後
MaCaulay2 Miuraパッケージの開発と今後
 

Kürzlich hochgeladen

Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdfPests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
PirithiRaju
 
Presentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptxPresentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptx
gindu3009
 
The Philosophy of Science
The Philosophy of ScienceThe Philosophy of Science
The Philosophy of Science
University of Hertfordshire
 
Hubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroidsHubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroids
Sérgio Sacani
 
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 bAsymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Sérgio Sacani
 

Kürzlich hochgeladen (20)

Chromatin Structure | EUCHROMATIN | HETEROCHROMATIN
Chromatin Structure | EUCHROMATIN | HETEROCHROMATINChromatin Structure | EUCHROMATIN | HETEROCHROMATIN
Chromatin Structure | EUCHROMATIN | HETEROCHROMATIN
 
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdfPests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
 
Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )
 
fundamental of entomology all in one topics of entomology
fundamental of entomology all in one topics of entomologyfundamental of entomology all in one topics of entomology
fundamental of entomology all in one topics of entomology
 
Chemistry 4th semester series (krishna).pdf
Chemistry 4th semester series (krishna).pdfChemistry 4th semester series (krishna).pdf
Chemistry 4th semester series (krishna).pdf
 
Presentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptxPresentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptx
 
Biological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdfBiological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdf
 
The Philosophy of Science
The Philosophy of ScienceThe Philosophy of Science
The Philosophy of Science
 
GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)
 
Hubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroidsHubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroids
 
Forensic Biology & Its biological significance.pdf
Forensic Biology & Its biological significance.pdfForensic Biology & Its biological significance.pdf
Forensic Biology & Its biological significance.pdf
 
Botany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdfBotany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdf
 
Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)
 
Botany 4th semester series (krishna).pdf
Botany 4th semester series (krishna).pdfBotany 4th semester series (krishna).pdf
Botany 4th semester series (krishna).pdf
 
Pulmonary drug delivery system M.pharm -2nd sem P'ceutics
Pulmonary drug delivery system M.pharm -2nd sem P'ceuticsPulmonary drug delivery system M.pharm -2nd sem P'ceutics
Pulmonary drug delivery system M.pharm -2nd sem P'ceutics
 
GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)
 
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral AnalysisRaman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
 
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCRStunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
 
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 bAsymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
 
CELL -Structural and Functional unit of life.pdf
CELL -Structural and Functional unit of life.pdfCELL -Structural and Functional unit of life.pdf
CELL -Structural and Functional unit of life.pdf
 

Bayes Independence Test

  • 1. . ...... Bayes Independence Test Joe Suzuki Osaka University GABA 2014 Joe Suzuki (Osaka University) Bayes Independence Test GABA 2014 1 / 20
  • 2. Road Map ...1 Problem ...2 Discrete Case ...3 Continuous Case ...4 HSIC ...5 Experiments ...6 Concluding Remarks Joe Suzuki (Osaka University) Bayes Independence Test GABA 2014 2 / 20
  • 3. Problem Problem: Decide X ⊥⊥ Y given (x1, y1), · · · , (xn, yn) Mutual Information: I(X, Y ) := ∑ x ∑ y PXY (x, y) log PXY (x, y) PX (x)PY (y) I(X, Y ) = 0 ⇐⇒ X ⊥⊥ Y Hilbert Schmidt independent criterion: Non-linear Correlation Correlation Coefficient (X, Y ) = 0 ⇐= ̸=⇒ X ⊥⊥ Y HSIC(X, Y ) = 0 ⇐⇒ X ⊥⊥ Y . Independence Test (Whether X ⊥⊥ Y or not) .. ......Given (x1, y1), · · · , (xn, yn), estimate I(X, Y ), HSIC(X, Y ), etc. Joe Suzuki (Osaka University) Bayes Independence Test GABA 2014 3 / 20
  • 4. Discrete Case Estimating MI (Maximum Likelihood) X, Y : discrete In(xn , yn ) := ∑ x ∑ y ˆPn(x, y) log ˆPn(x, y) ˆPn(x)ˆPn(y) ˆPn(x, y): relative occurency of (X, Y ) = (x, y) in (x1, y1), · · · , (xn, yn) ˆPn(x): relative occurency of X = x in x1, · · · , xn ˆPn(y): relative occurrency of Y = y in y1, · · · , yn In(x, y) → I(X, Y ) (n → ∞) even if X ⊥⊥ Y , In(xn, yn) > 0 occurs infnitely many times constructing Independent Test requires thresholds {ϵn} s.t. In(xn , yn ) < ϵn ⇐⇒ X ⊥⊥ Y cannot be extended into the case when X, Y are continuous Joe Suzuki (Osaka University) Bayes Independence Test GABA 2014 4 / 20
  • 5. Discrete Case Bayesian Estimation of MI (Proposal) . Lempel-Ziv (lzh, gzip etc.) .. ...... Compressing xn = (x1, · · · , xn) into zm = (z1, · · · , zm) ∈ {0, 1}m ...1 The compression ratio m n converges to its Entropy H(X) for any PX . ...2 ∑ 2−m ≤ 1 (Kraft’s inequality) for Qn X (xn) := 2−m, m = − log Qn X (xn) will be lenth after compression for Qn Y (yn), Qn XY (xn, yn), and prior p of X ⊥⊥ Y , Jn(xn , yn ) := 1 n log (1 − p)Qn XY (xn, yn) pQn X (xn)Qn Y (yn) Joe Suzuki (Osaka University) Bayes Independence Test GABA 2014 5 / 20
  • 6. Discrete Case MDL(minimum description length) Principle From examples, a model s.t. the total length of description of the model description of the examples given the model is minimized should be chosen (Rissanen, 1976) MDL(X ⊥⊥ Y ) := − log p − 1 n log Qn X (xn ) − 1 n log Qn Y (yn ) MDL(X ̸⊥⊥ Y ) := − log(1 − p) − 1 n log Qn XY (xn , yn ) . Consistency .. ......The MDL model coincides with the true model with. prob.1 as n → ∞. Joe Suzuki (Osaka University) Bayes Independence Test GABA 2014 6 / 20
  • 7. Discrete Case Bayesian Estimation of MI (Proposal, cont’d) Consistency of MDL implies that of Independence Test: Jn(xn , yn ) ≤ 0 ⇐⇒ MDL(X ⊥⊥ Y ) ≤ MDL(X ̸⊥⊥ Y ) for α := |X|, β := |Y | Jn(xn , yn ) ≈ In(xn .yn ) − (α − 1)(β − 1) 2n log n Jn(xn , yn ) ≤ 0 ⇐⇒ In(xn , yn ) ≤ ϵn := (α − 1)(β − 1) 2n log n Jn(xn, yn) → I(X, Y ) (n → ∞) O(n) computation p = 1 2 was assumed in Suzuki 2012. Joe Suzuki (Osaka University) Bayes Independence Test GABA 2014 7 / 20
  • 8. Discrete Case Universality: Discrete For any PX , m n = − 1 n log Qn X (xn ) → H(X) From i.i.d. and the law of large numbers, for any PX , − 1 n log Pn X (xn ) = − 1 n n∑ i=1 log PX (xi ) → E[− log PX (X)] = H(X) For any PX , 1 n log Pn X (xn) Qn X (xn) → 0 . Joe Suzuki (Osaka University) Bayes Independence Test GABA 2014 8 / 20
  • 9. Continuous Case Universality: Continuous Under regularity, there exists gn X s.t. for any fX , 1 n log f n X (xn) gn X (xn) → 0 ∫ ∞ −∞ gn (xn )dx ≤ 1 (Ryabko 2009) removing regularity even for more than one variables either discrete, continuous, or none of them (Suzuki 2013) Joe Suzuki (Osaka University) Bayes Independence Test GABA 2014 9 / 20
  • 10. Continuous Case Construcion of gn X Quantization at level k: xn = (x1, · · · , xn) → (a (k) 1 , · · · , a (k) n ) ... ... ... ... E E E Level 1 Level 2 Level k Qn 1 (a (1) 1 , · · · , a (1) n ) λ(a (1) 1 ) · · · λ(a (1) n ) Qn 2 (a (2) 1 , · · · , a (2) n ) λ(a (2) 1 ) · · · λ(a (2) n ) Qn k (a (k) 1 , · · · , a (k) n ) λ(a (k) 1 ) · · · λ(a (k) n ) wi > 0 , ∑ i wi = 1 , gn X (xn ) = ∑ i wi Qn i (a (i) 1 , · · · , a (i) n ) λ(a (i) 1 ) · · · λ(a (i) n ) Joe Suzuki (Osaka University) Bayes Independence Test GABA 2014 10 / 20
  • 11. Continuous Case Bayesian Estimation of MI: General Case . Bayesian Estimation of MI .. ...... Jn(xn , yn ) := 1 n log (1 − p)gn XY (xn, yn) pgn X (xn)gn Y (yn)   Generalization of MDL: MDL(X ⊥⊥ Y ) := − log p − 1 n log gn X (xn ) − 1 n log gn Y (yn ) MDL(X ̸⊥⊥ Y ) := − log(1 − p) − 1 n log gn XY (xn , yn ) . Consistency .. ...... The MDL model coincides with the true model with prob. 1 as n → ∞: X ⊥⊥ Y ⇐⇒ MDL(X ⊥⊥ Y ) ≤ MDL(X ̸⊥⊥ Y ) Joe Suzuki (Osaka University) Bayes Independence Test GABA 2014 11 / 20
  • 12. Continuous Case Jn(xn , yn ) → I(X, Y ) (n → ∞) Proof: since xn, yn are i.i.d., from the law of large numbers, for any fX , 1 n log f n XY (xn, yn) f n X (xn)f n Y (xn) = 1 n n∑ i=1 log f n XY (xn, yn) f n X (xn)f n Y (xn) → E[log fXY (XY ) fX (X)fY (Y ) ] = I(X, Y ) Jn(xn , yn ) − I(X, Y ) = − 1 n log f n XY (xn, yn) gn XY (xn, yn) + 1 n log f n X (xn) gn X (xn) + 1 n log f n Y (yn) gn Y (yn) + 1 n log f n XY (xn, yn) f n X (xn)f n Y (xn) − I(X, Y ) + 1 n log 1 − p p → 0 Joe Suzuki (Osaka University) Bayes Independence Test GABA 2014 12 / 20
  • 13. HSIC HSIC A nonlinear corralation coefficient cov(X, Y ) Random Variable X Y Hilbert Space X Y RKHS F: Basis {fi } G: Basis {gj } kernel k : X × X → R l : Y × Y → R HSIC(PXY , F, G) = ∑ i,j cov(fi (X), gj (Y ))2 For the universal kernels, HSIC(PXY , F, G) = 0 ⇐⇒ X ⊥⊥ Y ex: the Gaussian kernel is known to be universal: k(x, y) = exp{−(x − y)2 /2} Joe Suzuki (Osaka University) Bayes Independence Test GABA 2014 13 / 20
  • 14. HSIC Limitions of HSIC . Unbiased Estimator of HSIC(PXY , F, G) .. ...... For K = (k(xi , xj )), L = (l(yi , yj )), H = (δi,j − 1 n ) HSIC(xn , yn ) = 1 (n − 1)2 tr(KHLH) HSIC(PXY , F, G) → HSIC(PXY , F, G) as n → ∞ has been proved only for weak consistncy. Computation of HSIC(xn, yn, F, G): O(n3) Computation of the asymptotic distribution of H0: is O(n3 ) w.r.t. n based on U-statistics (Bunlphone, et. al, 2014). may not give correct estimaton based on permutation test. Joe Suzuki (Osaka University) Bayes Independence Test GABA 2014 14 / 20
  • 15. Experiments Experiments ...1 E¨¨ ¨¨¨¨¨B Errr rrrrj X Y 0 1 0 1 1 2 1 2 p 1 − p I(X, Y ) = HSIC(X, Y ) = 0 ⇐⇒ p = 1 2 ⇐⇒ X ⊥⊥ Y ...2 (X, Y ) ∼ N(0, Σ), Σ = [ 1 ρ ρ 1 ] , −1 < ρ < 1 I(X, Y ) = HSIC(X, Y ) = 0 ⇐⇒ ρ = 0 ⇐⇒ X ⊥⊥ Y ...3 P(X = 0) = P(X = 1) = 1 2 , Y ∼ N(aX, 1), a ≥ 0 I(X, Y ) = HSIC(X, Y ) = 0 ⇐⇒ a = 0 ⇐⇒ X ⊥⊥ Y Joe Suzuki (Osaka University) Bayes Independence Test GABA 2014 15 / 20
  • 16. Experiments Experiment 1 The Error Probabilities for n = 100 True p Proposal HSIC →Estimated p Threshold (×10−4) 4 8 12 16 20 p = 0.5 → p ̸= 0.5 0.084 0.306 0.135 0.077 0.043 0.022 p = 0.4 → p = 0.5 0.758 0.507 0.694 0.787 0.860 0.908 p = 0.3 → p = 0.5 0.333 0.139 0.251 0.396 0.505 0.610 p = 0.2 → p = 0.5 0.048 0.018 0.035 0.083 0.135 0.201 p = 0.1 → p = 0.5 0.001 0.000 0.001 0.005 0.010 0.021 ↑ Joe Suzuki (Osaka University) Bayes Independence Test GABA 2014 16 / 20
  • 17. Experiments Experiments 2 The Error Probabilities for n = 100 ρ Proposal HSIC Threshold (×10−3) 2 4 6 8 0.0 0.095 0.338 0.036 0.006 0.00 0.2 0.628 0.298 0.676 0.884 0.97 0.4 0.168 0.008 0.088 0.300 0.512 0.6 0.008 0.000 0.000 0.002 0.006 0.8 0.000 0.000 0.000 0.000 0.000 ↑ For the Gaussian kernel and Gauss distributions, HSIC performs very well. Joe Suzuki (Osaka University) Bayes Independence Test GABA 2014 17 / 20
  • 18. Experiments HSIC shows poor performance in cases such as ˆˆˆˆˆˆˆˆˆz        ~ $$$$$$$$$X ˆˆˆˆˆˆˆˆˆz$$$$$$$$$X ˆˆˆˆˆˆˆˆˆz& & & & & & & &&b $$$$$$$$$X X Y 0 ϵ 1 1 − ϵ 0 ϵ 1 1 − ϵif ϵ > 0 small. HSIC(xn .yn ) = 1 (n − 1)2 ∑ i ∑ j {k(xi , xj ) − 1 n ∑ h k(xi , xh)}{l(yi , yj ) − 1 n ∑ h l(yi , yh)} k(u, v) = l(u, v) = exp{−(u − v)2 } Joe Suzuki (Osaka University) Bayes Independence Test GABA 2014 18 / 20
  • 19. Experiments Execution Time Execution (sec) n 100 500 1000 2000 Proposal 0.30 0.33 0.62 1.05 HSIC 0.50 9.51 40.28 185.53 Joe Suzuki (Osaka University) Bayes Independence Test GABA 2014 19 / 20
  • 20. Concluding Remarks Concluding Remarks . Contribution .. ......Independence Test based on MDL/Bayes Proposal HSIC Principle Bayes Detection will be maximied Strong Discrete Continuous Threshold Not Necessay Necessary Prior Necessary Not Necessary Computation O(n) O(n3) Consistency Strong Weak Future Works The Border for which either Bays/MDL or HSIC outperforms R Package Joe Suzuki (Osaka University) Bayes Independence Test GABA 2014 20 / 20