SlideShare ist ein Scribd-Unternehmen logo
1 von 37
Downloaden Sie, um offline zu lesen
1
Shohei Taniguchi, Matsuo Lab (M1)
• (EBM) (?)
• 2
- Flow Contrastive Estimation of Energy-Based Models
‣ 2
- Your Classifier is Secretly an Energy Based Model and You Should Treat it
Like One
‣
2
Outline
1. Energy Based Model (EBM)
- EBM
‣ Contrastive Divergence Learning (CD )
‣ Noice Contrastive Estimation ( )
2. EBM
- Restricted Boltzmann Machine (RBM)
3. Flow Contrastive Estimation of Energy-Based Models
4. Your Classifier is Secretly an Energy Based Model and You Should Treat it
Like One 3
Energy Based Model
4
EBM
•
-
x pθ (x) x
Eθ (x)
pθ (x) =
exp (−Eθ (x))
Z (θ) (
Z (θ) =
∫
exp (−Eθ (x)) dx
)
Z (θ)
5
EBM
• ( )
-
‣
‣ NCE
- (?)
•
- HMC
‣ MCMC
6
EBM
•
( )
-
➡
- EBM
log pθ (x)
Z (θ)
7
Contrastive Divergence Learning (CD )
• SGD
-
‣
‣ MCMC
∂ log pθ (x)
∂θ
∂ log pθ (x)
∂θ
= 𝔼pθ(x) [
∂Eθ (x)
∂θ ]
− 𝔼pdata(x) [
∂Eθ (x)
∂θ ]
pθ (x) 8
CD
•
- MCMC
- MCMC
➡
pθ (x)
9
Noise Contrastive Estimation (NCE, )
•
-
‣ ( )
‣
‣ GAN ( )
Z (θ) c
log pθ (x) = − Eθ (x) − c
θ
c Z (θ)
J (θ) = 𝔼pdata(x)
[
log
pθ(x)
pθ(x) + q(x)]
+ 𝔼q(x)
[
log
q(x)
pθ(x) + q(x) ]
q (x)
10
NCE
•
-
①
②
③
- ①, ② ③
‣
EBM
q (x)
q (x)
pdata (x)
11
EBM
12
EBM
•
• 2
- :
e.g. Autoencoder, Denoising AE
- EBM : 2
e.g. Restricted Boltzmann Machine, Deep Boltzmann Machine
13
Restricted Boltzmann Machine (RBM)
• 2
•
( 2 )
• CD
• Deep Boltzmann Machine
hi
P (hi = 1|v) = σ (v⊤
W:,i + bi)
E(v, h) = − b⊤
v − c⊤
h − v⊤
Wh
h
p (v) =
∑
i
p(v, h)
(
p(v, h) =
1
Z
exp(−E(v, h))
)
hi
14
RBM
RBM EBM
• RBM DBM EBM
ReLU
• VAE, GAN
•
•
➡
15
EBM
EBM (RBM )
• 2
•
EBM
• NN
(NN 1
)
•
16
E (v) = NN (v)
= w(n)
(
⋯φ (W(2)
φ (W(1)
v + b(1)
) + b(2)
))
+ b(n)
E (v, h(1)
, h(2)
, h(3)
)
= − v⊤
W(1)
h(1)
− h(1)⊤
W(2)
h(2)
− h(2)⊤
W(3)
h(3)
EBM
• Implicit Generation and Modeling with Energy-Based Models (NeurIPS
2019)
- EBM
- CD
-
-
17
32x32 Imagenet
Flow Contrastive Estimation of Energy-Based Models
18
•
- Ruiqi Gao, Erik Nijkamp, Diederik P. Kingma, Zhen Xu, Andrew M.
Dai, Ying Nian Wu
• NeurIPS 2019 Bayesian Deep Learning Workshop
• Kingma
• NCE EBM flow
• 19
(flow)
Noise Contrastive Estimation ( )
•
-
‣ ( )
‣
‣ GAN ( )
Z (θ) c
log pθ (x) = − Eθ (x) − c
θ
c Z (θ)
J (θ) = 𝔼pdata(x)
[
log
pθ(x)
pθ(x) + q(x)]
+ 𝔼q(x)
[
log
q(x)
pθ(x) + q(x) ]
q (x)
20
NCE ( )
•
-
①
②
③
- ①, ② ③
‣
EBM
q (x)
q (x)
pdata (x)
21
Flow Contrastive Estimation (FCE)
• flow
- flow
https://www.slideshare.net/DeepLearningJP2016/dlflowbased-deep-
generative-models
• flow
FCE NCE EBM
- EBM flow
q (x)
qα (x)
V(θ, α) = 𝔼pdata(x)
[
log
pθ(x)
pθ(x) + qα(x)]
+ 𝔼p(z)
[
log
qα (gα(z))
pθ (gα(z)) + qα (gα(z)) ]
22
FCE flow
• GAN
EBM
flow
• EBM flow
V(θ, α) = 𝔼pdata(x)
[
log
pθ(x)
pθ(x) + qα(x) ]
+ 𝔼p(z)
[
log
qα (gα(z))
pθ (gα(z)) + qα (gα(z)) ]
pθ(x)
pθ(x) + qα(x)
x
qα (gα(z))
pθ (gα(z)) + qα (gα(z))
gα(z)
23
= JSDV
• EBM EBM
flow
➡ GAN
• GAN EBM
Jensen-Shannon Divergence (JSD)V
JSD (qα∥pdata) = KL (pdata∥ (pdata + qα)/2) + KL (qα∥ (pdata + qα)/2)
24
FCE
• EBM flow
- flow
- EBM MCMC
➡
‣ EBM flow
25
1 2D
• 1
- Glow-MLE: Glow
- Glow-FCE: FCE Glow
- EBM-FCE: FCE EBM
• FCE EBM 1
26
1 2D
• EBM
• Glow FCE (trained)
FCE (rand)
27
2
FCE Glow
FID
28
FCE
• NCE EBM flow
Flow Contrastive Estimation (FCE)
• flow EBM
• flow GAN generator (JSD)
discriminator generator
GAN
• EBM
29
Your Classifier is Secretly an Energy Based Model and You
Should Treat it Like One
30
•
- Will Grathwohl, Kuan-Chieh Wang, Jörn-Henrik Jacobsen, David
Duvenaud, Mohammad Norouzi, Kevin Swersky
• ICLR 2020 accepted (8, 8, 6)
•
•
•
x y
p (y|x) p (x)
31
Joint Energy based Model (JEM)
• softmax
•
pθ(y|x) =
exp (fθ(x)[y])
∑y′
exp (fθ(x)[y′])
x y
pθ(x, y) =
exp (fθ(x)[y])
Z(θ)
, Z (θ) =
∫ ∑
y′
exp (fθ(x)[y′]) dx
32
Joint Energy based Model (JEM)
•
• 2
• 1
CD ( NCE )
log pθ(x, y) = log pθ(x) + log pθ(y|x)
x Eθ (x)
Eθ(x) = − LogSumExpy (fθ(x)[y]) = − log
∑
y
exp (fθ(x)[y])
33
JEM
•
- class-conditional
•
-
34
•
• class-conditional
35
CIFAR10
JEM
• CD EBM
-
- MCMC
‣
‣ FCE (?)
36
•
• RBM
• EBM FCE EBM
- JEM FCE
• NCE
• EBM (?)
37

Weitere ähnliche Inhalte

Was ist angesagt?

グラフィカルモデル入門
グラフィカルモデル入門グラフィカルモデル入門
グラフィカルモデル入門
Kawamoto_Kazuhiko
 

Was ist angesagt? (20)

Control as Inference (強化学習とベイズ統計)
Control as Inference (強化学習とベイズ統計)Control as Inference (強化学習とベイズ統計)
Control as Inference (強化学習とベイズ統計)
 
グラフィカルモデル入門
グラフィカルモデル入門グラフィカルモデル入門
グラフィカルモデル入門
 
ELBO型VAEのダメなところ
ELBO型VAEのダメなところELBO型VAEのダメなところ
ELBO型VAEのダメなところ
 
[DL輪読会]ICLR2020の分布外検知速報
[DL輪読会]ICLR2020の分布外検知速報[DL輪読会]ICLR2020の分布外検知速報
[DL輪読会]ICLR2020の分布外検知速報
 
backbone としての timm 入門
backbone としての timm 入門backbone としての timm 入門
backbone としての timm 入門
 
【DL輪読会】GPT-4Technical Report
【DL輪読会】GPT-4Technical Report【DL輪読会】GPT-4Technical Report
【DL輪読会】GPT-4Technical Report
 
[DL輪読会]Recent Advances in Autoencoder-Based Representation Learning
[DL輪読会]Recent Advances in Autoencoder-Based Representation Learning[DL輪読会]Recent Advances in Autoencoder-Based Representation Learning
[DL輪読会]Recent Advances in Autoencoder-Based Representation Learning
 
【DL輪読会】ViT + Self Supervised Learningまとめ
【DL輪読会】ViT + Self Supervised Learningまとめ【DL輪読会】ViT + Self Supervised Learningまとめ
【DL輪読会】ViT + Self Supervised Learningまとめ
 
Optimizer入門&最新動向
Optimizer入門&最新動向Optimizer入門&最新動向
Optimizer入門&最新動向
 
[DL輪読会]相互情報量最大化による表現学習
[DL輪読会]相互情報量最大化による表現学習[DL輪読会]相互情報量最大化による表現学習
[DL輪読会]相互情報量最大化による表現学習
 
Curriculum Learning (関東CV勉強会)
Curriculum Learning (関東CV勉強会)Curriculum Learning (関東CV勉強会)
Curriculum Learning (関東CV勉強会)
 
[DL輪読会]GQNと関連研究,世界モデルとの関係について
[DL輪読会]GQNと関連研究,世界モデルとの関係について[DL輪読会]GQNと関連研究,世界モデルとの関係について
[DL輪読会]GQNと関連研究,世界モデルとの関係について
 
Swin Transformer (ICCV'21 Best Paper) を完璧に理解する資料
Swin Transformer (ICCV'21 Best Paper) を完璧に理解する資料Swin Transformer (ICCV'21 Best Paper) を完璧に理解する資料
Swin Transformer (ICCV'21 Best Paper) を完璧に理解する資料
 
SSII2022 [SS1] ニューラル3D表現の最新動向〜 ニューラルネットでなんでも表せる?? 〜​
SSII2022 [SS1] ニューラル3D表現の最新動向〜 ニューラルネットでなんでも表せる?? 〜​SSII2022 [SS1] ニューラル3D表現の最新動向〜 ニューラルネットでなんでも表せる?? 〜​
SSII2022 [SS1] ニューラル3D表現の最新動向〜 ニューラルネットでなんでも表せる?? 〜​
 
[DL輪読会]1次近似系MAMLとその理論的背景
[DL輪読会]1次近似系MAMLとその理論的背景[DL輪読会]1次近似系MAMLとその理論的背景
[DL輪読会]1次近似系MAMLとその理論的背景
 
【DL輪読会】言語以外でのTransformerのまとめ (ViT, Perceiver, Frozen Pretrained Transformer etc)
【DL輪読会】言語以外でのTransformerのまとめ (ViT, Perceiver, Frozen Pretrained Transformer etc)【DL輪読会】言語以外でのTransformerのまとめ (ViT, Perceiver, Frozen Pretrained Transformer etc)
【DL輪読会】言語以外でのTransformerのまとめ (ViT, Perceiver, Frozen Pretrained Transformer etc)
 
[DL輪読会]Deep Learning 第15章 表現学習
[DL輪読会]Deep Learning 第15章 表現学習[DL輪読会]Deep Learning 第15章 表現学習
[DL輪読会]Deep Learning 第15章 表現学習
 
[DL輪読会]Control as Inferenceと発展
[DL輪読会]Control as Inferenceと発展[DL輪読会]Control as Inferenceと発展
[DL輪読会]Control as Inferenceと発展
 
[DL輪読会] Spectral Norm Regularization for Improving the Generalizability of De...
[DL輪読会] Spectral Norm Regularization for Improving the Generalizability of De...[DL輪読会] Spectral Norm Regularization for Improving the Generalizability of De...
[DL輪読会] Spectral Norm Regularization for Improving the Generalizability of De...
 
[DL輪読会]Learning Transferable Visual Models From Natural Language Supervision
[DL輪読会]Learning Transferable Visual Models From Natural Language Supervision[DL輪読会]Learning Transferable Visual Models From Natural Language Supervision
[DL輪読会]Learning Transferable Visual Models From Natural Language Supervision
 

Ähnlich wie [DL輪読会]近年のエネルギーベースモデルの進展

Stochastic Alternating Direction Method of Multipliers
Stochastic Alternating Direction Method of MultipliersStochastic Alternating Direction Method of Multipliers
Stochastic Alternating Direction Method of Multipliers
Taiji Suzuki
 
A Note on the Derivation of the Variational Inference Updates for DILN
A Note on the Derivation of the Variational Inference Updates for DILNA Note on the Derivation of the Variational Inference Updates for DILN
A Note on the Derivation of the Variational Inference Updates for DILN
Tomonari Masada
 
グラム行列のスパース近似を用いた生成的モーメントマッチングネットに基づく音声合成の検討
グラム行列のスパース近似を用いた生成的モーメントマッチングネットに基づく音声合成の検討グラム行列のスパース近似を用いた生成的モーメントマッチングネットに基づく音声合成の検討
グラム行列のスパース近似を用いた生成的モーメントマッチングネットに基づく音声合成の検討
Tomoki Koriyama
 
Phase Transitions of Best-of-Two and Best-of-Three on Stochastic Block Models
Phase Transitions of Best-of-Two and Best-of-Three on Stochastic Block ModelsPhase Transitions of Best-of-Two and Best-of-Three on Stochastic Block Models
Phase Transitions of Best-of-Two and Best-of-Three on Stochastic Block Models
Nobutaka Shimizu
 

Ähnlich wie [DL輪読会]近年のエネルギーベースモデルの進展 (20)

Phase diagram at finite T & Mu in strong coupling limit of lattice QCD
Phase diagram at finite T & Mu in strong coupling limit of lattice QCDPhase diagram at finite T & Mu in strong coupling limit of lattice QCD
Phase diagram at finite T & Mu in strong coupling limit of lattice QCD
 
Modeling the Dynamics of SGD by Stochastic Differential Equation
Modeling the Dynamics of SGD by Stochastic Differential EquationModeling the Dynamics of SGD by Stochastic Differential Equation
Modeling the Dynamics of SGD by Stochastic Differential Equation
 
【DL輪読会】Unbiased Gradient Estimation for Marginal Log-likelihood
【DL輪読会】Unbiased Gradient Estimation for Marginal Log-likelihood【DL輪読会】Unbiased Gradient Estimation for Marginal Log-likelihood
【DL輪読会】Unbiased Gradient Estimation for Marginal Log-likelihood
 
Stochastic Alternating Direction Method of Multipliers
Stochastic Alternating Direction Method of MultipliersStochastic Alternating Direction Method of Multipliers
Stochastic Alternating Direction Method of Multipliers
 
Using blurred images to assess damage in bridge structures?
Using blurred images to assess damage in bridge structures?Using blurred images to assess damage in bridge structures?
Using blurred images to assess damage in bridge structures?
 
RF Module Design - [Chapter 1] From Basics to RF Transceivers
RF Module Design - [Chapter 1] From Basics to RF TransceiversRF Module Design - [Chapter 1] From Basics to RF Transceivers
RF Module Design - [Chapter 1] From Basics to RF Transceivers
 
Multiband Transceivers - [Chapter 1]
Multiband Transceivers - [Chapter 1] Multiband Transceivers - [Chapter 1]
Multiband Transceivers - [Chapter 1]
 
Modeling the Dynamics of SGD by Stochastic Differential Equation
Modeling the Dynamics of SGD by Stochastic Differential EquationModeling the Dynamics of SGD by Stochastic Differential Equation
Modeling the Dynamics of SGD by Stochastic Differential Equation
 
Pseudo Random Number Generators
Pseudo Random Number GeneratorsPseudo Random Number Generators
Pseudo Random Number Generators
 
射頻電子 - [第一章] 知識回顧與通訊系統簡介
射頻電子 - [第一章] 知識回顧與通訊系統簡介射頻電子 - [第一章] 知識回顧與通訊系統簡介
射頻電子 - [第一章] 知識回顧與通訊系統簡介
 
Formations Near The Libration Points: Design Strategies Using Natural And Non...
Formations Near The Libration Points: Design Strategies Using Natural And Non...Formations Near The Libration Points: Design Strategies Using Natural And Non...
Formations Near The Libration Points: Design Strategies Using Natural And Non...
 
Improving EV Lateral Dynamics Control Using Infinity Norm Approach with Close...
Improving EV Lateral Dynamics Control Using Infinity Norm Approach with Close...Improving EV Lateral Dynamics Control Using Infinity Norm Approach with Close...
Improving EV Lateral Dynamics Control Using Infinity Norm Approach with Close...
 
Doubly Accelerated Stochastic Variance Reduced Gradient Methods for Regulariz...
Doubly Accelerated Stochastic Variance Reduced Gradient Methods for Regulariz...Doubly Accelerated Stochastic Variance Reduced Gradient Methods for Regulariz...
Doubly Accelerated Stochastic Variance Reduced Gradient Methods for Regulariz...
 
Ph ddefence
Ph ddefencePh ddefence
Ph ddefence
 
A Note on the Derivation of the Variational Inference Updates for DILN
A Note on the Derivation of the Variational Inference Updates for DILNA Note on the Derivation of the Variational Inference Updates for DILN
A Note on the Derivation of the Variational Inference Updates for DILN
 
digital logic design Chapter 2 boolean_algebra_&_logic_gates
digital logic design Chapter 2 boolean_algebra_&_logic_gatesdigital logic design Chapter 2 boolean_algebra_&_logic_gates
digital logic design Chapter 2 boolean_algebra_&_logic_gates
 
グラム行列のスパース近似を用いた生成的モーメントマッチングネットに基づく音声合成の検討
グラム行列のスパース近似を用いた生成的モーメントマッチングネットに基づく音声合成の検討グラム行列のスパース近似を用いた生成的モーメントマッチングネットに基づく音声合成の検討
グラム行列のスパース近似を用いた生成的モーメントマッチングネットに基づく音声合成の検討
 
Introduction To Lisp
Introduction To LispIntroduction To Lisp
Introduction To Lisp
 
Agilent ADS 模擬手冊 [實習2] 放大器設計
Agilent ADS 模擬手冊 [實習2]  放大器設計Agilent ADS 模擬手冊 [實習2]  放大器設計
Agilent ADS 模擬手冊 [實習2] 放大器設計
 
Phase Transitions of Best-of-Two and Best-of-Three on Stochastic Block Models
Phase Transitions of Best-of-Two and Best-of-Three on Stochastic Block ModelsPhase Transitions of Best-of-Two and Best-of-Three on Stochastic Block Models
Phase Transitions of Best-of-Two and Best-of-Three on Stochastic Block Models
 

Mehr von Deep Learning JP

Mehr von Deep Learning JP (20)

【DL輪読会】AdaptDiffuser: Diffusion Models as Adaptive Self-evolving Planners
【DL輪読会】AdaptDiffuser: Diffusion Models as Adaptive Self-evolving Planners【DL輪読会】AdaptDiffuser: Diffusion Models as Adaptive Self-evolving Planners
【DL輪読会】AdaptDiffuser: Diffusion Models as Adaptive Self-evolving Planners
 
【DL輪読会】事前学習用データセットについて
【DL輪読会】事前学習用データセットについて【DL輪読会】事前学習用データセットについて
【DL輪読会】事前学習用データセットについて
 
【DL輪読会】 "Learning to render novel views from wide-baseline stereo pairs." CVP...
【DL輪読会】 "Learning to render novel views from wide-baseline stereo pairs." CVP...【DL輪読会】 "Learning to render novel views from wide-baseline stereo pairs." CVP...
【DL輪読会】 "Learning to render novel views from wide-baseline stereo pairs." CVP...
 
【DL輪読会】Zero-Shot Dual-Lens Super-Resolution
【DL輪読会】Zero-Shot Dual-Lens Super-Resolution【DL輪読会】Zero-Shot Dual-Lens Super-Resolution
【DL輪読会】Zero-Shot Dual-Lens Super-Resolution
 
【DL輪読会】BloombergGPT: A Large Language Model for Finance arxiv
【DL輪読会】BloombergGPT: A Large Language Model for Finance arxiv【DL輪読会】BloombergGPT: A Large Language Model for Finance arxiv
【DL輪読会】BloombergGPT: A Large Language Model for Finance arxiv
 
【DL輪読会】マルチモーダル LLM
【DL輪読会】マルチモーダル LLM【DL輪読会】マルチモーダル LLM
【DL輪読会】マルチモーダル LLM
 
【 DL輪読会】ToolLLM: Facilitating Large Language Models to Master 16000+ Real-wo...
 【 DL輪読会】ToolLLM: Facilitating Large Language Models to Master 16000+ Real-wo... 【 DL輪読会】ToolLLM: Facilitating Large Language Models to Master 16000+ Real-wo...
【 DL輪読会】ToolLLM: Facilitating Large Language Models to Master 16000+ Real-wo...
 
【DL輪読会】AnyLoc: Towards Universal Visual Place Recognition
【DL輪読会】AnyLoc: Towards Universal Visual Place Recognition【DL輪読会】AnyLoc: Towards Universal Visual Place Recognition
【DL輪読会】AnyLoc: Towards Universal Visual Place Recognition
 
【DL輪読会】Can Neural Network Memorization Be Localized?
【DL輪読会】Can Neural Network Memorization Be Localized?【DL輪読会】Can Neural Network Memorization Be Localized?
【DL輪読会】Can Neural Network Memorization Be Localized?
 
【DL輪読会】Hopfield network 関連研究について
【DL輪読会】Hopfield network 関連研究について【DL輪読会】Hopfield network 関連研究について
【DL輪読会】Hopfield network 関連研究について
 
【DL輪読会】SimPer: Simple self-supervised learning of periodic targets( ICLR 2023 )
【DL輪読会】SimPer: Simple self-supervised learning of periodic targets( ICLR 2023 )【DL輪読会】SimPer: Simple self-supervised learning of periodic targets( ICLR 2023 )
【DL輪読会】SimPer: Simple self-supervised learning of periodic targets( ICLR 2023 )
 
【DL輪読会】RLCD: Reinforcement Learning from Contrast Distillation for Language M...
【DL輪読会】RLCD: Reinforcement Learning from Contrast Distillation for Language M...【DL輪読会】RLCD: Reinforcement Learning from Contrast Distillation for Language M...
【DL輪読会】RLCD: Reinforcement Learning from Contrast Distillation for Language M...
 
【DL輪読会】"Secrets of RLHF in Large Language Models Part I: PPO"
【DL輪読会】"Secrets of RLHF in Large Language Models Part I: PPO"【DL輪読会】"Secrets of RLHF in Large Language Models Part I: PPO"
【DL輪読会】"Secrets of RLHF in Large Language Models Part I: PPO"
 
【DL輪読会】"Language Instructed Reinforcement Learning for Human-AI Coordination "
【DL輪読会】"Language Instructed Reinforcement Learning  for Human-AI Coordination "【DL輪読会】"Language Instructed Reinforcement Learning  for Human-AI Coordination "
【DL輪読会】"Language Instructed Reinforcement Learning for Human-AI Coordination "
 
【DL輪読会】Llama 2: Open Foundation and Fine-Tuned Chat Models
【DL輪読会】Llama 2: Open Foundation and Fine-Tuned Chat Models【DL輪読会】Llama 2: Open Foundation and Fine-Tuned Chat Models
【DL輪読会】Llama 2: Open Foundation and Fine-Tuned Chat Models
 
【DL輪読会】"Learning Fine-Grained Bimanual Manipulation with Low-Cost Hardware"
【DL輪読会】"Learning Fine-Grained Bimanual Manipulation with Low-Cost Hardware"【DL輪読会】"Learning Fine-Grained Bimanual Manipulation with Low-Cost Hardware"
【DL輪読会】"Learning Fine-Grained Bimanual Manipulation with Low-Cost Hardware"
 
【DL輪読会】Parameter is Not All You Need:Starting from Non-Parametric Networks fo...
【DL輪読会】Parameter is Not All You Need:Starting from Non-Parametric Networks fo...【DL輪読会】Parameter is Not All You Need:Starting from Non-Parametric Networks fo...
【DL輪読会】Parameter is Not All You Need:Starting from Non-Parametric Networks fo...
 
【DL輪読会】Drag Your GAN: Interactive Point-based Manipulation on the Generative ...
【DL輪読会】Drag Your GAN: Interactive Point-based Manipulation on the Generative ...【DL輪読会】Drag Your GAN: Interactive Point-based Manipulation on the Generative ...
【DL輪読会】Drag Your GAN: Interactive Point-based Manipulation on the Generative ...
 
【DL輪読会】Self-Supervised Learning from Images with a Joint-Embedding Predictive...
【DL輪読会】Self-Supervised Learning from Images with a Joint-Embedding Predictive...【DL輪読会】Self-Supervised Learning from Images with a Joint-Embedding Predictive...
【DL輪読会】Self-Supervised Learning from Images with a Joint-Embedding Predictive...
 
【DL輪読会】Towards Understanding Ensemble, Knowledge Distillation and Self-Distil...
【DL輪読会】Towards Understanding Ensemble, Knowledge Distillation and Self-Distil...【DL輪読会】Towards Understanding Ensemble, Knowledge Distillation and Self-Distil...
【DL輪読会】Towards Understanding Ensemble, Knowledge Distillation and Self-Distil...
 

Kürzlich hochgeladen

IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
Enterprise Knowledge
 

Kürzlich hochgeladen (20)

08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Tech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfTech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdf
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 

[DL輪読会]近年のエネルギーベースモデルの進展

  • 2. • (EBM) (?) • 2 - Flow Contrastive Estimation of Energy-Based Models ‣ 2 - Your Classifier is Secretly an Energy Based Model and You Should Treat it Like One ‣ 2
  • 3. Outline 1. Energy Based Model (EBM) - EBM ‣ Contrastive Divergence Learning (CD ) ‣ Noice Contrastive Estimation ( ) 2. EBM - Restricted Boltzmann Machine (RBM) 3. Flow Contrastive Estimation of Energy-Based Models 4. Your Classifier is Secretly an Energy Based Model and You Should Treat it Like One 3
  • 5. EBM • - x pθ (x) x Eθ (x) pθ (x) = exp (−Eθ (x)) Z (θ) ( Z (θ) = ∫ exp (−Eθ (x)) dx ) Z (θ) 5
  • 6. EBM • ( ) - ‣ ‣ NCE - (?) • - HMC ‣ MCMC 6
  • 7. EBM • ( ) - ➡ - EBM log pθ (x) Z (θ) 7
  • 8. Contrastive Divergence Learning (CD ) • SGD - ‣ ‣ MCMC ∂ log pθ (x) ∂θ ∂ log pθ (x) ∂θ = 𝔼pθ(x) [ ∂Eθ (x) ∂θ ] − 𝔼pdata(x) [ ∂Eθ (x) ∂θ ] pθ (x) 8
  • 10. Noise Contrastive Estimation (NCE, ) • - ‣ ( ) ‣ ‣ GAN ( ) Z (θ) c log pθ (x) = − Eθ (x) − c θ c Z (θ) J (θ) = 𝔼pdata(x) [ log pθ(x) pθ(x) + q(x)] + 𝔼q(x) [ log q(x) pθ(x) + q(x) ] q (x) 10
  • 11. NCE • - ① ② ③ - ①, ② ③ ‣ EBM q (x) q (x) pdata (x) 11
  • 13. EBM • • 2 - : e.g. Autoencoder, Denoising AE - EBM : 2 e.g. Restricted Boltzmann Machine, Deep Boltzmann Machine 13
  • 14. Restricted Boltzmann Machine (RBM) • 2 • ( 2 ) • CD • Deep Boltzmann Machine hi P (hi = 1|v) = σ (v⊤ W:,i + bi) E(v, h) = − b⊤ v − c⊤ h − v⊤ Wh h p (v) = ∑ i p(v, h) ( p(v, h) = 1 Z exp(−E(v, h)) ) hi 14 RBM
  • 15. RBM EBM • RBM DBM EBM ReLU • VAE, GAN • • ➡ 15
  • 16. EBM EBM (RBM ) • 2 • EBM • NN (NN 1 ) • 16 E (v) = NN (v) = w(n) ( ⋯φ (W(2) φ (W(1) v + b(1) ) + b(2) )) + b(n) E (v, h(1) , h(2) , h(3) ) = − v⊤ W(1) h(1) − h(1)⊤ W(2) h(2) − h(2)⊤ W(3) h(3)
  • 17. EBM • Implicit Generation and Modeling with Energy-Based Models (NeurIPS 2019) - EBM - CD - - 17 32x32 Imagenet
  • 18. Flow Contrastive Estimation of Energy-Based Models 18
  • 19. • - Ruiqi Gao, Erik Nijkamp, Diederik P. Kingma, Zhen Xu, Andrew M. Dai, Ying Nian Wu • NeurIPS 2019 Bayesian Deep Learning Workshop • Kingma • NCE EBM flow • 19 (flow)
  • 20. Noise Contrastive Estimation ( ) • - ‣ ( ) ‣ ‣ GAN ( ) Z (θ) c log pθ (x) = − Eθ (x) − c θ c Z (θ) J (θ) = 𝔼pdata(x) [ log pθ(x) pθ(x) + q(x)] + 𝔼q(x) [ log q(x) pθ(x) + q(x) ] q (x) 20
  • 21. NCE ( ) • - ① ② ③ - ①, ② ③ ‣ EBM q (x) q (x) pdata (x) 21
  • 22. Flow Contrastive Estimation (FCE) • flow - flow https://www.slideshare.net/DeepLearningJP2016/dlflowbased-deep- generative-models • flow FCE NCE EBM - EBM flow q (x) qα (x) V(θ, α) = 𝔼pdata(x) [ log pθ(x) pθ(x) + qα(x)] + 𝔼p(z) [ log qα (gα(z)) pθ (gα(z)) + qα (gα(z)) ] 22
  • 23. FCE flow • GAN EBM flow • EBM flow V(θ, α) = 𝔼pdata(x) [ log pθ(x) pθ(x) + qα(x) ] + 𝔼p(z) [ log qα (gα(z)) pθ (gα(z)) + qα (gα(z)) ] pθ(x) pθ(x) + qα(x) x qα (gα(z)) pθ (gα(z)) + qα (gα(z)) gα(z) 23
  • 24. = JSDV • EBM EBM flow ➡ GAN • GAN EBM Jensen-Shannon Divergence (JSD)V JSD (qα∥pdata) = KL (pdata∥ (pdata + qα)/2) + KL (qα∥ (pdata + qα)/2) 24
  • 25. FCE • EBM flow - flow - EBM MCMC ➡ ‣ EBM flow 25
  • 26. 1 2D • 1 - Glow-MLE: Glow - Glow-FCE: FCE Glow - EBM-FCE: FCE EBM • FCE EBM 1 26
  • 27. 1 2D • EBM • Glow FCE (trained) FCE (rand) 27
  • 29. FCE • NCE EBM flow Flow Contrastive Estimation (FCE) • flow EBM • flow GAN generator (JSD) discriminator generator GAN • EBM 29
  • 30. Your Classifier is Secretly an Energy Based Model and You Should Treat it Like One 30
  • 31. • - Will Grathwohl, Kuan-Chieh Wang, Jörn-Henrik Jacobsen, David Duvenaud, Mohammad Norouzi, Kevin Swersky • ICLR 2020 accepted (8, 8, 6) • • • x y p (y|x) p (x) 31
  • 32. Joint Energy based Model (JEM) • softmax • pθ(y|x) = exp (fθ(x)[y]) ∑y′ exp (fθ(x)[y′]) x y pθ(x, y) = exp (fθ(x)[y]) Z(θ) , Z (θ) = ∫ ∑ y′ exp (fθ(x)[y′]) dx 32
  • 33. Joint Energy based Model (JEM) • • 2 • 1 CD ( NCE ) log pθ(x, y) = log pθ(x) + log pθ(y|x) x Eθ (x) Eθ(x) = − LogSumExpy (fθ(x)[y]) = − log ∑ y exp (fθ(x)[y]) 33
  • 36. JEM • CD EBM - - MCMC ‣ ‣ FCE (?) 36
  • 37. • • RBM • EBM FCE EBM - JEM FCE • NCE • EBM (?) 37