SlideShare a Scribd company logo
1 of 17
Download to read offline
When	
  Does	
  Label	
  Propaga1on	
  Fail?	
  
A	
  View	
  from	
  a	
  Network	
  Genera1ve	
  Model	
Yuto	
  Yamaguchi	
  and	
  Kohei	
  Hayashi	
17/08/22	
 IJCAI@Melbourne	
 1
Node Classification	
Given	
 Find	
Partially labeled
undirected graph	
Labels of all nodes	
17/08/22	
 IJCAI@Melbourne	
 2
Example:
User profile inference	
Friends	
Soccer	
 Soccer	
Soccer	
Tennis	
Baseball	
???	
What’s his hobby?
Node Classification	
17/08/22	
 IJCAI@Melbourne	
 3
Label Propagation (1/2)	
Propagate neighbors’ labels
Friends	
Soccer	
 Soccer	
Soccer	
Tennis	
Baseline	
???	
Soccer	
 Soccer	
Soccer	
Tennis	
Baseline	
Soccer	
[Zhu+, 03], [Zhou+, 03], etc.	
17/08/22	
 IJCAI@Melbourne	
 4
Label Propagation (2/2)	
Q F;X,Y,λ( )=
1
2
fi − yi 2
2
i=1
N
∑ +
λ
2
xij fi − fj 2
2
j=1
N
∑
i=1
N
∑
Given: adjacency matrix X and labels Y
Find: F = { fi } that minimizes Q
17/08/22	
 IJCAI@Melbourne	
 5	
F ∈ RN x K
Y ∈ {0, 1}N x K	
X ∈ {0, 1}N x N
N: # of nodes
K: # of labels
λ ∈ R+ : user parameter
[Zhu+, 03], [Zhou+, 03], etc.
Cases	
  when	
  LP	
  fails	
  (prac1cally	
  known)	
Different labels
are connected	
 Label ratio is not uniform	
Q. So, do we know why LP fails in these cases?

A. No. Since it’s not a probabilistic model, we
don’t know the assumptions behind the model.
17/08/22	
 IJCAI@Melbourne	
 6	
Edge probability is not uniform
What	
  we	
  do	
  in	
  this	
  work	
1.  Prove	
  a	
  theore1cal	
  rela1onship	
  between	
  LP	
  
and	
  Stochas(c	
  Block	
  Model,	
  which	
  is	
  a	
  well-­‐
studied	
  probabilis1c	
  genera1ve	
  model	
  
2.  Find	
  the	
  assump(ons	
  behind	
  LP	
  through	
  the	
  
assump1ons	
  behind	
  SBM	
  
3.  Show	
  when	
  and	
  why	
  LP	
  fails	
17/08/22	
 IJCAI@Melbourne	
 7
NETWORK	
  GENERATIVE	
  MODELS	
17/08/22	
 IJCAI@Melbourne	
 8
Stochastic Block Model	
Generative process	
 Multinomial	
Bernoulli	
①	
②	
①: Generate cluster assignment for each node

 
(which can be thought of labels)
②: Generate adjacency matrix	
17/08/22	
 IJCAI@Melbourne	
 9	
γ ∈ RK
Π ∈ RKxK
Parameters:
Proposed:
Partially Labeled SBM (PLSBM) 	
Generative process	
①	
②	
③	
②:Generate labels for “labeled nodes” 

 
(α large à yi is more likely to be the same as zi)
Depends on
parameter α	
17/08/22	
 IJCAI@Melbourne	
 10	
γ ∈ RK
Π ∈ RKxK
α ∈ 0,1[ ]
Parameters:
Rela1onships	
  between	
  models	
17/08/22	
 IJCAI@Melbourne	
 11	
SBM	
 PLSBM	
LP	
 Discre1zed	
  LP	
Main	
  result	
  
(next	
  slide)	
No	
  labels	
Con1nuous	
  
relaxa1on
Main Result	
Map estimator Z of PLSBM is identical to the solution of
(discretized) LP when the following conditions hold 
Condition 1: 
Condition 2:
Condition 3: 
Condition 4: (omitted)
17/08/22	
 IJCAI@Melbourne	
 12
Condition 1	
Implication (implicit assumption of LP)

•  Label ratio is uniform	
17/08/22	
 IJCAI@Melbourne	
 13	
Violates this assumption L
Condition 2	
Implication (Implicit assumptions of LP)	

•  Edge probs between the same labels are all the same (μ)
•  Edge probs between different labels are all the same (ν)	
17/08/22	
 IJCAI@Melbourne	
 14	
Violates this assumption L
Condition 3	
Implication (Implicit assumption of LP)

•  Assortative (same labels tend to be connected)
17/08/22	
 IJCAI@Melbourne	
 15	
Violates this assumption L
Experimental results	
17/08/22	
 IJCAI@Melbourne	
 16	
… Come see full results at the poster session J	
Better	
Setups:
1.  Generate datasets by PLSBM
2.  infer labels (Z) by PLSBM, SBM, and LP
3.  Report mean accuracy of 20 trials	
Assortative	
 Disassortative	
Agree with
theoretical results
Summary	
•  Proposed	
  Par1ally-­‐Labeled	
  SBM	
  (PLSBM)	
  
•  Proved	
  the	
  rela1onship	
  between	
  LP	
  and	
  SBM	
  via	
  
PLSBM	
  
•  Showed	
  cases	
  when	
  LP	
  fails	
  
•  Experimental	
  and	
  Theore1cal	
  results	
  agree	
17/08/22	
 IJCAI@Melbourne	
 17	
Github: yamaguchiyuto/plsbm

More Related Content

More from Yuto Yamaguchi

OMNI-Prop: Seamless Node Classification on Arbitrary Label Correlation
OMNI-Prop: Seamless Node Classification on Arbitrary Label CorrelationOMNI-Prop: Seamless Node Classification on Arbitrary Label Correlation
OMNI-Prop: Seamless Node Classification on Arbitrary Label CorrelationYuto Yamaguchi
 
Online User Location Inference Exploiting Spatiotemporal Correlations in Soci...
Online User Location Inference Exploiting Spatiotemporal Correlations in Soci...Online User Location Inference Exploiting Spatiotemporal Correlations in Soci...
Online User Location Inference Exploiting Spatiotemporal Correlations in Soci...Yuto Yamaguchi
 
SIGMOD2013勉強会:Social Media
SIGMOD2013勉強会:Social MediaSIGMOD2013勉強会:Social Media
SIGMOD2013勉強会:Social MediaYuto Yamaguchi
 
Towards Social User Profiling: Unified and Discriminative Influence Model for...
Towards Social User Profiling: Unified and Discriminative Influence Model for...Towards Social User Profiling: Unified and Discriminative Influence Model for...
Towards Social User Profiling: Unified and Discriminative Influence Model for...Yuto Yamaguchi
 
The Length of Bridge Ties: Structural and Geographic Properties of Online So...
The Length of Bridge Ties: Structural and Geographic Properties of Online So...The Length of Bridge Ties: Structural and Geographic Properties of Online So...
The Length of Bridge Ties: Structural and Geographic Properties of Online So...Yuto Yamaguchi
 
WWW2012勉強会:Information Diffusion in Social Networks
WWW2012勉強会:Information Diffusion in Social NetworksWWW2012勉強会:Information Diffusion in Social Networks
WWW2012勉強会:Information Diffusion in Social NetworksYuto Yamaguchi
 
ICDE2012勉強会:Social Media
ICDE2012勉強会:Social MediaICDE2012勉強会:Social Media
ICDE2012勉強会:Social MediaYuto Yamaguchi
 

More from Yuto Yamaguchi (7)

OMNI-Prop: Seamless Node Classification on Arbitrary Label Correlation
OMNI-Prop: Seamless Node Classification on Arbitrary Label CorrelationOMNI-Prop: Seamless Node Classification on Arbitrary Label Correlation
OMNI-Prop: Seamless Node Classification on Arbitrary Label Correlation
 
Online User Location Inference Exploiting Spatiotemporal Correlations in Soci...
Online User Location Inference Exploiting Spatiotemporal Correlations in Soci...Online User Location Inference Exploiting Spatiotemporal Correlations in Soci...
Online User Location Inference Exploiting Spatiotemporal Correlations in Soci...
 
SIGMOD2013勉強会:Social Media
SIGMOD2013勉強会:Social MediaSIGMOD2013勉強会:Social Media
SIGMOD2013勉強会:Social Media
 
Towards Social User Profiling: Unified and Discriminative Influence Model for...
Towards Social User Profiling: Unified and Discriminative Influence Model for...Towards Social User Profiling: Unified and Discriminative Influence Model for...
Towards Social User Profiling: Unified and Discriminative Influence Model for...
 
The Length of Bridge Ties: Structural and Geographic Properties of Online So...
The Length of Bridge Ties: Structural and Geographic Properties of Online So...The Length of Bridge Ties: Structural and Geographic Properties of Online So...
The Length of Bridge Ties: Structural and Geographic Properties of Online So...
 
WWW2012勉強会:Information Diffusion in Social Networks
WWW2012勉強会:Information Diffusion in Social NetworksWWW2012勉強会:Information Diffusion in Social Networks
WWW2012勉強会:Information Diffusion in Social Networks
 
ICDE2012勉強会:Social Media
ICDE2012勉強会:Social MediaICDE2012勉強会:Social Media
ICDE2012勉強会:Social Media
 

Recently uploaded

From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsTop 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsRoshan Dwivedi
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...Neo4j
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Paola De la Torre
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slidevu2urc
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 

Recently uploaded (20)

From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsTop 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen Frames
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 

When Does Label Propagation Fail? Theoretical Analysis Using a Network Generative Model

  • 1. When  Does  Label  Propaga1on  Fail?   A  View  from  a  Network  Genera1ve  Model Yuto  Yamaguchi  and  Kohei  Hayashi 17/08/22 IJCAI@Melbourne 1
  • 2. Node Classification Given Find Partially labeled undirected graph Labels of all nodes 17/08/22 IJCAI@Melbourne 2
  • 3. Example: User profile inference Friends Soccer Soccer Soccer Tennis Baseball ??? What’s his hobby? Node Classification 17/08/22 IJCAI@Melbourne 3
  • 4. Label Propagation (1/2) Propagate neighbors’ labels Friends Soccer Soccer Soccer Tennis Baseline ??? Soccer Soccer Soccer Tennis Baseline Soccer [Zhu+, 03], [Zhou+, 03], etc. 17/08/22 IJCAI@Melbourne 4
  • 5. Label Propagation (2/2) Q F;X,Y,λ( )= 1 2 fi − yi 2 2 i=1 N ∑ + λ 2 xij fi − fj 2 2 j=1 N ∑ i=1 N ∑ Given: adjacency matrix X and labels Y Find: F = { fi } that minimizes Q 17/08/22 IJCAI@Melbourne 5 F ∈ RN x K Y ∈ {0, 1}N x K X ∈ {0, 1}N x N N: # of nodes K: # of labels λ ∈ R+ : user parameter [Zhu+, 03], [Zhou+, 03], etc.
  • 6. Cases  when  LP  fails  (prac1cally  known) Different labels are connected Label ratio is not uniform Q. So, do we know why LP fails in these cases? A. No. Since it’s not a probabilistic model, we don’t know the assumptions behind the model. 17/08/22 IJCAI@Melbourne 6 Edge probability is not uniform
  • 7. What  we  do  in  this  work 1.  Prove  a  theore1cal  rela1onship  between  LP   and  Stochas(c  Block  Model,  which  is  a  well-­‐ studied  probabilis1c  genera1ve  model   2.  Find  the  assump(ons  behind  LP  through  the   assump1ons  behind  SBM   3.  Show  when  and  why  LP  fails 17/08/22 IJCAI@Melbourne 7
  • 9. Stochastic Block Model Generative process Multinomial Bernoulli ① ② ①: Generate cluster assignment for each node (which can be thought of labels) ②: Generate adjacency matrix 17/08/22 IJCAI@Melbourne 9 γ ∈ RK Π ∈ RKxK Parameters:
  • 10. Proposed: Partially Labeled SBM (PLSBM) Generative process ① ② ③ ②:Generate labels for “labeled nodes” (α large à yi is more likely to be the same as zi) Depends on parameter α 17/08/22 IJCAI@Melbourne 10 γ ∈ RK Π ∈ RKxK α ∈ 0,1[ ] Parameters:
  • 11. Rela1onships  between  models 17/08/22 IJCAI@Melbourne 11 SBM PLSBM LP Discre1zed  LP Main  result   (next  slide) No  labels Con1nuous   relaxa1on
  • 12. Main Result Map estimator Z of PLSBM is identical to the solution of (discretized) LP when the following conditions hold Condition 1: Condition 2: Condition 3: Condition 4: (omitted) 17/08/22 IJCAI@Melbourne 12
  • 13. Condition 1 Implication (implicit assumption of LP) •  Label ratio is uniform 17/08/22 IJCAI@Melbourne 13 Violates this assumption L
  • 14. Condition 2 Implication (Implicit assumptions of LP) •  Edge probs between the same labels are all the same (μ) •  Edge probs between different labels are all the same (ν) 17/08/22 IJCAI@Melbourne 14 Violates this assumption L
  • 15. Condition 3 Implication (Implicit assumption of LP) •  Assortative (same labels tend to be connected) 17/08/22 IJCAI@Melbourne 15 Violates this assumption L
  • 16. Experimental results 17/08/22 IJCAI@Melbourne 16 … Come see full results at the poster session J Better Setups: 1.  Generate datasets by PLSBM 2.  infer labels (Z) by PLSBM, SBM, and LP 3.  Report mean accuracy of 20 trials Assortative Disassortative Agree with theoretical results
  • 17. Summary •  Proposed  Par1ally-­‐Labeled  SBM  (PLSBM)   •  Proved  the  rela1onship  between  LP  and  SBM  via   PLSBM   •  Showed  cases  when  LP  fails   •  Experimental  and  Theore1cal  results  agree 17/08/22 IJCAI@Melbourne 17 Github: yamaguchiyuto/plsbm