Graph Kernelpdf

Graph Kernel
CS15MTECH11011
THESIS(STAGE I)

Outline
oKernel
oGraph Kernel
oShortest Path Kernel
oGraphlet Kernel
oWL Kernel
oExperimentation
oResults
oConclusion

Kernels
 A kernel is a function of two objects that quantiﬁes their similarity.
Any kernel methods solution comprises two parts: a module that performs the mapping into the
embedding or feature space and a learning algorithm designed to discover linear patterns in that
space.
K(X,X’)=(Φ(X),Φ(X’))
Where Φ is the feature space transformation.
Property: Symmetric Positive-Definite
C’KC≥0;
Graph Kernel

Graph Kernel
Graph kernels are used to find similarity between two graphs.
It exploits graph topology but, restricts itself to comparing substructure of graph that are
computable in polynomial time.
Property: Symmetric positive-Definite.
Many Graph Kernels have been defined, we will focus on mainly following:
1. Shortest Path Kernel
2. Graphlet Kernel
3. Weisfeiler-Lehman Kernel.
GRAPH KERNEL

Shortest Path Kernel
Task: Find the shortest path of all lengths among the graphs.
Working:
1. Floyd Transformation of graph.
2. Apply shortest path kernel over that transformed graph.
Complexity: O(n4)
GRAPH KERNEL

SP Kernel(Cont.)
Floyd Transformation:
 Contains same number of nodes as input graph.
 Unlike, there exist an edge between nodes in transformed graph if there exist a walk
between the nodes in input graph.
 Edge are marked as shortest distance
between nodes
 Complexity O(n3) Floyd-Warshall algorithm
GRAPH KERNEL

SP Kernel(Cont.)
GRAPH KERNEL
Kernel: Let S1(E1,V1), S2(E2,V2) be two graphs

SP Kernel (Cont.)
GRAPH KERNEL

Graphlet Kernel
Task: Count number of graphlet because isomorphic graphs have same graphlet distribution.
Count the number of graphlets of specific size, for linear computation.
There are possible graphlet of size k.
Let G and G’ be 2 graphs and S and S’ be their respective subgraphs.
Complexity: O(ndk-1)
 d=maximum degree of node
 k= size of graphlet
GRAPH KERNEL

Graphlet Kernel
 computing 3-graphlet kernel for connected component (triangle, V shapes).
GRAPH KERNEL

Weisfeiler-Lehman Kernel
WL Algorithm:
GRAPH KERNEL

WL Algorithm :iteration 1
Each Iteration of WL test comprises of following steps:-
1.Multiset label determination and sorting
◦ O(m) via Bucket Sort
GRAPH KERNEL

2.label compression
◦ O(m) via Radix Sort
GRAPH KERNEL

2.label compression
3. Relabeling
◦ O(n)
GRAPH KERNEL

2.label compression
3. Relabeling
◦ O(n)
4.Are the labels of G and G’ identical?
Yes, continue.
GRAPH KERNEL

GRAPH KERNEL

Each Iteration of WL test comprises of following steps:
GRAPH KERNEL

Each Iteration of WL test comprises of following steps:
2.label compression
GRAPH KERNEL

2.label compression
3. Relabeling
◦ O(n)
GRAPH KERNEL

2.label compression
3. Relabeling
◦ O(n)
4.Are the labels of G and G’ identical?
NO, output YES.
5.complexity O(hm) for h iteration
GRAPH KERNEL

WL Kernel
Feature{2 3 4 5 6 7 8 9 10}
Φ(G1)={3 2 1 0 1 2 2 0 1}
Φ(G2)={3 2 1 2 1 0 0 2 1 }
K{G1,G2}=16
GRAPH KERNEL

EXPERIMENT
1. INPUT: We have generated 3 different kind of input graphs.
a)Edge Removed Graphs: We took a complete graph and removed 10 edges at a time
from that graph, iteratively generate new graph.
b)Degree Reduced Graph: We took a complete graph and reduced the degree of each
node by 1,iteratively generate new graph.
c)Node Removed Graph: We took a complete graph and removed 1 node at a time from
that graph to generate new graph iteratively.
GRAPH KERNEL

EXPERIMENT
NORMALIZATION:
GRAPH KERNEL

Graphlet Kernel norm.
GRAPH KERNEL

Shortest Path Kernel norm.
GRAPH KERNEL

SVM
Graph Label Assignment:10 belong to each class, iteratively.
10 Fold SVM
Function Used: ‘rbf’
Gamma=0.00001
Accuracy: mean Accuracy output from all 10 folds.
GRAPH KERNEL

Conclusion
Accuracy:
Graphlet Kernel > WL Kernel > Shortest Path Kernel
I. Graphlet Kernel Exploits the topology very effectively by considering graphlet distribution.
II. Shortest Path Kernel shows least accuracy because it computes the number of shortest path
while not considering the topology involved in graphs.
III. WL Kernel shows moderate performance on our dataset.
IV. As we increase number of nodes the accuracy decreases because number of false
classification increases.
GRAPH KERNEL

Conclusion (cont.)
Time
Graphlet Kernel > Shortest Path Kernel > WL Kernel
I. Graphlet Kernel has highest computation time O(ndk-1),since our dataset is pretty dense.
II. Shortest Path Kernel O(n4).
III. WL Kernel O(Nhm).
N=number of Graphs.
h=number of iterations.
m=number of distinct labels.
GRAPH KERNEL

Graph Kernelpdf

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Viewers also liked

Viewers also liked (15)

Similar to Graph Kernelpdf

Similar to Graph Kernelpdf (20)

Graph Kernelpdf