SlideShare ist ein Scribd-Unternehmen logo
1 von 32
Downloaden Sie, um offline zu lesen
Graph Gurus 32
Using Graph Algorithms for Advanced Analytics
Part 5 - Classification
1
© 2020 TigerGraph. All Rights Reserved
Today's Presenter
2
Victor Lee
Head of Product Strategy & Developer Relations
● BS in Electrical Engineering and Computer
Science from UC Berkeley, MS in Electrical
Engineering from Stanford University
● PhD in Computer Science from Kent State
University focused on graph data mining
● 20+ years in tech industry
© 2020 TigerGraph. All Rights Reserved
Some Housekeeping Items
● Although your phone is muted we do want to answer your questions -
submit your questions at any time using the Q&A tab in the menu
● The webinar is being recorded and will uploaded to our website shortly
(https://www.tigergraph.com/webinars/) and the URL will be emailed
you
● If you have issues with Zoom please contact the panelists via chat
3
© 2020 TigerGraph. All Rights Reserved
Move Faster with TigerGraph Cloud
4
Built for agile teams who would rather build innovative applications than
procure hardware or configure and manage databases
● Start for free
● Move to production with distributed data and HA replication
© 2020 TigerGraph. All Rights Reserved
Today’s Outline
5
1
3
2
Recap of Parts 1 to 4:
Path, Centrality, Community, and
Similarity Algorithms
Training a Classifier: K-Nearest Neighbors
Introduction to machine learning
Use Cases for Classification
4 Demo
GSQL Queries for training & classifying
© 2020 TigerGraph. All Rights Reserved
Review: Analytics with Graph Algorithms
● Graph algorithms answer fundamental questions about
connected data
● Each algorithm in a library is tool in an analytics toolkit
● Building blocks for more complex business questions
6
Specialized functions Combine to make
something better
© 2020 TigerGraph. All Rights Reserved
Example Questions/Analyses for Graph Algorithms
Which entity is most centrally
located?
● For delivery logistics or greatest visibility
● Closeness Centrality, Betweenness
Centrality algorithms
7
How much influence does this entity
exert over the others?
● For market penetration & buyer influence
● PageRank algorithm
Which entity has similar
relationships to this entity?
● For grouping customers, products, etc.
● Cosine Similarity, Jaccard Similarity,
SimRank, RoleSim algorithms
What are the natural community
groupings in the graph?
● For partitioning risk groups, workgroups,
product offerings, etc.
● Community Detection, MinCut algorithms
© 2020 TigerGraph. All Rights Reserved
Summary for Shortest Path Algorithms
Graph Gurus 26
8
1
4
3
Graph Algorithms - tools and building
blocks for analyzing graph data
GSQL Algorithm Library - runs in-database,
high-performance,
easy to read and modify
Shortest Path Algorithms - different
algorithms for weighted and unweighted
graphs
2 Learning To Use Algorithms - know what
problem they solve, pros and cons
© 2020 TigerGraph. All Rights Reserved
Summary for Centrality Algorithms
Graph Gurus 27
9
1
4
3
Centrality Algorithms - abstract
concepts of location and travel.
Customizing GSQL Library algorithms -
easy and familiar, like procedural SQL
PageRank - uses directed referral edges to
find the most influential nodes. Personalized
PageRank is localized.
2 Closeness and Betweenness - use shortest
paths. Betweenness is more complex.
© 2020 TigerGraph. All Rights Reserved
Summary for Community Detection Algorithms
Graph Gurus 29
10
1
3
2
Community Detection Algorithms
Use connectedness to decide boundaries
Strict vs. Lenient Community Rules
Black & white rules are not always helpful.
Louvain uses relative density.
Communities are Clusters, not Partitions
Don't have to include everyone.
Can overlap?
4 Pre- or Post- step with other algorithms
Many algorithms assume you start from just
one connected community
© 2020 TigerGraph. All Rights Reserved 11
1
3
2
Similarity is in the Eye of the Beholder
What factors matter to you? How much?
Jaccard and Cosine Similarity
Counting matches vs. measuring numerical
alignment
Graph modeling helps with Similarity
Hub-and-spoke view
4 Deeper Measures: SimRank and RoleSim
Define similarity recursively, look multiple hops
deep (globally)
Summary for Similarity Algorithms
Graph Gurus 30
© 2020 TigerGraph. All Rights Reserved
Some Types of Graph Algorithms
● Search
● Path Finding & Analytics
● Centrality / Ranking
● Clustering / Community Detection
● Similarity
● Classification
12
© 2020 TigerGraph. All Rights Reserved
Classifying 4 types of
good & bad telecom
users,using basic and
graph features
Tim Sarah JohnFred
Prankster Regular Customer Sale
s
Fraudster
Age of sim card 2 weeks 4 weeks 3 weeks 2 weeks
% of one directional calls 50% 10% 55% 60%
% rejected calls 40% 5% 28% 25%
Stable group Yes Yes No No
Many in-group connections No Yes No Yes
3-step friend relation No Yes No Yes
Prediction by machine learning with
deep link graph features
Likely Prankster Regular Customer Likely Fraudster Likely Sales
13
Download the solution brief: https://info.tigergraph.com/MachineLearning
© 2020 TigerGraph. All Rights Reserved
Other Use Cases
● Cold, Flu, COVID-19 or other?
14
Medical Diagnosis
● Word recognition, part of speech
● Question or statement?
● Sarcasm or not?
● Sentiment analysis
● Expected response
Understanding Natural Language
© 2020 TigerGraph. All Rights Reserved
1. Rule-based:
"If it walks like a duck and talks
like a duck, then it's a duck."
Common Classifiers
● The two methods are equivalent
● But how do you derive the rules / decisions?
○ Not talking about classification by legislation: "Because I say so"
○ Are there ways to make a "best" classifier?
15
2. Decision Tree
Walk like duck?
Talk like duck?
DUCKNot duckNot duck
Y
YN
N
© 2020 TigerGraph. All Rights Reserved
Classifier Induction (e.g. Learning how to classify)
16
● Need a set of training instances, where you know both (1) features
and (2) the classes (labels) of each instance.
● Use some statistical method to correlate the features to the labels.
Item/Case Feature1 Feature2 Feature3 Label:
Classification
1 red 2.3 yes A
2 blue 4.1 yes B
3 red 4.0 yes A
Learning a Classifier is one type of Supervised Machine Learning
© 2020 TigerGraph. All Rights Reserved
k-Nearest Neighbor Classification (kNN)
● Concept: Predict an entity's class by
looking at the classes of the
"nearest" other entities.
● Question: What is distance?
○ Physical distance? → Clustering
○ Or, some concept of similarity?
● How many neighbors to consider?
○ Within a radius?
○ Up to a certain number?
17
?
© 2020 TigerGraph. All Rights Reserved
k-Nearest Neighbor Classification (kNN)
Consider the K closest neighbors: from nearest to farthest:
● Pick the class that is represented most often
● The prediction depends on the value of K:
18
?
k red yellow unlabeled Prediction
2 0 2 0 yellow
3 0 2 1 yellow
4 1 2 1 yellow
5 2 2 1 red/yellow
6 3 2 1 red
7 3 2 2 red
8 3 3 2 red/yellow
9 3 3 3 red/yellow
10 4 3 3 red
11 4 4 3 red/yellow
12 4 5 3 yellow
© 2020 TigerGraph. All Rights Reserved
kNN as a Machine Learning Task
● As long as you have a distance/similarity function, you don't
need any additional "intelligence", except

● ML task: Find the best value of K
19
Distance calculation
Simple counting
Item you want
to classify
k (?)
Prediction
Could also work on optimizing the distance
function, but that's out of scope of our
current discussion.
© 2020 TigerGraph. All Rights Reserved
kNN for Graph Data
20
kNN is a general purpose classifier. How does it work with graphs?
● It's really about the distance/similarity function.
● For our kNN, we cosine neighborhood similarity.
● It's easy to replace this with Jaccard neighborhood similarity.
A W
B
X
Y
Z
2
3
12
1
4Similarity(A,B) =
f(shared neighbors)
© 2020 TigerGraph. All Rights Reserved
Cross Validation
21
● In Machine Learning, first train a model, and then validate
(check) the accuracy of the model. Split your labeled data
into subsets:
○ Training set (bigger part)
○ Validation set (smaller part): Use the trained model to see if
correctly predicts the actual labels in the validation set.
1 2 ... n-1 n
training set
n-Fold Cross Validation Leave-out-one Cross Validation
Repeat, leaving
out a different
one(segment)
each time
© 2020 TigerGraph. All Rights Reserved
TigerGraph kNN_cosine_cv algorithm
Given
● a (fully or partially) labeled graph
● a similarity measure
● a range of k values chosen by the user
For each value of k:
● For each labeled item Q in the graph:
○ Predict Q's label by looking at the classes of the k closest entities
○ Accuracy(k) = number (%) of correct predictions
Select the k which produced the highest Accuracy(k)
22
DEMO
GSQL Graph Algorithms in TigerGraph Cloud
23
© 2020 TigerGraph. All Rights Reserved
Sample Data Set
24
© 2020 TigerGraph. All Rights Reserved
GSQL Graph Algorithm Library
● Written in GSQL - high-level, parallelized
● Open-source, user-extensible
● Well-documented
25
docs.tigergraph.com/graph-algorithm-library
© 2020 TigerGraph. All Rights Reserved
TigerGraph GSQL Graph Algorithm Library
✓ Call each algorithm as a GSQL query
or as a RESTful endpoint
✓ Run the algorithms in-database (don't
export the data)
✓ Option to update the graph with the
algorithm results
✓ Able to modify/customize the
algorithms. Turing-complete
language.
✓ Massively parallel processing to
handle big graphs
26
© 2020 TigerGraph. All Rights Reserved
Summary
27
1
3
2
Graph Algorithms
Key tool for data scientists
k-Nearest Neighbors
Predicts the classification by looking at the
classes of the similar/nearby items.
What's the right value for k?
Classification Algorithms
Bridge the gap to machine learning
4 Advanced Analytics with Graph Algorithms
https://docs.tigergraph.com/graph-algorithm-library
Q&A
Please submit your questions via the Q&A tab in Zoom
28
© 2020 TigerGraph. All Rights Reserved
More Questions?
Join our Developer Forum
https://groups.google.com/a/opengsql.org/forum/#!forum/gsql-users
Sign up for our Developer OfïŹce Hours (every Thursday at 11 AM PST)
https://info.tigergraph.com/officehours
29
© 2020 TigerGraph. All Rights Reserved
Additional Resources
Start Free at TigerGraph Cloud Today!
https://www.tigergraph.com/cloud/
Test Drive Online Demo
https://www.tigergraph.com/demo
Download the Developer Edition
https://www.tigergraph.com/download/
Guru Scripts
https://github.com/tigergraph/ecosys/tree/master/guru_scripts
30
© 2020 TigerGraph. All Rights Reserved
Upcoming Online Events
Graph Gurus 33: GSQL Writing Best Practices - Part 2
Wednesday, April 8, at 11am PDT
https://info.tigergraph.com/graph-gurus-33
31
Thank You

Weitere Àhnliche Inhalte

Was ist angesagt?

Was ist angesagt? (20)

Using Graph Algorithms For Advanced Analytics - Part 4 Similarity 30 graph al...
Using Graph Algorithms For Advanced Analytics - Part 4 Similarity 30 graph al...Using Graph Algorithms For Advanced Analytics - Part 4 Similarity 30 graph al...
Using Graph Algorithms For Advanced Analytics - Part 4 Similarity 30 graph al...
 
Graph Gurus Episode 25: Unleash the Business Value of Your Data Lake with Gra...
Graph Gurus Episode 25: Unleash the Business Value of Your Data Lake with Gra...Graph Gurus Episode 25: Unleash the Business Value of Your Data Lake with Gra...
Graph Gurus Episode 25: Unleash the Business Value of Your Data Lake with Gra...
 
Graph Gurus Episode 17: Seven Key Data Science Capabilities Powered by a Nati...
Graph Gurus Episode 17: Seven Key Data Science Capabilities Powered by a Nati...Graph Gurus Episode 17: Seven Key Data Science Capabilities Powered by a Nati...
Graph Gurus Episode 17: Seven Key Data Science Capabilities Powered by a Nati...
 
Graph Gurus 21: Integrating Real-Time Deep-Link Graph Analytics with Spark AI
Graph Gurus 21: Integrating Real-Time Deep-Link Graph Analytics with Spark AIGraph Gurus 21: Integrating Real-Time Deep-Link Graph Analytics with Spark AI
Graph Gurus 21: Integrating Real-Time Deep-Link Graph Analytics with Spark AI
 
Graph Gurus Episode 6: Community Detection
Graph Gurus Episode 6: Community DetectionGraph Gurus Episode 6: Community Detection
Graph Gurus Episode 6: Community Detection
 
Graph Databases and Machine Learning | November 2018
Graph Databases and Machine Learning | November 2018Graph Databases and Machine Learning | November 2018
Graph Databases and Machine Learning | November 2018
 
Graph Gurus Episode 35: No Code Graph Analytics to Get Insights from Petabyte...
Graph Gurus Episode 35: No Code Graph Analytics to Get Insights from Petabyte...Graph Gurus Episode 35: No Code Graph Analytics to Get Insights from Petabyte...
Graph Gurus Episode 35: No Code Graph Analytics to Get Insights from Petabyte...
 
Graph Gurus Episode 1: Enterprise Graph
Graph Gurus Episode 1: Enterprise GraphGraph Gurus Episode 1: Enterprise Graph
Graph Gurus Episode 1: Enterprise Graph
 
Fast Parallel Similarity Calculations with FPGA Hardware
Fast Parallel Similarity Calculations with FPGA HardwareFast Parallel Similarity Calculations with FPGA Hardware
Fast Parallel Similarity Calculations with FPGA Hardware
 
Graph Gurus Episode 12: Tiger Graph v2.3 Overview
Graph Gurus Episode 12: Tiger Graph v2.3 OverviewGraph Gurus Episode 12: Tiger Graph v2.3 Overview
Graph Gurus Episode 12: Tiger Graph v2.3 Overview
 
Graph Gurus Episode 9: How Visa Optimizes Network and IT Resources with a Nat...
Graph Gurus Episode 9: How Visa Optimizes Network and IT Resources with a Nat...Graph Gurus Episode 9: How Visa Optimizes Network and IT Resources with a Nat...
Graph Gurus Episode 9: How Visa Optimizes Network and IT Resources with a Nat...
 
Graph Gurus 15: Introducing TigerGraph 2.4
Graph Gurus 15: Introducing TigerGraph 2.4 Graph Gurus 15: Introducing TigerGraph 2.4
Graph Gurus 15: Introducing TigerGraph 2.4
 
Graph Gurus Episode 22: Cybersecurity
Graph Gurus Episode 22: CybersecurityGraph Gurus Episode 22: Cybersecurity
Graph Gurus Episode 22: Cybersecurity
 
Graph Gurus Episode 3: Anti Fraud and AML Part 1
Graph Gurus Episode 3: Anti Fraud and AML Part 1Graph Gurus Episode 3: Anti Fraud and AML Part 1
Graph Gurus Episode 3: Anti Fraud and AML Part 1
 
Plume - A Code Property Graph Extraction and Analysis Library
Plume - A Code Property Graph Extraction and Analysis LibraryPlume - A Code Property Graph Extraction and Analysis Library
Plume - A Code Property Graph Extraction and Analysis Library
 
Predictive Model and Record Description with Segmented Sensitivity Analysis (...
Predictive Model and Record Description with Segmented Sensitivity Analysis (...Predictive Model and Record Description with Segmented Sensitivity Analysis (...
Predictive Model and Record Description with Segmented Sensitivity Analysis (...
 
Tales from an ip worker in consulting and software
Tales from an ip worker in consulting and softwareTales from an ip worker in consulting and software
Tales from an ip worker in consulting and software
 
Shift Remote: AI: Smarter AI with analytical graph databases - Victor Lee (Ti...
Shift Remote: AI: Smarter AI with analytical graph databases - Victor Lee (Ti...Shift Remote: AI: Smarter AI with analytical graph databases - Victor Lee (Ti...
Shift Remote: AI: Smarter AI with analytical graph databases - Victor Lee (Ti...
 
Graphs and Financial Services Analytics
Graphs and Financial Services AnalyticsGraphs and Financial Services Analytics
Graphs and Financial Services Analytics
 
Neo4j MeetUp - Graph Exploration with MetaExp
Neo4j MeetUp - Graph Exploration with MetaExpNeo4j MeetUp - Graph Exploration with MetaExp
Neo4j MeetUp - Graph Exploration with MetaExp
 

Ähnlich wie Graph Gurus Episode 32: Using Graph Algorithms for Advanced Analytics Part 5

Real-Time Fraud Detection at Scale—Integrating Real-Time Deep-Link Graph Anal...
Real-Time Fraud Detection at Scale—Integrating Real-Time Deep-Link Graph Anal...Real-Time Fraud Detection at Scale—Integrating Real-Time Deep-Link Graph Anal...
Real-Time Fraud Detection at Scale—Integrating Real-Time Deep-Link Graph Anal...
Databricks
 
Building A Machine Learning Platform At Quora (1)
Building A Machine Learning Platform At Quora (1)Building A Machine Learning Platform At Quora (1)
Building A Machine Learning Platform At Quora (1)
Nikhil Garg
 
DN18 | Demystifying the Buzz in Machine Learning! (This Time for Real) | Dat ...
DN18 | Demystifying the Buzz in Machine Learning! (This Time for Real) | Dat ...DN18 | Demystifying the Buzz in Machine Learning! (This Time for Real) | Dat ...
DN18 | Demystifying the Buzz in Machine Learning! (This Time for Real) | Dat ...
Dataconomy Media
 

Ähnlich wie Graph Gurus Episode 32: Using Graph Algorithms for Advanced Analytics Part 5 (20)

Graph Gurus Episode 29: Using Graph Algorithms for Advanced Analytics Part 3
Graph Gurus Episode 29: Using Graph Algorithms for Advanced Analytics Part 3Graph Gurus Episode 29: Using Graph Algorithms for Advanced Analytics Part 3
Graph Gurus Episode 29: Using Graph Algorithms for Advanced Analytics Part 3
 
Graph Gurus Episode 5: Webinar PageRank
Graph Gurus Episode 5: Webinar PageRankGraph Gurus Episode 5: Webinar PageRank
Graph Gurus Episode 5: Webinar PageRank
 
Graph Gurus Episode 8: Location, Location, Location - Geospatial Analysis wit...
Graph Gurus Episode 8: Location, Location, Location - Geospatial Analysis wit...Graph Gurus Episode 8: Location, Location, Location - Geospatial Analysis wit...
Graph Gurus Episode 8: Location, Location, Location - Geospatial Analysis wit...
 
Graph Gurus Episode 19: Deep Learning Implemented by GSQL on a Native Paralle...
Graph Gurus Episode 19: Deep Learning Implemented by GSQL on a Native Paralle...Graph Gurus Episode 19: Deep Learning Implemented by GSQL on a Native Paralle...
Graph Gurus Episode 19: Deep Learning Implemented by GSQL on a Native Paralle...
 
Graph Gurus Episode 34: Graph Databases are Changing the Fraud Detection and ...
Graph Gurus Episode 34: Graph Databases are Changing the Fraud Detection and ...Graph Gurus Episode 34: Graph Databases are Changing the Fraud Detection and ...
Graph Gurus Episode 34: Graph Databases are Changing the Fraud Detection and ...
 
Scaling up business value with real-time operational graph analytics
Scaling up business value with real-time operational graph analyticsScaling up business value with real-time operational graph analytics
Scaling up business value with real-time operational graph analytics
 
Tiger graph 2021 corporate overview [read only]
Tiger graph 2021 corporate overview [read only]Tiger graph 2021 corporate overview [read only]
Tiger graph 2021 corporate overview [read only]
 
Graph Gurus Episode 7: Connecting the Dots in Real-Time: Deep Link Analysis w...
Graph Gurus Episode 7: Connecting the Dots in Real-Time: Deep Link Analysis w...Graph Gurus Episode 7: Connecting the Dots in Real-Time: Deep Link Analysis w...
Graph Gurus Episode 7: Connecting the Dots in Real-Time: Deep Link Analysis w...
 
Real-Time Fraud Detection at Scale—Integrating Real-Time Deep-Link Graph Anal...
Real-Time Fraud Detection at Scale—Integrating Real-Time Deep-Link Graph Anal...Real-Time Fraud Detection at Scale—Integrating Real-Time Deep-Link Graph Anal...
Real-Time Fraud Detection at Scale—Integrating Real-Time Deep-Link Graph Anal...
 
GridMAP: Next generation energy analysis tools.
GridMAP: Next generation energy analysis tools.GridMAP: Next generation energy analysis tools.
GridMAP: Next generation energy analysis tools.
 
ML Zoomcamp 2.1 - Car Price Prediction Project
ML Zoomcamp 2.1 - Car Price Prediction ProjectML Zoomcamp 2.1 - Car Price Prediction Project
ML Zoomcamp 2.1 - Car Price Prediction Project
 
Dimensionality Reduction
Dimensionality ReductionDimensionality Reduction
Dimensionality Reduction
 
Big Data LDN 2018: DATA OPERATIONS PROBLEMS CREATED BY DEEP LEARNING, AND HOW...
Big Data LDN 2018: DATA OPERATIONS PROBLEMS CREATED BY DEEP LEARNING, AND HOW...Big Data LDN 2018: DATA OPERATIONS PROBLEMS CREATED BY DEEP LEARNING, AND HOW...
Big Data LDN 2018: DATA OPERATIONS PROBLEMS CREATED BY DEEP LEARNING, AND HOW...
 
Neo4j Graph Data Science Training - June 9 & 10 - Slides #7 GDS Best Practices
Neo4j Graph Data Science Training - June 9 & 10 - Slides #7 GDS Best PracticesNeo4j Graph Data Science Training - June 9 & 10 - Slides #7 GDS Best Practices
Neo4j Graph Data Science Training - June 9 & 10 - Slides #7 GDS Best Practices
 
Kaggle and data science
Kaggle and data scienceKaggle and data science
Kaggle and data science
 
K Nearest Neighbor V1.0 Supervised Machine Learning Algorithm
K Nearest Neighbor V1.0 Supervised Machine Learning AlgorithmK Nearest Neighbor V1.0 Supervised Machine Learning Algorithm
K Nearest Neighbor V1.0 Supervised Machine Learning Algorithm
 
Graph processing at scale using spark & graph frames
Graph processing at scale using spark & graph framesGraph processing at scale using spark & graph frames
Graph processing at scale using spark & graph frames
 
Nikhil Garg, Engineering Manager, Quora at MLconf SF 2016
Nikhil Garg, Engineering Manager, Quora at MLconf SF 2016Nikhil Garg, Engineering Manager, Quora at MLconf SF 2016
Nikhil Garg, Engineering Manager, Quora at MLconf SF 2016
 
Building A Machine Learning Platform At Quora (1)
Building A Machine Learning Platform At Quora (1)Building A Machine Learning Platform At Quora (1)
Building A Machine Learning Platform At Quora (1)
 
DN18 | Demystifying the Buzz in Machine Learning! (This Time for Real) | Dat ...
DN18 | Demystifying the Buzz in Machine Learning! (This Time for Real) | Dat ...DN18 | Demystifying the Buzz in Machine Learning! (This Time for Real) | Dat ...
DN18 | Demystifying the Buzz in Machine Learning! (This Time for Real) | Dat ...
 

Mehr von TigerGraph

Mehr von TigerGraph (20)

MAXIMIZING THE VALUE OF SCIENTIFIC INFORMATION TO ACCELERATE INNOVATION
MAXIMIZING THE VALUE OF SCIENTIFIC INFORMATION TO ACCELERATE INNOVATIONMAXIMIZING THE VALUE OF SCIENTIFIC INFORMATION TO ACCELERATE INNOVATION
MAXIMIZING THE VALUE OF SCIENTIFIC INFORMATION TO ACCELERATE INNOVATION
 
Better Together: How Graph database enables easy data integration with Spark ...
Better Together: How Graph database enables easy data integration with Spark ...Better Together: How Graph database enables easy data integration with Spark ...
Better Together: How Graph database enables easy data integration with Spark ...
 
Building an accurate understanding of consumers based on real-world signals
Building an accurate understanding of consumers based on real-world signalsBuilding an accurate understanding of consumers based on real-world signals
Building an accurate understanding of consumers based on real-world signals
 
Care Intervention Assistant - Omaha Clinical Data Information System
Care Intervention Assistant - Omaha Clinical Data Information SystemCare Intervention Assistant - Omaha Clinical Data Information System
Care Intervention Assistant - Omaha Clinical Data Information System
 
Correspondent Banking Networks
Correspondent Banking NetworksCorrespondent Banking Networks
Correspondent Banking Networks
 
Delivering Large Scale Real-time Graph Analytics with Dell Infrastructure and...
Delivering Large Scale Real-time Graph Analytics with Dell Infrastructure and...Delivering Large Scale Real-time Graph Analytics with Dell Infrastructure and...
Delivering Large Scale Real-time Graph Analytics with Dell Infrastructure and...
 
Deploying an End-to-End TigerGraph Enterprise Architecture using Kafka, Maria...
Deploying an End-to-End TigerGraph Enterprise Architecture using Kafka, Maria...Deploying an End-to-End TigerGraph Enterprise Architecture using Kafka, Maria...
Deploying an End-to-End TigerGraph Enterprise Architecture using Kafka, Maria...
 
Fraud Detection and Compliance with Graph Learning
Fraud Detection and Compliance with Graph LearningFraud Detection and Compliance with Graph Learning
Fraud Detection and Compliance with Graph Learning
 
Fraudulent credit card cash-out detection On Graphs
Fraudulent credit card cash-out detection On GraphsFraudulent credit card cash-out detection On Graphs
Fraudulent credit card cash-out detection On Graphs
 
FROM DATAFRAMES TO GRAPH Data Science with pyTigerGraph
FROM DATAFRAMES TO GRAPH Data Science with pyTigerGraphFROM DATAFRAMES TO GRAPH Data Science with pyTigerGraph
FROM DATAFRAMES TO GRAPH Data Science with pyTigerGraph
 
Customer Experience Management
Customer Experience ManagementCustomer Experience Management
Customer Experience Management
 
Graph+AI for Fin. Services
Graph+AI for Fin. ServicesGraph+AI for Fin. Services
Graph+AI for Fin. Services
 
Davraz - A graph visualization and exploration software.
Davraz - A graph visualization and exploration software.Davraz - A graph visualization and exploration software.
Davraz - A graph visualization and exploration software.
 
TigerGraph.js
TigerGraph.jsTigerGraph.js
TigerGraph.js
 
GRAPHS FOR THE FUTURE ENERGY SYSTEMS
GRAPHS FOR THE FUTURE ENERGY SYSTEMSGRAPHS FOR THE FUTURE ENERGY SYSTEMS
GRAPHS FOR THE FUTURE ENERGY SYSTEMS
 
Hardware Accelerated Machine Learning Solution for Detecting Fraud and Money ...
Hardware Accelerated Machine Learning Solution for Detecting Fraud and Money ...Hardware Accelerated Machine Learning Solution for Detecting Fraud and Money ...
Hardware Accelerated Machine Learning Solution for Detecting Fraud and Money ...
 
How to Build An AI Based Customer Data Platform: Learn the design patterns fo...
How to Build An AI Based Customer Data Platform: Learn the design patterns fo...How to Build An AI Based Customer Data Platform: Learn the design patterns fo...
How to Build An AI Based Customer Data Platform: Learn the design patterns fo...
 
Machine Learning Feature Design with TigerGraph 3.0 No-Code GUI
Machine Learning Feature Design with TigerGraph 3.0 No-Code GUIMachine Learning Feature Design with TigerGraph 3.0 No-Code GUI
Machine Learning Feature Design with TigerGraph 3.0 No-Code GUI
 
Recommendation Engine with In-Database Machine Learning
Recommendation Engine with In-Database Machine LearningRecommendation Engine with In-Database Machine Learning
Recommendation Engine with In-Database Machine Learning
 
Supply Chain and Logistics Management with Graph & AI
Supply Chain and Logistics Management with Graph & AISupply Chain and Logistics Management with Graph & AI
Supply Chain and Logistics Management with Graph & AI
 

KĂŒrzlich hochgeladen

Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
amitlee9823
 
Call Girls In Bellandur ☎ 7737669865 đŸ„” Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 đŸ„” Book Your One night StandCall Girls In Bellandur ☎ 7737669865 đŸ„” Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 đŸ„” Book Your One night Stand
amitlee9823
 
Just Call Vip call girls Mysore Escorts ☎9352988975 Two shot with one girl (...
Just Call Vip call girls Mysore Escorts ☎9352988975 Two shot with one girl (...Just Call Vip call girls Mysore Escorts ☎9352988975 Two shot with one girl (...
Just Call Vip call girls Mysore Escorts ☎9352988975 Two shot with one girl (...
gajnagarg
 
âž„đŸ” 7737669865 đŸ”â–» Dindigul Call-girls in Women Seeking Men 🔝Dindigul🔝 Escor...
âž„đŸ” 7737669865 đŸ”â–» Dindigul Call-girls in Women Seeking Men  🔝Dindigul🔝   Escor...âž„đŸ” 7737669865 đŸ”â–» Dindigul Call-girls in Women Seeking Men  🔝Dindigul🔝   Escor...
âž„đŸ” 7737669865 đŸ”â–» Dindigul Call-girls in Women Seeking Men 🔝Dindigul🔝 Escor...
amitlee9823
 
Call Girls In Attibele ☎ 7737669865 đŸ„” Book Your One night Stand
Call Girls In Attibele ☎ 7737669865 đŸ„” Book Your One night StandCall Girls In Attibele ☎ 7737669865 đŸ„” Book Your One night Stand
Call Girls In Attibele ☎ 7737669865 đŸ„” Book Your One night Stand
amitlee9823
 
👉 Amritsar Call Girl 👉📞 6367187148 👉📞 JustđŸ“Č Call Ruhi Call Girl Phone No Amri...
👉 Amritsar Call Girl 👉📞 6367187148 👉📞 JustđŸ“Č Call Ruhi Call Girl Phone No Amri...👉 Amritsar Call Girl 👉📞 6367187148 👉📞 JustđŸ“Č Call Ruhi Call Girl Phone No Amri...
👉 Amritsar Call Girl 👉📞 6367187148 👉📞 JustđŸ“Č Call Ruhi Call Girl Phone No Amri...
karishmasinghjnh
 
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
amitlee9823
 
Just Call Vip call girls roorkee Escorts ☎9352988975 Two shot with one girl ...
Just Call Vip call girls roorkee Escorts ☎9352988975 Two shot with one girl ...Just Call Vip call girls roorkee Escorts ☎9352988975 Two shot with one girl ...
Just Call Vip call girls roorkee Escorts ☎9352988975 Two shot with one girl ...
gajnagarg
 
Call Girls In Nandini Layout ☎ 7737669865 đŸ„” Book Your One night Stand
Call Girls In Nandini Layout ☎ 7737669865 đŸ„” Book Your One night StandCall Girls In Nandini Layout ☎ 7737669865 đŸ„” Book Your One night Stand
Call Girls In Nandini Layout ☎ 7737669865 đŸ„” Book Your One night Stand
amitlee9823
 
âž„đŸ” 7737669865 đŸ”â–» mahisagar Call-girls in Women Seeking Men 🔝mahisagar🔝 Esc...
âž„đŸ” 7737669865 đŸ”â–» mahisagar Call-girls in Women Seeking Men  🔝mahisagar🔝   Esc...âž„đŸ” 7737669865 đŸ”â–» mahisagar Call-girls in Women Seeking Men  🔝mahisagar🔝   Esc...
âž„đŸ” 7737669865 đŸ”â–» mahisagar Call-girls in Women Seeking Men 🔝mahisagar🔝 Esc...
amitlee9823
 
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
amitlee9823
 
âž„đŸ” 7737669865 đŸ”â–» Sambalpur Call-girls in Women Seeking Men 🔝Sambalpur🔝 Esc...
âž„đŸ” 7737669865 đŸ”â–» Sambalpur Call-girls in Women Seeking Men  🔝Sambalpur🔝   Esc...âž„đŸ” 7737669865 đŸ”â–» Sambalpur Call-girls in Women Seeking Men  🔝Sambalpur🔝   Esc...
âž„đŸ” 7737669865 đŸ”â–» Sambalpur Call-girls in Women Seeking Men 🔝Sambalpur🔝 Esc...
amitlee9823
 
Call Girls In Shivaji Nagar ☎ 7737669865 đŸ„” Book Your One night Stand
Call Girls In Shivaji Nagar ☎ 7737669865 đŸ„” Book Your One night StandCall Girls In Shivaji Nagar ☎ 7737669865 đŸ„” Book Your One night Stand
Call Girls In Shivaji Nagar ☎ 7737669865 đŸ„” Book Your One night Stand
amitlee9823
 
Just Call Vip call girls Erode Escorts ☎9352988975 Two shot with one girl (E...
Just Call Vip call girls Erode Escorts ☎9352988975 Two shot with one girl (E...Just Call Vip call girls Erode Escorts ☎9352988975 Two shot with one girl (E...
Just Call Vip call girls Erode Escorts ☎9352988975 Two shot with one girl (E...
gajnagarg
 

KĂŒrzlich hochgeladen (20)

Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
 
Call Girls In Bellandur ☎ 7737669865 đŸ„” Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 đŸ„” Book Your One night StandCall Girls In Bellandur ☎ 7737669865 đŸ„” Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 đŸ„” Book Your One night Stand
 
DATA SUMMIT 24 Building Real-Time Pipelines With FLaNK
DATA SUMMIT 24  Building Real-Time Pipelines With FLaNKDATA SUMMIT 24  Building Real-Time Pipelines With FLaNK
DATA SUMMIT 24 Building Real-Time Pipelines With FLaNK
 
Detecting Credit Card Fraud: A Machine Learning Approach
Detecting Credit Card Fraud: A Machine Learning ApproachDetecting Credit Card Fraud: A Machine Learning Approach
Detecting Credit Card Fraud: A Machine Learning Approach
 
Just Call Vip call girls Mysore Escorts ☎9352988975 Two shot with one girl (...
Just Call Vip call girls Mysore Escorts ☎9352988975 Two shot with one girl (...Just Call Vip call girls Mysore Escorts ☎9352988975 Two shot with one girl (...
Just Call Vip call girls Mysore Escorts ☎9352988975 Two shot with one girl (...
 
âž„đŸ” 7737669865 đŸ”â–» Dindigul Call-girls in Women Seeking Men 🔝Dindigul🔝 Escor...
âž„đŸ” 7737669865 đŸ”â–» Dindigul Call-girls in Women Seeking Men  🔝Dindigul🔝   Escor...âž„đŸ” 7737669865 đŸ”â–» Dindigul Call-girls in Women Seeking Men  🔝Dindigul🔝   Escor...
âž„đŸ” 7737669865 đŸ”â–» Dindigul Call-girls in Women Seeking Men 🔝Dindigul🔝 Escor...
 
Anomaly detection and data imputation within time series
Anomaly detection and data imputation within time seriesAnomaly detection and data imputation within time series
Anomaly detection and data imputation within time series
 
Call Girls In Attibele ☎ 7737669865 đŸ„” Book Your One night Stand
Call Girls In Attibele ☎ 7737669865 đŸ„” Book Your One night StandCall Girls In Attibele ☎ 7737669865 đŸ„” Book Your One night Stand
Call Girls In Attibele ☎ 7737669865 đŸ„” Book Your One night Stand
 
👉 Amritsar Call Girl 👉📞 6367187148 👉📞 JustđŸ“Č Call Ruhi Call Girl Phone No Amri...
👉 Amritsar Call Girl 👉📞 6367187148 👉📞 JustđŸ“Č Call Ruhi Call Girl Phone No Amri...👉 Amritsar Call Girl 👉📞 6367187148 👉📞 JustđŸ“Č Call Ruhi Call Girl Phone No Amri...
👉 Amritsar Call Girl 👉📞 6367187148 👉📞 JustđŸ“Č Call Ruhi Call Girl Phone No Amri...
 
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
 
Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -
Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -
Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -
 
Just Call Vip call girls roorkee Escorts ☎9352988975 Two shot with one girl ...
Just Call Vip call girls roorkee Escorts ☎9352988975 Two shot with one girl ...Just Call Vip call girls roorkee Escorts ☎9352988975 Two shot with one girl ...
Just Call Vip call girls roorkee Escorts ☎9352988975 Two shot with one girl ...
 
Call Girls In Nandini Layout ☎ 7737669865 đŸ„” Book Your One night Stand
Call Girls In Nandini Layout ☎ 7737669865 đŸ„” Book Your One night StandCall Girls In Nandini Layout ☎ 7737669865 đŸ„” Book Your One night Stand
Call Girls In Nandini Layout ☎ 7737669865 đŸ„” Book Your One night Stand
 
âž„đŸ” 7737669865 đŸ”â–» mahisagar Call-girls in Women Seeking Men 🔝mahisagar🔝 Esc...
âž„đŸ” 7737669865 đŸ”â–» mahisagar Call-girls in Women Seeking Men  🔝mahisagar🔝   Esc...âž„đŸ” 7737669865 đŸ”â–» mahisagar Call-girls in Women Seeking Men  🔝mahisagar🔝   Esc...
âž„đŸ” 7737669865 đŸ”â–» mahisagar Call-girls in Women Seeking Men 🔝mahisagar🔝 Esc...
 
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
 
âž„đŸ” 7737669865 đŸ”â–» Sambalpur Call-girls in Women Seeking Men 🔝Sambalpur🔝 Esc...
âž„đŸ” 7737669865 đŸ”â–» Sambalpur Call-girls in Women Seeking Men  🔝Sambalpur🔝   Esc...âž„đŸ” 7737669865 đŸ”â–» Sambalpur Call-girls in Women Seeking Men  🔝Sambalpur🔝   Esc...
âž„đŸ” 7737669865 đŸ”â–» Sambalpur Call-girls in Women Seeking Men 🔝Sambalpur🔝 Esc...
 
Call Girls In Shivaji Nagar ☎ 7737669865 đŸ„” Book Your One night Stand
Call Girls In Shivaji Nagar ☎ 7737669865 đŸ„” Book Your One night StandCall Girls In Shivaji Nagar ☎ 7737669865 đŸ„” Book Your One night Stand
Call Girls In Shivaji Nagar ☎ 7737669865 đŸ„” Book Your One night Stand
 
SAC 25 Final National, Regional & Local Angel Group Investing Insights 2024 0...
SAC 25 Final National, Regional & Local Angel Group Investing Insights 2024 0...SAC 25 Final National, Regional & Local Angel Group Investing Insights 2024 0...
SAC 25 Final National, Regional & Local Angel Group Investing Insights 2024 0...
 
Just Call Vip call girls Erode Escorts ☎9352988975 Two shot with one girl (E...
Just Call Vip call girls Erode Escorts ☎9352988975 Two shot with one girl (E...Just Call Vip call girls Erode Escorts ☎9352988975 Two shot with one girl (E...
Just Call Vip call girls Erode Escorts ☎9352988975 Two shot with one girl (E...
 
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
 

Graph Gurus Episode 32: Using Graph Algorithms for Advanced Analytics Part 5

  • 1. Graph Gurus 32 Using Graph Algorithms for Advanced Analytics Part 5 - Classification 1
  • 2. © 2020 TigerGraph. All Rights Reserved Today's Presenter 2 Victor Lee Head of Product Strategy & Developer Relations ● BS in Electrical Engineering and Computer Science from UC Berkeley, MS in Electrical Engineering from Stanford University ● PhD in Computer Science from Kent State University focused on graph data mining ● 20+ years in tech industry
  • 3. © 2020 TigerGraph. All Rights Reserved Some Housekeeping Items ● Although your phone is muted we do want to answer your questions - submit your questions at any time using the Q&A tab in the menu ● The webinar is being recorded and will uploaded to our website shortly (https://www.tigergraph.com/webinars/) and the URL will be emailed you ● If you have issues with Zoom please contact the panelists via chat 3
  • 4. © 2020 TigerGraph. All Rights Reserved Move Faster with TigerGraph Cloud 4 Built for agile teams who would rather build innovative applications than procure hardware or configure and manage databases ● Start for free ● Move to production with distributed data and HA replication
  • 5. © 2020 TigerGraph. All Rights Reserved Today’s Outline 5 1 3 2 Recap of Parts 1 to 4: Path, Centrality, Community, and Similarity Algorithms Training a Classifier: K-Nearest Neighbors Introduction to machine learning Use Cases for Classification 4 Demo GSQL Queries for training & classifying
  • 6. © 2020 TigerGraph. All Rights Reserved Review: Analytics with Graph Algorithms ● Graph algorithms answer fundamental questions about connected data ● Each algorithm in a library is tool in an analytics toolkit ● Building blocks for more complex business questions 6 Specialized functions Combine to make something better
  • 7. © 2020 TigerGraph. All Rights Reserved Example Questions/Analyses for Graph Algorithms Which entity is most centrally located? ● For delivery logistics or greatest visibility ● Closeness Centrality, Betweenness Centrality algorithms 7 How much influence does this entity exert over the others? ● For market penetration & buyer influence ● PageRank algorithm Which entity has similar relationships to this entity? ● For grouping customers, products, etc. ● Cosine Similarity, Jaccard Similarity, SimRank, RoleSim algorithms What are the natural community groupings in the graph? ● For partitioning risk groups, workgroups, product offerings, etc. ● Community Detection, MinCut algorithms
  • 8. © 2020 TigerGraph. All Rights Reserved Summary for Shortest Path Algorithms Graph Gurus 26 8 1 4 3 Graph Algorithms - tools and building blocks for analyzing graph data GSQL Algorithm Library - runs in-database, high-performance, easy to read and modify Shortest Path Algorithms - different algorithms for weighted and unweighted graphs 2 Learning To Use Algorithms - know what problem they solve, pros and cons
  • 9. © 2020 TigerGraph. All Rights Reserved Summary for Centrality Algorithms Graph Gurus 27 9 1 4 3 Centrality Algorithms - abstract concepts of location and travel. Customizing GSQL Library algorithms - easy and familiar, like procedural SQL PageRank - uses directed referral edges to find the most influential nodes. Personalized PageRank is localized. 2 Closeness and Betweenness - use shortest paths. Betweenness is more complex.
  • 10. © 2020 TigerGraph. All Rights Reserved Summary for Community Detection Algorithms Graph Gurus 29 10 1 3 2 Community Detection Algorithms Use connectedness to decide boundaries Strict vs. Lenient Community Rules Black & white rules are not always helpful. Louvain uses relative density. Communities are Clusters, not Partitions Don't have to include everyone. Can overlap? 4 Pre- or Post- step with other algorithms Many algorithms assume you start from just one connected community
  • 11. © 2020 TigerGraph. All Rights Reserved 11 1 3 2 Similarity is in the Eye of the Beholder What factors matter to you? How much? Jaccard and Cosine Similarity Counting matches vs. measuring numerical alignment Graph modeling helps with Similarity Hub-and-spoke view 4 Deeper Measures: SimRank and RoleSim Define similarity recursively, look multiple hops deep (globally) Summary for Similarity Algorithms Graph Gurus 30
  • 12. © 2020 TigerGraph. All Rights Reserved Some Types of Graph Algorithms ● Search ● Path Finding & Analytics ● Centrality / Ranking ● Clustering / Community Detection ● Similarity ● Classification 12
  • 13. © 2020 TigerGraph. All Rights Reserved Classifying 4 types of good & bad telecom users,using basic and graph features Tim Sarah JohnFred Prankster Regular Customer Sale s Fraudster Age of sim card 2 weeks 4 weeks 3 weeks 2 weeks % of one directional calls 50% 10% 55% 60% % rejected calls 40% 5% 28% 25% Stable group Yes Yes No No Many in-group connections No Yes No Yes 3-step friend relation No Yes No Yes Prediction by machine learning with deep link graph features Likely Prankster Regular Customer Likely Fraudster Likely Sales 13 Download the solution brief: https://info.tigergraph.com/MachineLearning
  • 14. © 2020 TigerGraph. All Rights Reserved Other Use Cases ● Cold, Flu, COVID-19 or other? 14 Medical Diagnosis ● Word recognition, part of speech ● Question or statement? ● Sarcasm or not? ● Sentiment analysis ● Expected response Understanding Natural Language
  • 15. © 2020 TigerGraph. All Rights Reserved 1. Rule-based: "If it walks like a duck and talks like a duck, then it's a duck." Common Classifiers ● The two methods are equivalent ● But how do you derive the rules / decisions? ○ Not talking about classification by legislation: "Because I say so" ○ Are there ways to make a "best" classifier? 15 2. Decision Tree Walk like duck? Talk like duck? DUCKNot duckNot duck Y YN N
  • 16. © 2020 TigerGraph. All Rights Reserved Classifier Induction (e.g. Learning how to classify) 16 ● Need a set of training instances, where you know both (1) features and (2) the classes (labels) of each instance. ● Use some statistical method to correlate the features to the labels. Item/Case Feature1 Feature2 Feature3 Label: Classification 1 red 2.3 yes A 2 blue 4.1 yes B 3 red 4.0 yes A Learning a Classifier is one type of Supervised Machine Learning
  • 17. © 2020 TigerGraph. All Rights Reserved k-Nearest Neighbor Classification (kNN) ● Concept: Predict an entity's class by looking at the classes of the "nearest" other entities. ● Question: What is distance? ○ Physical distance? → Clustering ○ Or, some concept of similarity? ● How many neighbors to consider? ○ Within a radius? ○ Up to a certain number? 17 ?
  • 18. © 2020 TigerGraph. All Rights Reserved k-Nearest Neighbor Classification (kNN) Consider the K closest neighbors: from nearest to farthest: ● Pick the class that is represented most often ● The prediction depends on the value of K: 18 ? k red yellow unlabeled Prediction 2 0 2 0 yellow 3 0 2 1 yellow 4 1 2 1 yellow 5 2 2 1 red/yellow 6 3 2 1 red 7 3 2 2 red 8 3 3 2 red/yellow 9 3 3 3 red/yellow 10 4 3 3 red 11 4 4 3 red/yellow 12 4 5 3 yellow
  • 19. © 2020 TigerGraph. All Rights Reserved kNN as a Machine Learning Task ● As long as you have a distance/similarity function, you don't need any additional "intelligence", except
 ● ML task: Find the best value of K 19 Distance calculation Simple counting Item you want to classify k (?) Prediction Could also work on optimizing the distance function, but that's out of scope of our current discussion.
  • 20. © 2020 TigerGraph. All Rights Reserved kNN for Graph Data 20 kNN is a general purpose classifier. How does it work with graphs? ● It's really about the distance/similarity function. ● For our kNN, we cosine neighborhood similarity. ● It's easy to replace this with Jaccard neighborhood similarity. A W B X Y Z 2 3 12 1 4Similarity(A,B) = f(shared neighbors)
  • 21. © 2020 TigerGraph. All Rights Reserved Cross Validation 21 ● In Machine Learning, first train a model, and then validate (check) the accuracy of the model. Split your labeled data into subsets: ○ Training set (bigger part) ○ Validation set (smaller part): Use the trained model to see if correctly predicts the actual labels in the validation set. 1 2 ... n-1 n training set n-Fold Cross Validation Leave-out-one Cross Validation Repeat, leaving out a different one(segment) each time
  • 22. © 2020 TigerGraph. All Rights Reserved TigerGraph kNN_cosine_cv algorithm Given ● a (fully or partially) labeled graph ● a similarity measure ● a range of k values chosen by the user For each value of k: ● For each labeled item Q in the graph: ○ Predict Q's label by looking at the classes of the k closest entities ○ Accuracy(k) = number (%) of correct predictions Select the k which produced the highest Accuracy(k) 22
  • 23. DEMO GSQL Graph Algorithms in TigerGraph Cloud 23
  • 24. © 2020 TigerGraph. All Rights Reserved Sample Data Set 24
  • 25. © 2020 TigerGraph. All Rights Reserved GSQL Graph Algorithm Library ● Written in GSQL - high-level, parallelized ● Open-source, user-extensible ● Well-documented 25 docs.tigergraph.com/graph-algorithm-library
  • 26. © 2020 TigerGraph. All Rights Reserved TigerGraph GSQL Graph Algorithm Library ✓ Call each algorithm as a GSQL query or as a RESTful endpoint ✓ Run the algorithms in-database (don't export the data) ✓ Option to update the graph with the algorithm results ✓ Able to modify/customize the algorithms. Turing-complete language. ✓ Massively parallel processing to handle big graphs 26
  • 27. © 2020 TigerGraph. All Rights Reserved Summary 27 1 3 2 Graph Algorithms Key tool for data scientists k-Nearest Neighbors Predicts the classification by looking at the classes of the similar/nearby items. What's the right value for k? Classification Algorithms Bridge the gap to machine learning 4 Advanced Analytics with Graph Algorithms https://docs.tigergraph.com/graph-algorithm-library
  • 28. Q&A Please submit your questions via the Q&A tab in Zoom 28
  • 29. © 2020 TigerGraph. All Rights Reserved More Questions? Join our Developer Forum https://groups.google.com/a/opengsql.org/forum/#!forum/gsql-users Sign up for our Developer OfïŹce Hours (every Thursday at 11 AM PST) https://info.tigergraph.com/officehours 29
  • 30. © 2020 TigerGraph. All Rights Reserved Additional Resources Start Free at TigerGraph Cloud Today! https://www.tigergraph.com/cloud/ Test Drive Online Demo https://www.tigergraph.com/demo Download the Developer Edition https://www.tigergraph.com/download/ Guru Scripts https://github.com/tigergraph/ecosys/tree/master/guru_scripts 30
  • 31. © 2020 TigerGraph. All Rights Reserved Upcoming Online Events Graph Gurus 33: GSQL Writing Best Practices - Part 2 Wednesday, April 8, at 11am PDT https://info.tigergraph.com/graph-gurus-33 31