SlideShare ist ein Scribd-Unternehmen logo
1 von 20
Neo4j, Inc. All rights reserved 2021
Neo4j, Inc. All rights reserved 2021
1
Graph Data Science:
Accelerating AI & Machine Learning
Alicia Frame, PhD
Director, Data Science @ Neo4j
Neo4j, Inc. All rights reserved 2021
2
Neo4j: The Connected Data Company
20 of the top 25 financial firms
7 of the top 10 retailers
7 of the top 10 software vendors
Neo4j is the creator of:
• The world’s leading graph database
• The first graph data science platform
• The most flexible graph data model
• The easiest-to-use graph query language
Thousands of Organizations Use Neo4j
Silicon Valley | London | Munich
Paris | Malmö
Neo4j, Inc. All rights reserved 2021
3
Node
Represents an entity in the graph
Relationship
Connect nodes to each other
Property
Describes a node or relationship:
e.g. name, age, weight etc
What’s a graph?
MICA
ANDRE
Name: “Andre”
Born: May 29, 1970
Twitter: “@dan”
Name: “Mica”
Born: Dec 5, 1975
CAR
Brand “Volvo”
Model: “V70”
Since:
Jan 10, 2011
LOVES
LOVES
LOVES
LIVES WITH
O
W
N
S
D
R
I
V
E
S
Neo4j, Inc. All rights reserved 2021
Networks of People Transaction Networks
Bought
B
ou
gh
t
V
i
e
w
e
d
R
e
t
u
r
n
e
d
Bought
Knowledge Networks
Pl
ay
s
Lives_in
In_sport
Likes
F
a
n
_
o
f
Plays_for
Risk management,
Supply chain, Orders,
Payments, etc.
Employees, Customers,
Suppliers, Partners,
Influencers, etc.
Enterprise content,
Domain specific content,
eCommerce content, etc
K
n
o
w
s
Knows
Knows
K
n
o
w
s
4
Everything is Naturally Connected
Neo4j, Inc. All rights reserved 2021
5
Higher Pay and More Promotions
• People Near Structural Holes
• Organizational Misfits
Network Structure is
Highly Predictive
Photo by Helena Lopes on Unsplash
“Organizational Misfits and the Origins of Brokerage in Intrafirm Networks” A. Kleinbaum
“Structural Holes and Good Ideas” R. Burt
Neo4j, Inc. All rights reserved 2021
Consider What Drives Your Business
It’s not the numbers, it’s the relationships behind them
Plants
Warehouses
Suppliers
Distributors
Competitors
Partners
Regulations
Employees
Citizens
Customers
Products
Parts
Services
Regions
Neo4j, Inc. All rights reserved 2021
Neo4j, Inc. All rights reserved 2021
7
Relationships
are the strongest
predictors of behavior
But You Can’t Analyse
What You Can’t See
● Most data science techniques
ignore relationships
● It’s painful to manually engineer
connected features from tabular
data
● Graphs are built on
relationships, so…
● You don’t have to guess at the
correlations: with graphs,
relationships are built in
James Fowler
Neo4j, Inc. All rights reserved 2021
8
8 Top 10 Tech Trends in Data and Analytics, 16 Feb 2021
According to Gartner, “Graphs form
the foundation of modern D&A,
with capabilities to enhance and
improve user collaboration, ML models
and explainable AI.
The recent Gartner AI in Organizations
Survey demonstrates that graph
techniques are increasingly
prevalent as AI maturity grows,
going from 13% adoption when AI
maturity is lowest to 48% when
maturity is highest.”
AI Research Papers
Featuring Graph
Source: Dimensions Knowledge System
4x
Increase in
traffic to
Neo4j GDS
page in
2H-2020
Analytics & Data Science Interest
Exploding in Neo4j Community
100k+
Practicing data
scientists
engaged with
Neo4j
+210k
downloads
Neo4j, Inc. All rights reserved 2021
9
Queries
Find the patterns you know exist.
Machine Learning
Uncover trends and make
predictions
Visualization
Explore, collaborate, and explain
Graphs & Data Science
Analytics
Feature
Engineering
Data
Exploration
Graph
Data
Science
Queries
Machine Learning Visualization
Neo4j, Inc. All rights reserved 2021
10
Graphs & Data Science
Knowledge Graphs
Graph Algorithms
Graph Native
Machine Learning
Find the patterns you’re
looking for in connected data
Use unsupervised machine
learning techniques to
identify associations,
anomalies, and trends.
Use embeddings to learn the
features in your graph that
you don’t even know are
important yet.
Train in-graph supervise ML
models to predict links,
labels, and missing data.
Neo4j, Inc. All rights reserved 2021
Better Predictions with Data You Already Have
● Traditional ML ignores network structure because it’s difficult to extract
● Uncover patterns and trends you can’t find any other way
● Easily generate predictive features to incorporate into ML pipelines
11
Machine Learning Pipeline
Neo4j, Inc. All rights reserved 2021
Neo4j’s Graph Data Science Framework
Neo4j Graph Data
Science Library
Neo4j
Database
Neo4j
Bloom
Scalable Graph Algorithms &
Analytics Workspace
Native Graph Creation &
Persistence
Visual Graph
Exploration & Prototyping
Neo4j, Inc. All rights reserved 2021
Robust Graph Algorithms & ML methods
● Compute metrics about the topology and connectivity
● Build predictive models to enhance your graph
● Highly parallelized and scale to 10’s of billions of nodes
13
The Neo4j GDS Library
Mutable In-Memory
Workspace
Computational Graph
Native Graph Store
Efficient & Flexible Analytics Workspace
● Automatically reshapes transactional graphs into
an in-memory analytics graph
● Optimized for global traversals and aggregation
● Create workflows and layer algorithms
● Store and manage predictive models in the
model catalog
Neo4j, Inc. All rights reserved 2021
Community
Detection
14
Neo4j’s Graph Data Science Library
Unsupervised Graph Algorithms
Clustering
Dimension Reduction
(generalization)
Association
Which parts of my graph are
connected to each other?
Which nodes are most
similar?
How important is each node?
Supervised Machine Learning
Node Classification
Link Prediction
Where will connections
form next?
What’s the label
for this node?
Centrality
Embeddings
Similarity
Pathfinding
More Algos than
any other vendor
ONLY in neo4j
Neo4j, Inc. All rights reserved 2021
15
Unsupervised Machine Learning
Pathfinding &
Search
• Shortest Path
• Single-Source Shortest Path
• All Pairs Shortest Path
• A* Shortest Path
• Yen’s K Shortest Path
• Minimum Weight Spanning Tree
• K-Spanning Tree (MST)
• Random Walk
• Breadth & Depth First Search
Centrality &
Importance
• Degree Centrality
• Closeness Centrality
• Harmonic Centrality
• Betweenness Centrality & Approx.
• PageRank
• Personalized PageRank
• ArticleRank
• Eigenvector Centrality
• Hyperlink Induced Topic Search (HITS)
• Influence Maximization (Greedy, CELF)
Community
Detection
• Triangle Count
• Local Clustering Coefficient
• Connected Components (Union Find)
• Strongly Connected Components
• Label Propagation
• Louvain Modularity
• K-1 Coloring
• Modularity Optimization
• Speaker Listener Label Propagation
Heuristic Link
Prediction
• Adamic Adar
• Common Neighbors
• Preferential Attachment
• Resource Allocations
• Same Community
• Total Neighbors
Similarity
• Node Similarity
• K-Nearest Neighbors (KNN)
• Jaccard Similarity
• Cosine Similarity
• Pearson Similarity
• Euclidean Distance
• Approximate Nearest Neighbors (ANN)
Graph
Embeddings
• Node2Vec
• FastRP
• FastRPExtended
• GraphSAGE
Neo4j, Inc. All rights reserved 2021
Graph Features & Graph Models for Predictions
Traditional ML problems where
relationships between your data points
are important predictive features
16
Predictions influenced by
graph structure
Predictions about
graph structure
Enhance your graph by predicting
missing data or changes to your graph
that will occur in the future
Neo4j, Inc. All rights reserved 2021
17
Neo4j’s In-Graph ML Models
Node
classification:
“What kind of node
is this?”
Link prediction:
“Should there be a
relationship between
these nodes?”
Labeled data: Pairs of nodes
that are either linked or not
Features: Pre-existing
attributes, algorithms
(pageRank), embedding
Neo4j, Inc. All rights reserved 2021
18
The Only Completely In-Graph, ML Workflow
Graph-Native
Feature
Engineering
Train
Predictive Model
Queries
Algorithms
Embeddings
1. Model Type
2. Property
Selection
3. Train & Test
4. Model
Selection
Apply Model to
Existing / New
Data
Use Predictions
for Decisions
Use Predictions
to Enhance
the Graph
Publish & Share
Store Model in
Database
Neo4j, Inc. All rights reserved 2021
What’s most important and
influential in my business?
What’s occurring that’s unusual?
What’s going to happen next?
But traditional
approaches to data make
it impossible to reveal and
effectively use those
connections as data sizes
become large
Predictive signals get lost in
big data noise
19
Graph Data Science Answers the BIG Questions
Connected Data is
Powerful
Graph Data Science uses
Connections to Answer
Critical Questions
Neo4j, Inc. All rights reserved 2021
Neo4j, Inc. All rights reserved 2021
20
Resources
Graph Resources
● Video: Advantages of Graph Technology
● Code: https://github.com/neo4j/graph-data-science/
● Whitepaper: Financial Fraud Detection with Graph Data Science
● Case Study: Meredith Corporation
Neo4j BookShelf
● Graph Databases For Dummies
● Graph Data Science For Dummies
● O’Reilly Graph Algorithms

Weitere ähnliche Inhalte

Was ist angesagt?

Was ist angesagt? (11)

AI and ML for Product Management by Smartsheet Sr Dir of PM
AI and ML for Product Management by Smartsheet Sr Dir of PMAI and ML for Product Management by Smartsheet Sr Dir of PM
AI and ML for Product Management by Smartsheet Sr Dir of PM
 
Presentation-Open AI Chat GPT-4 3.pptx
Presentation-Open AI Chat GPT-4 3.pptxPresentation-Open AI Chat GPT-4 3.pptx
Presentation-Open AI Chat GPT-4 3.pptx
 
Power BI Ecosystem
Power BI EcosystemPower BI Ecosystem
Power BI Ecosystem
 
𝐆𝐞𝐧𝐞𝐫𝐚𝐭𝐢𝐯𝐞 𝐀𝐈: 𝐂𝐡𝐚𝐧𝐠𝐢𝐧𝐠 𝐇𝐨𝐰 𝐁𝐮𝐬𝐢𝐧𝐞𝐬𝐬 𝐈𝐧𝐧𝐨𝐯𝐚𝐭𝐞𝐬 𝐚𝐧𝐝 𝐎𝐩𝐞𝐫𝐚𝐭𝐞𝐬
𝐆𝐞𝐧𝐞𝐫𝐚𝐭𝐢𝐯𝐞 𝐀𝐈: 𝐂𝐡𝐚𝐧𝐠𝐢𝐧𝐠 𝐇𝐨𝐰 𝐁𝐮𝐬𝐢𝐧𝐞𝐬𝐬 𝐈𝐧𝐧𝐨𝐯𝐚𝐭𝐞𝐬 𝐚𝐧𝐝 𝐎𝐩𝐞𝐫𝐚𝐭𝐞𝐬𝐆𝐞𝐧𝐞𝐫𝐚𝐭𝐢𝐯𝐞 𝐀𝐈: 𝐂𝐡𝐚𝐧𝐠𝐢𝐧𝐠 𝐇𝐨𝐰 𝐁𝐮𝐬𝐢𝐧𝐞𝐬𝐬 𝐈𝐧𝐧𝐨𝐯𝐚𝐭𝐞𝐬 𝐚𝐧𝐝 𝐎𝐩𝐞𝐫𝐚𝐭𝐞𝐬
𝐆𝐞𝐧𝐞𝐫𝐚𝐭𝐢𝐯𝐞 𝐀𝐈: 𝐂𝐡𝐚𝐧𝐠𝐢𝐧𝐠 𝐇𝐨𝐰 𝐁𝐮𝐬𝐢𝐧𝐞𝐬𝐬 𝐈𝐧𝐧𝐨𝐯𝐚𝐭𝐞𝐬 𝐚𝐧𝐝 𝐎𝐩𝐞𝐫𝐚𝐭𝐞𝐬
 
Power BI
Power BIPower BI
Power BI
 
Cloud Stakeholders
Cloud StakeholdersCloud Stakeholders
Cloud Stakeholders
 
Api clarity webinar
Api clarity webinarApi clarity webinar
Api clarity webinar
 
Preparing, Piloting & Paths to Success with Microsoft Copilot
Preparing, Piloting & Paths to Success with Microsoft CopilotPreparing, Piloting & Paths to Success with Microsoft Copilot
Preparing, Piloting & Paths to Success with Microsoft Copilot
 
FIWARE Wednesday Webinars - Integrating FIWARE with Blockchain/DLTs
FIWARE Wednesday Webinars - Integrating FIWARE with Blockchain/DLTsFIWARE Wednesday Webinars - Integrating FIWARE with Blockchain/DLTs
FIWARE Wednesday Webinars - Integrating FIWARE with Blockchain/DLTs
 
Generative AI - Responsible Path Forward.pdf
Generative AI - Responsible Path Forward.pdfGenerative AI - Responsible Path Forward.pdf
Generative AI - Responsible Path Forward.pdf
 
Skills Requirements for Future Jobs - 10 Facts
Skills Requirements for Future Jobs - 10 FactsSkills Requirements for Future Jobs - 10 Facts
Skills Requirements for Future Jobs - 10 Facts
 

Ähnlich wie Graph Data Science: The Secret to Accelerating Innovation with AI/ML

Graph Data Science with Neo4j: Nordics Webinar
Graph Data Science with Neo4j: Nordics WebinarGraph Data Science with Neo4j: Nordics Webinar
Graph Data Science with Neo4j: Nordics Webinar
Neo4j
 

Ähnlich wie Graph Data Science: The Secret to Accelerating Innovation with AI/ML (20)

Graphs for Data Science and Machine Learning
Graphs for Data Science and Machine LearningGraphs for Data Science and Machine Learning
Graphs for Data Science and Machine Learning
 
Using Connected Data and Graph Technology to Enhance Machine Learning and Art...
Using Connected Data and Graph Technology to Enhance Machine Learning and Art...Using Connected Data and Graph Technology to Enhance Machine Learning and Art...
Using Connected Data and Graph Technology to Enhance Machine Learning and Art...
 
Graph Data Science with Neo4j: Nordics Webinar
Graph Data Science with Neo4j: Nordics WebinarGraph Data Science with Neo4j: Nordics Webinar
Graph Data Science with Neo4j: Nordics Webinar
 
Relationships Matter: Using Connected Data for Better Machine Learning
Relationships Matter: Using Connected Data for Better Machine LearningRelationships Matter: Using Connected Data for Better Machine Learning
Relationships Matter: Using Connected Data for Better Machine Learning
 
Einstieg in Neo4j Graph Data Science
Einstieg in Neo4j Graph Data ScienceEinstieg in Neo4j Graph Data Science
Einstieg in Neo4j Graph Data Science
 
Neo4j – The Fastest Path to Scalable Real-Time Analytics
Neo4j – The Fastest Path to Scalable Real-Time AnalyticsNeo4j – The Fastest Path to Scalable Real-Time Analytics
Neo4j – The Fastest Path to Scalable Real-Time Analytics
 
Workshop Tel Aviv - Graph Data Science
Workshop Tel Aviv - Graph Data ScienceWorkshop Tel Aviv - Graph Data Science
Workshop Tel Aviv - Graph Data Science
 
Introduction to Neo4j
Introduction to Neo4jIntroduction to Neo4j
Introduction to Neo4j
 
The Data Platform for Today's Intelligent Applications.pdf
The Data Platform for Today's Intelligent Applications.pdfThe Data Platform for Today's Intelligent Applications.pdf
The Data Platform for Today's Intelligent Applications.pdf
 
ntroducing to the Power of Graph Technology
ntroducing to the Power of Graph Technologyntroducing to the Power of Graph Technology
ntroducing to the Power of Graph Technology
 
AI, Knowledge Representation and Graph Databases -
 Key Trends in Data Science
AI, Knowledge Representation and Graph Databases -
 Key Trends in Data ScienceAI, Knowledge Representation and Graph Databases -
 Key Trends in Data Science
AI, Knowledge Representation and Graph Databases -
 Key Trends in Data Science
 
Neo4j : la voie du succès avec les bases de données de graphes et la Graph Da...
Neo4j : la voie du succès avec les bases de données de graphes et la Graph Da...Neo4j : la voie du succès avec les bases de données de graphes et la Graph Da...
Neo4j : la voie du succès avec les bases de données de graphes et la Graph Da...
 
Neo4j Graph Data Science - Webinar
Neo4j Graph Data Science - WebinarNeo4j Graph Data Science - Webinar
Neo4j Graph Data Science - Webinar
 
GraphSummit Toronto: Leveraging Graphs for AI and ML
GraphSummit Toronto: Leveraging Graphs for AI and MLGraphSummit Toronto: Leveraging Graphs for AI and ML
GraphSummit Toronto: Leveraging Graphs for AI and ML
 
How Graph Technology is Changing AI
How Graph Technology is Changing AIHow Graph Technology is Changing AI
How Graph Technology is Changing AI
 
Get Started with the Most Advanced Edition Yet of Neo4j Graph Data Science
Get Started with the Most Advanced Edition Yet of Neo4j Graph Data ScienceGet Started with the Most Advanced Edition Yet of Neo4j Graph Data Science
Get Started with the Most Advanced Edition Yet of Neo4j Graph Data Science
 
3. Relationships Matter: Using Connected Data for Better Machine Learning
3. Relationships Matter: Using Connected Data for Better Machine Learning3. Relationships Matter: Using Connected Data for Better Machine Learning
3. Relationships Matter: Using Connected Data for Better Machine Learning
 
Leveraging Graphs for Better AI
Leveraging Graphs for Better AILeveraging Graphs for Better AI
Leveraging Graphs for Better AI
 
Knowledge Graphs for Transformation: Dynamic Context for the Intelligent Ente...
Knowledge Graphs for Transformation: Dynamic Context for the Intelligent Ente...Knowledge Graphs for Transformation: Dynamic Context for the Intelligent Ente...
Knowledge Graphs for Transformation: Dynamic Context for the Intelligent Ente...
 
Are You Underestimating the Value Within Your Data? A conversation about grap...
Are You Underestimating the Value Within Your Data? A conversation about grap...Are You Underestimating the Value Within Your Data? A conversation about grap...
Are You Underestimating the Value Within Your Data? A conversation about grap...
 

Mehr von Neo4j

Mehr von Neo4j (20)

From Knowledge Graphs via Lego Bricks to scientific conversations.pptx
From Knowledge Graphs via Lego Bricks to scientific conversations.pptxFrom Knowledge Graphs via Lego Bricks to scientific conversations.pptx
From Knowledge Graphs via Lego Bricks to scientific conversations.pptx
 
Novo Nordisk: When Knowledge Graphs meet LLMs
Novo Nordisk: When Knowledge Graphs meet LLMsNovo Nordisk: When Knowledge Graphs meet LLMs
Novo Nordisk: When Knowledge Graphs meet LLMs
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
 
QIAGEN: Biomedical Knowledge Graphs for Data Scientists and Bioinformaticians
QIAGEN: Biomedical Knowledge Graphs for Data Scientists and BioinformaticiansQIAGEN: Biomedical Knowledge Graphs for Data Scientists and Bioinformaticians
QIAGEN: Biomedical Knowledge Graphs for Data Scientists and Bioinformaticians
 
EY_Graph Database Powered Sustainability
EY_Graph Database Powered SustainabilityEY_Graph Database Powered Sustainability
EY_Graph Database Powered Sustainability
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
 
Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024
 
Connecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfConnecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdf
 
ISDEFE - GraphSummit Madrid - ARETA: Aviation Real-Time Emissions Token Accre...
ISDEFE - GraphSummit Madrid - ARETA: Aviation Real-Time Emissions Token Accre...ISDEFE - GraphSummit Madrid - ARETA: Aviation Real-Time Emissions Token Accre...
ISDEFE - GraphSummit Madrid - ARETA: Aviation Real-Time Emissions Token Accre...
 
BBVA - GraphSummit Madrid - Caso de éxito en BBVA: Optimizando con grafos
BBVA - GraphSummit Madrid - Caso de éxito en BBVA: Optimizando con grafosBBVA - GraphSummit Madrid - Caso de éxito en BBVA: Optimizando con grafos
BBVA - GraphSummit Madrid - Caso de éxito en BBVA: Optimizando con grafos
 
Graph Everywhere - Josep Taruella - Por qué Graph Data Science en tus modelos...
Graph Everywhere - Josep Taruella - Por qué Graph Data Science en tus modelos...Graph Everywhere - Josep Taruella - Por qué Graph Data Science en tus modelos...
Graph Everywhere - Josep Taruella - Por qué Graph Data Science en tus modelos...
 
GraphSummit Madrid - Product Vision and Roadmap - Luis Salvador Neo4j
GraphSummit Madrid - Product Vision and Roadmap - Luis Salvador Neo4jGraphSummit Madrid - Product Vision and Roadmap - Luis Salvador Neo4j
GraphSummit Madrid - Product Vision and Roadmap - Luis Salvador Neo4j
 
Neo4j_Exploring the Impact of Graph Technology on Financial Services.pdf
Neo4j_Exploring the Impact of Graph Technology on Financial Services.pdfNeo4j_Exploring the Impact of Graph Technology on Financial Services.pdf
Neo4j_Exploring the Impact of Graph Technology on Financial Services.pdf
 
Rabobank_Exploring the Impact of Graph Technology on Financial Services.pdf
Rabobank_Exploring the Impact of Graph Technology on Financial Services.pdfRabobank_Exploring the Impact of Graph Technology on Financial Services.pdf
Rabobank_Exploring the Impact of Graph Technology on Financial Services.pdf
 
Webinar - IA generativa e grafi Neo4j: RAG time!
Webinar - IA generativa e grafi Neo4j: RAG time!Webinar - IA generativa e grafi Neo4j: RAG time!
Webinar - IA generativa e grafi Neo4j: RAG time!
 
IA Generativa y Grafos de Neo4j: RAG time
IA Generativa y Grafos de Neo4j: RAG timeIA Generativa y Grafos de Neo4j: RAG time
IA Generativa y Grafos de Neo4j: RAG time
 
Neo4j: Data Engineering for RAG (retrieval augmented generation)
Neo4j: Data Engineering for RAG (retrieval augmented generation)Neo4j: Data Engineering for RAG (retrieval augmented generation)
Neo4j: Data Engineering for RAG (retrieval augmented generation)
 
Neo4j Graph Summit 2024 Workshop - EMEA - Breda_and_Munchen.pdf
Neo4j Graph Summit 2024 Workshop - EMEA - Breda_and_Munchen.pdfNeo4j Graph Summit 2024 Workshop - EMEA - Breda_and_Munchen.pdf
Neo4j Graph Summit 2024 Workshop - EMEA - Breda_and_Munchen.pdf
 
Enabling GenAI Breakthroughs with Knowledge Graphs
Enabling GenAI Breakthroughs with Knowledge GraphsEnabling GenAI Breakthroughs with Knowledge Graphs
Enabling GenAI Breakthroughs with Knowledge Graphs
 

Kürzlich hochgeladen

Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
WSO2
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 

Kürzlich hochgeladen (20)

WSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering Developers
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 
Six Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal OntologySix Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal Ontology
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectors
 
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
 
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
 
[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 

Graph Data Science: The Secret to Accelerating Innovation with AI/ML

  • 1. Neo4j, Inc. All rights reserved 2021 Neo4j, Inc. All rights reserved 2021 1 Graph Data Science: Accelerating AI & Machine Learning Alicia Frame, PhD Director, Data Science @ Neo4j
  • 2. Neo4j, Inc. All rights reserved 2021 2 Neo4j: The Connected Data Company 20 of the top 25 financial firms 7 of the top 10 retailers 7 of the top 10 software vendors Neo4j is the creator of: • The world’s leading graph database • The first graph data science platform • The most flexible graph data model • The easiest-to-use graph query language Thousands of Organizations Use Neo4j Silicon Valley | London | Munich Paris | Malmö
  • 3. Neo4j, Inc. All rights reserved 2021 3 Node Represents an entity in the graph Relationship Connect nodes to each other Property Describes a node or relationship: e.g. name, age, weight etc What’s a graph? MICA ANDRE Name: “Andre” Born: May 29, 1970 Twitter: “@dan” Name: “Mica” Born: Dec 5, 1975 CAR Brand “Volvo” Model: “V70” Since: Jan 10, 2011 LOVES LOVES LOVES LIVES WITH O W N S D R I V E S
  • 4. Neo4j, Inc. All rights reserved 2021 Networks of People Transaction Networks Bought B ou gh t V i e w e d R e t u r n e d Bought Knowledge Networks Pl ay s Lives_in In_sport Likes F a n _ o f Plays_for Risk management, Supply chain, Orders, Payments, etc. Employees, Customers, Suppliers, Partners, Influencers, etc. Enterprise content, Domain specific content, eCommerce content, etc K n o w s Knows Knows K n o w s 4 Everything is Naturally Connected
  • 5. Neo4j, Inc. All rights reserved 2021 5 Higher Pay and More Promotions • People Near Structural Holes • Organizational Misfits Network Structure is Highly Predictive Photo by Helena Lopes on Unsplash “Organizational Misfits and the Origins of Brokerage in Intrafirm Networks” A. Kleinbaum “Structural Holes and Good Ideas” R. Burt
  • 6. Neo4j, Inc. All rights reserved 2021 Consider What Drives Your Business It’s not the numbers, it’s the relationships behind them Plants Warehouses Suppliers Distributors Competitors Partners Regulations Employees Citizens Customers Products Parts Services Regions
  • 7. Neo4j, Inc. All rights reserved 2021 Neo4j, Inc. All rights reserved 2021 7 Relationships are the strongest predictors of behavior But You Can’t Analyse What You Can’t See ● Most data science techniques ignore relationships ● It’s painful to manually engineer connected features from tabular data ● Graphs are built on relationships, so… ● You don’t have to guess at the correlations: with graphs, relationships are built in James Fowler
  • 8. Neo4j, Inc. All rights reserved 2021 8 8 Top 10 Tech Trends in Data and Analytics, 16 Feb 2021 According to Gartner, “Graphs form the foundation of modern D&A, with capabilities to enhance and improve user collaboration, ML models and explainable AI. The recent Gartner AI in Organizations Survey demonstrates that graph techniques are increasingly prevalent as AI maturity grows, going from 13% adoption when AI maturity is lowest to 48% when maturity is highest.” AI Research Papers Featuring Graph Source: Dimensions Knowledge System 4x Increase in traffic to Neo4j GDS page in 2H-2020 Analytics & Data Science Interest Exploding in Neo4j Community 100k+ Practicing data scientists engaged with Neo4j +210k downloads
  • 9. Neo4j, Inc. All rights reserved 2021 9 Queries Find the patterns you know exist. Machine Learning Uncover trends and make predictions Visualization Explore, collaborate, and explain Graphs & Data Science Analytics Feature Engineering Data Exploration Graph Data Science Queries Machine Learning Visualization
  • 10. Neo4j, Inc. All rights reserved 2021 10 Graphs & Data Science Knowledge Graphs Graph Algorithms Graph Native Machine Learning Find the patterns you’re looking for in connected data Use unsupervised machine learning techniques to identify associations, anomalies, and trends. Use embeddings to learn the features in your graph that you don’t even know are important yet. Train in-graph supervise ML models to predict links, labels, and missing data.
  • 11. Neo4j, Inc. All rights reserved 2021 Better Predictions with Data You Already Have ● Traditional ML ignores network structure because it’s difficult to extract ● Uncover patterns and trends you can’t find any other way ● Easily generate predictive features to incorporate into ML pipelines 11 Machine Learning Pipeline
  • 12. Neo4j, Inc. All rights reserved 2021 Neo4j’s Graph Data Science Framework Neo4j Graph Data Science Library Neo4j Database Neo4j Bloom Scalable Graph Algorithms & Analytics Workspace Native Graph Creation & Persistence Visual Graph Exploration & Prototyping
  • 13. Neo4j, Inc. All rights reserved 2021 Robust Graph Algorithms & ML methods ● Compute metrics about the topology and connectivity ● Build predictive models to enhance your graph ● Highly parallelized and scale to 10’s of billions of nodes 13 The Neo4j GDS Library Mutable In-Memory Workspace Computational Graph Native Graph Store Efficient & Flexible Analytics Workspace ● Automatically reshapes transactional graphs into an in-memory analytics graph ● Optimized for global traversals and aggregation ● Create workflows and layer algorithms ● Store and manage predictive models in the model catalog
  • 14. Neo4j, Inc. All rights reserved 2021 Community Detection 14 Neo4j’s Graph Data Science Library Unsupervised Graph Algorithms Clustering Dimension Reduction (generalization) Association Which parts of my graph are connected to each other? Which nodes are most similar? How important is each node? Supervised Machine Learning Node Classification Link Prediction Where will connections form next? What’s the label for this node? Centrality Embeddings Similarity Pathfinding More Algos than any other vendor ONLY in neo4j
  • 15. Neo4j, Inc. All rights reserved 2021 15 Unsupervised Machine Learning Pathfinding & Search • Shortest Path • Single-Source Shortest Path • All Pairs Shortest Path • A* Shortest Path • Yen’s K Shortest Path • Minimum Weight Spanning Tree • K-Spanning Tree (MST) • Random Walk • Breadth & Depth First Search Centrality & Importance • Degree Centrality • Closeness Centrality • Harmonic Centrality • Betweenness Centrality & Approx. • PageRank • Personalized PageRank • ArticleRank • Eigenvector Centrality • Hyperlink Induced Topic Search (HITS) • Influence Maximization (Greedy, CELF) Community Detection • Triangle Count • Local Clustering Coefficient • Connected Components (Union Find) • Strongly Connected Components • Label Propagation • Louvain Modularity • K-1 Coloring • Modularity Optimization • Speaker Listener Label Propagation Heuristic Link Prediction • Adamic Adar • Common Neighbors • Preferential Attachment • Resource Allocations • Same Community • Total Neighbors Similarity • Node Similarity • K-Nearest Neighbors (KNN) • Jaccard Similarity • Cosine Similarity • Pearson Similarity • Euclidean Distance • Approximate Nearest Neighbors (ANN) Graph Embeddings • Node2Vec • FastRP • FastRPExtended • GraphSAGE
  • 16. Neo4j, Inc. All rights reserved 2021 Graph Features & Graph Models for Predictions Traditional ML problems where relationships between your data points are important predictive features 16 Predictions influenced by graph structure Predictions about graph structure Enhance your graph by predicting missing data or changes to your graph that will occur in the future
  • 17. Neo4j, Inc. All rights reserved 2021 17 Neo4j’s In-Graph ML Models Node classification: “What kind of node is this?” Link prediction: “Should there be a relationship between these nodes?” Labeled data: Pairs of nodes that are either linked or not Features: Pre-existing attributes, algorithms (pageRank), embedding
  • 18. Neo4j, Inc. All rights reserved 2021 18 The Only Completely In-Graph, ML Workflow Graph-Native Feature Engineering Train Predictive Model Queries Algorithms Embeddings 1. Model Type 2. Property Selection 3. Train & Test 4. Model Selection Apply Model to Existing / New Data Use Predictions for Decisions Use Predictions to Enhance the Graph Publish & Share Store Model in Database
  • 19. Neo4j, Inc. All rights reserved 2021 What’s most important and influential in my business? What’s occurring that’s unusual? What’s going to happen next? But traditional approaches to data make it impossible to reveal and effectively use those connections as data sizes become large Predictive signals get lost in big data noise 19 Graph Data Science Answers the BIG Questions Connected Data is Powerful Graph Data Science uses Connections to Answer Critical Questions
  • 20. Neo4j, Inc. All rights reserved 2021 Neo4j, Inc. All rights reserved 2021 20 Resources Graph Resources ● Video: Advantages of Graph Technology ● Code: https://github.com/neo4j/graph-data-science/ ● Whitepaper: Financial Fraud Detection with Graph Data Science ● Case Study: Meredith Corporation Neo4j BookShelf ● Graph Databases For Dummies ● Graph Data Science For Dummies ● O’Reilly Graph Algorithms