Machine Learning Powered by Graphs - Alessandro Negro

GraphAware®
Machine Learning
powered by Graphs
Alessandro Negro, Chief Scientist @ GraphAware
graphaware.com

@graph_aware, @AlessandroNegro

“We ﬁrmly believe that it's at the intersection of machine learning
and graph technology where the next evolution lies and where new
disruptive companies are emerging,”

Ash Damle, Founder and CEO @ Lumiata
GraphAware®

“There are a variety of use cases where a graph database is a better
ﬁt than other database management systems including relational or
general NoSQL database systems.”

Matthias Broecheler, Chief Technologist @ DataStax
GraphAware®

“Machine learning algorithms help data scientists discover meaning
in data sets […]. Graph databases enable eﬃcient storage and
traversal of information about relationships. Therefore, graph data
can either be the input or the output of machine learning
processing.”

Jim Webber, Chief Scientist @ Neo4J
GraphAware®

Machine Learning Workflow
GraphAware®
[Machine Learning is the] ﬁeld of study that gives computers the ability to
learn without being explicitly programmed.

- Arthur Samuel, 1959

Data Source Issues
- Storing large amounts of labeled and unlabeled data

- Guarantee data quality

- Managing several data sources

Algorithms Issues:
- Results quality

- Computational eﬃciency

- Real-time (Continuous model update)

Model Issues:
- Storing the model built

- Provide fast access to the model
Machine Learning Challenges
GraphAware®

-Machine Learning enables computer systems to solve complex
real-world problems

-Deep Learning models demonstrate high predictive capacity
when trained on large amounts of labeled data

-Graph-based machine learning is becoming a very important
trend in Artiﬁcial Intelligence

-The world’s largest companies are promoting this trend.

-Using graphs as basic representation of data for machine
learning purposes has several advantages
ML and Graphs Facts
GraphAware®

Some usage patterns for graphs in machine learning applications:

-Storing data source in a suitable way

-Tensors

-Centralize multiple data sources (raw or not)

-Lambda Architecture

-Knowledge Graph

-Graph-Based algorithms

-Storing Models produced
Graphs in Machine Learning
GraphAware®

Storing data sources: Lambda Architecture
GraphAware®
Lambda Architecture is a scalable and fault-tolerant data processing
architecture suitable for fast data streaming

Storing data sources: Lambda Architecture (2)
GraphAware®
Continuous Cellular Tower Data Analysis
Eagle N., Quinn J.A., Clauset A. (2009) Methodologies for Continuous Cellular Tower Data Analysis. In: Tokuda H., Beigl M.,
Friday A., Brush A.J.B., Tobe Y. (eds) Pervasive Computing. Pervasive 2009. Lecture Notes in Computer Science, vol 5538.
Springer, Berlin, Heidelberg

Storing data sources: Tensor
GraphAware®

GraphAware®
Simple Recommendation
f: User x Item -> Relevance Score

GraphAware®
Simple Recommendation
f: User x Item -> Relevance Score
Context Aware Recommendation
f: User x Item x Context1 x Context2 x Context3 -> Relevance Score

Storing data sources: Knowledge Graph
GraphAware®

Some graph-theoretical algorithms that are relevant to machine
learning processes:

-Random Walk

-Page Rank

-Graph Matching

-Shortest Path

-Depth-First Graph Traversal

-Breadth-First Graph Traversal

-Minimum Spanning Tree

-Node2vec
Graph-Based ML algorithms
GraphAware®

Graph-Based ML algorithms
GraphAware®
Keywords Extraction
Rada Mihalcea, Paul Tarau. 2004. TextRank: Bringing Order into Texts. Proceedings of EMNLP 2004, pages 404–411, Barcelona,
Spain. Association for Computational Linguistics. http://www.aclweb.org/anthology/W04-3252.

The results of machine learning process can be stored in a graph as
well. Some examples are:

-Similarity (k-Nearest Neighbors)
-Cluster
-Spanning Tree
-Decision Tree
-Random forest
-Markov Chain
Storing Models
GraphAware®

Storing Models
GraphAware®
K-Nearest Neighbors

Storing Models
GraphAware®
K-Nearest Neighbors
Markov Chain

Storing Models
GraphAware®
K-Nearest Neighbors
Markov Chain
Decision Tree

Application: NLP and Graphs
GraphAware®
- Natural Language Processing applications ﬁnd eﬃcient solutions
within graph-theoretical frameworks.

- This idea is not new (Freud 1901, Schvaneveldt 1989)

- Text has a lot of structure - it’s just that most of it isn’t explicit.

- Tokens, events, relationships, and references are extracted from
the text provided;

- The information related could be extended by introducing new
sources of knowledge like ontologies or further processed.

- A suitable model for representing them is in the form of a:

Graph Model

GraphAware®
GraphAware NLP Framework

- End-to-end framework: from low to high level set of functionalities,
services and applications

- Suitable storage schema based on graph

- Distributed processing using Apache Spark

- Integrated with other software/services

GraphAware®
Graph-Centric Clockwise Architecture

Storage:

- Bipartite graph

- Tensors

Algorithms:

- Optimisation using co-occurrence

- Node2Vec

Models:
- K-Nearest Neighbors

- Factorization - User/Item Vectors

- Clusters
Recommendation with Graph
GraphAware®

- Think bigger
- Think about Data Flow
- Think about graphs
- Think …
Think about us
Conclusion
GraphAware®

www.graphaware.com 
@graph_aware
GraphAware
GraphAware®
world’s #1 Neo4j consultancy

Machine Learning Powered by Graphs - Alessandro Negro

Empfohlen

Empfohlen

Weitere ähnliche Inhalte

Was ist angesagt?

Was ist angesagt? (20)

Ähnlich wie Machine Learning Powered by Graphs - Alessandro Negro

Ähnlich wie Machine Learning Powered by Graphs - Alessandro Negro (20)

Mehr von GraphAware

Mehr von GraphAware (20)

Kürzlich hochgeladen

Kürzlich hochgeladen (20)

Machine Learning Powered by Graphs - Alessandro Negro