SlideShare ist ein Scribd-Unternehmen logo
1 von 24
Downloaden Sie, um offline zu lesen
Copyright © 2017, Oracle and/or its affiliates. All rights reserved. |
Visual Analysis of Social Media Data from
using Graph Technologies
DATA NATIVES 2018 | Nov 22-23, 2018 | Berlin
Karin Patenge | Principal Solution Engineer | Cloud & Core Technologies
@kpatenge |  karin.patenge@oracle.com
Oracle Deutschland B.V. & Co. KG | Potsdam | Schiffbauergasse 14
Copyright © 2018, Oracle and/or its affiliates. All rights reserved. | @kpatenge @datanativesconf #DN18
Copyright © 2018, Oracle and/or its affiliates. All rights reserved. |
Accessing Data Entities
• Data retrieval via REST API
https://www.meetup.com/meetup_api
• Different API methods & versions
• API Key required
• Sample request
• Data returned as JSON
@kpatenge @datanativesconf #DN18
Copyright © 2018, Oracle and/or its affiliates. All rights reserved. |
Potential Questions of Interest
• Which Meetup groups are most active in terms of:
– # members
– # events
– # event attendees
• Who and where are influencers in the Meetup community?
• Where are connections between the Meetup groups in different locations?
• Which topics are “hot” and where?
• How close/similar are groups?
• …
@kpatenge @datanativesconf #DN18
Copyright © 2018, Oracle and/or its affiliates. All rights reserved. |
Approach: Modeling Data as Graphs
The more connected the data is, the better a Graph fits
Source: http://www.ateam-oracle.com/intro-to-graphs-at-oracle/
@kpatenge @datanativesconf #DN18
Copyright © 2018, Oracle and/or its affiliates. All rights reserved. |
• A set of nodes (aka vertices)
– each vertex has a unique identifier
– each vertex has a set of in/out edges
– each vertex has a collection of key-value
properties
• A set of edges
– each edge has a unique identifier
– each edge has a head/tail vertex
– each edge has a label denoting type of
relationship between two vertices
– each edge has a collection of key-value properties
• Implementations
– Oracle (Spatial and Graph/Big Data Spatial and
Graph), Neo4j, DataStax (Titan), InfiniteGraph, …
What is a Property Graph?
https://github.com/tinkerpop/blueprints/wiki/Property-Graph-Model
@kpatenge @datanativesconf #DN18
Copyright © 2018, Oracle and/or its affiliates. All rights reserved. |
• PageRank, Weighted PageRank
– Find influencers, critical vertices
• Personalized PageRank
– Find important people/products/...
with respect to a given starting point
• Sparsification
– Trim down the graph to make it more
fragmented
• Clustering
– Find communities which can be the
basis of segmentation, and/or
recommendation/anomaly detection,
churn analysis
• Centrality
– Find critical people/devices/...
• Shortest path
– Discover links, find suspect‘s close
collaborators, transportation routing
• Breadth-First-Search (BFS)
– Impact analysis, link analysis
• Matric factorization
– Recommendation
• Reachability
– Connectivity test
• ...
Graph Algorithms and their Applications
@kpatenge @datanativesconf #DN18
Copyright © 2018, Oracle and/or its affiliates. All rights reserved. |
Pathfinding
– fattestPath
– shortestPathBellmanFord
– shortestPathBellmanFordReverse
– shortestPathDijkstra
– shortestPathDijkstraBidirectional
– shortestPathFilteredDijkstra
– shortestPathFilteredDijkstraBidirectional
– shortestPathHopDist
– shortestPathHopDistReverse
Ranking
– closenessCentralityUnitLength
– degreeCentrality
– eigenvectorCentrality
– Hyperlink-Induced Topic Search (HITS)
– inDegreeCentrality
– nodeBetweennessCentrality
– outDegreeCentrality
– PageRank, weighted PageRank
– approximatePagerank
– personalizedPagerank
– randomWalkWithRestart
Social Network Analysis Algorithms (1)
@kpatenge @datanativesconf #DN18
https://tinyurl.com/pgxdocs
Copyright © 2018, Oracle and/or its affiliates. All rights reserved. |
Structure Evaluation
– Conductance
– countTriangles
– inDegreeDistribution
– outDegreeDistribution
– partitionConductance
– partitionModularity
– sparsify
– K-Core computes
Community Detection
– communitiesLabelPropagation
Recommendation
– salsa
– personalizedSalsa
– whomToFollow
Classic - Connected Components
– sccKosaraju
– sccTarjan
– wcc
Social Network Analysis Algorithms (2)
@kpatenge @datanativesconf #DN18
https://tinyurl.com/pgxdocs
Copyright © 2018, Oracle and/or its affiliates. All rights reserved. |
Architecture of Oracle Property Graph Analytics
Property
Graph formats
GraphML
GML
GraphSON
Flat Files
@kpatenge @datanativesconf #DN18
Copyright © 2018, Oracle and/or its affiliates. All rights reserved. |
PGX
• Toolkit for In-Memory, Parallel Graph
Analytics containing
– PGX shell
– Analyst API with a large collection of built-in
algorithms (45+)
– Enhance with user defined algorithms written
in Green-Marl
– Tutorials, JavaDoc, Use Cases, and more
• Developed by Oracle Labs
• https://docs.oracle.com/cd/E56133_01/latest/i
ndex.html
PGQL – Property Graph Query Language
• http://pgql-lang.org/
• Graph Pattern Matching combined with
SQL
• Developed by Oracle Labs
• Proposed for standardization
• Changes in Version 1.1:
http://pgql-lang.org/spec/1.1/#breaking-syntax-
changes-since-pgql-10
Property Graph Analytics Engine
@kpatenge @datanativesconf #DN18
Copyright © 2018, Oracle and/or its affiliates. All rights reserved. |
Data Processing and Analysis Workflow: Overview
Retrieve&Prepare
Prepare
source data
• Using R for data
retrieval via
REST API and
conversion
JSON  CSV 
OPV/OPE
Load&Build
Load
nodes and
edges data
into a graph
• Using Oracle
NoSQL DB as
Graph data
store
Analyze
Analyze
graph data
• Using Graph
Analytics Engine
(PGX) and
Property Graph
Query Language
(PGQL)
Visualize
Visualize
graph data
• Using
Cytoscape
Results
Summarize
results
@kpatenge @datanativesconf #DN18
Copyright © 2018, Oracle and/or its affiliates. All rights reserved. |
Demo
@kpatenge @datanativesconf #DN18
Copyright © 2018, Oracle and/or its affiliates. All rights reserved. | @kpatenge @datanativesconf #DN18
‚Big Data‘ Groups
in relation with
Topics and Cities
Copyright © 2018, Oracle and/or its affiliates. All rights reserved. | @kpatenge @datanativesconf #DN18
‚Big Data‘ Groups
in relation with
Organizers, Cities
and Events
Copyright © 2018, Oracle and/or its affiliates. All rights reserved. | @kpatenge @datanativesconf #DN18
‚Big Data‘ Groups
in relation with
Organizers and Cities
Weakly Connected
Components (WCC)
Copyright © 2018, Oracle and/or its affiliates. All rights reserved. | @kpatenge @datanativesconf #DN18
Ranking via PageRank (Top 10+1)
Copyright © 2018, Oracle and/or its affiliates. All rights reserved. | @kpatenge @datanativesconf #DN18
Ranking via PageRank (Top 10+1)
Copyright © 2018, Oracle and/or its affiliates. All rights reserved. | @kpatenge @datanativesconf #DN18
Ranking via PageRank (Top 10+1)
Copyright © 2018, Oracle and/or its affiliates. All rights reserved. | @kpatenge @datanativesconf #DN18
Ranking via PageRank (Top 10+1)
Copyright © 2018, Oracle and/or its affiliates. All rights reserved. |
✓Which cities are tech hot spots?
✓Who are important people in the
Meetup landscape?
✓Which Meetup groups cover with
topics?
✓Which Meetup groups are relevant in
terms of
#Members, #Participants of events,
#Events
✓Which Meetup groups are related
and how?
✓Which topics are related and how?
• The way you model the graph has
influence on the results of executing
Graph algorithms
• The choice of edge directions does
matter depending on the algorithms
• Attaching weights to edges is useful
for certain algorithms
Some Results
@kpatenge @datanativesconf #DN18
Copyright © 2018, Oracle and/or its affiliates. All rights reserved. |
Key Takeaways
• Graph data model perfect to focus on connectivity
• Graph databases are powerful tools, complementing relational and other
databases
– Especially strong for analysis of graph topology and connectedness
• Visual analysis helps a great deal to understand how data are connected
– New insights, especially with relationships, dependencies and behavioral patterns
• Big variety of analytic tools and frameworks to answer all kind of questions
• Oracle Graph Technologies combined with Open Source or 3rd party tools
@kpatenge @datanativesconf #DN18
Copyright © 2018, Oracle and/or its affiliates. All rights reserved. |
Follow us @kpatenge @SpatialHannes @JeanIhm
karin.patenge@oracle.com
GitHub:
https://github.com/karinpatenge/DN2018
Blogs:
https://blogs.oracle.com/bigdataspatialgraph/
https://blogs.oracle.com/oraclespatial/
AskTom Office Hours for Property Graph:
https://asktom.oracle.com/pls/apex/f?p=100:551
@kpatenge @datanativesconf #DN18
20181123 dn2018 graph_analytics_k_patenge

Weitere ähnliche Inhalte

Was ist angesagt?

Using Spark-Solr at Scale: Productionizing Spark for Search with Apache Solr...
 Using Spark-Solr at Scale: Productionizing Spark for Search with Apache Solr... Using Spark-Solr at Scale: Productionizing Spark for Search with Apache Solr...
Using Spark-Solr at Scale: Productionizing Spark for Search with Apache Solr...
Databricks
 
In search of database nirvana - The challenges of delivering Hybrid Transacti...
In search of database nirvana - The challenges of delivering Hybrid Transacti...In search of database nirvana - The challenges of delivering Hybrid Transacti...
In search of database nirvana - The challenges of delivering Hybrid Transacti...
Rohit Jain
 

Was ist angesagt? (20)

PGQL: A Language for Graphs
PGQL: A Language for GraphsPGQL: A Language for Graphs
PGQL: A Language for Graphs
 
Visual Ontology Modeling for Domain Experts and Business Users with metaphactory
Visual Ontology Modeling for Domain Experts and Business Users with metaphactoryVisual Ontology Modeling for Domain Experts and Business Users with metaphactory
Visual Ontology Modeling for Domain Experts and Business Users with metaphactory
 
Using Spark-Solr at Scale: Productionizing Spark for Search with Apache Solr...
 Using Spark-Solr at Scale: Productionizing Spark for Search with Apache Solr... Using Spark-Solr at Scale: Productionizing Spark for Search with Apache Solr...
Using Spark-Solr at Scale: Productionizing Spark for Search with Apache Solr...
 
Transforming AI with Graphs: Real World Examples using Spark and Neo4j
Transforming AI with Graphs: Real World Examples using Spark and Neo4jTransforming AI with Graphs: Real World Examples using Spark and Neo4j
Transforming AI with Graphs: Real World Examples using Spark and Neo4j
 
Oracle Spatial Studio: Fast and Easy Spatial Analytics and Maps
Oracle Spatial Studio:  Fast and Easy Spatial Analytics and MapsOracle Spatial Studio:  Fast and Easy Spatial Analytics and Maps
Oracle Spatial Studio: Fast and Easy Spatial Analytics and Maps
 
Neo4j GraphDay Seattle- Sept19- Connected data imperative
Neo4j GraphDay Seattle- Sept19- Connected data imperativeNeo4j GraphDay Seattle- Sept19- Connected data imperative
Neo4j GraphDay Seattle- Sept19- Connected data imperative
 
Connected datalondon metadata-driven apps
Connected datalondon metadata-driven appsConnected datalondon metadata-driven apps
Connected datalondon metadata-driven apps
 
Strata sf - Amundsen presentation
Strata sf - Amundsen presentationStrata sf - Amundsen presentation
Strata sf - Amundsen presentation
 
How Lyft Drives Data Discovery
How Lyft Drives Data DiscoveryHow Lyft Drives Data Discovery
How Lyft Drives Data Discovery
 
Data council sf amundsen presentation
Data council sf    amundsen presentationData council sf    amundsen presentation
Data council sf amundsen presentation
 
Neo4j: What's Under the Hood
Neo4j: What's Under the HoodNeo4j: What's Under the Hood
Neo4j: What's Under the Hood
 
Neo4j GraphTour Santa Monica 2019 - Amundsen Presentation
Neo4j GraphTour Santa Monica 2019 - Amundsen PresentationNeo4j GraphTour Santa Monica 2019 - Amundsen Presentation
Neo4j GraphTour Santa Monica 2019 - Amundsen Presentation
 
SHACL-based data life cycle management
SHACL-based data life cycle managementSHACL-based data life cycle management
SHACL-based data life cycle management
 
Neo4j GraphTalk Basel - Building intelligent Software with Graphs
Neo4j GraphTalk Basel - Building intelligent Software with GraphsNeo4j GraphTalk Basel - Building intelligent Software with Graphs
Neo4j GraphTalk Basel - Building intelligent Software with Graphs
 
Meetup SF - Amundsen
Meetup SF  -  AmundsenMeetup SF  -  Amundsen
Meetup SF - Amundsen
 
Graph Algorithms for Developers
Graph Algorithms for DevelopersGraph Algorithms for Developers
Graph Algorithms for Developers
 
Neo4j GraphDay Seattle- Sept19- in the enterprise
Neo4j GraphDay Seattle- Sept19-  in the enterpriseNeo4j GraphDay Seattle- Sept19-  in the enterprise
Neo4j GraphDay Seattle- Sept19- in the enterprise
 
In search of database nirvana - The challenges of delivering Hybrid Transacti...
In search of database nirvana - The challenges of delivering Hybrid Transacti...In search of database nirvana - The challenges of delivering Hybrid Transacti...
In search of database nirvana - The challenges of delivering Hybrid Transacti...
 
Joseph Bradley, Software Engineer, Databricks Inc. at MLconf SEA - 5/01/15
Joseph Bradley, Software Engineer, Databricks Inc. at MLconf SEA - 5/01/15Joseph Bradley, Software Engineer, Databricks Inc. at MLconf SEA - 5/01/15
Joseph Bradley, Software Engineer, Databricks Inc. at MLconf SEA - 5/01/15
 
GraphTour - Neo4j Platform Overview
GraphTour - Neo4j Platform OverviewGraphTour - Neo4j Platform Overview
GraphTour - Neo4j Platform Overview
 

Ähnlich wie 20181123 dn2018 graph_analytics_k_patenge

Ähnlich wie 20181123 dn2018 graph_analytics_k_patenge (20)

Graph Analytics on Data from Meetup.com
Graph Analytics on Data from Meetup.comGraph Analytics on Data from Meetup.com
Graph Analytics on Data from Meetup.com
 
An Introduction to Graph: Database, Analytics, and Cloud Services
An Introduction to Graph:  Database, Analytics, and Cloud ServicesAn Introduction to Graph:  Database, Analytics, and Cloud Services
An Introduction to Graph: Database, Analytics, and Cloud Services
 
Knowledge Graph for Machine Learning and Data Science
Knowledge Graph for Machine Learning and Data ScienceKnowledge Graph for Machine Learning and Data Science
Knowledge Graph for Machine Learning and Data Science
 
AnzoGraph DB: Driving AI and Machine Insights with Knowledge Graphs in a Conn...
AnzoGraph DB: Driving AI and Machine Insights with Knowledge Graphs in a Conn...AnzoGraph DB: Driving AI and Machine Insights with Knowledge Graphs in a Conn...
AnzoGraph DB: Driving AI and Machine Insights with Knowledge Graphs in a Conn...
 
Oracle Data Science Platform
Oracle Data Science PlatformOracle Data Science Platform
Oracle Data Science Platform
 
Neo4j GraphTalk Oslo - Building Intelligent Solutions with Graphs
Neo4j GraphTalk Oslo - Building Intelligent Solutions with GraphsNeo4j GraphTalk Oslo - Building Intelligent Solutions with Graphs
Neo4j GraphTalk Oslo - Building Intelligent Solutions with Graphs
 
OracleCode_Berlin_Jun2018_AnalyzeBitcoinTransactionDataUsingAsGraph
OracleCode_Berlin_Jun2018_AnalyzeBitcoinTransactionDataUsingAsGraphOracleCode_Berlin_Jun2018_AnalyzeBitcoinTransactionDataUsingAsGraph
OracleCode_Berlin_Jun2018_AnalyzeBitcoinTransactionDataUsingAsGraph
 
Applying large scale text analytics with graph databases
Applying large scale text analytics with graph databasesApplying large scale text analytics with graph databases
Applying large scale text analytics with graph databases
 
Domain Specific Languages for Parallel Graph AnalytiX (PGX)
Domain Specific Languages for Parallel Graph AnalytiX (PGX)Domain Specific Languages for Parallel Graph AnalytiX (PGX)
Domain Specific Languages for Parallel Graph AnalytiX (PGX)
 
GraphGen: Conducting Graph Analytics over Relational Databases
GraphGen: Conducting Graph Analytics over Relational DatabasesGraphGen: Conducting Graph Analytics over Relational Databases
GraphGen: Conducting Graph Analytics over Relational Databases
 
GraphGen: Conducting Graph Analytics over Relational Databases
GraphGen: Conducting Graph Analytics over Relational DatabasesGraphGen: Conducting Graph Analytics over Relational Databases
GraphGen: Conducting Graph Analytics over Relational Databases
 
Nodes2020 | Graph of enterprise_metadata | NEO4J Conference
Nodes2020 | Graph of enterprise_metadata | NEO4J ConferenceNodes2020 | Graph of enterprise_metadata | NEO4J Conference
Nodes2020 | Graph of enterprise_metadata | NEO4J Conference
 
Get Started with the Most Advanced Edition Yet of Neo4j Graph Data Science
Get Started with the Most Advanced Edition Yet of Neo4j Graph Data ScienceGet Started with the Most Advanced Edition Yet of Neo4j Graph Data Science
Get Started with the Most Advanced Edition Yet of Neo4j Graph Data Science
 
Neo4j GraphTalk Düsseldorf - Building intelligent solutions with Graphs
Neo4j GraphTalk Düsseldorf - Building intelligent solutions with GraphsNeo4j GraphTalk Düsseldorf - Building intelligent solutions with Graphs
Neo4j GraphTalk Düsseldorf - Building intelligent solutions with Graphs
 
Data meets AI - AICUG - Santa Clara
Data meets AI  - AICUG - Santa ClaraData meets AI  - AICUG - Santa Clara
Data meets AI - AICUG - Santa Clara
 
Graph Gurus Episode 1: Enterprise Graph
Graph Gurus Episode 1: Enterprise GraphGraph Gurus Episode 1: Enterprise Graph
Graph Gurus Episode 1: Enterprise Graph
 
Build Deep Learning Applications for Big Data Platforms (CVPR 2018 tutorial)
Build Deep Learning Applications for Big Data Platforms (CVPR 2018 tutorial)Build Deep Learning Applications for Big Data Platforms (CVPR 2018 tutorial)
Build Deep Learning Applications for Big Data Platforms (CVPR 2018 tutorial)
 
Introducing new AIOps innovations in Oracle 19c - San Jose AICUG
Introducing new AIOps innovations in Oracle 19c - San Jose AICUGIntroducing new AIOps innovations in Oracle 19c - San Jose AICUG
Introducing new AIOps innovations in Oracle 19c - San Jose AICUG
 
From Developer to Data Scientist - Gaines Kergosien
From Developer to Data Scientist - Gaines KergosienFrom Developer to Data Scientist - Gaines Kergosien
From Developer to Data Scientist - Gaines Kergosien
 
Using graphs for recommendations
Using graphs for recommendationsUsing graphs for recommendations
Using graphs for recommendations
 

Mehr von Karin Patenge

Mehr von Karin Patenge (17)

20190704_AGIT_Georaster_ImageryData_KPatenge
20190704_AGIT_Georaster_ImageryData_KPatenge20190704_AGIT_Georaster_ImageryData_KPatenge
20190704_AGIT_Georaster_ImageryData_KPatenge
 
20190703_AGIT_GeoRasterWorkshop_GriddedData_KPatenge
20190703_AGIT_GeoRasterWorkshop_GriddedData_KPatenge20190703_AGIT_GeoRasterWorkshop_GriddedData_KPatenge
20190703_AGIT_GeoRasterWorkshop_GriddedData_KPatenge
 
20190604_DOAGDatabase2019_OracleNoSQLDB_for_DBAs
20190604_DOAGDatabase2019_OracleNoSQLDB_for_DBAs20190604_DOAGDatabase2019_OracleNoSQLDB_for_DBAs
20190604_DOAGDatabase2019_OracleNoSQLDB_for_DBAs
 
Big Data Community Webinar vom 16. Mai 2019: Oracle NoSQL DB im Überblick
Big Data Community Webinar vom 16. Mai 2019: Oracle NoSQL DB im ÜberblickBig Data Community Webinar vom 16. Mai 2019: Oracle NoSQL DB im Überblick
Big Data Community Webinar vom 16. Mai 2019: Oracle NoSQL DB im Überblick
 
20181210_ITTage2018_OracleNoSQLDB_KPatenge
20181210_ITTage2018_OracleNoSQLDB_KPatenge20181210_ITTage2018_OracleNoSQLDB_KPatenge
20181210_ITTage2018_OracleNoSQLDB_KPatenge
 
20181120_DOAG_OracleNoSQLDB_KPatenge
20181120_DOAG_OracleNoSQLDB_KPatenge20181120_DOAG_OracleNoSQLDB_KPatenge
20181120_DOAG_OracleNoSQLDB_KPatenge
 
20180921_DOAG_BigDataDays_OracleSpatialandPython_kpatenge
20180921_DOAG_BigDataDays_OracleSpatialandPython_kpatenge20180921_DOAG_BigDataDays_OracleSpatialandPython_kpatenge
20180921_DOAG_BigDataDays_OracleSpatialandPython_kpatenge
 
5 Gründe für Oracle Spatial Technologies
5 Gründe für Oracle Spatial Technologies5 Gründe für Oracle Spatial Technologies
5 Gründe für Oracle Spatial Technologies
 
IT-Tage 2017: Visuelle Analyse komplexer Datenbestände am Beispiel der Panama...
IT-Tage 2017: Visuelle Analyse komplexer Datenbestände am Beispiel der Panama...IT-Tage 2017: Visuelle Analyse komplexer Datenbestände am Beispiel der Panama...
IT-Tage 2017: Visuelle Analyse komplexer Datenbestände am Beispiel der Panama...
 
20171106_OracleWebcast_ITTrends_EFavuzzi_KPatenge
20171106_OracleWebcast_ITTrends_EFavuzzi_KPatenge20171106_OracleWebcast_ITTrends_EFavuzzi_KPatenge
20171106_OracleWebcast_ITTrends_EFavuzzi_KPatenge
 
20171121_DOAGKonferenz_JSON_OracleNoSQL_KPatenge
20171121_DOAGKonferenz_JSON_OracleNoSQL_KPatenge20171121_DOAGKonferenz_JSON_OracleNoSQL_KPatenge
20171121_DOAGKonferenz_JSON_OracleNoSQL_KPatenge
 
Oracle NoSQL Database: Integration in den Oracle Enterprise Manager 12
Oracle NoSQL Database: Integration in den Oracle Enterprise Manager 12Oracle NoSQL Database: Integration in den Oracle Enterprise Manager 12
Oracle NoSQL Database: Integration in den Oracle Enterprise Manager 12
 
20160310_ModernApplicationDevelopment_NoSQL_KPatenge
20160310_ModernApplicationDevelopment_NoSQL_KPatenge20160310_ModernApplicationDevelopment_NoSQL_KPatenge
20160310_ModernApplicationDevelopment_NoSQL_KPatenge
 
20160229_ModernApplicationDevelopment_Python_KPatenge
20160229_ModernApplicationDevelopment_Python_KPatenge20160229_ModernApplicationDevelopment_Python_KPatenge
20160229_ModernApplicationDevelopment_Python_KPatenge
 
Datenbank-gestützte Validierung und Geokodierung von Adressdatenbeständen
Datenbank-gestützte Validierung und Geokodierung von AdressdatenbeständenDatenbank-gestützte Validierung und Geokodierung von Adressdatenbeständen
Datenbank-gestützte Validierung und Geokodierung von Adressdatenbeständen
 
Raster Algebra mit Oracle Spatial und uDig
Raster Algebra mit Oracle Spatial und uDigRaster Algebra mit Oracle Spatial und uDig
Raster Algebra mit Oracle Spatial und uDig
 
Geodatenmanagement und -Visualisierung mit Oracle Spatial Technologies
Geodatenmanagement und -Visualisierung mit Oracle Spatial TechnologiesGeodatenmanagement und -Visualisierung mit Oracle Spatial Technologies
Geodatenmanagement und -Visualisierung mit Oracle Spatial Technologies
 

Kürzlich hochgeladen

Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
ZurliaSoop
 
Reconciling Conflicting Data Curation Actions: Transparency Through Argument...
Reconciling Conflicting Data Curation Actions:  Transparency Through Argument...Reconciling Conflicting Data Curation Actions:  Transparency Through Argument...
Reconciling Conflicting Data Curation Actions: Transparency Through Argument...
Bertram Ludäscher
 
Lecture_2_Deep_Learning_Overview-newone1
Lecture_2_Deep_Learning_Overview-newone1Lecture_2_Deep_Learning_Overview-newone1
Lecture_2_Deep_Learning_Overview-newone1
ranjankumarbehera14
 
Top profile Call Girls In Indore [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Indore [ 7014168258 ] Call Me For Genuine Models We...Top profile Call Girls In Indore [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Indore [ 7014168258 ] Call Me For Genuine Models We...
gajnagarg
 
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
Klinik kandungan
 
Sonagachi * best call girls in Kolkata | ₹,9500 Pay Cash 8005736733 Free Home...
Sonagachi * best call girls in Kolkata | ₹,9500 Pay Cash 8005736733 Free Home...Sonagachi * best call girls in Kolkata | ₹,9500 Pay Cash 8005736733 Free Home...
Sonagachi * best call girls in Kolkata | ₹,9500 Pay Cash 8005736733 Free Home...
HyderabadDolls
 
Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...
nirzagarg
 
Jodhpur Park | Call Girls in Kolkata Phone No 8005736733 Elite Escort Service...
Jodhpur Park | Call Girls in Kolkata Phone No 8005736733 Elite Escort Service...Jodhpur Park | Call Girls in Kolkata Phone No 8005736733 Elite Escort Service...
Jodhpur Park | Call Girls in Kolkata Phone No 8005736733 Elite Escort Service...
HyderabadDolls
 
Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...
nirzagarg
 
Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...
Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...
Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...
gajnagarg
 
Gartner's Data Analytics Maturity Model.pptx
Gartner's Data Analytics Maturity Model.pptxGartner's Data Analytics Maturity Model.pptx
Gartner's Data Analytics Maturity Model.pptx
chadhar227
 
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
nirzagarg
 

Kürzlich hochgeladen (20)

RESEARCH-FINAL-DEFENSE-PPT-TEMPLATE.pptx
RESEARCH-FINAL-DEFENSE-PPT-TEMPLATE.pptxRESEARCH-FINAL-DEFENSE-PPT-TEMPLATE.pptx
RESEARCH-FINAL-DEFENSE-PPT-TEMPLATE.pptx
 
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
 
Reconciling Conflicting Data Curation Actions: Transparency Through Argument...
Reconciling Conflicting Data Curation Actions:  Transparency Through Argument...Reconciling Conflicting Data Curation Actions:  Transparency Through Argument...
Reconciling Conflicting Data Curation Actions: Transparency Through Argument...
 
Lecture_2_Deep_Learning_Overview-newone1
Lecture_2_Deep_Learning_Overview-newone1Lecture_2_Deep_Learning_Overview-newone1
Lecture_2_Deep_Learning_Overview-newone1
 
Fun all Day Call Girls in Jaipur 9332606886 High Profile Call Girls You Ca...
Fun all Day Call Girls in Jaipur   9332606886  High Profile Call Girls You Ca...Fun all Day Call Girls in Jaipur   9332606886  High Profile Call Girls You Ca...
Fun all Day Call Girls in Jaipur 9332606886 High Profile Call Girls You Ca...
 
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
 
Digital Transformation Playbook by Graham Ware
Digital Transformation Playbook by Graham WareDigital Transformation Playbook by Graham Ware
Digital Transformation Playbook by Graham Ware
 
Top Call Girls in Balaghat 9332606886Call Girls Advance Cash On Delivery Ser...
Top Call Girls in Balaghat  9332606886Call Girls Advance Cash On Delivery Ser...Top Call Girls in Balaghat  9332606886Call Girls Advance Cash On Delivery Ser...
Top Call Girls in Balaghat 9332606886Call Girls Advance Cash On Delivery Ser...
 
Top profile Call Girls In Indore [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Indore [ 7014168258 ] Call Me For Genuine Models We...Top profile Call Girls In Indore [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Indore [ 7014168258 ] Call Me For Genuine Models We...
 
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
 
Sonagachi * best call girls in Kolkata | ₹,9500 Pay Cash 8005736733 Free Home...
Sonagachi * best call girls in Kolkata | ₹,9500 Pay Cash 8005736733 Free Home...Sonagachi * best call girls in Kolkata | ₹,9500 Pay Cash 8005736733 Free Home...
Sonagachi * best call girls in Kolkata | ₹,9500 Pay Cash 8005736733 Free Home...
 
Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...
 
20240412-SmartCityIndex-2024-Full-Report.pdf
20240412-SmartCityIndex-2024-Full-Report.pdf20240412-SmartCityIndex-2024-Full-Report.pdf
20240412-SmartCityIndex-2024-Full-Report.pdf
 
Jodhpur Park | Call Girls in Kolkata Phone No 8005736733 Elite Escort Service...
Jodhpur Park | Call Girls in Kolkata Phone No 8005736733 Elite Escort Service...Jodhpur Park | Call Girls in Kolkata Phone No 8005736733 Elite Escort Service...
Jodhpur Park | Call Girls in Kolkata Phone No 8005736733 Elite Escort Service...
 
Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...
 
Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...
Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...
Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...
 
Gomti Nagar & best call girls in Lucknow | 9548273370 Independent Escorts & D...
Gomti Nagar & best call girls in Lucknow | 9548273370 Independent Escorts & D...Gomti Nagar & best call girls in Lucknow | 9548273370 Independent Escorts & D...
Gomti Nagar & best call girls in Lucknow | 9548273370 Independent Escorts & D...
 
Gartner's Data Analytics Maturity Model.pptx
Gartner's Data Analytics Maturity Model.pptxGartner's Data Analytics Maturity Model.pptx
Gartner's Data Analytics Maturity Model.pptx
 
Vadodara 💋 Call Girl 7737669865 Call Girls in Vadodara Escort service book now
Vadodara 💋 Call Girl 7737669865 Call Girls in Vadodara Escort service book nowVadodara 💋 Call Girl 7737669865 Call Girls in Vadodara Escort service book now
Vadodara 💋 Call Girl 7737669865 Call Girls in Vadodara Escort service book now
 
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
 

20181123 dn2018 graph_analytics_k_patenge

  • 1. Copyright © 2017, Oracle and/or its affiliates. All rights reserved. | Visual Analysis of Social Media Data from using Graph Technologies DATA NATIVES 2018 | Nov 22-23, 2018 | Berlin Karin Patenge | Principal Solution Engineer | Cloud & Core Technologies @kpatenge |  karin.patenge@oracle.com Oracle Deutschland B.V. & Co. KG | Potsdam | Schiffbauergasse 14
  • 2. Copyright © 2018, Oracle and/or its affiliates. All rights reserved. | @kpatenge @datanativesconf #DN18
  • 3. Copyright © 2018, Oracle and/or its affiliates. All rights reserved. | Accessing Data Entities • Data retrieval via REST API https://www.meetup.com/meetup_api • Different API methods & versions • API Key required • Sample request • Data returned as JSON @kpatenge @datanativesconf #DN18
  • 4. Copyright © 2018, Oracle and/or its affiliates. All rights reserved. | Potential Questions of Interest • Which Meetup groups are most active in terms of: – # members – # events – # event attendees • Who and where are influencers in the Meetup community? • Where are connections between the Meetup groups in different locations? • Which topics are “hot” and where? • How close/similar are groups? • … @kpatenge @datanativesconf #DN18
  • 5. Copyright © 2018, Oracle and/or its affiliates. All rights reserved. | Approach: Modeling Data as Graphs The more connected the data is, the better a Graph fits Source: http://www.ateam-oracle.com/intro-to-graphs-at-oracle/ @kpatenge @datanativesconf #DN18
  • 6. Copyright © 2018, Oracle and/or its affiliates. All rights reserved. | • A set of nodes (aka vertices) – each vertex has a unique identifier – each vertex has a set of in/out edges – each vertex has a collection of key-value properties • A set of edges – each edge has a unique identifier – each edge has a head/tail vertex – each edge has a label denoting type of relationship between two vertices – each edge has a collection of key-value properties • Implementations – Oracle (Spatial and Graph/Big Data Spatial and Graph), Neo4j, DataStax (Titan), InfiniteGraph, … What is a Property Graph? https://github.com/tinkerpop/blueprints/wiki/Property-Graph-Model @kpatenge @datanativesconf #DN18
  • 7. Copyright © 2018, Oracle and/or its affiliates. All rights reserved. | • PageRank, Weighted PageRank – Find influencers, critical vertices • Personalized PageRank – Find important people/products/... with respect to a given starting point • Sparsification – Trim down the graph to make it more fragmented • Clustering – Find communities which can be the basis of segmentation, and/or recommendation/anomaly detection, churn analysis • Centrality – Find critical people/devices/... • Shortest path – Discover links, find suspect‘s close collaborators, transportation routing • Breadth-First-Search (BFS) – Impact analysis, link analysis • Matric factorization – Recommendation • Reachability – Connectivity test • ... Graph Algorithms and their Applications @kpatenge @datanativesconf #DN18
  • 8. Copyright © 2018, Oracle and/or its affiliates. All rights reserved. | Pathfinding – fattestPath – shortestPathBellmanFord – shortestPathBellmanFordReverse – shortestPathDijkstra – shortestPathDijkstraBidirectional – shortestPathFilteredDijkstra – shortestPathFilteredDijkstraBidirectional – shortestPathHopDist – shortestPathHopDistReverse Ranking – closenessCentralityUnitLength – degreeCentrality – eigenvectorCentrality – Hyperlink-Induced Topic Search (HITS) – inDegreeCentrality – nodeBetweennessCentrality – outDegreeCentrality – PageRank, weighted PageRank – approximatePagerank – personalizedPagerank – randomWalkWithRestart Social Network Analysis Algorithms (1) @kpatenge @datanativesconf #DN18 https://tinyurl.com/pgxdocs
  • 9. Copyright © 2018, Oracle and/or its affiliates. All rights reserved. | Structure Evaluation – Conductance – countTriangles – inDegreeDistribution – outDegreeDistribution – partitionConductance – partitionModularity – sparsify – K-Core computes Community Detection – communitiesLabelPropagation Recommendation – salsa – personalizedSalsa – whomToFollow Classic - Connected Components – sccKosaraju – sccTarjan – wcc Social Network Analysis Algorithms (2) @kpatenge @datanativesconf #DN18 https://tinyurl.com/pgxdocs
  • 10. Copyright © 2018, Oracle and/or its affiliates. All rights reserved. | Architecture of Oracle Property Graph Analytics Property Graph formats GraphML GML GraphSON Flat Files @kpatenge @datanativesconf #DN18
  • 11. Copyright © 2018, Oracle and/or its affiliates. All rights reserved. | PGX • Toolkit for In-Memory, Parallel Graph Analytics containing – PGX shell – Analyst API with a large collection of built-in algorithms (45+) – Enhance with user defined algorithms written in Green-Marl – Tutorials, JavaDoc, Use Cases, and more • Developed by Oracle Labs • https://docs.oracle.com/cd/E56133_01/latest/i ndex.html PGQL – Property Graph Query Language • http://pgql-lang.org/ • Graph Pattern Matching combined with SQL • Developed by Oracle Labs • Proposed for standardization • Changes in Version 1.1: http://pgql-lang.org/spec/1.1/#breaking-syntax- changes-since-pgql-10 Property Graph Analytics Engine @kpatenge @datanativesconf #DN18
  • 12. Copyright © 2018, Oracle and/or its affiliates. All rights reserved. | Data Processing and Analysis Workflow: Overview Retrieve&Prepare Prepare source data • Using R for data retrieval via REST API and conversion JSON  CSV  OPV/OPE Load&Build Load nodes and edges data into a graph • Using Oracle NoSQL DB as Graph data store Analyze Analyze graph data • Using Graph Analytics Engine (PGX) and Property Graph Query Language (PGQL) Visualize Visualize graph data • Using Cytoscape Results Summarize results @kpatenge @datanativesconf #DN18
  • 13. Copyright © 2018, Oracle and/or its affiliates. All rights reserved. | Demo @kpatenge @datanativesconf #DN18
  • 14. Copyright © 2018, Oracle and/or its affiliates. All rights reserved. | @kpatenge @datanativesconf #DN18 ‚Big Data‘ Groups in relation with Topics and Cities
  • 15. Copyright © 2018, Oracle and/or its affiliates. All rights reserved. | @kpatenge @datanativesconf #DN18 ‚Big Data‘ Groups in relation with Organizers, Cities and Events
  • 16. Copyright © 2018, Oracle and/or its affiliates. All rights reserved. | @kpatenge @datanativesconf #DN18 ‚Big Data‘ Groups in relation with Organizers and Cities Weakly Connected Components (WCC)
  • 17. Copyright © 2018, Oracle and/or its affiliates. All rights reserved. | @kpatenge @datanativesconf #DN18 Ranking via PageRank (Top 10+1)
  • 18. Copyright © 2018, Oracle and/or its affiliates. All rights reserved. | @kpatenge @datanativesconf #DN18 Ranking via PageRank (Top 10+1)
  • 19. Copyright © 2018, Oracle and/or its affiliates. All rights reserved. | @kpatenge @datanativesconf #DN18 Ranking via PageRank (Top 10+1)
  • 20. Copyright © 2018, Oracle and/or its affiliates. All rights reserved. | @kpatenge @datanativesconf #DN18 Ranking via PageRank (Top 10+1)
  • 21. Copyright © 2018, Oracle and/or its affiliates. All rights reserved. | ✓Which cities are tech hot spots? ✓Who are important people in the Meetup landscape? ✓Which Meetup groups cover with topics? ✓Which Meetup groups are relevant in terms of #Members, #Participants of events, #Events ✓Which Meetup groups are related and how? ✓Which topics are related and how? • The way you model the graph has influence on the results of executing Graph algorithms • The choice of edge directions does matter depending on the algorithms • Attaching weights to edges is useful for certain algorithms Some Results @kpatenge @datanativesconf #DN18
  • 22. Copyright © 2018, Oracle and/or its affiliates. All rights reserved. | Key Takeaways • Graph data model perfect to focus on connectivity • Graph databases are powerful tools, complementing relational and other databases – Especially strong for analysis of graph topology and connectedness • Visual analysis helps a great deal to understand how data are connected – New insights, especially with relationships, dependencies and behavioral patterns • Big variety of analytic tools and frameworks to answer all kind of questions • Oracle Graph Technologies combined with Open Source or 3rd party tools @kpatenge @datanativesconf #DN18
  • 23. Copyright © 2018, Oracle and/or its affiliates. All rights reserved. | Follow us @kpatenge @SpatialHannes @JeanIhm karin.patenge@oracle.com GitHub: https://github.com/karinpatenge/DN2018 Blogs: https://blogs.oracle.com/bigdataspatialgraph/ https://blogs.oracle.com/oraclespatial/ AskTom Office Hours for Property Graph: https://asktom.oracle.com/pls/apex/f?p=100:551 @kpatenge @datanativesconf #DN18