SlideShare ist ein Scribd-Unternehmen logo
1 von 26
Downloaden Sie, um offline zu lesen
25 Settembre 2017
With a datascience perspective
Aview of graph data usage by Cerved
Stefano Gatti – Head of Innovation and data sources
Nunzio Pellegrino – Senior Data Scientist – Innovation team
Cerved and its graphs in a nutshell
3
Cerved, in a nutshell
The Italian data-driven company
CREDIT INFORMATION
Protection against credit risk
MARKETING SOLUTIONS
New business opportunities
CREDIT MANAGEMENT
Manage and collect performing and
non-performing loans.
	
Over 1000 a minute	
ü  Documents	
	
Over 40 million	
ü  Lines of code
	
Over 30,000	
ü  Customers
	
Over 50 different	
ü  Data sources
Over 10million a day	
ü  Api call
	
Over 1,900	
ü  People
	
377 million Eur (2016)	
ü  Revenue
4
Web	Data	
Open	Data	
Proprietary	data	
Official	data	
Chamber	of	
Commerce	
official	data	
C
o
m
p
l
e
x
i
t
y	
Our big data
5
Cerved, in a tech view
Data	
Algorithms	
Solu0ons	
Towards algorithmic economy …
6
Cerved Graph Story
2011-12 - we started from an IT problem: reengineering of beneficial owner algorithm
7
Cerved Graph Story
2014-15 - we went through a more algorithmic problem: corporate linkages algorithm
8
Cerved Graph Story
2015-16 - we go with a “full stack” solution
9
Cerved Graph thoughts
We strongly believe in …
The	power	of	linking	data		
The	power	of	analyzing	data	with	network	analysis	
The	power	of	visualizing	data	in	a	different	way	
To	understand	a	li5le	be5er	the		
increasing	complexity	of	modern	world	…	
also	from	an	economic	point	of	view
Why a Graph Database?
11
What is a Graph?
12
Key Concepts
Graph database
NoSQL database
Managing highly connected data
and complex queries
Flexible data model
13
Key Concepts
Graph database
Declarative or imperative
language
Horizontal Scaling
Graph native storage and
process
14
Where graphdb can be useful?
“Hands-On Machine Learning with Scikit-Learn and TensorFlow” by Aurélien Géron
15
Maybe in the future…
“Hands-On Machine Learning with Scikit-Learn and TensorFlow” by Aurélien Géron
16
Frame the Problem
Data Model
Simple1
Expressive2
3 Additive
17
RDBMS
RDBMS vs Graph Data Model
Graph
18
Store & Get Data
Native graph storage
Store Data Fast Write Performance
Easy Data Integration:
CSV
Jdbc
REST Api
19
Store & Get Data
Native graph storage
Store Data Fast Write Performance
Easy Data Integration:
Get Data
Native graph processing à Index free adjacency
CSV
Jdbc
REST Api
Cypher, Declarative Language
Driver:
Python
py2neo (unofficial)
R (unofficial)
Java
APOC
20
Explore Data
Transform implicit to explicitCypher (access points, pattern)
21
Explore Data
Transform implicit to explicitCypher (access points, pattern)
22
Explore Data
Transform implicit to explicitCypher (access points, pattern)
23
Prepare Data
Feature Creation with parallel Graph algorithms
Centralities
•  Page Rank
•  Betweenness Centrality
•  Closeness Centrality
Graph Partitioning
•  Label Propagation
•  Connected Components
•  Strongly Connected
Components
Path Finding
•  Minimum Weight
Spanning Tree
•  All Pairs- and Single
Source Shortest Path
24
Prepare Data
Feature Creation with parallel Graph algorithms
Centralities
•  Page Rank
•  Betweenness Centrality
•  Closeness Centrality
Graph Partitioning
•  Label Propagation
•  Connected Components
•  Strongly Connected
Components
Path Finding
•  Minimum Weight
Spanning Tree
•  All Pairs- and Single
Source Shortest Path
Graph Size (GB) nodes (M) rels (M)
PageRank
(s)
ConCom
(s)
LabelPropag
(s)
StrongConCom
(s)
Pokec 7.3 2 31 10 24 12 12
DBPedia 15 11 117 46 91 51 65
Graphs500-23 7.9 5 129 19 29 18 25
Twitter-2010 49 42 1468 349 353 405 339
soc-LifeJournal1 6.3 5 69 30 34 25 23
Friendster 62 66 1806 611 619 296 483
Performance
25
Present&Launch your solution
Real time
Recommendation
Fraud Detection
Social Network Analysis
Search & Link Analysis
Knowledge Graph Natural Language
Process
Nunzio Pellegrino
S e n i o r D a t a S c i e n t i s t –
Innovation Team
nunzio.pellegrino@cerved.com
Stefano Gatti
Head of Innovation & Data
Sources
stefano.gatti@cerved.com

Weitere ähnliche Inhalte

Was ist angesagt?

GraphTour - The Workshop - Device Tracking in Practice: From Idea to Production
GraphTour - The Workshop - Device Tracking in Practice: From Idea to ProductionGraphTour - The Workshop - Device Tracking in Practice: From Idea to Production
GraphTour - The Workshop - Device Tracking in Practice: From Idea to ProductionNeo4j
 
07 verheul texcavator
07 verheul texcavator07 verheul texcavator
07 verheul texcavatoringeangevaare
 
Demystifying Big Data with Scala and Akka
Demystifying Big Data with Scala and AkkaDemystifying Big Data with Scala and Akka
Demystifying Big Data with Scala and AkkaKnoldus Inc.
 
Turning Data into Insights and Intelligence
Turning Data into Insights and IntelligenceTurning Data into Insights and Intelligence
Turning Data into Insights and IntelligenceSabine Kurjo McNeill
 
2018 12-10 apidays.io eric horesnyi streamdata.io event-driven ap is
2018 12-10 apidays.io eric horesnyi streamdata.io event-driven ap is2018 12-10 apidays.io eric horesnyi streamdata.io event-driven ap is
2018 12-10 apidays.io eric horesnyi streamdata.io event-driven ap isEric Horesnyi
 
Geschäftliches Potential für System-Integratoren und Berater - Graphdatenban...
Geschäftliches Potential für System-Integratoren und Berater -  Graphdatenban...Geschäftliches Potential für System-Integratoren und Berater -  Graphdatenban...
Geschäftliches Potential für System-Integratoren und Berater - Graphdatenban...Neo4j
 
Integration and Exploration of Financial Data using Semantics and Ontologies
Integration and Exploration of Financial Data using Semantics and OntologiesIntegration and Exploration of Financial Data using Semantics and Ontologies
Integration and Exploration of Financial Data using Semantics and OntologiesRoberto García
 
Foundations of Data Teams
Foundations of Data TeamsFoundations of Data Teams
Foundations of Data TeamsDatabricks
 
Vector Similarity Search & Indexing Methods
Vector Similarity Search & Indexing MethodsVector Similarity Search & Indexing Methods
Vector Similarity Search & Indexing MethodsKate Shao
 
Autograph - Natural Signatures for Graph Modelling, Simon Brueckheimer, Ciena
Autograph - Natural Signatures for Graph Modelling, Simon Brueckheimer, CienaAutograph - Natural Signatures for Graph Modelling, Simon Brueckheimer, Ciena
Autograph - Natural Signatures for Graph Modelling, Simon Brueckheimer, CienaNeo4j
 
Tracxn - Geo Monthly Report - Austria Tech - Mar 2022
Tracxn - Geo Monthly Report - Austria Tech - Mar 2022 Tracxn - Geo Monthly Report - Austria Tech - Mar 2022
Tracxn - Geo Monthly Report - Austria Tech - Mar 2022 Tracxn
 
Coordinating external data importer services using AWS step functions
Coordinating external data importer services using AWS step functionsCoordinating external data importer services using AWS step functions
Coordinating external data importer services using AWS step functionsMarcos Rebelo
 
Session 4.3 semantic annotation for enhancing collaborative ideation
Session 4.3   semantic annotation for enhancing collaborative ideationSession 4.3   semantic annotation for enhancing collaborative ideation
Session 4.3 semantic annotation for enhancing collaborative ideationsemanticsconference
 
AI Solutions with Macnica.ai - AI Expo 2018 Tokyo Japan
AI Solutions with Macnica.ai - AI Expo 2018 Tokyo JapanAI Solutions with Macnica.ai - AI Expo 2018 Tokyo Japan
AI Solutions with Macnica.ai - AI Expo 2018 Tokyo JapanAvkash Chauhan
 
Amsterdam developing public code for every city and everyone, Boris Van Hoyte...
Amsterdam developing public code for every city and everyone, Boris Van Hoyte...Amsterdam developing public code for every city and everyone, Boris Van Hoyte...
Amsterdam developing public code for every city and everyone, Boris Van Hoyte...OW2
 
Linking authorities through Wikidata
Linking authorities through WikidataLinking authorities through Wikidata
Linking authorities through WikidataJoachim Neubert
 
MarkLogic - Open Analytics Meetup
MarkLogic - Open Analytics MeetupMarkLogic - Open Analytics Meetup
MarkLogic - Open Analytics MeetupOpen Analytics
 
Tracxn - Monthly Report - Austria Tech - Oct 2021
Tracxn - Monthly Report - Austria Tech - Oct 2021Tracxn - Monthly Report - Austria Tech - Oct 2021
Tracxn - Monthly Report - Austria Tech - Oct 2021Tracxn
 

Was ist angesagt? (20)

GraphTour - The Workshop - Device Tracking in Practice: From Idea to Production
GraphTour - The Workshop - Device Tracking in Practice: From Idea to ProductionGraphTour - The Workshop - Device Tracking in Practice: From Idea to Production
GraphTour - The Workshop - Device Tracking in Practice: From Idea to Production
 
07 verheul texcavator
07 verheul texcavator07 verheul texcavator
07 verheul texcavator
 
Demystifying Big Data with Scala and Akka
Demystifying Big Data with Scala and AkkaDemystifying Big Data with Scala and Akka
Demystifying Big Data with Scala and Akka
 
Turning Data into Insights and Intelligence
Turning Data into Insights and IntelligenceTurning Data into Insights and Intelligence
Turning Data into Insights and Intelligence
 
2018 12-10 apidays.io eric horesnyi streamdata.io event-driven ap is
2018 12-10 apidays.io eric horesnyi streamdata.io event-driven ap is2018 12-10 apidays.io eric horesnyi streamdata.io event-driven ap is
2018 12-10 apidays.io eric horesnyi streamdata.io event-driven ap is
 
Geschäftliches Potential für System-Integratoren und Berater - Graphdatenban...
Geschäftliches Potential für System-Integratoren und Berater -  Graphdatenban...Geschäftliches Potential für System-Integratoren und Berater -  Graphdatenban...
Geschäftliches Potential für System-Integratoren und Berater - Graphdatenban...
 
Integration and Exploration of Financial Data using Semantics and Ontologies
Integration and Exploration of Financial Data using Semantics and OntologiesIntegration and Exploration of Financial Data using Semantics and Ontologies
Integration and Exploration of Financial Data using Semantics and Ontologies
 
Foundations of Data Teams
Foundations of Data TeamsFoundations of Data Teams
Foundations of Data Teams
 
Vector Similarity Search & Indexing Methods
Vector Similarity Search & Indexing MethodsVector Similarity Search & Indexing Methods
Vector Similarity Search & Indexing Methods
 
Autograph - Natural Signatures for Graph Modelling, Simon Brueckheimer, Ciena
Autograph - Natural Signatures for Graph Modelling, Simon Brueckheimer, CienaAutograph - Natural Signatures for Graph Modelling, Simon Brueckheimer, Ciena
Autograph - Natural Signatures for Graph Modelling, Simon Brueckheimer, Ciena
 
Sitech
SitechSitech
Sitech
 
Tracxn - Geo Monthly Report - Austria Tech - Mar 2022
Tracxn - Geo Monthly Report - Austria Tech - Mar 2022 Tracxn - Geo Monthly Report - Austria Tech - Mar 2022
Tracxn - Geo Monthly Report - Austria Tech - Mar 2022
 
2019 GDRR: Blockchain Data Analytics - Cryptocurrency and blockchain analysis...
2019 GDRR: Blockchain Data Analytics - Cryptocurrency and blockchain analysis...2019 GDRR: Blockchain Data Analytics - Cryptocurrency and blockchain analysis...
2019 GDRR: Blockchain Data Analytics - Cryptocurrency and blockchain analysis...
 
Coordinating external data importer services using AWS step functions
Coordinating external data importer services using AWS step functionsCoordinating external data importer services using AWS step functions
Coordinating external data importer services using AWS step functions
 
Session 4.3 semantic annotation for enhancing collaborative ideation
Session 4.3   semantic annotation for enhancing collaborative ideationSession 4.3   semantic annotation for enhancing collaborative ideation
Session 4.3 semantic annotation for enhancing collaborative ideation
 
AI Solutions with Macnica.ai - AI Expo 2018 Tokyo Japan
AI Solutions with Macnica.ai - AI Expo 2018 Tokyo JapanAI Solutions with Macnica.ai - AI Expo 2018 Tokyo Japan
AI Solutions with Macnica.ai - AI Expo 2018 Tokyo Japan
 
Amsterdam developing public code for every city and everyone, Boris Van Hoyte...
Amsterdam developing public code for every city and everyone, Boris Van Hoyte...Amsterdam developing public code for every city and everyone, Boris Van Hoyte...
Amsterdam developing public code for every city and everyone, Boris Van Hoyte...
 
Linking authorities through Wikidata
Linking authorities through WikidataLinking authorities through Wikidata
Linking authorities through Wikidata
 
MarkLogic - Open Analytics Meetup
MarkLogic - Open Analytics MeetupMarkLogic - Open Analytics Meetup
MarkLogic - Open Analytics Meetup
 
Tracxn - Monthly Report - Austria Tech - Oct 2021
Tracxn - Monthly Report - Austria Tech - Oct 2021Tracxn - Monthly Report - Austria Tech - Oct 2021
Tracxn - Monthly Report - Austria Tech - Oct 2021
 

Ähnlich wie Cerved Datascience Milan

A view of graph data usage by Cerved
A view of graph data usage by CervedA view of graph data usage by Cerved
A view of graph data usage by CervedData Science Milan
 
SAP Forum Ankara 2017 - "Verinin Merkezine Seyahat"
SAP Forum Ankara 2017 - "Verinin Merkezine Seyahat"SAP Forum Ankara 2017 - "Verinin Merkezine Seyahat"
SAP Forum Ankara 2017 - "Verinin Merkezine Seyahat"MDS ap
 
Graphs for Enterprise Architects
Graphs for Enterprise ArchitectsGraphs for Enterprise Architects
Graphs for Enterprise ArchitectsNeo4j
 
AWS Partner Webcast - Analyze Big Data for Consumer Applications with Looker ...
AWS Partner Webcast - Analyze Big Data for Consumer Applications with Looker ...AWS Partner Webcast - Analyze Big Data for Consumer Applications with Looker ...
AWS Partner Webcast - Analyze Big Data for Consumer Applications with Looker ...Amazon Web Services
 
Integrating Semantic Web in the Real World: A Journey between Two Cities
Integrating Semantic Web in the Real World: A Journey between Two Cities Integrating Semantic Web in the Real World: A Journey between Two Cities
Integrating Semantic Web in the Real World: A Journey between Two Cities Juan Sequeda
 
Refactoring your EDW with Mobile Analytics Products
Refactoring your EDW with Mobile Analytics ProductsRefactoring your EDW with Mobile Analytics Products
Refactoring your EDW with Mobile Analytics ProductsLuke Han
 
Architecting for change: LinkedIn's new data ecosystem
Architecting for change: LinkedIn's new data ecosystemArchitecting for change: LinkedIn's new data ecosystem
Architecting for change: LinkedIn's new data ecosystemYael Garten
 
Strata 2016 - Architecting for Change: LinkedIn's new data ecosystem
Strata 2016 - Architecting for Change: LinkedIn's new data ecosystemStrata 2016 - Architecting for Change: LinkedIn's new data ecosystem
Strata 2016 - Architecting for Change: LinkedIn's new data ecosystemShirshanka Das
 
Strata 2017 (San Jose): Building a healthy data ecosystem around Kafka and Ha...
Strata 2017 (San Jose): Building a healthy data ecosystem around Kafka and Ha...Strata 2017 (San Jose): Building a healthy data ecosystem around Kafka and Ha...
Strata 2017 (San Jose): Building a healthy data ecosystem around Kafka and Ha...Shirshanka Das
 
Building a healthy data ecosystem around Kafka and Hadoop: Lessons learned at...
Building a healthy data ecosystem around Kafka and Hadoop: Lessons learned at...Building a healthy data ecosystem around Kafka and Hadoop: Lessons learned at...
Building a healthy data ecosystem around Kafka and Hadoop: Lessons learned at...Yael Garten
 
RedisGraph A Low Latency Graph DB: Pieter Cailliau
RedisGraph A Low Latency Graph DB: Pieter CailliauRedisGraph A Low Latency Graph DB: Pieter Cailliau
RedisGraph A Low Latency Graph DB: Pieter CailliauRedis Labs
 
GraphTalks Rome - The Italian Business Graph
GraphTalks Rome - The Italian Business GraphGraphTalks Rome - The Italian Business Graph
GraphTalks Rome - The Italian Business GraphNeo4j
 
SC4 Workshop 1: Simon Scerri: Existing tools and technologies
SC4 Workshop 1: Simon Scerri: Existing tools and technologiesSC4 Workshop 1: Simon Scerri: Existing tools and technologies
SC4 Workshop 1: Simon Scerri: Existing tools and technologiesBigData_Europe
 
Produktdatenmanagement mit Neo4j
Produktdatenmanagement mit Neo4jProduktdatenmanagement mit Neo4j
Produktdatenmanagement mit Neo4jNeo4j
 
La bi, l'informatique décisionnelle et les graphes
La bi, l'informatique décisionnelle et les graphesLa bi, l'informatique décisionnelle et les graphes
La bi, l'informatique décisionnelle et les graphesCédric Fauvet
 
In-Memory Computing Webcast. Market Predictions 2017
In-Memory Computing Webcast. Market Predictions 2017In-Memory Computing Webcast. Market Predictions 2017
In-Memory Computing Webcast. Market Predictions 2017SingleStore
 
Scaling up your Analytics & Insights
Scaling up your Analytics & InsightsScaling up your Analytics & Insights
Scaling up your Analytics & InsightsLoQutus
 
Nodes2020 | Graph of enterprise_metadata | NEO4J Conference
Nodes2020 | Graph of enterprise_metadata | NEO4J ConferenceNodes2020 | Graph of enterprise_metadata | NEO4J Conference
Nodes2020 | Graph of enterprise_metadata | NEO4J ConferenceDeepak Chandramouli
 

Ähnlich wie Cerved Datascience Milan (20)

A view of graph data usage by Cerved
A view of graph data usage by CervedA view of graph data usage by Cerved
A view of graph data usage by Cerved
 
SAP Forum Ankara 2017 - "Verinin Merkezine Seyahat"
SAP Forum Ankara 2017 - "Verinin Merkezine Seyahat"SAP Forum Ankara 2017 - "Verinin Merkezine Seyahat"
SAP Forum Ankara 2017 - "Verinin Merkezine Seyahat"
 
Graphs for Enterprise Architects
Graphs for Enterprise ArchitectsGraphs for Enterprise Architects
Graphs for Enterprise Architects
 
AWS Partner Webcast - Analyze Big Data for Consumer Applications with Looker ...
AWS Partner Webcast - Analyze Big Data for Consumer Applications with Looker ...AWS Partner Webcast - Analyze Big Data for Consumer Applications with Looker ...
AWS Partner Webcast - Analyze Big Data for Consumer Applications with Looker ...
 
Integrating Semantic Web in the Real World: A Journey between Two Cities
Integrating Semantic Web in the Real World: A Journey between Two Cities Integrating Semantic Web in the Real World: A Journey between Two Cities
Integrating Semantic Web in the Real World: A Journey between Two Cities
 
Talend introduction v1
Talend introduction v1Talend introduction v1
Talend introduction v1
 
Refactoring your EDW with Mobile Analytics Products
Refactoring your EDW with Mobile Analytics ProductsRefactoring your EDW with Mobile Analytics Products
Refactoring your EDW with Mobile Analytics Products
 
Architecting for change: LinkedIn's new data ecosystem
Architecting for change: LinkedIn's new data ecosystemArchitecting for change: LinkedIn's new data ecosystem
Architecting for change: LinkedIn's new data ecosystem
 
Strata 2016 - Architecting for Change: LinkedIn's new data ecosystem
Strata 2016 - Architecting for Change: LinkedIn's new data ecosystemStrata 2016 - Architecting for Change: LinkedIn's new data ecosystem
Strata 2016 - Architecting for Change: LinkedIn's new data ecosystem
 
Strata 2017 (San Jose): Building a healthy data ecosystem around Kafka and Ha...
Strata 2017 (San Jose): Building a healthy data ecosystem around Kafka and Ha...Strata 2017 (San Jose): Building a healthy data ecosystem around Kafka and Ha...
Strata 2017 (San Jose): Building a healthy data ecosystem around Kafka and Ha...
 
Building a healthy data ecosystem around Kafka and Hadoop: Lessons learned at...
Building a healthy data ecosystem around Kafka and Hadoop: Lessons learned at...Building a healthy data ecosystem around Kafka and Hadoop: Lessons learned at...
Building a healthy data ecosystem around Kafka and Hadoop: Lessons learned at...
 
RedisGraph A Low Latency Graph DB: Pieter Cailliau
RedisGraph A Low Latency Graph DB: Pieter CailliauRedisGraph A Low Latency Graph DB: Pieter Cailliau
RedisGraph A Low Latency Graph DB: Pieter Cailliau
 
GraphTalks Rome - The Italian Business Graph
GraphTalks Rome - The Italian Business GraphGraphTalks Rome - The Italian Business Graph
GraphTalks Rome - The Italian Business Graph
 
SC4 Workshop 1: Simon Scerri: Existing tools and technologies
SC4 Workshop 1: Simon Scerri: Existing tools and technologiesSC4 Workshop 1: Simon Scerri: Existing tools and technologies
SC4 Workshop 1: Simon Scerri: Existing tools and technologies
 
Produktdatenmanagement mit Neo4j
Produktdatenmanagement mit Neo4jProduktdatenmanagement mit Neo4j
Produktdatenmanagement mit Neo4j
 
La bi, l'informatique décisionnelle et les graphes
La bi, l'informatique décisionnelle et les graphesLa bi, l'informatique décisionnelle et les graphes
La bi, l'informatique décisionnelle et les graphes
 
In-Memory Computing Webcast. Market Predictions 2017
In-Memory Computing Webcast. Market Predictions 2017In-Memory Computing Webcast. Market Predictions 2017
In-Memory Computing Webcast. Market Predictions 2017
 
Scaling up your Analytics & Insights
Scaling up your Analytics & InsightsScaling up your Analytics & Insights
Scaling up your Analytics & Insights
 
Nodes2020 | Graph of enterprise_metadata | NEO4J Conference
Nodes2020 | Graph of enterprise_metadata | NEO4J ConferenceNodes2020 | Graph of enterprise_metadata | NEO4J Conference
Nodes2020 | Graph of enterprise_metadata | NEO4J Conference
 
It takes a village (to raise a ML model)
It takes a village (to raise a ML model)It takes a village (to raise a ML model)
It takes a village (to raise a ML model)
 

Kürzlich hochgeladen

Industrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfIndustrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfLars Albertsson
 
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdfKantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdfSocial Samosa
 
BabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxBabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxolyaivanovalion
 
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service AmravatiVIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service AmravatiSuhani Kapoor
 
Mature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxMature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxolyaivanovalion
 
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Callshivangimorya083
 
定制英国白金汉大学毕业证(UCB毕业证书) 成绩单原版一比一
定制英国白金汉大学毕业证(UCB毕业证书)																			成绩单原版一比一定制英国白金汉大学毕业证(UCB毕业证书)																			成绩单原版一比一
定制英国白金汉大学毕业证(UCB毕业证书) 成绩单原版一比一ffjhghh
 
Customer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxCustomer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxEmmanuel Dauda
 
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptdokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptSonatrach
 
RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998YohFuh
 
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Callshivangimorya083
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfLars Albertsson
 
Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusTimothy Spann
 
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Serviceranjana rawat
 
Carero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxCarero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxolyaivanovalion
 
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...Suhani Kapoor
 
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...Suhani Kapoor
 

Kürzlich hochgeladen (20)

Industrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfIndustrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdf
 
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdfKantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
 
BabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxBabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptx
 
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service AmravatiVIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
 
Mature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxMature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptx
 
E-Commerce Order PredictionShraddha Kamble.pptx
E-Commerce Order PredictionShraddha Kamble.pptxE-Commerce Order PredictionShraddha Kamble.pptx
E-Commerce Order PredictionShraddha Kamble.pptx
 
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
 
定制英国白金汉大学毕业证(UCB毕业证书) 成绩单原版一比一
定制英国白金汉大学毕业证(UCB毕业证书)																			成绩单原版一比一定制英国白金汉大学毕业证(UCB毕业证书)																			成绩单原版一比一
定制英国白金汉大学毕业证(UCB毕业证书) 成绩单原版一比一
 
Customer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxCustomer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptx
 
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptdokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
 
RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998
 
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdf
 
Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and Milvus
 
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
 
Carero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxCarero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptx
 
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
 
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
 
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
 
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
 

Cerved Datascience Milan

  • 1. 25 Settembre 2017 With a datascience perspective Aview of graph data usage by Cerved Stefano Gatti – Head of Innovation and data sources Nunzio Pellegrino – Senior Data Scientist – Innovation team
  • 2. Cerved and its graphs in a nutshell
  • 3. 3 Cerved, in a nutshell The Italian data-driven company CREDIT INFORMATION Protection against credit risk MARKETING SOLUTIONS New business opportunities CREDIT MANAGEMENT Manage and collect performing and non-performing loans. Over 1000 a minute ü  Documents Over 40 million ü  Lines of code Over 30,000 ü  Customers Over 50 different ü  Data sources Over 10million a day ü  Api call Over 1,900 ü  People 377 million Eur (2016) ü  Revenue
  • 5. 5 Cerved, in a tech view Data Algorithms Solu0ons Towards algorithmic economy …
  • 6. 6 Cerved Graph Story 2011-12 - we started from an IT problem: reengineering of beneficial owner algorithm
  • 7. 7 Cerved Graph Story 2014-15 - we went through a more algorithmic problem: corporate linkages algorithm
  • 8. 8 Cerved Graph Story 2015-16 - we go with a “full stack” solution
  • 9. 9 Cerved Graph thoughts We strongly believe in … The power of linking data The power of analyzing data with network analysis The power of visualizing data in a different way To understand a li5le be5er the increasing complexity of modern world … also from an economic point of view
  • 10. Why a Graph Database?
  • 11. 11 What is a Graph?
  • 12. 12 Key Concepts Graph database NoSQL database Managing highly connected data and complex queries Flexible data model
  • 13. 13 Key Concepts Graph database Declarative or imperative language Horizontal Scaling Graph native storage and process
  • 14. 14 Where graphdb can be useful? “Hands-On Machine Learning with Scikit-Learn and TensorFlow” by Aurélien Géron
  • 15. 15 Maybe in the future… “Hands-On Machine Learning with Scikit-Learn and TensorFlow” by Aurélien Géron
  • 16. 16 Frame the Problem Data Model Simple1 Expressive2 3 Additive
  • 17. 17 RDBMS RDBMS vs Graph Data Model Graph
  • 18. 18 Store & Get Data Native graph storage Store Data Fast Write Performance Easy Data Integration: CSV Jdbc REST Api
  • 19. 19 Store & Get Data Native graph storage Store Data Fast Write Performance Easy Data Integration: Get Data Native graph processing à Index free adjacency CSV Jdbc REST Api Cypher, Declarative Language Driver: Python py2neo (unofficial) R (unofficial) Java APOC
  • 20. 20 Explore Data Transform implicit to explicitCypher (access points, pattern)
  • 21. 21 Explore Data Transform implicit to explicitCypher (access points, pattern)
  • 22. 22 Explore Data Transform implicit to explicitCypher (access points, pattern)
  • 23. 23 Prepare Data Feature Creation with parallel Graph algorithms Centralities •  Page Rank •  Betweenness Centrality •  Closeness Centrality Graph Partitioning •  Label Propagation •  Connected Components •  Strongly Connected Components Path Finding •  Minimum Weight Spanning Tree •  All Pairs- and Single Source Shortest Path
  • 24. 24 Prepare Data Feature Creation with parallel Graph algorithms Centralities •  Page Rank •  Betweenness Centrality •  Closeness Centrality Graph Partitioning •  Label Propagation •  Connected Components •  Strongly Connected Components Path Finding •  Minimum Weight Spanning Tree •  All Pairs- and Single Source Shortest Path Graph Size (GB) nodes (M) rels (M) PageRank (s) ConCom (s) LabelPropag (s) StrongConCom (s) Pokec 7.3 2 31 10 24 12 12 DBPedia 15 11 117 46 91 51 65 Graphs500-23 7.9 5 129 19 29 18 25 Twitter-2010 49 42 1468 349 353 405 339 soc-LifeJournal1 6.3 5 69 30 34 25 23 Friendster 62 66 1806 611 619 296 483 Performance
  • 25. 25 Present&Launch your solution Real time Recommendation Fraud Detection Social Network Analysis Search & Link Analysis Knowledge Graph Natural Language Process
  • 26. Nunzio Pellegrino S e n i o r D a t a S c i e n t i s t – Innovation Team nunzio.pellegrino@cerved.com Stefano Gatti Head of Innovation & Data Sources stefano.gatti@cerved.com