SlideShare ist ein Scribd-Unternehmen logo
1 von 19
Downloaden Sie, um offline zu lesen
Neo4j for Bacterial Genomes
Sixing Huang
1
Talk Overview
• How to do Neo4j bioinformatics on a local computer with public data
• Three use cases:
ofor genome analyses
ofor Carbohydrate-active enzymes
ofor antibiotic resistance in CARD
• Conclusions
2
Genes
Genomes Phenotypes
Degrade cellulose
Resist cephalosporin
Transport Fe2+
About me – Sixing Huang
• Studied biology and bioinformatics in Bremen.
• Worked as a bioinformatic data scientist in DSZM Braunschweig.
• Now as a bioinformatic scientist in MGI Shenzhen.
• First contact with Neo4j in 2019.
• Now use Neo4j for knowledge management, genome browser and
database and write about my Neo4j journey in medium.com.
3
Genomes have structures
• Genes are more than a bag of words.
• Neo4j can model genome structures.
Gene1 Gene2 Gene3
4
Neo4j as a genome browser
5
EMBL file Neo4j genome browser
Import
Gene CAZy clusters and annotations
6
MATCH p=(f0:Gene) -[:NEXT*5] -> (f1:Gene) -[:NEXT*5] ->(f2:Gene)
WHERE f1.name =~ '.+GH16[^a-zA-Zds:]*'
Neo4j Commander
Bacterial genome data
Data in biological studies
Genome1 3 4 1 0 1 0 1 3 2 1 3 2 1
Genome2 2 1 3 1 3 0 1 3 2 1 4 1 2
Genes
Neo4j can compare core and pan genomes effectively.
7
Heavy use of orthology
Taxonomy
Phylum
Class1 Class2
Order1 Order2 Order3
Genome1 Genome2 Genome3 Genome4
KEGG Gene annotation
Metabolism
Carbohydrate
metabolism
Glycolysis /
Gluconeogenesis
alcohol
dehydrogenase
K00001
Lipid metabolism
Fatty acid
degradation
acyl-CoA
dehydrogenase
K06445
hexokinase
K00844
Neo4j can model orthology intuitively.
8
Neo4j for genome analyses
9
Taxonomy
Gene
annotation
Chromobacterium sp. ATCC 53434
10
KO in sisters as filter
KO in 53434 - filter
Unique KO in 53434
Unique KO in ATCC 53434
11
KO Annotation
K04783 yersiniabactin salicyl-AMP ligase [EC:6.3.2.-]
K04784 yersiniabactin nonribosomal peptide synthetase
K12241 pyochelin biosynthesis protein PchG
K12242 pyochelin biosynthesis protein PchC
K13255 ferric iron reductase protein FhuF
K23227
ferric hydroxamate transport system substrate-
binding protein
K23228
ferric hydroxamate transport system permease
protein
K10829
ferric hydroxamate transport system ATP-binding
protein [EC:7.2.2.16]
siderophore
siderophore
ferric hydroxamate transport
Hypothesis: ATCC 53434 has a unique repertoire of iron related transport proteins.
Phylogeny made easy
12
Order by numbers of shared KO
Compute the shared KO
Name Shared KO
Chromobacterium vaccinii 1869
Chromobacterium sp. IIBBL 112-1 1867
Chromobacterium rhizoryzae 1777
Chromobacterium haemolyticum 1776
Chromobacterium sp. 257-1 1723
Neo4j for Carbohydrate-active EnZYmes:
13
Reannotation of Formosa agariphila KMM 3901
CAZy Annotation
PL28 ulvan lyase
PL37 ulvan lyase
GH28 polygalacturonase
GH78 alpha-L-rhamnosidase
GH105 unsaturated rhamnogalacturonyl hydrolase
GH86 beta-agarase
Gh168 endo-alpha-(1,3)-L-fucanase
Unique CAZy not in sister genomes
Visualization
degrade
ulvan
degrade
pectin
degrade sulfated
polysaccharide
unique agarolytic life strategy
Node classification: Predict cellulose degradation
14
CV: Training: 0.67; Testing: 0.65
CAZy annotated genomes
annotated with cellulose degradability
Holdout
Embedding
Deploy on holdout genomes
15
Predicted cellulose degrader Confirmed
Micromonospora carbonacea africana ✓
Micromonospora sp. HM134 X
Streptacidiphilus sp. P02-A3a
Streptomyces prasinus ATCC 13879 ?
Streptomyces sp. CB01881
Streptomyces sp. SirexAA-E ✓
Streptomyces venezuelae ATCC 15439
Streptomyces venezuelae ATCC 21113
Xanthomonas citri pv. citri UI7 ✓
Xenorhabdus budapestensis C72
Kitasatospora sp. MMS16-BH015
CAZy annotated genomes
Holdout
Prediction
Results
GraphQL for non-Neo4j users
16
Antibiotic resistance with CARD
17
Fluoroquinolone resistance Cephalosporin resistance
Conclusions
• Neo4j can serve as an all-in-one genome browser, a biodata
warehouse and a data mining tool.
• It can deliver insights more quickly than relational database + SQL
• Built-in machine learning can predict new connections and properties
• GraphQL serves data for non-Neo4j users
18
Acknowledgement
• Hanno Teeling (MPI Bremen)
• The Neo4j Community
• MGI (BGI)
19

Weitere ähnliche Inhalte

Was ist angesagt?

BT Group: Use of Graph in VENA (a smart broadcast network)
BT Group: Use of Graph in VENA (a smart broadcast network)BT Group: Use of Graph in VENA (a smart broadcast network)
BT Group: Use of Graph in VENA (a smart broadcast network)Neo4j
 
Volvo Cars - Retrieving Safety Insights using Graphs (GraphSummit Stockholm 2...
Volvo Cars - Retrieving Safety Insights using Graphs (GraphSummit Stockholm 2...Volvo Cars - Retrieving Safety Insights using Graphs (GraphSummit Stockholm 2...
Volvo Cars - Retrieving Safety Insights using Graphs (GraphSummit Stockholm 2...Neo4j
 
How the Neanex digital twin solution delivers on both speed and scale to the ...
How the Neanex digital twin solution delivers on both speed and scale to the ...How the Neanex digital twin solution delivers on both speed and scale to the ...
How the Neanex digital twin solution delivers on both speed and scale to the ...Neo4j
 
SERVIER Pegasus - Graphe de connaissances pour les phases primaires de recher...
SERVIER Pegasus - Graphe de connaissances pour les phases primaires de recher...SERVIER Pegasus - Graphe de connaissances pour les phases primaires de recher...
SERVIER Pegasus - Graphe de connaissances pour les phases primaires de recher...Neo4j
 
Elsevier: Empowering Knowledge Discovery in Research with Graphs
Elsevier: Empowering Knowledge Discovery in Research with GraphsElsevier: Empowering Knowledge Discovery in Research with Graphs
Elsevier: Empowering Knowledge Discovery in Research with GraphsNeo4j
 
Intro to Cypher
Intro to CypherIntro to Cypher
Intro to CypherNeo4j
 
The path to success with graph database and graph data science_ Neo4j GraphSu...
The path to success with graph database and graph data science_ Neo4j GraphSu...The path to success with graph database and graph data science_ Neo4j GraphSu...
The path to success with graph database and graph data science_ Neo4j GraphSu...Neo4j
 
Fraud Detection with Graphs at the Danish Business Authority
Fraud Detection with Graphs at the Danish Business AuthorityFraud Detection with Graphs at the Danish Business Authority
Fraud Detection with Graphs at the Danish Business AuthorityNeo4j
 
Technip Energies Italy: Planning is a graph matter
Technip Energies Italy: Planning is a graph matterTechnip Energies Italy: Planning is a graph matter
Technip Energies Italy: Planning is a graph matterNeo4j
 
Training Week: Introduction to Neo4j
Training Week: Introduction to Neo4jTraining Week: Introduction to Neo4j
Training Week: Introduction to Neo4jNeo4j
 
Big Pharma Problems. Big Graphs: Creating the Merck Manufacturing Mesh
Big Pharma Problems. Big Graphs: Creating the Merck Manufacturing MeshBig Pharma Problems. Big Graphs: Creating the Merck Manufacturing Mesh
Big Pharma Problems. Big Graphs: Creating the Merck Manufacturing MeshNeo4j
 
Neanex - Semantic Construction with Graphs
Neanex - Semantic Construction with GraphsNeanex - Semantic Construction with Graphs
Neanex - Semantic Construction with GraphsNeo4j
 
Training Series: Build APIs with Neo4j GraphQL Library
Training Series: Build APIs with Neo4j GraphQL LibraryTraining Series: Build APIs with Neo4j GraphQL Library
Training Series: Build APIs with Neo4j GraphQL LibraryNeo4j
 
Training Series - Intro to Neo4j
Training Series - Intro to Neo4jTraining Series - Intro to Neo4j
Training Series - Intro to Neo4jNeo4j
 
AstraZeneca - The promise of graphs & graph-based learning in drug discovery
AstraZeneca - The promise of graphs & graph-based learning in drug discoveryAstraZeneca - The promise of graphs & graph-based learning in drug discovery
AstraZeneca - The promise of graphs & graph-based learning in drug discoveryNeo4j
 
Better Together: Delivering Graph Value with AWS & Neo4j - Antony Prasad The...
Better Together:  Delivering Graph Value with AWS & Neo4j - Antony Prasad The...Better Together:  Delivering Graph Value with AWS & Neo4j - Antony Prasad The...
Better Together: Delivering Graph Value with AWS & Neo4j - Antony Prasad The...Neo4j
 
Full Stack Graph in the Cloud
Full Stack Graph in the CloudFull Stack Graph in the Cloud
Full Stack Graph in the CloudNeo4j
 
Workshop Introduction to Neo4j
Workshop Introduction to Neo4jWorkshop Introduction to Neo4j
Workshop Introduction to Neo4jNeo4j
 
Neo4j – The Fastest Path to Scalable Real-Time Analytics
Neo4j – The Fastest Path to Scalable Real-Time AnalyticsNeo4j – The Fastest Path to Scalable Real-Time Analytics
Neo4j – The Fastest Path to Scalable Real-Time AnalyticsNeo4j
 
SITA WorldTracer - Lost & Found Property
SITA WorldTracer -  Lost & Found PropertySITA WorldTracer -  Lost & Found Property
SITA WorldTracer - Lost & Found PropertyNeo4j
 

Was ist angesagt? (20)

BT Group: Use of Graph in VENA (a smart broadcast network)
BT Group: Use of Graph in VENA (a smart broadcast network)BT Group: Use of Graph in VENA (a smart broadcast network)
BT Group: Use of Graph in VENA (a smart broadcast network)
 
Volvo Cars - Retrieving Safety Insights using Graphs (GraphSummit Stockholm 2...
Volvo Cars - Retrieving Safety Insights using Graphs (GraphSummit Stockholm 2...Volvo Cars - Retrieving Safety Insights using Graphs (GraphSummit Stockholm 2...
Volvo Cars - Retrieving Safety Insights using Graphs (GraphSummit Stockholm 2...
 
How the Neanex digital twin solution delivers on both speed and scale to the ...
How the Neanex digital twin solution delivers on both speed and scale to the ...How the Neanex digital twin solution delivers on both speed and scale to the ...
How the Neanex digital twin solution delivers on both speed and scale to the ...
 
SERVIER Pegasus - Graphe de connaissances pour les phases primaires de recher...
SERVIER Pegasus - Graphe de connaissances pour les phases primaires de recher...SERVIER Pegasus - Graphe de connaissances pour les phases primaires de recher...
SERVIER Pegasus - Graphe de connaissances pour les phases primaires de recher...
 
Elsevier: Empowering Knowledge Discovery in Research with Graphs
Elsevier: Empowering Knowledge Discovery in Research with GraphsElsevier: Empowering Knowledge Discovery in Research with Graphs
Elsevier: Empowering Knowledge Discovery in Research with Graphs
 
Intro to Cypher
Intro to CypherIntro to Cypher
Intro to Cypher
 
The path to success with graph database and graph data science_ Neo4j GraphSu...
The path to success with graph database and graph data science_ Neo4j GraphSu...The path to success with graph database and graph data science_ Neo4j GraphSu...
The path to success with graph database and graph data science_ Neo4j GraphSu...
 
Fraud Detection with Graphs at the Danish Business Authority
Fraud Detection with Graphs at the Danish Business AuthorityFraud Detection with Graphs at the Danish Business Authority
Fraud Detection with Graphs at the Danish Business Authority
 
Technip Energies Italy: Planning is a graph matter
Technip Energies Italy: Planning is a graph matterTechnip Energies Italy: Planning is a graph matter
Technip Energies Italy: Planning is a graph matter
 
Training Week: Introduction to Neo4j
Training Week: Introduction to Neo4jTraining Week: Introduction to Neo4j
Training Week: Introduction to Neo4j
 
Big Pharma Problems. Big Graphs: Creating the Merck Manufacturing Mesh
Big Pharma Problems. Big Graphs: Creating the Merck Manufacturing MeshBig Pharma Problems. Big Graphs: Creating the Merck Manufacturing Mesh
Big Pharma Problems. Big Graphs: Creating the Merck Manufacturing Mesh
 
Neanex - Semantic Construction with Graphs
Neanex - Semantic Construction with GraphsNeanex - Semantic Construction with Graphs
Neanex - Semantic Construction with Graphs
 
Training Series: Build APIs with Neo4j GraphQL Library
Training Series: Build APIs with Neo4j GraphQL LibraryTraining Series: Build APIs with Neo4j GraphQL Library
Training Series: Build APIs with Neo4j GraphQL Library
 
Training Series - Intro to Neo4j
Training Series - Intro to Neo4jTraining Series - Intro to Neo4j
Training Series - Intro to Neo4j
 
AstraZeneca - The promise of graphs & graph-based learning in drug discovery
AstraZeneca - The promise of graphs & graph-based learning in drug discoveryAstraZeneca - The promise of graphs & graph-based learning in drug discovery
AstraZeneca - The promise of graphs & graph-based learning in drug discovery
 
Better Together: Delivering Graph Value with AWS & Neo4j - Antony Prasad The...
Better Together:  Delivering Graph Value with AWS & Neo4j - Antony Prasad The...Better Together:  Delivering Graph Value with AWS & Neo4j - Antony Prasad The...
Better Together: Delivering Graph Value with AWS & Neo4j - Antony Prasad The...
 
Full Stack Graph in the Cloud
Full Stack Graph in the CloudFull Stack Graph in the Cloud
Full Stack Graph in the Cloud
 
Workshop Introduction to Neo4j
Workshop Introduction to Neo4jWorkshop Introduction to Neo4j
Workshop Introduction to Neo4j
 
Neo4j – The Fastest Path to Scalable Real-Time Analytics
Neo4j – The Fastest Path to Scalable Real-Time AnalyticsNeo4j – The Fastest Path to Scalable Real-Time Analytics
Neo4j – The Fastest Path to Scalable Real-Time Analytics
 
SITA WorldTracer - Lost & Found Property
SITA WorldTracer -  Lost & Found PropertySITA WorldTracer -  Lost & Found Property
SITA WorldTracer - Lost & Found Property
 

Ähnlich wie Neo4j for Bacterial Genomes

20081216 05袁國芳 紅麴菌基因體計畫及基因研究
20081216 05袁國芳 紅麴菌基因體計畫及基因研究20081216 05袁國芳 紅麴菌基因體計畫及基因研究
20081216 05袁國芳 紅麴菌基因體計畫及基因研究Monascus2008
 
RNA-Seq transcriptome analysis of Gonium pectorale cell cycle
RNA-Seq transcriptome analysis of Gonium pectorale cell cycleRNA-Seq transcriptome analysis of Gonium pectorale cell cycle
RNA-Seq transcriptome analysis of Gonium pectorale cell cycleJennifer Shelton
 
Daily changes in the phosphoproteome of the dinoflagellate Lingulodinium
Daily changes in the phosphoproteome of the dinoflagellate LingulodiniumDaily changes in the phosphoproteome of the dinoflagellate Lingulodinium
Daily changes in the phosphoproteome of the dinoflagellate LingulodiniumCIMMYT
 
Published Article in PPT.pptx
Published Article in PPT.pptxPublished Article in PPT.pptx
Published Article in PPT.pptxCEMB & online
 
RNA-Seq transcriptome analysis of Gonium pectorale cell cycle.
RNA-Seq transcriptome analysis of Gonium pectorale cell cycle.RNA-Seq transcriptome analysis of Gonium pectorale cell cycle.
RNA-Seq transcriptome analysis of Gonium pectorale cell cycle.Jennifer Shelton
 
LAMP PCR.pptx by Dr. Chayanika Das, Ph.D, Veterinary Microbiology
LAMP PCR.pptx by Dr. Chayanika Das, Ph.D, Veterinary MicrobiologyLAMP PCR.pptx by Dr. Chayanika Das, Ph.D, Veterinary Microbiology
LAMP PCR.pptx by Dr. Chayanika Das, Ph.D, Veterinary MicrobiologyChayanika Das
 
2015 bioinformatics protein_structure_wimvancriekinge
2015 bioinformatics protein_structure_wimvancriekinge2015 bioinformatics protein_structure_wimvancriekinge
2015 bioinformatics protein_structure_wimvancriekingeProf. Wim Van Criekinge
 
The effect of thioredoxin on the solubility of proteinase inhibitor 2 in an E...
The effect of thioredoxin on the solubility of proteinase inhibitor 2 in an E...The effect of thioredoxin on the solubility of proteinase inhibitor 2 in an E...
The effect of thioredoxin on the solubility of proteinase inhibitor 2 in an E...Ivan Wang
 
Discovery and Annotation of Novel Proteins from Rumen Gut Metagenomic Sequenc...
Discovery and Annotation of Novel Proteins from Rumen Gut Metagenomic Sequenc...Discovery and Annotation of Novel Proteins from Rumen Gut Metagenomic Sequenc...
Discovery and Annotation of Novel Proteins from Rumen Gut Metagenomic Sequenc...Mick Watson
 
Next generation-sequencing.ppt-converted
Next generation-sequencing.ppt-convertedNext generation-sequencing.ppt-converted
Next generation-sequencing.ppt-convertedShweta Tiwari
 
JBEI highlights September 2019
JBEI highlights September 2019JBEI highlights September 2019
JBEI highlights September 2019LeahFreemanSloan
 
Cytoscape: Integrating biological networks
Cytoscape: Integrating biological networksCytoscape: Integrating biological networks
Cytoscape: Integrating biological networksBITS
 
recombinant Dna technology.pptx
recombinant Dna technology.pptxrecombinant Dna technology.pptx
recombinant Dna technology.pptxGyanchandSaini1
 
JBEI Highlights May 2015
JBEI Highlights May 2015JBEI Highlights May 2015
JBEI Highlights May 2015Irina Silva
 
Rational Design of Phosphorylation Sites into the Erbin-PDZ Domain
Rational Design of Phosphorylation Sites into the Erbin-PDZ DomainRational Design of Phosphorylation Sites into the Erbin-PDZ Domain
Rational Design of Phosphorylation Sites into the Erbin-PDZ Domaincashi10
 
2016 bioinformatics i_proteins_wim_vancriekinge
2016 bioinformatics i_proteins_wim_vancriekinge2016 bioinformatics i_proteins_wim_vancriekinge
2016 bioinformatics i_proteins_wim_vancriekingeProf. Wim Van Criekinge
 

Ähnlich wie Neo4j for Bacterial Genomes (20)

20081216 05袁國芳 紅麴菌基因體計畫及基因研究
20081216 05袁國芳 紅麴菌基因體計畫及基因研究20081216 05袁國芳 紅麴菌基因體計畫及基因研究
20081216 05袁國芳 紅麴菌基因體計畫及基因研究
 
RNA-Seq transcriptome analysis of Gonium pectorale cell cycle
RNA-Seq transcriptome analysis of Gonium pectorale cell cycleRNA-Seq transcriptome analysis of Gonium pectorale cell cycle
RNA-Seq transcriptome analysis of Gonium pectorale cell cycle
 
Daily changes in the phosphoproteome of the dinoflagellate Lingulodinium
Daily changes in the phosphoproteome of the dinoflagellate LingulodiniumDaily changes in the phosphoproteome of the dinoflagellate Lingulodinium
Daily changes in the phosphoproteome of the dinoflagellate Lingulodinium
 
Published Article in PPT.pptx
Published Article in PPT.pptxPublished Article in PPT.pptx
Published Article in PPT.pptx
 
CcP2APX_Biochem_2008
CcP2APX_Biochem_2008CcP2APX_Biochem_2008
CcP2APX_Biochem_2008
 
RNA-Seq transcriptome analysis of Gonium pectorale cell cycle.
RNA-Seq transcriptome analysis of Gonium pectorale cell cycle.RNA-Seq transcriptome analysis of Gonium pectorale cell cycle.
RNA-Seq transcriptome analysis of Gonium pectorale cell cycle.
 
Presentation.ppt
Presentation.pptPresentation.ppt
Presentation.ppt
 
LAMP PCR.pptx by Dr. Chayanika Das, Ph.D, Veterinary Microbiology
LAMP PCR.pptx by Dr. Chayanika Das, Ph.D, Veterinary MicrobiologyLAMP PCR.pptx by Dr. Chayanika Das, Ph.D, Veterinary Microbiology
LAMP PCR.pptx by Dr. Chayanika Das, Ph.D, Veterinary Microbiology
 
2015 bioinformatics protein_structure_wimvancriekinge
2015 bioinformatics protein_structure_wimvancriekinge2015 bioinformatics protein_structure_wimvancriekinge
2015 bioinformatics protein_structure_wimvancriekinge
 
The effect of thioredoxin on the solubility of proteinase inhibitor 2 in an E...
The effect of thioredoxin on the solubility of proteinase inhibitor 2 in an E...The effect of thioredoxin on the solubility of proteinase inhibitor 2 in an E...
The effect of thioredoxin on the solubility of proteinase inhibitor 2 in an E...
 
Gene translation
Gene translationGene translation
Gene translation
 
CHEM3204_PRAC_Manual_2016
CHEM3204_PRAC_Manual_2016CHEM3204_PRAC_Manual_2016
CHEM3204_PRAC_Manual_2016
 
Discovery and Annotation of Novel Proteins from Rumen Gut Metagenomic Sequenc...
Discovery and Annotation of Novel Proteins from Rumen Gut Metagenomic Sequenc...Discovery and Annotation of Novel Proteins from Rumen Gut Metagenomic Sequenc...
Discovery and Annotation of Novel Proteins from Rumen Gut Metagenomic Sequenc...
 
Next generation-sequencing.ppt-converted
Next generation-sequencing.ppt-convertedNext generation-sequencing.ppt-converted
Next generation-sequencing.ppt-converted
 
JBEI highlights September 2019
JBEI highlights September 2019JBEI highlights September 2019
JBEI highlights September 2019
 
Cytoscape: Integrating biological networks
Cytoscape: Integrating biological networksCytoscape: Integrating biological networks
Cytoscape: Integrating biological networks
 
recombinant Dna technology.pptx
recombinant Dna technology.pptxrecombinant Dna technology.pptx
recombinant Dna technology.pptx
 
JBEI Highlights May 2015
JBEI Highlights May 2015JBEI Highlights May 2015
JBEI Highlights May 2015
 
Rational Design of Phosphorylation Sites into the Erbin-PDZ Domain
Rational Design of Phosphorylation Sites into the Erbin-PDZ DomainRational Design of Phosphorylation Sites into the Erbin-PDZ Domain
Rational Design of Phosphorylation Sites into the Erbin-PDZ Domain
 
2016 bioinformatics i_proteins_wim_vancriekinge
2016 bioinformatics i_proteins_wim_vancriekinge2016 bioinformatics i_proteins_wim_vancriekinge
2016 bioinformatics i_proteins_wim_vancriekinge
 

Mehr von Neo4j

Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...Neo4j
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j
 
QIAGEN: Biomedical Knowledge Graphs for Data Scientists and Bioinformaticians
QIAGEN: Biomedical Knowledge Graphs for Data Scientists and BioinformaticiansQIAGEN: Biomedical Knowledge Graphs for Data Scientists and Bioinformaticians
QIAGEN: Biomedical Knowledge Graphs for Data Scientists and BioinformaticiansNeo4j
 
EY_Graph Database Powered Sustainability
EY_Graph Database Powered SustainabilityEY_Graph Database Powered Sustainability
EY_Graph Database Powered SustainabilityNeo4j
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphNeo4j
 
Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024Neo4j
 
Connecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfConnecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfNeo4j
 
ISDEFE - GraphSummit Madrid - ARETA: Aviation Real-Time Emissions Token Accre...
ISDEFE - GraphSummit Madrid - ARETA: Aviation Real-Time Emissions Token Accre...ISDEFE - GraphSummit Madrid - ARETA: Aviation Real-Time Emissions Token Accre...
ISDEFE - GraphSummit Madrid - ARETA: Aviation Real-Time Emissions Token Accre...Neo4j
 
BBVA - GraphSummit Madrid - Caso de éxito en BBVA: Optimizando con grafos
BBVA - GraphSummit Madrid - Caso de éxito en BBVA: Optimizando con grafosBBVA - GraphSummit Madrid - Caso de éxito en BBVA: Optimizando con grafos
BBVA - GraphSummit Madrid - Caso de éxito en BBVA: Optimizando con grafosNeo4j
 
Graph Everywhere - Josep Taruella - Por qué Graph Data Science en tus modelos...
Graph Everywhere - Josep Taruella - Por qué Graph Data Science en tus modelos...Graph Everywhere - Josep Taruella - Por qué Graph Data Science en tus modelos...
Graph Everywhere - Josep Taruella - Por qué Graph Data Science en tus modelos...Neo4j
 
GraphSummit Madrid - Product Vision and Roadmap - Luis Salvador Neo4j
GraphSummit Madrid - Product Vision and Roadmap - Luis Salvador Neo4jGraphSummit Madrid - Product Vision and Roadmap - Luis Salvador Neo4j
GraphSummit Madrid - Product Vision and Roadmap - Luis Salvador Neo4jNeo4j
 
Neo4j_Exploring the Impact of Graph Technology on Financial Services.pdf
Neo4j_Exploring the Impact of Graph Technology on Financial Services.pdfNeo4j_Exploring the Impact of Graph Technology on Financial Services.pdf
Neo4j_Exploring the Impact of Graph Technology on Financial Services.pdfNeo4j
 
Rabobank_Exploring the Impact of Graph Technology on Financial Services.pdf
Rabobank_Exploring the Impact of Graph Technology on Financial Services.pdfRabobank_Exploring the Impact of Graph Technology on Financial Services.pdf
Rabobank_Exploring the Impact of Graph Technology on Financial Services.pdfNeo4j
 
Webinar - IA generativa e grafi Neo4j: RAG time!
Webinar - IA generativa e grafi Neo4j: RAG time!Webinar - IA generativa e grafi Neo4j: RAG time!
Webinar - IA generativa e grafi Neo4j: RAG time!Neo4j
 
IA Generativa y Grafos de Neo4j: RAG time
IA Generativa y Grafos de Neo4j: RAG timeIA Generativa y Grafos de Neo4j: RAG time
IA Generativa y Grafos de Neo4j: RAG timeNeo4j
 
Neo4j: Data Engineering for RAG (retrieval augmented generation)
Neo4j: Data Engineering for RAG (retrieval augmented generation)Neo4j: Data Engineering for RAG (retrieval augmented generation)
Neo4j: Data Engineering for RAG (retrieval augmented generation)Neo4j
 
Neo4j Graph Summit 2024 Workshop - EMEA - Breda_and_Munchen.pdf
Neo4j Graph Summit 2024 Workshop - EMEA - Breda_and_Munchen.pdfNeo4j Graph Summit 2024 Workshop - EMEA - Breda_and_Munchen.pdf
Neo4j Graph Summit 2024 Workshop - EMEA - Breda_and_Munchen.pdfNeo4j
 
Enabling GenAI Breakthroughs with Knowledge Graphs
Enabling GenAI Breakthroughs with Knowledge GraphsEnabling GenAI Breakthroughs with Knowledge Graphs
Enabling GenAI Breakthroughs with Knowledge GraphsNeo4j
 
Neo4j_Anurag Tandon_Product Vision and Roadmap.Benelux.pptx.pdf
Neo4j_Anurag Tandon_Product Vision and Roadmap.Benelux.pptx.pdfNeo4j_Anurag Tandon_Product Vision and Roadmap.Benelux.pptx.pdf
Neo4j_Anurag Tandon_Product Vision and Roadmap.Benelux.pptx.pdfNeo4j
 
Neo4j Jesus Barrasa The Art of the Possible with Graph
Neo4j Jesus Barrasa The Art of the Possible with GraphNeo4j Jesus Barrasa The Art of the Possible with Graph
Neo4j Jesus Barrasa The Art of the Possible with GraphNeo4j
 

Mehr von Neo4j (20)

Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
 
QIAGEN: Biomedical Knowledge Graphs for Data Scientists and Bioinformaticians
QIAGEN: Biomedical Knowledge Graphs for Data Scientists and BioinformaticiansQIAGEN: Biomedical Knowledge Graphs for Data Scientists and Bioinformaticians
QIAGEN: Biomedical Knowledge Graphs for Data Scientists and Bioinformaticians
 
EY_Graph Database Powered Sustainability
EY_Graph Database Powered SustainabilityEY_Graph Database Powered Sustainability
EY_Graph Database Powered Sustainability
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
 
Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024
 
Connecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfConnecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdf
 
ISDEFE - GraphSummit Madrid - ARETA: Aviation Real-Time Emissions Token Accre...
ISDEFE - GraphSummit Madrid - ARETA: Aviation Real-Time Emissions Token Accre...ISDEFE - GraphSummit Madrid - ARETA: Aviation Real-Time Emissions Token Accre...
ISDEFE - GraphSummit Madrid - ARETA: Aviation Real-Time Emissions Token Accre...
 
BBVA - GraphSummit Madrid - Caso de éxito en BBVA: Optimizando con grafos
BBVA - GraphSummit Madrid - Caso de éxito en BBVA: Optimizando con grafosBBVA - GraphSummit Madrid - Caso de éxito en BBVA: Optimizando con grafos
BBVA - GraphSummit Madrid - Caso de éxito en BBVA: Optimizando con grafos
 
Graph Everywhere - Josep Taruella - Por qué Graph Data Science en tus modelos...
Graph Everywhere - Josep Taruella - Por qué Graph Data Science en tus modelos...Graph Everywhere - Josep Taruella - Por qué Graph Data Science en tus modelos...
Graph Everywhere - Josep Taruella - Por qué Graph Data Science en tus modelos...
 
GraphSummit Madrid - Product Vision and Roadmap - Luis Salvador Neo4j
GraphSummit Madrid - Product Vision and Roadmap - Luis Salvador Neo4jGraphSummit Madrid - Product Vision and Roadmap - Luis Salvador Neo4j
GraphSummit Madrid - Product Vision and Roadmap - Luis Salvador Neo4j
 
Neo4j_Exploring the Impact of Graph Technology on Financial Services.pdf
Neo4j_Exploring the Impact of Graph Technology on Financial Services.pdfNeo4j_Exploring the Impact of Graph Technology on Financial Services.pdf
Neo4j_Exploring the Impact of Graph Technology on Financial Services.pdf
 
Rabobank_Exploring the Impact of Graph Technology on Financial Services.pdf
Rabobank_Exploring the Impact of Graph Technology on Financial Services.pdfRabobank_Exploring the Impact of Graph Technology on Financial Services.pdf
Rabobank_Exploring the Impact of Graph Technology on Financial Services.pdf
 
Webinar - IA generativa e grafi Neo4j: RAG time!
Webinar - IA generativa e grafi Neo4j: RAG time!Webinar - IA generativa e grafi Neo4j: RAG time!
Webinar - IA generativa e grafi Neo4j: RAG time!
 
IA Generativa y Grafos de Neo4j: RAG time
IA Generativa y Grafos de Neo4j: RAG timeIA Generativa y Grafos de Neo4j: RAG time
IA Generativa y Grafos de Neo4j: RAG time
 
Neo4j: Data Engineering for RAG (retrieval augmented generation)
Neo4j: Data Engineering for RAG (retrieval augmented generation)Neo4j: Data Engineering for RAG (retrieval augmented generation)
Neo4j: Data Engineering for RAG (retrieval augmented generation)
 
Neo4j Graph Summit 2024 Workshop - EMEA - Breda_and_Munchen.pdf
Neo4j Graph Summit 2024 Workshop - EMEA - Breda_and_Munchen.pdfNeo4j Graph Summit 2024 Workshop - EMEA - Breda_and_Munchen.pdf
Neo4j Graph Summit 2024 Workshop - EMEA - Breda_and_Munchen.pdf
 
Enabling GenAI Breakthroughs with Knowledge Graphs
Enabling GenAI Breakthroughs with Knowledge GraphsEnabling GenAI Breakthroughs with Knowledge Graphs
Enabling GenAI Breakthroughs with Knowledge Graphs
 
Neo4j_Anurag Tandon_Product Vision and Roadmap.Benelux.pptx.pdf
Neo4j_Anurag Tandon_Product Vision and Roadmap.Benelux.pptx.pdfNeo4j_Anurag Tandon_Product Vision and Roadmap.Benelux.pptx.pdf
Neo4j_Anurag Tandon_Product Vision and Roadmap.Benelux.pptx.pdf
 
Neo4j Jesus Barrasa The Art of the Possible with Graph
Neo4j Jesus Barrasa The Art of the Possible with GraphNeo4j Jesus Barrasa The Art of the Possible with Graph
Neo4j Jesus Barrasa The Art of the Possible with Graph
 

Kürzlich hochgeladen

What Goes Wrong with Language Definitions and How to Improve the Situation
What Goes Wrong with Language Definitions and How to Improve the SituationWhat Goes Wrong with Language Definitions and How to Improve the Situation
What Goes Wrong with Language Definitions and How to Improve the SituationJuha-Pekka Tolvanen
 
WSO2CON 2024 - Building the API First Enterprise – Running an API Program, fr...
WSO2CON 2024 - Building the API First Enterprise – Running an API Program, fr...WSO2CON 2024 - Building the API First Enterprise – Running an API Program, fr...
WSO2CON 2024 - Building the API First Enterprise – Running an API Program, fr...WSO2
 
WSO2CON2024 - It's time to go Platformless
WSO2CON2024 - It's time to go PlatformlessWSO2CON2024 - It's time to go Platformless
WSO2CON2024 - It's time to go PlatformlessWSO2
 
The title is not connected to what is inside
The title is not connected to what is insideThe title is not connected to what is inside
The title is not connected to what is insideshinachiaurasa2
 
%in Rustenburg+277-882-255-28 abortion pills for sale in Rustenburg
%in Rustenburg+277-882-255-28 abortion pills for sale in Rustenburg%in Rustenburg+277-882-255-28 abortion pills for sale in Rustenburg
%in Rustenburg+277-882-255-28 abortion pills for sale in Rustenburgmasabamasaba
 
OpenChain - The Ramifications of ISO/IEC 5230 and ISO/IEC 18974 for Legal Pro...
OpenChain - The Ramifications of ISO/IEC 5230 and ISO/IEC 18974 for Legal Pro...OpenChain - The Ramifications of ISO/IEC 5230 and ISO/IEC 18974 for Legal Pro...
OpenChain - The Ramifications of ISO/IEC 5230 and ISO/IEC 18974 for Legal Pro...Shane Coughlan
 
%in tembisa+277-882-255-28 abortion pills for sale in tembisa
%in tembisa+277-882-255-28 abortion pills for sale in tembisa%in tembisa+277-882-255-28 abortion pills for sale in tembisa
%in tembisa+277-882-255-28 abortion pills for sale in tembisamasabamasaba
 
%in Benoni+277-882-255-28 abortion pills for sale in Benoni
%in Benoni+277-882-255-28 abortion pills for sale in Benoni%in Benoni+277-882-255-28 abortion pills for sale in Benoni
%in Benoni+277-882-255-28 abortion pills for sale in Benonimasabamasaba
 
Love witchcraft +27768521739 Binding love spell in Sandy Springs, GA |psychic...
Love witchcraft +27768521739 Binding love spell in Sandy Springs, GA |psychic...Love witchcraft +27768521739 Binding love spell in Sandy Springs, GA |psychic...
Love witchcraft +27768521739 Binding love spell in Sandy Springs, GA |psychic...chiefasafspells
 
Announcing Codolex 2.0 from GDK Software
Announcing Codolex 2.0 from GDK SoftwareAnnouncing Codolex 2.0 from GDK Software
Announcing Codolex 2.0 from GDK SoftwareJim McKeeth
 
WSO2Con2024 - From Code To Cloud: Fast Track Your Cloud Native Journey with C...
WSO2Con2024 - From Code To Cloud: Fast Track Your Cloud Native Journey with C...WSO2Con2024 - From Code To Cloud: Fast Track Your Cloud Native Journey with C...
WSO2Con2024 - From Code To Cloud: Fast Track Your Cloud Native Journey with C...WSO2
 
%in ivory park+277-882-255-28 abortion pills for sale in ivory park
%in ivory park+277-882-255-28 abortion pills for sale in ivory park %in ivory park+277-882-255-28 abortion pills for sale in ivory park
%in ivory park+277-882-255-28 abortion pills for sale in ivory park masabamasaba
 
VTU technical seminar 8Th Sem on Scikit-learn
VTU technical seminar 8Th Sem on Scikit-learnVTU technical seminar 8Th Sem on Scikit-learn
VTU technical seminar 8Th Sem on Scikit-learnAmarnathKambale
 
AI & Machine Learning Presentation Template
AI & Machine Learning Presentation TemplateAI & Machine Learning Presentation Template
AI & Machine Learning Presentation TemplatePresentation.STUDIO
 
Architecture decision records - How not to get lost in the past
Architecture decision records - How not to get lost in the pastArchitecture decision records - How not to get lost in the past
Architecture decision records - How not to get lost in the pastPapp Krisztián
 
Artyushina_Guest lecture_YorkU CS May 2024.pptx
Artyushina_Guest lecture_YorkU CS May 2024.pptxArtyushina_Guest lecture_YorkU CS May 2024.pptx
Artyushina_Guest lecture_YorkU CS May 2024.pptxAnnaArtyushina1
 
%in Soweto+277-882-255-28 abortion pills for sale in soweto
%in Soweto+277-882-255-28 abortion pills for sale in soweto%in Soweto+277-882-255-28 abortion pills for sale in soweto
%in Soweto+277-882-255-28 abortion pills for sale in sowetomasabamasaba
 
%+27788225528 love spells in Knoxville Psychic Readings, Attraction spells,Br...
%+27788225528 love spells in Knoxville Psychic Readings, Attraction spells,Br...%+27788225528 love spells in Knoxville Psychic Readings, Attraction spells,Br...
%+27788225528 love spells in Knoxville Psychic Readings, Attraction spells,Br...masabamasaba
 
%in kaalfontein+277-882-255-28 abortion pills for sale in kaalfontein
%in kaalfontein+277-882-255-28 abortion pills for sale in kaalfontein%in kaalfontein+277-882-255-28 abortion pills for sale in kaalfontein
%in kaalfontein+277-882-255-28 abortion pills for sale in kaalfonteinmasabamasaba
 
%in tembisa+277-882-255-28 abortion pills for sale in tembisa
%in tembisa+277-882-255-28 abortion pills for sale in tembisa%in tembisa+277-882-255-28 abortion pills for sale in tembisa
%in tembisa+277-882-255-28 abortion pills for sale in tembisamasabamasaba
 

Kürzlich hochgeladen (20)

What Goes Wrong with Language Definitions and How to Improve the Situation
What Goes Wrong with Language Definitions and How to Improve the SituationWhat Goes Wrong with Language Definitions and How to Improve the Situation
What Goes Wrong with Language Definitions and How to Improve the Situation
 
WSO2CON 2024 - Building the API First Enterprise – Running an API Program, fr...
WSO2CON 2024 - Building the API First Enterprise – Running an API Program, fr...WSO2CON 2024 - Building the API First Enterprise – Running an API Program, fr...
WSO2CON 2024 - Building the API First Enterprise – Running an API Program, fr...
 
WSO2CON2024 - It's time to go Platformless
WSO2CON2024 - It's time to go PlatformlessWSO2CON2024 - It's time to go Platformless
WSO2CON2024 - It's time to go Platformless
 
The title is not connected to what is inside
The title is not connected to what is insideThe title is not connected to what is inside
The title is not connected to what is inside
 
%in Rustenburg+277-882-255-28 abortion pills for sale in Rustenburg
%in Rustenburg+277-882-255-28 abortion pills for sale in Rustenburg%in Rustenburg+277-882-255-28 abortion pills for sale in Rustenburg
%in Rustenburg+277-882-255-28 abortion pills for sale in Rustenburg
 
OpenChain - The Ramifications of ISO/IEC 5230 and ISO/IEC 18974 for Legal Pro...
OpenChain - The Ramifications of ISO/IEC 5230 and ISO/IEC 18974 for Legal Pro...OpenChain - The Ramifications of ISO/IEC 5230 and ISO/IEC 18974 for Legal Pro...
OpenChain - The Ramifications of ISO/IEC 5230 and ISO/IEC 18974 for Legal Pro...
 
%in tembisa+277-882-255-28 abortion pills for sale in tembisa
%in tembisa+277-882-255-28 abortion pills for sale in tembisa%in tembisa+277-882-255-28 abortion pills for sale in tembisa
%in tembisa+277-882-255-28 abortion pills for sale in tembisa
 
%in Benoni+277-882-255-28 abortion pills for sale in Benoni
%in Benoni+277-882-255-28 abortion pills for sale in Benoni%in Benoni+277-882-255-28 abortion pills for sale in Benoni
%in Benoni+277-882-255-28 abortion pills for sale in Benoni
 
Love witchcraft +27768521739 Binding love spell in Sandy Springs, GA |psychic...
Love witchcraft +27768521739 Binding love spell in Sandy Springs, GA |psychic...Love witchcraft +27768521739 Binding love spell in Sandy Springs, GA |psychic...
Love witchcraft +27768521739 Binding love spell in Sandy Springs, GA |psychic...
 
Announcing Codolex 2.0 from GDK Software
Announcing Codolex 2.0 from GDK SoftwareAnnouncing Codolex 2.0 from GDK Software
Announcing Codolex 2.0 from GDK Software
 
WSO2Con2024 - From Code To Cloud: Fast Track Your Cloud Native Journey with C...
WSO2Con2024 - From Code To Cloud: Fast Track Your Cloud Native Journey with C...WSO2Con2024 - From Code To Cloud: Fast Track Your Cloud Native Journey with C...
WSO2Con2024 - From Code To Cloud: Fast Track Your Cloud Native Journey with C...
 
%in ivory park+277-882-255-28 abortion pills for sale in ivory park
%in ivory park+277-882-255-28 abortion pills for sale in ivory park %in ivory park+277-882-255-28 abortion pills for sale in ivory park
%in ivory park+277-882-255-28 abortion pills for sale in ivory park
 
VTU technical seminar 8Th Sem on Scikit-learn
VTU technical seminar 8Th Sem on Scikit-learnVTU technical seminar 8Th Sem on Scikit-learn
VTU technical seminar 8Th Sem on Scikit-learn
 
AI & Machine Learning Presentation Template
AI & Machine Learning Presentation TemplateAI & Machine Learning Presentation Template
AI & Machine Learning Presentation Template
 
Architecture decision records - How not to get lost in the past
Architecture decision records - How not to get lost in the pastArchitecture decision records - How not to get lost in the past
Architecture decision records - How not to get lost in the past
 
Artyushina_Guest lecture_YorkU CS May 2024.pptx
Artyushina_Guest lecture_YorkU CS May 2024.pptxArtyushina_Guest lecture_YorkU CS May 2024.pptx
Artyushina_Guest lecture_YorkU CS May 2024.pptx
 
%in Soweto+277-882-255-28 abortion pills for sale in soweto
%in Soweto+277-882-255-28 abortion pills for sale in soweto%in Soweto+277-882-255-28 abortion pills for sale in soweto
%in Soweto+277-882-255-28 abortion pills for sale in soweto
 
%+27788225528 love spells in Knoxville Psychic Readings, Attraction spells,Br...
%+27788225528 love spells in Knoxville Psychic Readings, Attraction spells,Br...%+27788225528 love spells in Knoxville Psychic Readings, Attraction spells,Br...
%+27788225528 love spells in Knoxville Psychic Readings, Attraction spells,Br...
 
%in kaalfontein+277-882-255-28 abortion pills for sale in kaalfontein
%in kaalfontein+277-882-255-28 abortion pills for sale in kaalfontein%in kaalfontein+277-882-255-28 abortion pills for sale in kaalfontein
%in kaalfontein+277-882-255-28 abortion pills for sale in kaalfontein
 
%in tembisa+277-882-255-28 abortion pills for sale in tembisa
%in tembisa+277-882-255-28 abortion pills for sale in tembisa%in tembisa+277-882-255-28 abortion pills for sale in tembisa
%in tembisa+277-882-255-28 abortion pills for sale in tembisa
 

Neo4j for Bacterial Genomes

  • 1. Neo4j for Bacterial Genomes Sixing Huang 1
  • 2. Talk Overview • How to do Neo4j bioinformatics on a local computer with public data • Three use cases: ofor genome analyses ofor Carbohydrate-active enzymes ofor antibiotic resistance in CARD • Conclusions 2 Genes Genomes Phenotypes Degrade cellulose Resist cephalosporin Transport Fe2+
  • 3. About me – Sixing Huang • Studied biology and bioinformatics in Bremen. • Worked as a bioinformatic data scientist in DSZM Braunschweig. • Now as a bioinformatic scientist in MGI Shenzhen. • First contact with Neo4j in 2019. • Now use Neo4j for knowledge management, genome browser and database and write about my Neo4j journey in medium.com. 3
  • 4. Genomes have structures • Genes are more than a bag of words. • Neo4j can model genome structures. Gene1 Gene2 Gene3 4
  • 5. Neo4j as a genome browser 5 EMBL file Neo4j genome browser Import
  • 6. Gene CAZy clusters and annotations 6 MATCH p=(f0:Gene) -[:NEXT*5] -> (f1:Gene) -[:NEXT*5] ->(f2:Gene) WHERE f1.name =~ '.+GH16[^a-zA-Zds:]*' Neo4j Commander
  • 7. Bacterial genome data Data in biological studies Genome1 3 4 1 0 1 0 1 3 2 1 3 2 1 Genome2 2 1 3 1 3 0 1 3 2 1 4 1 2 Genes Neo4j can compare core and pan genomes effectively. 7
  • 8. Heavy use of orthology Taxonomy Phylum Class1 Class2 Order1 Order2 Order3 Genome1 Genome2 Genome3 Genome4 KEGG Gene annotation Metabolism Carbohydrate metabolism Glycolysis / Gluconeogenesis alcohol dehydrogenase K00001 Lipid metabolism Fatty acid degradation acyl-CoA dehydrogenase K06445 hexokinase K00844 Neo4j can model orthology intuitively. 8
  • 9. Neo4j for genome analyses 9 Taxonomy Gene annotation
  • 10. Chromobacterium sp. ATCC 53434 10 KO in sisters as filter KO in 53434 - filter Unique KO in 53434
  • 11. Unique KO in ATCC 53434 11 KO Annotation K04783 yersiniabactin salicyl-AMP ligase [EC:6.3.2.-] K04784 yersiniabactin nonribosomal peptide synthetase K12241 pyochelin biosynthesis protein PchG K12242 pyochelin biosynthesis protein PchC K13255 ferric iron reductase protein FhuF K23227 ferric hydroxamate transport system substrate- binding protein K23228 ferric hydroxamate transport system permease protein K10829 ferric hydroxamate transport system ATP-binding protein [EC:7.2.2.16] siderophore siderophore ferric hydroxamate transport Hypothesis: ATCC 53434 has a unique repertoire of iron related transport proteins.
  • 12. Phylogeny made easy 12 Order by numbers of shared KO Compute the shared KO Name Shared KO Chromobacterium vaccinii 1869 Chromobacterium sp. IIBBL 112-1 1867 Chromobacterium rhizoryzae 1777 Chromobacterium haemolyticum 1776 Chromobacterium sp. 257-1 1723
  • 13. Neo4j for Carbohydrate-active EnZYmes: 13 Reannotation of Formosa agariphila KMM 3901 CAZy Annotation PL28 ulvan lyase PL37 ulvan lyase GH28 polygalacturonase GH78 alpha-L-rhamnosidase GH105 unsaturated rhamnogalacturonyl hydrolase GH86 beta-agarase Gh168 endo-alpha-(1,3)-L-fucanase Unique CAZy not in sister genomes Visualization degrade ulvan degrade pectin degrade sulfated polysaccharide unique agarolytic life strategy
  • 14. Node classification: Predict cellulose degradation 14 CV: Training: 0.67; Testing: 0.65 CAZy annotated genomes annotated with cellulose degradability Holdout Embedding
  • 15. Deploy on holdout genomes 15 Predicted cellulose degrader Confirmed Micromonospora carbonacea africana ✓ Micromonospora sp. HM134 X Streptacidiphilus sp. P02-A3a Streptomyces prasinus ATCC 13879 ? Streptomyces sp. CB01881 Streptomyces sp. SirexAA-E ✓ Streptomyces venezuelae ATCC 15439 Streptomyces venezuelae ATCC 21113 Xanthomonas citri pv. citri UI7 ✓ Xenorhabdus budapestensis C72 Kitasatospora sp. MMS16-BH015 CAZy annotated genomes Holdout Prediction Results
  • 17. Antibiotic resistance with CARD 17 Fluoroquinolone resistance Cephalosporin resistance
  • 18. Conclusions • Neo4j can serve as an all-in-one genome browser, a biodata warehouse and a data mining tool. • It can deliver insights more quickly than relational database + SQL • Built-in machine learning can predict new connections and properties • GraphQL serves data for non-Neo4j users 18
  • 19. Acknowledgement • Hanno Teeling (MPI Bremen) • The Neo4j Community • MGI (BGI) 19