SlideShare ist ein Scribd-Unternehmen logo
1 von 33
Mining biological knowledge networks for
gene-phenotype discovery
Keywan Hassani-Pak
EBI course: Introduction to Omics data integration
March 2017
http://knetminer.rothamsted.ac.uk/
@KnetMiner
• Rothamsted is the longest running
agricultural research station in the world,
providing cutting-edge science and
innovation for more than 170 years.
• Over 450 staff
• Bioinformatics group
• Bioinformatics Analysts
• Software Developers
About us…
Agenda for today
Kevin Dialdestoro
Stephanie Brunet
Part I Part II
Keywan Hassani-Pak
Ajit Singh
Monika Mistry
• To understand why linking genotype to phenotype is complex
• To learn which information types are useful for candidate gene prioritization
• To understand the concept of knowledge networks/graphs
• To use KnetMiner for the interpretation of your RNA-seq, QTL, GWAS results
• To learn a little bit about neurodegenerative diseases
Learning Objectives for Part I
The Genotype to Phenotype Challenge
Genotype
QTL and GWAS
Omics
Includes any ‘omics
Phenotype
Disease
Intelligence
Flowering
Stress tolerance
Biological Knowledge Discovery
Data selection, processing, transformation, integration and
interpretation
The approach is generic and works similarly for other species
• Free and open source
• Data warehousing using a graph-
database
• Platform to integrate public and private
datasets in various formats
• Provides a GUI, CLI, APIs and workflows
for reproducible data integration
Ondex – Data Integration Platform
Ondex
www.ondex.org Not covered in this training course!
Let’s start with some GWAS data…
http://plants.ensembl.org/biomart
Example Arabidopsis
#SNP=66,816 | #Gene=27,502 | #Phenotype=107
… transform into a network
(SNP)
(Phenotype)
associated
Biological interaction datasets
http://thebiogrid.org
(SNP)
(Phenotype)
associated
… add biological interactions
• Gene-GO
• Gene-Phenotype
Gene knock-out or overexpression
Text mining publications
• Gene-Publication
• Gene-Pathway
• Gene-Expression
• Protein-Small Molecule
• Homology to other species
… add other open linked data
>800,000 nodes
>3,000,000 edges
Genome-scale knowledge network
• Progressive loss of structure or function of
neurons, including death of neurons
• Many types including Alzheimer, Parkinson,
Huntington…
• Many similarities between these diseases on
a sub-cellular level
• Discovering these similarities offers hope for
therapeutic advances that could ameliorate
many diseases simultaneously
Neurodegenerative Diseases
• Use OMIM advanced search
Query: alzheimer parkinson huntington
Tick: Search in Title
Tick: MIM Number Prefix: “# phenotype”
• Download results as Tab-delimited file
• Copy MIM ids without the prefix “#”
• Use UniProt Retrieve/ID mapping
Provide your MIM identifiers
Select option: From MIM to UniProtKB
Press Go and download all proteins in
XML format (compressed)
Tutorial data – based on 33 human genes
Integration of public datasets
Public Databases
Quantitative data
Interaction data
Omics Data
Datasets and workflows: https://github.com/Rothamsted/ondex-knet-builder
Relationships in Biological Knowledge Networks
Genes Homology AnnotationsGenetics Interactions Phenotype
• Methods needed to evaluate millions of
relationships in knowledge network, prioritize
genes and extract relevant subnetworks
• Interactive and exploratory tools needed to
enable knowledge discovery and decision
making
• Interpretation should be the task of domain
experts i.e. biologists!
How to search and interpret too much information?
Web Browser
Server
Servlets and JSP Page
Java Socket
Knowledge
Graph DBOndex API
DHTML
JavaScript
Apache Tomcat
Multithreaded
Java Server
HTML, JSON, XML and images
over HTTP via Ajax
Views
Java Socket
KnetMiner System OverviewClient
KnetMiner UI Overview
Search
Gene View
Map View
Evidence View
Network View
1
2
3
http://knetminer.rothamsted.ac.uk/HumanDisease
KnetMiner search interface (1)
Ontology-based
term suggestions
Concept Types
e.g. GO, Trait
OR, NOT, Replace
KnetMiner search interface (2)
User provided
QTL region
Supports gene
IDs and names
Example 1
1. Search terms: "cell death" OR apoptosis
2. Open Query Suggestor
3. Click on cell death tab
4. Replace with neuron death
5. Does it change the number of
documents and genes that can be found?
Exercise 1 – Search Interface
Video 1
Gene View - Ranked genes and evidence summaries
1. Uses TF*IDF to rank documents by their relevance to a search term
2. Uses the properties of gene-evidence networks such as
 the specificity of documents to a gene
 the frequency of evidence concepts
3. Calculates Knet-Score for every gene
Smart pre-indexing of the knowledge network makes the computation of
the score very fast
Gene Ranking
Network View – Interactive network visualization
• Enlarge
• Show all
• Re-layout
• Info Box
Add hidden nodes
and edges
Example 2
Exercise 2: Gene View  Network
Search terms: Alzheimer OR Parkinson OR Heparin OR "cell death“
Gene List: APP, MAPT, PRNP
1. Click on the APP gene which loads the Network View
2. Open the Info Box
3. Click on different Concept and Relation types
4. Check their attributes and click on links to external databases
5. Explore all direct and indirect paths from APP to Alzheimer and
Parkinson
6. Hide Publication concepts
7. Show all drugs that can target the APP interaction network
8. Go back to Gene View and select Known targets
9. Click View Network and find out if APP, MAPT and PRNP
interact, are differentially expressed and have GWAS data
Video 2
Example 3
Exercise 3: Evidence View  Network
Search terms: Alzheimer "cell death"
1. Go to Evidence View
2. Sort table by column GENES
3. Find GO concept downregulation of neuron death
4. Find GO concept upregulation of apoptosis
5. Click on number of genes linked to these terms
6. In Network View, show labels for Gene and GO concepts
7. What’s the evidence linking genes to selected GO terms?
Video 3
Map View – Interactive map of chromosome, gene, SNP and QTL data
• Show
network
• Enlarge
• Reset
• Settings
GWAS studies
Example 4
Exercise 4: Map View  Network
Search terms: Parkinson "cell death“
QTL: Chromosome 12 :: 35000000 - 44000000
1. Go to Map View
2. Toggle Full Screen
3. Zoom into Chromosome 12 and find you QTL
4. Find one or several genes that are in close proximity to
GWAS SNPs
5. Select one ore more genes, e.g. LRRK2, PRNP and EIF4G1
6. Launch Network View
7. Study the network and how the genes are connected
Video 4
• Web application for very fast search of
large genome-scale knowledge graphs
• Ranking of candidate genes based on
knowledge mining
• Interactive visualisation of genome
and knowledge maps
• Facilitates hypothesis validation and
generation
KnetMiner – Making Gene Discovery Efficient & Fun
http://knetminer.rothamsted.ac.uk/
• You like KnetMiner but you might be asking…
What if I’m interested in a different disease?
What if I’m interested in a different species?
What if I want to integrate my own private data?
What if I don’t have a server to run KnetMiner?
• As part of a Innovate UK project we are working with Genestack to address these
qestions by integrating KnetMiner tools into the Genestack Bioinformatics Platform.
• Next: We will teach you how to use Genestack to build your own networks and deploy
your own KnetMiner application
Objectives for Part II
Acknowledgements
John Doonan
Sergio Feingold
Martin Castellote
Uwe Scholz
Matthias Lange
Andy Law
Keywan Hassani-Pak
Ajit Singh
Marco Brandizi
Monika Mistry
Lisa Lill
Chris Rawlings
Dave Edwards
Philipp Bayer
Misha Kapushesky
Kevin Dialdestoro
@KnetMiner

Weitere ähnliche Inhalte

Was ist angesagt?

Bioinformatics Final Presentation
Bioinformatics Final PresentationBioinformatics Final Presentation
Bioinformatics Final Presentation
Shruthi Choudary
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
biinoida
 
Bioinformatics n bio-bio-1_uoda_workshop_4_july_2013_v1.0
Bioinformatics n bio-bio-1_uoda_workshop_4_july_2013_v1.0Bioinformatics n bio-bio-1_uoda_workshop_4_july_2013_v1.0
Bioinformatics n bio-bio-1_uoda_workshop_4_july_2013_v1.0
Fokhruz Zaman
 

Was ist angesagt? (20)

Careers in bioinformatics
Careers in bioinformaticsCareers in bioinformatics
Careers in bioinformatics
 
Bioinformatics Final Presentation
Bioinformatics Final PresentationBioinformatics Final Presentation
Bioinformatics Final Presentation
 
Career oppurtunities in the field of Bioinformatics
Career oppurtunities in the field of BioinformaticsCareer oppurtunities in the field of Bioinformatics
Career oppurtunities in the field of Bioinformatics
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
 
Bioinformatics - Discovering the Bio Logic Of Nature
Bioinformatics - Discovering the Bio Logic Of NatureBioinformatics - Discovering the Bio Logic Of Nature
Bioinformatics - Discovering the Bio Logic Of Nature
 
Better Data for a Better World
Better Data for a Better WorldBetter Data for a Better World
Better Data for a Better World
 
Introduction to Bioinformatics
Introduction to BioinformaticsIntroduction to Bioinformatics
Introduction to Bioinformatics
 
Introduction to bioinformatics
Introduction to bioinformaticsIntroduction to bioinformatics
Introduction to bioinformatics
 
BIOINFORMATICS Applications And Challenges
BIOINFORMATICS Applications And ChallengesBIOINFORMATICS Applications And Challenges
BIOINFORMATICS Applications And Challenges
 
Bioinformatics, Its Usage and Advantages
Bioinformatics, Its Usage and AdvantagesBioinformatics, Its Usage and Advantages
Bioinformatics, Its Usage and Advantages
 
B.sc biochem i bobi u-1 introduction to bioinformatics
B.sc biochem i bobi u-1 introduction to bioinformaticsB.sc biochem i bobi u-1 introduction to bioinformatics
B.sc biochem i bobi u-1 introduction to bioinformatics
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
 
Role of bioinformatics in life sciences research
Role of bioinformatics in life sciences researchRole of bioinformatics in life sciences research
Role of bioinformatics in life sciences research
 
Introduction to Bioinformatics
Introduction to BioinformaticsIntroduction to Bioinformatics
Introduction to Bioinformatics
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
 
Bioinformatics n bio-bio-1_uoda_workshop_4_july_2013_v1.0
Bioinformatics n bio-bio-1_uoda_workshop_4_july_2013_v1.0Bioinformatics n bio-bio-1_uoda_workshop_4_july_2013_v1.0
Bioinformatics n bio-bio-1_uoda_workshop_4_july_2013_v1.0
 
Basics of Data Analysis in Bioinformatics
Basics of Data Analysis in BioinformaticsBasics of Data Analysis in Bioinformatics
Basics of Data Analysis in Bioinformatics
 
Bioinformatics ppt
Bioinformatics pptBioinformatics ppt
Bioinformatics ppt
 

Andere mochten auch

Ruby HTTP clients comparison
Ruby HTTP clients comparisonRuby HTTP clients comparison
Ruby HTTP clients comparison
Hiroshi Nakamura
 

Andere mochten auch (19)

ScholarChip intro
ScholarChip introScholarChip intro
ScholarChip intro
 
Human toxicity, environmental impact and legal implications of water fluorida...
Human toxicity, environmental impact and legal implications of water fluorida...Human toxicity, environmental impact and legal implications of water fluorida...
Human toxicity, environmental impact and legal implications of water fluorida...
 
These words I share, written from despair, read them, speak them, but do so w...
These words I share, written from despair, read them, speak them, but do so w...These words I share, written from despair, read them, speak them, but do so w...
These words I share, written from despair, read them, speak them, but do so w...
 
Isu isu trenda terkini dalam teknologi pendidikan
Isu isu trenda terkini dalam teknologi pendidikanIsu isu trenda terkini dalam teknologi pendidikan
Isu isu trenda terkini dalam teknologi pendidikan
 
正規言語について
正規言語について正規言語について
正規言語について
 
C# & AWS Lambda
C# & AWS LambdaC# & AWS Lambda
C# & AWS Lambda
 
Hazop gijutsushikai chubu koukuukai
Hazop gijutsushikai chubu koukuukai Hazop gijutsushikai chubu koukuukai
Hazop gijutsushikai chubu koukuukai
 
The New Framework for Information Literacy for Higher Education
The New Framework for Information Literacy for Higher EducationThe New Framework for Information Literacy for Higher Education
The New Framework for Information Literacy for Higher Education
 
Ruby HTTP clients comparison
Ruby HTTP clients comparisonRuby HTTP clients comparison
Ruby HTTP clients comparison
 
Yapc Asia 2009 ペパボでのPerlの使い方
Yapc Asia 2009 ペパボでのPerlの使い方Yapc Asia 2009 ペパボでのPerlの使い方
Yapc Asia 2009 ペパボでのPerlの使い方
 
GBM Group Based Marketing: Marketing to Groups
GBM Group Based Marketing: Marketing to GroupsGBM Group Based Marketing: Marketing to Groups
GBM Group Based Marketing: Marketing to Groups
 
How to Kill a Word
How to Kill a WordHow to Kill a Word
How to Kill a Word
 
Ui qa tools
Ui qa toolsUi qa tools
Ui qa tools
 
好みや多数決で決めない、デザインとの正しい付き合い方
好みや多数決で決めない、デザインとの正しい付き合い方好みや多数決で決めない、デザインとの正しい付き合い方
好みや多数決で決めない、デザインとの正しい付き合い方
 
Corso storytelling a Gemona
Corso storytelling a GemonaCorso storytelling a Gemona
Corso storytelling a Gemona
 
Bundesliga Report - 10 years of academies - Talent pools of top-level German ...
Bundesliga Report - 10 years of academies - Talent pools of top-level German ...Bundesliga Report - 10 years of academies - Talent pools of top-level German ...
Bundesliga Report - 10 years of academies - Talent pools of top-level German ...
 
Escaneado 09 03-2017 10.02
Escaneado 09 03-2017 10.02Escaneado 09 03-2017 10.02
Escaneado 09 03-2017 10.02
 
ブレンダーをDisってみる
ブレンダーをDisってみるブレンダーをDisってみる
ブレンダーをDisってみる
 
4 questions to help you secure ePHI today
4 questions to help you secure ePHI today4 questions to help you secure ePHI today
4 questions to help you secure ePHI today
 

Ähnlich wie KnetMiner - EBI Workshop 2017

bioinformatics simple
bioinformatics simple bioinformatics simple
bioinformatics simple
nadeem akhter
 
Ben Goertzel AIs, Superflies and the Path to Immortality - singsum au 2011
Ben Goertzel AIs, Superflies and the Path to Immortality - singsum au 2011Ben Goertzel AIs, Superflies and the Path to Immortality - singsum au 2011
Ben Goertzel AIs, Superflies and the Path to Immortality - singsum au 2011
Adam Ford
 
Jillian ms defense-4-14-14-ja-novid3
Jillian ms defense-4-14-14-ja-novid3Jillian ms defense-4-14-14-ja-novid3
Jillian ms defense-4-14-14-ja-novid3
Jillian Aurisano
 
DNA Sequence Data in Big Data Perspective
DNA Sequence Data in Big Data PerspectiveDNA Sequence Data in Big Data Perspective
DNA Sequence Data in Big Data Perspective
Palaniappan SP
 
Semantic Web & Web 3.0 empowering real world outcomes in biomedical research ...
Semantic Web & Web 3.0 empowering real world outcomes in biomedical research ...Semantic Web & Web 3.0 empowering real world outcomes in biomedical research ...
Semantic Web & Web 3.0 empowering real world outcomes in biomedical research ...
Amit Sheth
 

Ähnlich wie KnetMiner - EBI Workshop 2017 (20)

KnetMiner Overview Oct 2017
KnetMiner Overview Oct 2017KnetMiner Overview Oct 2017
KnetMiner Overview Oct 2017
 
Open data genomics_palermo_2017_ver03
Open data genomics_palermo_2017_ver03Open data genomics_palermo_2017_ver03
Open data genomics_palermo_2017_ver03
 
Grand round whsiao_may2015
Grand round whsiao_may2015Grand round whsiao_may2015
Grand round whsiao_may2015
 
How Can We Make Genomic Epidemiology a Widespread Reality? - William Hsiao
How Can We Make Genomic Epidemiology a Widespread Reality?  - William HsiaoHow Can We Make Genomic Epidemiology a Widespread Reality?  - William Hsiao
How Can We Make Genomic Epidemiology a Widespread Reality? - William Hsiao
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
 
Friend NRNB 2012-12-13
Friend NRNB 2012-12-13Friend NRNB 2012-12-13
Friend NRNB 2012-12-13
 
bioinformatics simple
bioinformatics simple bioinformatics simple
bioinformatics simple
 
Ben Goertzel AIs, Superflies and the Path to Immortality - singsum au 2011
Ben Goertzel AIs, Superflies and the Path to Immortality - singsum au 2011Ben Goertzel AIs, Superflies and the Path to Immortality - singsum au 2011
Ben Goertzel AIs, Superflies and the Path to Immortality - singsum au 2011
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
 
Discovery on Target 2014 - The Industry's Preeminent Event on Novel Drug Targets
Discovery on Target 2014 - The Industry's Preeminent Event on Novel Drug TargetsDiscovery on Target 2014 - The Industry's Preeminent Event on Novel Drug Targets
Discovery on Target 2014 - The Industry's Preeminent Event on Novel Drug Targets
 
Semantic Web for Health Care and Biomedical Informatics
Semantic Web for Health Care and Biomedical InformaticsSemantic Web for Health Care and Biomedical Informatics
Semantic Web for Health Care and Biomedical Informatics
 
Jillian ms defense-4-14-14-ja-novid3
Jillian ms defense-4-14-14-ja-novid3Jillian ms defense-4-14-14-ja-novid3
Jillian ms defense-4-14-14-ja-novid3
 
Advanced Bioinformatics for Genomics and BioData Driven Research
Advanced Bioinformatics for Genomics and BioData Driven ResearchAdvanced Bioinformatics for Genomics and BioData Driven Research
Advanced Bioinformatics for Genomics and BioData Driven Research
 
Data sharing - Data management - The SysMO-SEEK Story
Data sharing - Data management - The SysMO-SEEK StoryData sharing - Data management - The SysMO-SEEK Story
Data sharing - Data management - The SysMO-SEEK Story
 
Data management, data sharing: the SysMO-SEEK Story
Data management, data sharing: the SysMO-SEEK StoryData management, data sharing: the SysMO-SEEK Story
Data management, data sharing: the SysMO-SEEK Story
 
DNA Sequence Data in Big Data Perspective
DNA Sequence Data in Big Data PerspectiveDNA Sequence Data in Big Data Perspective
DNA Sequence Data in Big Data Perspective
 
Biocuration 2013 - Fiona Brinkman - From genes, to genomes to networks, with ...
Biocuration 2013 - Fiona Brinkman - From genes, to genomes to networks, with ...Biocuration 2013 - Fiona Brinkman - From genes, to genomes to networks, with ...
Biocuration 2013 - Fiona Brinkman - From genes, to genomes to networks, with ...
 
Semantic Web & Web 3.0 empowering real world outcomes in biomedical research ...
Semantic Web & Web 3.0 empowering real world outcomes in biomedical research ...Semantic Web & Web 3.0 empowering real world outcomes in biomedical research ...
Semantic Web & Web 3.0 empowering real world outcomes in biomedical research ...
 
Introduction to bioinformatics
Introduction to bioinformaticsIntroduction to bioinformatics
Introduction to bioinformatics
 
Bioinformatics (Exam point of view)
Bioinformatics (Exam point of view)Bioinformatics (Exam point of view)
Bioinformatics (Exam point of view)
 

Kürzlich hochgeladen

AI Mastery 201: Elevating Your Workflow with Advanced LLM Techniques
AI Mastery 201: Elevating Your Workflow with Advanced LLM TechniquesAI Mastery 201: Elevating Your Workflow with Advanced LLM Techniques
AI Mastery 201: Elevating Your Workflow with Advanced LLM Techniques
VictorSzoltysek
 

Kürzlich hochgeladen (20)

Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdfLearn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
 
Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...
Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...
Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...
 
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
 
Vip Call Girls Noida ➡️ Delhi ➡️ 9999965857 No Advance 24HRS Live
Vip Call Girls Noida ➡️ Delhi ➡️ 9999965857 No Advance 24HRS LiveVip Call Girls Noida ➡️ Delhi ➡️ 9999965857 No Advance 24HRS Live
Vip Call Girls Noida ➡️ Delhi ➡️ 9999965857 No Advance 24HRS Live
 
Diamond Application Development Crafting Solutions with Precision
Diamond Application Development Crafting Solutions with PrecisionDiamond Application Development Crafting Solutions with Precision
Diamond Application Development Crafting Solutions with Precision
 
Software Quality Assurance Interview Questions
Software Quality Assurance Interview QuestionsSoftware Quality Assurance Interview Questions
Software Quality Assurance Interview Questions
 
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
 
How to Choose the Right Laravel Development Partner in New York City_compress...
How to Choose the Right Laravel Development Partner in New York City_compress...How to Choose the Right Laravel Development Partner in New York City_compress...
How to Choose the Right Laravel Development Partner in New York City_compress...
 
A Secure and Reliable Document Management System is Essential.docx
A Secure and Reliable Document Management System is Essential.docxA Secure and Reliable Document Management System is Essential.docx
A Secure and Reliable Document Management System is Essential.docx
 
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
 
How To Use Server-Side Rendering with Nuxt.js
How To Use Server-Side Rendering with Nuxt.jsHow To Use Server-Side Rendering with Nuxt.js
How To Use Server-Side Rendering with Nuxt.js
 
Define the academic and professional writing..pdf
Define the academic and professional writing..pdfDefine the academic and professional writing..pdf
Define the academic and professional writing..pdf
 
AI Mastery 201: Elevating Your Workflow with Advanced LLM Techniques
AI Mastery 201: Elevating Your Workflow with Advanced LLM TechniquesAI Mastery 201: Elevating Your Workflow with Advanced LLM Techniques
AI Mastery 201: Elevating Your Workflow with Advanced LLM Techniques
 
5 Signs You Need a Fashion PLM Software.pdf
5 Signs You Need a Fashion PLM Software.pdf5 Signs You Need a Fashion PLM Software.pdf
5 Signs You Need a Fashion PLM Software.pdf
 
Optimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTVOptimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTV
 
10 Trends Likely to Shape Enterprise Technology in 2024
10 Trends Likely to Shape Enterprise Technology in 202410 Trends Likely to Shape Enterprise Technology in 2024
10 Trends Likely to Shape Enterprise Technology in 2024
 
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
 
AI & Machine Learning Presentation Template
AI & Machine Learning Presentation TemplateAI & Machine Learning Presentation Template
AI & Machine Learning Presentation Template
 
Azure_Native_Qumulo_High_Performance_Compute_Benchmarks.pdf
Azure_Native_Qumulo_High_Performance_Compute_Benchmarks.pdfAzure_Native_Qumulo_High_Performance_Compute_Benchmarks.pdf
Azure_Native_Qumulo_High_Performance_Compute_Benchmarks.pdf
 
The Guide to Integrating Generative AI into Unified Continuous Testing Platfo...
The Guide to Integrating Generative AI into Unified Continuous Testing Platfo...The Guide to Integrating Generative AI into Unified Continuous Testing Platfo...
The Guide to Integrating Generative AI into Unified Continuous Testing Platfo...
 

KnetMiner - EBI Workshop 2017

  • 1. Mining biological knowledge networks for gene-phenotype discovery Keywan Hassani-Pak EBI course: Introduction to Omics data integration March 2017 http://knetminer.rothamsted.ac.uk/ @KnetMiner
  • 2. • Rothamsted is the longest running agricultural research station in the world, providing cutting-edge science and innovation for more than 170 years. • Over 450 staff • Bioinformatics group • Bioinformatics Analysts • Software Developers About us…
  • 3. Agenda for today Kevin Dialdestoro Stephanie Brunet Part I Part II Keywan Hassani-Pak Ajit Singh Monika Mistry
  • 4. • To understand why linking genotype to phenotype is complex • To learn which information types are useful for candidate gene prioritization • To understand the concept of knowledge networks/graphs • To use KnetMiner for the interpretation of your RNA-seq, QTL, GWAS results • To learn a little bit about neurodegenerative diseases Learning Objectives for Part I
  • 5. The Genotype to Phenotype Challenge Genotype QTL and GWAS Omics Includes any ‘omics Phenotype Disease Intelligence Flowering Stress tolerance Biological Knowledge Discovery Data selection, processing, transformation, integration and interpretation
  • 6. The approach is generic and works similarly for other species
  • 7. • Free and open source • Data warehousing using a graph- database • Platform to integrate public and private datasets in various formats • Provides a GUI, CLI, APIs and workflows for reproducible data integration Ondex – Data Integration Platform Ondex www.ondex.org Not covered in this training course!
  • 8. Let’s start with some GWAS data… http://plants.ensembl.org/biomart Example Arabidopsis #SNP=66,816 | #Gene=27,502 | #Phenotype=107
  • 9. … transform into a network (SNP) (Phenotype) associated
  • 12. • Gene-GO • Gene-Phenotype Gene knock-out or overexpression Text mining publications • Gene-Publication • Gene-Pathway • Gene-Expression • Protein-Small Molecule • Homology to other species … add other open linked data >800,000 nodes >3,000,000 edges Genome-scale knowledge network
  • 13. • Progressive loss of structure or function of neurons, including death of neurons • Many types including Alzheimer, Parkinson, Huntington… • Many similarities between these diseases on a sub-cellular level • Discovering these similarities offers hope for therapeutic advances that could ameliorate many diseases simultaneously Neurodegenerative Diseases
  • 14. • Use OMIM advanced search Query: alzheimer parkinson huntington Tick: Search in Title Tick: MIM Number Prefix: “# phenotype” • Download results as Tab-delimited file • Copy MIM ids without the prefix “#” • Use UniProt Retrieve/ID mapping Provide your MIM identifiers Select option: From MIM to UniProtKB Press Go and download all proteins in XML format (compressed) Tutorial data – based on 33 human genes
  • 15. Integration of public datasets Public Databases Quantitative data Interaction data Omics Data Datasets and workflows: https://github.com/Rothamsted/ondex-knet-builder
  • 16. Relationships in Biological Knowledge Networks Genes Homology AnnotationsGenetics Interactions Phenotype
  • 17. • Methods needed to evaluate millions of relationships in knowledge network, prioritize genes and extract relevant subnetworks • Interactive and exploratory tools needed to enable knowledge discovery and decision making • Interpretation should be the task of domain experts i.e. biologists! How to search and interpret too much information?
  • 18. Web Browser Server Servlets and JSP Page Java Socket Knowledge Graph DBOndex API DHTML JavaScript Apache Tomcat Multithreaded Java Server HTML, JSON, XML and images over HTTP via Ajax Views Java Socket KnetMiner System OverviewClient
  • 19. KnetMiner UI Overview Search Gene View Map View Evidence View Network View 1 2 3
  • 21. KnetMiner search interface (1) Ontology-based term suggestions Concept Types e.g. GO, Trait OR, NOT, Replace
  • 22. KnetMiner search interface (2) User provided QTL region Supports gene IDs and names
  • 23. Example 1 1. Search terms: "cell death" OR apoptosis 2. Open Query Suggestor 3. Click on cell death tab 4. Replace with neuron death 5. Does it change the number of documents and genes that can be found? Exercise 1 – Search Interface Video 1
  • 24. Gene View - Ranked genes and evidence summaries
  • 25. 1. Uses TF*IDF to rank documents by their relevance to a search term 2. Uses the properties of gene-evidence networks such as  the specificity of documents to a gene  the frequency of evidence concepts 3. Calculates Knet-Score for every gene Smart pre-indexing of the knowledge network makes the computation of the score very fast Gene Ranking
  • 26. Network View – Interactive network visualization • Enlarge • Show all • Re-layout • Info Box Add hidden nodes and edges
  • 27. Example 2 Exercise 2: Gene View  Network Search terms: Alzheimer OR Parkinson OR Heparin OR "cell death“ Gene List: APP, MAPT, PRNP 1. Click on the APP gene which loads the Network View 2. Open the Info Box 3. Click on different Concept and Relation types 4. Check their attributes and click on links to external databases 5. Explore all direct and indirect paths from APP to Alzheimer and Parkinson 6. Hide Publication concepts 7. Show all drugs that can target the APP interaction network 8. Go back to Gene View and select Known targets 9. Click View Network and find out if APP, MAPT and PRNP interact, are differentially expressed and have GWAS data Video 2
  • 28. Example 3 Exercise 3: Evidence View  Network Search terms: Alzheimer "cell death" 1. Go to Evidence View 2. Sort table by column GENES 3. Find GO concept downregulation of neuron death 4. Find GO concept upregulation of apoptosis 5. Click on number of genes linked to these terms 6. In Network View, show labels for Gene and GO concepts 7. What’s the evidence linking genes to selected GO terms? Video 3
  • 29. Map View – Interactive map of chromosome, gene, SNP and QTL data • Show network • Enlarge • Reset • Settings GWAS studies
  • 30. Example 4 Exercise 4: Map View  Network Search terms: Parkinson "cell death“ QTL: Chromosome 12 :: 35000000 - 44000000 1. Go to Map View 2. Toggle Full Screen 3. Zoom into Chromosome 12 and find you QTL 4. Find one or several genes that are in close proximity to GWAS SNPs 5. Select one ore more genes, e.g. LRRK2, PRNP and EIF4G1 6. Launch Network View 7. Study the network and how the genes are connected Video 4
  • 31. • Web application for very fast search of large genome-scale knowledge graphs • Ranking of candidate genes based on knowledge mining • Interactive visualisation of genome and knowledge maps • Facilitates hypothesis validation and generation KnetMiner – Making Gene Discovery Efficient & Fun http://knetminer.rothamsted.ac.uk/
  • 32. • You like KnetMiner but you might be asking… What if I’m interested in a different disease? What if I’m interested in a different species? What if I want to integrate my own private data? What if I don’t have a server to run KnetMiner? • As part of a Innovate UK project we are working with Genestack to address these qestions by integrating KnetMiner tools into the Genestack Bioinformatics Platform. • Next: We will teach you how to use Genestack to build your own networks and deploy your own KnetMiner application Objectives for Part II
  • 33. Acknowledgements John Doonan Sergio Feingold Martin Castellote Uwe Scholz Matthias Lange Andy Law Keywan Hassani-Pak Ajit Singh Marco Brandizi Monika Mistry Lisa Lill Chris Rawlings Dave Edwards Philipp Bayer Misha Kapushesky Kevin Dialdestoro @KnetMiner

Hinweis der Redaktion

  1. Many phenotypes are complex, polygenic and the result of complex interactions on cellular level Linking genotype and phenotype is one of the greatest challenges in biology
  2. SNP-Phenotype relations (122,919 relations) of significant SNPs (as defined by Ensembl, p-value<0.05?) linked to 107 phenotypes; on average 1,150 SNPs per phenotype. SNP-Gene relations are based on genes in close proximity to SNPs <1000bp (96,047 relations) How to integrate GWAS and biological interaction data
  3. Using Ondex
  4. http://www.sciencedirect.com/science/article/pii/S2212066116300308
  5. Highlight text-mining Add gene expression Mention xrefs and that they cn be collapsed
  6. Scale…