SlideShare a Scribd company logo
1 of 18
Download to read offline
RDF/SPARQL:
a UniProtKB/Swiss-Prot
practical perspective
Jerven Bolleman
Developer
Swiss-Prot Group
Our Goals
• Provide	core	Bioinformatics	resources	
– UniProtKB/	
– 		
– …	
• Provide	services	and	infrastructure	
– 							Vital-IT	:	HPC	for	the	life-sciences	
– …
Genetic	Variations	and	Diseases	in	
UniProtKB/Swiss-Prot:		
The	Ins	and	Outs	of	Expert	Manual	Curation	
Famiglietti, et al.
We annotate a lot of disease/variants!
http://europepmc.org/abstract/MED/24848695
Why provide a public SPARQL endpoint
• A	10	man	wet	laboratory	can	not	afford:
Why provide a public SPARQL endpoint
• A	10	man	wet	laboratory	can	not	afford:	
– to	host	their	own	database	in	house	holding	
all	or	even	a	bit	of	all	life	science	data.
Why provide a public SPARQL endpoint
• A	10	man	wet	laboratory	can	not	afford:	
– to	host	their	own	database	in	house	holding	
all	or	even	a	bit	of	all	life	science	data.		
– not	to	have	access,	and	use,	existing	life	
science	information.
← Not CPU Time...
But Brain Time
↓
The right kind of optimisation
Why provide a public SPARQL endpoint
• Classical	SQL	can	be	provided	on	the	web	
–Is	not	practical	
–No	federation	
–Poor	standards	conformance	
• Local SQL is expensive
• Local	JSON	is	no	better	
• Nor	is	local	XML
Data Integration Traditional
Pathway.txt
UniProt.txt
Pathway
Parser
UniProt
Parser
Pathway
Schema
UniProt
Schema
Own Lab Data
Data
warehouse
SQL
queries
$
$
$
$
$
$
Data Integration RDF/SPARQL
Pathway.rdf
UniProt.rdf
Own Lab Data
Triple Store
SPARQL
Queries
$
$?
Why not some other graph database?
Ecosystem
RDF enables sharing and reuse of data at low cost
Identity Precision Standards
Why provide a public SPARQL endpoint
• Document	centric	REST	is	not	enough	
–Swiss-Prot	available	as	REST		
–(over e-mail !!) since 1986
–expasy.ch since 1993
–www.uniprot.org	since	2002	
• Most user use a GUI not a CLI
• developers	build	GUI	on	a	CLI
13
© 2015 SIB
100
10'000
1'000'000
2015-012015-022015-032015-042015-052015-062015-072015-082015-09
queries ask select
construct describe
Queries per month in 2015
peak: 4 million per month
Real users
Mix between hard analytics and super specific
Estimate somewhere between:
400 - 1200 real humans per month
We know they are real because they take
holidays ;)
Questions?
18
© 2015 SIB
help@uniprot.org

More Related Content

Similar to Why sparql tohu

How to invent strong a framework navneet bhushan
How to invent strong a framework  navneet bhushanHow to invent strong a framework  navneet bhushan
How to invent strong a framework navneet bhushan
Navneet Bhushan
 
E2 life as_a_scientific_database_curator_(sandra_orchard)
E2 life as_a_scientific_database_curator_(sandra_orchard)E2 life as_a_scientific_database_curator_(sandra_orchard)
E2 life as_a_scientific_database_curator_(sandra_orchard)
phdcareers
 
Smb 30012014 martijn wilmer incedo bioscience
Smb 30012014 martijn wilmer   incedo bioscienceSmb 30012014 martijn wilmer   incedo bioscience
Smb 30012014 martijn wilmer incedo bioscience
SMBBV
 
How to make your repository OpenAIRE compliant
How to make your repository OpenAIRE compliantHow to make your repository OpenAIRE compliant
How to make your repository OpenAIRE compliant
Pedro Príncipe
 
10th e concertation-brussels-06march2013-v2
10th e concertation-brussels-06march2013-v210th e concertation-brussels-06march2013-v2
10th e concertation-brussels-06march2013-v2
Alex Hardisty
 

Similar to Why sparql tohu (20)

The case for cloud computing in Life Sciences
The case for cloud computing in Life SciencesThe case for cloud computing in Life Sciences
The case for cloud computing in Life Sciences
 
iMicrobe and iVirus: Extending the iPlant cyberinfrastructure from plants to ...
iMicrobe and iVirus: Extending the iPlant cyberinfrastructure from plants to ...iMicrobe and iVirus: Extending the iPlant cyberinfrastructure from plants to ...
iMicrobe and iVirus: Extending the iPlant cyberinfrastructure from plants to ...
 
HPC at NIBR
HPC at NIBRHPC at NIBR
HPC at NIBR
 
UKSG Meeting April 4, 2011
UKSG Meeting April 4, 2011UKSG Meeting April 4, 2011
UKSG Meeting April 4, 2011
 
The BHL Infrastructure
The BHL InfrastructureThe BHL Infrastructure
The BHL Infrastructure
 
Data-intensive applications on cloud computing resources: Applications in lif...
Data-intensive applications on cloud computing resources: Applications in lif...Data-intensive applications on cloud computing resources: Applications in lif...
Data-intensive applications on cloud computing resources: Applications in lif...
 
Towards Reproducible Science: a few building blocks from my personal experience
Towards Reproducible Science: a few building blocks from my personal experienceTowards Reproducible Science: a few building blocks from my personal experience
Towards Reproducible Science: a few building blocks from my personal experience
 
Open Source Generic Lab Equipement - GaudiLabs
Open Source Generic Lab Equipement - GaudiLabsOpen Source Generic Lab Equipement - GaudiLabs
Open Source Generic Lab Equipement - GaudiLabs
 
April retreat wet lab
April retreat wet labApril retreat wet lab
April retreat wet lab
 
How to invent strong a framework navneet bhushan
How to invent strong a framework  navneet bhushanHow to invent strong a framework  navneet bhushan
How to invent strong a framework navneet bhushan
 
SCOOTER UKOER V Rolfe Oct10
SCOOTER UKOER V Rolfe Oct10SCOOTER UKOER V Rolfe Oct10
SCOOTER UKOER V Rolfe Oct10
 
E2 life as_a_scientific_database_curator_(sandra_orchard)
E2 life as_a_scientific_database_curator_(sandra_orchard)E2 life as_a_scientific_database_curator_(sandra_orchard)
E2 life as_a_scientific_database_curator_(sandra_orchard)
 
Major resources of bioinformatics 2
Major resources of bioinformatics 2Major resources of bioinformatics 2
Major resources of bioinformatics 2
 
Introduction to bioinformatics
Introduction to bioinformaticsIntroduction to bioinformatics
Introduction to bioinformatics
 
When you are given Open Science, what will you do with it?
When you are given Open Science, what will you do with it?When you are given Open Science, what will you do with it?
When you are given Open Science, what will you do with it?
 
Smb 30012014 martijn wilmer incedo bioscience
Smb 30012014 martijn wilmer   incedo bioscienceSmb 30012014 martijn wilmer   incedo bioscience
Smb 30012014 martijn wilmer incedo bioscience
 
How to make your repository OpenAIRE compliant
How to make your repository OpenAIRE compliantHow to make your repository OpenAIRE compliant
How to make your repository OpenAIRE compliant
 
Infraestrutura para a Ciência Aberta na Europa - OpenAIRE: O poder dos reposi...
Infraestrutura para a Ciência Aberta na Europa - OpenAIRE: O poder dos reposi...Infraestrutura para a Ciência Aberta na Europa - OpenAIRE: O poder dos reposi...
Infraestrutura para a Ciência Aberta na Europa - OpenAIRE: O poder dos reposi...
 
10th e concertation-brussels-06march2013-v2
10th e concertation-brussels-06march2013-v210th e concertation-brussels-06march2013-v2
10th e concertation-brussels-06march2013-v2
 
Constructing bottomup
Constructing bottomupConstructing bottomup
Constructing bottomup
 

More from Jerven Bolleman

More from Jerven Bolleman (6)

Semantic Variation Graphs the case for RDF & SPARQL
Semantic Variation Graphs the case for RDF & SPARQLSemantic Variation Graphs the case for RDF & SPARQL
Semantic Variation Graphs the case for RDF & SPARQL
 
RDF: what and why plus a SPARQL tutorial
RDF: what and why plus a SPARQL tutorialRDF: what and why plus a SPARQL tutorial
RDF: what and why plus a SPARQL tutorial
 
The UniProt SPARQL endpoint: 20 billion quads in production
The UniProt SPARQL endpoint: 20 billion quads in productionThe UniProt SPARQL endpoint: 20 billion quads in production
The UniProt SPARQL endpoint: 20 billion quads in production
 
Biohackathon2013: Tripling Bioinformatics Productivity
Biohackathon2013: Tripling Bioinformatics ProductivityBiohackathon2013: Tripling Bioinformatics Productivity
Biohackathon2013: Tripling Bioinformatics Productivity
 
Learning sparql 2012 12
Learning sparql 2012 12Learning sparql 2012 12
Learning sparql 2012 12
 
Uni protsparqlcloud
Uni protsparqlcloudUni protsparqlcloud
Uni protsparqlcloud
 

Recently uploaded

POGONATUM : morphology, anatomy, reproduction etc.
POGONATUM : morphology, anatomy, reproduction etc.POGONATUM : morphology, anatomy, reproduction etc.
POGONATUM : morphology, anatomy, reproduction etc.
Silpa
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 
Module for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learningModule for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learning
levieagacer
 
Porella : features, morphology, anatomy, reproduction etc.
Porella : features, morphology, anatomy, reproduction etc.Porella : features, morphology, anatomy, reproduction etc.
Porella : features, morphology, anatomy, reproduction etc.
Silpa
 
The Mariana Trench remarkable geological features on Earth.pptx
The Mariana Trench remarkable geological features on Earth.pptxThe Mariana Trench remarkable geological features on Earth.pptx
The Mariana Trench remarkable geological features on Earth.pptx
seri bangash
 
development of diagnostic enzyme assay to detect leuser virus
development of diagnostic enzyme assay to detect leuser virusdevelopment of diagnostic enzyme assay to detect leuser virus
development of diagnostic enzyme assay to detect leuser virus
NazaninKarimi6
 

Recently uploaded (20)

Bhiwandi Bhiwandi ❤CALL GIRL 7870993772 ❤CALL GIRLS ESCORT SERVICE In Bhiwan...
Bhiwandi Bhiwandi ❤CALL GIRL 7870993772 ❤CALL GIRLS  ESCORT SERVICE In Bhiwan...Bhiwandi Bhiwandi ❤CALL GIRL 7870993772 ❤CALL GIRLS  ESCORT SERVICE In Bhiwan...
Bhiwandi Bhiwandi ❤CALL GIRL 7870993772 ❤CALL GIRLS ESCORT SERVICE In Bhiwan...
 
Introduction of DNA analysis in Forensic's .pptx
Introduction of DNA analysis in Forensic's .pptxIntroduction of DNA analysis in Forensic's .pptx
Introduction of DNA analysis in Forensic's .pptx
 
GBSN - Microbiology (Unit 3)
GBSN - Microbiology (Unit 3)GBSN - Microbiology (Unit 3)
GBSN - Microbiology (Unit 3)
 
POGONATUM : morphology, anatomy, reproduction etc.
POGONATUM : morphology, anatomy, reproduction etc.POGONATUM : morphology, anatomy, reproduction etc.
POGONATUM : morphology, anatomy, reproduction etc.
 
Clean In Place(CIP).pptx .
Clean In Place(CIP).pptx                 .Clean In Place(CIP).pptx                 .
Clean In Place(CIP).pptx .
 
An introduction on sequence tagged site mapping
An introduction on sequence tagged site mappingAn introduction on sequence tagged site mapping
An introduction on sequence tagged site mapping
 
Zoology 5th semester notes( Sumit_yadav).pdf
Zoology 5th semester notes( Sumit_yadav).pdfZoology 5th semester notes( Sumit_yadav).pdf
Zoology 5th semester notes( Sumit_yadav).pdf
 
Site Acceptance Test .
Site Acceptance Test                    .Site Acceptance Test                    .
Site Acceptance Test .
 
CURRENT SCENARIO OF POULTRY PRODUCTION IN INDIA
CURRENT SCENARIO OF POULTRY PRODUCTION IN INDIACURRENT SCENARIO OF POULTRY PRODUCTION IN INDIA
CURRENT SCENARIO OF POULTRY PRODUCTION IN INDIA
 
300003-World Science Day For Peace And Development.pptx
300003-World Science Day For Peace And Development.pptx300003-World Science Day For Peace And Development.pptx
300003-World Science Day For Peace And Development.pptx
 
Exploring Criminology and Criminal Behaviour.pdf
Exploring Criminology and Criminal Behaviour.pdfExploring Criminology and Criminal Behaviour.pdf
Exploring Criminology and Criminal Behaviour.pdf
 
GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Chemistry 5th semester paper 1st Notes.pdf
Chemistry 5th semester paper 1st Notes.pdfChemistry 5th semester paper 1st Notes.pdf
Chemistry 5th semester paper 1st Notes.pdf
 
Module for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learningModule for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learning
 
Selaginella: features, morphology ,anatomy and reproduction.
Selaginella: features, morphology ,anatomy and reproduction.Selaginella: features, morphology ,anatomy and reproduction.
Selaginella: features, morphology ,anatomy and reproduction.
 
Porella : features, morphology, anatomy, reproduction etc.
Porella : features, morphology, anatomy, reproduction etc.Porella : features, morphology, anatomy, reproduction etc.
Porella : features, morphology, anatomy, reproduction etc.
 
Call Girls Ahmedabad +917728919243 call me Independent Escort Service
Call Girls Ahmedabad +917728919243 call me Independent Escort ServiceCall Girls Ahmedabad +917728919243 call me Independent Escort Service
Call Girls Ahmedabad +917728919243 call me Independent Escort Service
 
The Mariana Trench remarkable geological features on Earth.pptx
The Mariana Trench remarkable geological features on Earth.pptxThe Mariana Trench remarkable geological features on Earth.pptx
The Mariana Trench remarkable geological features on Earth.pptx
 
development of diagnostic enzyme assay to detect leuser virus
development of diagnostic enzyme assay to detect leuser virusdevelopment of diagnostic enzyme assay to detect leuser virus
development of diagnostic enzyme assay to detect leuser virus
 

Why sparql tohu