Organic farming with special reference to vermiculture
Chunlei Wu BD2K 201601 MyGene.info and MyVariant.info
1. Chunlei Wu, Ph.D.
cwu@scripps.edu
@chunleiwu
Associate Professor of Molecular Medicine
Dept. of Molecular Experimental Medicine
The Scripps Research Institute
La Jolla, CA, USA
01/22/2016
From MyGene.info and MyVariant.info towards BioThings API
2. As a
MyGene.info and MyVariant.info recap
Annotations
Gene
Variant
(Aggregated)
(high-performance)
(real-time) Web Service
3. So many variant annotation resources
dbNSFP
The Exome Aggregation
Consortium (ExAC)
6. Keep data always up-to-date
Each data source is updated individually. Colors
indicate their different updating schedules.
Schematic view of MyVariant.info architecture
8. MyVariant.info for the end users:
http://MyVariant.info
(currently v1 API, two endpoints)
http://MyVariant.info/v1/query?q=<query>
any query term(s)
matching variant hits
http://MyVariant.info/v1/variant/<variantid>
hgvs id(s)
matching variant object(s)
Both supports batch-mode via POST
Simple API. No sign-up. No API key.
Try our live API , and documentations
9. MyGene.info for the end users:
http://MyGene.info
(currently v2 API, two endpoints)
http://MyGene.info/v2/query?q=<query>
any query term(s)
matching gene hits
http://MyGene.info/v2/gene/<geneid>
gene id(s)
matching gene object(s)
Both supports batch-mode via POST
Simple API. No sign-up. No API key.
Try our live API , and documentations
16. MyVariant.info official Python/R Clients
myvariant Python client hosted in PyPI
(initial release in Aug 2015)
myvariant R client hosted in Bioconductor
(initial release in Oct 2015)
27. A Python library for processing JSON-LD data
In [1]: fetch_value_source_for_variant("chr6:g.26093141G>A","http://identifiers.org/dbsnp/")
Out[1]:
['rs1800562 http://schema.myvarint.info/datasource/dbnsfp',
'rs1800562 http://schema.myvarint.info/datasource/clinvar',
'rs1800562 http://schema.myvarint.info/datasource/dbsnp',
'rs1800562 http://schema.myvarint.info/datasource/evs',
'rs1800562 http://schema.myvarint.info/datasource/gwassnps',
'rs1800562 http://schema.myvarint.info/datasource/mutdb']
By Kevin Xin
28. Need to define an API specs
• Output as a JSON object with a defined _id.
• "jsonld=true/false" toggle for the inclusion of JSON-LD
context.
• Support the retrieval of a single entity via GET
(use case: individual data aggregation on the fly)
• Support the retrieval of a list of entities via POST
(use case: routine data aggregation in batches)
• Output should indicate the entity existence:
GET /variant/<unknown_id> 404
POST /variant/ id1, <unknown_id>, id3
[id1: {…},
<unknown_id>: "notfound",
id3: {…}]
to enable data exchange via JSON-LD
31. Data-sharing via Web API is trending
Making a single web service is trivial,
but making a sustainable/scalable
web API is non-trivial.
We would like to help other groups to
create their own hosted web API for
sharing their data.
32. Action item 1: BioThings API whitepaper
Also the action item from last BD2K CA
consortium meeting and the API working
group from last year's NIH BD2K AHM
33. Action item 2: BioThings API framework
NIH commons
Infrastructure as a Service:
Software as a Service:
BioThings API
34. Action item 3: expansion to other "BioThings"
D
Disease
D
Drugs
MyDrug.info MyDisease.info
need an alt. name here
35. Acknowledgement
Funding and Support
U54GM114833
U01HG008473
Washtington U:
Ben Ainscough
Obi Griffith
TSRI:
Andrew Su
Jiwen Xin
Cyrus Afrasiabi
Ginger Tsueng
Adam Mark
Greg Stupp
Tim Putman
STSI:
Eric Topol
Ali Torkamani
Galina Erikson
U. Washington:
Sean Mooney
Moritz Juchler
Nikhil Gopal
OICR:
Robin Haw
UC Berkeley:
Chris Mungall
UCSD:
Trish Whetzel
MyVariant.info MyGene.info
Hinweis der Redaktion
A high-performance query engine for aggregated variant annotations.
A high-performance query engine for aggregated variant annotations.
Annotation data are fundamental
Gene anno: no need a slide to explain, everyone need them
Var anno: relatively new, more and more trending due to the booming of NGS