SlideShare ist ein Scribd-Unternehmen logo
1 von 17
Downloaden Sie, um offline zu lesen
MyGene.Info:
   Gene Annotation as a Service - GAaaS


              Chunlei Wu
               cwu@scripps.edu
                 2011/07/16
A migration story for BioGPS
         http://biogps.org
A migration story for BioGPS
         http://biogps.org
A migration story for BioGPS
         http://biogps.org
Gene-centric annotation data
                   http://biogps.org

A simple view:

   Gene 1017 →Symbol:    CDK2
            →Ensembl:    ENSG00000123374
            →RefSeq:     NM_001798
                         NM_052827
            →Reporter:   →U95A:            1792_g_at
                                           1833_at
                         →U133A:           211804_s_at
                                           204252_at
                                           211803_at
A real example:
                  Symbol
                  Name
                  Alias
                  Summary
                  Ensembl
                  Refseq
                  UniGene
                  Homologene
                  GO
                  UniProt
                  InterPro
                  PDB
                  Prosite
                  IPI

                  And many more…
Relational database solutions
         Solution 1: “star” schema

                                                                     Ensembl Table
                                                           GeneID        EnsemblID

         Reporter Table                                    1017          ENSG00000250560

GeneID     Platform   Reporter
1017       U95A       1792_s_at             Master Table

1017       U95A       1833_at        GeneID      Symbol

1017       U133A      211804_s_at    1017        CDK2

1017       U133A      204252_at                                   Refseq Table
1017       U133A      211803_at                            GeneID      RefseqID
                                                           1017        NM_001798
                                                           1017        NM_052827
Relational database solutions
Solution 2: “weakly-typed” schema
                      Generic Data Table

   ID   Type         Value                 Parent   Root
   1    GeneID       1017                  NULL     1017
   2    Symbol       CDK2                  1017     1017
   3    Ensembl      ENSG00000123374       1017     1017
   4    RefSeq       NM_001798             1017     1017
   5    RefSeq       NM_052827             1017     1017
   6    Platform     U95A                  1017     1017
   7    Platform     U133A                 1017     1017
   8    Reporter     1792_g_at             U95A     1017
   9    Reporter     1833_at               U95A     1017
   10   Reporter     211804_s_at           U133A    1017
   11   Reporter     204252_at             U133A    1017
   12   Reporter     211803_at             U133A    1017
“Document”-based database solution
                      CDK2

    1017:   {
                “Symbol”: “CDK2”,
                “Ensembl”: “ENSG00000123374”,
                “RefSeq”:    [
                       “NM_001798”,
                       “NM_052827”
                ],
                “Reporter”: {
                    “U95A”: [
                        “1792_g_at”,
                        “1833_at”
                    ],
                    “U133A”:[
                        “211804_s_at”,
                        “2045252_at”,
                        “211803_at”
                    ]
                 }
            }
What’s CouchDB



 Document-based (“schema-free”) database
 Index and query data in MapReduce fashion
  using Javascript
 RESTful JSON API
 Bi-directional replicator
 Distributed
Load data into “CouchDB”
                                   
                                           NCBI “gene_info” file

                                                        Create bare-bone
                                                        document for each gene


    Appending more annotation to
    “document”                           Gene “document”
          “gene2refseq”

          “gene2ensembl”

          “u95a_annot”
          “u133a_annot”

          And more …




Easy to add append data type           Easy to update incrementally
What’s behind Mygene.info
Gene Annotation as a Service

          http://MyGene.Info

   Gene annotation services go PUBLIC



   Gene query service
       http://mygene.info/query?q=<query>

   Gene annotation service
       http://mygene.info/gene/<geneid>
Gene Query Service
   user query  matching gene IDs/symbols/names
                (JSON output)


  http://mygene.info/query?q=<query>
Examples:
   http://mygene.info/query?q=cdk2
   http://mygene.info/query?q=cdk2+AND+species:human
   http://mygene.info/query?q=cdk?
   http://mygene.info/query?q=p*
   http://mygene.info/query?q=entrezgene:1017
   http://mygene.info/query?q=ensemblgene:ENSG00000123374
Gene Annotation Service
      gene id  full or filtered gene annotation object
                (JSON output)


     http://mygene.info/gene/<geneid>
Examples:
      http://mygene.info/gene/1017
      http://mygene.info/gene/ENSG00000123374
      http://mygene.info/gene/1017?filter=name,symbol,summary
      http://mygene.info/gene/1017?filter=name,symbol,refseq.rna

Species supported:
       human
       mouse
       rat
       fruitfly
       nematode
       zebrafish
       thale cress
       frog
Targeted use case:
    Quickly build a gene-centric online resource without the need of maintaining
    a local gene annotation database

Use it in a web application:
    Server side
            Making direct HTTP calls
    Client side
             Setup a server-side proxy
            JSONP calls
            Cross-domain AJAX calls via CORS (Cross-Origin Resource Sharing)



          Demo and full documentation at
             http://mygene.info
          Source code: https://bitbucket.org/newgene/genedoc/src
Acknowledgement

Group members:          GNF collaborators:

Andrew Su               Camilo Orozco
                        Jon Huss
Ian MacLeod
Benjamin Good           Past contributor:
Eric Clarke
                        Marc Leglise         ISMB travel support




                  Funding and Support



                  (NIH grant: R01GM083924)

                 http://mygene.info

Weitere ähnliche Inhalte

Andere mochten auch

Next generation sequencing
Next generation sequencingNext generation sequencing
Next generation sequencing
Dayananda Salam
 
A Comparison of NGS Platforms.
A Comparison of NGS Platforms.A Comparison of NGS Platforms.
A Comparison of NGS Platforms.
mkim8
 

Andere mochten auch (9)

2015 functional genomics variant annotation and interpretation- tools and p...
2015 functional genomics   variant annotation and interpretation- tools and p...2015 functional genomics   variant annotation and interpretation- tools and p...
2015 functional genomics variant annotation and interpretation- tools and p...
 
New Generation Sequencing Technologies: an overview
New Generation Sequencing Technologies: an overviewNew Generation Sequencing Technologies: an overview
New Generation Sequencing Technologies: an overview
 
Next generation sequencing
Next generation sequencingNext generation sequencing
Next generation sequencing
 
Next Gen Sequencing (NGS) Technology Overview
Next Gen Sequencing (NGS) Technology OverviewNext Gen Sequencing (NGS) Technology Overview
Next Gen Sequencing (NGS) Technology Overview
 
NGS technologies - platforms and applications
NGS technologies - platforms and applicationsNGS technologies - platforms and applications
NGS technologies - platforms and applications
 
Ngs ppt
Ngs pptNgs ppt
Ngs ppt
 
A Comparison of NGS Platforms.
A Comparison of NGS Platforms.A Comparison of NGS Platforms.
A Comparison of NGS Platforms.
 
Introduction to next generation sequencing
Introduction to next generation sequencingIntroduction to next generation sequencing
Introduction to next generation sequencing
 
NGS - Basic principles and sequencing platforms
NGS - Basic principles and sequencing platformsNGS - Basic principles and sequencing platforms
NGS - Basic principles and sequencing platforms
 

Ähnlich wie F01-Cloud-Mygene.info

Swertz bosc2010 molgenis
Swertz bosc2010 molgenisSwertz bosc2010 molgenis
Swertz bosc2010 molgenis
BOSC 2010
 
Insights extraction through knowledge graph of medical booking data
Insights extraction through knowledge graph of medical booking dataInsights extraction through knowledge graph of medical booking data
Insights extraction through knowledge graph of medical booking data
Vaticle
 
E-Utilities
E-UtilitiesE-Utilities
E-Utilities
mkim8
 
2016 bd2k bgood_wikidata
2016 bd2k bgood_wikidata2016 bd2k bgood_wikidata
2016 bd2k bgood_wikidata
Benjamin Good
 
Overview of DisTEMIST at BioASQ: Automatic detection and normalization of dis...
Overview of DisTEMIST at BioASQ: Automatic detection and normalization of dis...Overview of DisTEMIST at BioASQ: Automatic detection and normalization of dis...
Overview of DisTEMIST at BioASQ: Automatic detection and normalization of dis...
Martin Krallinger
 
20120220 Tri-Con Cloud Computing Symposium
20120220 Tri-Con Cloud Computing Symposium20120220 Tri-Con Cloud Computing Symposium
20120220 Tri-Con Cloud Computing Symposium
Andrew Su
 

Ähnlich wie F01-Cloud-Mygene.info (20)

Swertz bosc2010 molgenis
Swertz bosc2010 molgenisSwertz bosc2010 molgenis
Swertz bosc2010 molgenis
 
Insights extraction through knowledge graph of medical booking data
Insights extraction through knowledge graph of medical booking dataInsights extraction through knowledge graph of medical booking data
Insights extraction through knowledge graph of medical booking data
 
[Hydro]geological analysis using open source app: case Cikapundung River
[Hydro]geological analysis using open source app: case Cikapundung River[Hydro]geological analysis using open source app: case Cikapundung River
[Hydro]geological analysis using open source app: case Cikapundung River
 
The Power of Graphs to Analyze Biological Data - Davy Suvee @ GraphConnect Lo...
The Power of Graphs to Analyze Biological Data - Davy Suvee @ GraphConnect Lo...The Power of Graphs to Analyze Biological Data - Davy Suvee @ GraphConnect Lo...
The Power of Graphs to Analyze Biological Data - Davy Suvee @ GraphConnect Lo...
 
The Power of Graphs to Analyze Biological Data
The Power of Graphs to Analyze Biological DataThe Power of Graphs to Analyze Biological Data
The Power of Graphs to Analyze Biological Data
 
E-Utilities
E-UtilitiesE-Utilities
E-Utilities
 
NCBI API - Integration into analysis code
NCBI API - Integration into analysis codeNCBI API - Integration into analysis code
NCBI API - Integration into analysis code
 
Improving Stewardship of Scientific Data Through Use of a Maturity Matrix
Improving Stewardship of Scientific Data Through Use of a Maturity MatrixImproving Stewardship of Scientific Data Through Use of a Maturity Matrix
Improving Stewardship of Scientific Data Through Use of a Maturity Matrix
 
Data Driven Innovation - Interoperable Genebanks (Tech Track Session)
Data Driven Innovation - Interoperable Genebanks (Tech Track Session)Data Driven Innovation - Interoperable Genebanks (Tech Track Session)
Data Driven Innovation - Interoperable Genebanks (Tech Track Session)
 
EpiMOLAS: An Intuitive Web-based Framework for Genome-Wide DNA Methylation An...
EpiMOLAS: An Intuitive Web-based Framework for Genome-Wide DNA Methylation An...EpiMOLAS: An Intuitive Web-based Framework for Genome-Wide DNA Methylation An...
EpiMOLAS: An Intuitive Web-based Framework for Genome-Wide DNA Methylation An...
 
Learning Biologically Relevant Features Using Convolutional Neural Networks f...
Learning Biologically Relevant Features Using Convolutional Neural Networks f...Learning Biologically Relevant Features Using Convolutional Neural Networks f...
Learning Biologically Relevant Features Using Convolutional Neural Networks f...
 
Super-Encryption Cryptography with IDEA and WAKE Algorithm
Super-Encryption Cryptography with IDEA and WAKE AlgorithmSuper-Encryption Cryptography with IDEA and WAKE Algorithm
Super-Encryption Cryptography with IDEA and WAKE Algorithm
 
2016 bd2k bgood_wikidata
2016 bd2k bgood_wikidata2016 bd2k bgood_wikidata
2016 bd2k bgood_wikidata
 
ICWE2017 BigDataEurope
ICWE2017 BigDataEuropeICWE2017 BigDataEurope
ICWE2017 BigDataEurope
 
A Genome Sequence Analysis System Built with Hypertable
A Genome Sequence Analysis System Built with HypertableA Genome Sequence Analysis System Built with Hypertable
A Genome Sequence Analysis System Built with Hypertable
 
2016 03 25_group_meeting MyVariant.info
2016 03 25_group_meeting MyVariant.info2016 03 25_group_meeting MyVariant.info
2016 03 25_group_meeting MyVariant.info
 
Overview of DisTEMIST at BioASQ: Automatic detection and normalization of dis...
Overview of DisTEMIST at BioASQ: Automatic detection and normalization of dis...Overview of DisTEMIST at BioASQ: Automatic detection and normalization of dis...
Overview of DisTEMIST at BioASQ: Automatic detection and normalization of dis...
 
20120220 Tri-Con Cloud Computing Symposium
20120220 Tri-Con Cloud Computing Symposium20120220 Tri-Con Cloud Computing Symposium
20120220 Tri-Con Cloud Computing Symposium
 
Chunlei Wu BD2K 201601 MyGene.info and MyVariant.info
Chunlei Wu BD2K 201601 MyGene.info and MyVariant.infoChunlei Wu BD2K 201601 MyGene.info and MyVariant.info
Chunlei Wu BD2K 201601 MyGene.info and MyVariant.info
 
Bioinformatica t2-databases
Bioinformatica t2-databasesBioinformatica t2-databases
Bioinformatica t2-databases
 

Mehr von Bioinformatics Open Source Conference

Mehr von Bioinformatics Open Source Conference (20)

Running workflows through galaxy bosc presentation
Running workflows through galaxy bosc presentationRunning workflows through galaxy bosc presentation
Running workflows through galaxy bosc presentation
 
Talk1 ben sadi for_gmod_bosc_2011
Talk1 ben sadi for_gmod_bosc_2011Talk1 ben sadi for_gmod_bosc_2011
Talk1 ben sadi for_gmod_bosc_2011
 
Bosc mercer
Bosc mercerBosc mercer
Bosc mercer
 
Mobyle 1 0_new_features_new_types_of_service
Mobyle 1 0_new_features_new_types_of_serviceMobyle 1 0_new_features_new_types_of_service
Mobyle 1 0_new_features_new_types_of_service
 
Bosc2011 arakawa
Bosc2011 arakawaBosc2011 arakawa
Bosc2011 arakawa
 
Bosc2011 isobar-fbp
Bosc2011 isobar-fbpBosc2011 isobar-fbp
Bosc2011 isobar-fbp
 
Talk6 biopython bosc2011
Talk6 biopython bosc2011Talk6 biopython bosc2011
Talk6 biopython bosc2011
 
Unipro ugene bosc 2011 update
Unipro ugene bosc 2011 updateUnipro ugene bosc 2011 update
Unipro ugene bosc 2011 update
 
Bosc2011 ntino-krampis-full
Bosc2011 ntino-krampis-fullBosc2011 ntino-krampis-full
Bosc2011 ntino-krampis-full
 
Bosc talk 7-15-2011x
Bosc talk 7-15-2011xBosc talk 7-15-2011x
Bosc talk 7-15-2011x
 
F02-Cloud-Cloud BioLinux
F02-Cloud-Cloud BioLinuxF02-Cloud-Cloud BioLinux
F02-Cloud-Cloud BioLinux
 
B07-GenomeContent-Biomart
B07-GenomeContent-BiomartB07-GenomeContent-Biomart
B07-GenomeContent-Biomart
 
B03-GenomeContent-Intermine
B03-GenomeContent-IntermineB03-GenomeContent-Intermine
B03-GenomeContent-Intermine
 
G03-SemanticWeb-OntoCAT
G03-SemanticWeb-OntoCATG03-SemanticWeb-OntoCAT
G03-SemanticWeb-OntoCAT
 
F06-Cloud-Enabling NGS
F06-Cloud-Enabling NGSF06-Cloud-Enabling NGS
F06-Cloud-Enabling NGS
 
D03-NextGen-Bio-NGS
D03-NextGen-Bio-NGSD03-NextGen-Bio-NGS
D03-NextGen-Bio-NGS
 
F07-Cloud-Hadoop-BAM
F07-Cloud-Hadoop-BAMF07-Cloud-Hadoop-BAM
F07-Cloud-Hadoop-BAM
 
C03-Visualization-Webapollo
C03-Visualization-WebapolloC03-Visualization-Webapollo
C03-Visualization-Webapollo
 
A01-Openness in knowledge-based systems
A01-Openness in knowledge-based systemsA01-Openness in knowledge-based systems
A01-Openness in knowledge-based systems
 
F03-Cloud-Obiwee
F03-Cloud-ObiweeF03-Cloud-Obiwee
F03-Cloud-Obiwee
 

Kürzlich hochgeladen

EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
Earley Information Science
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
Joaquim Jorge
 

Kürzlich hochgeladen (20)

A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 

F01-Cloud-Mygene.info

  • 1. MyGene.Info: Gene Annotation as a Service - GAaaS Chunlei Wu cwu@scripps.edu 2011/07/16
  • 2. A migration story for BioGPS http://biogps.org
  • 3. A migration story for BioGPS http://biogps.org
  • 4. A migration story for BioGPS http://biogps.org
  • 5. Gene-centric annotation data http://biogps.org A simple view: Gene 1017 →Symbol: CDK2 →Ensembl: ENSG00000123374 →RefSeq: NM_001798 NM_052827 →Reporter: →U95A: 1792_g_at 1833_at →U133A: 211804_s_at 204252_at 211803_at
  • 6. A real example: Symbol Name Alias Summary Ensembl Refseq UniGene Homologene GO UniProt InterPro PDB Prosite IPI And many more…
  • 7. Relational database solutions Solution 1: “star” schema Ensembl Table GeneID EnsemblID Reporter Table 1017 ENSG00000250560 GeneID Platform Reporter 1017 U95A 1792_s_at Master Table 1017 U95A 1833_at GeneID Symbol 1017 U133A 211804_s_at 1017 CDK2 1017 U133A 204252_at Refseq Table 1017 U133A 211803_at GeneID RefseqID 1017 NM_001798 1017 NM_052827
  • 8. Relational database solutions Solution 2: “weakly-typed” schema Generic Data Table ID Type Value Parent Root 1 GeneID 1017 NULL 1017 2 Symbol CDK2 1017 1017 3 Ensembl ENSG00000123374 1017 1017 4 RefSeq NM_001798 1017 1017 5 RefSeq NM_052827 1017 1017 6 Platform U95A 1017 1017 7 Platform U133A 1017 1017 8 Reporter 1792_g_at U95A 1017 9 Reporter 1833_at U95A 1017 10 Reporter 211804_s_at U133A 1017 11 Reporter 204252_at U133A 1017 12 Reporter 211803_at U133A 1017
  • 9. “Document”-based database solution CDK2 1017: { “Symbol”: “CDK2”, “Ensembl”: “ENSG00000123374”, “RefSeq”: [ “NM_001798”, “NM_052827” ], “Reporter”: { “U95A”: [ “1792_g_at”, “1833_at” ], “U133A”:[ “211804_s_at”, “2045252_at”, “211803_at” ] } }
  • 10. What’s CouchDB  Document-based (“schema-free”) database  Index and query data in MapReduce fashion using Javascript  RESTful JSON API  Bi-directional replicator  Distributed
  • 11. Load data into “CouchDB”  NCBI “gene_info” file Create bare-bone document for each gene  Appending more annotation to “document” Gene “document” “gene2refseq” “gene2ensembl” “u95a_annot” “u133a_annot” And more … Easy to add append data type Easy to update incrementally
  • 13. Gene Annotation as a Service http://MyGene.Info Gene annotation services go PUBLIC Gene query service http://mygene.info/query?q=<query> Gene annotation service http://mygene.info/gene/<geneid>
  • 14. Gene Query Service user query  matching gene IDs/symbols/names (JSON output) http://mygene.info/query?q=<query> Examples: http://mygene.info/query?q=cdk2 http://mygene.info/query?q=cdk2+AND+species:human http://mygene.info/query?q=cdk? http://mygene.info/query?q=p* http://mygene.info/query?q=entrezgene:1017 http://mygene.info/query?q=ensemblgene:ENSG00000123374
  • 15. Gene Annotation Service gene id  full or filtered gene annotation object (JSON output) http://mygene.info/gene/<geneid> Examples: http://mygene.info/gene/1017 http://mygene.info/gene/ENSG00000123374 http://mygene.info/gene/1017?filter=name,symbol,summary http://mygene.info/gene/1017?filter=name,symbol,refseq.rna Species supported: human mouse rat fruitfly nematode zebrafish thale cress frog
  • 16. Targeted use case: Quickly build a gene-centric online resource without the need of maintaining a local gene annotation database Use it in a web application: Server side Making direct HTTP calls Client side Setup a server-side proxy JSONP calls Cross-domain AJAX calls via CORS (Cross-Origin Resource Sharing) Demo and full documentation at http://mygene.info Source code: https://bitbucket.org/newgene/genedoc/src
  • 17. Acknowledgement Group members: GNF collaborators: Andrew Su Camilo Orozco Jon Huss Ian MacLeod Benjamin Good Past contributor: Eric Clarke Marc Leglise ISMB travel support Funding and Support (NIH grant: R01GM083924) http://mygene.info