SlideShare a Scribd company logo
1 of 29
PomBase Community Curation:
A Fast Track to Capture Expert Knowledge

Antonia Lock
The S. pombe Community
¡  Medium-sized research community
 ¡  >200 labs, 1300 subscribe to mailing list
 ¡  Close-knit

¡  GeneDB S. pombe model organism database set up in 2004
 ¡  Maintained by one person (V. Wood)
 ¡  Mainly GO annotation
 ¡  Problem:
   ¡  Needed to support additional types of data
   ¡  Too many publications to curate considering the available
       man-power
The Community Curation
Initiative
¡  Pilot study in 2009
  ¡  Highly successful
      ¡  29/44 responded (no follow up for non-responders)
      ¡  ~360 new annotations
      ¡  Annotations were generally of high quality – errors easy to spot
      ¡  Enabled a dialogue between author and curators
  ¡  Process must be simplified
      ¡  Need for a simple tool in which to do the curation, instead of a
          complicated word document

¡  2010 – Wellcome Trust grant
  ¡  to develop and implement a community curation tool
  ¡  Also to develop a new fission yeast database ‘PomBase’ which will
      support a range of additional data-types not previously captured in
      GeneDB
Data captured in GeneDB
   vs. PomBase
Data type                         Ontology               GeneDB    PomBase
Function/Process/Component        GO                       ✔          ✔
Protein modifications             Protein Modification      -         ✔
                                  Ontology
Phenotypes                        FYPO (Fission Yeast     Some        ✔
                                  Phenotype Ontology)
Interactions                      BioGRID                BioGRID      ✔
Gene expression                   In-house vocabulary       -         ✔

Misc features (disease            In-house vocabulary      ✔          ✔
associations, complementation…)



   The increased breadth makes community curation even more important
Phenotype Ontology
¡  User survey 2007 - Phenotypes were identified as the single most
    desirable information type not supported by GeneDB S.
    pombe.

¡  Need for a pre-composed Fission Yeast Phenotype Ontology
  ¡  Ease for community curation
  ¡  Needed greater specificity of terms than that offered by existing
      phenotype ontologies

¡  Term is accompanied by two types of information:
  ¡  Allele description – deletion, overexpression of mutation
  ¡  Experimental conditions where appropriate

¡  Combination of different ontologies used to create formal definitions
     ¡  E.g. PATO, ChEBI, GO
      PATO                  FYPO                                ChEBI
      resistance to         resistance to thiabendazole         thiabendazole
GO Term Extensions


GO	
  ID	
      Term	
                                                 Evidence	
     With/From	
     Source	
  

GO:004674	
     Protein	
  serine/threonine	
  kinase	
  ac<vity	
  

                has_substrate	
  pom1	
                                IDA	
          	
  	
          Yoon	
  HJ	
  et	
  al.	
  (2006)	
  

                has_substrate	
  rum1	
  	
  	
  	
                    IDA	
          	
  	
          Noguchi	
  E	
  et	
  al.	
  (2002)	
  

                has_substrate	
  rbp80	
                               IDA	
          	
  	
          Holig	
  K	
  et	
  al.	
  (2009)	
  

                has_substrate	
  sin1	
                                IDA	
          	
  	
          Jang	
  YJ	
  et	
  al.	
  (1997)	
  
Why Not a Wiki?
¡  Traditionally biologists would study one gene/protein
 ¡  Individual text-based gene pages were an ideal format

¡  Many techniques used today generate gene lists
 ¡  Enrichment identify patterns in the data-set e.g. are certain
     processes common the group of genes?
 ¡  Need annotations to controlled vocabularies to make efficient,
     computerized comparisons
   ¡  A wiki, essentially free-text, does not provide this

¡  All annotations are supported by evidence
What Will the
Community Curate?
¡  Data that can be captured by the formal vocabularies used in
    PomBase
 ¡  GO (including extensions)
 ¡  Protein modifications (including residue information)
 ¡  Phenotypes (including alleles and conditions)
 ¡  Interactions

¡  Mostly pre-composed terms
 ¡  Extensions will be captured by prompting where relevant
   ¡  E.g. the community will not be expected to know when to use these
The Community Annotation
Tool - CANTO
¡  Final stages of development
 ¡  Developed by Kim Rutherford
 ¡  Already in use by the PomBase curators
 ¡  We are involving the community at this stage through review of
     curated (recent) publications

¡  Provides a web-based interface
 ¡  Can be used as a stand-alone application (provides annotations in
     GAFs)
 ¡  Pipelines are in place for direct loading into Chado
   ¡  Chado (GMOD project) is a database schema for handling
       biological data
5 Easy Steps to Broad
Curation of Data
- A Walk-through
Step 1: add your genes
The main page
- choose a gene to get started…
Step 2: Choose the type of
annotation
Step 3: Find the correct term
Child terms are suggested…
Step 4: Add the evidence
Step 5: Review, extend and
transfer
Quality Control and
Consistency Checking
¡  Professional curators are needed not just for
    curation support, but also for quality control and
    consistency checking.
Help?!
¡  There is always a visible help button
Benefits of Community
Curation

¡  Researchers can curate ‘from home’ immediately following
    publication
 ¡  First-pass annotations quickly obtained – data will quickly appear in the
     database
 ¡  Expert knowledge, coupled to quality control by curators make for
     powerful, accurate annotations
 ¡  Controlled annotations can be loaded from the tool directly into our
     database

¡  Bottle-neck is how quickly professional curators can check
    annotations, not how fast we can obtain them

¡  Frees up time for us to clear the back-log of papers
Benefits to the Researcher
¡  Greater visibility of
    publication
  ¡  Annotations propagated to
      GO, BioGRID, Ensembl, NCBI,
      UniProt…
  ¡  Increased citation index?

¡  A greater understanding of
    ontologies
  ¡  Will be able to use them
      better to support their
      research
Future Directions
¡  ~3 months until official launch of CANTO
 ¡  Multi-gene phenotypes
 ¡  Extensions (restricted usage for specific terms and
     relations)
 ¡  More help features and descriptive boxes

¡  Longer term
 ¡  Making the tool easily configurable for other
     organisms
 ¡  Making the tool available to other communities
Acknowledgements
¡  The PomBase team:
   ¡    Val Wood
   ¡    Midori Harris
   ¡    Kim Rutherford
   ¡    Mark McDowall
   ¡    Antonia Lock

¡  PI’s:
   ¡  Jurg Bahler (UCL)
   ¡  Steve Oliver (Cambridge)
   ¡  Paul Kersey (EBI Hinxton)

¡  Funded by the Wellcome
    Trust

More Related Content

Viewers also liked

Belajar dari Sejarah Untuk Membangun Kekuatan Perdagangan dan Keuangan Umat
Belajar dari Sejarah Untuk Membangun Kekuatan Perdagangan dan Keuangan UmatBelajar dari Sejarah Untuk Membangun Kekuatan Perdagangan dan Keuangan Umat
Belajar dari Sejarah Untuk Membangun Kekuatan Perdagangan dan Keuangan UmatNur Rachman
 
Leverage the Internet to Empower Your Career, Image, and Reputation
Leverage the Internet to Empower Your Career, Image, and ReputationLeverage the Internet to Empower Your Career, Image, and Reputation
Leverage the Internet to Empower Your Career, Image, and ReputationCesar Abeid
 
Presentatie HootSuite pro
Presentatie HootSuite proPresentatie HootSuite pro
Presentatie HootSuite proHootFan Tips
 
Scotland vs canada powerpoint
Scotland vs canada powerpointScotland vs canada powerpoint
Scotland vs canada powerpointlavignec
 
Brochure Ski Portillo
Brochure Ski PortilloBrochure Ski Portillo
Brochure Ski PortilloSkiportillo
 
Ingrid Castillo
Ingrid CastilloIngrid Castillo
Ingrid CastilloUNAD
 
Presentation mentioned by Greg Howell on episode 47 of the Construction Indus...
Presentation mentioned by Greg Howell on episode 47 of the Construction Indus...Presentation mentioned by Greg Howell on episode 47 of the Construction Indus...
Presentation mentioned by Greg Howell on episode 47 of the Construction Indus...Cesar Abeid
 

Viewers also liked (7)

Belajar dari Sejarah Untuk Membangun Kekuatan Perdagangan dan Keuangan Umat
Belajar dari Sejarah Untuk Membangun Kekuatan Perdagangan dan Keuangan UmatBelajar dari Sejarah Untuk Membangun Kekuatan Perdagangan dan Keuangan Umat
Belajar dari Sejarah Untuk Membangun Kekuatan Perdagangan dan Keuangan Umat
 
Leverage the Internet to Empower Your Career, Image, and Reputation
Leverage the Internet to Empower Your Career, Image, and ReputationLeverage the Internet to Empower Your Career, Image, and Reputation
Leverage the Internet to Empower Your Career, Image, and Reputation
 
Presentatie HootSuite pro
Presentatie HootSuite proPresentatie HootSuite pro
Presentatie HootSuite pro
 
Scotland vs canada powerpoint
Scotland vs canada powerpointScotland vs canada powerpoint
Scotland vs canada powerpoint
 
Brochure Ski Portillo
Brochure Ski PortilloBrochure Ski Portillo
Brochure Ski Portillo
 
Ingrid Castillo
Ingrid CastilloIngrid Castillo
Ingrid Castillo
 
Presentation mentioned by Greg Howell on episode 47 of the Construction Indus...
Presentation mentioned by Greg Howell on episode 47 of the Construction Indus...Presentation mentioned by Greg Howell on episode 47 of the Construction Indus...
Presentation mentioned by Greg Howell on episode 47 of the Construction Indus...
 

Similar to Lock - PomBase community curation

UniProt-GOA
UniProt-GOAUniProt-GOA
UniProt-GOAEBI
 
ICAR2016 TAIR talk
ICAR2016 TAIR talkICAR2016 TAIR talk
ICAR2016 TAIR talkDonghui Li
 
TAIR -Using biological ontologies to accelerate progress in plant biology res...
TAIR -Using biological ontologies to accelerate progress in plant biology res...TAIR -Using biological ontologies to accelerate progress in plant biology res...
TAIR -Using biological ontologies to accelerate progress in plant biology res...Phoenix Bioinformatics
 
Computing on the shoulders of giants
Computing on the shoulders of giantsComputing on the shoulders of giants
Computing on the shoulders of giantsBenjamin Good
 
Plant Pathogen Genome Data: My Life In Sequences
Plant Pathogen Genome Data: My Life In SequencesPlant Pathogen Genome Data: My Life In Sequences
Plant Pathogen Genome Data: My Life In SequencesLeighton Pritchard
 
Trends In Genomics
Trends In GenomicsTrends In Genomics
Trends In GenomicsSaul Kravitz
 
Bio-ontologies in bioinformatics: Growing up challenges
Bio-ontologies in bioinformatics: Growing up challengesBio-ontologies in bioinformatics: Growing up challenges
Bio-ontologies in bioinformatics: Growing up challengesJanna Hastings
 
RDA Wheat Data Interoperability Cookbook and last developments
RDA Wheat Data Interoperability Cookbook and last developmentsRDA Wheat Data Interoperability Cookbook and last developments
RDA Wheat Data Interoperability Cookbook and last developmentsCIARD Movement
 
Integrate Ontologies into your apps
Integrate Ontologies into your appsIntegrate Ontologies into your apps
Integrate Ontologies into your appsIRIDA_community
 
Curate locally, think globally
Curate locally, think globallyCurate locally, think globally
Curate locally, think globallyValerie Wood
 
Semantics for Bioinformatics: What, Why and How of Search, Integration and An...
Semantics for Bioinformatics: What, Why and How of Search, Integration and An...Semantics for Bioinformatics: What, Why and How of Search, Integration and An...
Semantics for Bioinformatics: What, Why and How of Search, Integration and An...Amit Sheth
 
Apollo and i5K: Collaborative Curation and Interactive Analysis of Genomes
Apollo and i5K: Collaborative Curation and Interactive Analysis of GenomesApollo and i5K: Collaborative Curation and Interactive Analysis of Genomes
Apollo and i5K: Collaborative Curation and Interactive Analysis of GenomesMonica Munoz-Torres
 
The Role of Libraries in Data Management and Curation
The Role of Libraries in Data Management and CurationThe Role of Libraries in Data Management and Curation
The Role of Libraries in Data Management and CurationNicole Vasilevsky
 

Similar to Lock - PomBase community curation (20)

UniProt-GOA
UniProt-GOAUniProt-GOA
UniProt-GOA
 
ICAR2016 TAIR talk
ICAR2016 TAIR talkICAR2016 TAIR talk
ICAR2016 TAIR talk
 
TAIR -Using biological ontologies to accelerate progress in plant biology res...
TAIR -Using biological ontologies to accelerate progress in plant biology res...TAIR -Using biological ontologies to accelerate progress in plant biology res...
TAIR -Using biological ontologies to accelerate progress in plant biology res...
 
Computing on the shoulders of giants
Computing on the shoulders of giantsComputing on the shoulders of giants
Computing on the shoulders of giants
 
Plant Pathogen Genome Data: My Life In Sequences
Plant Pathogen Genome Data: My Life In SequencesPlant Pathogen Genome Data: My Life In Sequences
Plant Pathogen Genome Data: My Life In Sequences
 
Big data nebraska
Big data nebraskaBig data nebraska
Big data nebraska
 
Trends In Genomics
Trends In GenomicsTrends In Genomics
Trends In Genomics
 
bioinformatics enabling knowledge generation from agricultural omics data
bioinformatics enabling knowledge generation from agricultural omics databioinformatics enabling knowledge generation from agricultural omics data
bioinformatics enabling knowledge generation from agricultural omics data
 
Bio-ontologies in bioinformatics: Growing up challenges
Bio-ontologies in bioinformatics: Growing up challengesBio-ontologies in bioinformatics: Growing up challenges
Bio-ontologies in bioinformatics: Growing up challenges
 
Prosdocimi ucb cdao
Prosdocimi ucb cdaoProsdocimi ucb cdao
Prosdocimi ucb cdao
 
Big data nebraska
Big data nebraskaBig data nebraska
Big data nebraska
 
RDA Wheat Data Interoperability Cookbook and last developments
RDA Wheat Data Interoperability Cookbook and last developmentsRDA Wheat Data Interoperability Cookbook and last developments
RDA Wheat Data Interoperability Cookbook and last developments
 
Integrate Ontologies into your apps
Integrate Ontologies into your appsIntegrate Ontologies into your apps
Integrate Ontologies into your apps
 
Chibucos annot go_final
Chibucos annot go_finalChibucos annot go_final
Chibucos annot go_final
 
Curate locally, think globally
Curate locally, think globallyCurate locally, think globally
Curate locally, think globally
 
Semantics for Bioinformatics: What, Why and How of Search, Integration and An...
Semantics for Bioinformatics: What, Why and How of Search, Integration and An...Semantics for Bioinformatics: What, Why and How of Search, Integration and An...
Semantics for Bioinformatics: What, Why and How of Search, Integration and An...
 
Ontology development and use for efficient information input and retrieval
Ontology development and use for efficient information input and retrievalOntology development and use for efficient information input and retrieval
Ontology development and use for efficient information input and retrieval
 
Ontology development and use for efficient information input and retrieval
Ontology development and use for efficient information input and retrievalOntology development and use for efficient information input and retrieval
Ontology development and use for efficient information input and retrieval
 
Apollo and i5K: Collaborative Curation and Interactive Analysis of Genomes
Apollo and i5K: Collaborative Curation and Interactive Analysis of GenomesApollo and i5K: Collaborative Curation and Interactive Analysis of Genomes
Apollo and i5K: Collaborative Curation and Interactive Analysis of Genomes
 
The Role of Libraries in Data Management and Curation
The Role of Libraries in Data Management and CurationThe Role of Libraries in Data Management and Curation
The Role of Libraries in Data Management and Curation
 

Recently uploaded

ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITY
ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITYISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITY
ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITYKayeClaireEstoconing
 
Karra SKD Conference Presentation Revised.pptx
Karra SKD Conference Presentation Revised.pptxKarra SKD Conference Presentation Revised.pptx
Karra SKD Conference Presentation Revised.pptxAshokKarra1
 
Roles & Responsibilities in Pharmacovigilance
Roles & Responsibilities in PharmacovigilanceRoles & Responsibilities in Pharmacovigilance
Roles & Responsibilities in PharmacovigilanceSamikshaHamane
 
Barangay Council for the Protection of Children (BCPC) Orientation.pptx
Barangay Council for the Protection of Children (BCPC) Orientation.pptxBarangay Council for the Protection of Children (BCPC) Orientation.pptx
Barangay Council for the Protection of Children (BCPC) Orientation.pptxCarlos105
 
Gas measurement O2,Co2,& ph) 04/2024.pptx
Gas measurement O2,Co2,& ph) 04/2024.pptxGas measurement O2,Co2,& ph) 04/2024.pptx
Gas measurement O2,Co2,& ph) 04/2024.pptxDr.Ibrahim Hassaan
 
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)lakshayb543
 
USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...
USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...
USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...Postal Advocate Inc.
 
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...JhezDiaz1
 
ENGLISH6-Q4-W3.pptxqurter our high choom
ENGLISH6-Q4-W3.pptxqurter our high choomENGLISH6-Q4-W3.pptxqurter our high choom
ENGLISH6-Q4-W3.pptxqurter our high choomnelietumpap1
 
Choosing the Right CBSE School A Comprehensive Guide for Parents
Choosing the Right CBSE School A Comprehensive Guide for ParentsChoosing the Right CBSE School A Comprehensive Guide for Parents
Choosing the Right CBSE School A Comprehensive Guide for Parentsnavabharathschool99
 
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdf
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdfInclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdf
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdfTechSoup
 
Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17
Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17
Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17Celine George
 
DATA STRUCTURE AND ALGORITHM for beginners
DATA STRUCTURE AND ALGORITHM for beginnersDATA STRUCTURE AND ALGORITHM for beginners
DATA STRUCTURE AND ALGORITHM for beginnersSabitha Banu
 
Judging the Relevance and worth of ideas part 2.pptx
Judging the Relevance  and worth of ideas part 2.pptxJudging the Relevance  and worth of ideas part 2.pptx
Judging the Relevance and worth of ideas part 2.pptxSherlyMaeNeri
 
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptxMULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptxAnupkumar Sharma
 
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️9953056974 Low Rate Call Girls In Saket, Delhi NCR
 

Recently uploaded (20)

YOUVE GOT EMAIL_FINALS_EL_DORADO_2024.pptx
YOUVE GOT EMAIL_FINALS_EL_DORADO_2024.pptxYOUVE GOT EMAIL_FINALS_EL_DORADO_2024.pptx
YOUVE GOT EMAIL_FINALS_EL_DORADO_2024.pptx
 
ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITY
ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITYISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITY
ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITY
 
Karra SKD Conference Presentation Revised.pptx
Karra SKD Conference Presentation Revised.pptxKarra SKD Conference Presentation Revised.pptx
Karra SKD Conference Presentation Revised.pptx
 
Roles & Responsibilities in Pharmacovigilance
Roles & Responsibilities in PharmacovigilanceRoles & Responsibilities in Pharmacovigilance
Roles & Responsibilities in Pharmacovigilance
 
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
 
Barangay Council for the Protection of Children (BCPC) Orientation.pptx
Barangay Council for the Protection of Children (BCPC) Orientation.pptxBarangay Council for the Protection of Children (BCPC) Orientation.pptx
Barangay Council for the Protection of Children (BCPC) Orientation.pptx
 
Gas measurement O2,Co2,& ph) 04/2024.pptx
Gas measurement O2,Co2,& ph) 04/2024.pptxGas measurement O2,Co2,& ph) 04/2024.pptx
Gas measurement O2,Co2,& ph) 04/2024.pptx
 
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
 
Raw materials used in Herbal Cosmetics.pptx
Raw materials used in Herbal Cosmetics.pptxRaw materials used in Herbal Cosmetics.pptx
Raw materials used in Herbal Cosmetics.pptx
 
USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...
USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...
USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...
 
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
 
ENGLISH6-Q4-W3.pptxqurter our high choom
ENGLISH6-Q4-W3.pptxqurter our high choomENGLISH6-Q4-W3.pptxqurter our high choom
ENGLISH6-Q4-W3.pptxqurter our high choom
 
Choosing the Right CBSE School A Comprehensive Guide for Parents
Choosing the Right CBSE School A Comprehensive Guide for ParentsChoosing the Right CBSE School A Comprehensive Guide for Parents
Choosing the Right CBSE School A Comprehensive Guide for Parents
 
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdf
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdfInclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdf
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdf
 
Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17
Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17
Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17
 
DATA STRUCTURE AND ALGORITHM for beginners
DATA STRUCTURE AND ALGORITHM for beginnersDATA STRUCTURE AND ALGORITHM for beginners
DATA STRUCTURE AND ALGORITHM for beginners
 
Judging the Relevance and worth of ideas part 2.pptx
Judging the Relevance  and worth of ideas part 2.pptxJudging the Relevance  and worth of ideas part 2.pptx
Judging the Relevance and worth of ideas part 2.pptx
 
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdfTataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
 
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptxMULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
 
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
 

Lock - PomBase community curation

  • 1. PomBase Community Curation: A Fast Track to Capture Expert Knowledge Antonia Lock
  • 2. The S. pombe Community ¡  Medium-sized research community ¡  >200 labs, 1300 subscribe to mailing list ¡  Close-knit ¡  GeneDB S. pombe model organism database set up in 2004 ¡  Maintained by one person (V. Wood) ¡  Mainly GO annotation ¡  Problem: ¡  Needed to support additional types of data ¡  Too many publications to curate considering the available man-power
  • 3.
  • 4. The Community Curation Initiative ¡  Pilot study in 2009 ¡  Highly successful ¡  29/44 responded (no follow up for non-responders) ¡  ~360 new annotations ¡  Annotations were generally of high quality – errors easy to spot ¡  Enabled a dialogue between author and curators ¡  Process must be simplified ¡  Need for a simple tool in which to do the curation, instead of a complicated word document ¡  2010 – Wellcome Trust grant ¡  to develop and implement a community curation tool ¡  Also to develop a new fission yeast database ‘PomBase’ which will support a range of additional data-types not previously captured in GeneDB
  • 5. Data captured in GeneDB vs. PomBase Data type Ontology GeneDB PomBase Function/Process/Component GO ✔ ✔ Protein modifications Protein Modification - ✔ Ontology Phenotypes FYPO (Fission Yeast Some ✔ Phenotype Ontology) Interactions BioGRID BioGRID ✔ Gene expression In-house vocabulary - ✔ Misc features (disease In-house vocabulary ✔ ✔ associations, complementation…) The increased breadth makes community curation even more important
  • 6. Phenotype Ontology ¡  User survey 2007 - Phenotypes were identified as the single most desirable information type not supported by GeneDB S. pombe. ¡  Need for a pre-composed Fission Yeast Phenotype Ontology ¡  Ease for community curation ¡  Needed greater specificity of terms than that offered by existing phenotype ontologies ¡  Term is accompanied by two types of information: ¡  Allele description – deletion, overexpression of mutation ¡  Experimental conditions where appropriate ¡  Combination of different ontologies used to create formal definitions ¡  E.g. PATO, ChEBI, GO PATO FYPO ChEBI resistance to resistance to thiabendazole thiabendazole
  • 7. GO Term Extensions GO  ID   Term   Evidence   With/From   Source   GO:004674   Protein  serine/threonine  kinase  ac<vity   has_substrate  pom1   IDA       Yoon  HJ  et  al.  (2006)   has_substrate  rum1         IDA       Noguchi  E  et  al.  (2002)   has_substrate  rbp80   IDA       Holig  K  et  al.  (2009)   has_substrate  sin1   IDA       Jang  YJ  et  al.  (1997)  
  • 8. Why Not a Wiki? ¡  Traditionally biologists would study one gene/protein ¡  Individual text-based gene pages were an ideal format ¡  Many techniques used today generate gene lists ¡  Enrichment identify patterns in the data-set e.g. are certain processes common the group of genes? ¡  Need annotations to controlled vocabularies to make efficient, computerized comparisons ¡  A wiki, essentially free-text, does not provide this ¡  All annotations are supported by evidence
  • 9. What Will the Community Curate? ¡  Data that can be captured by the formal vocabularies used in PomBase ¡  GO (including extensions) ¡  Protein modifications (including residue information) ¡  Phenotypes (including alleles and conditions) ¡  Interactions ¡  Mostly pre-composed terms ¡  Extensions will be captured by prompting where relevant ¡  E.g. the community will not be expected to know when to use these
  • 10. The Community Annotation Tool - CANTO ¡  Final stages of development ¡  Developed by Kim Rutherford ¡  Already in use by the PomBase curators ¡  We are involving the community at this stage through review of curated (recent) publications ¡  Provides a web-based interface ¡  Can be used as a stand-alone application (provides annotations in GAFs) ¡  Pipelines are in place for direct loading into Chado ¡  Chado (GMOD project) is a database schema for handling biological data
  • 11. 5 Easy Steps to Broad Curation of Data - A Walk-through
  • 12. Step 1: add your genes
  • 13. The main page - choose a gene to get started…
  • 14. Step 2: Choose the type of annotation
  • 15. Step 3: Find the correct term
  • 16. Child terms are suggested…
  • 17. Step 4: Add the evidence
  • 18. Step 5: Review, extend and transfer
  • 19.
  • 20.
  • 21.
  • 22. Quality Control and Consistency Checking ¡  Professional curators are needed not just for curation support, but also for quality control and consistency checking.
  • 23.
  • 24.
  • 25. Help?! ¡  There is always a visible help button
  • 26. Benefits of Community Curation ¡  Researchers can curate ‘from home’ immediately following publication ¡  First-pass annotations quickly obtained – data will quickly appear in the database ¡  Expert knowledge, coupled to quality control by curators make for powerful, accurate annotations ¡  Controlled annotations can be loaded from the tool directly into our database ¡  Bottle-neck is how quickly professional curators can check annotations, not how fast we can obtain them ¡  Frees up time for us to clear the back-log of papers
  • 27. Benefits to the Researcher ¡  Greater visibility of publication ¡  Annotations propagated to GO, BioGRID, Ensembl, NCBI, UniProt… ¡  Increased citation index? ¡  A greater understanding of ontologies ¡  Will be able to use them better to support their research
  • 28. Future Directions ¡  ~3 months until official launch of CANTO ¡  Multi-gene phenotypes ¡  Extensions (restricted usage for specific terms and relations) ¡  More help features and descriptive boxes ¡  Longer term ¡  Making the tool easily configurable for other organisms ¡  Making the tool available to other communities
  • 29. Acknowledgements ¡  The PomBase team: ¡  Val Wood ¡  Midori Harris ¡  Kim Rutherford ¡  Mark McDowall ¡  Antonia Lock ¡  PI’s: ¡  Jurg Bahler (UCL) ¡  Steve Oliver (Cambridge) ¡  Paul Kersey (EBI Hinxton) ¡  Funded by the Wellcome Trust