SlideShare ist ein Scribd-Unternehmen logo
1 von 18
Downloaden Sie, um offline zu lesen
Life as a scientific database curator


          Sandra Orchard




                EBI is an Outstation of the European Molecular Biology Laboratory.
What is a database curator

       Curator – OED

            - a keeper of a museum or other collection

            - from LATIN curare – take care of




2/17
What is a database curator

       The job
       • Creating a structure for unstructured biological data
       • Generating order from chaos
       • Combining literature and automated processes to provide
         biomolecules with correct sequence/structure,
         nomenclature, function and contextual information
       • Give biological context to large experimental datasets
       The qualification
       • Need an attention to detail which would annoy even the
         best of housemates
       • Passion for reading and understanding literature

3/17
What is a database curator

       The Pros

       • Read about and gain understanding of all areas of
         biology

       The Cons

       • No specialisation
       • Persuading biologists that there are benefits to this.




4/17
What is a database curator

• The International Society for Biocuration (ISB) definition:
...integration of information relevant to biology into a
    database or resource that enables integration of the
    scientific literature...and large experimental data sets.
• Goals are
...accurate and comprehensive representation...
...to facilitate access to data for scientists...as a resource for
    computational analysis
What does a database curator do?
Collects, annotates, and validates information (in a
database).


Extracts & organizes data from literature


Describes data using standards, protocols and
vocabularies (enabling computational queries and data
exchange).

Communicates with researchers to ensure the accuracy
of curated information and to foster good practice in data
exchange.
What does a database curator do?

            Takes part in the development of shared
            biomedical data standards and ontologies
            and (ideally) enforces their use.

            Trains users in effectively accessing and
            using the data in the databases

            Promotes database usage through talks,
            conference attendance/posters,
            publications etc…..



7/17
What do I do?

       • Curate the molecular interaction database




8/17
What do I do?




       Custom curation tools designed by the curation team


9/17
What do I do?

                        Controlled vocabulary maintenance




10/17
Qualifications for the job

        • A biology B.Sc./M.Sc./PhD + lab experience

              or

        • A bioinformatics M.Sc

        Plus – an enquiring mind, ability to write good English and
          the right attitude

        Training – largely database specific and will be given ‘on-
          the-job’



11/17
Qualifications for the job

        • Do I need to be able to do programming?

        • Answer – no. It is often helpful to have some database
          query ability but it is perfectly possible to do the job
          without (in most databases)




12/17
Career Progression

        Within the EBI
        • Progress as a curator – senior curator, curation
          coordinator

        • Project management – grant coordinator, project leader

        Post –EBI
        • Curation/project leadership positions at many other
          institutes
        • Related areas – academic research, research project
          management, lectureships, journal publishing

13/17
Will I still be allowed to publish?

        Curation
        The annotation of both human and mouse kinomes in
          UniProtKB/Swiss-Prot - (MCP)
        Data Standards
        The Minimum Information required for reporting a Molecular
          Interaction Experiment (MIMIx) – (NBT)
        Data Formats
        The HUPO PSI's molecular interaction format--a community
          standard for the representation of protein interaction data.
          – (NBT)



14/17
Will I still be allowed to publish?

        Tool development
          Rintact: enabling computational analysis of molecular
          interaction data from the IntAct repository.
          (Bioinformatics)
        Ontologies
        The use of common ontologies and controlled vocabularies
          to enable data exchange and deposition for complex
          proteomic experiments (Pac Symp Biocomput)
        Training
        Submit your interaction data the IMEx way - a step by step
          guide to trouble-free deposition (Proteomics)


15/17
Curation as a profession




16/17
Curation as a profession

        • Biocuration conference every 12 months – 2102 in
          Cambridge, UK

        • Opportunities for further training – bioinformatic tools,
          programming, career development/management

        • Attendance at biological/computational biology
          conferences encouraged – the EBI often provides
          speakers




17/17
Summary

        • Curation is not for everyone – it does require a certain
          mindset

        • Exposes you to all areas of biology (and chemistry)

        •   Now a recognised profession and our numbers are
            growing

        • Many opportunities to be become involved in “extra-
          curriculum” activities – its not all reading papers



18/17

Weitere ähnliche Inhalte

Was ist angesagt?

BIOLOGICAL SEQUENCE DATABASES
BIOLOGICAL SEQUENCE DATABASES BIOLOGICAL SEQUENCE DATABASES
BIOLOGICAL SEQUENCE DATABASES
nadeem akhter
 

Was ist angesagt? (20)

Introduction to NCBI
Introduction to NCBIIntroduction to NCBI
Introduction to NCBI
 
Ncbi basic intro_v_pitt_kent_osu
Ncbi basic intro_v_pitt_kent_osuNcbi basic intro_v_pitt_kent_osu
Ncbi basic intro_v_pitt_kent_osu
 
Applications of bioinformatics
Applications of bioinformaticsApplications of bioinformatics
Applications of bioinformatics
 
Bioinformatics biological databases
Bioinformatics biological databasesBioinformatics biological databases
Bioinformatics biological databases
 
Data base in detail
Data base in detailData base in detail
Data base in detail
 
Major resources of bioinformatics 2
Major resources of bioinformatics 2Major resources of bioinformatics 2
Major resources of bioinformatics 2
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
 
Sequence Submission Tools
Sequence Submission ToolsSequence Submission Tools
Sequence Submission Tools
 
Publicly available tools and open resources in Bioinformatics
Publicly available  tools and open resources in BioinformaticsPublicly available  tools and open resources in Bioinformatics
Publicly available tools and open resources in Bioinformatics
 
Primary, secondary, tertiary biological database
Primary, secondary, tertiary biological databasePrimary, secondary, tertiary biological database
Primary, secondary, tertiary biological database
 
Major databases in bioinformatics
Major databases in bioinformaticsMajor databases in bioinformatics
Major databases in bioinformatics
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
 
Nucleic acid database
Nucleic acid databaseNucleic acid database
Nucleic acid database
 
European molecular biology laboratory (EMBL)
European molecular biology laboratory (EMBL)European molecular biology laboratory (EMBL)
European molecular biology laboratory (EMBL)
 
Biological databases
Biological databasesBiological databases
Biological databases
 
BIOLOGICAL SEQUENCE DATABASES
BIOLOGICAL SEQUENCE DATABASES BIOLOGICAL SEQUENCE DATABASES
BIOLOGICAL SEQUENCE DATABASES
 
EMBL- European Molecular Biology Laboratory
EMBL- European Molecular Biology LaboratoryEMBL- European Molecular Biology Laboratory
EMBL- European Molecular Biology Laboratory
 
Databases in Bioinformatics
Databases in BioinformaticsDatabases in Bioinformatics
Databases in Bioinformatics
 
Bioinformatics introduction
Bioinformatics introductionBioinformatics introduction
Bioinformatics introduction
 
Proteins databases
Proteins databasesProteins databases
Proteins databases
 

Andere mochten auch (7)

P2 training and_life_as_a_postdoc_(kota_miura)
P2 training and_life_as_a_postdoc_(kota_miura)P2 training and_life_as_a_postdoc_(kota_miura)
P2 training and_life_as_a_postdoc_(kota_miura)
 
P3 training and_life_as_a_postdoc_(felix_klein)
P3 training and_life_as_a_postdoc_(felix_klein)P3 training and_life_as_a_postdoc_(felix_klein)
P3 training and_life_as_a_postdoc_(felix_klein)
 
Career Paths in the Life Sciences. Janssens, Summer 2012
Career Paths in the Life Sciences. Janssens, Summer 2012Career Paths in the Life Sciences. Janssens, Summer 2012
Career Paths in the Life Sciences. Janssens, Summer 2012
 
E1 life as_an_outreach_project_leader_(giulietta_spudich)
E1 life as_an_outreach_project_leader_(giulietta_spudich)E1 life as_an_outreach_project_leader_(giulietta_spudich)
E1 life as_an_outreach_project_leader_(giulietta_spudich)
 
E3 life as a ux analyst (jenny_cham)
E3 life as a ux analyst (jenny_cham)E3 life as a ux analyst (jenny_cham)
E3 life as a ux analyst (jenny_cham)
 
Publishing Career Day Presentation AM
Publishing Career Day Presentation AMPublishing Career Day Presentation AM
Publishing Career Day Presentation AM
 
PhDretreat
PhDretreat PhDretreat
PhDretreat
 

Ähnlich wie E2 life as_a_scientific_database_curator_(sandra_orchard)

"Perfection is the enemy of the good "Supporting research data management: A ...
"Perfection is the enemy of the good "Supporting research data management: A ..."Perfection is the enemy of the good "Supporting research data management: A ...
"Perfection is the enemy of the good "Supporting research data management: A ...
Incremental Project
 
P4 training and_life_as_a_postdoc_(shinichi_sunagawa)
P4 training and_life_as_a_postdoc_(shinichi_sunagawa)P4 training and_life_as_a_postdoc_(shinichi_sunagawa)
P4 training and_life_as_a_postdoc_(shinichi_sunagawa)
phdcareers
 

Ähnlich wie E2 life as_a_scientific_database_curator_(sandra_orchard) (20)

Big Data Standards - Workshop, ExpBio, Boston, 2015
Big Data Standards - Workshop, ExpBio, Boston, 2015Big Data Standards - Workshop, ExpBio, Boston, 2015
Big Data Standards - Workshop, ExpBio, Boston, 2015
 
Teaching Case Studies
Teaching Case StudiesTeaching Case Studies
Teaching Case Studies
 
Critical infrastructure to promote data synthesis
Critical infrastructure to promote data synthesis Critical infrastructure to promote data synthesis
Critical infrastructure to promote data synthesis
 
Scally The Library's Role in Research Data Management. OCLC partnership meeti...
Scally The Library's Role in Research Data Management. OCLC partnership meeti...Scally The Library's Role in Research Data Management. OCLC partnership meeti...
Scally The Library's Role in Research Data Management. OCLC partnership meeti...
 
"Perfection is the enemy of the good "Supporting research data management: A ...
"Perfection is the enemy of the good "Supporting research data management: A ..."Perfection is the enemy of the good "Supporting research data management: A ...
"Perfection is the enemy of the good "Supporting research data management: A ...
 
Exercising creativity to implement an institutional repository with limited r...
Exercising creativity to implement an institutional repository with limited r...Exercising creativity to implement an institutional repository with limited r...
Exercising creativity to implement an institutional repository with limited r...
 
Pine education-platform
Pine education-platformPine education-platform
Pine education-platform
 
E-Science: New Roles for Libraries
E-Science: New Roles for LibrariesE-Science: New Roles for Libraries
E-Science: New Roles for Libraries
 
P4 training and_life_as_a_postdoc_(shinichi_sunagawa)
P4 training and_life_as_a_postdoc_(shinichi_sunagawa)P4 training and_life_as_a_postdoc_(shinichi_sunagawa)
P4 training and_life_as_a_postdoc_(shinichi_sunagawa)
 
B4OS-2012
B4OS-2012B4OS-2012
B4OS-2012
 
OeRC Seminar
OeRC SeminarOeRC Seminar
OeRC Seminar
 
Supporting the development of a national Research Data Discovery Service - A ...
Supporting the development of a national Research Data Discovery Service - A ...Supporting the development of a national Research Data Discovery Service - A ...
Supporting the development of a national Research Data Discovery Service - A ...
 
Library Linkages
Library LinkagesLibrary Linkages
Library Linkages
 
LIBER Webinar: Supporting Data Literacy
LIBER Webinar: Supporting Data LiteracyLIBER Webinar: Supporting Data Literacy
LIBER Webinar: Supporting Data Literacy
 
Designing Biological Databases
Designing Biological DatabasesDesigning Biological Databases
Designing Biological Databases
 
Supporting the development of a national Research Data Discovery Service – a ...
Supporting the development of a national Research Data Discovery Service – a ...Supporting the development of a national Research Data Discovery Service – a ...
Supporting the development of a national Research Data Discovery Service – a ...
 
Sla2009 D Curation Heidorn
Sla2009 D Curation HeidornSla2009 D Curation Heidorn
Sla2009 D Curation Heidorn
 
LIBRARY ASSESSMENT
LIBRARY ASSESSMENTLIBRARY ASSESSMENT
LIBRARY ASSESSMENT
 
Supporting researchers in the molecular life sciences Jeff Christiansen
Supporting researchers in the molecular life sciences Jeff Christiansen Supporting researchers in the molecular life sciences Jeff Christiansen
Supporting researchers in the molecular life sciences Jeff Christiansen
 
Designing and delivering an international MOOC on Research Data Management an...
Designing and delivering an international MOOC on Research Data Management an...Designing and delivering an international MOOC on Research Data Management an...
Designing and delivering an international MOOC on Research Data Management an...
 

Kürzlich hochgeladen

+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 

Kürzlich hochgeladen (20)

Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot ModelNavi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
 
Ransomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdfRansomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdf
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
A Beginners Guide to Building a RAG App Using Open Source Milvus
A Beginners Guide to Building a RAG App Using Open Source MilvusA Beginners Guide to Building a RAG App Using Open Source Milvus
A Beginners Guide to Building a RAG App Using Open Source Milvus
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024
 
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectors
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 

E2 life as_a_scientific_database_curator_(sandra_orchard)

  • 1. Life as a scientific database curator Sandra Orchard EBI is an Outstation of the European Molecular Biology Laboratory.
  • 2. What is a database curator Curator – OED - a keeper of a museum or other collection - from LATIN curare – take care of 2/17
  • 3. What is a database curator The job • Creating a structure for unstructured biological data • Generating order from chaos • Combining literature and automated processes to provide biomolecules with correct sequence/structure, nomenclature, function and contextual information • Give biological context to large experimental datasets The qualification • Need an attention to detail which would annoy even the best of housemates • Passion for reading and understanding literature 3/17
  • 4. What is a database curator The Pros • Read about and gain understanding of all areas of biology The Cons • No specialisation • Persuading biologists that there are benefits to this. 4/17
  • 5. What is a database curator • The International Society for Biocuration (ISB) definition: ...integration of information relevant to biology into a database or resource that enables integration of the scientific literature...and large experimental data sets. • Goals are ...accurate and comprehensive representation... ...to facilitate access to data for scientists...as a resource for computational analysis
  • 6. What does a database curator do? Collects, annotates, and validates information (in a database). Extracts & organizes data from literature Describes data using standards, protocols and vocabularies (enabling computational queries and data exchange). Communicates with researchers to ensure the accuracy of curated information and to foster good practice in data exchange.
  • 7. What does a database curator do? Takes part in the development of shared biomedical data standards and ontologies and (ideally) enforces their use. Trains users in effectively accessing and using the data in the databases Promotes database usage through talks, conference attendance/posters, publications etc….. 7/17
  • 8. What do I do? • Curate the molecular interaction database 8/17
  • 9. What do I do? Custom curation tools designed by the curation team 9/17
  • 10. What do I do? Controlled vocabulary maintenance 10/17
  • 11. Qualifications for the job • A biology B.Sc./M.Sc./PhD + lab experience or • A bioinformatics M.Sc Plus – an enquiring mind, ability to write good English and the right attitude Training – largely database specific and will be given ‘on- the-job’ 11/17
  • 12. Qualifications for the job • Do I need to be able to do programming? • Answer – no. It is often helpful to have some database query ability but it is perfectly possible to do the job without (in most databases) 12/17
  • 13. Career Progression Within the EBI • Progress as a curator – senior curator, curation coordinator • Project management – grant coordinator, project leader Post –EBI • Curation/project leadership positions at many other institutes • Related areas – academic research, research project management, lectureships, journal publishing 13/17
  • 14. Will I still be allowed to publish? Curation The annotation of both human and mouse kinomes in UniProtKB/Swiss-Prot - (MCP) Data Standards The Minimum Information required for reporting a Molecular Interaction Experiment (MIMIx) – (NBT) Data Formats The HUPO PSI's molecular interaction format--a community standard for the representation of protein interaction data. – (NBT) 14/17
  • 15. Will I still be allowed to publish? Tool development Rintact: enabling computational analysis of molecular interaction data from the IntAct repository. (Bioinformatics) Ontologies The use of common ontologies and controlled vocabularies to enable data exchange and deposition for complex proteomic experiments (Pac Symp Biocomput) Training Submit your interaction data the IMEx way - a step by step guide to trouble-free deposition (Proteomics) 15/17
  • 16. Curation as a profession 16/17
  • 17. Curation as a profession • Biocuration conference every 12 months – 2102 in Cambridge, UK • Opportunities for further training – bioinformatic tools, programming, career development/management • Attendance at biological/computational biology conferences encouraged – the EBI often provides speakers 17/17
  • 18. Summary • Curation is not for everyone – it does require a certain mindset • Exposes you to all areas of biology (and chemistry) • Now a recognised profession and our numbers are growing • Many opportunities to be become involved in “extra- curriculum” activities – its not all reading papers 18/17