SlideShare ist ein Scribd-Unternehmen logo
1 von 16
Three’s a crowd-source:
Observations on Collaborative
Genome Annotation.
Monica Munoz-Torres, PhD via Suzanna Lewis
Biocurator & Bioinformatics Analyst | @monimunozto
Genomics Division, Lawrence Berkeley National Laboratory
08 April, 2014 | 7th International Biocuration Conference
UNIVERSITY OF
CALIFORNIA
Outline 1. Automated and Manual Annotation in
a genome sequencing project.
2. Distributed, community-based
genome curation using Apollo.
3. What we have learned so far.
Three’s a crowd-
source:
Observations on
Collaborative
Genome Annotation.
Outline 2
Assembly
Manual
annotation
Experimental
validation
Automated
Annotation
In a genome sequencing project…
Automated Genome Annotation
1. Automated and Manual Annotation.
Gene prediction
Identifies elements of the genome using empiric and ab
initio gene finding systems. Uses additional experimental
evidence to identify domains and motifs.
Nucleic Acids 2003 vol. 31 no. 13 3738-3741
Curation [manual genome annotation editing]
1. Automated and Manual Annotation.
- Identify elements that best
represent the underlying
biological truth
- Eliminate elements that reflect
the systemic errors of
automated analyses.
- Determine functional roles
comparing to well-
studied, phylogenetically similar
genome elements via literature
and public databases (and
experience!).
Experimental Evidence:
cDNAs, HMM domain
searches, alignments with
assemblies or genes from other
species.
Computational analyses
Manually-curated Consensus
Gene Structures
Curators strive to achieve precise
biological fidelity.
1. Automated and Manual Annotation. 5
But! A single curator
cannot do it all:
- unmanageable scale.
- colleagues with
expertise in other
domains and gene
families are required.
iStockPhoto.com
Bring scientists together to:
- Distribute problem solving
- Mine collective intelligence
- Access quality
- Process work in parallel
Crowd-sourcing Genome Curation
“The knowledge and talents of a group of people is
leveraged to create and solve problems”
– Josh Catone | ReadWrite.com
Footer 6
(“crowdsourcing”, FreeBase.com)
Dispersed, community-based manual
annotation efforts.
We* have trained geographically dispersed
scientific communities to perform biologically
supported manual annotations: ~80
institutions, 14 countries, hundreds of
scientists using Apollo.
Education through:
– Training workshops and geneborees.
– Tutorials.
– Personalized user support.
2. Community-based curation. 7
*with Elsik Lab. University of Missouri.
What is Apollo?
• Apollo is a genomic annotation editing platform.
To modify and refine the precise location and structure of the
genome elements that predictive algorithms cannot yet
resolve automatically.
82. Community-based curation.
Find more about Web Apollo at
http://GenomeArchitect.org
and
Genome Biol 14:R93. (2013).
Web Apollo improves the
manual annotation environment
• Allows for intuitive annotation creation and editing with
gestures and pull-down menus to create and modify
coding genes and regulatory elements, insert comments
(CV, freeform text), etc.
• Browser-based, plugin for JBrowse.
• Edits in one client are instantly
pushed to all other clients.
• Customizable rules and
appearance.
92. Community-based curation.
Has the collaborative nature of manual
annotation efforts influenced research
productivity and the quality of
downstream analyses?
3. What we have learned. 10
Working together was helpful and
automated annotations were improved.
Scientific community efforts brought
together domain-specific and natural
history expertise that would have
otherwise remain disconnected.
Example:
>100 bovine cattle researchers
~3,600 manual annotations
3. What we have learned. 11
Nature Reviews Genetics 2009 (10), 346-
347
Science. 2009 (324) 5926, 522-528
Example:
Understanding the evolution of sociality.
Compared seven ant genomes for a better
understanding of evolution and organization
of insect societies at the molecular level.
Insights drawn mainly from six core aspects of
ant biology:
1. Alternative morphological castes
2. Division of labor
3. Chemical Communication
4. Alternative social organization
5. Social immunity
6. Mutualism
3. What we have learned. 12
The work of
groups of
communities led
to new insights.
Libbrecht et al. 2012. Genome Biology 2013, 14:212
New sequencing technologies pose
additional challenges.
Lower coverage leads to
– frameshifts and indel errors
– split genes across contigs
– highly repetitive sequences
To face these challenges, we train annotators in
recovering coding sequences in agreement with all
available biological evidence.
3. What we have learned. 13
Other lessons learned
1. You must enforce strict rules and formats; it is
necessary to maintain consistency.
2. Be flexible and adaptable: study and incorporate
new data, and adapt to support new platforms to
keep pace and maintain the interest of scientific
community. Evolve with the data!
3. A little training goes a long way! With the right
tools, wet lab scientists make exceptional curators
who can easily learn to maximize the generation of
accurate, biologically supported gene models.
3. What we have learned. 14
The power behind
community-based
curation of
biological data.
3. What we have learned. 15
Thanks!
• Berkeley Bioinformatics Open-source Projects
(BBOP), Berkeley Lab: Web Apollo and Gene
Ontology teams. Suzanna Lewis (PI).
• The team at Elsik Lab. § University of Missouri.
Christine G. Elsik (PI).
• Ian Holmes (PI). * University of California Berkeley.
• Arthropod genomics community, i5K
http://www.arthropodgenomes.org/wiki/i5K (Org.
Committee, NAL (USDA), HGSC-BCM, BGI), and
1KITE http://www.1kite.org/.
• Web Apollo is supported by NIH grants
5R01GM080203 from NIGMS, and 5R01HG004483
from NHGRI, and by the Director, Office of Science,
Office of Basic Energy Sciences, of the U.S.
Department of Energy under Contract No. DE-AC02-
05CH11231.
• Insect images used with permission:
http://AlexanderWild.com
• For your attention, thank you!
Thank you. 16
Web Apollo
Ed Lee
Gregg Helt
Justin Reese §
Colin Diesh §
Deepak Unni §
Chris Childers §
Rob Buels *
Gene Ontology
Chris Mungall
Seth Carbon
Heiko Dietze
BBOP
Web Apollo: http://GenomeArchitect.org
GO: http://GeneOntology.org
i5K: http://arthropodgenomes.org/wiki/i5K
ISB: http://biocurator.org

Weitere ähnliche Inhalte

Was ist angesagt?

Munoz torres web-apollo-workshop_exeter-2014_ss
Munoz torres web-apollo-workshop_exeter-2014_ssMunoz torres web-apollo-workshop_exeter-2014_ss
Munoz torres web-apollo-workshop_exeter-2014_ssMonica Munoz-Torres
 
Essential Requirements for Community Annotation Tools
Essential Requirements for Community Annotation ToolsEssential Requirements for Community Annotation Tools
Essential Requirements for Community Annotation ToolsMonica Munoz-Torres
 
An introduction to Web Apollo for i5K Pilot Species Projects - Hemiptera
An introduction to Web Apollo for i5K Pilot Species Projects - HemipteraAn introduction to Web Apollo for i5K Pilot Species Projects - Hemiptera
An introduction to Web Apollo for i5K Pilot Species Projects - HemipteraMonica Munoz-Torres
 
System's Biology
System's Biology System's Biology
System's Biology Pritam Shil
 
Management of experimental biosamples
Management of experimental biosamplesManagement of experimental biosamples
Management of experimental biosamplesARDC
 
Visualization of insect vector-plant pathogen interactions in the citrus gree...
Visualization of insect vector-plant pathogen interactions in the citrus gree...Visualization of insect vector-plant pathogen interactions in the citrus gree...
Visualization of insect vector-plant pathogen interactions in the citrus gree...Surya Saha
 
ADARSH JOSE_Resume
ADARSH JOSE_ResumeADARSH JOSE_Resume
ADARSH JOSE_ResumeAdarsh Jose
 
Wright_Andrew_Resume
Wright_Andrew_ResumeWright_Andrew_Resume
Wright_Andrew_ResumeAndrew Wright
 
Systems Biology Approaches to Cancer
Systems Biology Approaches to CancerSystems Biology Approaches to Cancer
Systems Biology Approaches to CancerRaunak Shrestha
 
Reframing Phylogenomics
Reframing PhylogenomicsReframing Phylogenomics
Reframing PhylogenomicsJoe Parker
 
Welch Wordifier Bosc2009
Welch Wordifier Bosc2009Welch Wordifier Bosc2009
Welch Wordifier Bosc2009bosc
 
Light Intro to the Gene Ontology
Light Intro to the Gene OntologyLight Intro to the Gene Ontology
Light Intro to the Gene Ontologynniiicc
 
Matthew Brown resume
Matthew Brown resumeMatthew Brown resume
Matthew Brown resumeMatthew Brown
 
Next-Gen Taxonomic Descriptions for Microbial Eukaryotes
Next-Gen Taxonomic Descriptions for Microbial EukaryotesNext-Gen Taxonomic Descriptions for Microbial Eukaryotes
Next-Gen Taxonomic Descriptions for Microbial EukaryotesAnne Thessen
 
Gene Ontology Project
Gene Ontology ProjectGene Ontology Project
Gene Ontology Projectvaibhavdeoda
 

Was ist angesagt? (20)

Munoz torres web-apollo-workshop_exeter-2014_ss
Munoz torres web-apollo-workshop_exeter-2014_ssMunoz torres web-apollo-workshop_exeter-2014_ss
Munoz torres web-apollo-workshop_exeter-2014_ss
 
Essential Requirements for Community Annotation Tools
Essential Requirements for Community Annotation ToolsEssential Requirements for Community Annotation Tools
Essential Requirements for Community Annotation Tools
 
An introduction to Web Apollo for i5K Pilot Species Projects - Hemiptera
An introduction to Web Apollo for i5K Pilot Species Projects - HemipteraAn introduction to Web Apollo for i5K Pilot Species Projects - Hemiptera
An introduction to Web Apollo for i5K Pilot Species Projects - Hemiptera
 
System's Biology
System's Biology System's Biology
System's Biology
 
Management of experimental biosamples
Management of experimental biosamplesManagement of experimental biosamples
Management of experimental biosamples
 
Visualization of insect vector-plant pathogen interactions in the citrus gree...
Visualization of insect vector-plant pathogen interactions in the citrus gree...Visualization of insect vector-plant pathogen interactions in the citrus gree...
Visualization of insect vector-plant pathogen interactions in the citrus gree...
 
ADARSH JOSE_Resume
ADARSH JOSE_ResumeADARSH JOSE_Resume
ADARSH JOSE_Resume
 
Wright_Andrew_Resume
Wright_Andrew_ResumeWright_Andrew_Resume
Wright_Andrew_Resume
 
Systems Biology Approaches to Cancer
Systems Biology Approaches to CancerSystems Biology Approaches to Cancer
Systems Biology Approaches to Cancer
 
Reframing Phylogenomics
Reframing PhylogenomicsReframing Phylogenomics
Reframing Phylogenomics
 
Welch Wordifier Bosc2009
Welch Wordifier Bosc2009Welch Wordifier Bosc2009
Welch Wordifier Bosc2009
 
Matteo CV
Matteo CVMatteo CV
Matteo CV
 
MIT STAR
MIT STARMIT STAR
MIT STAR
 
MIT STAR
MIT STARMIT STAR
MIT STAR
 
Light Intro to the Gene Ontology
Light Intro to the Gene OntologyLight Intro to the Gene Ontology
Light Intro to the Gene Ontology
 
Folker Meyer: Metagenomic Data Annotation
Folker Meyer: Metagenomic Data AnnotationFolker Meyer: Metagenomic Data Annotation
Folker Meyer: Metagenomic Data Annotation
 
Matthew Brown resume
Matthew Brown resumeMatthew Brown resume
Matthew Brown resume
 
Next-Gen Taxonomic Descriptions for Microbial Eukaryotes
Next-Gen Taxonomic Descriptions for Microbial EukaryotesNext-Gen Taxonomic Descriptions for Microbial Eukaryotes
Next-Gen Taxonomic Descriptions for Microbial Eukaryotes
 
Bioinformatics ppt
Bioinformatics pptBioinformatics ppt
Bioinformatics ppt
 
Gene Ontology Project
Gene Ontology ProjectGene Ontology Project
Gene Ontology Project
 

Ähnlich wie Three's a crowd-source: Observations on Collaborative Genome Annotation

Web Apollo Tutorial for the i5K copepod research community.
Web Apollo Tutorial for the i5K copepod research community.Web Apollo Tutorial for the i5K copepod research community.
Web Apollo Tutorial for the i5K copepod research community.Monica Munoz-Torres
 
An introduction to Web Apollo for the Biomphalaria glabatra research community.
An introduction to Web Apollo for the Biomphalaria glabatra research community.An introduction to Web Apollo for the Biomphalaria glabatra research community.
An introduction to Web Apollo for the Biomphalaria glabatra research community.Monica Munoz-Torres
 
Web Apollo at Genome Informatics 2014
Web Apollo at Genome Informatics 2014Web Apollo at Genome Informatics 2014
Web Apollo at Genome Informatics 2014Monica Munoz-Torres
 
Web Apollo Workshop University of Exeter
Web Apollo Workshop University of ExeterWeb Apollo Workshop University of Exeter
Web Apollo Workshop University of ExeterMonica Munoz-Torres
 
Genome annotation with open source software: Apollo, Jbrowse and the GO in Ga...
Genome annotation with open source software: Apollo, Jbrowse and the GO in Ga...Genome annotation with open source software: Apollo, Jbrowse and the GO in Ga...
Genome annotation with open source software: Apollo, Jbrowse and the GO in Ga...Nathan Dunn
 
Apollo and i5K: Collaborative Curation and Interactive Analysis of Genomes
Apollo and i5K: Collaborative Curation and Interactive Analysis of GenomesApollo and i5K: Collaborative Curation and Interactive Analysis of Genomes
Apollo and i5K: Collaborative Curation and Interactive Analysis of GenomesMonica Munoz-Torres
 
Apollo Workshop AGS2017 Introduction
Apollo Workshop AGS2017 IntroductionApollo Workshop AGS2017 Introduction
Apollo Workshop AGS2017 IntroductionMonica Munoz-Torres
 
Curation Introduction - Apollo Workshop
Curation Introduction - Apollo WorkshopCuration Introduction - Apollo Workshop
Curation Introduction - Apollo WorkshopMonica Munoz-Torres
 
Advanced Bioinformatics for Genomics and BioData Driven Research
Advanced Bioinformatics for Genomics and BioData Driven ResearchAdvanced Bioinformatics for Genomics and BioData Driven Research
Advanced Bioinformatics for Genomics and BioData Driven ResearchEuropean Bioinformatics Institute
 
Introduction to Ontologies for Environmental Biology
Introduction to Ontologies for Environmental BiologyIntroduction to Ontologies for Environmental Biology
Introduction to Ontologies for Environmental BiologyBarry Smith
 
EVE161: Microbial Phylogenomics - Class 1 - Introduction
EVE161: Microbial Phylogenomics - Class 1 - IntroductionEVE161: Microbial Phylogenomics - Class 1 - Introduction
EVE161: Microbial Phylogenomics - Class 1 - IntroductionJonathan Eisen
 
Web based servers and softwares for genome analysis
Web based servers and softwares for genome analysisWeb based servers and softwares for genome analysis
Web based servers and softwares for genome analysisDr. Naveen Gaurav srivastava
 
Bio-ontologies in bioinformatics: Growing up challenges
Bio-ontologies in bioinformatics: Growing up challengesBio-ontologies in bioinformatics: Growing up challenges
Bio-ontologies in bioinformatics: Growing up challengesJanna Hastings
 
R31%20Strong%20A%20Web-based%20Comparative%20Genomics%20tutorial%20Microbiolo...
R31%20Strong%20A%20Web-based%20Comparative%20Genomics%20tutorial%20Microbiolo...R31%20Strong%20A%20Web-based%20Comparative%20Genomics%20tutorial%20Microbiolo...
R31%20Strong%20A%20Web-based%20Comparative%20Genomics%20tutorial%20Microbiolo...tutorialsruby
 

Ähnlich wie Three's a crowd-source: Observations on Collaborative Genome Annotation (20)

Web Apollo Tutorial for the i5K copepod research community.
Web Apollo Tutorial for the i5K copepod research community.Web Apollo Tutorial for the i5K copepod research community.
Web Apollo Tutorial for the i5K copepod research community.
 
An introduction to Web Apollo for the Biomphalaria glabatra research community.
An introduction to Web Apollo for the Biomphalaria glabatra research community.An introduction to Web Apollo for the Biomphalaria glabatra research community.
An introduction to Web Apollo for the Biomphalaria glabatra research community.
 
Web Apollo at Genome Informatics 2014
Web Apollo at Genome Informatics 2014Web Apollo at Genome Informatics 2014
Web Apollo at Genome Informatics 2014
 
Web Apollo Workshop University of Exeter
Web Apollo Workshop University of ExeterWeb Apollo Workshop University of Exeter
Web Apollo Workshop University of Exeter
 
Genome annotation with open source software: Apollo, Jbrowse and the GO in Ga...
Genome annotation with open source software: Apollo, Jbrowse and the GO in Ga...Genome annotation with open source software: Apollo, Jbrowse and the GO in Ga...
Genome annotation with open source software: Apollo, Jbrowse and the GO in Ga...
 
Apollo and i5K: Collaborative Curation and Interactive Analysis of Genomes
Apollo and i5K: Collaborative Curation and Interactive Analysis of GenomesApollo and i5K: Collaborative Curation and Interactive Analysis of Genomes
Apollo and i5K: Collaborative Curation and Interactive Analysis of Genomes
 
Apollo Workshop AGS2017 Introduction
Apollo Workshop AGS2017 IntroductionApollo Workshop AGS2017 Introduction
Apollo Workshop AGS2017 Introduction
 
Curation Introduction - Apollo Workshop
Curation Introduction - Apollo WorkshopCuration Introduction - Apollo Workshop
Curation Introduction - Apollo Workshop
 
Advanced Bioinformatics for Genomics and BioData Driven Research
Advanced Bioinformatics for Genomics and BioData Driven ResearchAdvanced Bioinformatics for Genomics and BioData Driven Research
Advanced Bioinformatics for Genomics and BioData Driven Research
 
Introduction to Ontologies for Environmental Biology
Introduction to Ontologies for Environmental BiologyIntroduction to Ontologies for Environmental Biology
Introduction to Ontologies for Environmental Biology
 
Apollo Workshop at KSU 2015
Apollo Workshop at KSU 2015Apollo Workshop at KSU 2015
Apollo Workshop at KSU 2015
 
eScience-School-Oct2012-Campinas-Brazil
eScience-School-Oct2012-Campinas-BrazileScience-School-Oct2012-Campinas-Brazil
eScience-School-Oct2012-Campinas-Brazil
 
rheumatoid arthritis
rheumatoid arthritisrheumatoid arthritis
rheumatoid arthritis
 
EVE161: Microbial Phylogenomics - Class 1 - Introduction
EVE161: Microbial Phylogenomics - Class 1 - IntroductionEVE161: Microbial Phylogenomics - Class 1 - Introduction
EVE161: Microbial Phylogenomics - Class 1 - Introduction
 
PhDc exam presentation
PhDc exam presentationPhDc exam presentation
PhDc exam presentation
 
B4OS-2012
B4OS-2012B4OS-2012
B4OS-2012
 
Web based servers and softwares for genome analysis
Web based servers and softwares for genome analysisWeb based servers and softwares for genome analysis
Web based servers and softwares for genome analysis
 
A6.3 Longchamps
A6.3 LongchampsA6.3 Longchamps
A6.3 Longchamps
 
Bio-ontologies in bioinformatics: Growing up challenges
Bio-ontologies in bioinformatics: Growing up challengesBio-ontologies in bioinformatics: Growing up challenges
Bio-ontologies in bioinformatics: Growing up challenges
 
R31%20Strong%20A%20Web-based%20Comparative%20Genomics%20tutorial%20Microbiolo...
R31%20Strong%20A%20Web-based%20Comparative%20Genomics%20tutorial%20Microbiolo...R31%20Strong%20A%20Web-based%20Comparative%20Genomics%20tutorial%20Microbiolo...
R31%20Strong%20A%20Web-based%20Comparative%20Genomics%20tutorial%20Microbiolo...
 

Mehr von Monica Munoz-Torres

Apollo Workshop AGS2017 Editing functionality
Apollo Workshop AGS2017 Editing functionalityApollo Workshop AGS2017 Editing functionality
Apollo Workshop AGS2017 Editing functionalityMonica Munoz-Torres
 
Editing Functionality - Apollo Workshop
Editing Functionality - Apollo WorkshopEditing Functionality - Apollo Workshop
Editing Functionality - Apollo WorkshopMonica Munoz-Torres
 
Apollo Exercises Kansas State University 2015
Apollo Exercises Kansas State University 2015Apollo Exercises Kansas State University 2015
Apollo Exercises Kansas State University 2015Monica Munoz-Torres
 
Apollo annotation guidelines for i5k projects Diaphorina citri
Apollo annotation guidelines for i5k projects Diaphorina citriApollo annotation guidelines for i5k projects Diaphorina citri
Apollo annotation guidelines for i5k projects Diaphorina citriMonica Munoz-Torres
 
Apollo Collaborative genome annotation editing
Apollo Collaborative genome annotation editing Apollo Collaborative genome annotation editing
Apollo Collaborative genome annotation editing Monica Munoz-Torres
 
JBrowse & Apollo Overview - for AGR
JBrowse & Apollo Overview - for AGRJBrowse & Apollo Overview - for AGR
JBrowse & Apollo Overview - for AGRMonica Munoz-Torres
 
Apollo Genome Annotation Editor: Latest Updates, Including New Galaxy Integra...
Apollo Genome Annotation Editor: Latest Updates, Including New Galaxy Integra...Apollo Genome Annotation Editor: Latest Updates, Including New Galaxy Integra...
Apollo Genome Annotation Editor: Latest Updates, Including New Galaxy Integra...Monica Munoz-Torres
 
Introduction to Apollo - i5k Research Community – Calanoida (copepod)
Introduction to Apollo - i5k Research Community – Calanoida (copepod)Introduction to Apollo - i5k Research Community – Calanoida (copepod)
Introduction to Apollo - i5k Research Community – Calanoida (copepod)Monica Munoz-Torres
 
Gene Ontology Consortium: Website & COmmunity
Gene Ontology Consortium: Website & COmmunityGene Ontology Consortium: Website & COmmunity
Gene Ontology Consortium: Website & COmmunityMonica Munoz-Torres
 
Genome Curation using Apollo - Workshop at UTK
Genome Curation using Apollo - Workshop at UTKGenome Curation using Apollo - Workshop at UTK
Genome Curation using Apollo - Workshop at UTKMonica Munoz-Torres
 
Introduction to Apollo: i5K E affinis
Introduction to Apollo: i5K E affinisIntroduction to Apollo: i5K E affinis
Introduction to Apollo: i5K E affinisMonica Munoz-Torres
 
Introduction to Apollo: A webinar for the i5K Research Community
Introduction to Apollo: A webinar for the i5K Research CommunityIntroduction to Apollo: A webinar for the i5K Research Community
Introduction to Apollo: A webinar for the i5K Research CommunityMonica Munoz-Torres
 
Apollo Introduction for i5K Groups 2015-10-07
Apollo Introduction for i5K Groups 2015-10-07Apollo Introduction for i5K Groups 2015-10-07
Apollo Introduction for i5K Groups 2015-10-07Monica Munoz-Torres
 
CONSORCIO ONTOLOGÍA DE GENES: herramientas para anotación funcional
CONSORCIO ONTOLOGÍA DE GENES: herramientas para anotación funcionalCONSORCIO ONTOLOGÍA DE GENES: herramientas para anotación funcional
CONSORCIO ONTOLOGÍA DE GENES: herramientas para anotación funcionalMonica Munoz-Torres
 
Apollo Introduction for the Chestnut Research Community
Apollo Introduction for the Chestnut Research CommunityApollo Introduction for the Chestnut Research Community
Apollo Introduction for the Chestnut Research CommunityMonica Munoz-Torres
 
Apollo : A workshop for the Manakin Research Coordination Network
Apollo: A workshop for the Manakin Research Coordination NetworkApollo: A workshop for the Manakin Research Coordination Network
Apollo : A workshop for the Manakin Research Coordination NetworkMonica Munoz-Torres
 
Apollo - A webinar for the Phascolarctos cinereus research community
Apollo - A webinar for the Phascolarctos cinereus research communityApollo - A webinar for the Phascolarctos cinereus research community
Apollo - A webinar for the Phascolarctos cinereus research communityMonica Munoz-Torres
 

Mehr von Monica Munoz-Torres (20)

Apollo Workshop AGS2017 Editing functionality
Apollo Workshop AGS2017 Editing functionalityApollo Workshop AGS2017 Editing functionality
Apollo Workshop AGS2017 Editing functionality
 
Editing Functionality - Apollo Workshop
Editing Functionality - Apollo WorkshopEditing Functionality - Apollo Workshop
Editing Functionality - Apollo Workshop
 
Apollo Exercises Kansas State University 2015
Apollo Exercises Kansas State University 2015Apollo Exercises Kansas State University 2015
Apollo Exercises Kansas State University 2015
 
Apollo annotation guidelines for i5k projects Diaphorina citri
Apollo annotation guidelines for i5k projects Diaphorina citriApollo annotation guidelines for i5k projects Diaphorina citri
Apollo annotation guidelines for i5k projects Diaphorina citri
 
Apollo Collaborative genome annotation editing
Apollo Collaborative genome annotation editing Apollo Collaborative genome annotation editing
Apollo Collaborative genome annotation editing
 
JBrowse & Apollo Overview - for AGR
JBrowse & Apollo Overview - for AGRJBrowse & Apollo Overview - for AGR
JBrowse & Apollo Overview - for AGR
 
Apollo Genome Annotation Editor: Latest Updates, Including New Galaxy Integra...
Apollo Genome Annotation Editor: Latest Updates, Including New Galaxy Integra...Apollo Genome Annotation Editor: Latest Updates, Including New Galaxy Integra...
Apollo Genome Annotation Editor: Latest Updates, Including New Galaxy Integra...
 
Introduction to Apollo - i5k Research Community – Calanoida (copepod)
Introduction to Apollo - i5k Research Community – Calanoida (copepod)Introduction to Apollo - i5k Research Community – Calanoida (copepod)
Introduction to Apollo - i5k Research Community – Calanoida (copepod)
 
Gene Ontology Consortium: Website & COmmunity
Gene Ontology Consortium: Website & COmmunityGene Ontology Consortium: Website & COmmunity
Gene Ontology Consortium: Website & COmmunity
 
Introduction to Apollo for i5k
Introduction to Apollo for i5kIntroduction to Apollo for i5k
Introduction to Apollo for i5k
 
Genome Curation using Apollo - Workshop at UTK
Genome Curation using Apollo - Workshop at UTKGenome Curation using Apollo - Workshop at UTK
Genome Curation using Apollo - Workshop at UTK
 
Introduction to Apollo: i5K E affinis
Introduction to Apollo: i5K E affinisIntroduction to Apollo: i5K E affinis
Introduction to Apollo: i5K E affinis
 
Introduction to Apollo: A webinar for the i5K Research Community
Introduction to Apollo: A webinar for the i5K Research CommunityIntroduction to Apollo: A webinar for the i5K Research Community
Introduction to Apollo: A webinar for the i5K Research Community
 
Genome Curation using Apollo
Genome Curation using ApolloGenome Curation using Apollo
Genome Curation using Apollo
 
Apollo Introduction for i5K Groups 2015-10-07
Apollo Introduction for i5K Groups 2015-10-07Apollo Introduction for i5K Groups 2015-10-07
Apollo Introduction for i5K Groups 2015-10-07
 
CONSORCIO ONTOLOGÍA DE GENES: herramientas para anotación funcional
CONSORCIO ONTOLOGÍA DE GENES: herramientas para anotación funcionalCONSORCIO ONTOLOGÍA DE GENES: herramientas para anotación funcional
CONSORCIO ONTOLOGÍA DE GENES: herramientas para anotación funcional
 
Apolo Taller en BIOS
Apolo Taller en BIOS Apolo Taller en BIOS
Apolo Taller en BIOS
 
Apollo Introduction for the Chestnut Research Community
Apollo Introduction for the Chestnut Research CommunityApollo Introduction for the Chestnut Research Community
Apollo Introduction for the Chestnut Research Community
 
Apollo : A workshop for the Manakin Research Coordination Network
Apollo: A workshop for the Manakin Research Coordination NetworkApollo: A workshop for the Manakin Research Coordination Network
Apollo : A workshop for the Manakin Research Coordination Network
 
Apollo - A webinar for the Phascolarctos cinereus research community
Apollo - A webinar for the Phascolarctos cinereus research communityApollo - A webinar for the Phascolarctos cinereus research community
Apollo - A webinar for the Phascolarctos cinereus research community
 

Kürzlich hochgeladen

Transaction Management in Database Management System
Transaction Management in Database Management SystemTransaction Management in Database Management System
Transaction Management in Database Management SystemChristalin Nelson
 
4.16.24 21st Century Movements for Black Lives.pptx
4.16.24 21st Century Movements for Black Lives.pptx4.16.24 21st Century Movements for Black Lives.pptx
4.16.24 21st Century Movements for Black Lives.pptxmary850239
 
ENGLISH6-Q4-W3.pptxqurter our high choom
ENGLISH6-Q4-W3.pptxqurter our high choomENGLISH6-Q4-W3.pptxqurter our high choom
ENGLISH6-Q4-W3.pptxqurter our high choomnelietumpap1
 
Procuring digital preservation CAN be quick and painless with our new dynamic...
Procuring digital preservation CAN be quick and painless with our new dynamic...Procuring digital preservation CAN be quick and painless with our new dynamic...
Procuring digital preservation CAN be quick and painless with our new dynamic...Jisc
 
Difference Between Search & Browse Methods in Odoo 17
Difference Between Search & Browse Methods in Odoo 17Difference Between Search & Browse Methods in Odoo 17
Difference Between Search & Browse Methods in Odoo 17Celine George
 
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptx
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptxINTRODUCTION TO CATHOLIC CHRISTOLOGY.pptx
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptxHumphrey A Beña
 
Barangay Council for the Protection of Children (BCPC) Orientation.pptx
Barangay Council for the Protection of Children (BCPC) Orientation.pptxBarangay Council for the Protection of Children (BCPC) Orientation.pptx
Barangay Council for the Protection of Children (BCPC) Orientation.pptxCarlos105
 
ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITY
ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITYISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITY
ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITYKayeClaireEstoconing
 
Culture Uniformity or Diversity IN SOCIOLOGY.pptx
Culture Uniformity or Diversity IN SOCIOLOGY.pptxCulture Uniformity or Diversity IN SOCIOLOGY.pptx
Culture Uniformity or Diversity IN SOCIOLOGY.pptxPoojaSen20
 
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️9953056974 Low Rate Call Girls In Saket, Delhi NCR
 
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdf
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdfInclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdf
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdfTechSoup
 
FILIPINO PSYCHology sikolohiyang pilipino
FILIPINO PSYCHology sikolohiyang pilipinoFILIPINO PSYCHology sikolohiyang pilipino
FILIPINO PSYCHology sikolohiyang pilipinojohnmickonozaleda
 
ANG SEKTOR NG agrikultura.pptx QUARTER 4
ANG SEKTOR NG agrikultura.pptx QUARTER 4ANG SEKTOR NG agrikultura.pptx QUARTER 4
ANG SEKTOR NG agrikultura.pptx QUARTER 4MiaBumagat1
 
Choosing the Right CBSE School A Comprehensive Guide for Parents
Choosing the Right CBSE School A Comprehensive Guide for ParentsChoosing the Right CBSE School A Comprehensive Guide for Parents
Choosing the Right CBSE School A Comprehensive Guide for Parentsnavabharathschool99
 
Science 7 Quarter 4 Module 2: Natural Resources.pptx
Science 7 Quarter 4 Module 2: Natural Resources.pptxScience 7 Quarter 4 Module 2: Natural Resources.pptx
Science 7 Quarter 4 Module 2: Natural Resources.pptxMaryGraceBautista27
 
Grade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdf
Grade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdfGrade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdf
Grade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdfJemuel Francisco
 
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptxMULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptxAnupkumar Sharma
 

Kürzlich hochgeladen (20)

FINALS_OF_LEFT_ON_C'N_EL_DORADO_2024.pptx
FINALS_OF_LEFT_ON_C'N_EL_DORADO_2024.pptxFINALS_OF_LEFT_ON_C'N_EL_DORADO_2024.pptx
FINALS_OF_LEFT_ON_C'N_EL_DORADO_2024.pptx
 
Transaction Management in Database Management System
Transaction Management in Database Management SystemTransaction Management in Database Management System
Transaction Management in Database Management System
 
4.16.24 21st Century Movements for Black Lives.pptx
4.16.24 21st Century Movements for Black Lives.pptx4.16.24 21st Century Movements for Black Lives.pptx
4.16.24 21st Century Movements for Black Lives.pptx
 
ENGLISH6-Q4-W3.pptxqurter our high choom
ENGLISH6-Q4-W3.pptxqurter our high choomENGLISH6-Q4-W3.pptxqurter our high choom
ENGLISH6-Q4-W3.pptxqurter our high choom
 
Procuring digital preservation CAN be quick and painless with our new dynamic...
Procuring digital preservation CAN be quick and painless with our new dynamic...Procuring digital preservation CAN be quick and painless with our new dynamic...
Procuring digital preservation CAN be quick and painless with our new dynamic...
 
Difference Between Search & Browse Methods in Odoo 17
Difference Between Search & Browse Methods in Odoo 17Difference Between Search & Browse Methods in Odoo 17
Difference Between Search & Browse Methods in Odoo 17
 
Raw materials used in Herbal Cosmetics.pptx
Raw materials used in Herbal Cosmetics.pptxRaw materials used in Herbal Cosmetics.pptx
Raw materials used in Herbal Cosmetics.pptx
 
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptx
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptxINTRODUCTION TO CATHOLIC CHRISTOLOGY.pptx
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptx
 
Barangay Council for the Protection of Children (BCPC) Orientation.pptx
Barangay Council for the Protection of Children (BCPC) Orientation.pptxBarangay Council for the Protection of Children (BCPC) Orientation.pptx
Barangay Council for the Protection of Children (BCPC) Orientation.pptx
 
YOUVE_GOT_EMAIL_PRELIMS_EL_DORADO_2024.pptx
YOUVE_GOT_EMAIL_PRELIMS_EL_DORADO_2024.pptxYOUVE_GOT_EMAIL_PRELIMS_EL_DORADO_2024.pptx
YOUVE_GOT_EMAIL_PRELIMS_EL_DORADO_2024.pptx
 
ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITY
ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITYISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITY
ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITY
 
Culture Uniformity or Diversity IN SOCIOLOGY.pptx
Culture Uniformity or Diversity IN SOCIOLOGY.pptxCulture Uniformity or Diversity IN SOCIOLOGY.pptx
Culture Uniformity or Diversity IN SOCIOLOGY.pptx
 
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
 
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdf
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdfInclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdf
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdf
 
FILIPINO PSYCHology sikolohiyang pilipino
FILIPINO PSYCHology sikolohiyang pilipinoFILIPINO PSYCHology sikolohiyang pilipino
FILIPINO PSYCHology sikolohiyang pilipino
 
ANG SEKTOR NG agrikultura.pptx QUARTER 4
ANG SEKTOR NG agrikultura.pptx QUARTER 4ANG SEKTOR NG agrikultura.pptx QUARTER 4
ANG SEKTOR NG agrikultura.pptx QUARTER 4
 
Choosing the Right CBSE School A Comprehensive Guide for Parents
Choosing the Right CBSE School A Comprehensive Guide for ParentsChoosing the Right CBSE School A Comprehensive Guide for Parents
Choosing the Right CBSE School A Comprehensive Guide for Parents
 
Science 7 Quarter 4 Module 2: Natural Resources.pptx
Science 7 Quarter 4 Module 2: Natural Resources.pptxScience 7 Quarter 4 Module 2: Natural Resources.pptx
Science 7 Quarter 4 Module 2: Natural Resources.pptx
 
Grade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdf
Grade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdfGrade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdf
Grade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdf
 
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptxMULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
 

Three's a crowd-source: Observations on Collaborative Genome Annotation

  • 1. Three’s a crowd-source: Observations on Collaborative Genome Annotation. Monica Munoz-Torres, PhD via Suzanna Lewis Biocurator & Bioinformatics Analyst | @monimunozto Genomics Division, Lawrence Berkeley National Laboratory 08 April, 2014 | 7th International Biocuration Conference UNIVERSITY OF CALIFORNIA
  • 2. Outline 1. Automated and Manual Annotation in a genome sequencing project. 2. Distributed, community-based genome curation using Apollo. 3. What we have learned so far. Three’s a crowd- source: Observations on Collaborative Genome Annotation. Outline 2 Assembly Manual annotation Experimental validation Automated Annotation In a genome sequencing project…
  • 3. Automated Genome Annotation 1. Automated and Manual Annotation. Gene prediction Identifies elements of the genome using empiric and ab initio gene finding systems. Uses additional experimental evidence to identify domains and motifs. Nucleic Acids 2003 vol. 31 no. 13 3738-3741
  • 4. Curation [manual genome annotation editing] 1. Automated and Manual Annotation. - Identify elements that best represent the underlying biological truth - Eliminate elements that reflect the systemic errors of automated analyses. - Determine functional roles comparing to well- studied, phylogenetically similar genome elements via literature and public databases (and experience!). Experimental Evidence: cDNAs, HMM domain searches, alignments with assemblies or genes from other species. Computational analyses Manually-curated Consensus Gene Structures
  • 5. Curators strive to achieve precise biological fidelity. 1. Automated and Manual Annotation. 5 But! A single curator cannot do it all: - unmanageable scale. - colleagues with expertise in other domains and gene families are required. iStockPhoto.com
  • 6. Bring scientists together to: - Distribute problem solving - Mine collective intelligence - Access quality - Process work in parallel Crowd-sourcing Genome Curation “The knowledge and talents of a group of people is leveraged to create and solve problems” – Josh Catone | ReadWrite.com Footer 6 (“crowdsourcing”, FreeBase.com)
  • 7. Dispersed, community-based manual annotation efforts. We* have trained geographically dispersed scientific communities to perform biologically supported manual annotations: ~80 institutions, 14 countries, hundreds of scientists using Apollo. Education through: – Training workshops and geneborees. – Tutorials. – Personalized user support. 2. Community-based curation. 7 *with Elsik Lab. University of Missouri.
  • 8. What is Apollo? • Apollo is a genomic annotation editing platform. To modify and refine the precise location and structure of the genome elements that predictive algorithms cannot yet resolve automatically. 82. Community-based curation. Find more about Web Apollo at http://GenomeArchitect.org and Genome Biol 14:R93. (2013).
  • 9. Web Apollo improves the manual annotation environment • Allows for intuitive annotation creation and editing with gestures and pull-down menus to create and modify coding genes and regulatory elements, insert comments (CV, freeform text), etc. • Browser-based, plugin for JBrowse. • Edits in one client are instantly pushed to all other clients. • Customizable rules and appearance. 92. Community-based curation.
  • 10. Has the collaborative nature of manual annotation efforts influenced research productivity and the quality of downstream analyses? 3. What we have learned. 10
  • 11. Working together was helpful and automated annotations were improved. Scientific community efforts brought together domain-specific and natural history expertise that would have otherwise remain disconnected. Example: >100 bovine cattle researchers ~3,600 manual annotations 3. What we have learned. 11 Nature Reviews Genetics 2009 (10), 346- 347 Science. 2009 (324) 5926, 522-528
  • 12. Example: Understanding the evolution of sociality. Compared seven ant genomes for a better understanding of evolution and organization of insect societies at the molecular level. Insights drawn mainly from six core aspects of ant biology: 1. Alternative morphological castes 2. Division of labor 3. Chemical Communication 4. Alternative social organization 5. Social immunity 6. Mutualism 3. What we have learned. 12 The work of groups of communities led to new insights. Libbrecht et al. 2012. Genome Biology 2013, 14:212
  • 13. New sequencing technologies pose additional challenges. Lower coverage leads to – frameshifts and indel errors – split genes across contigs – highly repetitive sequences To face these challenges, we train annotators in recovering coding sequences in agreement with all available biological evidence. 3. What we have learned. 13
  • 14. Other lessons learned 1. You must enforce strict rules and formats; it is necessary to maintain consistency. 2. Be flexible and adaptable: study and incorporate new data, and adapt to support new platforms to keep pace and maintain the interest of scientific community. Evolve with the data! 3. A little training goes a long way! With the right tools, wet lab scientists make exceptional curators who can easily learn to maximize the generation of accurate, biologically supported gene models. 3. What we have learned. 14
  • 15. The power behind community-based curation of biological data. 3. What we have learned. 15
  • 16. Thanks! • Berkeley Bioinformatics Open-source Projects (BBOP), Berkeley Lab: Web Apollo and Gene Ontology teams. Suzanna Lewis (PI). • The team at Elsik Lab. § University of Missouri. Christine G. Elsik (PI). • Ian Holmes (PI). * University of California Berkeley. • Arthropod genomics community, i5K http://www.arthropodgenomes.org/wiki/i5K (Org. Committee, NAL (USDA), HGSC-BCM, BGI), and 1KITE http://www.1kite.org/. • Web Apollo is supported by NIH grants 5R01GM080203 from NIGMS, and 5R01HG004483 from NHGRI, and by the Director, Office of Science, Office of Basic Energy Sciences, of the U.S. Department of Energy under Contract No. DE-AC02- 05CH11231. • Insect images used with permission: http://AlexanderWild.com • For your attention, thank you! Thank you. 16 Web Apollo Ed Lee Gregg Helt Justin Reese § Colin Diesh § Deepak Unni § Chris Childers § Rob Buels * Gene Ontology Chris Mungall Seth Carbon Heiko Dietze BBOP Web Apollo: http://GenomeArchitect.org GO: http://GeneOntology.org i5K: http://arthropodgenomes.org/wiki/i5K ISB: http://biocurator.org

Hinweis der Redaktion

  1. Outline. The box at the bottom is to give a context of automated and manual annotation as it will be discussed in this talk.
  2. Gene prediction identifies elements of the genome using either empiric or ab initio gene finding systems. Additional experimental evidence is used to identify domains and motifs, both at DNA and amino acid level.
  3. Curation here is understood in the context of manual genome annotation editing. It tries to find the best biological representation of gene models, while eliminating the most systematic errors of the automated analysis. Curation also helps to determine the functional roles of these genetic elements play by comparing them to well-studied, phylogenetically similar elements using literature and public databases, to distinguish orthologs from paralogs, and classifying their membership in families and networks.
  4. Precise biological fidelity in genome annotation editing cannot be achieved by a single individual. There are too many genes, making it an unmanageable scale, and curators need insights from colleagues with other expertise.
  5. SETI@Home tapped the unused processing power of millions of individual computers. Similarly, distributed labor networks are using the internet to exploit the spare processing power of millions of human brains.We are trying to empower genome researchers around the world to harness expertise from dispersed researchers. It could be just 3 researchers working together, that’s already a crowd!
  6. Although Computational analyses and experimental evidence from genomic features were available to build manually-curated consensus gene structures, all existent applications at the time imposed communications constrains on the curators. We created the tools to facilitate real-time interactivity and allow extensive changes without server round trips: Web Apollo.
  7. Apollo is a genomic annotation editing platform, and in its latest inception it is an evolution of a popular desktop version adopted by many research groups (insects, fish, mammals, birds, etc).
  8. Web Apollo improves the manual annotation environment. (then highlight the bulleted ideas).
  9. So, what have we learned so far?
  10. Previously disconnected researchers were more productive when obtaining immediate feedback in dialogs with collaborators.Also, automated annotations were improved as exemplified by discoveries made based on revised annotations, for example, ~3600 manual annotations contributed to a better understanding of immune function, reproduction, lactation, and metabolism in cattle.
  11. This is an example of how the collaborative nature of manual annotation has brought together an enormous group of scientists with very diverse interests, for the purpose of propelling discovery and a better understanding on the evolution and organization of insect societies at the molecular level. ~2800 manually annotated genes from three species of ants granted further insight into the evolution of sociality in this group.
  12. Unlike earlier genome projects, which had the advantage of more highly polished genomes, recent projects usually have lower coverage. Therefore curators now face additional work correcting for more frequent assembly errors and annotating genes that are split across multiple contigs.
  13. Highlight that the distributed nature of these efforts still demands strong, goal-oriented (i.e. publication of findings) leadership and coordination, as these are crucial to the success of each project.
  14. This slide brings together a collection of collaborative efforts, close to the work of many of the members of ISB, and other communities.i5K is the initiative to sequence the genomes of 5,000 arthropods. It currently is collaboratively – and simultaneously! - annotating the genomes of 6 insects using Web Apollo.
  15. Thank you!