1. OPEN TREE OF LIFE
A community-assembled, continually updated evolutionary
history of all life
Karen A. Cranston
National Evolutionary Synthesis Center
Duke University
2. Tree of life
• 1.8million named
species
• Millions
more
unnamed / undiscovered
All biodiversity linked through common ancestry; tree of life is a grand challenge; scale is
huge
10. DATA AVAILABILITY
High archival rate of sequence data
~4% of all published
phylogenetic trees
Reason 2: can’t do synthesis without data; community norm to deposit sequences but not
trees
11. Most trees published
as (beautiful) figures
in PDF files
EVOLUTION
not reusable!
Weigmann et al. PNAS, 2011
Fig. 1. Combined molecular phylogenetic tree for Diptera. Partitioned ML analysis of combined taxon sets of tier 1 and tier 2 FLYTREE data samples (−lnL =
344155.6169) calculated in RAxML. Circles indicate bootstrap support >80% (black/bp = 95–100%, gray/bp = 88–94%, white/bp = 80–88%). Nodes with im-
proved bootstrap values resulting from postanalysis pruning of unstable taxa are marked by stars (black/bp = 95–100%, gray/bp = 88–94%, white/bp = 80–
88%). Colored squares on terminal branches indicate the presence, in at least one species of a family, of ecological traits as shown to lower left. The number
Most phylogenetic knowledge locked in PDFs; even best case (here): trees and alignments in
of origins of each trait was estimated with reference to the phylogeny, the distribution of each trait among genera within a family, and the known biology of
the organisms.
treebase, sequences in GenBank but so much more data in this figure!
12. 1. Build and make publicly available the first complete draft
tree of life
2. Engage the community in refinement and annotation of
the draft tree
3. Promote a culture of data sharing among systematists
through software products that target key bottlenecks in
phylogenetic workflows
4. Develop, disseminate, and evaluate novel methods for
phylogenetic synthesis
13. + NSF Assembling the Tree of Life projects
+ recent high-profile phylogenies
+ ribosomal RNA trees for Bacteria and Archaea
+ unnamed environmental sequences
+ taxonomies of living and extinct species
Draft Tree of Life as seed for
collaboration
Get all data available! Part of draft tree will be good; parts not so good
14. Tools to browse,
update, monitor the
draft tree of life
Tools that incentivize
collaboration
Tree image modified from Tree of Life Web Project page http://tolweb.org/Nymphalidae/12172 Pictures by Katja Schulz (queen butterfly; CCAttribution-NonCommercial) and Charles Lam (via Flicker;CCAttribution-ShareAlike)
publish refinement tools along with draft tree; invite comment / critique / updating
15. SMART GENERATION OF FIGURES FOR
PUBLICATION
• Semantic annotation layers
• Collaborative editing
EVOLUTION
• Integrated submission of
data and annotations to
archives
ig. 1. Combined molecular phylogenetic tree for Diptera. Partitioned ML analysis of combined taxon sets of tier 1 and tier 2 FLYTREE data samples (−lnL =
44155.6169) calculated in RAxML. Circles indicate bootstrap support >80% (black/bp = 95–100%, gray/bp = 88–94%, white/bp = 80–88%). Nodes with im-
Motivated to organize data at this point in pipeline; want to capture digitally at this point
roved bootstrap values resulting from postanalysis pruning of unstable taxa are marked by stars (black/bp = 95–100%, gray/bp = 88–94%, white/bp = 80–
8%). Colored squares on terminal branches indicate the presence, in at least one species of a family, of ecological traits as shown to lower left. The number
f origins of each trait was estimated with reference to the phylogeny, the distribution of each trait among genera within a family, and the known biology of
he organisms.
16. AUTOMATIC UPDATING
update trees
with new
sequence data
detect and synthesize newly
published trees
Can’t rely on manual refinement alone; need automate based on new data (and need new
analytical method development)
17. community
refinement
ng the major evolutionary relationships recovered in this analysis of Carnivora. Illustrations of
om top): Nandinia binotata; Felidae (Lynx rufus); Viverridae (Viverra zibetha); Hyaenidae (Crocuta
y carnivorans (Eupleres goudotii); Canidae (Canis lupus); Ursidae (Ursus americanus); Phocidae
Odobenidae (Odobenus rosmarus); Ailurus fulgens; Mephitidae (Mephitis mephitis); Procyonidae
education and
outreach
(generalized schematic representing diverse taxa [African polecat and striped marten, badger,
telidae, Mustela (Mustela frenata); Mustelidae, Lutrinae (Lontra canadensis).
the major evolutionary relationships recoveredin this analysis of Carnivora. Illustrations of
nting the major evolutionary relationships recovered in this analysis of Carnivora. Illustrations of
top): Nandinia binotata; Felidae (Lynx rufus); Viverridae (Viverra zibetha); Hyaenidae (Crocuta
(from top): Nandinia binotata; Felidae (Lynx rufus); Viverridae (Viverra zibetha); Hyaenidae (Crocuta
gasy carnivorans (Eupleres goudotii); Canidae (Canis lupus); Ursidae (Ursus americanus); Phocidae
rnivorans (Eupleres goudotii); Canidae (Canis lupus); Ursidae (Ursus americanus); Phocidae
us); Odobenidae (Odobenus rosmarus); Ailurus fulgens; Mephitidae (Mephitis mephitis); Procyonidae
dobenidae (Odobenus rosmarus); Ailurus fulgens; Mephitidae (Mephitis mephitis); badger,
ids (generalized schematic representing diverse taxa [African polecat and striped marten, Procyonidae
Dynamic,
Mustelidae, schematic representing diverse Lutrinae (Lontra canadensis).
neralized Mustela (Mustela frenata); Mustelidae,taxa [African polecat and striped marten, badger,
dae, Mustela (Mustela frenata); Mustelidae, Lutrinae (Lontra canadensis).
continually updated
cladogram representing the major evolutionary relationships recovered in this analysis of Carnivora. Illustrations of
or lineages include (from top): Nandinia binotata; Felidae (Lynx rufus); Viverridae (Viverra zibetha); Hyaenidae (Crocuta
gos mungo); Malagasy carnivorans (Eupleres goudotii); Canidae (Canis lupus); Ursidae (Ursus americanus); Phocidae
tree of all life
Zalophus californianus); Odobenidae (Odobenus rosmarus); Ailurus fulgens; Mephitidae (Mephitis mephitis); Procyonidae
asal/other mustelids (generalized schematic representing diverse taxa [African polecat and striped marten, badger,
?
roup (Gulo gulo); Mustelidae, Mustela (Mustela frenata); Mustelidae, Lutrinae (Lontra canadensis).
?
?
automatic ?
updating
downstream
analyses
The grand vision
18. opentreeoflife.com
Karen Cranston, lead PI (Duke) Laura Katz (Smith)
Gordon Burleigh (Florida) Rick Ree (FMNH)
Keith Crandall (BYU) Stephen Smith (Michigan)
Karl Gude (MSU) Doug Soltis (Florida)
David Hibbett (Clark) Tiffani Williams (TAMU)
Mark Holder (Kansas)
AVAToL: Assembling, Visualizing and Analysis of
the Tree of Life, $5.76 million
The team and funding; empirical + computational biologists + comp sci + graphics /
communication