SlideShare a Scribd company logo
1 of 1
Download to read offline
Project Description Method Conclusion
Background
Results
[1] Felsenstein, Joseph (2004). Inferring Phylogenies.
[2] Brazeau, Martin D. Problematic character coding methods in morphology and
their effect, Biological Journal of the Linnean Society, 2011, 103, p. 489-498.
[3] Swofford, David L. and Maddison, Wayne P. Reconstructing Ancestral Character
States Under Wagner Parsimony, Mathematical Biosciences 87, 199-229 (1987).
[4] De Laet, Jan E (2005). Parsimony and the problem of inapplicable in sequence
data, in: Albert, V.A. (ed.) Parsimony, phylogeny and genomics.
[5] Budd, Graham E. A paleontological solution to the arthropod head problem,
Nature, vol. 417, 16 May 2002, p. 271-275.
The literature contains many algorithms for inferring
phylogenies [1], often based on the principle of parsimony:
the most likely scenario is the one with the fewest number
of evolutionary changes. An example is the Fitch algorithm,
suited for non-additive data. However, in a morphological
study of arthropod appendages we deal with additive data in
the form of a character describing the number of
appendages. We would also like to include other data such
as appendage colour or shape, but they are dependent on
the presence of a certain appendage. This contingency leads
to complications involving inapplicable data [2].
We split the problem into two parts: (1) reconstruct the tree
with the number of appendages as single character, and (2)
apply an algorithm adapted from the Fitch algorithm to the
full data set, whereby only evolutionary changes in present
states are taken into account. The first part relies on Wagner
parsimony, and algorithms to find optimal trees have been
proposed decades ago [3]. However, these existing
algorithms are either too general or fail to maximize the
number of homologies, as desirable by the Hennig-Farris
principle [4]. Hence we have tried to find a novel algorithm
suitable for dealing with the particular problem studied.
Historical biology is primarily concerned with the inference of
evolutionary history and relationships. Whereas molecular data
can be readily interrogated under a range of models, the
reconstruction of phylogenetic trees from morphological data –
the mainstay of palaeontologists and conventional taxonomists
– has outstanding problems. This project’s objective was to
adapt existing models of morphological evolution to incorporate
peculiarities of morphological data sets.
Contact information: (1) yiteng@live.nl, (2) ms609@cam.ac.uk
Funded by the Post Master
Consultancy Scheme
A typical problem that involves morphological data is the
arthropod head problem. Arthropods form a phylum which
includes the hexapods, crustaceans, myriapods, chelicerates
and trilobites). Their heads have complex structures which
include one or more pairs of antennae. The evolutionary
relationships and between these appendages – whether
some antennae have the same origin as others – is an
ongoing topic of debate among zoologists.
Schematic
representation of head
structures of five groups
of arthropods A new algorithm has been proposed for inferring the tree for
the first part of our approach. It consists of a downpass from
the tips to the root and an uppass from the root upwards
assigning definite values to all internal nodes and tips (see
flowchart). The algorithm was devised by examining trees
locally by looking at an internal node and its immediate
ancestor and descendants. By considering the various
possibilities for their datasets exhaustively and optimizing
the internal node in each of the cases, we obtained the
algorithm displayed to the right.
This algorithm has been implemented in R and C code and
has been tested for a range of sample trees. The results
found all coincide with the best tree obtained by parsimony
and the Hennig-Farris principle (example: top-right figure).
Above: algorithm applied to arbitary tree
Left: flowchart of the algorithm.
Below: example of phylogenetic tree (source: [5])
The algorithm is a proposed solution for the first part of the
problem. Each part of the tree is optimized locally during the
uppass, but it remains to prove that the resulting full tree is
optimal. Furthermore, ongoing work has resulted in an
algorithm for the second part of the problem for single-
character trees. An extension to multi-character trees seems
possible, but has not yet been realized.
References

More Related Content

What's hot

Bioinformatics.Assignment
Bioinformatics.AssignmentBioinformatics.Assignment
Bioinformatics.Assignment
Naima Tahsin
 
Algorithm research project neighbor joining
Algorithm research project neighbor joiningAlgorithm research project neighbor joining
Algorithm research project neighbor joining
Jay Mehta
 

What's hot (20)

Phylogenetics: Tree building
Phylogenetics: Tree buildingPhylogenetics: Tree building
Phylogenetics: Tree building
 
Multiple Sequence Alignment-just glims of viewes on bioinformatics.
 Multiple Sequence Alignment-just glims of viewes on bioinformatics. Multiple Sequence Alignment-just glims of viewes on bioinformatics.
Multiple Sequence Alignment-just glims of viewes on bioinformatics.
 
Molecular Evolution and Phylogenetics (2009)
Molecular Evolution and Phylogenetics (2009)Molecular Evolution and Phylogenetics (2009)
Molecular Evolution and Phylogenetics (2009)
 
Upgma
UpgmaUpgma
Upgma
 
Phylogenetics1
Phylogenetics1Phylogenetics1
Phylogenetics1
 
UPGMA
UPGMAUPGMA
UPGMA
 
Tools in phylogeny
Tools in phylogeny Tools in phylogeny
Tools in phylogeny
 
Blast and fasta
Blast and fastaBlast and fasta
Blast and fasta
 
Sequence Alignment In Bioinformatics
Sequence Alignment In BioinformaticsSequence Alignment In Bioinformatics
Sequence Alignment In Bioinformatics
 
Bioinformatics.Assignment
Bioinformatics.AssignmentBioinformatics.Assignment
Bioinformatics.Assignment
 
PHYLOGENETICS WITH MEGA
PHYLOGENETICS WITH MEGAPHYLOGENETICS WITH MEGA
PHYLOGENETICS WITH MEGA
 
Sequence homology search and multiple sequence alignment(1)
Sequence homology search and multiple sequence alignment(1)Sequence homology search and multiple sequence alignment(1)
Sequence homology search and multiple sequence alignment(1)
 
PPT
PPTPPT
PPT
 
Algorithm research project neighbor joining
Algorithm research project neighbor joiningAlgorithm research project neighbor joining
Algorithm research project neighbor joining
 
Propagation of data fusion
Propagation of data fusionPropagation of data fusion
Propagation of data fusion
 
Postdoctoral associate ad university of minnesota fernandes birol
Postdoctoral associate ad university of minnesota fernandes birolPostdoctoral associate ad university of minnesota fernandes birol
Postdoctoral associate ad university of minnesota fernandes birol
 
A03730108
A03730108A03730108
A03730108
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
 
Sequencealignmentinbioinformatics 100204112518-phpapp02
Sequencealignmentinbioinformatics 100204112518-phpapp02Sequencealignmentinbioinformatics 100204112518-phpapp02
Sequencealignmentinbioinformatics 100204112518-phpapp02
 
Survey of softwares for phylogenetic analysis
Survey of softwares for phylogenetic analysisSurvey of softwares for phylogenetic analysis
Survey of softwares for phylogenetic analysis
 

Viewers also liked (8)

Maxy
MaxyMaxy
Maxy
 
Internship
InternshipInternship
Internship
 
Desiderata
DesiderataDesiderata
Desiderata
 
Historiaenunbache
HistoriaenunbacheHistoriaenunbache
Historiaenunbache
 
PrimaryWater_WebPlanning_09.12.wk1
PrimaryWater_WebPlanning_09.12.wk1PrimaryWater_WebPlanning_09.12.wk1
PrimaryWater_WebPlanning_09.12.wk1
 
Presentación de servicios GIRA
Presentación de servicios GIRAPresentación de servicios GIRA
Presentación de servicios GIRA
 
Is the sweetness in the sugar sector for real
Is the sweetness in the sugar sector for realIs the sweetness in the sugar sector for real
Is the sweetness in the sugar sector for real
 
Como realizar un curriculum
Como realizar un curriculumComo realizar un curriculum
Como realizar un curriculum
 

Similar to PMC Poster - phylogenetic algorithm for morphological data

Exploring the Solution Space of Sorting by Reversals: A New Approach
Exploring the Solution Space of Sorting by Reversals: A New ApproachExploring the Solution Space of Sorting by Reversals: A New Approach
Exploring the Solution Space of Sorting by Reversals: A New Approach
IDES Editor
 

Similar to PMC Poster - phylogenetic algorithm for morphological data (20)

Comparison
ComparisonComparison
Comparison
 
Bioinformatics presentation shabir .pptx
Bioinformatics presentation shabir .pptxBioinformatics presentation shabir .pptx
Bioinformatics presentation shabir .pptx
 
Arabidopsis thaliana Inspired Genetic Restoration Strategies
Arabidopsis thaliana Inspired Genetic Restoration StrategiesArabidopsis thaliana Inspired Genetic Restoration Strategies
Arabidopsis thaliana Inspired Genetic Restoration Strategies
 
Perl for Phyloinformatics
Perl for PhyloinformaticsPerl for Phyloinformatics
Perl for Phyloinformatics
 
Considerations on the genetic equilibrium law
Considerations on the genetic equilibrium lawConsiderations on the genetic equilibrium law
Considerations on the genetic equilibrium law
 
IDENTIFYING THE SEMANTIC RELATIONS ON UNSTRUCTURED DATA
IDENTIFYING THE SEMANTIC RELATIONS ON UNSTRUCTURED DATAIDENTIFYING THE SEMANTIC RELATIONS ON UNSTRUCTURED DATA
IDENTIFYING THE SEMANTIC RELATIONS ON UNSTRUCTURED DATA
 
Identifying the semantic relations on
Identifying the semantic relations onIdentifying the semantic relations on
Identifying the semantic relations on
 
An Essay Concerning Human Understanding Of Genetic Programming
An Essay Concerning Human Understanding Of Genetic ProgrammingAn Essay Concerning Human Understanding Of Genetic Programming
An Essay Concerning Human Understanding Of Genetic Programming
 
6238578.ppt
6238578.ppt6238578.ppt
6238578.ppt
 
NAMED ENTITY RECOGNITION IN TURKISH USING ASSOCIATION MEASURES
NAMED ENTITY RECOGNITION IN TURKISH USING ASSOCIATION MEASURESNAMED ENTITY RECOGNITION IN TURKISH USING ASSOCIATION MEASURES
NAMED ENTITY RECOGNITION IN TURKISH USING ASSOCIATION MEASURES
 
Exploring the Solution Space of Sorting by Reversals: A New Approach
Exploring the Solution Space of Sorting by Reversals: A New ApproachExploring the Solution Space of Sorting by Reversals: A New Approach
Exploring the Solution Space of Sorting by Reversals: A New Approach
 
Cornell Pbsb 20090126 Nets
Cornell Pbsb 20090126 NetsCornell Pbsb 20090126 Nets
Cornell Pbsb 20090126 Nets
 
Learning to Extract Relations for Protein Annotation
Learning to Extract Relations for Protein AnnotationLearning to Extract Relations for Protein Annotation
Learning to Extract Relations for Protein Annotation
 
Tree Physiology Optimization in Constrained Optimization Problem
Tree Physiology Optimization in Constrained Optimization ProblemTree Physiology Optimization in Constrained Optimization Problem
Tree Physiology Optimization in Constrained Optimization Problem
 
Introduction to 16S rRNA gene multivariate analysis
Introduction to 16S rRNA gene multivariate analysisIntroduction to 16S rRNA gene multivariate analysis
Introduction to 16S rRNA gene multivariate analysis
 
Piepho et-al-2003
Piepho et-al-2003Piepho et-al-2003
Piepho et-al-2003
 
Parallel evolutionary approach paper
Parallel evolutionary approach paperParallel evolutionary approach paper
Parallel evolutionary approach paper
 
Phylogenetics
PhylogeneticsPhylogenetics
Phylogenetics
 
Two-Stage Eagle Strategy with Differential Evolution
Two-Stage Eagle Strategy with Differential EvolutionTwo-Stage Eagle Strategy with Differential Evolution
Two-Stage Eagle Strategy with Differential Evolution
 
phy prAC.pptx
phy prAC.pptxphy prAC.pptx
phy prAC.pptx
 

Recently uploaded

Electricity and Circuits for Grade 9 students
Electricity and Circuits for Grade 9 studentsElectricity and Circuits for Grade 9 students
Electricity and Circuits for Grade 9 students
levieagacer
 
Heat Units in plant physiology and the importance of Growing Degree days
Heat Units in plant physiology and the importance of Growing Degree daysHeat Units in plant physiology and the importance of Growing Degree days
Heat Units in plant physiology and the importance of Growing Degree days
Brahmesh Reddy B R
 
Chemistry Data Delivery from the US-EPA Center for Computational Toxicology a...
Chemistry Data Delivery from the US-EPA Center for Computational Toxicology a...Chemistry Data Delivery from the US-EPA Center for Computational Toxicology a...
Chemistry Data Delivery from the US-EPA Center for Computational Toxicology a...
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
Quantifying Artificial Intelligence and What Comes Next!
Quantifying Artificial Intelligence and What Comes Next!Quantifying Artificial Intelligence and What Comes Next!
Quantifying Artificial Intelligence and What Comes Next!
University of Hertfordshire
 

Recently uploaded (20)

A Scientific PowerPoint on Albert Einstein
A Scientific PowerPoint on Albert EinsteinA Scientific PowerPoint on Albert Einstein
A Scientific PowerPoint on Albert Einstein
 
Electricity and Circuits for Grade 9 students
Electricity and Circuits for Grade 9 studentsElectricity and Circuits for Grade 9 students
Electricity and Circuits for Grade 9 students
 
Heat Units in plant physiology and the importance of Growing Degree days
Heat Units in plant physiology and the importance of Growing Degree daysHeat Units in plant physiology and the importance of Growing Degree days
Heat Units in plant physiology and the importance of Growing Degree days
 
GBSN - Biochemistry (Unit 3) Metabolism
GBSN - Biochemistry (Unit 3) MetabolismGBSN - Biochemistry (Unit 3) Metabolism
GBSN - Biochemistry (Unit 3) Metabolism
 
Fun for mover student's book- English book for teaching.pdf
Fun for mover student's book- English book for teaching.pdfFun for mover student's book- English book for teaching.pdf
Fun for mover student's book- English book for teaching.pdf
 
Heads-Up Multitasker: CHI 2024 Presentation.pdf
Heads-Up Multitasker: CHI 2024 Presentation.pdfHeads-Up Multitasker: CHI 2024 Presentation.pdf
Heads-Up Multitasker: CHI 2024 Presentation.pdf
 
RACEMIzATION AND ISOMERISATION completed.pptx
RACEMIzATION AND ISOMERISATION completed.pptxRACEMIzATION AND ISOMERISATION completed.pptx
RACEMIzATION AND ISOMERISATION completed.pptx
 
NUMERICAL Proof Of TIme Electron Theory.
NUMERICAL Proof Of TIme Electron Theory.NUMERICAL Proof Of TIme Electron Theory.
NUMERICAL Proof Of TIme Electron Theory.
 
TEST BANK for Organic Chemistry 6th Edition.pdf
TEST BANK for Organic Chemistry 6th Edition.pdfTEST BANK for Organic Chemistry 6th Edition.pdf
TEST BANK for Organic Chemistry 6th Edition.pdf
 
Adaptive Restore algorithm & importance Monte Carlo
Adaptive Restore algorithm & importance Monte CarloAdaptive Restore algorithm & importance Monte Carlo
Adaptive Restore algorithm & importance Monte Carlo
 
Efficient spin-up of Earth System Models usingsequence acceleration
Efficient spin-up of Earth System Models usingsequence accelerationEfficient spin-up of Earth System Models usingsequence acceleration
Efficient spin-up of Earth System Models usingsequence acceleration
 
Chemistry Data Delivery from the US-EPA Center for Computational Toxicology a...
Chemistry Data Delivery from the US-EPA Center for Computational Toxicology a...Chemistry Data Delivery from the US-EPA Center for Computational Toxicology a...
Chemistry Data Delivery from the US-EPA Center for Computational Toxicology a...
 
Lubrication System in forced feed system
Lubrication System in forced feed systemLubrication System in forced feed system
Lubrication System in forced feed system
 
Costs to heap leach gold ore tailings in Karamoja region of Uganda
Costs to heap leach gold ore tailings in Karamoja region of UgandaCosts to heap leach gold ore tailings in Karamoja region of Uganda
Costs to heap leach gold ore tailings in Karamoja region of Uganda
 
Harry Coumnas Thinks That Human Teleportation is Possible in Quantum Mechanic...
Harry Coumnas Thinks That Human Teleportation is Possible in Quantum Mechanic...Harry Coumnas Thinks That Human Teleportation is Possible in Quantum Mechanic...
Harry Coumnas Thinks That Human Teleportation is Possible in Quantum Mechanic...
 
GBSN - Microbiology (Unit 6) Human and Microbial interaction
GBSN - Microbiology (Unit 6) Human and Microbial interactionGBSN - Microbiology (Unit 6) Human and Microbial interaction
GBSN - Microbiology (Unit 6) Human and Microbial interaction
 
Biochemistry and Biomolecules - Science - 9th Grade by Slidesgo.pptx
Biochemistry and Biomolecules - Science - 9th Grade by Slidesgo.pptxBiochemistry and Biomolecules - Science - 9th Grade by Slidesgo.pptx
Biochemistry and Biomolecules - Science - 9th Grade by Slidesgo.pptx
 
Quantifying Artificial Intelligence and What Comes Next!
Quantifying Artificial Intelligence and What Comes Next!Quantifying Artificial Intelligence and What Comes Next!
Quantifying Artificial Intelligence and What Comes Next!
 
FORENSIC CHEMISTRY ARSON INVESTIGATION.pdf
FORENSIC CHEMISTRY ARSON INVESTIGATION.pdfFORENSIC CHEMISTRY ARSON INVESTIGATION.pdf
FORENSIC CHEMISTRY ARSON INVESTIGATION.pdf
 
Manganese‐RichSandstonesasanIndicatorofAncientOxic LakeWaterConditionsinGale...
Manganese‐RichSandstonesasanIndicatorofAncientOxic  LakeWaterConditionsinGale...Manganese‐RichSandstonesasanIndicatorofAncientOxic  LakeWaterConditionsinGale...
Manganese‐RichSandstonesasanIndicatorofAncientOxic LakeWaterConditionsinGale...
 

PMC Poster - phylogenetic algorithm for morphological data

  • 1. Project Description Method Conclusion Background Results [1] Felsenstein, Joseph (2004). Inferring Phylogenies. [2] Brazeau, Martin D. Problematic character coding methods in morphology and their effect, Biological Journal of the Linnean Society, 2011, 103, p. 489-498. [3] Swofford, David L. and Maddison, Wayne P. Reconstructing Ancestral Character States Under Wagner Parsimony, Mathematical Biosciences 87, 199-229 (1987). [4] De Laet, Jan E (2005). Parsimony and the problem of inapplicable in sequence data, in: Albert, V.A. (ed.) Parsimony, phylogeny and genomics. [5] Budd, Graham E. A paleontological solution to the arthropod head problem, Nature, vol. 417, 16 May 2002, p. 271-275. The literature contains many algorithms for inferring phylogenies [1], often based on the principle of parsimony: the most likely scenario is the one with the fewest number of evolutionary changes. An example is the Fitch algorithm, suited for non-additive data. However, in a morphological study of arthropod appendages we deal with additive data in the form of a character describing the number of appendages. We would also like to include other data such as appendage colour or shape, but they are dependent on the presence of a certain appendage. This contingency leads to complications involving inapplicable data [2]. We split the problem into two parts: (1) reconstruct the tree with the number of appendages as single character, and (2) apply an algorithm adapted from the Fitch algorithm to the full data set, whereby only evolutionary changes in present states are taken into account. The first part relies on Wagner parsimony, and algorithms to find optimal trees have been proposed decades ago [3]. However, these existing algorithms are either too general or fail to maximize the number of homologies, as desirable by the Hennig-Farris principle [4]. Hence we have tried to find a novel algorithm suitable for dealing with the particular problem studied. Historical biology is primarily concerned with the inference of evolutionary history and relationships. Whereas molecular data can be readily interrogated under a range of models, the reconstruction of phylogenetic trees from morphological data – the mainstay of palaeontologists and conventional taxonomists – has outstanding problems. This project’s objective was to adapt existing models of morphological evolution to incorporate peculiarities of morphological data sets. Contact information: (1) yiteng@live.nl, (2) ms609@cam.ac.uk Funded by the Post Master Consultancy Scheme A typical problem that involves morphological data is the arthropod head problem. Arthropods form a phylum which includes the hexapods, crustaceans, myriapods, chelicerates and trilobites). Their heads have complex structures which include one or more pairs of antennae. The evolutionary relationships and between these appendages – whether some antennae have the same origin as others – is an ongoing topic of debate among zoologists. Schematic representation of head structures of five groups of arthropods A new algorithm has been proposed for inferring the tree for the first part of our approach. It consists of a downpass from the tips to the root and an uppass from the root upwards assigning definite values to all internal nodes and tips (see flowchart). The algorithm was devised by examining trees locally by looking at an internal node and its immediate ancestor and descendants. By considering the various possibilities for their datasets exhaustively and optimizing the internal node in each of the cases, we obtained the algorithm displayed to the right. This algorithm has been implemented in R and C code and has been tested for a range of sample trees. The results found all coincide with the best tree obtained by parsimony and the Hennig-Farris principle (example: top-right figure). Above: algorithm applied to arbitary tree Left: flowchart of the algorithm. Below: example of phylogenetic tree (source: [5]) The algorithm is a proposed solution for the first part of the problem. Each part of the tree is optimized locally during the uppass, but it remains to prove that the resulting full tree is optimal. Furthermore, ongoing work has resulted in an algorithm for the second part of the problem for single- character trees. An extension to multi-character trees seems possible, but has not yet been realized. References