SlideShare ist ein Scribd-Unternehmen logo
1 von 28
Downloaden Sie, um offline zu lesen
Aequatus: An open-source
homology browser
ANIL THANKI
Data Infrastructure and Algorithms
@anilthanki
www.earlham.ac.uk
Co-authors
Robert Davey
Earlham Institute
Nicola Soranzo
Earlham Institute
Wilfried Haerty
Earlham Institute
Javier Herrero
University College
London
Acknowledgements
Homology
www.earlham.ac.uk
Homology
• Homology is existence of shared ancestry between a pair of structures
in different species.
–i.e. genes
• The phylogenetic information inferred from the study of homologous
genes
–helps us to understand the evolution of gene families.
www.earlham.ac.uk
Homology
• Various tools available to visualise homology
–Ensembl
–Genomicus
–SynChro
• They provide an overview of phylogeny and/or syntenic regions
evolution at the family level
• They can not provide information about structural changes within a
gene
www.earlham.ac.uk
Homology - Tool example
Genomicus
www.earlham.ac.uk
Homology - Tool example
Map view Street view
Genomicus
Aequatus
http://aequatus.earlham.ac.uk/
www.earlham.ac.uk
Aequatus
• New open-source tool for visualisation of homologous genes
• Reads data directly from Ensembl Compara and Ensembl Core
Databases
• Three main views
1. Gene tree view
2. Sankey view
3. Tabular view
www.earlham.ac.uk
Aequatus - Gene tree view
• Phylogeny on left
• Detailed view of gene structure across gene families
• Shared exons use the same colour in each representation
• Also visualises Insertions and Deletions
www.earlham.ac.uk
Aequatus - Gene tree view
• Depicts the type of interrelation events that gave rise to the family:
–speciation, duplication, and gene splits
www.earlham.ac.uk
• 1-to-1 alignments between homologous genes are important for
pairwise comparison
• On the top (A): alignment on gene structure
• On the bottom (B): pairwise sequence alignments
Aequatus - Gene tree view
www.earlham.ac.uk
Aequatus - Gene tree view
• An interactive visualisation of the protein domains.
• Connects to SMART web server via REST API and queries for domains,
motifs, internal repeats, etc.
• Can be filtered and sorted based on E-value and source.
• Can be exported in CSV or Excel file format.
www.earlham.ac.uk
Aequatus - Gene tree view
• An interactive visualisation of the protein domains.
www.earlham.ac.uk
Aequatus - Sankey view
• Visualises homology as an interactive Sankey diagram
• Homologues of a selected gene are distinguished by homology type
–paralogs, 1-to-1 orthologs, 1-to-many orthologs
• Coloured by species
• Additional details for the homologous in the info panel on the right-
hand side.
www.earlham.ac.uk
Aequatus - Sankey view
• Visualises homology as an interactive Sankey diagram
www.earlham.ac.uk
Aequatus - Tabular view
• Visualises homology as an interactive table
• Contains statistical information for the homologous relationships.
• Allows the user to
–search for any homolog using a search box
–filter results for the type of homology or one or more species
• Export from the tabular view as Excel, CSV or PDF.
www.earlham.ac.uk
Aequatus - Tabular view
D = Filter based on Species
E = Filter based on Type of homology
A = Search Box
B = Detailed statistical information
C = Detailed pairwise alignment
Aequatus.js
www.earlham.ac.uk
Aequatus.js
• Aequatus.js is a JavaScript library based on the standalone Aequatus
software package
• It preserves interactive functionality of Aequatus
• Does not require Ensembl databases for data
• It has an ability to integration with countless web based applications
• Gene Tree
–JSON / Newick
• Gene structural info
–JSON
Input
Aequatus.js
Use Case
Galaxy and GeneSeqToFamily
www.earlham.ac.uk
Galaxy and GeneSeqToFamily
• Galaxy is an open source, web-based platform for data intensive
biomedical research.
• Aequatus.js plugin configured to be used into Galaxy
–available on GitHub and integrated into usegalaxy.eu
• Can visualises results of GeneSeqToFamily workflow
–a Galaxy workflow to find gene families based on the Ensembl
Compara GeneTrees pipeline
–https://doi.org/10.1093/gigascience/giy005
www.earlham.ac.uk
Galaxy and GeneSeqToFamily
Aequatus.js plugin in Galaxy
New stuff...
www.earlham.ac.uk
New stuff...
• The main extension to the Aequatus is incorporation of Ensembl REST
API.
• Aequatus can also retrieve latest data directly from Ensembl Compara
and Core databases held at the EMBL-EBI,
– without any need for local databases
– avoids the need for local storage space
– improves the portability of Aequatus
www.earlham.ac.uk
New stuff...
• The main extension to the Aequatus is incorporation of Ensembl REST
API.
www.earlham.ac.uk
New stuff...
• The main extension to the Aequatus is incorporation of Ensembl REST
API.
www.earlham.ac.uk
• Thanki AS, Soranzo N, Haerty W, Herrero J, Davey RP. Aequatus:
An open-source homology browser. GigaScience 2018
• Demo:
– Demo: http://aequatus.earlham.ac.uk/
• Source Code:
– GitHub: https://github.com/TGAC/Aequatus
• Aequatus.js plugin
– GitHub: https://github.com/TGAC/aequatus.js
• E-mail: Anil.Thanki@earlham.ac.uk
• Twitter: @anilthanki
Thank You..
Questions…?

Weitere ähnliche Inhalte

Ähnlich wie Anil Thanki at #ICG13: Aequatus: An open-source homology browser

Franz 2015 SPNHC Taxonomic concept resolution for voucher-based biodiversity ...
Franz 2015 SPNHC Taxonomic concept resolution for voucher-based biodiversity ...Franz 2015 SPNHC Taxonomic concept resolution for voucher-based biodiversity ...
Franz 2015 SPNHC Taxonomic concept resolution for voucher-based biodiversity ...
taxonbytes
 
Web Apollo at Genome Informatics 2014
Web Apollo at Genome Informatics 2014Web Apollo at Genome Informatics 2014
Web Apollo at Genome Informatics 2014
Monica Munoz-Torres
 
Munoz torres web-apollo-workshop_exeter-2014_ss
Munoz torres web-apollo-workshop_exeter-2014_ssMunoz torres web-apollo-workshop_exeter-2014_ss
Munoz torres web-apollo-workshop_exeter-2014_ss
Monica Munoz-Torres
 

Ähnlich wie Anil Thanki at #ICG13: Aequatus: An open-source homology browser (20)

Building and Using Ontologies to do biology
Building and Using Ontologies to do biologyBuilding and Using Ontologies to do biology
Building and Using Ontologies to do biology
 
Connecting life sciences data at the European Bioinformatics Institute
Connecting life sciences data at the European Bioinformatics InstituteConnecting life sciences data at the European Bioinformatics Institute
Connecting life sciences data at the European Bioinformatics Institute
 
Web Apollo Tutorial for Medfly Research Community
Web Apollo Tutorial for Medfly Research CommunityWeb Apollo Tutorial for Medfly Research Community
Web Apollo Tutorial for Medfly Research Community
 
Franz 2015 SPNHC Taxonomic concept resolution for voucher-based biodiversity ...
Franz 2015 SPNHC Taxonomic concept resolution for voucher-based biodiversity ...Franz 2015 SPNHC Taxonomic concept resolution for voucher-based biodiversity ...
Franz 2015 SPNHC Taxonomic concept resolution for voucher-based biodiversity ...
 
Web Apollo Workshop UIUC
Web Apollo Workshop UIUCWeb Apollo Workshop UIUC
Web Apollo Workshop UIUC
 
Open interoperability standards, tools and services at EMBL-EBI
Open interoperability standards, tools and services at EMBL-EBIOpen interoperability standards, tools and services at EMBL-EBI
Open interoperability standards, tools and services at EMBL-EBI
 
Introduction to Web Apollo for the i5K pilot species.
Introduction to Web Apollo for the i5K pilot species.Introduction to Web Apollo for the i5K pilot species.
Introduction to Web Apollo for the i5K pilot species.
 
Web Apollo at Genome Informatics 2014
Web Apollo at Genome Informatics 2014Web Apollo at Genome Informatics 2014
Web Apollo at Genome Informatics 2014
 
Ontology repositories and case study with OntoPortal
Ontology repositories and case study with OntoPortalOntology repositories and case study with OntoPortal
Ontology repositories and case study with OntoPortal
 
Presentation on entrez as used in bioinformatics
Presentation on entrez as used in bioinformaticsPresentation on entrez as used in bioinformatics
Presentation on entrez as used in bioinformatics
 
NCBO BioPortal SPARQL Endpoint - The Quad Economy of a Semantic Web Ontology ...
NCBO BioPortal SPARQL Endpoint - The Quad Economy of a Semantic Web Ontology ...NCBO BioPortal SPARQL Endpoint - The Quad Economy of a Semantic Web Ontology ...
NCBO BioPortal SPARQL Endpoint - The Quad Economy of a Semantic Web Ontology ...
 
Advanced Bioinformatics for Genomics and BioData Driven Research
Advanced Bioinformatics for Genomics and BioData Driven ResearchAdvanced Bioinformatics for Genomics and BioData Driven Research
Advanced Bioinformatics for Genomics and BioData Driven Research
 
Facilitating semantic alignment.-biohackathon-jupp
Facilitating semantic alignment.-biohackathon-juppFacilitating semantic alignment.-biohackathon-jupp
Facilitating semantic alignment.-biohackathon-jupp
 
Munoz torres web-apollo-workshop_exeter-2014_ss
Munoz torres web-apollo-workshop_exeter-2014_ssMunoz torres web-apollo-workshop_exeter-2014_ss
Munoz torres web-apollo-workshop_exeter-2014_ss
 
L clarke faang_dcc_isag_2017_compress
L clarke faang_dcc_isag_2017_compressL clarke faang_dcc_isag_2017_compress
L clarke faang_dcc_isag_2017_compress
 
The Neuroscience Information Framework: Making Resources Discoverable for the...
The Neuroscience Information Framework: Making Resources Discoverable for the...The Neuroscience Information Framework: Making Resources Discoverable for the...
The Neuroscience Information Framework: Making Resources Discoverable for the...
 
Semantics as a service at EMBL-EBI
Semantics as a service at EMBL-EBISemantics as a service at EMBL-EBI
Semantics as a service at EMBL-EBI
 
Variation and Assembly Resources at EMBL-EBI
Variation and Assembly Resources at EMBL-EBIVariation and Assembly Resources at EMBL-EBI
Variation and Assembly Resources at EMBL-EBI
 
Building genomic data cyberinfrastructure with the online database software T...
Building genomic data cyberinfrastructure with the online database software T...Building genomic data cyberinfrastructure with the online database software T...
Building genomic data cyberinfrastructure with the online database software T...
 
Web Apollo Tutorial for the i5K copepod research community.
Web Apollo Tutorial for the i5K copepod research community.Web Apollo Tutorial for the i5K copepod research community.
Web Apollo Tutorial for the i5K copepod research community.
 

Mehr von GigaScience, BGI Hong Kong

Mehr von GigaScience, BGI Hong Kong (20)

IDW2022: A decades experiences in transparent and interactive publication of ...
IDW2022: A decades experiences in transparent and interactive publication of ...IDW2022: A decades experiences in transparent and interactive publication of ...
IDW2022: A decades experiences in transparent and interactive publication of ...
 
Scott Edmunds: Preparing a data paper for GigaByte
Scott Edmunds: Preparing a data paper for GigaByteScott Edmunds: Preparing a data paper for GigaByte
Scott Edmunds: Preparing a data paper for GigaByte
 
STM Week: Demonstrating bringing publications to life via an End-to-end XML p...
STM Week: Demonstrating bringing publications to life via an End-to-end XML p...STM Week: Demonstrating bringing publications to life via an End-to-end XML p...
STM Week: Demonstrating bringing publications to life via an End-to-end XML p...
 
Measuring richness. A RCT to quantify the benefits of metadata quality; Scott...
Measuring richness. A RCT to quantify the benefits of metadata quality; Scott...Measuring richness. A RCT to quantify the benefits of metadata quality; Scott...
Measuring richness. A RCT to quantify the benefits of metadata quality; Scott...
 
Scott Edmunds: A new publishing workflow for rapid dissemination of genomes u...
Scott Edmunds: A new publishing workflow for rapid dissemination of genomes u...Scott Edmunds: A new publishing workflow for rapid dissemination of genomes u...
Scott Edmunds: A new publishing workflow for rapid dissemination of genomes u...
 
Scott Edmunds: Quantifying how FAIR is Hong Kong: The Hong Kong Shareability ...
Scott Edmunds: Quantifying how FAIR is Hong Kong: The Hong Kong Shareability ...Scott Edmunds: Quantifying how FAIR is Hong Kong: The Hong Kong Shareability ...
Scott Edmunds: Quantifying how FAIR is Hong Kong: The Hong Kong Shareability ...
 
Scott Edmunds talk at IARC: How can we make science more trustworthy and FAIR...
Scott Edmunds talk at IARC: How can we make science more trustworthy and FAIR...Scott Edmunds talk at IARC: How can we make science more trustworthy and FAIR...
Scott Edmunds talk at IARC: How can we make science more trustworthy and FAIR...
 
PAGAsia19 - The Digitalization of Ruili Botanical Garden Project: Production...
PAGAsia19 - The Digitalization of Ruili Botanical Garden Project:  Production...PAGAsia19 - The Digitalization of Ruili Botanical Garden Project:  Production...
PAGAsia19 - The Digitalization of Ruili Botanical Garden Project: Production...
 
Democratising biodiversity and genomics research: open and citizen science to...
Democratising biodiversity and genomics research: open and citizen science to...Democratising biodiversity and genomics research: open and citizen science to...
Democratising biodiversity and genomics research: open and citizen science to...
 
Hong Kong Open Access & GigaScience: CCHK@10
Hong Kong Open Access & GigaScience: CCHK@10Hong Kong Open Access & GigaScience: CCHK@10
Hong Kong Open Access & GigaScience: CCHK@10
 
Ricardo Wurmus: Reproducible genomics analysis pipelines with GNU Guix
Ricardo Wurmus: Reproducible genomics analysis pipelines with GNU GuixRicardo Wurmus: Reproducible genomics analysis pipelines with GNU Guix
Ricardo Wurmus: Reproducible genomics analysis pipelines with GNU Guix
 
Paul Pavlidis at #ICG13: Monitoring changes in the Gene Ontology and their im...
Paul Pavlidis at #ICG13: Monitoring changes in the Gene Ontology and their im...Paul Pavlidis at #ICG13: Monitoring changes in the Gene Ontology and their im...
Paul Pavlidis at #ICG13: Monitoring changes in the Gene Ontology and their im...
 
Venice Juanillas at #ICG13: Rice Galaxy: an open resource for plant science
Venice Juanillas at #ICG13: Rice Galaxy: an open resource for plant scienceVenice Juanillas at #ICG13: Rice Galaxy: an open resource for plant science
Venice Juanillas at #ICG13: Rice Galaxy: an open resource for plant science
 
Stefan Prost at #ICG13: Genome analyses show strong selection on coloration, ...
Stefan Prost at #ICG13: Genome analyses show strong selection on coloration, ...Stefan Prost at #ICG13: Genome analyses show strong selection on coloration, ...
Stefan Prost at #ICG13: Genome analyses show strong selection on coloration, ...
 
Lisa Johnson at #ICG13: Re-assembly, quality evaluation, and annotation of 67...
Lisa Johnson at #ICG13: Re-assembly, quality evaluation, and annotation of 67...Lisa Johnson at #ICG13: Re-assembly, quality evaluation, and annotation of 67...
Lisa Johnson at #ICG13: Re-assembly, quality evaluation, and annotation of 67...
 
Chris Armit at IDW2018: Democratising Data Publishing: A Global Perspective
Chris Armit at IDW2018: Democratising Data Publishing: A Global PerspectiveChris Armit at IDW2018: Democratising Data Publishing: A Global Perspective
Chris Armit at IDW2018: Democratising Data Publishing: A Global Perspective
 
EMBL OA Week: FAIR or unfair? Principled publishing for more Open & Democrati...
EMBL OA Week: FAIR or unfair? Principled publishing for more Open & Democrati...EMBL OA Week: FAIR or unfair? Principled publishing for more Open & Democrati...
EMBL OA Week: FAIR or unfair? Principled publishing for more Open & Democrati...
 
Reproducible method and benchmarking publishing for the data (and evidence) d...
Reproducible method and benchmarking publishing for the data (and evidence) d...Reproducible method and benchmarking publishing for the data (and evidence) d...
Reproducible method and benchmarking publishing for the data (and evidence) d...
 
Mary Ann Tuli: What MODs can learn from Journals – a GigaDB curator’s perspec...
Mary Ann Tuli: What MODs can learn from Journals – a GigaDB curator’s perspec...Mary Ann Tuli: What MODs can learn from Journals – a GigaDB curator’s perspec...
Mary Ann Tuli: What MODs can learn from Journals – a GigaDB curator’s perspec...
 
Laurie Goodman: Sharing and Reusing Cell Image Data, ASCB/EMBO 2017 Subgroup ...
Laurie Goodman: Sharing and Reusing Cell Image Data, ASCB/EMBO 2017 Subgroup ...Laurie Goodman: Sharing and Reusing Cell Image Data, ASCB/EMBO 2017 Subgroup ...
Laurie Goodman: Sharing and Reusing Cell Image Data, ASCB/EMBO 2017 Subgroup ...
 

Kürzlich hochgeladen

THE ROLE OF BIOTECHNOLOGY IN THE ECONOMIC UPLIFT.pptx
THE ROLE OF BIOTECHNOLOGY IN THE ECONOMIC UPLIFT.pptxTHE ROLE OF BIOTECHNOLOGY IN THE ECONOMIC UPLIFT.pptx
THE ROLE OF BIOTECHNOLOGY IN THE ECONOMIC UPLIFT.pptx
ANSARKHAN96
 
Human genetics..........................pptx
Human genetics..........................pptxHuman genetics..........................pptx
Human genetics..........................pptx
Silpa
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 
Module for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learningModule for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learning
levieagacer
 
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune WaterworldsBiogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Sérgio Sacani
 
POGONATUM : morphology, anatomy, reproduction etc.
POGONATUM : morphology, anatomy, reproduction etc.POGONATUM : morphology, anatomy, reproduction etc.
POGONATUM : morphology, anatomy, reproduction etc.
Silpa
 
CYTOGENETIC MAP................ ppt.pptx
CYTOGENETIC MAP................ ppt.pptxCYTOGENETIC MAP................ ppt.pptx
CYTOGENETIC MAP................ ppt.pptx
Silpa
 
biology HL practice questions IB BIOLOGY
biology HL practice questions IB BIOLOGYbiology HL practice questions IB BIOLOGY
biology HL practice questions IB BIOLOGY
1301aanya
 

Kürzlich hochgeladen (20)

Use of mutants in understanding seedling development.pptx
Use of mutants in understanding seedling development.pptxUse of mutants in understanding seedling development.pptx
Use of mutants in understanding seedling development.pptx
 
THE ROLE OF BIOTECHNOLOGY IN THE ECONOMIC UPLIFT.pptx
THE ROLE OF BIOTECHNOLOGY IN THE ECONOMIC UPLIFT.pptxTHE ROLE OF BIOTECHNOLOGY IN THE ECONOMIC UPLIFT.pptx
THE ROLE OF BIOTECHNOLOGY IN THE ECONOMIC UPLIFT.pptx
 
Human genetics..........................pptx
Human genetics..........................pptxHuman genetics..........................pptx
Human genetics..........................pptx
 
CURRENT SCENARIO OF POULTRY PRODUCTION IN INDIA
CURRENT SCENARIO OF POULTRY PRODUCTION IN INDIACURRENT SCENARIO OF POULTRY PRODUCTION IN INDIA
CURRENT SCENARIO OF POULTRY PRODUCTION IN INDIA
 
Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....
Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....
Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....
 
PSYCHOSOCIAL NEEDS. in nursing II sem pptx
PSYCHOSOCIAL NEEDS. in nursing II sem pptxPSYCHOSOCIAL NEEDS. in nursing II sem pptx
PSYCHOSOCIAL NEEDS. in nursing II sem pptx
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Module for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learningModule for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learning
 
300003-World Science Day For Peace And Development.pptx
300003-World Science Day For Peace And Development.pptx300003-World Science Day For Peace And Development.pptx
300003-World Science Day For Peace And Development.pptx
 
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune WaterworldsBiogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
 
POGONATUM : morphology, anatomy, reproduction etc.
POGONATUM : morphology, anatomy, reproduction etc.POGONATUM : morphology, anatomy, reproduction etc.
POGONATUM : morphology, anatomy, reproduction etc.
 
CYTOGENETIC MAP................ ppt.pptx
CYTOGENETIC MAP................ ppt.pptxCYTOGENETIC MAP................ ppt.pptx
CYTOGENETIC MAP................ ppt.pptx
 
Clean In Place(CIP).pptx .
Clean In Place(CIP).pptx                 .Clean In Place(CIP).pptx                 .
Clean In Place(CIP).pptx .
 
Dr. E. Muralinath_ Blood indices_clinical aspects
Dr. E. Muralinath_ Blood indices_clinical  aspectsDr. E. Muralinath_ Blood indices_clinical  aspects
Dr. E. Muralinath_ Blood indices_clinical aspects
 
Climate Change Impacts on Terrestrial and Aquatic Ecosystems.pptx
Climate Change Impacts on Terrestrial and Aquatic Ecosystems.pptxClimate Change Impacts on Terrestrial and Aquatic Ecosystems.pptx
Climate Change Impacts on Terrestrial and Aquatic Ecosystems.pptx
 
Factory Acceptance Test( FAT).pptx .
Factory Acceptance Test( FAT).pptx       .Factory Acceptance Test( FAT).pptx       .
Factory Acceptance Test( FAT).pptx .
 
Role of AI in seed science Predictive modelling and Beyond.pptx
Role of AI in seed science  Predictive modelling and  Beyond.pptxRole of AI in seed science  Predictive modelling and  Beyond.pptx
Role of AI in seed science Predictive modelling and Beyond.pptx
 
PATNA CALL GIRLS 8617370543 LOW PRICE ESCORT SERVICE
PATNA CALL GIRLS 8617370543 LOW PRICE ESCORT SERVICEPATNA CALL GIRLS 8617370543 LOW PRICE ESCORT SERVICE
PATNA CALL GIRLS 8617370543 LOW PRICE ESCORT SERVICE
 
Cyanide resistant respiration pathway.pptx
Cyanide resistant respiration pathway.pptxCyanide resistant respiration pathway.pptx
Cyanide resistant respiration pathway.pptx
 
biology HL practice questions IB BIOLOGY
biology HL practice questions IB BIOLOGYbiology HL practice questions IB BIOLOGY
biology HL practice questions IB BIOLOGY
 

Anil Thanki at #ICG13: Aequatus: An open-source homology browser

  • 1. Aequatus: An open-source homology browser ANIL THANKI Data Infrastructure and Algorithms @anilthanki
  • 2. www.earlham.ac.uk Co-authors Robert Davey Earlham Institute Nicola Soranzo Earlham Institute Wilfried Haerty Earlham Institute Javier Herrero University College London Acknowledgements
  • 4. www.earlham.ac.uk Homology • Homology is existence of shared ancestry between a pair of structures in different species. –i.e. genes • The phylogenetic information inferred from the study of homologous genes –helps us to understand the evolution of gene families.
  • 5. www.earlham.ac.uk Homology • Various tools available to visualise homology –Ensembl –Genomicus –SynChro • They provide an overview of phylogeny and/or syntenic regions evolution at the family level • They can not provide information about structural changes within a gene
  • 7. www.earlham.ac.uk Homology - Tool example Map view Street view Genomicus
  • 9. www.earlham.ac.uk Aequatus • New open-source tool for visualisation of homologous genes • Reads data directly from Ensembl Compara and Ensembl Core Databases • Three main views 1. Gene tree view 2. Sankey view 3. Tabular view
  • 10. www.earlham.ac.uk Aequatus - Gene tree view • Phylogeny on left • Detailed view of gene structure across gene families • Shared exons use the same colour in each representation • Also visualises Insertions and Deletions
  • 11. www.earlham.ac.uk Aequatus - Gene tree view • Depicts the type of interrelation events that gave rise to the family: –speciation, duplication, and gene splits
  • 12. www.earlham.ac.uk • 1-to-1 alignments between homologous genes are important for pairwise comparison • On the top (A): alignment on gene structure • On the bottom (B): pairwise sequence alignments Aequatus - Gene tree view
  • 13. www.earlham.ac.uk Aequatus - Gene tree view • An interactive visualisation of the protein domains. • Connects to SMART web server via REST API and queries for domains, motifs, internal repeats, etc. • Can be filtered and sorted based on E-value and source. • Can be exported in CSV or Excel file format.
  • 14. www.earlham.ac.uk Aequatus - Gene tree view • An interactive visualisation of the protein domains.
  • 15. www.earlham.ac.uk Aequatus - Sankey view • Visualises homology as an interactive Sankey diagram • Homologues of a selected gene are distinguished by homology type –paralogs, 1-to-1 orthologs, 1-to-many orthologs • Coloured by species • Additional details for the homologous in the info panel on the right- hand side.
  • 16. www.earlham.ac.uk Aequatus - Sankey view • Visualises homology as an interactive Sankey diagram
  • 17. www.earlham.ac.uk Aequatus - Tabular view • Visualises homology as an interactive table • Contains statistical information for the homologous relationships. • Allows the user to –search for any homolog using a search box –filter results for the type of homology or one or more species • Export from the tabular view as Excel, CSV or PDF.
  • 18. www.earlham.ac.uk Aequatus - Tabular view D = Filter based on Species E = Filter based on Type of homology A = Search Box B = Detailed statistical information C = Detailed pairwise alignment
  • 20. www.earlham.ac.uk Aequatus.js • Aequatus.js is a JavaScript library based on the standalone Aequatus software package • It preserves interactive functionality of Aequatus • Does not require Ensembl databases for data • It has an ability to integration with countless web based applications • Gene Tree –JSON / Newick • Gene structural info –JSON Input
  • 22. www.earlham.ac.uk Galaxy and GeneSeqToFamily • Galaxy is an open source, web-based platform for data intensive biomedical research. • Aequatus.js plugin configured to be used into Galaxy –available on GitHub and integrated into usegalaxy.eu • Can visualises results of GeneSeqToFamily workflow –a Galaxy workflow to find gene families based on the Ensembl Compara GeneTrees pipeline –https://doi.org/10.1093/gigascience/giy005
  • 25. www.earlham.ac.uk New stuff... • The main extension to the Aequatus is incorporation of Ensembl REST API. • Aequatus can also retrieve latest data directly from Ensembl Compara and Core databases held at the EMBL-EBI, – without any need for local databases – avoids the need for local storage space – improves the portability of Aequatus
  • 26. www.earlham.ac.uk New stuff... • The main extension to the Aequatus is incorporation of Ensembl REST API.
  • 27. www.earlham.ac.uk New stuff... • The main extension to the Aequatus is incorporation of Ensembl REST API.
  • 28. www.earlham.ac.uk • Thanki AS, Soranzo N, Haerty W, Herrero J, Davey RP. Aequatus: An open-source homology browser. GigaScience 2018 • Demo: – Demo: http://aequatus.earlham.ac.uk/ • Source Code: – GitHub: https://github.com/TGAC/Aequatus • Aequatus.js plugin – GitHub: https://github.com/TGAC/aequatus.js • E-mail: Anil.Thanki@earlham.ac.uk • Twitter: @anilthanki Thank You.. Questions…?