SlideShare ist ein Scribd-Unternehmen logo
1 von 30
Alignment-based methods ,[object Object],[object Object],[object Object],[object Object],[object Object]
 
Local alignment Finds domains and short regions of similarity between  a pair of sequences.  The two sequences under  comparison do not necessarily need to have high levels  of similarity over their entire length in order to receive  locally high similarity scores.  This feature of local  similarity searches give them the advantage of being  useful when looking for domains within proteins or  looking for regions of genomic DNA that contain  introns.  Local similarity searches do not have the  constraint that similarity between two sequences  needs to be observed over the entire length of each  gene.
Global alignment Finds the optimal alignment over the entire length of  the two sequences under comparison.  Algorithms of  this nature are not particularly suited to the  identification of genes that have evolved by  recombination or insertion of unrelated regions of  DNA.  In instances such as this, a global similarity  score will be greatly reduced.  In cases where genes  are being aligned whose sequences are of comparable  length and also whose entire gene is homologous  (descendant from a common ancestor), global  alignment might be considered appropriate.
Terminology ,[object Object],[object Object],[object Object],[object Object]
Needleman-Wunsch Exact global alignment method. Not particularly good in many cases (database searches, looking for small regions of similarity, alignment of sequences with vastly differing lengths), but the most rigorous and thorough method if the task is to align sequences that have not evolved by exon shuffling, domain insertion/deletion etc.  In other words, it is the best method if you have sequences that are of ‘similar’ length and have evolved from a common ancestor by point processes (point mutation, small indels).
Smith-Waterman Exact local alignment There is no requirement for the alignment to extend along the entirety of the sequences.  This is a very good algorithm for database searching, multiple alignment and pairwise alignment. It is exhaustive and can be very slow (compared to the heuristics described later).  The difference between this and the N-W algorithm is that alignments starting at all possible positions must be considered, not just the ones that start at the beginning and end at the end.
 
FastA algorithm ,[object Object],[object Object],[object Object],[object Object]
 
FastA algorithm ,[object Object],[object Object],[object Object]
z-opt E() < 20  0  0 : * 22  0  0 : * 24  0  0 : * 26  0  0 : * 28  0  3 : * 30  0  18 : * 32  11  70 :=  * 34  73  190 :====  * 36  430  389 :================ * == 38  969  644 :=========================== * =============== 40  1086  898 :======================================= * ======== 42  1332  1097 :=============================================== * ========== 44  1252  1211 :==================================================== * == 46  1022  1233 :=============================================  * 48  1041  1181 :==============================================  * 50  982  1077 :===========================================  * 52  846  947 :=====================================  * 54  716  809 :================================  * 56  650  676 :============================= * 58  547  555 :======================== * 60  409  449 :==================  * 62  369  360 :=============== * = 64  289  287 :============ * 66  232  226 :========= * = 68  176  178 :======= * 70  163  140 :====== * = 72  124  109 :==== * = 74  88  85 :=== * 76  73  66 :== * = 78  73  51 :== * = 80  44  40 := * 82  32  31 := * 84  23  24 := * 86  19  19 : * 88  15  14 : * 90  8  11 : * 92  11  9 : *   :======== * == 94  3  7 : *   :===  * 96  2  5 : *   :==  * 98  6  4 : *   :=== * == 100  2  3 : *   :== * 102  4  2 : *   := * == 104  3  2 : *   := * = 106  0  1 : *   : * 108  0  1 : *   : * 110  1  1 : *   : * 112  0  1 : *   : * 114  1  1 : *   : * 116  0  0 : *   * 118  0  0 : *   * >120  1  0 : *   * = Results of a FastA search
The best scores are:  initn init1 opt  z-sc E(13127) HP0793 polypeptide deformylase (def) {Escherichia  66  66  100 126.9  0.71 AF2215 methylmalonyl-CoA mutase, subunit alpha, N  45  45  94 113.9  1.2 AF1231 hypothetical protein  50  50  86 104.9  4.4 MJ1169 tungsten formylmethanofuran dehydrogenase,  45  45  85 102.7  4.8 AF0267 hypothetical protein  71  71  84 101.2  5.5 AF1486 hypothetical protein  83  83  84 102.4  6.1 AF0262 medium-chain acyl-CoA ligase (alkK-2) {Pse  50  50  82 99.2  7.8 AF0229 conserved hypothetical protein {Methanococ  58  58  83 103.0  8.2 D09_orf125.gseg, 378 bases, 5AC53121 checksum.  50  50  85 110.0  8.5 SL251_1.UVRC  1797 residues  40  40  81 97.5  8.9 slr2049 hypothetical protein  83  83  83 105.5  9.9 AF0868 alkyldihydroxyacetonephosphate synthase {C  45  45  80 97.7  12 AF1320 GMP synthase (guaA-2) {Methanococcus janna  35  35  82 104.5  12 SL159_1.PKSK  13344 residues  99  74  74 79.2  12 slr1771  40  40  79 95.6  13 sll1018 dihydroorotase (pyrC)  60  60  79 96.6  14 slr2102 cell division protein FtsY (ftsY)  77  77  78 94.7  15 AF0946 hypothetical protein  67  67  76 88.8  16 AF1325 multidrug resistance protein {Methanococcu  55  55  77 95.0  20 SL194_2.BFMBB  1272 residues  75  75  76 93.1  22
Original BLAST ,[object Object],[object Object],[object Object]
Original BLAST ,[object Object],[object Object]
Gapped BLAST ,[object Object],[object Object],[object Object]
Gapped BLAST ,[object Object],[object Object],[object Object]
Two-Hit Method ,[object Object],[object Object],[object Object]
How does this affect the process of searching a database? ,[object Object],[object Object]
 
 
PSI(  -BLAST ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
PSI-BLAST ,[object Object],[object Object],[object Object]
PSI(  -BLAST ,[object Object],[object Object],[object Object]
PSI(  -BLAST ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
 
 
 
Significance of the similarity of two sequences ,[object Object],[object Object],[object Object]
Randomisation test ,[object Object],[object Object],[object Object],[object Object]

Weitere ähnliche Inhalte

Was ist angesagt? (20)

Sequence Alignment
Sequence AlignmentSequence Alignment
Sequence Alignment
 
Finding ORF
Finding ORFFinding ORF
Finding ORF
 
Sequence database
Sequence databaseSequence database
Sequence database
 
Data Retrieval Systems
Data Retrieval SystemsData Retrieval Systems
Data Retrieval Systems
 
Gene prediction methods vijay
Gene prediction methods  vijayGene prediction methods  vijay
Gene prediction methods vijay
 
Protein structure
Protein structureProtein structure
Protein structure
 
Prosite
PrositeProsite
Prosite
 
Proteins databases
Proteins databasesProteins databases
Proteins databases
 
Bioinformatics introduction
Bioinformatics introductionBioinformatics introduction
Bioinformatics introduction
 
Multiple sequence alignment
Multiple sequence alignmentMultiple sequence alignment
Multiple sequence alignment
 
Swiss PROT
Swiss PROT Swiss PROT
Swiss PROT
 
PIR- Protein Information Resource
PIR- Protein Information ResourcePIR- Protein Information Resource
PIR- Protein Information Resource
 
Sequence alignment global vs. local
Sequence alignment  global vs. localSequence alignment  global vs. local
Sequence alignment global vs. local
 
Sequence alignment
Sequence alignmentSequence alignment
Sequence alignment
 
BIOINFORMATICS Applications And Challenges
BIOINFORMATICS Applications And ChallengesBIOINFORMATICS Applications And Challenges
BIOINFORMATICS Applications And Challenges
 
Sequence homology search and multiple sequence alignment(1)
Sequence homology search and multiple sequence alignment(1)Sequence homology search and multiple sequence alignment(1)
Sequence homology search and multiple sequence alignment(1)
 
sequence alignment
sequence alignmentsequence alignment
sequence alignment
 
Structural databases
Structural databases Structural databases
Structural databases
 
Rasmol
RasmolRasmol
Rasmol
 
Cath
CathCath
Cath
 

Andere mochten auch (20)

Blast
BlastBlast
Blast
 
Blast
BlastBlast
Blast
 
PROTEIN DATABASE
PROTEIN DATABASEPROTEIN DATABASE
PROTEIN DATABASE
 
Multiple sequence alignment
Multiple sequence alignmentMultiple sequence alignment
Multiple sequence alignment
 
Parwati sihag
Parwati sihagParwati sihag
Parwati sihag
 
Sequence alignment belgaum
Sequence alignment belgaumSequence alignment belgaum
Sequence alignment belgaum
 
Protein database ..... of NCBI
Protein database ..... of NCBI Protein database ..... of NCBI
Protein database ..... of NCBI
 
Multiple sequence alignment
Multiple sequence alignmentMultiple sequence alignment
Multiple sequence alignment
 
Protein databases
Protein databasesProtein databases
Protein databases
 
Introduction to sequence alignment
Introduction to sequence alignmentIntroduction to sequence alignment
Introduction to sequence alignment
 
Sequence Alignment In Bioinformatics
Sequence Alignment In BioinformaticsSequence Alignment In Bioinformatics
Sequence Alignment In Bioinformatics
 
Protein Structure Prediction
Protein Structure PredictionProtein Structure Prediction
Protein Structure Prediction
 
PSC9131UG_C4_FD
PSC9131UG_C4_FDPSC9131UG_C4_FD
PSC9131UG_C4_FD
 
Presentation 2007 Journal Club Azhar Ali Shah
Presentation 2007 Journal Club Azhar Ali ShahPresentation 2007 Journal Club Azhar Ali Shah
Presentation 2007 Journal Club Azhar Ali Shah
 
Cool FM Profile(5)
Cool FM Profile(5)Cool FM Profile(5)
Cool FM Profile(5)
 
Radio
RadioRadio
Radio
 
Similarity
SimilaritySimilarity
Similarity
 
Bbc case study working capital
Bbc case study working capitalBbc case study working capital
Bbc case study working capital
 
Absolute Radio Engagement Metrics December 2013
Absolute Radio Engagement Metrics December 2013Absolute Radio Engagement Metrics December 2013
Absolute Radio Engagement Metrics December 2013
 
Bbc radio 1
Bbc radio 1Bbc radio 1
Bbc radio 1
 

Ähnlich wie BLAST

Basics of bioinformatics
Basics of bioinformaticsBasics of bioinformatics
Basics of bioinformaticsAbhishek Vatsa
 
Bioinformatics t4-alignments wim_vancriekingev2013
Bioinformatics t4-alignments wim_vancriekingev2013Bioinformatics t4-alignments wim_vancriekingev2013
Bioinformatics t4-alignments wim_vancriekingev2013Prof. Wim Van Criekinge
 
Sequence Alignment
Sequence AlignmentSequence Alignment
Sequence AlignmentRavi Gandham
 
Pathema Burkholderia Annotation Jamboree: Prokaryotic Annotation Overview
Pathema Burkholderia Annotation Jamboree: Prokaryotic Annotation OverviewPathema Burkholderia Annotation Jamboree: Prokaryotic Annotation Overview
Pathema Burkholderia Annotation Jamboree: Prokaryotic Annotation OverviewPathema
 
Laboratory 1 sequence_alignments
Laboratory 1 sequence_alignmentsLaboratory 1 sequence_alignments
Laboratory 1 sequence_alignmentsseham15
 
B.sc biochem i bobi u 3.2 algorithm + blast
B.sc biochem i bobi u 3.2 algorithm + blastB.sc biochem i bobi u 3.2 algorithm + blast
B.sc biochem i bobi u 3.2 algorithm + blastRai University
 
B.sc biochem i bobi u 3.2 algorithm + blast
B.sc biochem i bobi u 3.2 algorithm + blastB.sc biochem i bobi u 3.2 algorithm + blast
B.sc biochem i bobi u 3.2 algorithm + blastRai University
 
lecture4.ppt Sequence Alignmentaldf sdfsadf
lecture4.ppt Sequence Alignmentaldf sdfsadflecture4.ppt Sequence Alignmentaldf sdfsadf
lecture4.ppt Sequence Alignmentaldf sdfsadfalizain9604
 
Performance Improvement of BLAST with Use of MSA Techniques to Search Ancesto...
Performance Improvement of BLAST with Use of MSA Techniques to Search Ancesto...Performance Improvement of BLAST with Use of MSA Techniques to Search Ancesto...
Performance Improvement of BLAST with Use of MSA Techniques to Search Ancesto...journal ijrtem
 
Performance Improvement of BLAST with Use of MSA Techniques to Search Ancesto...
Performance Improvement of BLAST with Use of MSA Techniques to Search Ancesto...Performance Improvement of BLAST with Use of MSA Techniques to Search Ancesto...
Performance Improvement of BLAST with Use of MSA Techniques to Search Ancesto...IJRTEMJOURNAL
 
Bioinformatics MiRON
Bioinformatics MiRONBioinformatics MiRON
Bioinformatics MiRONPrabin Shakya
 

Ähnlich wie BLAST (20)

Basics of bioinformatics
Basics of bioinformaticsBasics of bioinformatics
Basics of bioinformatics
 
Blast fasta 4
Blast fasta 4Blast fasta 4
Blast fasta 4
 
_BLAST.ppt
_BLAST.ppt_BLAST.ppt
_BLAST.ppt
 
Bioinformatics t4-alignments wim_vancriekingev2013
Bioinformatics t4-alignments wim_vancriekingev2013Bioinformatics t4-alignments wim_vancriekingev2013
Bioinformatics t4-alignments wim_vancriekingev2013
 
Bioinformatica t4-alignments
Bioinformatica t4-alignmentsBioinformatica t4-alignments
Bioinformatica t4-alignments
 
Sequence Alignment
Sequence AlignmentSequence Alignment
Sequence Alignment
 
Pathema Burkholderia Annotation Jamboree: Prokaryotic Annotation Overview
Pathema Burkholderia Annotation Jamboree: Prokaryotic Annotation OverviewPathema Burkholderia Annotation Jamboree: Prokaryotic Annotation Overview
Pathema Burkholderia Annotation Jamboree: Prokaryotic Annotation Overview
 
Laboratory 1 sequence_alignments
Laboratory 1 sequence_alignmentsLaboratory 1 sequence_alignments
Laboratory 1 sequence_alignments
 
B.sc biochem i bobi u 3.2 algorithm + blast
B.sc biochem i bobi u 3.2 algorithm + blastB.sc biochem i bobi u 3.2 algorithm + blast
B.sc biochem i bobi u 3.2 algorithm + blast
 
B.sc biochem i bobi u 3.2 algorithm + blast
B.sc biochem i bobi u 3.2 algorithm + blastB.sc biochem i bobi u 3.2 algorithm + blast
B.sc biochem i bobi u 3.2 algorithm + blast
 
Alignments
AlignmentsAlignments
Alignments
 
lecture4.ppt Sequence Alignmentaldf sdfsadf
lecture4.ppt Sequence Alignmentaldf sdfsadflecture4.ppt Sequence Alignmentaldf sdfsadf
lecture4.ppt Sequence Alignmentaldf sdfsadf
 
1 md2016 homology
1 md2016 homology1 md2016 homology
1 md2016 homology
 
Bioinformatica 08-12-2011-t8-go-hmm
Bioinformatica 08-12-2011-t8-go-hmmBioinformatica 08-12-2011-t8-go-hmm
Bioinformatica 08-12-2011-t8-go-hmm
 
Bioinformatics t4-alignments v2014
Bioinformatics t4-alignments v2014Bioinformatics t4-alignments v2014
Bioinformatics t4-alignments v2014
 
Performance Improvement of BLAST with Use of MSA Techniques to Search Ancesto...
Performance Improvement of BLAST with Use of MSA Techniques to Search Ancesto...Performance Improvement of BLAST with Use of MSA Techniques to Search Ancesto...
Performance Improvement of BLAST with Use of MSA Techniques to Search Ancesto...
 
Performance Improvement of BLAST with Use of MSA Techniques to Search Ancesto...
Performance Improvement of BLAST with Use of MSA Techniques to Search Ancesto...Performance Improvement of BLAST with Use of MSA Techniques to Search Ancesto...
Performance Improvement of BLAST with Use of MSA Techniques to Search Ancesto...
 
Database Searching
Database SearchingDatabase Searching
Database Searching
 
Bioinformatics MiRON
Bioinformatics MiRONBioinformatics MiRON
Bioinformatics MiRON
 
Bioinformatics t8-go-hmm v2014
Bioinformatics t8-go-hmm v2014Bioinformatics t8-go-hmm v2014
Bioinformatics t8-go-hmm v2014
 

Kürzlich hochgeladen

Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native ApplicationsWSO2
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobeapidays
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024The Digital Insurer
 
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...apidays
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)wesley chun
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century educationjfdjdjcjdnsjd
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Jeffrey Haguewood
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyKhushali Kathiriya
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDropbox
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsNanddeep Nachan
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024The Digital Insurer
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWERMadyBayot
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FMESafe Software
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?Igalia
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingEdi Saputra
 

Kürzlich hochgeladen (20)

+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectors
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 

BLAST

  • 1.
  • 2.  
  • 3. Local alignment Finds domains and short regions of similarity between a pair of sequences. The two sequences under comparison do not necessarily need to have high levels of similarity over their entire length in order to receive locally high similarity scores. This feature of local similarity searches give them the advantage of being useful when looking for domains within proteins or looking for regions of genomic DNA that contain introns. Local similarity searches do not have the constraint that similarity between two sequences needs to be observed over the entire length of each gene.
  • 4. Global alignment Finds the optimal alignment over the entire length of the two sequences under comparison. Algorithms of this nature are not particularly suited to the identification of genes that have evolved by recombination or insertion of unrelated regions of DNA. In instances such as this, a global similarity score will be greatly reduced. In cases where genes are being aligned whose sequences are of comparable length and also whose entire gene is homologous (descendant from a common ancestor), global alignment might be considered appropriate.
  • 5.
  • 6. Needleman-Wunsch Exact global alignment method. Not particularly good in many cases (database searches, looking for small regions of similarity, alignment of sequences with vastly differing lengths), but the most rigorous and thorough method if the task is to align sequences that have not evolved by exon shuffling, domain insertion/deletion etc. In other words, it is the best method if you have sequences that are of ‘similar’ length and have evolved from a common ancestor by point processes (point mutation, small indels).
  • 7. Smith-Waterman Exact local alignment There is no requirement for the alignment to extend along the entirety of the sequences. This is a very good algorithm for database searching, multiple alignment and pairwise alignment. It is exhaustive and can be very slow (compared to the heuristics described later). The difference between this and the N-W algorithm is that alignments starting at all possible positions must be considered, not just the ones that start at the beginning and end at the end.
  • 8.  
  • 9.
  • 10.  
  • 11.
  • 12. z-opt E() < 20 0 0 : * 22 0 0 : * 24 0 0 : * 26 0 0 : * 28 0 3 : * 30 0 18 : * 32 11 70 := * 34 73 190 :==== * 36 430 389 :================ * == 38 969 644 :=========================== * =============== 40 1086 898 :======================================= * ======== 42 1332 1097 :=============================================== * ========== 44 1252 1211 :==================================================== * == 46 1022 1233 :============================================= * 48 1041 1181 :============================================== * 50 982 1077 :=========================================== * 52 846 947 :===================================== * 54 716 809 :================================ * 56 650 676 :============================= * 58 547 555 :======================== * 60 409 449 :================== * 62 369 360 :=============== * = 64 289 287 :============ * 66 232 226 :========= * = 68 176 178 :======= * 70 163 140 :====== * = 72 124 109 :==== * = 74 88 85 :=== * 76 73 66 :== * = 78 73 51 :== * = 80 44 40 := * 82 32 31 := * 84 23 24 := * 86 19 19 : * 88 15 14 : * 90 8 11 : * 92 11 9 : * :======== * == 94 3 7 : * :=== * 96 2 5 : * :== * 98 6 4 : * :=== * == 100 2 3 : * :== * 102 4 2 : * := * == 104 3 2 : * := * = 106 0 1 : * : * 108 0 1 : * : * 110 1 1 : * : * 112 0 1 : * : * 114 1 1 : * : * 116 0 0 : * * 118 0 0 : * * >120 1 0 : * * = Results of a FastA search
  • 13. The best scores are: initn init1 opt z-sc E(13127) HP0793 polypeptide deformylase (def) {Escherichia 66 66 100 126.9 0.71 AF2215 methylmalonyl-CoA mutase, subunit alpha, N 45 45 94 113.9 1.2 AF1231 hypothetical protein 50 50 86 104.9 4.4 MJ1169 tungsten formylmethanofuran dehydrogenase, 45 45 85 102.7 4.8 AF0267 hypothetical protein 71 71 84 101.2 5.5 AF1486 hypothetical protein 83 83 84 102.4 6.1 AF0262 medium-chain acyl-CoA ligase (alkK-2) {Pse 50 50 82 99.2 7.8 AF0229 conserved hypothetical protein {Methanococ 58 58 83 103.0 8.2 D09_orf125.gseg, 378 bases, 5AC53121 checksum. 50 50 85 110.0 8.5 SL251_1.UVRC 1797 residues 40 40 81 97.5 8.9 slr2049 hypothetical protein 83 83 83 105.5 9.9 AF0868 alkyldihydroxyacetonephosphate synthase {C 45 45 80 97.7 12 AF1320 GMP synthase (guaA-2) {Methanococcus janna 35 35 82 104.5 12 SL159_1.PKSK 13344 residues 99 74 74 79.2 12 slr1771 40 40 79 95.6 13 sll1018 dihydroorotase (pyrC) 60 60 79 96.6 14 slr2102 cell division protein FtsY (ftsY) 77 77 78 94.7 15 AF0946 hypothetical protein 67 67 76 88.8 16 AF1325 multidrug resistance protein {Methanococcu 55 55 77 95.0 20 SL194_2.BFMBB 1272 residues 75 75 76 93.1 22
  • 14.
  • 15.
  • 16.
  • 17.
  • 18.
  • 19.
  • 20.  
  • 21.  
  • 22.
  • 23.
  • 24.
  • 25.
  • 26.  
  • 27.  
  • 28.  
  • 29.
  • 30.