SlideShare ist ein Scribd-Unternehmen logo
1 von 55
Downloaden Sie, um offline zu lesen
A CGM-Based Parallel Algorithm Using the
Four-Russians Speedup for the 1-D Sequence
Alignment Problem
By
Jerry Lacmou Zeutouo, Grace Colette Tessa Masse, Franklin Ingrid
Kamga Youmbi
at
African Conference on Reseach in Computer Science and Applied
Mathematics
CARI’2020 Polytech School of Thiès, Senegal October.
Introduction
Sequential algorithm
CGM algorithm
Experimental results
Conclusion
Opening thoughts
The processing of sequences such as DNA sequencing is a variety
of problems that generate massive calculations and whose results
are difficult to achieve on simple PCs. One way to improve the
speed of solution is parallelism.
Lacmou et al. 2 / 55
Introduction
Sequential algorithm
CGM algorithm
Experimental results
Conclusion
Outline
1 Introduction
2 Sequential algorithm
3 CGM algorithm
4 Experimental results
5 Conclusion
Lacmou et al. 3 / 55
Introduction
Sequential algorithm
CGM algorithm
Experimental results
Conclusion
Context
Problematic
Our main focus
1 Introduction
2 Sequential algorithm
3 CGM algorithm
4 Experimental results
5 Conclusion
Lacmou et al. 4 / 55
Introduction
Sequential algorithm
CGM algorithm
Experimental results
Conclusion
Context
Problematic
Our main focus
What bioinformatics is
Bioinformatics is the analysis of biological information.
Lacmou et al. 5 / 55
Introduction
Sequential algorithm
CGM algorithm
Experimental results
Conclusion
Context
Problematic
Our main focus
Biological information
Definition
Biological information can be:
a sequence (DNA, RNA, proteins);
a structure (primaire, secondary);
a function (molecular function e.g. enzyme; cell component
e.g. membrane protein; biological process e.g. oxygen trans-
port);
interactions (proteins, metabolic pathways, gene networks).
Lacmou et al. 6 / 55
Introduction
Sequential algorithm
CGM algorithm
Experimental results
Conclusion
Context
Problematic
Our main focus
The privileged axes of bioinformatics
1 Formalization of genetic information analysis of sequences
(biomolecules) and their structure (especially 3D structure);
2 Biological interpretation of genetic information data integra-
tion (establishment of maps and networks of gene interac-
tions, protein interactions, etc.);
3 Functional prediction.
Lacmou et al. 7 / 55
Introduction
Sequential algorithm
CGM algorithm
Experimental results
Conclusion
Context
Problematic
Our main focus
Bioinformatics methods
Comparative method: comparison of unknown sequences or
structures with databases (sequences and structures) of genes
and proteins known to establish similarities (similarities, ho-
mologies or identities);
Statistical method: software programs apply statistical anal-
yses to the data (on sequence syntax) to try to identify and
identify rules and constraints that are systematic, regular or
general in nature;
Modelling probabilistic approach.
Lacmou et al. 8 / 55
Introduction
Sequential algorithm
CGM algorithm
Experimental results
Conclusion
Context
Problematic
Our main focus
Sequence alignment
One of the fundamental problems of bioinformatics
is the sequence alignment, crucial for molecular
prediction, molecular interactions and phylogenetic
analysis.
Lacmou et al. 9 / 55
Introduction
Sequential algorithm
CGM algorithm
Experimental results
Conclusion
Context
Problematic
Our main focus
Sequence alignment definition
Definition
Sequence alignment is the problem of comparing biological
sequences by looking for a series of nucleotides or amino acids
that appear in the same order in the input sequences, possibly
introducing gaps. It is a means of visualizing the similarity
between sequences based on the notions of similarity or distance.
Figure: Sequences alignementLacmou et al. 10 / 55
Introduction
Sequential algorithm
CGM algorithm
Experimental results
Conclusion
Context
Problematic
Our main focus
Types of alignments
Two types of alignments are considered:
1 global alignments, which take into account the whole of two
sequences to be compared;
2 local alignments, which make it possible to detect the seg-
ment of the first sequence which is most similar to a segment
of the other.
Lacmou et al. 11 / 55
Introduction
Sequential algorithm
CGM algorithm
Experimental results
Conclusion
Context
Problematic
Our main focus
Dynamic programming complexity
Dynamic programming complexity
Due to dynamic programming, both types of alignments can be
run in O(n2) time and space where n is the length of two
strings. Based on the Four-Russians method, several sped-up
sequential solutions has been proposed like the one of Brubach
and Ghurye’s algorithm running in O( n2
log n).
Lacmou et al. 12 / 55
Introduction
Sequential algorithm
CGM algorithm
Experimental results
Conclusion
Context
Problematic
Our main focus
Why parallelize?
Despite the acceleration of the sequential algorithm, the
resolution of the problem remains time and space consuming.
Lacmou et al. 13 / 55
Introduction
Sequential algorithm
CGM algorithm
Experimental results
Conclusion
Context
Problematic
Our main focus
The parallelization of the sequential algorithm
A PRAM algorithm runs in O(log n log m) time with nm
log nm
CREW processors have been proposed by Apostolico et al.,
1990;
Alves et al., 2002 proposed a parallel solution for a variant
of the problem under the CGM model using weighted graphs
that require log p rounds and runs in O(n2
p log m);
Recently, Kim et al., 2016 have proposed a space-efficient al-
phabetindependent Four-Russians’ lookup table and a mul-
tithreaded Four-Russians’ edit distance algorithm.
Lacmou et al. 14 / 55
Introduction
Sequential algorithm
CGM algorithm
Experimental results
Conclusion
Context
Problematic
Our main focus
Our aim
We tackle the problem of parallelizing the Brubach and Ghurye’s
sequential algorithm for the one-dimensional sequence alignment
problem on the CGM model (Coarse-Grained Multicomputer).
The choice of the model CGM is due to the fact that it is
more suitable to the current machines and it has already
been used to solve efficiently several problems such as the
optimal binary search tree problem
Lacmou et al. 15 / 55
Introduction
Sequential algorithm
CGM algorithm
Experimental results
Conclusion
Context
Problematic
Our main focus
What is a CGM algorithm?
Its design is based on:
Partitioning of the problem into subproblems;
Distribution of subproblems between the different processors;
CGM algorithm itself: it is a set of rounds of calculations
and communicatons.
Lacmou et al. 16 / 55
Introduction
Sequential algorithm
CGM algorithm
Experimental results
Conclusion
Context
Problematic
Our main focus
Parallelisation criterion on CGM algorithm
Minimize the idleness time and the latency time of proces-
sors;
Minimize the overall communication time;
Minimize the number of communications rounds;
Minimize the local computation time of each processor and;
Balance the load between the processors.
Lacmou et al. 17 / 55
Introduction
Sequential algorithm
CGM algorithm
Experimental results
Conclusion
Context
Problematic
Our main focus
Our results
Our solution requires O( n2
p log n ) execution time with O(p)
communication rounds on p processors.
Lacmou et al. 18 / 55
Introduction
Sequential algorithm
CGM algorithm
Experimental results
Conclusion
Global alignment
Local alignment
The Four-Russians’ speedup
Brubach and Ghurye’s efficient lookup table
1 Introduction
2 Sequential algorithm
3 CGM algorithm
4 Experimental results
5 Conclusion
Lacmou et al. 19 / 55
Introduction
Sequential algorithm
CGM algorithm
Experimental results
Conclusion
Global alignment
Local alignment
The Four-Russians’ speedup
Brubach and Ghurye’s efficient lookup table
Formal definition
Definition
The sequence alignment method seeks to optimize the alignment
score. This score is related to the similarity rate between the
two compared sequences. It measures the number of edit
operations it takes to convert a sequence X into another Y .
Lacmou et al. 20 / 55
Introduction
Sequential algorithm
CGM algorithm
Experimental results
Conclusion
Global alignment
Local alignment
The Four-Russians’ speedup
Brubach and Ghurye’s efficient lookup table
Elementary editing operation
They include:
1 insertions;
2 deletions;
3 substitutions of a single character.
The weighted variant assigns a cost to each of the mentioned
operations and through the use of a penalty matrix in case, costs
appear not to be constant.
Lacmou et al. 21 / 55
Introduction
Sequential algorithm
CGM algorithm
Experimental results
Conclusion
Global alignment
Local alignment
The Four-Russians’ speedup
Brubach and Ghurye’s efficient lookup table
Global alignment overview
A global alignment of two X and Y sequences can be given
by computing the distance between X and Y ;
The elementary editing operations are described as follows:
substitution of an a symbol by an b symbol;
deletion of an a symbol;
insertion of an b symbol.
There is also the ability to calculate global alignments using
similarity scores instead of distance.
Lacmou et al. 22 / 55
Introduction
Sequential algorithm
CGM algorithm
Experimental results
Conclusion
Global alignment
Local alignment
The Four-Russians’ speedup
Brubach and Ghurye’s efficient lookup table
Global alignment score computation
T[i, j] = max



T[i − 1, j − 1] + Sub(X[i], Y [j]),
T[i − 1, j] + Del(X[i]),
T[i, j − 1] + Ins(Y [j]).
(1)
Lacmou et al. 23 / 55
Introduction
Sequential algorithm
CGM algorithm
Experimental results
Conclusion
Global alignment
Local alignment
The Four-Russians’ speedup
Brubach and Ghurye’s efficient lookup table
Local alignment overview
A local alignment of two sequences X and Y consists in find-
ing the X segment that is most similar to a Y segment ;
The local edit score of two sequences X and Y is defined by
s(X, Y ), the maximum similarity between an X segment and
a Y segment.
Lacmou et al. 24 / 55
Introduction
Sequential algorithm
CGM algorithm
Experimental results
Conclusion
Global alignment
Local alignment
The Four-Russians’ speedup
Brubach and Ghurye’s efficient lookup table
Local alignment score computation
Ts[i, j] = max



Ts[i − 1, j − 1] + Sub(X[i], Y [j]),
Ts[i − 1, j] + Del(X[i]),
Ts[i, j − 1] + Ins(Y [j]),
0.
(2)
Lacmou et al. 25 / 55
Introduction
Sequential algorithm
CGM algorithm
Experimental results
Conclusion
Global alignment
Local alignment
The Four-Russians’ speedup
Brubach and Ghurye’s efficient lookup table
Task graph
Lacmou et al. 26 / 55
Introduction
Sequential algorithm
CGM algorithm
Experimental results
Conclusion
Global alignment
Local alignment
The Four-Russians’ speedup
Brubach and Ghurye’s efficient lookup table
Speedup techniques
Dynamic programming algorithms can sometimes be made even
faster by applying speedups such as the Knuth-Yao
quadrangle-inequality speedup or the Four-Russians speedup.
Lacmou et al. 27 / 55
Introduction
Sequential algorithm
CGM algorithm
Experimental results
Conclusion
Global alignment
Local alignment
The Four-Russians’ speedup
Brubach and Ghurye’s efficient lookup table
Idea behind the Four-Russians’ speedup
The idea behind the speedup is to tile the dynamic programming
table into smaller blocks whose solutions are foreseen,
precomputed and stored in a lookup table. The goal sought is
spending less time on those by merely searching for them.
Lacmou et al. 28 / 55
Introduction
Sequential algorithm
CGM algorithm
Experimental results
Conclusion
Global alignment
Local alignment
The Four-Russians’ speedup
Brubach and Ghurye’s efficient lookup table
A single t-block
Lacmou et al. 29 / 55
Introduction
Sequential algorithm
CGM algorithm
Experimental results
Conclusion
Global alignment
Local alignment
The Four-Russians’ speedup
Brubach and Ghurye’s efficient lookup table
This quadratic time bound has been first proposed using
the Four-Russians technique by Masek and Paterson in 1980
achieving O(n2/ log n);
Crochemore et al., 2003 later achieved the same time bound
for unrestricted scoring matrices.
Lacmou et al. 30 / 55
Introduction
Sequential algorithm
CGM algorithm
Experimental results
Conclusion
Global alignment
Local alignment
The Four-Russians’ speedup
Brubach and Ghurye’s efficient lookup table
Brubach and Ghurye’s efficient lookup table
The bottleneck in the aforementioned speedup lies in the space
demanded by the table, and the time spent computing it.
Lacmou et al. 31 / 55
Introduction
Sequential algorithm
CGM algorithm
Experimental results
Conclusion
Global alignment
Local alignment
The Four-Russians’ speedup
Brubach and Ghurye’s efficient lookup table
The Four-Russians lookup table can be built in O(t2 lg t), queried
in O(t) and stored in O(t2) space.
Lacmou et al. 32 / 55
Introduction
Sequential algorithm
CGM algorithm
Experimental results
Conclusion
Global alignment
Local alignment
The Four-Russians’ speedup
Brubach and Ghurye’s efficient lookup table
The t-blocks within the dynamic programming table are tiled
in a way they overlap by one column/row on each side;
Given a t-block first row and column, the block function of
the lookup process has to provide its last row and column;
Running the block function time and again in a row-wise (or
column-wise) manner will eventually yield the global edit
distance score in the bottom-right cell.
Lacmou et al. 33 / 55
Introduction
Sequential algorithm
CGM algorithm
Experimental results
Conclusion
Partitioning strategy
Mapping macro-blocks onto processors
Overview of the CGM algorithm
1 Introduction
2 Sequential algorithm
3 CGM algorithm
4 Experimental results
5 Conclusion
Lacmou et al. 34 / 55
Introduction
Sequential algorithm
CGM algorithm
Experimental results
Conclusion
Partitioning strategy
Mapping macro-blocks onto processors
Overview of the CGM algorithm
Our contribution
Our solution is divided into two parts:
1 Firstly, each processor computes the lookup table through
the Brubach and Ghurye’s sequential algorithm in O(t2 lg t);
2 Secondly, we partition the task graph into subgraphs of the
same size, and we distribute them fairly onto the processors.
Lacmou et al. 35 / 55
Introduction
Sequential algorithm
CGM algorithm
Experimental results
Conclusion
Partitioning strategy
Mapping macro-blocks onto processors
Overview of the CGM algorithm
Partitioning strategy
Our technique consists in partitioning the task graph in two
steps:
Lacmou et al. 36 / 55
Introduction
Sequential algorithm
CGM algorithm
Experimental results
Conclusion
Partitioning strategy
Mapping macro-blocks onto processors
Overview of the CGM algorithm
Partitioning steps
1 Firstly, we partition the task graph into small blocks of size
t referred to as t-blocks such that any two adjacent t-blocks
overlap by either a row or column. After this first partition-
ing, the task graph is divided into k lines and k columns
where k = n
t−1;
2 Next, we subdivide the n2
t2 t-blocks into p lines and p columns
of blocks referred to as macro-blocks and denoted by MB(i, j).
MB(i, j) is a matrix of size k
p × k
p and is identified by the
node on the lower right corner.
Lacmou et al. 37 / 55
Introduction
Sequential algorithm
CGM algorithm
Experimental results
Conclusion
Partitioning strategy
Mapping macro-blocks onto processors
Overview of the CGM algorithm
Scenario of this partitioning
1 2 3 4
2 3 4 5
3 4 5 6
4 5 6 7
Lacmou et al. 38 / 55
Introduction
Sequential algorithm
CGM algorithm
Experimental results
Conclusion
Partitioning strategy
Mapping macro-blocks onto processors
Overview of the CGM algorithm
Snake-like mapping
This distribution consists in assigning all macro-blocks of a
diagonal from the top left corner to the bottom right corner;
This process is renewed until all processors have been used,
starting with processor 0 and traveling through the blocks
with a "snake-like" path.
Lacmou et al. 39 / 55
Introduction
Sequential algorithm
CGM algorithm
Experimental results
Conclusion
Partitioning strategy
Mapping macro-blocks onto processors
Overview of the CGM algorithm
Illustration
Lacmou et al. 40 / 55
Introduction
Sequential algorithm
CGM algorithm
Experimental results
Conclusion
Partitioning strategy
Mapping macro-blocks onto processors
Overview of the CGM algorithm
Lemma
After the partitioning of the task graph and snake-like
distribution scheme, each processor evaluates exactly p
macro-blocks.
Proof.
There are p2 macro-blocks after the partitioning of the task
graph. Therefore, it is clear that each processor evaluates
exactly p blocks.
Lacmou et al. 41 / 55
Introduction
Sequential algorithm
CGM algorithm
Experimental results
Conclusion
Partitioning strategy
Mapping macro-blocks onto processors
Overview of the CGM algorithm
Our CGM algorithm
We present our CGM solution for the sequence alignment
problem. Brubach and Ghurye’s sequential algorithm is used for
local computations. Based on the previous partitioning strategy
and the mapping of blocks onto processors, the corresponding
CGM algorithm is presented in algorithm.
Lacmou et al. 42 / 55
Introduction
Sequential algorithm
CGM algorithm
Experimental results
Conclusion
Partitioning strategy
Mapping macro-blocks onto processors
Overview of the CGM algorithm
Our CGM algorithm
Lacmou et al. 43 / 55
Introduction
Sequential algorithm
CGM algorithm
Experimental results
Conclusion
Partitioning strategy
Mapping macro-blocks onto processors
Overview of the CGM algorithm
Local computation time
Lemma
The evaluation of a macro-block of size n
p(t−1) × n
p(t−1) requires
O n2
p2t
computation time.
Lemma
Our CGM algorithm requires O n2
pt time steps per processor.
Lacmou et al. 44 / 55
Introduction
Sequential algorithm
CGM algorithm
Experimental results
Conclusion
Partitioning strategy
Mapping macro-blocks onto processors
Overview of the CGM algorithm
Algorithm complexity
Theorem
By subdividing the task graph into macro-blocks of size
n
p(t−1) × n
p(t−1) and using the snake-like distribution scheme, our
CGM algorithm requires O n2
p log n execution time with O(p)
communication rounds when t = log n.
Lacmou et al. 45 / 55
Introduction
Sequential algorithm
CGM algorithm
Experimental results
Conclusion
1 Introduction
2 Sequential algorithm
3 CGM algorithm
4 Experimental results
5 Conclusion
Lacmou et al. 46 / 55
Introduction
Sequential algorithm
CGM algorithm
Experimental results
Conclusion
Environment
We measure the performance of our solution on the Matrics plat-
form backed by the University of Picardie Jules Verne.
Intel Xeon (R) CPU E5-2680 v4 @ 2.40GHz;
28 cores;
128GB of RAM;
The inter-processor communication is implemented with the
MPI library (OpenMPI version 1.10.4).
Lacmou et al. 47 / 55
Introduction
Sequential algorithm
CGM algorithm
Experimental results
Conclusion
Samples
We use a real biological DNA sequences with |Σ| = 4. The
results presented here are derived from its execution for different
values of the triplet (m, n, p), where m and n are the size of
sequences, with values ranging from 105 to 106, and p is the
number of processors, with values in the set {1, 2, 4, 8, 16, 32, 48}.
Lacmou et al. 48 / 55
Introduction
Sequential algorithm
CGM algorithm
Experimental results
Conclusion
Execution time for m
Lacmou et al. 49 / 55
Introduction
Sequential algorithm
CGM algorithm
Experimental results
Conclusion
Execution time for p
Lacmou et al. 50 / 55
Introduction
Sequential algorithm
CGM algorithm
Experimental results
Conclusion
Result comments
For m = 106:
Our algorithm runs about 11.43 times faster than
the Brubach and Ghurye’s sequential algorithm on
16 processors;
The speed-up increases up to 20.40 on 32 processors;
And 30.05 on 48 processors.
From all this, we can conclude that our algorithm is scalable to
the increase of the size of sequences and the numbers of
processors.
Lacmou et al. 51 / 55
Introduction
Sequential algorithm
CGM algorithm
Experimental results
Conclusion
1 Introduction
2 Sequential algorithm
3 CGM algorithm
4 Experimental results
5 Conclusion
Lacmou et al. 52 / 55
Introduction
Sequential algorithm
CGM algorithm
Experimental results
Conclusion
Wrapping things up
We attempted in this paper to parallelize their results in ap-
plication to the one-dimensional sequence alignment prob-
lem;
Based on the CGM model, our solution clusters t-blocks into
macro-blocks and follows the snake-like distribution mapping
pattern onto p processors to achieve for O(p) rounds of com-
munication, an execution time in O(n2/p log n);
Experimental results show good agreement with theoretical
forecasts.
Lacmou et al. 53 / 55
Introduction
Sequential algorithm
CGM algorithm
Experimental results
Conclusion
Future directions
Noticing the fact that single processors uselessly compute large
sets of entries in their respective lookup tables, one might be
interested in cutting off the waste. Or even extend our solution
to address the two-dimensional sequence alignment problem.
Lacmou et al. 54 / 55
Introduction
Sequential algorithm
CGM algorithm
Experimental results
Conclusion
Thanks for your keen attention :-)
Lacmou et al. 55 / 55

Weitere ähnliche Inhalte

Was ist angesagt?

Optimizing an Earth Science Atmospheric Application with the OmpSs Programmin...
Optimizing an Earth Science Atmospheric Application with the OmpSs Programmin...Optimizing an Earth Science Atmospheric Application with the OmpSs Programmin...
Optimizing an Earth Science Atmospheric Application with the OmpSs Programmin...George Markomanolis
 
Hybrid of Ant Colony Optimization and Gravitational Emulation Based Load Bala...
Hybrid of Ant Colony Optimization and Gravitational Emulation Based Load Bala...Hybrid of Ant Colony Optimization and Gravitational Emulation Based Load Bala...
Hybrid of Ant Colony Optimization and Gravitational Emulation Based Load Bala...IRJET Journal
 
Parallel Biological Sequence Comparison in GPU Platforms
Parallel Biological Sequence Comparison in GPU PlatformsParallel Biological Sequence Comparison in GPU Platforms
Parallel Biological Sequence Comparison in GPU PlatformsGanesan Narayanasamy
 
OPTIMIZATION OF IP NETWORKS IN VARIOUS HYBRID IGP/MPLS ROUTING SCHEMES
OPTIMIZATION OF IP NETWORKS IN VARIOUS HYBRID IGP/MPLS ROUTING SCHEMESOPTIMIZATION OF IP NETWORKS IN VARIOUS HYBRID IGP/MPLS ROUTING SCHEMES
OPTIMIZATION OF IP NETWORKS IN VARIOUS HYBRID IGP/MPLS ROUTING SCHEMESEM Legacy
 
Evaluating Mapping Repair Systems with Large Biomedical Ontologies
Evaluating Mapping Repair Systems with Large Biomedical OntologiesEvaluating Mapping Repair Systems with Large Biomedical Ontologies
Evaluating Mapping Repair Systems with Large Biomedical OntologiesErnesto Jimenez Ruiz
 
FEEDBACK LINEARIZATION AND BACKSTEPPING CONTROLLERS FOR COUPLED TANKS
FEEDBACK LINEARIZATION AND BACKSTEPPING CONTROLLERS FOR COUPLED TANKSFEEDBACK LINEARIZATION AND BACKSTEPPING CONTROLLERS FOR COUPLED TANKS
FEEDBACK LINEARIZATION AND BACKSTEPPING CONTROLLERS FOR COUPLED TANKSieijjournal
 
New controllers efficient model based design method
New controllers efficient model based design methodNew controllers efficient model based design method
New controllers efficient model based design methodAlexander Decker
 
Using the black-box approach with machine learning methods in ...
Using the black-box approach with machine learning methods in ...Using the black-box approach with machine learning methods in ...
Using the black-box approach with machine learning methods in ...butest
 
Protein structure alignment beyond spatial proximity 3 dsig_2012
Protein structure alignment beyond spatial proximity 3 dsig_2012Protein structure alignment beyond spatial proximity 3 dsig_2012
Protein structure alignment beyond spatial proximity 3 dsig_2012Sheng Wang
 
LogMap: Large-scale, Logic-based and Interactive Ontology Matching
LogMap: Large-scale, Logic-based and Interactive Ontology MatchingLogMap: Large-scale, Logic-based and Interactive Ontology Matching
LogMap: Large-scale, Logic-based and Interactive Ontology MatchingErnesto Jimenez Ruiz
 
07. 9954 14018-1-ed edit dhyan
07. 9954 14018-1-ed edit dhyan07. 9954 14018-1-ed edit dhyan
07. 9954 14018-1-ed edit dhyanIAESIJEECS
 
Swarm Intelligence Heuristics for Graph Coloring Problem
Swarm Intelligence Heuristics for Graph Coloring ProblemSwarm Intelligence Heuristics for Graph Coloring Problem
Swarm Intelligence Heuristics for Graph Coloring ProblemMario Pavone
 
Automated Machine Learning via Sequential Uniform Designs
Automated Machine Learning via Sequential Uniform DesignsAutomated Machine Learning via Sequential Uniform Designs
Automated Machine Learning via Sequential Uniform DesignsAijun Zhang
 
Paper on experimental setup for verifying - "Slow Learners are Fast"
Paper  on experimental setup for verifying  - "Slow Learners are Fast"Paper  on experimental setup for verifying  - "Slow Learners are Fast"
Paper on experimental setup for verifying - "Slow Learners are Fast"Robin Srivastava
 
Time series analysis use E-views programer
Time series analysis use E-views programerTime series analysis use E-views programer
Time series analysis use E-views programerAl-Qadisiya University
 

Was ist angesagt? (20)

part A
part Apart A
part A
 
Optimizing an Earth Science Atmospheric Application with the OmpSs Programmin...
Optimizing an Earth Science Atmospheric Application with the OmpSs Programmin...Optimizing an Earth Science Atmospheric Application with the OmpSs Programmin...
Optimizing an Earth Science Atmospheric Application with the OmpSs Programmin...
 
Hybrid of Ant Colony Optimization and Gravitational Emulation Based Load Bala...
Hybrid of Ant Colony Optimization and Gravitational Emulation Based Load Bala...Hybrid of Ant Colony Optimization and Gravitational Emulation Based Load Bala...
Hybrid of Ant Colony Optimization and Gravitational Emulation Based Load Bala...
 
S4101116121
S4101116121S4101116121
S4101116121
 
Parallel Biological Sequence Comparison in GPU Platforms
Parallel Biological Sequence Comparison in GPU PlatformsParallel Biological Sequence Comparison in GPU Platforms
Parallel Biological Sequence Comparison in GPU Platforms
 
OPTIMIZATION OF IP NETWORKS IN VARIOUS HYBRID IGP/MPLS ROUTING SCHEMES
OPTIMIZATION OF IP NETWORKS IN VARIOUS HYBRID IGP/MPLS ROUTING SCHEMESOPTIMIZATION OF IP NETWORKS IN VARIOUS HYBRID IGP/MPLS ROUTING SCHEMES
OPTIMIZATION OF IP NETWORKS IN VARIOUS HYBRID IGP/MPLS ROUTING SCHEMES
 
Fulltext
FulltextFulltext
Fulltext
 
Evaluating Mapping Repair Systems with Large Biomedical Ontologies
Evaluating Mapping Repair Systems with Large Biomedical OntologiesEvaluating Mapping Repair Systems with Large Biomedical Ontologies
Evaluating Mapping Repair Systems with Large Biomedical Ontologies
 
FEEDBACK LINEARIZATION AND BACKSTEPPING CONTROLLERS FOR COUPLED TANKS
FEEDBACK LINEARIZATION AND BACKSTEPPING CONTROLLERS FOR COUPLED TANKSFEEDBACK LINEARIZATION AND BACKSTEPPING CONTROLLERS FOR COUPLED TANKS
FEEDBACK LINEARIZATION AND BACKSTEPPING CONTROLLERS FOR COUPLED TANKS
 
New controllers efficient model based design method
New controllers efficient model based design methodNew controllers efficient model based design method
New controllers efficient model based design method
 
Using the black-box approach with machine learning methods in ...
Using the black-box approach with machine learning methods in ...Using the black-box approach with machine learning methods in ...
Using the black-box approach with machine learning methods in ...
 
Protein structure alignment beyond spatial proximity 3 dsig_2012
Protein structure alignment beyond spatial proximity 3 dsig_2012Protein structure alignment beyond spatial proximity 3 dsig_2012
Protein structure alignment beyond spatial proximity 3 dsig_2012
 
LogMap: Large-scale, Logic-based and Interactive Ontology Matching
LogMap: Large-scale, Logic-based and Interactive Ontology MatchingLogMap: Large-scale, Logic-based and Interactive Ontology Matching
LogMap: Large-scale, Logic-based and Interactive Ontology Matching
 
07. 9954 14018-1-ed edit dhyan
07. 9954 14018-1-ed edit dhyan07. 9954 14018-1-ed edit dhyan
07. 9954 14018-1-ed edit dhyan
 
Swarm Intelligence Heuristics for Graph Coloring Problem
Swarm Intelligence Heuristics for Graph Coloring ProblemSwarm Intelligence Heuristics for Graph Coloring Problem
Swarm Intelligence Heuristics for Graph Coloring Problem
 
Automated Machine Learning via Sequential Uniform Designs
Automated Machine Learning via Sequential Uniform DesignsAutomated Machine Learning via Sequential Uniform Designs
Automated Machine Learning via Sequential Uniform Designs
 
Paper on experimental setup for verifying - "Slow Learners are Fast"
Paper  on experimental setup for verifying  - "Slow Learners are Fast"Paper  on experimental setup for verifying  - "Slow Learners are Fast"
Paper on experimental setup for verifying - "Slow Learners are Fast"
 
Time series analysis use E-views programer
Time series analysis use E-views programerTime series analysis use E-views programer
Time series analysis use E-views programer
 
paper
paperpaper
paper
 
NP-Completeness - II
NP-Completeness - IINP-Completeness - II
NP-Completeness - II
 

Ähnlich wie CGM-Based Parallel 1-D Sequence Alignment

The New Hybrid COAW Method for Solving Multi-Objective Problems
The New Hybrid COAW Method for Solving Multi-Objective ProblemsThe New Hybrid COAW Method for Solving Multi-Objective Problems
The New Hybrid COAW Method for Solving Multi-Objective Problemsijfcstjournal
 
THE NEW HYBRID COAW METHOD FOR SOLVING MULTI-OBJECTIVE PROBLEMS
THE NEW HYBRID COAW METHOD FOR SOLVING MULTI-OBJECTIVE PROBLEMSTHE NEW HYBRID COAW METHOD FOR SOLVING MULTI-OBJECTIVE PROBLEMS
THE NEW HYBRID COAW METHOD FOR SOLVING MULTI-OBJECTIVE PROBLEMSijfcstjournal
 
An Exact Branch And Bound Algorithm For The General Quadratic Assignment Problem
An Exact Branch And Bound Algorithm For The General Quadratic Assignment ProblemAn Exact Branch And Bound Algorithm For The General Quadratic Assignment Problem
An Exact Branch And Bound Algorithm For The General Quadratic Assignment ProblemJoe Andelija
 
Online learning in estimation of distribution algorithms for dynamic environm...
Online learning in estimation of distribution algorithms for dynamic environm...Online learning in estimation of distribution algorithms for dynamic environm...
Online learning in estimation of distribution algorithms for dynamic environm...André Gonçalves
 
result analysis for deep leakage from gradients
result analysis for deep leakage from gradientsresult analysis for deep leakage from gradients
result analysis for deep leakage from gradients國騰 丁
 
Solving Assembly Line Balancing Problem Using A Hybrid Genetic Algorithm With...
Solving Assembly Line Balancing Problem Using A Hybrid Genetic Algorithm With...Solving Assembly Line Balancing Problem Using A Hybrid Genetic Algorithm With...
Solving Assembly Line Balancing Problem Using A Hybrid Genetic Algorithm With...inventionjournals
 
Solving Assembly Line Balancing Problem Using A Hybrid Genetic Algorithm With...
Solving Assembly Line Balancing Problem Using A Hybrid Genetic Algorithm With...Solving Assembly Line Balancing Problem Using A Hybrid Genetic Algorithm With...
Solving Assembly Line Balancing Problem Using A Hybrid Genetic Algorithm With...inventionjournals
 
FPGA Optimized Fuzzy Controller Design for Magnetic Ball Levitation using Gen...
FPGA Optimized Fuzzy Controller Design for Magnetic Ball Levitation using Gen...FPGA Optimized Fuzzy Controller Design for Magnetic Ball Levitation using Gen...
FPGA Optimized Fuzzy Controller Design for Magnetic Ball Levitation using Gen...IDES Editor
 
Scimakelatex.93126.cocoon.bobbin
Scimakelatex.93126.cocoon.bobbinScimakelatex.93126.cocoon.bobbin
Scimakelatex.93126.cocoon.bobbinAgostino_Marchetti
 
A HYBRID CLUSTERING ALGORITHM FOR DATA MINING
A HYBRID CLUSTERING ALGORITHM FOR DATA MININGA HYBRID CLUSTERING ALGORITHM FOR DATA MINING
A HYBRID CLUSTERING ALGORITHM FOR DATA MININGcscpconf
 
COMPARISON BETWEEN THE GENETIC ALGORITHMS OPTIMIZATION AND PARTICLE SWARM OPT...
COMPARISON BETWEEN THE GENETIC ALGORITHMS OPTIMIZATION AND PARTICLE SWARM OPT...COMPARISON BETWEEN THE GENETIC ALGORITHMS OPTIMIZATION AND PARTICLE SWARM OPT...
COMPARISON BETWEEN THE GENETIC ALGORITHMS OPTIMIZATION AND PARTICLE SWARM OPT...IAEME Publication
 
Comparison between the genetic algorithms optimization and particle swarm opt...
Comparison between the genetic algorithms optimization and particle swarm opt...Comparison between the genetic algorithms optimization and particle swarm opt...
Comparison between the genetic algorithms optimization and particle swarm opt...IAEME Publication
 
Producer consumer-problems
Producer consumer-problemsProducer consumer-problems
Producer consumer-problemsRichard Ashworth
 
A Tabu Search Heuristic For The Generalized Assignment Problem
A Tabu Search Heuristic For The Generalized Assignment ProblemA Tabu Search Heuristic For The Generalized Assignment Problem
A Tabu Search Heuristic For The Generalized Assignment ProblemSandra Long
 
Probability and random processes project based learning template.pdf
Probability and random processes project based learning template.pdfProbability and random processes project based learning template.pdf
Probability and random processes project based learning template.pdfVedant Srivastava
 
Adapted Branch-and-Bound Algorithm Using SVM With Model Selection
Adapted Branch-and-Bound Algorithm Using SVM With Model SelectionAdapted Branch-and-Bound Algorithm Using SVM With Model Selection
Adapted Branch-and-Bound Algorithm Using SVM With Model SelectionIJECEIAES
 

Ähnlich wie CGM-Based Parallel 1-D Sequence Alignment (20)

The New Hybrid COAW Method for Solving Multi-Objective Problems
The New Hybrid COAW Method for Solving Multi-Objective ProblemsThe New Hybrid COAW Method for Solving Multi-Objective Problems
The New Hybrid COAW Method for Solving Multi-Objective Problems
 
THE NEW HYBRID COAW METHOD FOR SOLVING MULTI-OBJECTIVE PROBLEMS
THE NEW HYBRID COAW METHOD FOR SOLVING MULTI-OBJECTIVE PROBLEMSTHE NEW HYBRID COAW METHOD FOR SOLVING MULTI-OBJECTIVE PROBLEMS
THE NEW HYBRID COAW METHOD FOR SOLVING MULTI-OBJECTIVE PROBLEMS
 
An Exact Branch And Bound Algorithm For The General Quadratic Assignment Problem
An Exact Branch And Bound Algorithm For The General Quadratic Assignment ProblemAn Exact Branch And Bound Algorithm For The General Quadratic Assignment Problem
An Exact Branch And Bound Algorithm For The General Quadratic Assignment Problem
 
Online learning in estimation of distribution algorithms for dynamic environm...
Online learning in estimation of distribution algorithms for dynamic environm...Online learning in estimation of distribution algorithms for dynamic environm...
Online learning in estimation of distribution algorithms for dynamic environm...
 
result analysis for deep leakage from gradients
result analysis for deep leakage from gradientsresult analysis for deep leakage from gradients
result analysis for deep leakage from gradients
 
Model checking
Model checkingModel checking
Model checking
 
Solving Assembly Line Balancing Problem Using A Hybrid Genetic Algorithm With...
Solving Assembly Line Balancing Problem Using A Hybrid Genetic Algorithm With...Solving Assembly Line Balancing Problem Using A Hybrid Genetic Algorithm With...
Solving Assembly Line Balancing Problem Using A Hybrid Genetic Algorithm With...
 
Solving Assembly Line Balancing Problem Using A Hybrid Genetic Algorithm With...
Solving Assembly Line Balancing Problem Using A Hybrid Genetic Algorithm With...Solving Assembly Line Balancing Problem Using A Hybrid Genetic Algorithm With...
Solving Assembly Line Balancing Problem Using A Hybrid Genetic Algorithm With...
 
FPGA Optimized Fuzzy Controller Design for Magnetic Ball Levitation using Gen...
FPGA Optimized Fuzzy Controller Design for Magnetic Ball Levitation using Gen...FPGA Optimized Fuzzy Controller Design for Magnetic Ball Levitation using Gen...
FPGA Optimized Fuzzy Controller Design for Magnetic Ball Levitation using Gen...
 
Scimakelatex.93126.cocoon.bobbin
Scimakelatex.93126.cocoon.bobbinScimakelatex.93126.cocoon.bobbin
Scimakelatex.93126.cocoon.bobbin
 
A HYBRID CLUSTERING ALGORITHM FOR DATA MINING
A HYBRID CLUSTERING ALGORITHM FOR DATA MININGA HYBRID CLUSTERING ALGORITHM FOR DATA MINING
A HYBRID CLUSTERING ALGORITHM FOR DATA MINING
 
COMPARISON BETWEEN THE GENETIC ALGORITHMS OPTIMIZATION AND PARTICLE SWARM OPT...
COMPARISON BETWEEN THE GENETIC ALGORITHMS OPTIMIZATION AND PARTICLE SWARM OPT...COMPARISON BETWEEN THE GENETIC ALGORITHMS OPTIMIZATION AND PARTICLE SWARM OPT...
COMPARISON BETWEEN THE GENETIC ALGORITHMS OPTIMIZATION AND PARTICLE SWARM OPT...
 
Comparison between the genetic algorithms optimization and particle swarm opt...
Comparison between the genetic algorithms optimization and particle swarm opt...Comparison between the genetic algorithms optimization and particle swarm opt...
Comparison between the genetic algorithms optimization and particle swarm opt...
 
Adaptive dynamic programing based optimal control for a robot manipulator
Adaptive dynamic programing based optimal control for a robot manipulatorAdaptive dynamic programing based optimal control for a robot manipulator
Adaptive dynamic programing based optimal control for a robot manipulator
 
nips2014
nips2014nips2014
nips2014
 
Producer consumer-problems
Producer consumer-problemsProducer consumer-problems
Producer consumer-problems
 
A Tabu Search Heuristic For The Generalized Assignment Problem
A Tabu Search Heuristic For The Generalized Assignment ProblemA Tabu Search Heuristic For The Generalized Assignment Problem
A Tabu Search Heuristic For The Generalized Assignment Problem
 
Probability and random processes project based learning template.pdf
Probability and random processes project based learning template.pdfProbability and random processes project based learning template.pdf
Probability and random processes project based learning template.pdf
 
Ssbse10.ppt
Ssbse10.pptSsbse10.ppt
Ssbse10.ppt
 
Adapted Branch-and-Bound Algorithm Using SVM With Model Selection
Adapted Branch-and-Bound Algorithm Using SVM With Model SelectionAdapted Branch-and-Bound Algorithm Using SVM With Model Selection
Adapted Branch-and-Bound Algorithm Using SVM With Model Selection
 

Mehr von Mokhtar SELLAMI

A posteriori error estimation in an adaptive multidimensional parameterizatio...
A posteriori error estimation in an adaptive multidimensional parameterizatio...A posteriori error estimation in an adaptive multidimensional parameterizatio...
A posteriori error estimation in an adaptive multidimensional parameterizatio...Mokhtar SELLAMI
 
A new approximation of the Height process of a CSBP
A new approximation of the Height process of a CSBP A new approximation of the Height process of a CSBP
A new approximation of the Height process of a CSBP Mokhtar SELLAMI
 
Effects on competition induced by periodic fluctuations in environment
Effects on competition induced by periodic fluctuations in environment Effects on competition induced by periodic fluctuations in environment
Effects on competition induced by periodic fluctuations in environment Mokhtar SELLAMI
 
E ect of the mortality on a density-dependent model with a predator-prey rela...
Eect of the mortality on a density-dependent model with a predator-prey rela...Eect of the mortality on a density-dependent model with a predator-prey rela...
E ect of the mortality on a density-dependent model with a predator-prey rela...Mokhtar SELLAMI
 
Cari2020: ALGORITHMES DE SORTIE DU PIÈGE DE LA ZONE ENNUYEUSE EN APPRENTISSAG...
Cari2020: ALGORITHMES DE SORTIE DU PIÈGE DE LA ZONE ENNUYEUSE EN APPRENTISSAG...Cari2020: ALGORITHMES DE SORTIE DU PIÈGE DE LA ZONE ENNUYEUSE EN APPRENTISSAG...
Cari2020: ALGORITHMES DE SORTIE DU PIÈGE DE LA ZONE ENNUYEUSE EN APPRENTISSAG...Mokhtar SELLAMI
 
CARI2020: RESOLUTION D’ANAPHORES NOMINALES AVEC LES SEPARATEURS À VASTES MARG...
CARI2020: RESOLUTION D’ANAPHORES NOMINALES AVEC LES SEPARATEURS À VASTES MARG...CARI2020: RESOLUTION D’ANAPHORES NOMINALES AVEC LES SEPARATEURS À VASTES MARG...
CARI2020: RESOLUTION D’ANAPHORES NOMINALES AVEC LES SEPARATEURS À VASTES MARG...Mokhtar SELLAMI
 
Cari2020 Parallel Hybridization for SAT: An Efficient Combination of Search S...
Cari2020 Parallel Hybridization for SAT: An Efficient Combination of Search S...Cari2020 Parallel Hybridization for SAT: An Efficient Combination of Search S...
Cari2020 Parallel Hybridization for SAT: An Efficient Combination of Search S...Mokhtar SELLAMI
 
Cari 2020: A minimalistic model of spatial structuration of humid savanna veg...
Cari 2020: A minimalistic model of spatial structuration of humid savanna veg...Cari 2020: A minimalistic model of spatial structuration of humid savanna veg...
Cari 2020: A minimalistic model of spatial structuration of humid savanna veg...Mokhtar SELLAMI
 
Cari presentation maurice-tchoupe-joskelngoufo
Cari presentation maurice-tchoupe-joskelngoufoCari presentation maurice-tchoupe-joskelngoufo
Cari presentation maurice-tchoupe-joskelngoufoMokhtar SELLAMI
 
CARI-2020, Application of LSTM architectures for next frame forecasting in Se...
CARI-2020, Application of LSTM architectures for next frame forecasting in Se...CARI-2020, Application of LSTM architectures for next frame forecasting in Se...
CARI-2020, Application of LSTM architectures for next frame forecasting in Se...Mokhtar SELLAMI
 
Federation and Promotion of Heterogeneous Domains and Services
Federation and Promotion of Heterogeneous Domains and ServicesFederation and Promotion of Heterogeneous Domains and Services
Federation and Promotion of Heterogeneous Domains and ServicesMokhtar SELLAMI
 
Cari2020 Rodrigue Aimé Djeumen Djatcha
Cari2020 Rodrigue Aimé Djeumen DjatchaCari2020 Rodrigue Aimé Djeumen Djatcha
Cari2020 Rodrigue Aimé Djeumen DjatchaMokhtar SELLAMI
 
Cari2020 Comparative Study of the Performance of Elliptic Curve Cryptography ...
Cari2020 Comparative Study of the Performance of Elliptic Curve Cryptography ...Cari2020 Comparative Study of the Performance of Elliptic Curve Cryptography ...
Cari2020 Comparative Study of the Performance of Elliptic Curve Cryptography ...Mokhtar SELLAMI
 

Mehr von Mokhtar SELLAMI (15)

A posteriori error estimation in an adaptive multidimensional parameterizatio...
A posteriori error estimation in an adaptive multidimensional parameterizatio...A posteriori error estimation in an adaptive multidimensional parameterizatio...
A posteriori error estimation in an adaptive multidimensional parameterizatio...
 
A new approximation of the Height process of a CSBP
A new approximation of the Height process of a CSBP A new approximation of the Height process of a CSBP
A new approximation of the Height process of a CSBP
 
Effects on competition induced by periodic fluctuations in environment
Effects on competition induced by periodic fluctuations in environment Effects on competition induced by periodic fluctuations in environment
Effects on competition induced by periodic fluctuations in environment
 
E ect of the mortality on a density-dependent model with a predator-prey rela...
Eect of the mortality on a density-dependent model with a predator-prey rela...Eect of the mortality on a density-dependent model with a predator-prey rela...
E ect of the mortality on a density-dependent model with a predator-prey rela...
 
Cari2020: ALGORITHMES DE SORTIE DU PIÈGE DE LA ZONE ENNUYEUSE EN APPRENTISSAG...
Cari2020: ALGORITHMES DE SORTIE DU PIÈGE DE LA ZONE ENNUYEUSE EN APPRENTISSAG...Cari2020: ALGORITHMES DE SORTIE DU PIÈGE DE LA ZONE ENNUYEUSE EN APPRENTISSAG...
Cari2020: ALGORITHMES DE SORTIE DU PIÈGE DE LA ZONE ENNUYEUSE EN APPRENTISSAG...
 
CARI2020: RESOLUTION D’ANAPHORES NOMINALES AVEC LES SEPARATEURS À VASTES MARG...
CARI2020: RESOLUTION D’ANAPHORES NOMINALES AVEC LES SEPARATEURS À VASTES MARG...CARI2020: RESOLUTION D’ANAPHORES NOMINALES AVEC LES SEPARATEURS À VASTES MARG...
CARI2020: RESOLUTION D’ANAPHORES NOMINALES AVEC LES SEPARATEURS À VASTES MARG...
 
Cari2020 Parallel Hybridization for SAT: An Efficient Combination of Search S...
Cari2020 Parallel Hybridization for SAT: An Efficient Combination of Search S...Cari2020 Parallel Hybridization for SAT: An Efficient Combination of Search S...
Cari2020 Parallel Hybridization for SAT: An Efficient Combination of Search S...
 
Cari 2020: A minimalistic model of spatial structuration of humid savanna veg...
Cari 2020: A minimalistic model of spatial structuration of humid savanna veg...Cari 2020: A minimalistic model of spatial structuration of humid savanna veg...
Cari 2020: A minimalistic model of spatial structuration of humid savanna veg...
 
Cari presentation maurice-tchoupe-joskelngoufo
Cari presentation maurice-tchoupe-joskelngoufoCari presentation maurice-tchoupe-joskelngoufo
Cari presentation maurice-tchoupe-joskelngoufo
 
CARI-2020, Application of LSTM architectures for next frame forecasting in Se...
CARI-2020, Application of LSTM architectures for next frame forecasting in Se...CARI-2020, Application of LSTM architectures for next frame forecasting in Se...
CARI-2020, Application of LSTM architectures for next frame forecasting in Se...
 
Federation and Promotion of Heterogeneous Domains and Services
Federation and Promotion of Heterogeneous Domains and ServicesFederation and Promotion of Heterogeneous Domains and Services
Federation and Promotion of Heterogeneous Domains and Services
 
Cari2020 Rodrigue Aimé Djeumen Djatcha
Cari2020 Rodrigue Aimé Djeumen DjatchaCari2020 Rodrigue Aimé Djeumen Djatcha
Cari2020 Rodrigue Aimé Djeumen Djatcha
 
Cari2020 Comparative Study of the Performance of Elliptic Curve Cryptography ...
Cari2020 Comparative Study of the Performance of Elliptic Curve Cryptography ...Cari2020 Comparative Study of the Performance of Elliptic Curve Cryptography ...
Cari2020 Comparative Study of the Performance of Elliptic Curve Cryptography ...
 
CARI2020-Benaissa
CARI2020-BenaissaCARI2020-Benaissa
CARI2020-Benaissa
 
Cari2020 dembele
Cari2020 dembeleCari2020 dembele
Cari2020 dembele
 

Kürzlich hochgeladen

CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete Record
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete RecordCCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete Record
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete RecordAsst.prof M.Gokilavani
 
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur High Profile
 
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...Dr.Costas Sachpazis
 
Coefficient of Thermal Expansion and their Importance.pptx
Coefficient of Thermal Expansion and their Importance.pptxCoefficient of Thermal Expansion and their Importance.pptx
Coefficient of Thermal Expansion and their Importance.pptxAsutosh Ranjan
 
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...Dr.Costas Sachpazis
 
IMPLICATIONS OF THE ABOVE HOLISTIC UNDERSTANDING OF HARMONY ON PROFESSIONAL E...
IMPLICATIONS OF THE ABOVE HOLISTIC UNDERSTANDING OF HARMONY ON PROFESSIONAL E...IMPLICATIONS OF THE ABOVE HOLISTIC UNDERSTANDING OF HARMONY ON PROFESSIONAL E...
IMPLICATIONS OF THE ABOVE HOLISTIC UNDERSTANDING OF HARMONY ON PROFESSIONAL E...RajaP95
 
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130Suhani Kapoor
 
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINEMANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINESIVASHANKAR N
 
Processing & Properties of Floor and Wall Tiles.pptx
Processing & Properties of Floor and Wall Tiles.pptxProcessing & Properties of Floor and Wall Tiles.pptx
Processing & Properties of Floor and Wall Tiles.pptxpranjaldaimarysona
 
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...ranjana rawat
 
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escortsranjana rawat
 
result management system report for college project
result management system report for college projectresult management system report for college project
result management system report for college projectTonystark477637
 
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur EscortsCall Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur High Profile
 
SPICE PARK APR2024 ( 6,793 SPICE Models )
SPICE PARK APR2024 ( 6,793 SPICE Models )SPICE PARK APR2024 ( 6,793 SPICE Models )
SPICE PARK APR2024 ( 6,793 SPICE Models )Tsuyoshi Horigome
 
UNIT-III FMM. DIMENSIONAL ANALYSIS
UNIT-III FMM.        DIMENSIONAL ANALYSISUNIT-III FMM.        DIMENSIONAL ANALYSIS
UNIT-III FMM. DIMENSIONAL ANALYSISrknatarajan
 
Booking open Available Pune Call Girls Koregaon Park 6297143586 Call Hot Ind...
Booking open Available Pune Call Girls Koregaon Park  6297143586 Call Hot Ind...Booking open Available Pune Call Girls Koregaon Park  6297143586 Call Hot Ind...
Booking open Available Pune Call Girls Koregaon Park 6297143586 Call Hot Ind...Call Girls in Nagpur High Profile
 
KubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghlyKubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghlysanyuktamishra911
 
(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...ranjana rawat
 

Kürzlich hochgeladen (20)

CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete Record
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete RecordCCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete Record
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete Record
 
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
 
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
 
Coefficient of Thermal Expansion and their Importance.pptx
Coefficient of Thermal Expansion and their Importance.pptxCoefficient of Thermal Expansion and their Importance.pptx
Coefficient of Thermal Expansion and their Importance.pptx
 
★ CALL US 9953330565 ( HOT Young Call Girls In Badarpur delhi NCR
★ CALL US 9953330565 ( HOT Young Call Girls In Badarpur delhi NCR★ CALL US 9953330565 ( HOT Young Call Girls In Badarpur delhi NCR
★ CALL US 9953330565 ( HOT Young Call Girls In Badarpur delhi NCR
 
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
 
IMPLICATIONS OF THE ABOVE HOLISTIC UNDERSTANDING OF HARMONY ON PROFESSIONAL E...
IMPLICATIONS OF THE ABOVE HOLISTIC UNDERSTANDING OF HARMONY ON PROFESSIONAL E...IMPLICATIONS OF THE ABOVE HOLISTIC UNDERSTANDING OF HARMONY ON PROFESSIONAL E...
IMPLICATIONS OF THE ABOVE HOLISTIC UNDERSTANDING OF HARMONY ON PROFESSIONAL E...
 
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
 
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINEMANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
 
Processing & Properties of Floor and Wall Tiles.pptx
Processing & Properties of Floor and Wall Tiles.pptxProcessing & Properties of Floor and Wall Tiles.pptx
Processing & Properties of Floor and Wall Tiles.pptx
 
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
 
DJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINE
DJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINEDJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINE
DJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINE
 
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
 
result management system report for college project
result management system report for college projectresult management system report for college project
result management system report for college project
 
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur EscortsCall Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts
 
SPICE PARK APR2024 ( 6,793 SPICE Models )
SPICE PARK APR2024 ( 6,793 SPICE Models )SPICE PARK APR2024 ( 6,793 SPICE Models )
SPICE PARK APR2024 ( 6,793 SPICE Models )
 
UNIT-III FMM. DIMENSIONAL ANALYSIS
UNIT-III FMM.        DIMENSIONAL ANALYSISUNIT-III FMM.        DIMENSIONAL ANALYSIS
UNIT-III FMM. DIMENSIONAL ANALYSIS
 
Booking open Available Pune Call Girls Koregaon Park 6297143586 Call Hot Ind...
Booking open Available Pune Call Girls Koregaon Park  6297143586 Call Hot Ind...Booking open Available Pune Call Girls Koregaon Park  6297143586 Call Hot Ind...
Booking open Available Pune Call Girls Koregaon Park 6297143586 Call Hot Ind...
 
KubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghlyKubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghly
 
(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
 

CGM-Based Parallel 1-D Sequence Alignment

  • 1. A CGM-Based Parallel Algorithm Using the Four-Russians Speedup for the 1-D Sequence Alignment Problem By Jerry Lacmou Zeutouo, Grace Colette Tessa Masse, Franklin Ingrid Kamga Youmbi at African Conference on Reseach in Computer Science and Applied Mathematics CARI’2020 Polytech School of Thiès, Senegal October.
  • 2. Introduction Sequential algorithm CGM algorithm Experimental results Conclusion Opening thoughts The processing of sequences such as DNA sequencing is a variety of problems that generate massive calculations and whose results are difficult to achieve on simple PCs. One way to improve the speed of solution is parallelism. Lacmou et al. 2 / 55
  • 3. Introduction Sequential algorithm CGM algorithm Experimental results Conclusion Outline 1 Introduction 2 Sequential algorithm 3 CGM algorithm 4 Experimental results 5 Conclusion Lacmou et al. 3 / 55
  • 4. Introduction Sequential algorithm CGM algorithm Experimental results Conclusion Context Problematic Our main focus 1 Introduction 2 Sequential algorithm 3 CGM algorithm 4 Experimental results 5 Conclusion Lacmou et al. 4 / 55
  • 5. Introduction Sequential algorithm CGM algorithm Experimental results Conclusion Context Problematic Our main focus What bioinformatics is Bioinformatics is the analysis of biological information. Lacmou et al. 5 / 55
  • 6. Introduction Sequential algorithm CGM algorithm Experimental results Conclusion Context Problematic Our main focus Biological information Definition Biological information can be: a sequence (DNA, RNA, proteins); a structure (primaire, secondary); a function (molecular function e.g. enzyme; cell component e.g. membrane protein; biological process e.g. oxygen trans- port); interactions (proteins, metabolic pathways, gene networks). Lacmou et al. 6 / 55
  • 7. Introduction Sequential algorithm CGM algorithm Experimental results Conclusion Context Problematic Our main focus The privileged axes of bioinformatics 1 Formalization of genetic information analysis of sequences (biomolecules) and their structure (especially 3D structure); 2 Biological interpretation of genetic information data integra- tion (establishment of maps and networks of gene interac- tions, protein interactions, etc.); 3 Functional prediction. Lacmou et al. 7 / 55
  • 8. Introduction Sequential algorithm CGM algorithm Experimental results Conclusion Context Problematic Our main focus Bioinformatics methods Comparative method: comparison of unknown sequences or structures with databases (sequences and structures) of genes and proteins known to establish similarities (similarities, ho- mologies or identities); Statistical method: software programs apply statistical anal- yses to the data (on sequence syntax) to try to identify and identify rules and constraints that are systematic, regular or general in nature; Modelling probabilistic approach. Lacmou et al. 8 / 55
  • 9. Introduction Sequential algorithm CGM algorithm Experimental results Conclusion Context Problematic Our main focus Sequence alignment One of the fundamental problems of bioinformatics is the sequence alignment, crucial for molecular prediction, molecular interactions and phylogenetic analysis. Lacmou et al. 9 / 55
  • 10. Introduction Sequential algorithm CGM algorithm Experimental results Conclusion Context Problematic Our main focus Sequence alignment definition Definition Sequence alignment is the problem of comparing biological sequences by looking for a series of nucleotides or amino acids that appear in the same order in the input sequences, possibly introducing gaps. It is a means of visualizing the similarity between sequences based on the notions of similarity or distance. Figure: Sequences alignementLacmou et al. 10 / 55
  • 11. Introduction Sequential algorithm CGM algorithm Experimental results Conclusion Context Problematic Our main focus Types of alignments Two types of alignments are considered: 1 global alignments, which take into account the whole of two sequences to be compared; 2 local alignments, which make it possible to detect the seg- ment of the first sequence which is most similar to a segment of the other. Lacmou et al. 11 / 55
  • 12. Introduction Sequential algorithm CGM algorithm Experimental results Conclusion Context Problematic Our main focus Dynamic programming complexity Dynamic programming complexity Due to dynamic programming, both types of alignments can be run in O(n2) time and space where n is the length of two strings. Based on the Four-Russians method, several sped-up sequential solutions has been proposed like the one of Brubach and Ghurye’s algorithm running in O( n2 log n). Lacmou et al. 12 / 55
  • 13. Introduction Sequential algorithm CGM algorithm Experimental results Conclusion Context Problematic Our main focus Why parallelize? Despite the acceleration of the sequential algorithm, the resolution of the problem remains time and space consuming. Lacmou et al. 13 / 55
  • 14. Introduction Sequential algorithm CGM algorithm Experimental results Conclusion Context Problematic Our main focus The parallelization of the sequential algorithm A PRAM algorithm runs in O(log n log m) time with nm log nm CREW processors have been proposed by Apostolico et al., 1990; Alves et al., 2002 proposed a parallel solution for a variant of the problem under the CGM model using weighted graphs that require log p rounds and runs in O(n2 p log m); Recently, Kim et al., 2016 have proposed a space-efficient al- phabetindependent Four-Russians’ lookup table and a mul- tithreaded Four-Russians’ edit distance algorithm. Lacmou et al. 14 / 55
  • 15. Introduction Sequential algorithm CGM algorithm Experimental results Conclusion Context Problematic Our main focus Our aim We tackle the problem of parallelizing the Brubach and Ghurye’s sequential algorithm for the one-dimensional sequence alignment problem on the CGM model (Coarse-Grained Multicomputer). The choice of the model CGM is due to the fact that it is more suitable to the current machines and it has already been used to solve efficiently several problems such as the optimal binary search tree problem Lacmou et al. 15 / 55
  • 16. Introduction Sequential algorithm CGM algorithm Experimental results Conclusion Context Problematic Our main focus What is a CGM algorithm? Its design is based on: Partitioning of the problem into subproblems; Distribution of subproblems between the different processors; CGM algorithm itself: it is a set of rounds of calculations and communicatons. Lacmou et al. 16 / 55
  • 17. Introduction Sequential algorithm CGM algorithm Experimental results Conclusion Context Problematic Our main focus Parallelisation criterion on CGM algorithm Minimize the idleness time and the latency time of proces- sors; Minimize the overall communication time; Minimize the number of communications rounds; Minimize the local computation time of each processor and; Balance the load between the processors. Lacmou et al. 17 / 55
  • 18. Introduction Sequential algorithm CGM algorithm Experimental results Conclusion Context Problematic Our main focus Our results Our solution requires O( n2 p log n ) execution time with O(p) communication rounds on p processors. Lacmou et al. 18 / 55
  • 19. Introduction Sequential algorithm CGM algorithm Experimental results Conclusion Global alignment Local alignment The Four-Russians’ speedup Brubach and Ghurye’s efficient lookup table 1 Introduction 2 Sequential algorithm 3 CGM algorithm 4 Experimental results 5 Conclusion Lacmou et al. 19 / 55
  • 20. Introduction Sequential algorithm CGM algorithm Experimental results Conclusion Global alignment Local alignment The Four-Russians’ speedup Brubach and Ghurye’s efficient lookup table Formal definition Definition The sequence alignment method seeks to optimize the alignment score. This score is related to the similarity rate between the two compared sequences. It measures the number of edit operations it takes to convert a sequence X into another Y . Lacmou et al. 20 / 55
  • 21. Introduction Sequential algorithm CGM algorithm Experimental results Conclusion Global alignment Local alignment The Four-Russians’ speedup Brubach and Ghurye’s efficient lookup table Elementary editing operation They include: 1 insertions; 2 deletions; 3 substitutions of a single character. The weighted variant assigns a cost to each of the mentioned operations and through the use of a penalty matrix in case, costs appear not to be constant. Lacmou et al. 21 / 55
  • 22. Introduction Sequential algorithm CGM algorithm Experimental results Conclusion Global alignment Local alignment The Four-Russians’ speedup Brubach and Ghurye’s efficient lookup table Global alignment overview A global alignment of two X and Y sequences can be given by computing the distance between X and Y ; The elementary editing operations are described as follows: substitution of an a symbol by an b symbol; deletion of an a symbol; insertion of an b symbol. There is also the ability to calculate global alignments using similarity scores instead of distance. Lacmou et al. 22 / 55
  • 23. Introduction Sequential algorithm CGM algorithm Experimental results Conclusion Global alignment Local alignment The Four-Russians’ speedup Brubach and Ghurye’s efficient lookup table Global alignment score computation T[i, j] = max    T[i − 1, j − 1] + Sub(X[i], Y [j]), T[i − 1, j] + Del(X[i]), T[i, j − 1] + Ins(Y [j]). (1) Lacmou et al. 23 / 55
  • 24. Introduction Sequential algorithm CGM algorithm Experimental results Conclusion Global alignment Local alignment The Four-Russians’ speedup Brubach and Ghurye’s efficient lookup table Local alignment overview A local alignment of two sequences X and Y consists in find- ing the X segment that is most similar to a Y segment ; The local edit score of two sequences X and Y is defined by s(X, Y ), the maximum similarity between an X segment and a Y segment. Lacmou et al. 24 / 55
  • 25. Introduction Sequential algorithm CGM algorithm Experimental results Conclusion Global alignment Local alignment The Four-Russians’ speedup Brubach and Ghurye’s efficient lookup table Local alignment score computation Ts[i, j] = max    Ts[i − 1, j − 1] + Sub(X[i], Y [j]), Ts[i − 1, j] + Del(X[i]), Ts[i, j − 1] + Ins(Y [j]), 0. (2) Lacmou et al. 25 / 55
  • 26. Introduction Sequential algorithm CGM algorithm Experimental results Conclusion Global alignment Local alignment The Four-Russians’ speedup Brubach and Ghurye’s efficient lookup table Task graph Lacmou et al. 26 / 55
  • 27. Introduction Sequential algorithm CGM algorithm Experimental results Conclusion Global alignment Local alignment The Four-Russians’ speedup Brubach and Ghurye’s efficient lookup table Speedup techniques Dynamic programming algorithms can sometimes be made even faster by applying speedups such as the Knuth-Yao quadrangle-inequality speedup or the Four-Russians speedup. Lacmou et al. 27 / 55
  • 28. Introduction Sequential algorithm CGM algorithm Experimental results Conclusion Global alignment Local alignment The Four-Russians’ speedup Brubach and Ghurye’s efficient lookup table Idea behind the Four-Russians’ speedup The idea behind the speedup is to tile the dynamic programming table into smaller blocks whose solutions are foreseen, precomputed and stored in a lookup table. The goal sought is spending less time on those by merely searching for them. Lacmou et al. 28 / 55
  • 29. Introduction Sequential algorithm CGM algorithm Experimental results Conclusion Global alignment Local alignment The Four-Russians’ speedup Brubach and Ghurye’s efficient lookup table A single t-block Lacmou et al. 29 / 55
  • 30. Introduction Sequential algorithm CGM algorithm Experimental results Conclusion Global alignment Local alignment The Four-Russians’ speedup Brubach and Ghurye’s efficient lookup table This quadratic time bound has been first proposed using the Four-Russians technique by Masek and Paterson in 1980 achieving O(n2/ log n); Crochemore et al., 2003 later achieved the same time bound for unrestricted scoring matrices. Lacmou et al. 30 / 55
  • 31. Introduction Sequential algorithm CGM algorithm Experimental results Conclusion Global alignment Local alignment The Four-Russians’ speedup Brubach and Ghurye’s efficient lookup table Brubach and Ghurye’s efficient lookup table The bottleneck in the aforementioned speedup lies in the space demanded by the table, and the time spent computing it. Lacmou et al. 31 / 55
  • 32. Introduction Sequential algorithm CGM algorithm Experimental results Conclusion Global alignment Local alignment The Four-Russians’ speedup Brubach and Ghurye’s efficient lookup table The Four-Russians lookup table can be built in O(t2 lg t), queried in O(t) and stored in O(t2) space. Lacmou et al. 32 / 55
  • 33. Introduction Sequential algorithm CGM algorithm Experimental results Conclusion Global alignment Local alignment The Four-Russians’ speedup Brubach and Ghurye’s efficient lookup table The t-blocks within the dynamic programming table are tiled in a way they overlap by one column/row on each side; Given a t-block first row and column, the block function of the lookup process has to provide its last row and column; Running the block function time and again in a row-wise (or column-wise) manner will eventually yield the global edit distance score in the bottom-right cell. Lacmou et al. 33 / 55
  • 34. Introduction Sequential algorithm CGM algorithm Experimental results Conclusion Partitioning strategy Mapping macro-blocks onto processors Overview of the CGM algorithm 1 Introduction 2 Sequential algorithm 3 CGM algorithm 4 Experimental results 5 Conclusion Lacmou et al. 34 / 55
  • 35. Introduction Sequential algorithm CGM algorithm Experimental results Conclusion Partitioning strategy Mapping macro-blocks onto processors Overview of the CGM algorithm Our contribution Our solution is divided into two parts: 1 Firstly, each processor computes the lookup table through the Brubach and Ghurye’s sequential algorithm in O(t2 lg t); 2 Secondly, we partition the task graph into subgraphs of the same size, and we distribute them fairly onto the processors. Lacmou et al. 35 / 55
  • 36. Introduction Sequential algorithm CGM algorithm Experimental results Conclusion Partitioning strategy Mapping macro-blocks onto processors Overview of the CGM algorithm Partitioning strategy Our technique consists in partitioning the task graph in two steps: Lacmou et al. 36 / 55
  • 37. Introduction Sequential algorithm CGM algorithm Experimental results Conclusion Partitioning strategy Mapping macro-blocks onto processors Overview of the CGM algorithm Partitioning steps 1 Firstly, we partition the task graph into small blocks of size t referred to as t-blocks such that any two adjacent t-blocks overlap by either a row or column. After this first partition- ing, the task graph is divided into k lines and k columns where k = n t−1; 2 Next, we subdivide the n2 t2 t-blocks into p lines and p columns of blocks referred to as macro-blocks and denoted by MB(i, j). MB(i, j) is a matrix of size k p × k p and is identified by the node on the lower right corner. Lacmou et al. 37 / 55
  • 38. Introduction Sequential algorithm CGM algorithm Experimental results Conclusion Partitioning strategy Mapping macro-blocks onto processors Overview of the CGM algorithm Scenario of this partitioning 1 2 3 4 2 3 4 5 3 4 5 6 4 5 6 7 Lacmou et al. 38 / 55
  • 39. Introduction Sequential algorithm CGM algorithm Experimental results Conclusion Partitioning strategy Mapping macro-blocks onto processors Overview of the CGM algorithm Snake-like mapping This distribution consists in assigning all macro-blocks of a diagonal from the top left corner to the bottom right corner; This process is renewed until all processors have been used, starting with processor 0 and traveling through the blocks with a "snake-like" path. Lacmou et al. 39 / 55
  • 40. Introduction Sequential algorithm CGM algorithm Experimental results Conclusion Partitioning strategy Mapping macro-blocks onto processors Overview of the CGM algorithm Illustration Lacmou et al. 40 / 55
  • 41. Introduction Sequential algorithm CGM algorithm Experimental results Conclusion Partitioning strategy Mapping macro-blocks onto processors Overview of the CGM algorithm Lemma After the partitioning of the task graph and snake-like distribution scheme, each processor evaluates exactly p macro-blocks. Proof. There are p2 macro-blocks after the partitioning of the task graph. Therefore, it is clear that each processor evaluates exactly p blocks. Lacmou et al. 41 / 55
  • 42. Introduction Sequential algorithm CGM algorithm Experimental results Conclusion Partitioning strategy Mapping macro-blocks onto processors Overview of the CGM algorithm Our CGM algorithm We present our CGM solution for the sequence alignment problem. Brubach and Ghurye’s sequential algorithm is used for local computations. Based on the previous partitioning strategy and the mapping of blocks onto processors, the corresponding CGM algorithm is presented in algorithm. Lacmou et al. 42 / 55
  • 43. Introduction Sequential algorithm CGM algorithm Experimental results Conclusion Partitioning strategy Mapping macro-blocks onto processors Overview of the CGM algorithm Our CGM algorithm Lacmou et al. 43 / 55
  • 44. Introduction Sequential algorithm CGM algorithm Experimental results Conclusion Partitioning strategy Mapping macro-blocks onto processors Overview of the CGM algorithm Local computation time Lemma The evaluation of a macro-block of size n p(t−1) × n p(t−1) requires O n2 p2t computation time. Lemma Our CGM algorithm requires O n2 pt time steps per processor. Lacmou et al. 44 / 55
  • 45. Introduction Sequential algorithm CGM algorithm Experimental results Conclusion Partitioning strategy Mapping macro-blocks onto processors Overview of the CGM algorithm Algorithm complexity Theorem By subdividing the task graph into macro-blocks of size n p(t−1) × n p(t−1) and using the snake-like distribution scheme, our CGM algorithm requires O n2 p log n execution time with O(p) communication rounds when t = log n. Lacmou et al. 45 / 55
  • 46. Introduction Sequential algorithm CGM algorithm Experimental results Conclusion 1 Introduction 2 Sequential algorithm 3 CGM algorithm 4 Experimental results 5 Conclusion Lacmou et al. 46 / 55
  • 47. Introduction Sequential algorithm CGM algorithm Experimental results Conclusion Environment We measure the performance of our solution on the Matrics plat- form backed by the University of Picardie Jules Verne. Intel Xeon (R) CPU E5-2680 v4 @ 2.40GHz; 28 cores; 128GB of RAM; The inter-processor communication is implemented with the MPI library (OpenMPI version 1.10.4). Lacmou et al. 47 / 55
  • 48. Introduction Sequential algorithm CGM algorithm Experimental results Conclusion Samples We use a real biological DNA sequences with |Σ| = 4. The results presented here are derived from its execution for different values of the triplet (m, n, p), where m and n are the size of sequences, with values ranging from 105 to 106, and p is the number of processors, with values in the set {1, 2, 4, 8, 16, 32, 48}. Lacmou et al. 48 / 55
  • 49. Introduction Sequential algorithm CGM algorithm Experimental results Conclusion Execution time for m Lacmou et al. 49 / 55
  • 50. Introduction Sequential algorithm CGM algorithm Experimental results Conclusion Execution time for p Lacmou et al. 50 / 55
  • 51. Introduction Sequential algorithm CGM algorithm Experimental results Conclusion Result comments For m = 106: Our algorithm runs about 11.43 times faster than the Brubach and Ghurye’s sequential algorithm on 16 processors; The speed-up increases up to 20.40 on 32 processors; And 30.05 on 48 processors. From all this, we can conclude that our algorithm is scalable to the increase of the size of sequences and the numbers of processors. Lacmou et al. 51 / 55
  • 52. Introduction Sequential algorithm CGM algorithm Experimental results Conclusion 1 Introduction 2 Sequential algorithm 3 CGM algorithm 4 Experimental results 5 Conclusion Lacmou et al. 52 / 55
  • 53. Introduction Sequential algorithm CGM algorithm Experimental results Conclusion Wrapping things up We attempted in this paper to parallelize their results in ap- plication to the one-dimensional sequence alignment prob- lem; Based on the CGM model, our solution clusters t-blocks into macro-blocks and follows the snake-like distribution mapping pattern onto p processors to achieve for O(p) rounds of com- munication, an execution time in O(n2/p log n); Experimental results show good agreement with theoretical forecasts. Lacmou et al. 53 / 55
  • 54. Introduction Sequential algorithm CGM algorithm Experimental results Conclusion Future directions Noticing the fact that single processors uselessly compute large sets of entries in their respective lookup tables, one might be interested in cutting off the waste. Or even extend our solution to address the two-dimensional sequence alignment problem. Lacmou et al. 54 / 55
  • 55. Introduction Sequential algorithm CGM algorithm Experimental results Conclusion Thanks for your keen attention :-) Lacmou et al. 55 / 55