SlideShare ist ein Scribd-Unternehmen logo
1 von 53
Downloaden Sie, um offline zu lesen
Demand for
High Speed
Computers
2
Demand for High Speed Computers
Technological advancement has its Limits
Solution is Replication of Processing Units
It leads to parallel Computers
3
4
Observations
Theory
Physical
Experiment
Classical science is based on
observation,
theory, and
physical experimentation
Observation of a phenomenon leads
to a hypothesis.
The scientist develops a theory to
explain the phenomenon and designs
an experiment to test that theory.
5
Nature
Physical experiments are not always
feasible because:
‱ Too Expensive
‱ Time Consuming
‱ Unethical
‱ Impossible to perform
In contrast, modern science is
characterized by observation, theory,
experimentation, and numerical
simulation.
6
Observations
Theory
Numerical
Simulation
Nature
Numerical Simulation creates the experimental environment by using
Mathematical formulas. It is an increasingly important tool for
scientists, who often cannot use physical experiments to test
theories.
The modern scientist compares the behaviour of a numerical
simulation, which implements the theory, to observation of “real world”
phenomena.
Many important scientific problems are so complex that solving them
via numerical simulation requires extraordinarily powerful computers.
7
These complex problems, often called grand challenges
for science (Levin 1989):
‱ Quantum chemistry, statistical mechanics, and relativistic physics
‱ Cosmology and astrophysics
‱ Computational fluid dynamics and turbulence
‱ Materials design and superconductivity
‱ Biology, pharmacology, genome sequencing, genetic engineering,
protein folding, enzyme activity, and cell modelling
‱ Medicine, and modelling of human organs and bones
‱ Global weather and environmental modelling
8
9
Solomon: constructed by Westinghouse Electric Company in
the early 1960s.
ILLIAC IV: assembled at Burrough Corporation in the early
1970s.
At Carnegie-Mellon University, two parallel computers
C.mmp and Cm* were constructed during 1970s.
In early1980s researchers at Caltech built the parallel
computer Cosmic Cube
In the mid -1980s the parallel commercial computers were
constructed with microprocessors.
It took more than 20 years for parallel computers to move from the lab to market.
10
PP: Parallel Processing
Daniel Slotnick at University of Illinois designed two early parallel computers
11
PP: Parallel Processing
(Credit Hennessy and Patterson)
The performance growth
rate for minicomputers,
mainframes and traditional
supercomputer has been
just under 20% a year
While the performance growth
rate for microprocessors has
averaged 35% a year.
12
PP: Parallel Processing
The performance of Processor can be improved
through
13
PP: Parallel Processing
Fundamental Architectural Advances
Bit parallel
memory
Bit-parallel
arithmetic
Cache
memory
Channels
Interleaved
memory
Instruction
lookahead
Instruction
pipelining
Multiple
functional
units
Pipelined
functional
units
Data
pipelining
14
PP: Parallel Processing
Microprocessors have been able to achieve more
impressive performance gain because-
They are at the
beginning stage
They have not
incorporated all
the architectural
advances
Their clock speed
is much slower
15
PP: Parallel Processing
Microprocessors
Convergence in relative performance between
Supercomputers
Commercial
Parallel
Computers
16
PP: Parallel Processing
Single
processor
Supercomputer
Microprocessor
based parallel
computers
17
PP: Parallel Processing
Some of the organizations that delivered commercial parallel computers based on
microprocessor CPUs in the 10-year period 1984-1993 and their current status.
18
PP: Parallel Processing
Reasonable
Architecture
Operating
Systems
Programming
Languages
Parallel
Algorithms
Harnessing the power
latent in massive
parallel microprocessor
based computers,
however, requires the
development of
19
PP: Parallel Processing
This paper is about the
designing of efficient algorithms
for real parallel computers.
20
PP: Parallel Processing
21
Parallel computing is the use of a parallel computer to
reduce the time needed to solve a single
computational problem.
Parallel computing is now considered a standard way
for computational scientists and engineers to solve
problems in areas as diverse as galactic evolution,
climate modeling, aircraft design, and molecular
dynamics.
22
23
A parallel computer is a multiple processor computer
system supporting parallel programming.
24
Important categories of parallel
computers
Multicomputers Multiprocessors
A multicomputer is a parallel
computer constructed out of
multiple computers and an
interconnection network.
Each computer has its own
memory and it is accessible by
that particular processor .
The processors on different
computers interact by passing
messages to each other.
25
Multiprocessor is a
computer system with two
or more CPUs. It is highly
integrated system in which
all CPUs share access to a
single global memory.
This shared memory supports
communication &
synchronization among
processors.
26
It is information processing that emphasizes the
concurrent manipulation of data elements
belonging to one or more processes solving a
single problem.
A parallel computer capable of parallel processing.
27
‱ Sequential events or processes which seem to
occur or progress at the same time.
Concurrent Processing
‱ Events or processes which occur or progress
at the same time
Parallel Processing
28
29
Concurrency: Two or more
threads in progress at the
same time but only one
executed by single CPU.
Parallelism: Two or more
threads executing at the
same time
30
A supercomputer is a general purpose
computer capable of solving individual
problems at extremely high computational
speeds, compared with other computers
built during the same time.
31
The throughput of a devices is the number of
results it produces per unit time.
There are many ways to improve the
throughput of a device.
Speed
By reducing Instruction
Cycle Time
Concurrency
By executing more
instructions per Cycle Time
32
Speedup is the ratio between the time needed
for the most efficient sequential algorithm to
perform a computation and the time needed to
perform the same computation on a machine
incorporating pipelining and/or parallelism.
Speedup = Tsequential/ Tparallel/pipelined
33
A pipelined computation is divided into a number
of steps called segments or stages. The output of
one segment is the input of the next segment.
34
1 2 3 k


k
dn 

 d3 d2, d1
n * k
k + (n-1)
dn 

 d3 d2, d1
d1
d1d2
d1d2d3
d1dk-2dk-1dk 


Speedup = n*k/(k+n-1) lim
𝑛→∞
𝑘 + 𝑛 − 1 → 𝑛, 𝑆𝑝𝑒𝑒𝑑𝑱𝑝 = 𝑘
35
Data Parallelism is the use of multiple functional
units to apply the same operation simultaneously
to element of a data set.
THE SAME SET OF OPERATIONS TO DIFFERENT DATA
36
A k-fold increase in the number of
functional units leads to a k-fold
increase in the throughput of the
system if there is no overhead
associated with the parallelism.
A processor array is a parallel computer with set
of identical, ALUs/Processing Elements (PEs) that
can operate in parallel in a lock step fashion under
the control of one control unit and a number of
memory modules.
37
38
Three methods to assemble widgets.
a) A sequential widget assembly
machine produces one widget
every three units of time.
b) A three segment pipelined widget-
assembly machine produces the
first widget in three units of time
and successive widgets every time
unit thereafter.
c) A three-way data-parallel widget-
assembly machine produces three
widgets every three units of time.
39
No. of
Wi
Seq. Pipelined Parallel
Tsq Tpi Speedup Tpa Speedup
1 3 3 1 3 1
2 6 4 1.5 3 2
3 9 5 1.8 3 3
4 12 6 2 6 2
5 15 7 2.1 6 2.5
6 18 8 2.2 6 3
7 21 9 2.3 9 2.3
8 24 10 2.4 9 2.6
9 27 11 2.4 9 3
10 30 12 2.5 12 2.5
0
0.5
1
1.5
2
2.5
3
3.5
1 2 3 4 5 6 7 8 9 10
Speedup
Widgets Assembled
Pipelined Parallel
Control parallelism is
achieved by applying
different operations to
different data elements
simultaneously.
40
Pipelining is a special case of control parallelism.
Most realistic problems can exploit both data and
control parallelism.
Problem: Weekly maintenance of a Lawn
1. Mowing the Lawn
2. Edging the Lawn
3. Checking the Sprinkle
4. Weeding the flower beds
41
Different workers mowing the
lawn simultaneously
(Data Parallelism)
Other team of workers are
weeding the flower bed in parallel
(Control Parallelism)
42
Turn off
Security
System
Check
Sprinklers
Turn on
Security
System
Mow
Lawn
Edge
Lawn
Weed
Garden
An algorithm is scalable if the level of parallelism increases at
least linearly with the problem size.
An architecture is scalable if it continues to yield the same
performance per processor, albeit used on a larger problem
size, as the number of processors increases.
Data parallel algorithms are more scalable than control
parallel algorithms. Control parallelism is usually a constant,
independent of the problem size, while the level of data
parallelism is an increasing function of the problem size.
43
There are different ways to classify parallel computers. One of the
more widely used classifications, in use since 1966, is called
Flynn's Taxonomy.
Flynn's taxonomy distinguishes multi-processor computer
architectures according to how they can be classified along the
two independent dimensions of Instruction Stream and Data
Stream. Each of these dimensions can have only one of two
possible states: Single or Multiple.
44
45
A serial (non-parallel) computer
Single Instruction: Only one instruction stream/ clock cycle
Single Data: Only one data stream/ clock cycle
Deterministic execution
This is the oldest type of computer
Examples: older generation mainframes,
minicomputers, workstations and
single processor/core PCs.
46
SIMD: A type of parallel computer
Single Instruction: All processing units execute the
same instruction at any given clock cycle
Multiple Data: Each processing unit can operate on a
different data element
Two varieties: Processor Arrays and Vector Pipelines
47
Processor Arrays: Thinking Machines CM-2, MasPar
MP-1 & MP-2, ILLIAC IV
Vector Pipelines: IBM 9000, Cray X-MP, Y-MP & C90,
Fujitsu VP, NEC SX-2, Hitachi S820, ETA10
Most modern computers, particularly those with
graphics processor units (GPUs) employ SIMD
instructions and execution units.
48
49
50
(MISD): A type of parallel computer
Multiple Instruction: Each processing unit operates on the
data independently via separate instruction streams.
Single Data: A single data stream is fed into multiple
processing units.
Few (if any) actual examples of this class of parallel
computer have ever existed.
51
(MIMD): A type of parallel computer
Multiple Instruction: Every processor may be
executing a different instruction stream
Multiple Data: Every processor may be working with a
different data stream
The most common type of parallel computer - most
modern supercomputers fall into this category.
52
Examples:
most current
supercomputers, networked
parallel computer clusters
and "grids", multi-processor
SMP computers, multi-core
PCs.
53 Topic Covered in the Presentation
Chapter 1: Introduction
1.1 Computational Demands of Modern Science
1.2 Advent of Practical Parallel Processing
1.3 Parallel Processing Terminology

Weitere Àhnliche Inhalte

Was ist angesagt?

Architectural Development Tracks
Architectural Development TracksArchitectural Development Tracks
Architectural Development TracksANJALIG10
 
INTRODUCTION TO PARALLEL PROCESSING
INTRODUCTION TO PARALLEL PROCESSINGINTRODUCTION TO PARALLEL PROCESSING
INTRODUCTION TO PARALLEL PROCESSINGGS Kosta
 
Message passing interface
Message passing interfaceMessage passing interface
Message passing interfaceMd. Mahedi Mahfuj
 
Parallel computing persentation
Parallel computing persentationParallel computing persentation
Parallel computing persentationVIKAS SINGH BHADOURIA
 
Parallel algorithms
Parallel algorithmsParallel algorithms
Parallel algorithmsDanish Javed
 
Chapter 3 principles of parallel algorithm design
Chapter 3   principles of parallel algorithm designChapter 3   principles of parallel algorithm design
Chapter 3 principles of parallel algorithm designDenisAkbar1
 
Basic communication operations - One to all Broadcast
Basic communication operations - One to all BroadcastBasic communication operations - One to all Broadcast
Basic communication operations - One to all BroadcastRashiJoshi11
 
Research Scope in Parallel Computing And Parallel Programming
Research Scope in Parallel Computing And Parallel ProgrammingResearch Scope in Parallel Computing And Parallel Programming
Research Scope in Parallel Computing And Parallel ProgrammingShitalkumar Sukhdeve
 
Lecture 6.1
Lecture  6.1Lecture  6.1
Lecture 6.1Mr SMAK
 
Full introduction to_parallel_computing
Full introduction to_parallel_computingFull introduction to_parallel_computing
Full introduction to_parallel_computingSupasit Kajkamhaeng
 
Introduction to Parallel and Distributed Computing
Introduction to Parallel and Distributed ComputingIntroduction to Parallel and Distributed Computing
Introduction to Parallel and Distributed ComputingSayed Chhattan Shah
 
Evaluation of morden computer & system attributes in ACA
Evaluation of morden computer &  system attributes in ACAEvaluation of morden computer &  system attributes in ACA
Evaluation of morden computer & system attributes in ACAPankaj Kumar Jain
 
Applications of paralleL processing
Applications of paralleL processingApplications of paralleL processing
Applications of paralleL processingPage Maker
 
Limitations of memory system performance
Limitations of memory system performanceLimitations of memory system performance
Limitations of memory system performanceSyed Zaid Irshad
 
All-Reduce and Prefix-Sum Operations
All-Reduce and Prefix-Sum Operations All-Reduce and Prefix-Sum Operations
All-Reduce and Prefix-Sum Operations Syed Zaid Irshad
 
Introduction to MapReduce - Hadoop Streaming | Big Data Hadoop Spark Tutorial...
Introduction to MapReduce - Hadoop Streaming | Big Data Hadoop Spark Tutorial...Introduction to MapReduce - Hadoop Streaming | Big Data Hadoop Spark Tutorial...
Introduction to MapReduce - Hadoop Streaming | Big Data Hadoop Spark Tutorial...CloudxLab
 
Parallel sorting Algorithms
Parallel  sorting AlgorithmsParallel  sorting Algorithms
Parallel sorting AlgorithmsGARIMA SHAKYA
 
program partitioning and scheduling IN Advanced Computer Architecture
program partitioning and scheduling  IN Advanced Computer Architectureprogram partitioning and scheduling  IN Advanced Computer Architecture
program partitioning and scheduling IN Advanced Computer ArchitecturePankaj Kumar Jain
 
Lecture 1 (distributed systems)
Lecture 1 (distributed systems)Lecture 1 (distributed systems)
Lecture 1 (distributed systems)Fazli Amin
 

Was ist angesagt? (20)

Architectural Development Tracks
Architectural Development TracksArchitectural Development Tracks
Architectural Development Tracks
 
INTRODUCTION TO PARALLEL PROCESSING
INTRODUCTION TO PARALLEL PROCESSINGINTRODUCTION TO PARALLEL PROCESSING
INTRODUCTION TO PARALLEL PROCESSING
 
Message passing interface
Message passing interfaceMessage passing interface
Message passing interface
 
Parallel computing persentation
Parallel computing persentationParallel computing persentation
Parallel computing persentation
 
Parallel algorithms
Parallel algorithmsParallel algorithms
Parallel algorithms
 
Chapter 3 principles of parallel algorithm design
Chapter 3   principles of parallel algorithm designChapter 3   principles of parallel algorithm design
Chapter 3 principles of parallel algorithm design
 
Basic communication operations - One to all Broadcast
Basic communication operations - One to all BroadcastBasic communication operations - One to all Broadcast
Basic communication operations - One to all Broadcast
 
Research Scope in Parallel Computing And Parallel Programming
Research Scope in Parallel Computing And Parallel ProgrammingResearch Scope in Parallel Computing And Parallel Programming
Research Scope in Parallel Computing And Parallel Programming
 
Lecture 6.1
Lecture  6.1Lecture  6.1
Lecture 6.1
 
Full introduction to_parallel_computing
Full introduction to_parallel_computingFull introduction to_parallel_computing
Full introduction to_parallel_computing
 
Introduction to Parallel and Distributed Computing
Introduction to Parallel and Distributed ComputingIntroduction to Parallel and Distributed Computing
Introduction to Parallel and Distributed Computing
 
Evaluation of morden computer & system attributes in ACA
Evaluation of morden computer &  system attributes in ACAEvaluation of morden computer &  system attributes in ACA
Evaluation of morden computer & system attributes in ACA
 
6.distributed shared memory
6.distributed shared memory6.distributed shared memory
6.distributed shared memory
 
Applications of paralleL processing
Applications of paralleL processingApplications of paralleL processing
Applications of paralleL processing
 
Limitations of memory system performance
Limitations of memory system performanceLimitations of memory system performance
Limitations of memory system performance
 
All-Reduce and Prefix-Sum Operations
All-Reduce and Prefix-Sum Operations All-Reduce and Prefix-Sum Operations
All-Reduce and Prefix-Sum Operations
 
Introduction to MapReduce - Hadoop Streaming | Big Data Hadoop Spark Tutorial...
Introduction to MapReduce - Hadoop Streaming | Big Data Hadoop Spark Tutorial...Introduction to MapReduce - Hadoop Streaming | Big Data Hadoop Spark Tutorial...
Introduction to MapReduce - Hadoop Streaming | Big Data Hadoop Spark Tutorial...
 
Parallel sorting Algorithms
Parallel  sorting AlgorithmsParallel  sorting Algorithms
Parallel sorting Algorithms
 
program partitioning and scheduling IN Advanced Computer Architecture
program partitioning and scheduling  IN Advanced Computer Architectureprogram partitioning and scheduling  IN Advanced Computer Architecture
program partitioning and scheduling IN Advanced Computer Architecture
 
Lecture 1 (distributed systems)
Lecture 1 (distributed systems)Lecture 1 (distributed systems)
Lecture 1 (distributed systems)
 

Ähnlich wie Chapter 1 - introduction - parallel computing

intro, definitions, basic laws+.pptx
intro, definitions, basic laws+.pptxintro, definitions, basic laws+.pptx
intro, definitions, basic laws+.pptxssuser413a98
 
parallel computing.ppt
parallel computing.pptparallel computing.ppt
parallel computing.pptssuser413a98
 
parallelprogramming-130823023925-phpapp01.pptx
parallelprogramming-130823023925-phpapp01.pptxparallelprogramming-130823023925-phpapp01.pptx
parallelprogramming-130823023925-phpapp01.pptxMarlonMagtibay3
 
Module 2.pdf
Module 2.pdfModule 2.pdf
Module 2.pdfDrAnjuShukla
 
(19-23)CC Unit-1 ppt.pptx
(19-23)CC Unit-1 ppt.pptx(19-23)CC Unit-1 ppt.pptx
(19-23)CC Unit-1 ppt.pptxNithishaYadavv
 
Chap 2 classification of parralel architecture and introduction to parllel p...
Chap 2  classification of parralel architecture and introduction to parllel p...Chap 2  classification of parralel architecture and introduction to parllel p...
Chap 2 classification of parralel architecture and introduction to parllel p...Malobe Lottin Cyrille Marcel
 
Parallel Computing-Part-1.pptx
Parallel Computing-Part-1.pptxParallel Computing-Part-1.pptx
Parallel Computing-Part-1.pptxkrnaween
 
Introduction to parallel_computing
Introduction to parallel_computingIntroduction to parallel_computing
Introduction to parallel_computingMehul Patel
 
Data Parallel and Object Oriented Model
Data Parallel and Object Oriented ModelData Parallel and Object Oriented Model
Data Parallel and Object Oriented ModelNikhil Sharma
 
Parallel Processing.pptx
Parallel Processing.pptxParallel Processing.pptx
Parallel Processing.pptxSheethal Aji Mani
 
Concurrency and Parallelism, Asynchronous Programming, Network Programming
Concurrency and Parallelism, Asynchronous Programming, Network ProgrammingConcurrency and Parallelism, Asynchronous Programming, Network Programming
Concurrency and Parallelism, Asynchronous Programming, Network ProgrammingPrabu U
 
A Parallel Computing-a Paradigm to achieve High Performance
A Parallel Computing-a Paradigm to achieve High PerformanceA Parallel Computing-a Paradigm to achieve High Performance
A Parallel Computing-a Paradigm to achieve High PerformanceAM Publications
 
Parallel and Distributed Computing chapter 1
Parallel and Distributed Computing chapter 1Parallel and Distributed Computing chapter 1
Parallel and Distributed Computing chapter 1AbdullahMunir32
 
Advanced computer architecture
Advanced computer architectureAdvanced computer architecture
Advanced computer architectureAjithaSomasundaram
 
Real-Time Scheduling Algorithms
Real-Time Scheduling AlgorithmsReal-Time Scheduling Algorithms
Real-Time Scheduling AlgorithmsAJAL A J
 
High Performance & High Throughput Computing - EUDAT Summer School (Giuseppe ...
High Performance & High Throughput Computing - EUDAT Summer School (Giuseppe ...High Performance & High Throughput Computing - EUDAT Summer School (Giuseppe ...
High Performance & High Throughput Computing - EUDAT Summer School (Giuseppe ...EUDAT
 

Ähnlich wie Chapter 1 - introduction - parallel computing (20)

intro, definitions, basic laws+.pptx
intro, definitions, basic laws+.pptxintro, definitions, basic laws+.pptx
intro, definitions, basic laws+.pptx
 
parallel computing.ppt
parallel computing.pptparallel computing.ppt
parallel computing.ppt
 
parallelprogramming-130823023925-phpapp01.pptx
parallelprogramming-130823023925-phpapp01.pptxparallelprogramming-130823023925-phpapp01.pptx
parallelprogramming-130823023925-phpapp01.pptx
 
Nbvtalkatjntuvizianagaram
NbvtalkatjntuvizianagaramNbvtalkatjntuvizianagaram
Nbvtalkatjntuvizianagaram
 
Module 2.pdf
Module 2.pdfModule 2.pdf
Module 2.pdf
 
Aca module 1
Aca module 1Aca module 1
Aca module 1
 
(19-23)CC Unit-1 ppt.pptx
(19-23)CC Unit-1 ppt.pptx(19-23)CC Unit-1 ppt.pptx
(19-23)CC Unit-1 ppt.pptx
 
Chap 2 classification of parralel architecture and introduction to parllel p...
Chap 2  classification of parralel architecture and introduction to parllel p...Chap 2  classification of parralel architecture and introduction to parllel p...
Chap 2 classification of parralel architecture and introduction to parllel p...
 
Parallel Computing-Part-1.pptx
Parallel Computing-Part-1.pptxParallel Computing-Part-1.pptx
Parallel Computing-Part-1.pptx
 
Introduction to parallel_computing
Introduction to parallel_computingIntroduction to parallel_computing
Introduction to parallel_computing
 
Data Parallel and Object Oriented Model
Data Parallel and Object Oriented ModelData Parallel and Object Oriented Model
Data Parallel and Object Oriented Model
 
Parallel Processing.pptx
Parallel Processing.pptxParallel Processing.pptx
Parallel Processing.pptx
 
Concurrency and Parallelism, Asynchronous Programming, Network Programming
Concurrency and Parallelism, Asynchronous Programming, Network ProgrammingConcurrency and Parallelism, Asynchronous Programming, Network Programming
Concurrency and Parallelism, Asynchronous Programming, Network Programming
 
A Parallel Computing-a Paradigm to achieve High Performance
A Parallel Computing-a Paradigm to achieve High PerformanceA Parallel Computing-a Paradigm to achieve High Performance
A Parallel Computing-a Paradigm to achieve High Performance
 
Parallel and Distributed Computing chapter 1
Parallel and Distributed Computing chapter 1Parallel and Distributed Computing chapter 1
Parallel and Distributed Computing chapter 1
 
GRID COMPUTING.ppt
GRID COMPUTING.pptGRID COMPUTING.ppt
GRID COMPUTING.ppt
 
Advanced computer architecture
Advanced computer architectureAdvanced computer architecture
Advanced computer architecture
 
CC unit 1.pptx
CC unit 1.pptxCC unit 1.pptx
CC unit 1.pptx
 
Real-Time Scheduling Algorithms
Real-Time Scheduling AlgorithmsReal-Time Scheduling Algorithms
Real-Time Scheduling Algorithms
 
High Performance & High Throughput Computing - EUDAT Summer School (Giuseppe ...
High Performance & High Throughput Computing - EUDAT Summer School (Giuseppe ...High Performance & High Throughput Computing - EUDAT Summer School (Giuseppe ...
High Performance & High Throughput Computing - EUDAT Summer School (Giuseppe ...
 

Mehr von Heman Pathak

Interconnection Network
Interconnection NetworkInterconnection Network
Interconnection NetworkHeman Pathak
 
Central processing unit
Central processing unitCentral processing unit
Central processing unitHeman Pathak
 
Registers and counters
Registers and countersRegisters and counters
Registers and countersHeman Pathak
 
Sequential Circuit
Sequential CircuitSequential Circuit
Sequential CircuitHeman Pathak
 
Combinational logic 2
Combinational logic 2Combinational logic 2
Combinational logic 2Heman Pathak
 
Combinational logic 1
Combinational logic 1Combinational logic 1
Combinational logic 1Heman Pathak
 
Simplification of Boolean Function
Simplification of Boolean FunctionSimplification of Boolean Function
Simplification of Boolean FunctionHeman Pathak
 
Chapter 2: Boolean Algebra and Logic Gates
Chapter 2: Boolean Algebra and Logic GatesChapter 2: Boolean Algebra and Logic Gates
Chapter 2: Boolean Algebra and Logic GatesHeman Pathak
 
Chapter 7: Matrix Multiplication
Chapter 7: Matrix MultiplicationChapter 7: Matrix Multiplication
Chapter 7: Matrix MultiplicationHeman Pathak
 
Elementary Parallel Algorithms
Elementary Parallel AlgorithmsElementary Parallel Algorithms
Elementary Parallel AlgorithmsHeman Pathak
 
Chapter 5: Mapping and Scheduling
Chapter  5: Mapping and SchedulingChapter  5: Mapping and Scheduling
Chapter 5: Mapping and SchedulingHeman Pathak
 
Cost optimal algorithm
Cost optimal algorithmCost optimal algorithm
Cost optimal algorithmHeman Pathak
 
Chapter 4: Parallel Programming Languages
Chapter 4: Parallel Programming LanguagesChapter 4: Parallel Programming Languages
Chapter 4: Parallel Programming LanguagesHeman Pathak
 
Parallel Algorithm for Graph Coloring
Parallel Algorithm for Graph Coloring Parallel Algorithm for Graph Coloring
Parallel Algorithm for Graph Coloring Heman Pathak
 
Parallel Algorithms
Parallel AlgorithmsParallel Algorithms
Parallel AlgorithmsHeman Pathak
 

Mehr von Heman Pathak (15)

Interconnection Network
Interconnection NetworkInterconnection Network
Interconnection Network
 
Central processing unit
Central processing unitCentral processing unit
Central processing unit
 
Registers and counters
Registers and countersRegisters and counters
Registers and counters
 
Sequential Circuit
Sequential CircuitSequential Circuit
Sequential Circuit
 
Combinational logic 2
Combinational logic 2Combinational logic 2
Combinational logic 2
 
Combinational logic 1
Combinational logic 1Combinational logic 1
Combinational logic 1
 
Simplification of Boolean Function
Simplification of Boolean FunctionSimplification of Boolean Function
Simplification of Boolean Function
 
Chapter 2: Boolean Algebra and Logic Gates
Chapter 2: Boolean Algebra and Logic GatesChapter 2: Boolean Algebra and Logic Gates
Chapter 2: Boolean Algebra and Logic Gates
 
Chapter 7: Matrix Multiplication
Chapter 7: Matrix MultiplicationChapter 7: Matrix Multiplication
Chapter 7: Matrix Multiplication
 
Elementary Parallel Algorithms
Elementary Parallel AlgorithmsElementary Parallel Algorithms
Elementary Parallel Algorithms
 
Chapter 5: Mapping and Scheduling
Chapter  5: Mapping and SchedulingChapter  5: Mapping and Scheduling
Chapter 5: Mapping and Scheduling
 
Cost optimal algorithm
Cost optimal algorithmCost optimal algorithm
Cost optimal algorithm
 
Chapter 4: Parallel Programming Languages
Chapter 4: Parallel Programming LanguagesChapter 4: Parallel Programming Languages
Chapter 4: Parallel Programming Languages
 
Parallel Algorithm for Graph Coloring
Parallel Algorithm for Graph Coloring Parallel Algorithm for Graph Coloring
Parallel Algorithm for Graph Coloring
 
Parallel Algorithms
Parallel AlgorithmsParallel Algorithms
Parallel Algorithms
 

KĂŒrzlich hochgeladen

notes on Evolution Of Analytic Scalability.ppt
notes on Evolution Of Analytic Scalability.pptnotes on Evolution Of Analytic Scalability.ppt
notes on Evolution Of Analytic Scalability.pptMsecMca
 
COST-EFFETIVE and Energy Efficient BUILDINGS ptx
COST-EFFETIVE  and Energy Efficient BUILDINGS ptxCOST-EFFETIVE  and Energy Efficient BUILDINGS ptx
COST-EFFETIVE and Energy Efficient BUILDINGS ptxJIT KUMAR GUPTA
 
Engineering Drawing focus on projection of planes
Engineering Drawing focus on projection of planesEngineering Drawing focus on projection of planes
Engineering Drawing focus on projection of planesRAJNEESHKUMAR341697
 
BhubaneswarđŸŒčCall Girls Bhubaneswar ❀Komal 9777949614 💟 Full Trusted CALL GIRL...
BhubaneswarđŸŒčCall Girls Bhubaneswar ❀Komal 9777949614 💟 Full Trusted CALL GIRL...BhubaneswarđŸŒčCall Girls Bhubaneswar ❀Komal 9777949614 💟 Full Trusted CALL GIRL...
BhubaneswarđŸŒčCall Girls Bhubaneswar ❀Komal 9777949614 💟 Full Trusted CALL GIRL...Call Girls Mumbai
 
"Lesotho Leaps Forward: A Chronicle of Transformative Developments"
"Lesotho Leaps Forward: A Chronicle of Transformative Developments""Lesotho Leaps Forward: A Chronicle of Transformative Developments"
"Lesotho Leaps Forward: A Chronicle of Transformative Developments"mphochane1998
 
2016EF22_0 solar project report rooftop projects
2016EF22_0 solar project report rooftop projects2016EF22_0 solar project report rooftop projects
2016EF22_0 solar project report rooftop projectssmsksolar
 
data_management_and _data_science_cheat_sheet.pdf
data_management_and _data_science_cheat_sheet.pdfdata_management_and _data_science_cheat_sheet.pdf
data_management_and _data_science_cheat_sheet.pdfJiananWang21
 
AIRCANVAS[1].pdf mini project for btech students
AIRCANVAS[1].pdf mini project for btech studentsAIRCANVAS[1].pdf mini project for btech students
AIRCANVAS[1].pdf mini project for btech studentsvanyagupta248
 
Thermal Engineering Unit - I & II . ppt
Thermal Engineering  Unit - I & II . pptThermal Engineering  Unit - I & II . ppt
Thermal Engineering Unit - I & II . pptDineshKumar4165
 
Kuwait City MTP kit ((+919101817206)) Buy Abortion Pills Kuwait
Kuwait City MTP kit ((+919101817206)) Buy Abortion Pills KuwaitKuwait City MTP kit ((+919101817206)) Buy Abortion Pills Kuwait
Kuwait City MTP kit ((+919101817206)) Buy Abortion Pills Kuwaitjaanualu31
 
Online electricity billing project report..pdf
Online electricity billing project report..pdfOnline electricity billing project report..pdf
Online electricity billing project report..pdfKamal Acharya
 
FEA Based Level 3 Assessment of Deformed Tanks with Fluid Induced Loads
FEA Based Level 3 Assessment of Deformed Tanks with Fluid Induced LoadsFEA Based Level 3 Assessment of Deformed Tanks with Fluid Induced Loads
FEA Based Level 3 Assessment of Deformed Tanks with Fluid Induced LoadsArindam Chakraborty, Ph.D., P.E. (CA, TX)
 
+97470301568>> buy weed in qatar,buy thc oil qatar,buy weed and vape oil in d...
+97470301568>> buy weed in qatar,buy thc oil qatar,buy weed and vape oil in d...+97470301568>> buy weed in qatar,buy thc oil qatar,buy weed and vape oil in d...
+97470301568>> buy weed in qatar,buy thc oil qatar,buy weed and vape oil in d...Health
 
Minimum and Maximum Modes of microprocessor 8086
Minimum and Maximum Modes of microprocessor 8086Minimum and Maximum Modes of microprocessor 8086
Minimum and Maximum Modes of microprocessor 8086anil_gaur
 
Thermal Engineering-R & A / C - unit - V
Thermal Engineering-R & A / C - unit - VThermal Engineering-R & A / C - unit - V
Thermal Engineering-R & A / C - unit - VDineshKumar4165
 
Rums floating Omkareshwar FSPV IM_16112021.pdf
Rums floating Omkareshwar FSPV IM_16112021.pdfRums floating Omkareshwar FSPV IM_16112021.pdf
Rums floating Omkareshwar FSPV IM_16112021.pdfsmsksolar
 
Thermal Engineering -unit - III & IV.ppt
Thermal Engineering -unit - III & IV.pptThermal Engineering -unit - III & IV.ppt
Thermal Engineering -unit - III & IV.pptDineshKumar4165
 
kiln thermal load.pptx kiln tgermal load
kiln thermal load.pptx kiln tgermal loadkiln thermal load.pptx kiln tgermal load
kiln thermal load.pptx kiln tgermal loadhamedmustafa094
 
Double Revolving field theory-how the rotor develops torque
Double Revolving field theory-how the rotor develops torqueDouble Revolving field theory-how the rotor develops torque
Double Revolving field theory-how the rotor develops torqueBhangaleSonal
 
HOA1&2 - Module 3 - PREHISTORCI ARCHITECTURE OF KERALA.pptx
HOA1&2 - Module 3 - PREHISTORCI ARCHITECTURE OF KERALA.pptxHOA1&2 - Module 3 - PREHISTORCI ARCHITECTURE OF KERALA.pptx
HOA1&2 - Module 3 - PREHISTORCI ARCHITECTURE OF KERALA.pptxSCMS School of Architecture
 

KĂŒrzlich hochgeladen (20)

notes on Evolution Of Analytic Scalability.ppt
notes on Evolution Of Analytic Scalability.pptnotes on Evolution Of Analytic Scalability.ppt
notes on Evolution Of Analytic Scalability.ppt
 
COST-EFFETIVE and Energy Efficient BUILDINGS ptx
COST-EFFETIVE  and Energy Efficient BUILDINGS ptxCOST-EFFETIVE  and Energy Efficient BUILDINGS ptx
COST-EFFETIVE and Energy Efficient BUILDINGS ptx
 
Engineering Drawing focus on projection of planes
Engineering Drawing focus on projection of planesEngineering Drawing focus on projection of planes
Engineering Drawing focus on projection of planes
 
BhubaneswarđŸŒčCall Girls Bhubaneswar ❀Komal 9777949614 💟 Full Trusted CALL GIRL...
BhubaneswarđŸŒčCall Girls Bhubaneswar ❀Komal 9777949614 💟 Full Trusted CALL GIRL...BhubaneswarđŸŒčCall Girls Bhubaneswar ❀Komal 9777949614 💟 Full Trusted CALL GIRL...
BhubaneswarđŸŒčCall Girls Bhubaneswar ❀Komal 9777949614 💟 Full Trusted CALL GIRL...
 
"Lesotho Leaps Forward: A Chronicle of Transformative Developments"
"Lesotho Leaps Forward: A Chronicle of Transformative Developments""Lesotho Leaps Forward: A Chronicle of Transformative Developments"
"Lesotho Leaps Forward: A Chronicle of Transformative Developments"
 
2016EF22_0 solar project report rooftop projects
2016EF22_0 solar project report rooftop projects2016EF22_0 solar project report rooftop projects
2016EF22_0 solar project report rooftop projects
 
data_management_and _data_science_cheat_sheet.pdf
data_management_and _data_science_cheat_sheet.pdfdata_management_and _data_science_cheat_sheet.pdf
data_management_and _data_science_cheat_sheet.pdf
 
AIRCANVAS[1].pdf mini project for btech students
AIRCANVAS[1].pdf mini project for btech studentsAIRCANVAS[1].pdf mini project for btech students
AIRCANVAS[1].pdf mini project for btech students
 
Thermal Engineering Unit - I & II . ppt
Thermal Engineering  Unit - I & II . pptThermal Engineering  Unit - I & II . ppt
Thermal Engineering Unit - I & II . ppt
 
Kuwait City MTP kit ((+919101817206)) Buy Abortion Pills Kuwait
Kuwait City MTP kit ((+919101817206)) Buy Abortion Pills KuwaitKuwait City MTP kit ((+919101817206)) Buy Abortion Pills Kuwait
Kuwait City MTP kit ((+919101817206)) Buy Abortion Pills Kuwait
 
Online electricity billing project report..pdf
Online electricity billing project report..pdfOnline electricity billing project report..pdf
Online electricity billing project report..pdf
 
FEA Based Level 3 Assessment of Deformed Tanks with Fluid Induced Loads
FEA Based Level 3 Assessment of Deformed Tanks with Fluid Induced LoadsFEA Based Level 3 Assessment of Deformed Tanks with Fluid Induced Loads
FEA Based Level 3 Assessment of Deformed Tanks with Fluid Induced Loads
 
+97470301568>> buy weed in qatar,buy thc oil qatar,buy weed and vape oil in d...
+97470301568>> buy weed in qatar,buy thc oil qatar,buy weed and vape oil in d...+97470301568>> buy weed in qatar,buy thc oil qatar,buy weed and vape oil in d...
+97470301568>> buy weed in qatar,buy thc oil qatar,buy weed and vape oil in d...
 
Minimum and Maximum Modes of microprocessor 8086
Minimum and Maximum Modes of microprocessor 8086Minimum and Maximum Modes of microprocessor 8086
Minimum and Maximum Modes of microprocessor 8086
 
Thermal Engineering-R & A / C - unit - V
Thermal Engineering-R & A / C - unit - VThermal Engineering-R & A / C - unit - V
Thermal Engineering-R & A / C - unit - V
 
Rums floating Omkareshwar FSPV IM_16112021.pdf
Rums floating Omkareshwar FSPV IM_16112021.pdfRums floating Omkareshwar FSPV IM_16112021.pdf
Rums floating Omkareshwar FSPV IM_16112021.pdf
 
Thermal Engineering -unit - III & IV.ppt
Thermal Engineering -unit - III & IV.pptThermal Engineering -unit - III & IV.ppt
Thermal Engineering -unit - III & IV.ppt
 
kiln thermal load.pptx kiln tgermal load
kiln thermal load.pptx kiln tgermal loadkiln thermal load.pptx kiln tgermal load
kiln thermal load.pptx kiln tgermal load
 
Double Revolving field theory-how the rotor develops torque
Double Revolving field theory-how the rotor develops torqueDouble Revolving field theory-how the rotor develops torque
Double Revolving field theory-how the rotor develops torque
 
HOA1&2 - Module 3 - PREHISTORCI ARCHITECTURE OF KERALA.pptx
HOA1&2 - Module 3 - PREHISTORCI ARCHITECTURE OF KERALA.pptxHOA1&2 - Module 3 - PREHISTORCI ARCHITECTURE OF KERALA.pptx
HOA1&2 - Module 3 - PREHISTORCI ARCHITECTURE OF KERALA.pptx
 

Chapter 1 - introduction - parallel computing

  • 1.
  • 3. Demand for High Speed Computers Technological advancement has its Limits Solution is Replication of Processing Units It leads to parallel Computers 3
  • 4. 4
  • 5. Observations Theory Physical Experiment Classical science is based on observation, theory, and physical experimentation Observation of a phenomenon leads to a hypothesis. The scientist develops a theory to explain the phenomenon and designs an experiment to test that theory. 5 Nature
  • 6. Physical experiments are not always feasible because: ‱ Too Expensive ‱ Time Consuming ‱ Unethical ‱ Impossible to perform In contrast, modern science is characterized by observation, theory, experimentation, and numerical simulation. 6 Observations Theory Numerical Simulation Nature
  • 7. Numerical Simulation creates the experimental environment by using Mathematical formulas. It is an increasingly important tool for scientists, who often cannot use physical experiments to test theories. The modern scientist compares the behaviour of a numerical simulation, which implements the theory, to observation of “real world” phenomena. Many important scientific problems are so complex that solving them via numerical simulation requires extraordinarily powerful computers. 7
  • 8. These complex problems, often called grand challenges for science (Levin 1989): ‱ Quantum chemistry, statistical mechanics, and relativistic physics ‱ Cosmology and astrophysics ‱ Computational fluid dynamics and turbulence ‱ Materials design and superconductivity ‱ Biology, pharmacology, genome sequencing, genetic engineering, protein folding, enzyme activity, and cell modelling ‱ Medicine, and modelling of human organs and bones ‱ Global weather and environmental modelling 8
  • 9. 9
  • 10. Solomon: constructed by Westinghouse Electric Company in the early 1960s. ILLIAC IV: assembled at Burrough Corporation in the early 1970s. At Carnegie-Mellon University, two parallel computers C.mmp and Cm* were constructed during 1970s. In early1980s researchers at Caltech built the parallel computer Cosmic Cube In the mid -1980s the parallel commercial computers were constructed with microprocessors. It took more than 20 years for parallel computers to move from the lab to market. 10 PP: Parallel Processing Daniel Slotnick at University of Illinois designed two early parallel computers
  • 11. 11 PP: Parallel Processing (Credit Hennessy and Patterson)
  • 12. The performance growth rate for minicomputers, mainframes and traditional supercomputer has been just under 20% a year While the performance growth rate for microprocessors has averaged 35% a year. 12 PP: Parallel Processing
  • 13. The performance of Processor can be improved through 13 PP: Parallel Processing
  • 14. Fundamental Architectural Advances Bit parallel memory Bit-parallel arithmetic Cache memory Channels Interleaved memory Instruction lookahead Instruction pipelining Multiple functional units Pipelined functional units Data pipelining 14 PP: Parallel Processing
  • 15. Microprocessors have been able to achieve more impressive performance gain because- They are at the beginning stage They have not incorporated all the architectural advances Their clock speed is much slower 15 PP: Parallel Processing
  • 16. Microprocessors Convergence in relative performance between Supercomputers Commercial Parallel Computers 16 PP: Parallel Processing
  • 18. Some of the organizations that delivered commercial parallel computers based on microprocessor CPUs in the 10-year period 1984-1993 and their current status. 18 PP: Parallel Processing
  • 19. Reasonable Architecture Operating Systems Programming Languages Parallel Algorithms Harnessing the power latent in massive parallel microprocessor based computers, however, requires the development of 19 PP: Parallel Processing
  • 20. This paper is about the designing of efficient algorithms for real parallel computers. 20 PP: Parallel Processing
  • 21. 21
  • 22. Parallel computing is the use of a parallel computer to reduce the time needed to solve a single computational problem. Parallel computing is now considered a standard way for computational scientists and engineers to solve problems in areas as diverse as galactic evolution, climate modeling, aircraft design, and molecular dynamics. 22
  • 23. 23
  • 24. A parallel computer is a multiple processor computer system supporting parallel programming. 24 Important categories of parallel computers Multicomputers Multiprocessors
  • 25. A multicomputer is a parallel computer constructed out of multiple computers and an interconnection network. Each computer has its own memory and it is accessible by that particular processor . The processors on different computers interact by passing messages to each other. 25
  • 26. Multiprocessor is a computer system with two or more CPUs. It is highly integrated system in which all CPUs share access to a single global memory. This shared memory supports communication & synchronization among processors. 26
  • 27. It is information processing that emphasizes the concurrent manipulation of data elements belonging to one or more processes solving a single problem. A parallel computer capable of parallel processing. 27
  • 28. ‱ Sequential events or processes which seem to occur or progress at the same time. Concurrent Processing ‱ Events or processes which occur or progress at the same time Parallel Processing 28
  • 29. 29 Concurrency: Two or more threads in progress at the same time but only one executed by single CPU. Parallelism: Two or more threads executing at the same time
  • 30. 30 A supercomputer is a general purpose computer capable of solving individual problems at extremely high computational speeds, compared with other computers built during the same time.
  • 31. 31 The throughput of a devices is the number of results it produces per unit time. There are many ways to improve the throughput of a device. Speed By reducing Instruction Cycle Time Concurrency By executing more instructions per Cycle Time
  • 32. 32 Speedup is the ratio between the time needed for the most efficient sequential algorithm to perform a computation and the time needed to perform the same computation on a machine incorporating pipelining and/or parallelism. Speedup = Tsequential/ Tparallel/pipelined
  • 33. 33 A pipelined computation is divided into a number of steps called segments or stages. The output of one segment is the input of the next segment.
  • 34. 34 1 2 3 k

 k dn 

 d3 d2, d1 n * k k + (n-1) dn 

 d3 d2, d1 d1 d1d2 d1d2d3 d1dk-2dk-1dk 

 Speedup = n*k/(k+n-1) lim 𝑛→∞ 𝑘 + 𝑛 − 1 → 𝑛, 𝑆𝑝𝑒𝑒𝑑𝑱𝑝 = 𝑘
  • 35. 35 Data Parallelism is the use of multiple functional units to apply the same operation simultaneously to element of a data set. THE SAME SET OF OPERATIONS TO DIFFERENT DATA
  • 36. 36 A k-fold increase in the number of functional units leads to a k-fold increase in the throughput of the system if there is no overhead associated with the parallelism.
  • 37. A processor array is a parallel computer with set of identical, ALUs/Processing Elements (PEs) that can operate in parallel in a lock step fashion under the control of one control unit and a number of memory modules. 37
  • 38. 38 Three methods to assemble widgets. a) A sequential widget assembly machine produces one widget every three units of time. b) A three segment pipelined widget- assembly machine produces the first widget in three units of time and successive widgets every time unit thereafter. c) A three-way data-parallel widget- assembly machine produces three widgets every three units of time.
  • 39. 39 No. of Wi Seq. Pipelined Parallel Tsq Tpi Speedup Tpa Speedup 1 3 3 1 3 1 2 6 4 1.5 3 2 3 9 5 1.8 3 3 4 12 6 2 6 2 5 15 7 2.1 6 2.5 6 18 8 2.2 6 3 7 21 9 2.3 9 2.3 8 24 10 2.4 9 2.6 9 27 11 2.4 9 3 10 30 12 2.5 12 2.5 0 0.5 1 1.5 2 2.5 3 3.5 1 2 3 4 5 6 7 8 9 10 Speedup Widgets Assembled Pipelined Parallel
  • 40. Control parallelism is achieved by applying different operations to different data elements simultaneously. 40 Pipelining is a special case of control parallelism.
  • 41. Most realistic problems can exploit both data and control parallelism. Problem: Weekly maintenance of a Lawn 1. Mowing the Lawn 2. Edging the Lawn 3. Checking the Sprinkle 4. Weeding the flower beds 41
  • 42. Different workers mowing the lawn simultaneously (Data Parallelism) Other team of workers are weeding the flower bed in parallel (Control Parallelism) 42 Turn off Security System Check Sprinklers Turn on Security System Mow Lawn Edge Lawn Weed Garden
  • 43. An algorithm is scalable if the level of parallelism increases at least linearly with the problem size. An architecture is scalable if it continues to yield the same performance per processor, albeit used on a larger problem size, as the number of processors increases. Data parallel algorithms are more scalable than control parallel algorithms. Control parallelism is usually a constant, independent of the problem size, while the level of data parallelism is an increasing function of the problem size. 43
  • 44. There are different ways to classify parallel computers. One of the more widely used classifications, in use since 1966, is called Flynn's Taxonomy. Flynn's taxonomy distinguishes multi-processor computer architectures according to how they can be classified along the two independent dimensions of Instruction Stream and Data Stream. Each of these dimensions can have only one of two possible states: Single or Multiple. 44
  • 45. 45
  • 46. A serial (non-parallel) computer Single Instruction: Only one instruction stream/ clock cycle Single Data: Only one data stream/ clock cycle Deterministic execution This is the oldest type of computer Examples: older generation mainframes, minicomputers, workstations and single processor/core PCs. 46
  • 47. SIMD: A type of parallel computer Single Instruction: All processing units execute the same instruction at any given clock cycle Multiple Data: Each processing unit can operate on a different data element Two varieties: Processor Arrays and Vector Pipelines 47
  • 48. Processor Arrays: Thinking Machines CM-2, MasPar MP-1 & MP-2, ILLIAC IV Vector Pipelines: IBM 9000, Cray X-MP, Y-MP & C90, Fujitsu VP, NEC SX-2, Hitachi S820, ETA10 Most modern computers, particularly those with graphics processor units (GPUs) employ SIMD instructions and execution units. 48
  • 49. 49
  • 50. 50 (MISD): A type of parallel computer Multiple Instruction: Each processing unit operates on the data independently via separate instruction streams. Single Data: A single data stream is fed into multiple processing units. Few (if any) actual examples of this class of parallel computer have ever existed.
  • 51. 51 (MIMD): A type of parallel computer Multiple Instruction: Every processor may be executing a different instruction stream Multiple Data: Every processor may be working with a different data stream The most common type of parallel computer - most modern supercomputers fall into this category.
  • 52. 52 Examples: most current supercomputers, networked parallel computer clusters and "grids", multi-processor SMP computers, multi-core PCs.
  • 53. 53 Topic Covered in the Presentation Chapter 1: Introduction 1.1 Computational Demands of Modern Science 1.2 Advent of Practical Parallel Processing 1.3 Parallel Processing Terminology