SlideShare ist ein Scribd-Unternehmen logo
1 von 18
*



Submitted to :            Submitted by:
Ms Simarpreet             Kiran, A06
Lec : Computer            Kh. Loyanganba Meitei, 42
System and Architecture   Bca Hons, D1111
*
*Types of shared memory

  *Physically shared memory
  *Virtual (or distributed) shared memory

*Scalability issues

  *Organisation of memory
  *Design of interconnection network
  *Cache coherence protocols                2
Multi-processor:
           Structure of Shared Memory MIMD
Architectures
Design space of shared memory computers

                                                  Shared memory
                                                    computers




                                                    Interconnection                    Cache coherency
          Single address space                          scheme
             memory access



                                                                                  Hardware
                       Virtual shared       Shared path                                          Software based
Physical shared                                                                    based
                          memory                                  Switching
 memory UMA                                                        network
                                   Singled bus
                                      based

                                   Multiple bus                       Crossbar     Multistage network
                   NUMA              based

                                                Bus
                                           multiplication
                  CC-NUMA
                                           Grid of buses                  Omega        Banyan           Benes


                                            Hierarchical
                   COMA
                                              system
*
*
*Also called distributed shared memory architecture
*The local memories of multi-computer are
 components of global address space:
  * any processor can access the local memory of any other
   processor
*Three approaches:
  * Non-uniform memory access (NUMA) machines
  * Cache-only memory access (COMA) machines
  * Cache-coherent non-uniform memory access
   (CC-NUMA) machines
*
*
*Logically shared memory is physically
 distributed
*Different access of local and remote memory
 blocks. Remote access takes much more time
 – latency
*Sensitive to data and program distribution
*Close to distributed memory systems, yet the
 programming paradigm is different
*Example: Cray T3D
*
*
*
*Each block of the shared memory works as
 local cache of a processor
*Continuous, dynamic migration of data
*Hit-rate decreases the traffic on the
 Interconnection Network
*Solutions for data-consistency increase the
 same traffic (see cache coherency problem
 later)
*Examples: KSR-1, DDM
*
*
*A combination of NUMA and COMA
*Initially static data distribution, then
 dynamic data migration
*Cache coherency problem is to be solved
*COMA and CC-NUMA are used in newer
 generation of parallel computers
*Examples: Convex SPP1000, Stanford DASH,
 MIT Alewife
*


+ No need to partition data or program, uniprocessor
 programming techniques can be adapted
+ Communication between processor is efficient
+ Minor modifications of tool chain and compiler


- Synchronized access to share data in memory needed.
  Synchronizing constructs (semaphores, conditional critical
  regions, monitors) result in nondeterministic behavior which
  can lead programming errors that are difficult to discover
- Lack of scalability due to (memory) contention problem
*

*Memory Access Time
  * can be a bottleneck even in a single-processor system
*Contention for Memory
  * two or more processors want to access a location in the same
   block at the same time (hot spot problem).
*Contention for Communication
  * processors should share and use exclusively elements of the
   Interconnection Network


*Result: long latency-time, idle processors,
 nonscalable system
*
*Problems of scalable computers
  1.   Tolerate and hide latency of remote loads
  2.   Tolerate and hide idling due to synchronization
*Solutions
  1.   Cache memory
       *   problem of cache coherence
  2.   Prefetching
  3.   Threads and fast context switching
*
Than
ks...

Weitere ähnliche Inhalte

Was ist angesagt?

Distributed & parallel system
Distributed & parallel systemDistributed & parallel system
Distributed & parallel systemManish Singh
 
Processes and Processors in Distributed Systems
Processes and Processors in Distributed SystemsProcesses and Processors in Distributed Systems
Processes and Processors in Distributed SystemsDr Sandeep Kumar Poonia
 
Agreement Protocols, distributed File Systems, Distributed Shared Memory
Agreement Protocols, distributed File Systems, Distributed Shared MemoryAgreement Protocols, distributed File Systems, Distributed Shared Memory
Agreement Protocols, distributed File Systems, Distributed Shared MemorySHIKHA GAUTAM
 
Processor allocation in Distributed Systems
Processor allocation in Distributed SystemsProcessor allocation in Distributed Systems
Processor allocation in Distributed SystemsRitu Ranjan Shrivastwa
 
Round Robin Algorithm.pptx
Round Robin Algorithm.pptxRound Robin Algorithm.pptx
Round Robin Algorithm.pptxSanad Bhowmik
 
distributed Computing system model
distributed Computing system modeldistributed Computing system model
distributed Computing system modelHarshad Umredkar
 
Physical and Logical Clocks
Physical and Logical ClocksPhysical and Logical Clocks
Physical and Logical ClocksDilum Bandara
 
Parallel processing (simd and mimd)
Parallel processing (simd and mimd)Parallel processing (simd and mimd)
Parallel processing (simd and mimd)Bhavik Vashi
 
Multiprocessor architecture
Multiprocessor architectureMultiprocessor architecture
Multiprocessor architectureArpan Baishya
 
File replication
File replicationFile replication
File replicationKlawal13
 
Advanced Operating System Lecture Notes
Advanced Operating System Lecture NotesAdvanced Operating System Lecture Notes
Advanced Operating System Lecture NotesAnirudhan Guru
 
Parallel computing and its applications
Parallel computing and its applicationsParallel computing and its applications
Parallel computing and its applicationsBurhan Ahmed
 
Parallel Processing & Pipelining in Computer Architecture_Prof.Sumalatha.pptx
Parallel Processing & Pipelining in Computer Architecture_Prof.Sumalatha.pptxParallel Processing & Pipelining in Computer Architecture_Prof.Sumalatha.pptx
Parallel Processing & Pipelining in Computer Architecture_Prof.Sumalatha.pptxSumalatha A
 
Data aggregation in wireless sensor network
Data aggregation in wireless sensor networkData aggregation in wireless sensor network
Data aggregation in wireless sensor networkShiwangi Thakur
 

Was ist angesagt? (20)

Distributed & parallel system
Distributed & parallel systemDistributed & parallel system
Distributed & parallel system
 
Processes and Processors in Distributed Systems
Processes and Processors in Distributed SystemsProcesses and Processors in Distributed Systems
Processes and Processors in Distributed Systems
 
Agreement Protocols, distributed File Systems, Distributed Shared Memory
Agreement Protocols, distributed File Systems, Distributed Shared MemoryAgreement Protocols, distributed File Systems, Distributed Shared Memory
Agreement Protocols, distributed File Systems, Distributed Shared Memory
 
Processor allocation in Distributed Systems
Processor allocation in Distributed SystemsProcessor allocation in Distributed Systems
Processor allocation in Distributed Systems
 
Multi processing
Multi processingMulti processing
Multi processing
 
1.prallelism
1.prallelism1.prallelism
1.prallelism
 
Round Robin Algorithm.pptx
Round Robin Algorithm.pptxRound Robin Algorithm.pptx
Round Robin Algorithm.pptx
 
Lecture 3 threads
Lecture 3   threadsLecture 3   threads
Lecture 3 threads
 
distributed Computing system model
distributed Computing system modeldistributed Computing system model
distributed Computing system model
 
Physical and Logical Clocks
Physical and Logical ClocksPhysical and Logical Clocks
Physical and Logical Clocks
 
Parallel processing (simd and mimd)
Parallel processing (simd and mimd)Parallel processing (simd and mimd)
Parallel processing (simd and mimd)
 
Multiprocessor architecture
Multiprocessor architectureMultiprocessor architecture
Multiprocessor architecture
 
File replication
File replicationFile replication
File replication
 
Advanced Operating System Lecture Notes
Advanced Operating System Lecture NotesAdvanced Operating System Lecture Notes
Advanced Operating System Lecture Notes
 
Parallel processing
Parallel processingParallel processing
Parallel processing
 
Pram model
Pram modelPram model
Pram model
 
Parallel computing and its applications
Parallel computing and its applicationsParallel computing and its applications
Parallel computing and its applications
 
Parallel Processing & Pipelining in Computer Architecture_Prof.Sumalatha.pptx
Parallel Processing & Pipelining in Computer Architecture_Prof.Sumalatha.pptxParallel Processing & Pipelining in Computer Architecture_Prof.Sumalatha.pptx
Parallel Processing & Pipelining in Computer Architecture_Prof.Sumalatha.pptx
 
Data aggregation in wireless sensor network
Data aggregation in wireless sensor networkData aggregation in wireless sensor network
Data aggregation in wireless sensor network
 
6.distributed shared memory
6.distributed shared memory6.distributed shared memory
6.distributed shared memory
 

Andere mochten auch

Lecture 2
Lecture 2Lecture 2
Lecture 2Mr SMAK
 
The Flynn Effect
The Flynn EffectThe Flynn Effect
The Flynn EffectBhatt83
 
Lecture 6
Lecture  6Lecture  6
Lecture 6Mr SMAK
 
message passing vs shared memory
message passing vs shared memorymessage passing vs shared memory
message passing vs shared memoryHamza Zahid
 
Computer architecture
Computer architecture Computer architecture
Computer architecture Ashish Kumar
 
Parallel computing
Parallel computingParallel computing
Parallel computingVinay Gupta
 
Flynns classification
Flynns classificationFlynns classification
Flynns classificationYasir Khan
 
Feng’s classification
Feng’s classificationFeng’s classification
Feng’s classificationNarayan Kandel
 

Andere mochten auch (10)

Lecture 2
Lecture 2Lecture 2
Lecture 2
 
Aca 2
Aca 2Aca 2
Aca 2
 
The Flynn Effect
The Flynn EffectThe Flynn Effect
The Flynn Effect
 
NUMA overview
NUMA overviewNUMA overview
NUMA overview
 
Lecture 6
Lecture  6Lecture  6
Lecture 6
 
message passing vs shared memory
message passing vs shared memorymessage passing vs shared memory
message passing vs shared memory
 
Computer architecture
Computer architecture Computer architecture
Computer architecture
 
Parallel computing
Parallel computingParallel computing
Parallel computing
 
Flynns classification
Flynns classificationFlynns classification
Flynns classification
 
Feng’s classification
Feng’s classificationFeng’s classification
Feng’s classification
 

Ähnlich wie Mimd

Ceg4131 models
Ceg4131 modelsCeg4131 models
Ceg4131 modelsanandme07
 
Communication model of parallel platforms
Communication model of parallel platformsCommunication model of parallel platforms
Communication model of parallel platformsSyed Zaid Irshad
 
Lecture 6
Lecture  6Lecture  6
Lecture 6Mr SMAK
 
Lecture 6
Lecture  6Lecture  6
Lecture 6Mr SMAK
 
Lecture 6.1
Lecture  6.1Lecture  6.1
Lecture 6.1Mr SMAK
 
Lecture 6.1
Lecture  6.1Lecture  6.1
Lecture 6.1Mr SMAK
 
Lecture 6.1
Lecture  6.1Lecture  6.1
Lecture 6.1Mr SMAK
 
network ram parallel computing
network ram parallel computingnetwork ram parallel computing
network ram parallel computingNiranjana Ambadi
 
Distributed Shared Memory
Distributed Shared MemoryDistributed Shared Memory
Distributed Shared MemoryPrakhar Rastogi
 
Federal VMUG - March - Virtual machine sizing considerations in a numa enviro...
Federal VMUG - March - Virtual machine sizing considerations in a numa enviro...Federal VMUG - March - Virtual machine sizing considerations in a numa enviro...
Federal VMUG - March - Virtual machine sizing considerations in a numa enviro...langonej
 
Emulex OneConnect Universal CNA (Short Overview)
Emulex OneConnect Universal CNA (Short Overview)Emulex OneConnect Universal CNA (Short Overview)
Emulex OneConnect Universal CNA (Short Overview)Emulex Corporation
 
Overview of Distributed Systems
Overview of Distributed SystemsOverview of Distributed Systems
Overview of Distributed Systemsvampugani
 
Roeder posterismb2010
Roeder posterismb2010Roeder posterismb2010
Roeder posterismb2010Chris Roeder
 
Oracle rac 10g best practices
Oracle rac 10g best practicesOracle rac 10g best practices
Oracle rac 10g best practicesHaseeb Alam
 
Non-Uniform Memory Access ( NUMA)
Non-Uniform Memory Access ( NUMA)Non-Uniform Memory Access ( NUMA)
Non-Uniform Memory Access ( NUMA)Nakul Manchanda
 
multiprocessor _system _presentation.ppt
multiprocessor _system _presentation.pptmultiprocessor _system _presentation.ppt
multiprocessor _system _presentation.pptnaghamallella
 
Controlling Memory Footprint at All Layers: Linux Kernel, Applications, Libra...
Controlling Memory Footprint at All Layers: Linux Kernel, Applications, Libra...Controlling Memory Footprint at All Layers: Linux Kernel, Applications, Libra...
Controlling Memory Footprint at All Layers: Linux Kernel, Applications, Libra...peknap
 
T03160010220104036 multipleproc week11-1-pert 21
T03160010220104036 multipleproc week11-1-pert 21T03160010220104036 multipleproc week11-1-pert 21
T03160010220104036 multipleproc week11-1-pert 21Dandi Aulia
 

Ähnlich wie Mimd (20)

Ceg4131 models
Ceg4131 modelsCeg4131 models
Ceg4131 models
 
Communication model of parallel platforms
Communication model of parallel platformsCommunication model of parallel platforms
Communication model of parallel platforms
 
Lecture 6
Lecture  6Lecture  6
Lecture 6
 
Lecture 6
Lecture  6Lecture  6
Lecture 6
 
Lecture 6.1
Lecture  6.1Lecture  6.1
Lecture 6.1
 
Lecture 6.1
Lecture  6.1Lecture  6.1
Lecture 6.1
 
Lecture 6.1
Lecture  6.1Lecture  6.1
Lecture 6.1
 
Overview on NUMA
Overview on NUMAOverview on NUMA
Overview on NUMA
 
network ram parallel computing
network ram parallel computingnetwork ram parallel computing
network ram parallel computing
 
Distributed Shared Memory
Distributed Shared MemoryDistributed Shared Memory
Distributed Shared Memory
 
Intro_ppt.pptx
Intro_ppt.pptxIntro_ppt.pptx
Intro_ppt.pptx
 
Federal VMUG - March - Virtual machine sizing considerations in a numa enviro...
Federal VMUG - March - Virtual machine sizing considerations in a numa enviro...Federal VMUG - March - Virtual machine sizing considerations in a numa enviro...
Federal VMUG - March - Virtual machine sizing considerations in a numa enviro...
 
Emulex OneConnect Universal CNA (Short Overview)
Emulex OneConnect Universal CNA (Short Overview)Emulex OneConnect Universal CNA (Short Overview)
Emulex OneConnect Universal CNA (Short Overview)
 
Overview of Distributed Systems
Overview of Distributed SystemsOverview of Distributed Systems
Overview of Distributed Systems
 
Roeder posterismb2010
Roeder posterismb2010Roeder posterismb2010
Roeder posterismb2010
 
Oracle rac 10g best practices
Oracle rac 10g best practicesOracle rac 10g best practices
Oracle rac 10g best practices
 
Non-Uniform Memory Access ( NUMA)
Non-Uniform Memory Access ( NUMA)Non-Uniform Memory Access ( NUMA)
Non-Uniform Memory Access ( NUMA)
 
multiprocessor _system _presentation.ppt
multiprocessor _system _presentation.pptmultiprocessor _system _presentation.ppt
multiprocessor _system _presentation.ppt
 
Controlling Memory Footprint at All Layers: Linux Kernel, Applications, Libra...
Controlling Memory Footprint at All Layers: Linux Kernel, Applications, Libra...Controlling Memory Footprint at All Layers: Linux Kernel, Applications, Libra...
Controlling Memory Footprint at All Layers: Linux Kernel, Applications, Libra...
 
T03160010220104036 multipleproc week11-1-pert 21
T03160010220104036 multipleproc week11-1-pert 21T03160010220104036 multipleproc week11-1-pert 21
T03160010220104036 multipleproc week11-1-pert 21
 

Mimd

  • 1. * Submitted to : Submitted by: Ms Simarpreet Kiran, A06 Lec : Computer Kh. Loyanganba Meitei, 42 System and Architecture Bca Hons, D1111
  • 2. * *Types of shared memory *Physically shared memory *Virtual (or distributed) shared memory *Scalability issues *Organisation of memory *Design of interconnection network *Cache coherence protocols 2
  • 3. Multi-processor: Structure of Shared Memory MIMD Architectures
  • 4. Design space of shared memory computers Shared memory computers Interconnection Cache coherency Single address space scheme memory access Hardware Virtual shared Shared path Software based Physical shared based memory Switching memory UMA network Singled bus based Multiple bus Crossbar Multistage network NUMA based Bus multiplication CC-NUMA Grid of buses Omega Banyan Benes Hierarchical COMA system
  • 5. *
  • 6. * *Also called distributed shared memory architecture *The local memories of multi-computer are components of global address space: * any processor can access the local memory of any other processor *Three approaches: * Non-uniform memory access (NUMA) machines * Cache-only memory access (COMA) machines * Cache-coherent non-uniform memory access (CC-NUMA) machines
  • 7. *
  • 8. * *Logically shared memory is physically distributed *Different access of local and remote memory blocks. Remote access takes much more time – latency *Sensitive to data and program distribution *Close to distributed memory systems, yet the programming paradigm is different *Example: Cray T3D
  • 9. *
  • 10. *
  • 11. * *Each block of the shared memory works as local cache of a processor *Continuous, dynamic migration of data *Hit-rate decreases the traffic on the Interconnection Network *Solutions for data-consistency increase the same traffic (see cache coherency problem later) *Examples: KSR-1, DDM
  • 12. *
  • 13. * *A combination of NUMA and COMA *Initially static data distribution, then dynamic data migration *Cache coherency problem is to be solved *COMA and CC-NUMA are used in newer generation of parallel computers *Examples: Convex SPP1000, Stanford DASH, MIT Alewife
  • 14. * + No need to partition data or program, uniprocessor programming techniques can be adapted + Communication between processor is efficient + Minor modifications of tool chain and compiler - Synchronized access to share data in memory needed. Synchronizing constructs (semaphores, conditional critical regions, monitors) result in nondeterministic behavior which can lead programming errors that are difficult to discover - Lack of scalability due to (memory) contention problem
  • 15. * *Memory Access Time * can be a bottleneck even in a single-processor system *Contention for Memory * two or more processors want to access a location in the same block at the same time (hot spot problem). *Contention for Communication * processors should share and use exclusively elements of the Interconnection Network *Result: long latency-time, idle processors, nonscalable system
  • 16. * *Problems of scalable computers 1. Tolerate and hide latency of remote loads 2. Tolerate and hide idling due to synchronization *Solutions 1. Cache memory * problem of cache coherence 2. Prefetching 3. Threads and fast context switching
  • 17. *