SlideShare ist ein Scribd-Unternehmen logo
1 von 26
Advance Computer Architecture
Waqas khan
MSCS
 Multi-Core Computers

Multithreading


GPUs
Introduction To Multi-Core System

 Integration of multiple processor cores on a single chip.
 Multi-core processor is a special kind of multiprocessors:
    All   processors are on the same chip also called Chip
     Multiprocessor.
    Different cores execute different codes (threads) operating on
     different data.
    A shared memory multiprocessor: all cores share the same
     memory via some cache organization
 Provides a cheap parallel computer solution.
 Increases the computation power to PC platform.
Why Multi-Core?
 Limitations of single core architectures:
     High power consumption due to high clock rates (2-3% power
       increase per 1% performance increase).
      Heat generation (cooling is expensive).
      Limited parallelism (Instruction Level Parallelism only).
      Design time and complexity increased due to complex methods to
       increase ILP.
   Many new applications are multithreaded, suitable for multi-core.
      Ex. Multimedia applications.
   General trend in computer architecture (shift towards more
    parallelism).
   Much faster cache coherency circuits, in a single chip.
   Smaller in physical size than SMP.
Single Core VS Multi Core
A Dual-Core Intel Processor
Intel Polaris
 80 cores with Teraflop (10^12)
  performance on a single chip (1st
  chip to do so).
 Mesh network-on-a-chip.
 Frequency target at 5GHz.
 Workload-aware power
  management:
    Instructions to make any core
     sleep or wake as apps
     demand.
    Chip voltage & frequency
     control.
    Peak of 1.01 Teraflops at 62
     watts.
    Peak power efficiency of 19.4
    Short design time.
Innovations on Intel’s Polaris
 Rapid design – The tiled-design approach allows designers to use
  smaller cores that can easily be repeated across the chip.

 Network-on-a-chip – The cores are connected in a 2D mesh
  network that implement message-passing. This scheme is much
  more scalable.

 Fine-grain power management - The individual compute engines
  and data routers in each core can be activated or put to sleep based
  on the performance required by the applications.
What applications benefit From Multicore
Processors


 • Database servers
 • Web servers (Web commerce)
 • Compilers
 • Multimedia applications
 • Scientific applications,
 • In general, applications with
  Thread-level parallelism(as opposed to instruction
  level parallelism)
Multi-Core Computers



Multithreading

GPUs
2. Multi-Threading
Thread-Level Parallelism (TLP)
 This is parallelism on a more coarser scale than instruction-level
  parallelism (ILP).
 Instruction stream divided into smaller streams (threads) to be
  executed in parallel.
 Thread has its own instructions and data.
    May be part of a parallel program or independent programs.
    Each thread has all state (instructions, data, PC, etc.) needed to
     execute.
 Single-core superscalar processors cannot fully exploit TLP.
 Multi-core architectures can exploit TLP efficiently.
    Use multiple instruction streams to improve the throughput of
     computers that run several programs .
 TLP are more cost-effective to exploit than ILP.
Threads vs. Processes
 Process: An instance of a program running on a computer.
    Resource ownership.
        Virtual address space to hold process image including program, data,
         stack, and attributes.
    Execution of a process follows a path though the program.
    Process switch - An expensive operation due to the need to
     save the control data and register contents.
 Thread: A dispatchable unit of work within a process.
    Interruptible: processor can turn to another thread.
    All threads within a process share code and data segments.
    Thread switch is usually much less costly than process switch.
Multithreading Approaches
 Interleaved (fine-grained)
    Processor deals with several thread contexts at a time.
    Switching thread at each clock cycle (hardware need).
    If a thread is blocked, it is skipped.
    Hide latency of both short and long pipeline stalls.
 Blocked (coarse-grained)
    Thread executed until an event causes delay (e.g., cache miss).
    Relieves the need to have very fast thread switching.
    No slow down for ready-to-go threads.
 Simultaneous (SMT)
    Instructions simultaneously issued from multiple threads to
     execution units of superscalar processor.
 Chip multiprocessing
    Each processor handles separate threads.
Multithreading Paradigms
Programming for Multi-Core (In Context Of
Multithreading)
 There must be many threads or processes:
    Multiple applications running on the same machine.
        Multi-tasking is becoming very common.
        OS software tends to run many threads as a part of its normal
         operation.

 An application may also have multiple threads.
        In most cases, it must be specifically written.

 OS scheduler should map the threads to different cores, in
  order to balance the work load or to avoid hot spots due to
  heat generation.
Multi-Core Computers


Multithreading



GPUs
Introduction To GPU
What is GPU?
• It is a processor optimized for 2D/3D graphics, video,
  visual computing, and display.
• It is highly parallel, highly multithreaded multiprocessor
  optimized for visual computing.
• It provide real-time visual interaction with computed
  objects via graphics images, and video.
• It serves as both a programmable graphics processor
  and a scalable parallel computing platform.
• Heterogeneous Systems: combine a GPU with a CPU
Graphics Processing Unit, Why?
Graphics applications are:
    Rendering of 2D or 3D images with complex optical effects;
    Highly computation intensive;
    Massively parallel;
    Data stream based.
 General purpose processors (CPU) are:
    Designed to handle huge data volumes;
    Serial, one operation at a time;
    Control flow based;
    High flexible, but not very well adapted to graphics.
 GPUs are the solutions!
GPU Design
 Process pixels in parallel
    2.3M pixels per frame
    lots of work
 All pixels are independent
    no synchronization
 Lots of spatial locality
    regular memory access
 Great speedups:
    Limited only by the amount of hardware
GPU Design
2) Focus on throughput, not latency
Each pixel can take a long time……as long as we process
  many at the same time.
 Great scalability
 Lots of simple parallel processors
 Low clock speed
CPU vs. GPU Architecture
 GPUs are throughput-optimized
    Each thread may take a long time, but thousands of threads
 CPUs are latency-optimized
    Each thread runs as fast as possible, but only a few threads

 GPUs have hundreds of simple cores
 CPUs have a few massive cores

 GPUs excel at regular math-intensive work
    Lots of ALUs for math, little hardware for control
 CPUs excel at irregular control-intensive work
    Lots of hardware for control, few ALUs
GPU Architecture Features
 Massively Parallel, 1000s of processors (today).
 Power Efficient:
    Fixed Function Hardware = area & power efficient.
    Lack of speculation. More processing, less leaky cache.
 Memory Bandwidth:
    Memory Bandwidth is limited in CPU.
    GPU is not dependent on large caches for performance.
 Computing power = Frequency * Transistors
    GPUs: 1.7X (pixels) to 2.3X (vertices) annual growth.
    CPUs: 1.4X annual growth.
Computational Power
CUDA
 CUDA = Computer Unified Device Architecture.
 A scalable parallel programming model and a software
  environment for parallel computing.
 Threads:
   GPU threads are extremely light-weight, with little creation
    overhead.
   GPU needs 1000s of threads for full efficiency.
   Multi-core CPU needed only a few.
 GPU/CUDA address all three levels of parallelism:
    Thread parallelism;
    Data parallelism;
    Task parallelism.
Summary
 All computers are now parallel computers!
 Multi-core processors represent an important new trend in
  computer architecture.
    Decreased power consumption and heat generation.
    Minimized wire lengths and interconnect latencies.
 They enable true thread-level parallelism with great energy
  efficiency and scalability.
 Graphics requires a lot of computation and a huge amount
  of bandwidth - GPUs.
 GPUs are coming to general purpose computing, since they
  deliver huge performance with small power.
THANK YOU

Weitere ähnliche Inhalte

Was ist angesagt?

GPU Computing: A brief overview
GPU Computing: A brief overviewGPU Computing: A brief overview
GPU Computing: A brief overviewRajiv Kumar
 
Multi_Core_Processor_2015_(Download it!)
Multi_Core_Processor_2015_(Download it!)Multi_Core_Processor_2015_(Download it!)
Multi_Core_Processor_2015_(Download it!)Sudip Roy
 
Nvidia (History, GPU Architecture and New Pascal Architecture)
Nvidia (History, GPU Architecture and New Pascal Architecture)Nvidia (History, GPU Architecture and New Pascal Architecture)
Nvidia (History, GPU Architecture and New Pascal Architecture)Saksham Tanwar
 
Hyper Threading Technology
Hyper Threading TechnologyHyper Threading Technology
Hyper Threading Technologynayakslideshare
 
An introduction to the Design of Warehouse-Scale Computers
An introduction to the Design of Warehouse-Scale ComputersAn introduction to the Design of Warehouse-Scale Computers
An introduction to the Design of Warehouse-Scale ComputersAlessio Villardita
 
Quad Core Processors - Technology Presentation
Quad Core Processors - Technology PresentationQuad Core Processors - Technology Presentation
Quad Core Processors - Technology Presentationvinaya.hs
 
ECML PKDD 2021 ML meets IoT Tutorial Part II: Creating ML based Self learning...
ECML PKDD 2021 ML meets IoT Tutorial Part II: Creating ML based Self learning...ECML PKDD 2021 ML meets IoT Tutorial Part II: Creating ML based Self learning...
ECML PKDD 2021 ML meets IoT Tutorial Part II: Creating ML based Self learning...Bharath Sudharsan
 
Parallel Algorithms Advantages and Disadvantages
Parallel Algorithms Advantages and DisadvantagesParallel Algorithms Advantages and Disadvantages
Parallel Algorithms Advantages and DisadvantagesMurtadha Alsabbagh
 
Parallel computing and its applications
Parallel computing and its applicationsParallel computing and its applications
Parallel computing and its applicationsBurhan Ahmed
 
Multiprocessor Architecture for Image Processing
Multiprocessor Architecture for Image ProcessingMultiprocessor Architecture for Image Processing
Multiprocessor Architecture for Image Processingmayank.grd
 
Ximea - the pc camera, 90 gflps smart camera
Ximea  - the pc camera, 90 gflps smart cameraXimea  - the pc camera, 90 gflps smart camera
Ximea - the pc camera, 90 gflps smart cameraXIMEA
 
Graphics processing unit ppt
Graphics processing unit pptGraphics processing unit ppt
Graphics processing unit pptSandeep Singh
 
GPUs vs CPUs for Parallel Processing
GPUs vs CPUs for Parallel ProcessingGPUs vs CPUs for Parallel Processing
GPUs vs CPUs for Parallel ProcessingMohammed Billoo
 

Was ist angesagt? (19)

GPU Computing: A brief overview
GPU Computing: A brief overviewGPU Computing: A brief overview
GPU Computing: A brief overview
 
Multi_Core_Processor_2015_(Download it!)
Multi_Core_Processor_2015_(Download it!)Multi_Core_Processor_2015_(Download it!)
Multi_Core_Processor_2015_(Download it!)
 
Tensor Processing Unit (TPU)
Tensor Processing Unit (TPU)Tensor Processing Unit (TPU)
Tensor Processing Unit (TPU)
 
Nvidia (History, GPU Architecture and New Pascal Architecture)
Nvidia (History, GPU Architecture and New Pascal Architecture)Nvidia (History, GPU Architecture and New Pascal Architecture)
Nvidia (History, GPU Architecture and New Pascal Architecture)
 
Gpu databases
Gpu databasesGpu databases
Gpu databases
 
Hyper Threading Technology
Hyper Threading TechnologyHyper Threading Technology
Hyper Threading Technology
 
An introduction to the Design of Warehouse-Scale Computers
An introduction to the Design of Warehouse-Scale ComputersAn introduction to the Design of Warehouse-Scale Computers
An introduction to the Design of Warehouse-Scale Computers
 
Quad Core Processors - Technology Presentation
Quad Core Processors - Technology PresentationQuad Core Processors - Technology Presentation
Quad Core Processors - Technology Presentation
 
ECML PKDD 2021 ML meets IoT Tutorial Part II: Creating ML based Self learning...
ECML PKDD 2021 ML meets IoT Tutorial Part II: Creating ML based Self learning...ECML PKDD 2021 ML meets IoT Tutorial Part II: Creating ML based Self learning...
ECML PKDD 2021 ML meets IoT Tutorial Part II: Creating ML based Self learning...
 
Parallel Algorithms Advantages and Disadvantages
Parallel Algorithms Advantages and DisadvantagesParallel Algorithms Advantages and Disadvantages
Parallel Algorithms Advantages and Disadvantages
 
Parallel computing and its applications
Parallel computing and its applicationsParallel computing and its applications
Parallel computing and its applications
 
Multiprocessor Architecture for Image Processing
Multiprocessor Architecture for Image ProcessingMultiprocessor Architecture for Image Processing
Multiprocessor Architecture for Image Processing
 
Ximea - the pc camera, 90 gflps smart camera
Ximea  - the pc camera, 90 gflps smart cameraXimea  - the pc camera, 90 gflps smart camera
Ximea - the pc camera, 90 gflps smart camera
 
GPU Programming
GPU ProgrammingGPU Programming
GPU Programming
 
Multicore Processor Technology
Multicore Processor TechnologyMulticore Processor Technology
Multicore Processor Technology
 
Graphics processing unit ppt
Graphics processing unit pptGraphics processing unit ppt
Graphics processing unit ppt
 
GPU Computing
GPU ComputingGPU Computing
GPU Computing
 
Google TPU
Google TPUGoogle TPU
Google TPU
 
GPUs vs CPUs for Parallel Processing
GPUs vs CPUs for Parallel ProcessingGPUs vs CPUs for Parallel Processing
GPUs vs CPUs for Parallel Processing
 

Ähnlich wie processor struct

Stream Processing
Stream ProcessingStream Processing
Stream Processingarnamoy10
 
Multicore processor by Ankit Raj and Akash Prajapati
Multicore processor by Ankit Raj and Akash PrajapatiMulticore processor by Ankit Raj and Akash Prajapati
Multicore processor by Ankit Raj and Akash PrajapatiAnkit Raj
 
Intel new processors
Intel new processorsIntel new processors
Intel new processorszaid_b
 
Study of various factors affecting performance of multi core processors
Study of various factors affecting performance of multi core processorsStudy of various factors affecting performance of multi core processors
Study of various factors affecting performance of multi core processorsateeq ateeq
 
Maxwell siuc hpc_description_tutorial
Maxwell siuc hpc_description_tutorialMaxwell siuc hpc_description_tutorial
Maxwell siuc hpc_description_tutorialmadhuinturi
 
Computer architecture multi core processor
Computer architecture multi core processorComputer architecture multi core processor
Computer architecture multi core processorMazin Alwaaly
 
Blue gene- IBM's SuperComputer
Blue gene- IBM's SuperComputerBlue gene- IBM's SuperComputer
Blue gene- IBM's SuperComputerIsaaq Mohammed
 
The evolution of computer
The evolution of computerThe evolution of computer
The evolution of computerLolita De Leon
 
Implementation of RISC-Based Architecture for Low power applications
Implementation of RISC-Based Architecture for Low power applicationsImplementation of RISC-Based Architecture for Low power applications
Implementation of RISC-Based Architecture for Low power applicationsIOSR Journals
 
trends of microprocessor field
trends of microprocessor fieldtrends of microprocessor field
trends of microprocessor fieldRamya SK
 
Hyper threading technology
Hyper threading technologyHyper threading technology
Hyper threading technologyNikhil Venugopal
 
Parallel and Distributed Computing chapter 3
Parallel and Distributed Computing chapter 3Parallel and Distributed Computing chapter 3
Parallel and Distributed Computing chapter 3AbdullahMunir32
 
Area Optimized Implementation For Mips Processor
Area Optimized Implementation For Mips ProcessorArea Optimized Implementation For Mips Processor
Area Optimized Implementation For Mips ProcessorIOSR Journals
 

Ähnlich wie processor struct (20)

Stream Processing
Stream ProcessingStream Processing
Stream Processing
 
Multicore processor by Ankit Raj and Akash Prajapati
Multicore processor by Ankit Raj and Akash PrajapatiMulticore processor by Ankit Raj and Akash Prajapati
Multicore processor by Ankit Raj and Akash Prajapati
 
Intel new processors
Intel new processorsIntel new processors
Intel new processors
 
Distributed Computing
Distributed ComputingDistributed Computing
Distributed Computing
 
Study of various factors affecting performance of multi core processors
Study of various factors affecting performance of multi core processorsStudy of various factors affecting performance of multi core processors
Study of various factors affecting performance of multi core processors
 
Module 1 unit 3
Module 1  unit 3Module 1  unit 3
Module 1 unit 3
 
Maxwell siuc hpc_description_tutorial
Maxwell siuc hpc_description_tutorialMaxwell siuc hpc_description_tutorial
Maxwell siuc hpc_description_tutorial
 
Computer architecture multi core processor
Computer architecture multi core processorComputer architecture multi core processor
Computer architecture multi core processor
 
Blue gene- IBM's SuperComputer
Blue gene- IBM's SuperComputerBlue gene- IBM's SuperComputer
Blue gene- IBM's SuperComputer
 
Lecture1
Lecture1Lecture1
Lecture1
 
The evolution of computer
The evolution of computerThe evolution of computer
The evolution of computer
 
Implementation of RISC-Based Architecture for Low power applications
Implementation of RISC-Based Architecture for Low power applicationsImplementation of RISC-Based Architecture for Low power applications
Implementation of RISC-Based Architecture for Low power applications
 
trends of microprocessor field
trends of microprocessor fieldtrends of microprocessor field
trends of microprocessor field
 
Exascale Capabl
Exascale CapablExascale Capabl
Exascale Capabl
 
Multi-Core on Chip Architecture *doc - IK
Multi-Core on Chip Architecture *doc - IKMulti-Core on Chip Architecture *doc - IK
Multi-Core on Chip Architecture *doc - IK
 
High performance computing
High performance computingHigh performance computing
High performance computing
 
Hyper threading technology
Hyper threading technologyHyper threading technology
Hyper threading technology
 
Parallel and Distributed Computing chapter 3
Parallel and Distributed Computing chapter 3Parallel and Distributed Computing chapter 3
Parallel and Distributed Computing chapter 3
 
CPU vs GPU Comparison
CPU  vs GPU ComparisonCPU  vs GPU Comparison
CPU vs GPU Comparison
 
Area Optimized Implementation For Mips Processor
Area Optimized Implementation For Mips ProcessorArea Optimized Implementation For Mips Processor
Area Optimized Implementation For Mips Processor
 

Kürzlich hochgeladen

AMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdf
AMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdfAMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdf
AMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdfphamnguyenenglishnb
 
THEORIES OF ORGANIZATION-PUBLIC ADMINISTRATION
THEORIES OF ORGANIZATION-PUBLIC ADMINISTRATIONTHEORIES OF ORGANIZATION-PUBLIC ADMINISTRATION
THEORIES OF ORGANIZATION-PUBLIC ADMINISTRATIONHumphrey A Beña
 
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTiammrhaywood
 
Judging the Relevance and worth of ideas part 2.pptx
Judging the Relevance  and worth of ideas part 2.pptxJudging the Relevance  and worth of ideas part 2.pptx
Judging the Relevance and worth of ideas part 2.pptxSherlyMaeNeri
 
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...JhezDiaz1
 
ANG SEKTOR NG agrikultura.pptx QUARTER 4
ANG SEKTOR NG agrikultura.pptx QUARTER 4ANG SEKTOR NG agrikultura.pptx QUARTER 4
ANG SEKTOR NG agrikultura.pptx QUARTER 4MiaBumagat1
 
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)lakshayb543
 
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptxMULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptxAnupkumar Sharma
 
Gas measurement O2,Co2,& ph) 04/2024.pptx
Gas measurement O2,Co2,& ph) 04/2024.pptxGas measurement O2,Co2,& ph) 04/2024.pptx
Gas measurement O2,Co2,& ph) 04/2024.pptxDr.Ibrahim Hassaan
 
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdf
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdfInclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdf
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdfTechSoup
 
ACC 2024 Chronicles. Cardiology. Exam.pdf
ACC 2024 Chronicles. Cardiology. Exam.pdfACC 2024 Chronicles. Cardiology. Exam.pdf
ACC 2024 Chronicles. Cardiology. Exam.pdfSpandanaRallapalli
 
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...Nguyen Thanh Tu Collection
 
ENGLISH6-Q4-W3.pptxqurter our high choom
ENGLISH6-Q4-W3.pptxqurter our high choomENGLISH6-Q4-W3.pptxqurter our high choom
ENGLISH6-Q4-W3.pptxqurter our high choomnelietumpap1
 
Field Attribute Index Feature in Odoo 17
Field Attribute Index Feature in Odoo 17Field Attribute Index Feature in Odoo 17
Field Attribute Index Feature in Odoo 17Celine George
 
Full Stack Web Development Course for Beginners
Full Stack Web Development Course  for BeginnersFull Stack Web Development Course  for Beginners
Full Stack Web Development Course for BeginnersSabitha Banu
 
Keynote by Prof. Wurzer at Nordex about IP-design
Keynote by Prof. Wurzer at Nordex about IP-designKeynote by Prof. Wurzer at Nordex about IP-design
Keynote by Prof. Wurzer at Nordex about IP-designMIPLM
 
Grade 9 Q4-MELC1-Active and Passive Voice.pptx
Grade 9 Q4-MELC1-Active and Passive Voice.pptxGrade 9 Q4-MELC1-Active and Passive Voice.pptx
Grade 9 Q4-MELC1-Active and Passive Voice.pptxChelloAnnAsuncion2
 
Karra SKD Conference Presentation Revised.pptx
Karra SKD Conference Presentation Revised.pptxKarra SKD Conference Presentation Revised.pptx
Karra SKD Conference Presentation Revised.pptxAshokKarra1
 

Kürzlich hochgeladen (20)

AMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdf
AMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdfAMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdf
AMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdf
 
THEORIES OF ORGANIZATION-PUBLIC ADMINISTRATION
THEORIES OF ORGANIZATION-PUBLIC ADMINISTRATIONTHEORIES OF ORGANIZATION-PUBLIC ADMINISTRATION
THEORIES OF ORGANIZATION-PUBLIC ADMINISTRATION
 
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
 
Judging the Relevance and worth of ideas part 2.pptx
Judging the Relevance  and worth of ideas part 2.pptxJudging the Relevance  and worth of ideas part 2.pptx
Judging the Relevance and worth of ideas part 2.pptx
 
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
 
ANG SEKTOR NG agrikultura.pptx QUARTER 4
ANG SEKTOR NG agrikultura.pptx QUARTER 4ANG SEKTOR NG agrikultura.pptx QUARTER 4
ANG SEKTOR NG agrikultura.pptx QUARTER 4
 
LEFT_ON_C'N_ PRELIMS_EL_DORADO_2024.pptx
LEFT_ON_C'N_ PRELIMS_EL_DORADO_2024.pptxLEFT_ON_C'N_ PRELIMS_EL_DORADO_2024.pptx
LEFT_ON_C'N_ PRELIMS_EL_DORADO_2024.pptx
 
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
 
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptxMULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
 
Gas measurement O2,Co2,& ph) 04/2024.pptx
Gas measurement O2,Co2,& ph) 04/2024.pptxGas measurement O2,Co2,& ph) 04/2024.pptx
Gas measurement O2,Co2,& ph) 04/2024.pptx
 
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdf
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdfInclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdf
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdf
 
ACC 2024 Chronicles. Cardiology. Exam.pdf
ACC 2024 Chronicles. Cardiology. Exam.pdfACC 2024 Chronicles. Cardiology. Exam.pdf
ACC 2024 Chronicles. Cardiology. Exam.pdf
 
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...
 
ENGLISH6-Q4-W3.pptxqurter our high choom
ENGLISH6-Q4-W3.pptxqurter our high choomENGLISH6-Q4-W3.pptxqurter our high choom
ENGLISH6-Q4-W3.pptxqurter our high choom
 
Field Attribute Index Feature in Odoo 17
Field Attribute Index Feature in Odoo 17Field Attribute Index Feature in Odoo 17
Field Attribute Index Feature in Odoo 17
 
Full Stack Web Development Course for Beginners
Full Stack Web Development Course  for BeginnersFull Stack Web Development Course  for Beginners
Full Stack Web Development Course for Beginners
 
Keynote by Prof. Wurzer at Nordex about IP-design
Keynote by Prof. Wurzer at Nordex about IP-designKeynote by Prof. Wurzer at Nordex about IP-design
Keynote by Prof. Wurzer at Nordex about IP-design
 
Grade 9 Q4-MELC1-Active and Passive Voice.pptx
Grade 9 Q4-MELC1-Active and Passive Voice.pptxGrade 9 Q4-MELC1-Active and Passive Voice.pptx
Grade 9 Q4-MELC1-Active and Passive Voice.pptx
 
OS-operating systems- ch04 (Threads) ...
OS-operating systems- ch04 (Threads) ...OS-operating systems- ch04 (Threads) ...
OS-operating systems- ch04 (Threads) ...
 
Karra SKD Conference Presentation Revised.pptx
Karra SKD Conference Presentation Revised.pptxKarra SKD Conference Presentation Revised.pptx
Karra SKD Conference Presentation Revised.pptx
 

processor struct

  • 3. Introduction To Multi-Core System  Integration of multiple processor cores on a single chip.  Multi-core processor is a special kind of multiprocessors:  All processors are on the same chip also called Chip Multiprocessor.  Different cores execute different codes (threads) operating on different data.  A shared memory multiprocessor: all cores share the same memory via some cache organization  Provides a cheap parallel computer solution.  Increases the computation power to PC platform.
  • 4. Why Multi-Core?  Limitations of single core architectures:  High power consumption due to high clock rates (2-3% power increase per 1% performance increase).  Heat generation (cooling is expensive).  Limited parallelism (Instruction Level Parallelism only).  Design time and complexity increased due to complex methods to increase ILP.  Many new applications are multithreaded, suitable for multi-core.  Ex. Multimedia applications.  General trend in computer architecture (shift towards more parallelism).  Much faster cache coherency circuits, in a single chip.  Smaller in physical size than SMP.
  • 5. Single Core VS Multi Core
  • 6. A Dual-Core Intel Processor
  • 7. Intel Polaris  80 cores with Teraflop (10^12) performance on a single chip (1st chip to do so).  Mesh network-on-a-chip.  Frequency target at 5GHz.  Workload-aware power management:  Instructions to make any core sleep or wake as apps demand.  Chip voltage & frequency control.  Peak of 1.01 Teraflops at 62 watts.  Peak power efficiency of 19.4  Short design time.
  • 8. Innovations on Intel’s Polaris  Rapid design – The tiled-design approach allows designers to use smaller cores that can easily be repeated across the chip.  Network-on-a-chip – The cores are connected in a 2D mesh network that implement message-passing. This scheme is much more scalable.  Fine-grain power management - The individual compute engines and data routers in each core can be activated or put to sleep based on the performance required by the applications.
  • 9. What applications benefit From Multicore Processors  • Database servers  • Web servers (Web commerce)  • Compilers  • Multimedia applications  • Scientific applications,  • In general, applications with Thread-level parallelism(as opposed to instruction level parallelism)
  • 11. 2. Multi-Threading Thread-Level Parallelism (TLP)  This is parallelism on a more coarser scale than instruction-level parallelism (ILP).  Instruction stream divided into smaller streams (threads) to be executed in parallel.  Thread has its own instructions and data.  May be part of a parallel program or independent programs.  Each thread has all state (instructions, data, PC, etc.) needed to execute.  Single-core superscalar processors cannot fully exploit TLP.  Multi-core architectures can exploit TLP efficiently.  Use multiple instruction streams to improve the throughput of computers that run several programs .  TLP are more cost-effective to exploit than ILP.
  • 12. Threads vs. Processes  Process: An instance of a program running on a computer.  Resource ownership.  Virtual address space to hold process image including program, data, stack, and attributes.  Execution of a process follows a path though the program.  Process switch - An expensive operation due to the need to save the control data and register contents.  Thread: A dispatchable unit of work within a process.  Interruptible: processor can turn to another thread.  All threads within a process share code and data segments.  Thread switch is usually much less costly than process switch.
  • 13. Multithreading Approaches  Interleaved (fine-grained)  Processor deals with several thread contexts at a time.  Switching thread at each clock cycle (hardware need).  If a thread is blocked, it is skipped.  Hide latency of both short and long pipeline stalls.  Blocked (coarse-grained)  Thread executed until an event causes delay (e.g., cache miss).  Relieves the need to have very fast thread switching.  No slow down for ready-to-go threads.  Simultaneous (SMT)  Instructions simultaneously issued from multiple threads to execution units of superscalar processor.  Chip multiprocessing  Each processor handles separate threads.
  • 15. Programming for Multi-Core (In Context Of Multithreading)  There must be many threads or processes:  Multiple applications running on the same machine.  Multi-tasking is becoming very common.  OS software tends to run many threads as a part of its normal operation.  An application may also have multiple threads.  In most cases, it must be specifically written.  OS scheduler should map the threads to different cores, in order to balance the work load or to avoid hot spots due to heat generation.
  • 17. Introduction To GPU What is GPU? • It is a processor optimized for 2D/3D graphics, video, visual computing, and display. • It is highly parallel, highly multithreaded multiprocessor optimized for visual computing. • It provide real-time visual interaction with computed objects via graphics images, and video. • It serves as both a programmable graphics processor and a scalable parallel computing platform. • Heterogeneous Systems: combine a GPU with a CPU
  • 18. Graphics Processing Unit, Why? Graphics applications are:  Rendering of 2D or 3D images with complex optical effects;  Highly computation intensive;  Massively parallel;  Data stream based.  General purpose processors (CPU) are:  Designed to handle huge data volumes;  Serial, one operation at a time;  Control flow based;  High flexible, but not very well adapted to graphics.  GPUs are the solutions!
  • 19. GPU Design  Process pixels in parallel  2.3M pixels per frame  lots of work  All pixels are independent  no synchronization  Lots of spatial locality  regular memory access  Great speedups:  Limited only by the amount of hardware
  • 20. GPU Design 2) Focus on throughput, not latency Each pixel can take a long time……as long as we process many at the same time.  Great scalability  Lots of simple parallel processors  Low clock speed
  • 21. CPU vs. GPU Architecture  GPUs are throughput-optimized  Each thread may take a long time, but thousands of threads  CPUs are latency-optimized  Each thread runs as fast as possible, but only a few threads  GPUs have hundreds of simple cores  CPUs have a few massive cores  GPUs excel at regular math-intensive work  Lots of ALUs for math, little hardware for control  CPUs excel at irregular control-intensive work  Lots of hardware for control, few ALUs
  • 22. GPU Architecture Features  Massively Parallel, 1000s of processors (today).  Power Efficient:  Fixed Function Hardware = area & power efficient.  Lack of speculation. More processing, less leaky cache.  Memory Bandwidth:  Memory Bandwidth is limited in CPU.  GPU is not dependent on large caches for performance.  Computing power = Frequency * Transistors  GPUs: 1.7X (pixels) to 2.3X (vertices) annual growth.  CPUs: 1.4X annual growth.
  • 24. CUDA  CUDA = Computer Unified Device Architecture.  A scalable parallel programming model and a software environment for parallel computing.  Threads:  GPU threads are extremely light-weight, with little creation overhead.  GPU needs 1000s of threads for full efficiency.  Multi-core CPU needed only a few.  GPU/CUDA address all three levels of parallelism:  Thread parallelism;  Data parallelism;  Task parallelism.
  • 25. Summary  All computers are now parallel computers!  Multi-core processors represent an important new trend in computer architecture.  Decreased power consumption and heat generation.  Minimized wire lengths and interconnect latencies.  They enable true thread-level parallelism with great energy efficiency and scalability.  Graphics requires a lot of computation and a huge amount of bandwidth - GPUs.  GPUs are coming to general purpose computing, since they deliver huge performance with small power.

Hinweis der Redaktion

  1. smp
  2. L1, L2 cache