SlideShare ist ein Scribd-Unternehmen logo
1 von 27
A Cost Effective Centralized
 Adaptive Routing for Networks
            on Chip
     Ran Manevich*, Israel Cidon*, Avinoam Kolodny*,
        Isask’har (Zigi) Walter* and Shmuel Wimer#

*Technion – Israel Institute        #Bar-Ilan   University
      of Technology
                                                          QNoC
                                                                         Module       Module   Module




                                                         Research
                                                                    Module   Module   Module   Module




                                                                    Module                     Module



                                                          Group     Module
                                                                                  Module

                                                                                               Module




                               May 2, 2011
Networks-on-Chip (NoCs)




         May 2, 2011
Global traffic information is essential to
make the right decision!




                           May 2, 2011
Adaptive Routing in NoCs – Local
    vs. Global Information
                                    I CAN MAKE
                   Source               IT!!!
 A Packet routed                                               Low
 from upper left to                                            Congestion

 bottom right                                                  Medium
                                                               Congestion
 corner utilizing
                                                               High
 local congestion                                              Congestion
 information.
 The same packet
 routed using
 global                                          
 information.                                    Destination


                            May 2, 2011
Route Selection - ATDOR
   ATDOR - Adaptive Toggle Dimension Ordered Routing
   Keep it simple! Centralized selection:




   The option with less congested bottleneck link is
    preferred.
   Routing tables in sources. One bit per destination.

                             May 2, 2011
ATDOR Illustration 1
   Five identical flows, 100
    MB/s each.
   Initial routing - XY

   Links modeled as M/M/1
    queues. Delay of a single
    link:
               Traffic
DLINK
           Capacity Traffic

   Links capacity is 210
    MB/s.

                                May 2, 2011
Centralized Routing – How?
• Option 1 – Continuous calculation of optimal routing
  for the active sessions:

        Achievable load balancing

        Speed and computation
        complexity

        System complexity




                             May 2, 2011
Centralized Routing – How?
• Option 2 – Iterative serial selection based on traffic
  load measurements between XY and YX for all source-
  destination pairs:

         Achievable load balancing

         Speed and computation
         complexity

         System complexity




                            May 2, 2011
ATDOR illustration 1
Step #        Re-Routed
                Flow
  1
  3
  2             1->15
                2->15
                 2->8




      Average Delay

       22 ns
       37
        ∞
                          May 2, 2011
What did we just see?
   For each flow we:
    1.     Calculated the better route.
    2.     Updated routing table of the source.
    3.     Waited for the update to take effect
           and measured global traffic load.
   Performing steps 1-3 for each flow is slow
    and not scalable.
   Steps 2 and 3 are unified for all destinations of a single source:
           Achievable load balancing
           Speed and computation complexity
           Scalability

                                 May 2, 2011
Back illustration 1
Step #        Re-Routed
                Flow
  4
  3
  1             4->15
                1->15
                2->8
  5
  2
                2->15




      Average Delay

       22 ns
        ∞
                          May 2, 2011
Problem #1
    Changing routing may enhance
     congestion and cause fluctuations.

    Solution: Change routing only if the
     alternative is better by the margin α, 0<
     α <1:
if (Current Route = XY)
                YX if MAX[Load YX ] a MAX[Load XY ]
    NextRoute =
                XY if MAX[Load YX ] > a MAX[Load XY ]
elseif (Current Route = YX)
                XY if MAX[Load XY ] a MAX[Load YX ]
    NextRoute =
                YX if MAX[Load XY ] > a MAX[Load YX ]


                                 May 2, 2011
ATDOR illustration 2
Step #        Re-Routed
                Flow
                1->14

  3
  2
  1             1->15

                1->16



      Average Delay

          ∞
                          May 2, 2011
Problem #2
   Coupling among flows sharing the same
    source.

   Solution: Re-Routing counters CI,J count
    routing changes of flows from source I to
    destination J (FI,J). When CI,J reaches a
    limit LI,J, routing of FI,J is locked. A
    Possible definition of Limits LI,J :


         LI , J   (I   J ) mod 3


                              May 2, 2011
Back to illustration 2
    Flows        R. Changes
                     Left
    1->16            0
                     1
                     2
    1->15            0
                     1
    2->14
    1->14            0


LI , J      (I   J ) mod 3

         Average Delay

          22
          73 ns
           ∞
                              May 2, 2011
Bring it all together
   Flows         R. Changes
                     Left
    1->15                0
                         1
     2->8                0
                         1
    2->15                0
                         1
                         2
    4->15                0
                         1


LI , J      (I   J ) mod 3

         Average Delay

            14
            22 ns
             ∞
                              May 2, 2011
Centralized Adaptive Routing for
      NoCs - Architecture
   Local traffic load
    measurements
    inside the routers.

   Traffic load
    measurements
    aggregation into
    Traffic Load Maps.

   Routing control.


                          May 2, 2011
Load Measurements Aggregation
   An illustration of
    aggregation of load
    values in a 4X4 2D
    mesh.
   A congestion value is
    written to each traffic
    load map every clock
    cycle.




                              May 2, 2011
ATDOR – Route Selection Circuit
      Maximally loaded links of the two
       alternatives are compared. Next
       route:
    if(Current Route = XY)
                  YX    if MAX[Load YX ] a MAX[Load XY ]
    NextRoute =
                  XY if MAX[Load YX ] > a MAX[Load XY ]
    elseif(Current Route = YX)
                  XY    if MAX[Load XY ] a MAX[Load YX ]
    NextRoute =
                  YX if MAX[Load XY ] > a MAX[Load YX ]
    0 < a <1


• Combinatorial pipelined
  implementation.
     Result every ATDOR clock cycle.


                                                May 2, 2011
Hardware Requirements
   The whole mechanism
    was implemented on
    xc5vlx50t VIRTEX 5
    FPGA.

   Estimated area for 45nm
    technology node.



   Per-Router hardware overheads in % for a NoC with typical size
    (50 KGates) virtual channel routers.


                              May 2, 2011
Average Packet Delay – Uniform
            Traffic
• Average delay vs. average load in links normalized to links
  capacity. 8X8 2D Mesh. Uniform traffic pattern.




                         May 2, 2011
Average Packet Delay – Transpose
             Traffic
 • Average delay vs. average load in links normalized to links
   capacity. 8X8 2D Mesh. Transpose traffic pattern.




                          May 2, 2011
Average Packet Delay – Hotspot
            Traffic
• Average delay vs. average load in links normalized to links
  capacity. 8X8 2D Mesh. 4 Hotspots traffic pattern.




                         May 2, 2011
Control Iteration Duration
• Number of re-routed flows vs. time.
• 8X8 2D Mesh, ATDOR clock of 100 MHz.




             α = 15/16                   α = 3/4


                         May 2, 2011
CMP DNUCA - Architecture
• 8X8 CMP DNUCA (Dynamic Non Uniform Cache Array)
  with 8 CPUs and 56 cache banks:




                     May 2, 2011
CMP DNUCA – Saturation
         Throughput
• Saturation throughput - Splash 2 and Parsec benchmarks
  on 8X8 CMP DNUCA with 8 CPUs and 56 cache banks:




                       May 2, 2011
Conclusions
• Centralized adaptive routing is feasible for NoCs.

   ATDOR: Centralized selection between XY and
    YX for each source-destination pair.
   Hardware overhead: <4% of an 8X8 typical NoC.
   Average saturation throughput improvement:
                          Vs. O1TURN       Vs. RCA
     Synthetic Patterns     19.3%          12.1%
     Spash 2 and Parsec     22.8%          12.8%
        Benchmarks


                          May 2, 2011

Weitere ähnliche Inhalte

Was ist angesagt?

IJERD (www.ijerd.com) International Journal of Engineering Research and Devel...
IJERD (www.ijerd.com) International Journal of Engineering Research and Devel...IJERD (www.ijerd.com) International Journal of Engineering Research and Devel...
IJERD (www.ijerd.com) International Journal of Engineering Research and Devel...
IJERD Editor
 
Efficient load aware routing scheme
Efficient load aware routing schemeEfficient load aware routing scheme
Efficient load aware routing scheme
ambitlick
 

Was ist angesagt? (9)

Term paper presentation
Term paper presentationTerm paper presentation
Term paper presentation
 
IRJET- A Novel Design of Flip Flop and its Application in Up Counter
IRJET-  	  A Novel Design of Flip Flop and its Application in Up CounterIRJET-  	  A Novel Design of Flip Flop and its Application in Up Counter
IRJET- A Novel Design of Flip Flop and its Application in Up Counter
 
IJERD (www.ijerd.com) International Journal of Engineering Research and Devel...
IJERD (www.ijerd.com) International Journal of Engineering Research and Devel...IJERD (www.ijerd.com) International Journal of Engineering Research and Devel...
IJERD (www.ijerd.com) International Journal of Engineering Research and Devel...
 
Simulation and Performance Analysis of Long Term Evolution (LTE) Cellular Net...
Simulation and Performance Analysis of Long Term Evolution (LTE) Cellular Net...Simulation and Performance Analysis of Long Term Evolution (LTE) Cellular Net...
Simulation and Performance Analysis of Long Term Evolution (LTE) Cellular Net...
 
Design and Analysis of Sequential Elements for Low Power Clocking System with...
Design and Analysis of Sequential Elements for Low Power Clocking System with...Design and Analysis of Sequential Elements for Low Power Clocking System with...
Design and Analysis of Sequential Elements for Low Power Clocking System with...
 
A Single-Phase Clock Multiband Low-Power Flexible Divider
A Single-Phase Clock Multiband Low-Power Flexible DividerA Single-Phase Clock Multiband Low-Power Flexible Divider
A Single-Phase Clock Multiband Low-Power Flexible Divider
 
Efficient load aware routing scheme
Efficient load aware routing schemeEfficient load aware routing scheme
Efficient load aware routing scheme
 
Bm044394397
Bm044394397Bm044394397
Bm044394397
 
Stack Contention-alleviated Precharge Keeper for Pseudo Domino Logic
Stack Contention-alleviated Precharge Keeper for Pseudo Domino LogicStack Contention-alleviated Precharge Keeper for Pseudo Domino Logic
Stack Contention-alleviated Precharge Keeper for Pseudo Domino Logic
 

Andere mochten auch

A novel adaptive algorithm for removal of power line interference from ecg si...
A novel adaptive algorithm for removal of power line interference from ecg si...A novel adaptive algorithm for removal of power line interference from ecg si...
A novel adaptive algorithm for removal of power line interference from ecg si...
International Journal of Computer and Communication System Engineering
 
An Adaptive Routing Algorithm for Communication Networks using Back Pressure...
An Adaptive Routing Algorithm for Communication Networks  using Back Pressure...An Adaptive Routing Algorithm for Communication Networks  using Back Pressure...
An Adaptive Routing Algorithm for Communication Networks using Back Pressure...
IJMER
 
Anna university-ug-pg-ppt-presentation-format
Anna university-ug-pg-ppt-presentation-formatAnna university-ug-pg-ppt-presentation-format
Anna university-ug-pg-ppt-presentation-format
Veera Victory
 
Designing of a AMBA-AHB Multilayer Bus matrix Self-Motivated Arbitration scheme
Designing of a AMBA-AHB Multilayer Bus matrix Self-Motivated Arbitration schemeDesigning of a AMBA-AHB Multilayer Bus matrix Self-Motivated Arbitration scheme
Designing of a AMBA-AHB Multilayer Bus matrix Self-Motivated Arbitration scheme
IOSR Journals
 
Presentation on project report
Presentation on project reportPresentation on project report
Presentation on project report
ramesh_x
 

Andere mochten auch (16)

A novel adaptive algorithm for removal of power line interference from ecg si...
A novel adaptive algorithm for removal of power line interference from ecg si...A novel adaptive algorithm for removal of power line interference from ecg si...
A novel adaptive algorithm for removal of power line interference from ecg si...
 
An Adaptive Routing Algorithm for Communication Networks using Back Pressure...
An Adaptive Routing Algorithm for Communication Networks  using Back Pressure...An Adaptive Routing Algorithm for Communication Networks  using Back Pressure...
An Adaptive Routing Algorithm for Communication Networks using Back Pressure...
 
NoC simulators presentation
NoC simulators presentationNoC simulators presentation
NoC simulators presentation
 
Presentation reliable NoC
Presentation reliable NoCPresentation reliable NoC
Presentation reliable NoC
 
Routing Presentation
Routing PresentationRouting Presentation
Routing Presentation
 
Noc ajal final
Noc ajal  finalNoc ajal  final
Noc ajal final
 
Router architectures in no c
Router architectures in no cRouter architectures in no c
Router architectures in no c
 
Parallel Computing on the GPU
Parallel Computing on the GPUParallel Computing on the GPU
Parallel Computing on the GPU
 
Jamming aware traffic allocation for multiple-path routing using portfolio se...
Jamming aware traffic allocation for multiple-path routing using portfolio se...Jamming aware traffic allocation for multiple-path routing using portfolio se...
Jamming aware traffic allocation for multiple-path routing using portfolio se...
 
Sample PowerPoint for Project Review
Sample PowerPoint for Project ReviewSample PowerPoint for Project Review
Sample PowerPoint for Project Review
 
Anna university-ug-pg-ppt-presentation-format
Anna university-ug-pg-ppt-presentation-formatAnna university-ug-pg-ppt-presentation-format
Anna university-ug-pg-ppt-presentation-format
 
Designing of a AMBA-AHB Multilayer Bus matrix Self-Motivated Arbitration scheme
Designing of a AMBA-AHB Multilayer Bus matrix Self-Motivated Arbitration schemeDesigning of a AMBA-AHB Multilayer Bus matrix Self-Motivated Arbitration scheme
Designing of a AMBA-AHB Multilayer Bus matrix Self-Motivated Arbitration scheme
 
First Review(Ppt)
First Review(Ppt)First Review(Ppt)
First Review(Ppt)
 
Presentation on project report
Presentation on project reportPresentation on project report
Presentation on project report
 
Final Year Project Presentation
Final Year Project PresentationFinal Year Project Presentation
Final Year Project Presentation
 
Neural network & its applications
Neural network & its applications Neural network & its applications
Neural network & its applications
 

Ähnlich wie Cost Effective centralized adpative routing for networks on chip

Hanin Almutairi_Hanin.Almutairi@unb.ca_expo2015
Hanin Almutairi_Hanin.Almutairi@unb.ca_expo2015Hanin Almutairi_Hanin.Almutairi@unb.ca_expo2015
Hanin Almutairi_Hanin.Almutairi@unb.ca_expo2015
Hanin Almutairi
 
11.signal strength based congestion control in manet
11.signal strength based congestion control in manet11.signal strength based congestion control in manet
11.signal strength based congestion control in manet
Alexander Decker
 
An Evaluation on Selfish Behaviour Attack and Jellyfish Attacks Under AODV Ro...
An Evaluation on Selfish Behaviour Attack and Jellyfish Attacks Under AODV Ro...An Evaluation on Selfish Behaviour Attack and Jellyfish Attacks Under AODV Ro...
An Evaluation on Selfish Behaviour Attack and Jellyfish Attacks Under AODV Ro...
ijfcstjournal
 
AN EVALUATION ON SELFISH BEHAVIOUR ATTACK AND JELLYFISH ATTACKS UNDER AODV RO...
AN EVALUATION ON SELFISH BEHAVIOUR ATTACK AND JELLYFISH ATTACKS UNDER AODV RO...AN EVALUATION ON SELFISH BEHAVIOUR ATTACK AND JELLYFISH ATTACKS UNDER AODV RO...
AN EVALUATION ON SELFISH BEHAVIOUR ATTACK AND JELLYFISH ATTACKS UNDER AODV RO...
ijfcstjournal
 
Ijarcet vol-2-issue-7-2281-2286
Ijarcet vol-2-issue-7-2281-2286Ijarcet vol-2-issue-7-2281-2286
Ijarcet vol-2-issue-7-2281-2286
Editor IJARCET
 
Ijarcet vol-2-issue-7-2281-2286
Ijarcet vol-2-issue-7-2281-2286Ijarcet vol-2-issue-7-2281-2286
Ijarcet vol-2-issue-7-2281-2286
Editor IJARCET
 

Ähnlich wie Cost Effective centralized adpative routing for networks on chip (20)

IRJET- Comparison of ATTEMPT and SIMPLE Protocols for Wireless Body Area Netw...
IRJET- Comparison of ATTEMPT and SIMPLE Protocols for Wireless Body Area Netw...IRJET- Comparison of ATTEMPT and SIMPLE Protocols for Wireless Body Area Netw...
IRJET- Comparison of ATTEMPT and SIMPLE Protocols for Wireless Body Area Netw...
 
Extension of Optimized Linked State Routing Protocol For Energy Efficient App...
Extension of Optimized Linked State Routing Protocol For Energy Efficient App...Extension of Optimized Linked State Routing Protocol For Energy Efficient App...
Extension of Optimized Linked State Routing Protocol For Energy Efficient App...
 
Highly Parallel Pipelined VLSI Implementation of Lifting Based 2D Discrete Wa...
Highly Parallel Pipelined VLSI Implementation of Lifting Based 2D Discrete Wa...Highly Parallel Pipelined VLSI Implementation of Lifting Based 2D Discrete Wa...
Highly Parallel Pipelined VLSI Implementation of Lifting Based 2D Discrete Wa...
 
Ijcnc050204
Ijcnc050204Ijcnc050204
Ijcnc050204
 
Hanin Almutairi_Hanin.Almutairi@unb.ca_expo2015
Hanin Almutairi_Hanin.Almutairi@unb.ca_expo2015Hanin Almutairi_Hanin.Almutairi@unb.ca_expo2015
Hanin Almutairi_Hanin.Almutairi@unb.ca_expo2015
 
Review on variants of Power Aware AODV
Review on variants of Power Aware AODVReview on variants of Power Aware AODV
Review on variants of Power Aware AODV
 
4..[26 36]signal strength based congestion control in manet
4..[26 36]signal strength based congestion control in manet4..[26 36]signal strength based congestion control in manet
4..[26 36]signal strength based congestion control in manet
 
11.signal strength based congestion control in manet
11.signal strength based congestion control in manet11.signal strength based congestion control in manet
11.signal strength based congestion control in manet
 
International Journal of Computational Engineering Research(IJCER)
International Journal of Computational Engineering Research(IJCER)International Journal of Computational Engineering Research(IJCER)
International Journal of Computational Engineering Research(IJCER)
 
International Journal of Computational Engineering Research(IJCER)
International Journal of Computational Engineering Research(IJCER)International Journal of Computational Engineering Research(IJCER)
International Journal of Computational Engineering Research(IJCER)
 
An Analytical Analysis Of Neighbour And Isolatednodefor Geocast Routing In Va...
An Analytical Analysis Of Neighbour And Isolatednodefor Geocast Routing In Va...An Analytical Analysis Of Neighbour And Isolatednodefor Geocast Routing In Va...
An Analytical Analysis Of Neighbour And Isolatednodefor Geocast Routing In Va...
 
Enhancement of Routing Performance for Energy Efficiency and Critical Event M...
Enhancement of Routing Performance for Energy Efficiency and Critical Event M...Enhancement of Routing Performance for Energy Efficiency and Critical Event M...
Enhancement of Routing Performance for Energy Efficiency and Critical Event M...
 
M017358794
M017358794M017358794
M017358794
 
An Evaluation on Selfish Behaviour Attack and Jellyfish Attacks Under AODV Ro...
An Evaluation on Selfish Behaviour Attack and Jellyfish Attacks Under AODV Ro...An Evaluation on Selfish Behaviour Attack and Jellyfish Attacks Under AODV Ro...
An Evaluation on Selfish Behaviour Attack and Jellyfish Attacks Under AODV Ro...
 
AN EVALUATION ON SELFISH BEHAVIOUR ATTACK AND JELLYFISH ATTACKS UNDER AODV RO...
AN EVALUATION ON SELFISH BEHAVIOUR ATTACK AND JELLYFISH ATTACKS UNDER AODV RO...AN EVALUATION ON SELFISH BEHAVIOUR ATTACK AND JELLYFISH ATTACKS UNDER AODV RO...
AN EVALUATION ON SELFISH BEHAVIOUR ATTACK AND JELLYFISH ATTACKS UNDER AODV RO...
 
Ijarcet vol-2-issue-7-2281-2286
Ijarcet vol-2-issue-7-2281-2286Ijarcet vol-2-issue-7-2281-2286
Ijarcet vol-2-issue-7-2281-2286
 
Ijarcet vol-2-issue-7-2281-2286
Ijarcet vol-2-issue-7-2281-2286Ijarcet vol-2-issue-7-2281-2286
Ijarcet vol-2-issue-7-2281-2286
 
Energy aware routing for adhoc
Energy aware routing for adhocEnergy aware routing for adhoc
Energy aware routing for adhoc
 
Performance Evaluation of DSDV & OLSR Proactive Protocols in MANET
Performance Evaluation of DSDV & OLSR Proactive Protocols in MANETPerformance Evaluation of DSDV & OLSR Proactive Protocols in MANET
Performance Evaluation of DSDV & OLSR Proactive Protocols in MANET
 
Performance Evaluation of a Layered WSN Using AODV and MCF Protocols in NS-2
Performance Evaluation of a Layered WSN Using AODV and MCF Protocols in NS-2Performance Evaluation of a Layered WSN Using AODV and MCF Protocols in NS-2
Performance Evaluation of a Layered WSN Using AODV and MCF Protocols in NS-2
 

Mehr von chiportal

Prof. Steve Furber, University of Manchester, Principal Designer of the BBC M...
Prof. Steve Furber, University of Manchester, Principal Designer of the BBC M...Prof. Steve Furber, University of Manchester, Principal Designer of the BBC M...
Prof. Steve Furber, University of Manchester, Principal Designer of the BBC M...
chiportal
 

Mehr von chiportal (20)

Prof. Zhihua Wang, Tsinghua University, Beijing, China
Prof. Zhihua Wang, Tsinghua University, Beijing, China Prof. Zhihua Wang, Tsinghua University, Beijing, China
Prof. Zhihua Wang, Tsinghua University, Beijing, China
 
Prof. Steve Furber, University of Manchester, Principal Designer of the BBC M...
Prof. Steve Furber, University of Manchester, Principal Designer of the BBC M...Prof. Steve Furber, University of Manchester, Principal Designer of the BBC M...
Prof. Steve Furber, University of Manchester, Principal Designer of the BBC M...
 
Prof. Steve Furber, University of Manchester, Principal Designer of the BBC M...
Prof. Steve Furber, University of Manchester, Principal Designer of the BBC M...Prof. Steve Furber, University of Manchester, Principal Designer of the BBC M...
Prof. Steve Furber, University of Manchester, Principal Designer of the BBC M...
 
Prof. Uri Weiser,Technion
Prof. Uri Weiser,TechnionProf. Uri Weiser,Technion
Prof. Uri Weiser,Technion
 
Ken Liao, Senior Associate VP, Faraday
Ken Liao, Senior Associate VP, FaradayKen Liao, Senior Associate VP, Faraday
Ken Liao, Senior Associate VP, Faraday
 
Prof. Danny Raz, Director, Bell Labs Israel, Nokia
 Prof. Danny Raz, Director, Bell Labs Israel, Nokia  Prof. Danny Raz, Director, Bell Labs Israel, Nokia
Prof. Danny Raz, Director, Bell Labs Israel, Nokia
 
Marco Casale-Rossi, Product Mktg. Manager, Synopsys
Marco Casale-Rossi, Product Mktg. Manager, SynopsysMarco Casale-Rossi, Product Mktg. Manager, Synopsys
Marco Casale-Rossi, Product Mktg. Manager, Synopsys
 
Dr.Efraim Aharoni, ESD Leader, TowerJazz
Dr.Efraim Aharoni, ESD Leader, TowerJazzDr.Efraim Aharoni, ESD Leader, TowerJazz
Dr.Efraim Aharoni, ESD Leader, TowerJazz
 
Eddy Kvetny, System Engineering Group Leader, Intel
Eddy Kvetny, System Engineering Group Leader, IntelEddy Kvetny, System Engineering Group Leader, Intel
Eddy Kvetny, System Engineering Group Leader, Intel
 
Dr. John Bainbridge, Principal Application Architect, NetSpeed
 Dr. John Bainbridge, Principal Application Architect, NetSpeed  Dr. John Bainbridge, Principal Application Architect, NetSpeed
Dr. John Bainbridge, Principal Application Architect, NetSpeed
 
Xavier van Ruymbeke, App. Engineer, Arteris
Xavier van Ruymbeke, App. Engineer, ArterisXavier van Ruymbeke, App. Engineer, Arteris
Xavier van Ruymbeke, App. Engineer, Arteris
 
Asi Lifshitz, VP R&D, Vtool
Asi Lifshitz, VP R&D, VtoolAsi Lifshitz, VP R&D, Vtool
Asi Lifshitz, VP R&D, Vtool
 
Zvika Rozenshein,General Manager, EngineeringIQ
Zvika Rozenshein,General Manager, EngineeringIQZvika Rozenshein,General Manager, EngineeringIQ
Zvika Rozenshein,General Manager, EngineeringIQ
 
Lewis Chu,Marketing Director,GUC
Lewis Chu,Marketing Director,GUC Lewis Chu,Marketing Director,GUC
Lewis Chu,Marketing Director,GUC
 
Kunal Varshney, VLSI Engineer, Open-Silicon
Kunal Varshney, VLSI Engineer, Open-SiliconKunal Varshney, VLSI Engineer, Open-Silicon
Kunal Varshney, VLSI Engineer, Open-Silicon
 
Gert Goossens,Sen. Director, ASIP Tools, Synopsys
Gert Goossens,Sen. Director, ASIP Tools, SynopsysGert Goossens,Sen. Director, ASIP Tools, Synopsys
Gert Goossens,Sen. Director, ASIP Tools, Synopsys
 
Tuvia Liran, Director of VLSI, Nano Retina
Tuvia Liran, Director of VLSI, Nano RetinaTuvia Liran, Director of VLSI, Nano Retina
Tuvia Liran, Director of VLSI, Nano Retina
 
Sagar Kadam, Lead Software Engineer, Open-Silicon
Sagar Kadam, Lead Software Engineer, Open-SiliconSagar Kadam, Lead Software Engineer, Open-Silicon
Sagar Kadam, Lead Software Engineer, Open-Silicon
 
Ronen Shtayer,Director of ASG Operations & PMO, NXP Semiconductor
Ronen Shtayer,Director of ASG Operations & PMO, NXP SemiconductorRonen Shtayer,Director of ASG Operations & PMO, NXP Semiconductor
Ronen Shtayer,Director of ASG Operations & PMO, NXP Semiconductor
 
Prof. Emanuel Cohen, Technion
Prof. Emanuel Cohen, TechnionProf. Emanuel Cohen, Technion
Prof. Emanuel Cohen, Technion
 

Kürzlich hochgeladen

Kürzlich hochgeladen (20)

08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
Evaluating the top large language models.pdf
Evaluating the top large language models.pdfEvaluating the top large language models.pdf
Evaluating the top large language models.pdf
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 

Cost Effective centralized adpative routing for networks on chip

  • 1. A Cost Effective Centralized Adaptive Routing for Networks on Chip Ran Manevich*, Israel Cidon*, Avinoam Kolodny*, Isask’har (Zigi) Walter* and Shmuel Wimer# *Technion – Israel Institute #Bar-Ilan University of Technology QNoC Module Module Module Research Module Module Module Module Module Module Group Module Module Module May 2, 2011
  • 3. Global traffic information is essential to make the right decision! May 2, 2011
  • 4. Adaptive Routing in NoCs – Local vs. Global Information I CAN MAKE Source IT!!! A Packet routed Low from upper left to Congestion bottom right Medium Congestion corner utilizing High local congestion Congestion information. The same packet routed using global   information. Destination May 2, 2011
  • 5. Route Selection - ATDOR  ATDOR - Adaptive Toggle Dimension Ordered Routing  Keep it simple! Centralized selection:  The option with less congested bottleneck link is preferred.  Routing tables in sources. One bit per destination. May 2, 2011
  • 6. ATDOR Illustration 1  Five identical flows, 100 MB/s each.  Initial routing - XY  Links modeled as M/M/1 queues. Delay of a single link: Traffic DLINK Capacity Traffic  Links capacity is 210 MB/s. May 2, 2011
  • 7. Centralized Routing – How? • Option 1 – Continuous calculation of optimal routing for the active sessions: Achievable load balancing Speed and computation complexity System complexity May 2, 2011
  • 8. Centralized Routing – How? • Option 2 – Iterative serial selection based on traffic load measurements between XY and YX for all source- destination pairs: Achievable load balancing Speed and computation complexity System complexity May 2, 2011
  • 9. ATDOR illustration 1 Step # Re-Routed Flow 1 3 2 1->15 2->15 2->8 Average Delay 22 ns 37 ∞ May 2, 2011
  • 10. What did we just see?  For each flow we: 1. Calculated the better route. 2. Updated routing table of the source. 3. Waited for the update to take effect and measured global traffic load.  Performing steps 1-3 for each flow is slow and not scalable.  Steps 2 and 3 are unified for all destinations of a single source: Achievable load balancing Speed and computation complexity Scalability May 2, 2011
  • 11. Back illustration 1 Step # Re-Routed Flow 4 3 1 4->15 1->15 2->8 5 2 2->15 Average Delay 22 ns ∞ May 2, 2011
  • 12. Problem #1  Changing routing may enhance congestion and cause fluctuations.  Solution: Change routing only if the alternative is better by the margin α, 0< α <1: if (Current Route = XY) YX if MAX[Load YX ] a MAX[Load XY ] NextRoute = XY if MAX[Load YX ] > a MAX[Load XY ] elseif (Current Route = YX) XY if MAX[Load XY ] a MAX[Load YX ] NextRoute = YX if MAX[Load XY ] > a MAX[Load YX ] May 2, 2011
  • 13. ATDOR illustration 2 Step # Re-Routed Flow 1->14 3 2 1 1->15 1->16 Average Delay ∞ May 2, 2011
  • 14. Problem #2  Coupling among flows sharing the same source.  Solution: Re-Routing counters CI,J count routing changes of flows from source I to destination J (FI,J). When CI,J reaches a limit LI,J, routing of FI,J is locked. A Possible definition of Limits LI,J : LI , J (I J ) mod 3 May 2, 2011
  • 15. Back to illustration 2 Flows R. Changes Left 1->16 0 1 2 1->15 0 1 2->14 1->14 0 LI , J (I J ) mod 3 Average Delay 22 73 ns ∞ May 2, 2011
  • 16. Bring it all together Flows R. Changes Left 1->15 0 1 2->8 0 1 2->15 0 1 2 4->15 0 1 LI , J (I J ) mod 3 Average Delay 14 22 ns ∞ May 2, 2011
  • 17. Centralized Adaptive Routing for NoCs - Architecture  Local traffic load measurements inside the routers.  Traffic load measurements aggregation into Traffic Load Maps.  Routing control. May 2, 2011
  • 18. Load Measurements Aggregation  An illustration of aggregation of load values in a 4X4 2D mesh.  A congestion value is written to each traffic load map every clock cycle. May 2, 2011
  • 19. ATDOR – Route Selection Circuit  Maximally loaded links of the two alternatives are compared. Next route: if(Current Route = XY) YX if MAX[Load YX ] a MAX[Load XY ] NextRoute = XY if MAX[Load YX ] > a MAX[Load XY ] elseif(Current Route = YX) XY if MAX[Load XY ] a MAX[Load YX ] NextRoute = YX if MAX[Load XY ] > a MAX[Load YX ] 0 < a <1 • Combinatorial pipelined implementation.  Result every ATDOR clock cycle. May 2, 2011
  • 20. Hardware Requirements  The whole mechanism was implemented on xc5vlx50t VIRTEX 5 FPGA.  Estimated area for 45nm technology node.  Per-Router hardware overheads in % for a NoC with typical size (50 KGates) virtual channel routers. May 2, 2011
  • 21. Average Packet Delay – Uniform Traffic • Average delay vs. average load in links normalized to links capacity. 8X8 2D Mesh. Uniform traffic pattern. May 2, 2011
  • 22. Average Packet Delay – Transpose Traffic • Average delay vs. average load in links normalized to links capacity. 8X8 2D Mesh. Transpose traffic pattern. May 2, 2011
  • 23. Average Packet Delay – Hotspot Traffic • Average delay vs. average load in links normalized to links capacity. 8X8 2D Mesh. 4 Hotspots traffic pattern. May 2, 2011
  • 24. Control Iteration Duration • Number of re-routed flows vs. time. • 8X8 2D Mesh, ATDOR clock of 100 MHz. α = 15/16 α = 3/4 May 2, 2011
  • 25. CMP DNUCA - Architecture • 8X8 CMP DNUCA (Dynamic Non Uniform Cache Array) with 8 CPUs and 56 cache banks: May 2, 2011
  • 26. CMP DNUCA – Saturation Throughput • Saturation throughput - Splash 2 and Parsec benchmarks on 8X8 CMP DNUCA with 8 CPUs and 56 cache banks: May 2, 2011
  • 27. Conclusions • Centralized adaptive routing is feasible for NoCs.  ATDOR: Centralized selection between XY and YX for each source-destination pair.  Hardware overhead: <4% of an 8X8 typical NoC.  Average saturation throughput improvement: Vs. O1TURN Vs. RCA Synthetic Patterns 19.3% 12.1% Spash 2 and Parsec 22.8% 12.8% Benchmarks May 2, 2011