SlideShare ist ein Scribd-Unternehmen logo
1 von 22
Architectural tricks to
maximize Memory Bandwidth
Deepak Shankar
CEO, Mirabilis Design
Why Focus on Memory Sub-System
• Processors have huge number of cycles and
bandwidth
– How do you take advantage of this?
• Memory access is a major bottleneck
– Especially in high-performance systems like
multimedia and networking
• Memory access forms the largest power
consumption
– Too many ACT(RAS, RP and RCD) will dramatically
increase the power
Reports
Introduction
• Importance of improving Memory
Performance
• Addressing challenges with Architecture Level
Memory explorations
• Need for Performance vs. Power trade-off
analysis
• Memory addressing scheme on Performance
About Mirabilis Design
• Provider of system-level architecture exploration
solution for electronics and semiconductors
• Platform to conduct power-performance trade-
offs, hardware-software partitioning and
topology design
• VisualSim- Modeling and simulation software
• Based in Silicon Valley with experts in system
modeling and architectures
• Largest source of system modeling library with
embedded timing, functionality and power
Explore/Simulate a Memory System
• Key attributes
– DRAM datasheet
– Memory Controller attributes
– Connected Bus topology
– Workloads including rate, size, command and back
pressure
Statistical Memory Model for
Performance Analysis
Challenges in Memory Usage
• Product
– Multimedia, Networking, HPC, Avionics
• Situation
– Using an off-the-shelf Processor, FPGA or SoC
• Challenge
– What will be the performance and power
consumption for my use-cases?
• Metrics
– Power per frame or packet
– Latency from sensor input to HDMI output
Opportunities in Memory Usage
• Vary the data sizes
• Memory configuration
• Ordering of tasks in the use-case
• Multiple Masters making asynchronous
request to memory- Addresses
• Task and data distribution across multi-core
Full System Analysis
Processor Performance
Challenges in Memory System Design
• SoC interface to memory
• AXI bus and NoC topology to minimize the
overhead for each Master
• Single vs. dual channels
• Memory controller algorithm
Opportunity and Advantage of Design
• Consolidate read and write
• Split transaction
• Group transaction
• Read re-ordering
• Transaction priority assignment
• Lower clock frequency vs. wider bus
Cycle-accurate Memory Model for
Architecture Exploration
Power vs. Timing
About VisualSim
Architecture
Exploration
Performance
Analysis
Power
Analysis
HW-SW
Partitioning
Software
InterfacesRTOS
Hardware
• Graphical and hierarchical
modeling
• Large library of stochastic
and cycle-accurate
components and IP blocks
with embedded timing and
power
• Library blocks are used to
assemble hardware,
software, network, traffic,
reports and use-cases
System- vs. Pin-level Modeling
Mirabilis Design Inc.
One Router
System Design Transaction-level Cycle-accurate Signal-level
VisualSim
Schematics and RTL are very slow and to detailed for end-to-end metrics
System- vs. Pin-level Modeling
Similarity
• Hardware attributes- width,
clock speed, buffer depths
• Timing
• Algorithms & arbitration
• Data & control flow logic
• Use addresses
Differences
• Data & control combined in
transaction not bits
• No pin definitions
• No signal handshaking
• Skip cycles with no change
• Flexible to make major
changes
• 100-1000X Faster
2/18/2016 Mirabilis Design Inc. Confidential Slide18
System model accuracy and simulation is sufficient for the explorations
How can System Level Explorations Help
improve Memory Performance
• Evaluate performance and power advantages of
different types of memory technologies.
• Early prediction of latency, throughput, power,
and energy
• Evaluation of next gen Storage device for high
bandwidth and less latency requirements
• Spend more time on analysis and less time on
implementation
Modeling Libraries - Semiconductors
SoC
• AMBA (AHB/
APB/ AXI)
• CoreConnect-
PLB & OPB
• NoC, Virtual
Channel
• USB
Memory
• SDR, DDR,
DDR2, DDR3
• QDR, RDRAM
• LPDDR,
LPDDR2,
LPDDR3,
LPDDR4
• HBM
• Flash
Processors
• ARM
• PowerPC-
Freescale and
IBM
• Intel and AMD
• TI
• MIPS
• Tensilica
• Renesas SH
Interfaces
• PCI, PCI-X,
PCIe
• RapidIO
• NVMe
• Serial Switch
• Crossbar
• Ethernet
• Fibre Channel
Benefits
Features Benefits
Facilitating transition from concept to design
• Creating realistic workload scenarios
driving simulations
• Models enable experimentation and
enhance innovation
• Simulations facilitate analysis and
exchanges between teams
Increasing productivity • Rapid Exploration and analysis
• Graphics are better suited to handle
complexity
• Graphics are 10x more efficient than C/C++
programming
Optimizing design • HW Footprint, buffers, timings, power
Facilitating implementation and validation
• Providing executable specifications for
implementation
• Reusing test cases for validation
Deepak Shankar
CEO, Mirabilis Design
info@mirabilisdesign.com
www.mirabilisdesign.com/new/
Phone - 408-245-8992

Weitere ähnliche Inhalte

Was ist angesagt?

Task allocation on many core-multi processor distributed system
Task allocation on many core-multi processor distributed systemTask allocation on many core-multi processor distributed system
Task allocation on many core-multi processor distributed system
Deepak Shankar
 
Cognos transformer vs Tm1
Cognos transformer vs Tm1Cognos transformer vs Tm1
Cognos transformer vs Tm1
jasmeetsb
 

Was ist angesagt? (12)

Task allocation on many core-multi processor distributed system
Task allocation on many core-multi processor distributed systemTask allocation on many core-multi processor distributed system
Task allocation on many core-multi processor distributed system
 
Can communication using labview
Can communication using labview Can communication using labview
Can communication using labview
 
Solution Brief: Commvault & Red Hat Storage
Solution Brief: Commvault & Red Hat StorageSolution Brief: Commvault & Red Hat Storage
Solution Brief: Commvault & Red Hat Storage
 
SFDCP – sfdc project
SFDCP – sfdc projectSFDCP – sfdc project
SFDCP – sfdc project
 
Optimized Systems: Matching technologies for business success.
Optimized Systems: Matching technologies for business success.Optimized Systems: Matching technologies for business success.
Optimized Systems: Matching technologies for business success.
 
03. non-functional-attributes-introduction-4-slides
03. non-functional-attributes-introduction-4-slides03. non-functional-attributes-introduction-4-slides
03. non-functional-attributes-introduction-4-slides
 
2012 334 Machiwal Ppt
2012 334 Machiwal Ppt2012 334 Machiwal Ppt
2012 334 Machiwal Ppt
 
What’s Going on in Your 4HRA?
What’s Going on in Your 4HRA?What’s Going on in Your 4HRA?
What’s Going on in Your 4HRA?
 
Implementation of Dynamic Simulation for Engineering and Human Factors
Implementation of Dynamic Simulation for Engineering and Human FactorsImplementation of Dynamic Simulation for Engineering and Human Factors
Implementation of Dynamic Simulation for Engineering and Human Factors
 
Cognos transformer vs Tm1
Cognos transformer vs Tm1Cognos transformer vs Tm1
Cognos transformer vs Tm1
 
Exploration of Radars and Software Defined Radios using VisualSim
Exploration of  Radars and Software Defined Radios using VisualSimExploration of  Radars and Software Defined Radios using VisualSim
Exploration of Radars and Software Defined Radios using VisualSim
 
Ibm spectrum scale fundamentals workshop for americas part 6 spectrumscale el...
Ibm spectrum scale fundamentals workshop for americas part 6 spectrumscale el...Ibm spectrum scale fundamentals workshop for americas part 6 spectrumscale el...
Ibm spectrum scale fundamentals workshop for americas part 6 spectrumscale el...
 

Andere mochten auch

Memory technology and optimization in Advance Computer Architechture
Memory technology and optimization in Advance Computer ArchitechtureMemory technology and optimization in Advance Computer Architechture
Memory technology and optimization in Advance Computer Architechture
Shweta Ghate
 
memory Interleaving and low order interleaving and high interleaving
memory Interleaving and low order interleaving and high interleavingmemory Interleaving and low order interleaving and high interleaving
memory Interleaving and low order interleaving and high interleaving
Jawwad Rafiq
 
Fosforilacion oxidativa - Inhibidores y Agentes Desacoplantes
Fosforilacion oxidativa - Inhibidores y Agentes DesacoplantesFosforilacion oxidativa - Inhibidores y Agentes Desacoplantes
Fosforilacion oxidativa - Inhibidores y Agentes Desacoplantes
Karen Alex
 

Andere mochten auch (8)

Latency Trumps All
Latency Trumps AllLatency Trumps All
Latency Trumps All
 
Performance Measurements: Improving Latency and Bandwidth of your DDR4 System
Performance Measurements:  Improving Latency and Bandwidth of your DDR4 SystemPerformance Measurements:  Improving Latency and Bandwidth of your DDR4 System
Performance Measurements: Improving Latency and Bandwidth of your DDR4 System
 
MemGuard: Memory Bandwidth Reservation System for Efficient Performance Isola...
MemGuard: Memory Bandwidth Reservation System for Efficient Performance Isola...MemGuard: Memory Bandwidth Reservation System for Efficient Performance Isola...
MemGuard: Memory Bandwidth Reservation System for Efficient Performance Isola...
 
Memory access control in multiprocessor for real-time system with mixed criti...
Memory access control in multiprocessor for real-time system with mixed criti...Memory access control in multiprocessor for real-time system with mixed criti...
Memory access control in multiprocessor for real-time system with mixed criti...
 
Memory technology and optimization in Advance Computer Architechture
Memory technology and optimization in Advance Computer ArchitechtureMemory technology and optimization in Advance Computer Architechture
Memory technology and optimization in Advance Computer Architechture
 
memory Interleaving and low order interleaving and high interleaving
memory Interleaving and low order interleaving and high interleavingmemory Interleaving and low order interleaving and high interleaving
memory Interleaving and low order interleaving and high interleaving
 
Psychology- Memory
Psychology- MemoryPsychology- Memory
Psychology- Memory
 
Fosforilacion oxidativa - Inhibidores y Agentes Desacoplantes
Fosforilacion oxidativa - Inhibidores y Agentes DesacoplantesFosforilacion oxidativa - Inhibidores y Agentes Desacoplantes
Fosforilacion oxidativa - Inhibidores y Agentes Desacoplantes
 

Ähnlich wie Architectural tricks to maximize memory bandwidth

Webinar: Detecting Deadlocks in Electronic Systems using Time-based Simulation
Webinar: Detecting Deadlocks in Electronic Systems using Time-based SimulationWebinar: Detecting Deadlocks in Electronic Systems using Time-based Simulation
Webinar: Detecting Deadlocks in Electronic Systems using Time-based Simulation
Deepak Shankar
 
lecture asdkvakm;bk;dv;advvAVHD;KASV;DVKHSVDK
lecture asdkvakm;bk;dv;advvAVHD;KASV;DVKHSVDKlecture asdkvakm;bk;dv;advvAVHD;KASV;DVKHSVDK
lecture asdkvakm;bk;dv;advvAVHD;KASV;DVKHSVDK
officeaiotfab
 

Ähnlich wie Architectural tricks to maximize memory bandwidth (20)

How to create innovative architecture using VisualSim?
How to create innovative architecture using VisualSim?How to create innovative architecture using VisualSim?
How to create innovative architecture using VisualSim?
 
How to create innovative architecture using VisualSim?
How to create innovative architecture using VisualSim?How to create innovative architecture using VisualSim?
How to create innovative architecture using VisualSim?
 
Designing memory controller for ddr5 and hbm2.0
Designing memory controller for ddr5 and hbm2.0Designing memory controller for ddr5 and hbm2.0
Designing memory controller for ddr5 and hbm2.0
 
Using VisualSim Architect for Semiconductor System Analysis
Using VisualSim Architect for Semiconductor System AnalysisUsing VisualSim Architect for Semiconductor System Analysis
Using VisualSim Architect for Semiconductor System Analysis
 
Webinar on Latency and throughput computation of automotive EE network
Webinar on Latency and throughput computation of automotive EE networkWebinar on Latency and throughput computation of automotive EE network
Webinar on Latency and throughput computation of automotive EE network
 
Node architecture
Node architectureNode architecture
Node architecture
 
Operational-Analytics
Operational-AnalyticsOperational-Analytics
Operational-Analytics
 
Breaking the Memory Wall
Breaking the Memory WallBreaking the Memory Wall
Breaking the Memory Wall
 
Mirabilis_Design AMD Versal System-Level IP Library
Mirabilis_Design AMD Versal System-Level IP LibraryMirabilis_Design AMD Versal System-Level IP Library
Mirabilis_Design AMD Versal System-Level IP Library
 
Processors selection
Processors selectionProcessors selection
Processors selection
 
OpenPOWER Acceleration of HPCC Systems
OpenPOWER Acceleration of HPCC SystemsOpenPOWER Acceleration of HPCC Systems
OpenPOWER Acceleration of HPCC Systems
 
Introduction to Architecture Exploration of Semiconductor, Embedded Systems, ...
Introduction to Architecture Exploration of Semiconductor, Embedded Systems, ...Introduction to Architecture Exploration of Semiconductor, Embedded Systems, ...
Introduction to Architecture Exploration of Semiconductor, Embedded Systems, ...
 
Webinar: Detecting Deadlocks in Electronic Systems using Time-based Simulation
Webinar: Detecting Deadlocks in Electronic Systems using Time-based SimulationWebinar: Detecting Deadlocks in Electronic Systems using Time-based Simulation
Webinar: Detecting Deadlocks in Electronic Systems using Time-based Simulation
 
Develop High-bandwidth/low latency electronic systems for AI/ML application
Develop High-bandwidth/low latency electronic systems for AI/ML applicationDevelop High-bandwidth/low latency electronic systems for AI/ML application
Develop High-bandwidth/low latency electronic systems for AI/ML application
 
GEN-Z: An Overview and Use Cases
GEN-Z: An Overview and Use CasesGEN-Z: An Overview and Use Cases
GEN-Z: An Overview and Use Cases
 
Network Processor - 2021.pptx
Network Processor - 2021.pptxNetwork Processor - 2021.pptx
Network Processor - 2021.pptx
 
Cloud Networking Trends
Cloud Networking TrendsCloud Networking Trends
Cloud Networking Trends
 
TechTarget Event - Storage Architectures for the Modern Data Center - Howard ...
TechTarget Event - Storage Architectures for the Modern Data Center - Howard ...TechTarget Event - Storage Architectures for the Modern Data Center - Howard ...
TechTarget Event - Storage Architectures for the Modern Data Center - Howard ...
 
lecture asdkvakm;bk;dv;advvAVHD;KASV;DVKHSVDK
lecture asdkvakm;bk;dv;advvAVHD;KASV;DVKHSVDKlecture asdkvakm;bk;dv;advvAVHD;KASV;DVKHSVDK
lecture asdkvakm;bk;dv;advvAVHD;KASV;DVKHSVDK
 
A Closer Look at Apache Kudu
A Closer Look at Apache KuduA Closer Look at Apache Kudu
A Closer Look at Apache Kudu
 

Mehr von Deepak Shankar

Evaluating UCIe based multi-die SoC to meet timing and power
Evaluating UCIe based multi-die SoC to meet timing and power Evaluating UCIe based multi-die SoC to meet timing and power
Evaluating UCIe based multi-die SoC to meet timing and power
Deepak Shankar
 
ROLE OF DIGITAL SIMULATION IN CONFIGURING NETWORK PARAMETERS
ROLE OF DIGITAL SIMULATION IN CONFIGURING NETWORK PARAMETERSROLE OF DIGITAL SIMULATION IN CONFIGURING NETWORK PARAMETERS
ROLE OF DIGITAL SIMULATION IN CONFIGURING NETWORK PARAMETERS
Deepak Shankar
 
Compare Performance-power of Arm Cortex vs RISC-V for AI applications_oct_2021
Compare Performance-power of Arm Cortex vs RISC-V for AI applications_oct_2021Compare Performance-power of Arm Cortex vs RISC-V for AI applications_oct_2021
Compare Performance-power of Arm Cortex vs RISC-V for AI applications_oct_2021
Deepak Shankar
 
Energy efficient AI workload partitioning on multi-core systems
Energy efficient AI workload partitioning on multi-core systemsEnergy efficient AI workload partitioning on multi-core systems
Energy efficient AI workload partitioning on multi-core systems
Deepak Shankar
 
Webinar on Functional Safety Analysis using Model-based System Analysis
Webinar on Functional Safety Analysis using Model-based System AnalysisWebinar on Functional Safety Analysis using Model-based System Analysis
Webinar on Functional Safety Analysis using Model-based System Analysis
Deepak Shankar
 

Mehr von Deepak Shankar (16)

How to achieve 95%+ Accurate power measurement during architecture exploration?
How to achieve 95%+ Accurate power measurement during architecture exploration? How to achieve 95%+ Accurate power measurement during architecture exploration?
How to achieve 95%+ Accurate power measurement during architecture exploration?
 
Mastering IoT Design: Sense, Process, Connect: Processing: Turning IoT Data i...
Mastering IoT Design: Sense, Process, Connect: Processing: Turning IoT Data i...Mastering IoT Design: Sense, Process, Connect: Processing: Turning IoT Data i...
Mastering IoT Design: Sense, Process, Connect: Processing: Turning IoT Data i...
 
Modeling Abstraction
Modeling AbstractionModeling Abstraction
Modeling Abstraction
 
Accelerated development in Automotive E/E Systems using VisualSim Architect
Accelerated development in Automotive E/E Systems using VisualSim ArchitectAccelerated development in Automotive E/E Systems using VisualSim Architect
Accelerated development in Automotive E/E Systems using VisualSim Architect
 
Evaluating UCIe based multi-die SoC to meet timing and power
Evaluating UCIe based multi-die SoC to meet timing and power Evaluating UCIe based multi-die SoC to meet timing and power
Evaluating UCIe based multi-die SoC to meet timing and power
 
ROLE OF DIGITAL SIMULATION IN CONFIGURING NETWORK PARAMETERS
ROLE OF DIGITAL SIMULATION IN CONFIGURING NETWORK PARAMETERSROLE OF DIGITAL SIMULATION IN CONFIGURING NETWORK PARAMETERS
ROLE OF DIGITAL SIMULATION IN CONFIGURING NETWORK PARAMETERS
 
Compare Performance-power of Arm Cortex vs RISC-V for AI applications_oct_2021
Compare Performance-power of Arm Cortex vs RISC-V for AI applications_oct_2021Compare Performance-power of Arm Cortex vs RISC-V for AI applications_oct_2021
Compare Performance-power of Arm Cortex vs RISC-V for AI applications_oct_2021
 
Energy efficient AI workload partitioning on multi-core systems
Energy efficient AI workload partitioning on multi-core systemsEnergy efficient AI workload partitioning on multi-core systems
Energy efficient AI workload partitioning on multi-core systems
 
Capacity Planning and Power Management of Data Centers.
Capacity Planning and Power Management of Data Centers. Capacity Planning and Power Management of Data Centers.
Capacity Planning and Power Management of Data Centers.
 
Automotive network and gateway simulation
Automotive network and gateway simulationAutomotive network and gateway simulation
Automotive network and gateway simulation
 
Introduction to architecture exploration
Introduction to architecture explorationIntroduction to architecture exploration
Introduction to architecture exploration
 
Using ai for optimal time sensitive networking in avionics
Using ai for optimal time sensitive networking in avionicsUsing ai for optimal time sensitive networking in avionics
Using ai for optimal time sensitive networking in avionics
 
Webinar on radar
Webinar on radarWebinar on radar
Webinar on radar
 
System Architecture Exploration Training Class
System Architecture Exploration Training ClassSystem Architecture Exploration Training Class
System Architecture Exploration Training Class
 
Webinar on Functional Safety Analysis using Model-based System Analysis
Webinar on Functional Safety Analysis using Model-based System AnalysisWebinar on Functional Safety Analysis using Model-based System Analysis
Webinar on Functional Safety Analysis using Model-based System Analysis
 
Mirabilis design Inc - Brochure
Mirabilis design Inc - BrochureMirabilis design Inc - Brochure
Mirabilis design Inc - Brochure
 

Kürzlich hochgeladen

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
Joaquim Jorge
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
panagenda
 

Kürzlich hochgeladen (20)

Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsTop 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 

Architectural tricks to maximize memory bandwidth

  • 1. Architectural tricks to maximize Memory Bandwidth Deepak Shankar CEO, Mirabilis Design
  • 2. Why Focus on Memory Sub-System • Processors have huge number of cycles and bandwidth – How do you take advantage of this? • Memory access is a major bottleneck – Especially in high-performance systems like multimedia and networking • Memory access forms the largest power consumption – Too many ACT(RAS, RP and RCD) will dramatically increase the power
  • 4. Introduction • Importance of improving Memory Performance • Addressing challenges with Architecture Level Memory explorations • Need for Performance vs. Power trade-off analysis • Memory addressing scheme on Performance
  • 5. About Mirabilis Design • Provider of system-level architecture exploration solution for electronics and semiconductors • Platform to conduct power-performance trade- offs, hardware-software partitioning and topology design • VisualSim- Modeling and simulation software • Based in Silicon Valley with experts in system modeling and architectures • Largest source of system modeling library with embedded timing, functionality and power
  • 6. Explore/Simulate a Memory System • Key attributes – DRAM datasheet – Memory Controller attributes – Connected Bus topology – Workloads including rate, size, command and back pressure
  • 7. Statistical Memory Model for Performance Analysis
  • 8. Challenges in Memory Usage • Product – Multimedia, Networking, HPC, Avionics • Situation – Using an off-the-shelf Processor, FPGA or SoC • Challenge – What will be the performance and power consumption for my use-cases? • Metrics – Power per frame or packet – Latency from sensor input to HDMI output
  • 9. Opportunities in Memory Usage • Vary the data sizes • Memory configuration • Ordering of tasks in the use-case • Multiple Masters making asynchronous request to memory- Addresses • Task and data distribution across multi-core
  • 12. Challenges in Memory System Design • SoC interface to memory • AXI bus and NoC topology to minimize the overhead for each Master • Single vs. dual channels • Memory controller algorithm
  • 13. Opportunity and Advantage of Design • Consolidate read and write • Split transaction • Group transaction • Read re-ordering • Transaction priority assignment • Lower clock frequency vs. wider bus
  • 14. Cycle-accurate Memory Model for Architecture Exploration
  • 16. About VisualSim Architecture Exploration Performance Analysis Power Analysis HW-SW Partitioning Software InterfacesRTOS Hardware • Graphical and hierarchical modeling • Large library of stochastic and cycle-accurate components and IP blocks with embedded timing and power • Library blocks are used to assemble hardware, software, network, traffic, reports and use-cases
  • 17. System- vs. Pin-level Modeling Mirabilis Design Inc. One Router System Design Transaction-level Cycle-accurate Signal-level VisualSim Schematics and RTL are very slow and to detailed for end-to-end metrics
  • 18. System- vs. Pin-level Modeling Similarity • Hardware attributes- width, clock speed, buffer depths • Timing • Algorithms & arbitration • Data & control flow logic • Use addresses Differences • Data & control combined in transaction not bits • No pin definitions • No signal handshaking • Skip cycles with no change • Flexible to make major changes • 100-1000X Faster 2/18/2016 Mirabilis Design Inc. Confidential Slide18 System model accuracy and simulation is sufficient for the explorations
  • 19. How can System Level Explorations Help improve Memory Performance • Evaluate performance and power advantages of different types of memory technologies. • Early prediction of latency, throughput, power, and energy • Evaluation of next gen Storage device for high bandwidth and less latency requirements • Spend more time on analysis and less time on implementation
  • 20. Modeling Libraries - Semiconductors SoC • AMBA (AHB/ APB/ AXI) • CoreConnect- PLB & OPB • NoC, Virtual Channel • USB Memory • SDR, DDR, DDR2, DDR3 • QDR, RDRAM • LPDDR, LPDDR2, LPDDR3, LPDDR4 • HBM • Flash Processors • ARM • PowerPC- Freescale and IBM • Intel and AMD • TI • MIPS • Tensilica • Renesas SH Interfaces • PCI, PCI-X, PCIe • RapidIO • NVMe • Serial Switch • Crossbar • Ethernet • Fibre Channel
  • 21. Benefits Features Benefits Facilitating transition from concept to design • Creating realistic workload scenarios driving simulations • Models enable experimentation and enhance innovation • Simulations facilitate analysis and exchanges between teams Increasing productivity • Rapid Exploration and analysis • Graphics are better suited to handle complexity • Graphics are 10x more efficient than C/C++ programming Optimizing design • HW Footprint, buffers, timings, power Facilitating implementation and validation • Providing executable specifications for implementation • Reusing test cases for validation
  • 22. Deepak Shankar CEO, Mirabilis Design info@mirabilisdesign.com www.mirabilisdesign.com/new/ Phone - 408-245-8992