SlideShare ist ein Scribd-Unternehmen logo
1 von 33
Downloaden Sie, um offline zu lesen
FUSION APU AND TRENDS/
CHALLENGES IN FUTURE
SOC (PROCESSOR) DESIGN
Pankaj Singh,
Acknowledgement:
Denis Foley. Sr. Fellow, AMD
9th International SoC Conference
2nd & 3rd November 2011
2 | 9th Intl. SoC Conference| Nov 2nd,3rd, 2011
TODAY’S TOPICS
 Trends:
– Three Eras of Processor Performance
– Evolution of Heterogeneous Computing
 FSA and Open Standard:
– Why Fusion ?
– Open Standard, Open CL
 Power, Performance
 High Speed, Scalable Interconnect: NoC’s
 3-D Stacking
 SoC Trends & Challenges
– Verification Effort
– IP Integration
– TLM, RTL Co-simulation challenges.
3 | 9th Intl. SoC Conference| Nov 2nd,3rd, 2011
TRENDS: THREE ERAS OF PROCESSOR PERFORMANCE
Single-Core
Era
Single-threadPerformance
?
Time
we are
here
o
Enabled by:
 Moore’s Law
 Voltage Scaling
 MicroArchitecture
Constrained by:
Power
Complexity
Multi-Core
Era
ThroughputPerformance
Time
(# of Processors)
we are
here
o
Enabled by:
 Moore’s Law
 Desire for Throughput
 20 years of SMP arch
Constrained by:
Power
Parallel SW availability
Scalability
Heterogeneous
Systems Era
TargetedApplication
Performance
Time
(Data-parallel exploitation)
we are
here
o
Enabled by:
 Moore’s Law
 Abundant data parallelism
 Power efficient GPUs
Currently constrained by:
Programming models
Communication overheads
4 | 9th Intl. SoC Conference| Nov 2nd,3rd, 2011
TRENDS: EVOLUTION OF HETEROGENEOUS COMPUTINGArchitectureMaturity&ProgrammerAccessibility
PoorExcellent
2012 - 20202009 - 20112002 - 2008
Graphics & Proprietary
Driver-based APIs
Proprietary Drivers Era
 “Adventurous” programmers
 Exploit early programmable
“shader cores” in the GPU
 Make your program look like
“graphics” to the GPU
 CUDA™, Brook+, etc
OpenCL™, DirectCompute
Driver-based APIs
Standards Drivers Era
 Expert programmers
 C and C++ subsets
 Compute centric APIs , data
types
 Multiple address spaces with
explicit data movement
 Specialized work queue based
structures
 Kernel mode dispatch
Fusion™ System Architecture
GPU Peer Processor
Architected Era
 Mainstream programmers
 Full C++
 GPU as a co-processor
 Unified coherent address space
 Task parallel runtimes
 Nested Data Parallel programs
 User mode dispatch
 Pre-emption and context
switching
More uptodate information on FSA:
http://developer.amd.com/afds/pages/keynote.aspx#/Dev_AFDS_Reb_2
5 | 9th Intl. SoC Conference| Nov 2nd,3rd, 2011
FSA & OPEN STANDARD: ENTER FUSION
Dual Core CPU Northbridge DirectX®11 GPU
FUSION APU
(Accelerated Processing Unit)
Heterogeneous compute engine combining
x86 compute and parallel processing
capabilities of the GPU on a single die
6 | 9th Intl. SoC Conference| Nov 2nd,3rd, 2011
FSA & OPEN STANDARD: WHY FUSION?
6
 Integrating CPUs, Northbridge and GPU enables:
– Unified Memory
– High-bandwidth, low latency access by GPU
– Saves on interface power and PHY area
– Shared Power Control and TDP envelope
Potential bandwidth bottleneck
Relatively long memory latency
7 | 9th Intl. SoC Conference| Nov 2nd,3rd, 2011
COMMITTED TO OPEN STANDARDS
 AMD drives open and de-facto
standards
– Compete on the best
implementation
 Open standards are the basis for
large ecosystems
 Open standards always win over
time
– SW developers want their
applications to run on multiple
platforms from multiple
hardware vendors
DirectX®
8 | 9th Intl. SoC Conference| Nov 2nd,3rd, 2011
OPENCL™ AND FSA
 FSA is an optimized platform
architecture for OpenCL™
– Not an alternative to OpenCL™
 OpenCL™ on FSA will benefit from
– Avoidance of wasteful copies
– Low latency dispatch
– Improved memory model
– Shared pointers
 FSA also exposes a lower level
programming interface, for those
that want the ultimate in control
and performance
 Optimized libraries may choose
the lower level interface
POWER & PERFORMANCE
10 | 9th Intl. SoC Conference| Nov 2nd,3rd, 2011
POWER-THERMAL EFFECTS IN SYSTEMS ON CHIPS
¡ Local failures !
Part not working
 Complex SoCs: High power density
 Non-uniform power dissipation: Hotspots
 Spatial gradients: Cause malfunctions
 High on-chip temperatures cause
malfunctions affecting reliability.
 Power consumption depends on
frequency
 Setting frequencies to control power and
temperature
11 | 9th Intl. SoC Conference| Nov 2nd,3rd, 2011
OPTIONS FOR POWER SAVINGS
 Convergence of Performance and Low Power
– Notebook->Netbook-> Tablet
Tablet<-Smartphone
12 | 9th Intl. SoC Conference| Nov 2nd,3rd, 2011
PERFORMANCE AND POWER
S3 idle Static
Screen
MM07 Media
Playback
Full
Compute
APU Power vs. Use Case
Performance
Power
 Performance versus Power Efficiency
 Power Management versus Power reduction
 Performance & Thermal Design Power
HIGH SPEED, SCALABLE
INTERCONNECT
14 | 9th Intl. SoC Conference| Nov 2nd,3rd, 2011
NOC’S: FROM BUSES TO NETWORKS:
[Friedman Harel:10]
Note: This slide presents industry specific information does not relate to AMD NoC status
15 | 9th Intl. SoC Conference| Nov 2nd,3rd, 2011
NOC CHALLENGES: CAD TOOLS
 Capturing application traffic.
 Which Topology ?
 Mapping? Routes to use?
 Fixing communication
architecture : parameters.
 Verification for correctness, performance.
 Build models.
 QoS under un-reliable conditions.
Key to success: Automate & integrate the steps.
Mesh Topology
homogeneous systems, with
regular tiles
Customized Topology
heterogeneous systems, with
different cores & irregular FP
Software Services
Mapping, QoS, middleware...
Architecture
Packeting, buffering, flow control...
Physical Implementation
Synchronization, wires, power...
CAD Tools
16 | 9th Intl. SoC Conference| Nov 2nd,3rd, 2011
Synchronous Delay Insensitive
Global None
Timing Assumptions
Less Detection
Local Clocks, Interaction
with data (becoming aperiodic)
 A complete spectrum of approaches to system-timing exist
[Mullins06-07]
NOC CHALLENGES: BEYOND GLOBAL SYNCHRONY
Delay Insensitive
3-D STACKING
18 | 9th Intl. SoC Conference| Nov 2nd,3rd, 2011
3-D STACKING
 Supporting Heterogeneous computing: high density, high performance,
high memory B.W requirement.
 3-D NoC’s option
 Futuristic view:
Integrating Bio-sensor
Note:
This slide presents industry specific information does not relate to AMD 3-D stacking status
SOC TRENDS &
CHALLENGES:
1. VERIFICATION EFFORT
2. IP INTEGRATION
3. TLM-RTL CO-SIMULATION
20 | 9th Intl. SoC Conference| Nov 2nd,3rd, 2011
WHAT’S NEW IN SOC DESIGN?
 Larger and more complex chips with heavy use of pre-existing cores.
 Heavy use of multi core processors and DSPs.
 Complex Interconnect.
 Shorter time to market and Smaller design teams.
 … and software.
 Leads to:
– Increased verification effort: Debugging is harder.
– Integration is more difficult.
– Need for scalable and high speed interconnect.
– SW / HW co-simulation is a major issue.
– Power –Performance challenge.
– How do we treat the system software?
21 | 9th Intl. SoC Conference| Nov 2nd,3rd, 2011
VERIFICATION EFFORT
 Debugging
– Seamless debug across
h/w and software[especially SW]
 Testbench Development:
– Several methodologies
 VMM,OVMUVM.
 New developments
[Unified strategy]
– UCIS,UVM TLM2.0
– Coverage trend
 Address Gaps in VHDL,
System C coverage
22 | 9th Intl. SoC Conference| Nov 2nd,3rd, 2011
VERIFICATION EFFORT
 Creating/Running Testcase:
– Direct & Random
– Run time improvement
Save-restore.
Verification Cycle per second instead of Cycles per second:
Configuring environment to dynamically select relevant
design/core.
Alternate options
23 | 9th Intl. SoC Conference| Nov 2nd,3rd, 2011
Emulation Focus Areas:
1. Tests/regression run with Long run time
2. Corner case bugs that may escape traditional verification
3. Replicating System level scenarios
Ongoing Initiatives/Need:
1.Seemless support for assertions.
2.Improve portability between Simulation & Emulation
3. Common model from TLM-HDL-Emulation
VERIFICATION EFFORT
 Alternate Options
24 | 9th Intl. SoC Conference| Nov 2nd,3rd, 2011
IP INTEGRATION CHALLENGE
 Integration of IP :
– Multiple IP’s, various configurations, design languages
– IP’s to be in Sync: macro’s , libraries.
– Complexity increases with mixed language designs
SYSTEM
C
SVLO
G
VERILOG
VHDL
Unique Strengths
of Languages
Diversity of Design
Teams
Importing Existing
IP
Legacy Testbench
Environment
25 | 9th Intl. SoC Conference| Nov 2nd,3rd, 2011
IP INTEGRATION CHALLENGE COMPARISON OF CHOICES
Direct
Instantiation
SV Bind
Construct
SystemC
Control/Observe
SCV-
Connect()
SC-DPI
Source Code
Available
Yes Yes Yes Yes Yes
One IP
Compiled
Yes Yes Yes Yes Yes
Both IP
Compiled
No Yes No No No
Performance ++++ (3) +++ (2) + (1) + (1) +++++(4)
Delta Delay Yes Yes No No No
Languages
Supported
SV, SC,
VHDL
SV, SC,
VHDL
SC + SV/VHDL
SC +
SV/VHDL
SC + SV
Gap: No standardized automated methodology for integration.
Recommended Approach:
• Understand IP blocks: language, source code availability.
• Understand connection: 1-1, distributed, method port
• Option for optimized solution to quickly build a system
26 | 9th Intl. SoC Conference| Nov 2nd,3rd, 2011
IP INTEGRATION CHALLENGE: GAPS WITH ANALOG IP
INTEGRATION IN SOC
Table1. Gaps with Analog IP Integration in SoC
Gaps Root Cause
Testchip setup
-Testchip scenario is different
-Tester used for testchip differs
Inbuilt debug
-Incomplete inbuilt SoC test/debug capability or derisk option for basic
functionality such as PLL clock
IP I/F verification -Incomplete test setup
Review process
-No common detailed review process between IP and SoC team. Incorrect
assumption based on past analog IP working silicon
IP Modelling
-Mismtach in version between IP simulation model and spice netlist
-Limitations of behavioral model to replicate actual analog IP functionality
-Timing issue
-DFT issue
EDA tools -Gaps in analog and digital simulation environment
27 | 9th Intl. SoC Conference| Nov 2nd,3rd, 2011
 Verification Environment Bring-up
– Automated Assertions for early checks.
– Review forces, tie-off and relevant checkers from IP to SoC
– Bottleneck for SoC team to get started with verification: Option to use
fake model for initial bring up. Usage of system model.
– Super Block Concept: pre-verified IP blocks at similar frequency &
interface
 Requirement:
 Current solution: In-house methodology and process. No clear solution
from EDA vendors.
IP INTEGRATION CHALLENGE
IP
Block1
IP
Block2
Minimum
Manual
Effort
Hookup
Using ICU
No BUGS!
28 | 9th Intl. SoC Conference| Nov 2nd,3rd, 2011
TLM, RTL Co-simulation
 Traditional use of System level models : Architecture profiling &
Performance Analysis
 Increasing Demand for Co-simulation: Tradeoff between Accuracy and
Performance.
 Open Challenges
 Different level of Abstraction.
 Need for improvement in Integration methodology and Test bench
development
 Seamless Debug and Coverage methodology.
 Using System Level model for HDL generation
 Legacy system model not written with conversion in mind.
 Current limitation: Incomplete translation.
 Lack of reliable Equivalence Check tool.
 Need: Merge top down (SystemC) and bottom-up (System Verilog)
methodology/flow.
 Gaps/Work to do: How to do Power analysis
29 | 9th Intl. SoC Conference| Nov 2nd,3rd, 2011
THANK YOU!
30 | 9th Intl. SoC Conference| Nov 2nd,3rd, 2011
REFERENCES
[1] Wilson Research Group-MGC study blog 2011.
[2] AMD Coolchip2011 presentation. Denis Foley, AMD Sr. Fellow.
[3] Fusion Processors and HPC-2011, Chuck Moore, AMD Corporate
Fellow & Technology Group CTO
[3] AMD Fusion Developer Summit 2011. Phil Rogers, AMD Corporate
Fellow
[4] Fully Asynchronous framework for GALS network on chip. Friedman H
[5]Future of EE, NoC’s presentation. Dr. Srinivasan Murali
[6] Analog IP integration in SoC, IP reuse’09. Mixed language IP integration
DVCoN 2010. Extending Fucntional coverage to SystemC, VHDL-IP’10.
Pankaj S
31 | 9th Intl. SoC Conference| Nov 2nd,3rd, 2011
GLOSSARY
 GPU – Graphics processing unit
 APU: Accelerated Processing Unit
 Open CL: Open Computing Language
 TDP – Thermal Design power – a measure of a design
infrastructure’s ability to cool a device
 NoC: Network On Chip
 TLM: Transaction Level Modeling
 Turbo Core – AMD boost mechanism
 QoS: Quality of Service
 UVM: Universal Verification Methodology
 UCIS: Unified Coverage Interoperability Standard
32 | 9th Intl. SoC Conference| Nov 2nd,3rd, 2011
BACKUP
33 | 9th Intl. SoC Conference| Nov 2nd,3rd, 2011
Disclaimer
The information presented in this document is for informational purposes only and may contain technical inaccuracies,
omissions and typographical errors.
The information contained herein is subject to change and may be rendered inaccurate for many reasons, including but not
limited to product and roadmap changes, component and motherboard version changes, new model and/or product releases,
product differences between differing manufacturers, software changes, BIOS flashes, firmware upgrades, or the like. AMD
assumes no obligation to update or otherwise correct or revise this information. However, AMD reserves the right to revise this
information and to make changes from time to time to the content hereof without obligation of AMD to notify any person of
such revisions or changes.
AMD makes no representations or warranties with respect to the contents hereof and assumes no responsibility for any
inaccuracies, errors or omissions that appear in this information.
AMD specifically disclaims any implied warranties of merchantability or fitness for any particular purpose. In no event will AMD
be liable to any person for any direct, indirect, special or other consequential damages arising from the use of any information
contained herein, even if AMD is expressly advised of the possibility of such damages.
Trademark Attribution
AMD, the AMD Arrow logo, AMD Athlon, AMD Phenom, AMD Turion, AMD Radeon, and combinations thereof are trademarks
of Advanced Micro Devices, Inc. in the United States and/or other jurisdictions. Microsoft, Windows and DirectX are registered
trademarks of Microsoft Corporation in the United States and/or other jurisdictions. PCIe is a registered trademark of PCI-SIG.
Other names used in this presentation are for identification purposes only and may be trademarks of their respective owners.
©2011 Advanced Micro Devices, Inc. All rights reserved.

Weitere ähnliche Inhalte

Was ist angesagt?

HC-4020, Enhancing OpenCL performance in AfterShot Pro with HSA, by Michael W...
HC-4020, Enhancing OpenCL performance in AfterShot Pro with HSA, by Michael W...HC-4020, Enhancing OpenCL performance in AfterShot Pro with HSA, by Michael W...
HC-4020, Enhancing OpenCL performance in AfterShot Pro with HSA, by Michael W...AMD Developer Central
 
Stinson post si and verification
Stinson post si and verificationStinson post si and verification
Stinson post si and verificationObsidian Software
 
IMAGE CAPTURE, PROCESSING AND TRANSFER VIA ETHERNET UNDER CONTROL OF MATLAB G...
IMAGE CAPTURE, PROCESSING AND TRANSFER VIA ETHERNET UNDER CONTROL OF MATLAB G...IMAGE CAPTURE, PROCESSING AND TRANSFER VIA ETHERNET UNDER CONTROL OF MATLAB G...
IMAGE CAPTURE, PROCESSING AND TRANSFER VIA ETHERNET UNDER CONTROL OF MATLAB G...Christopher Diamantopoulos
 
4+yr Hardware Design Engineer_Richa
4+yr Hardware Design Engineer_Richa4+yr Hardware Design Engineer_Richa
4+yr Hardware Design Engineer_RichaRicha Verma
 
Public Seminar_Final 18112014
Public Seminar_Final 18112014Public Seminar_Final 18112014
Public Seminar_Final 18112014Hossam Hassan
 
Soc architecture and design
Soc architecture and designSoc architecture and design
Soc architecture and designSatya Harish
 
System On Chip
System On ChipSystem On Chip
System On Chipanishgoel
 
Preparing Codes for Intel Knights Landing (KNL)
Preparing Codes for Intel Knights Landing (KNL)Preparing Codes for Intel Knights Landing (KNL)
Preparing Codes for Intel Knights Landing (KNL)AllineaSoftware
 
Day 1 - 01 - Welcome
Day 1 - 01 - WelcomeDay 1 - 01 - Welcome
Day 1 - 01 - Welcomewebhostingguy
 
Soc - Intro, Design Aspects, HLS, TLM
Soc - Intro, Design Aspects, HLS, TLMSoc - Intro, Design Aspects, HLS, TLM
Soc - Intro, Design Aspects, HLS, TLMSubhash Iyer
 
LAS16-400: Mini Conference 3 AOSP (Session 1)
LAS16-400: Mini Conference 3 AOSP (Session 1)LAS16-400: Mini Conference 3 AOSP (Session 1)
LAS16-400: Mini Conference 3 AOSP (Session 1)Linaro
 
Open compute technology
Open compute technologyOpen compute technology
Open compute technologyAMD
 
SOC - system on a chip
SOC - system on a chipSOC - system on a chip
SOC - system on a chipParth Kavi
 
System on Chip (SoC) for mobile phones
System on Chip (SoC) for mobile phonesSystem on Chip (SoC) for mobile phones
System on Chip (SoC) for mobile phonesJeffrey Funk
 
SOC Application Studies: Image Compression
SOC Application Studies: Image CompressionSOC Application Studies: Image Compression
SOC Application Studies: Image CompressionA B Shinde
 
System on chip architectures
System on chip architecturesSystem on chip architectures
System on chip architecturesA B Shinde
 
The Microarchitecure Of FPGA Based Soft Processor
The Microarchitecure Of FPGA Based Soft ProcessorThe Microarchitecure Of FPGA Based Soft Processor
The Microarchitecure Of FPGA Based Soft ProcessorDeepak Tomar
 
Qualcomm Hexagon SDK: Optimize Your Multimedia Solutions
Qualcomm Hexagon SDK: Optimize Your Multimedia SolutionsQualcomm Hexagon SDK: Optimize Your Multimedia Solutions
Qualcomm Hexagon SDK: Optimize Your Multimedia SolutionsQualcomm Developer Network
 
Implementation of Soft-core Processor on FPGA
Implementation of Soft-core Processor on FPGAImplementation of Soft-core Processor on FPGA
Implementation of Soft-core Processor on FPGADeepak Kumar
 

Was ist angesagt? (20)

HC-4020, Enhancing OpenCL performance in AfterShot Pro with HSA, by Michael W...
HC-4020, Enhancing OpenCL performance in AfterShot Pro with HSA, by Michael W...HC-4020, Enhancing OpenCL performance in AfterShot Pro with HSA, by Michael W...
HC-4020, Enhancing OpenCL performance in AfterShot Pro with HSA, by Michael W...
 
Stinson post si and verification
Stinson post si and verificationStinson post si and verification
Stinson post si and verification
 
IMAGE CAPTURE, PROCESSING AND TRANSFER VIA ETHERNET UNDER CONTROL OF MATLAB G...
IMAGE CAPTURE, PROCESSING AND TRANSFER VIA ETHERNET UNDER CONTROL OF MATLAB G...IMAGE CAPTURE, PROCESSING AND TRANSFER VIA ETHERNET UNDER CONTROL OF MATLAB G...
IMAGE CAPTURE, PROCESSING AND TRANSFER VIA ETHERNET UNDER CONTROL OF MATLAB G...
 
4+yr Hardware Design Engineer_Richa
4+yr Hardware Design Engineer_Richa4+yr Hardware Design Engineer_Richa
4+yr Hardware Design Engineer_Richa
 
Public Seminar_Final 18112014
Public Seminar_Final 18112014Public Seminar_Final 18112014
Public Seminar_Final 18112014
 
Soc architecture and design
Soc architecture and designSoc architecture and design
Soc architecture and design
 
System On Chip
System On ChipSystem On Chip
System On Chip
 
Preparing Codes for Intel Knights Landing (KNL)
Preparing Codes for Intel Knights Landing (KNL)Preparing Codes for Intel Knights Landing (KNL)
Preparing Codes for Intel Knights Landing (KNL)
 
Day 1 - 01 - Welcome
Day 1 - 01 - WelcomeDay 1 - 01 - Welcome
Day 1 - 01 - Welcome
 
Soc - Intro, Design Aspects, HLS, TLM
Soc - Intro, Design Aspects, HLS, TLMSoc - Intro, Design Aspects, HLS, TLM
Soc - Intro, Design Aspects, HLS, TLM
 
LAS16-400: Mini Conference 3 AOSP (Session 1)
LAS16-400: Mini Conference 3 AOSP (Session 1)LAS16-400: Mini Conference 3 AOSP (Session 1)
LAS16-400: Mini Conference 3 AOSP (Session 1)
 
Open compute technology
Open compute technologyOpen compute technology
Open compute technology
 
SOC - system on a chip
SOC - system on a chipSOC - system on a chip
SOC - system on a chip
 
System on Chip (SoC) for mobile phones
System on Chip (SoC) for mobile phonesSystem on Chip (SoC) for mobile phones
System on Chip (SoC) for mobile phones
 
SOC Application Studies: Image Compression
SOC Application Studies: Image CompressionSOC Application Studies: Image Compression
SOC Application Studies: Image Compression
 
System on chip architectures
System on chip architecturesSystem on chip architectures
System on chip architectures
 
The Microarchitecure Of FPGA Based Soft Processor
The Microarchitecure Of FPGA Based Soft ProcessorThe Microarchitecure Of FPGA Based Soft Processor
The Microarchitecure Of FPGA Based Soft Processor
 
Qualcomm Hexagon SDK: Optimize Your Multimedia Solutions
Qualcomm Hexagon SDK: Optimize Your Multimedia SolutionsQualcomm Hexagon SDK: Optimize Your Multimedia Solutions
Qualcomm Hexagon SDK: Optimize Your Multimedia Solutions
 
Implementation of Soft-core Processor on FPGA
Implementation of Soft-core Processor on FPGAImplementation of Soft-core Processor on FPGA
Implementation of Soft-core Processor on FPGA
 
SDC Server Sao Jose
SDC Server Sao JoseSDC Server Sao Jose
SDC Server Sao Jose
 

Ähnlich wie FUSION APU & TRENDS/ CHALLENGES IN FUTURE SoC DESIGN

Open Hardware for All - Webinar March 25, 2015
Open Hardware for All - Webinar March 25, 2015Open Hardware for All - Webinar March 25, 2015
Open Hardware for All - Webinar March 25, 2015Cumulus Networks
 
Learn more about the tremendous value Open Data Plane brings to NFV
Learn more about the tremendous value Open Data Plane brings to NFVLearn more about the tremendous value Open Data Plane brings to NFV
Learn more about the tremendous value Open Data Plane brings to NFVGhodhbane Mohamed Amine
 
GTC15-Manoj-Roge-OpenPOWER
GTC15-Manoj-Roge-OpenPOWERGTC15-Manoj-Roge-OpenPOWER
GTC15-Manoj-Roge-OpenPOWERAchronix
 
Sonic 7 Hentchel Performance Tuning
Sonic 7 Hentchel   Performance TuningSonic 7 Hentchel   Performance Tuning
Sonic 7 Hentchel Performance Tuninga i
 
Seminar Accelerating Business Using Microservices Architecture in Digital Age...
Seminar Accelerating Business Using Microservices Architecture in Digital Age...Seminar Accelerating Business Using Microservices Architecture in Digital Age...
Seminar Accelerating Business Using Microservices Architecture in Digital Age...PT Datacomm Diangraha
 
Red Hat® Ceph Storage and Network Solutions for Software Defined Infrastructure
Red Hat® Ceph Storage and Network Solutions for Software Defined InfrastructureRed Hat® Ceph Storage and Network Solutions for Software Defined Infrastructure
Red Hat® Ceph Storage and Network Solutions for Software Defined InfrastructureIntel® Software
 
PowerDRC/LVS 2.2 released by POLYTEDA
PowerDRC/LVS 2.2 released by POLYTEDAPowerDRC/LVS 2.2 released by POLYTEDA
PowerDRC/LVS 2.2 released by POLYTEDAAlexander Grudanov
 
Trends and challenges in IP based SOC design
Trends and challenges in IP based SOC designTrends and challenges in IP based SOC design
Trends and challenges in IP based SOC designAishwaryaRavishankar8
 
Polyteda Power DRC/LVS July 2016
Polyteda Power DRC/LVS July 2016Polyteda Power DRC/LVS July 2016
Polyteda Power DRC/LVS July 2016Oleksandra Nazola
 
Michael Gschwind, Cell Broadband Engine: Exploiting multiple levels of parall...
Michael Gschwind, Cell Broadband Engine: Exploiting multiple levels of parall...Michael Gschwind, Cell Broadband Engine: Exploiting multiple levels of parall...
Michael Gschwind, Cell Broadband Engine: Exploiting multiple levels of parall...Michael Gschwind
 
ODSA Sub-Project Launch
ODSA Sub-Project LaunchODSA Sub-Project Launch
ODSA Sub-Project LaunchODSA Workgroup
 
ODSA Sub-Project Launch
 ODSA Sub-Project Launch ODSA Sub-Project Launch
ODSA Sub-Project LaunchNetronome
 
5 pipeline arch_rationale
5 pipeline arch_rationale5 pipeline arch_rationale
5 pipeline arch_rationalevideos
 
[EWiLi2016] Towards a performance-aware power capping orchestrator for the Xe...
[EWiLi2016] Towards a performance-aware power capping orchestrator for the Xe...[EWiLi2016] Towards a performance-aware power capping orchestrator for the Xe...
[EWiLi2016] Towards a performance-aware power capping orchestrator for the Xe...Matteo Ferroni
 
Tool-Driven Technology Transfer in Software Engineering
Tool-Driven Technology Transfer in Software EngineeringTool-Driven Technology Transfer in Software Engineering
Tool-Driven Technology Transfer in Software EngineeringHeiko Koziolek
 
Real time machine learning proposers day v3
Real time machine learning proposers day v3Real time machine learning proposers day v3
Real time machine learning proposers day v3mustafa sarac
 
Systems on chip (so c)
Systems on chip (so c)Systems on chip (so c)
Systems on chip (so c)sandeep1721
 
OS for AI: Elastic Microservices & the Next Gen of ML
OS for AI: Elastic Microservices & the Next Gen of MLOS for AI: Elastic Microservices & the Next Gen of ML
OS for AI: Elastic Microservices & the Next Gen of MLNordic APIs
 

Ähnlich wie FUSION APU & TRENDS/ CHALLENGES IN FUTURE SoC DESIGN (20)

Open Hardware for All - Webinar March 25, 2015
Open Hardware for All - Webinar March 25, 2015Open Hardware for All - Webinar March 25, 2015
Open Hardware for All - Webinar March 25, 2015
 
Learn more about the tremendous value Open Data Plane brings to NFV
Learn more about the tremendous value Open Data Plane brings to NFVLearn more about the tremendous value Open Data Plane brings to NFV
Learn more about the tremendous value Open Data Plane brings to NFV
 
GTC15-Manoj-Roge-OpenPOWER
GTC15-Manoj-Roge-OpenPOWERGTC15-Manoj-Roge-OpenPOWER
GTC15-Manoj-Roge-OpenPOWER
 
Sonic 7 Hentchel Performance Tuning
Sonic 7 Hentchel   Performance TuningSonic 7 Hentchel   Performance Tuning
Sonic 7 Hentchel Performance Tuning
 
Seminar Accelerating Business Using Microservices Architecture in Digital Age...
Seminar Accelerating Business Using Microservices Architecture in Digital Age...Seminar Accelerating Business Using Microservices Architecture in Digital Age...
Seminar Accelerating Business Using Microservices Architecture in Digital Age...
 
Red Hat® Ceph Storage and Network Solutions for Software Defined Infrastructure
Red Hat® Ceph Storage and Network Solutions for Software Defined InfrastructureRed Hat® Ceph Storage and Network Solutions for Software Defined Infrastructure
Red Hat® Ceph Storage and Network Solutions for Software Defined Infrastructure
 
PowerDRC/LVS 2.2 released by POLYTEDA
PowerDRC/LVS 2.2 released by POLYTEDAPowerDRC/LVS 2.2 released by POLYTEDA
PowerDRC/LVS 2.2 released by POLYTEDA
 
Trends and challenges in IP based SOC design
Trends and challenges in IP based SOC designTrends and challenges in IP based SOC design
Trends and challenges in IP based SOC design
 
Polyteda Power DRC/LVS July 2016
Polyteda Power DRC/LVS July 2016Polyteda Power DRC/LVS July 2016
Polyteda Power DRC/LVS July 2016
 
Michael Gschwind, Cell Broadband Engine: Exploiting multiple levels of parall...
Michael Gschwind, Cell Broadband Engine: Exploiting multiple levels of parall...Michael Gschwind, Cell Broadband Engine: Exploiting multiple levels of parall...
Michael Gschwind, Cell Broadband Engine: Exploiting multiple levels of parall...
 
ODSA Sub-Project Launch
ODSA Sub-Project LaunchODSA Sub-Project Launch
ODSA Sub-Project Launch
 
ODSA Sub-Project Launch
 ODSA Sub-Project Launch ODSA Sub-Project Launch
ODSA Sub-Project Launch
 
5 pipeline arch_rationale
5 pipeline arch_rationale5 pipeline arch_rationale
5 pipeline arch_rationale
 
01-06 OCRE Test Suite - Fernandes.pdf
01-06 OCRE Test Suite - Fernandes.pdf01-06 OCRE Test Suite - Fernandes.pdf
01-06 OCRE Test Suite - Fernandes.pdf
 
[EWiLi2016] Towards a performance-aware power capping orchestrator for the Xe...
[EWiLi2016] Towards a performance-aware power capping orchestrator for the Xe...[EWiLi2016] Towards a performance-aware power capping orchestrator for the Xe...
[EWiLi2016] Towards a performance-aware power capping orchestrator for the Xe...
 
Tool-Driven Technology Transfer in Software Engineering
Tool-Driven Technology Transfer in Software EngineeringTool-Driven Technology Transfer in Software Engineering
Tool-Driven Technology Transfer in Software Engineering
 
Real time machine learning proposers day v3
Real time machine learning proposers day v3Real time machine learning proposers day v3
Real time machine learning proposers day v3
 
Systems on chip (so c)
Systems on chip (so c)Systems on chip (so c)
Systems on chip (so c)
 
ECI OpenFlow 2.0 the Future of SDN
ECI OpenFlow 2.0 the Future of SDN ECI OpenFlow 2.0 the Future of SDN
ECI OpenFlow 2.0 the Future of SDN
 
OS for AI: Elastic Microservices & the Next Gen of ML
OS for AI: Elastic Microservices & the Next Gen of MLOS for AI: Elastic Microservices & the Next Gen of ML
OS for AI: Elastic Microservices & the Next Gen of ML
 

Mehr von Pankaj Singh

An Approach to Overcome Modeling Inaccuracies for Performance Simulation Sig...
An Approach to Overcome Modeling  Inaccuracies for Performance Simulation Sig...An Approach to Overcome Modeling  Inaccuracies for Performance Simulation Sig...
An Approach to Overcome Modeling Inaccuracies for Performance Simulation Sig...Pankaj Singh
 
Unified methodology for effective correlation of soc power
Unified methodology for effective correlation of soc powerUnified methodology for effective correlation of soc power
Unified methodology for effective correlation of soc powerPankaj Singh
 
Overcoming challenges of_verifying complex mixed signal designs
Overcoming challenges of_verifying complex mixed signal designsOvercoming challenges of_verifying complex mixed signal designs
Overcoming challenges of_verifying complex mixed signal designsPankaj Singh
 
Qualifying a high performance memory subsysten for Functional Safety
Qualifying a high performance memory subsysten for Functional SafetyQualifying a high performance memory subsysten for Functional Safety
Qualifying a high performance memory subsysten for Functional SafetyPankaj Singh
 
Safety Verification and Software aspects of Automotive SoC
Safety Verification and Software aspects of Automotive SoCSafety Verification and Software aspects of Automotive SoC
Safety Verification and Software aspects of Automotive SoCPankaj Singh
 
Managing securityforautomotivesoc
Managing securityforautomotivesocManaging securityforautomotivesoc
Managing securityforautomotivesocPankaj Singh
 
Panel:The secret of Indian leadership in Electronic Design skill... From Desi...
Panel:The secret of Indian leadership in Electronic Design skill... From Desi...Panel:The secret of Indian leadership in Electronic Design skill... From Desi...
Panel:The secret of Indian leadership in Electronic Design skill... From Desi...Pankaj Singh
 
Power Optimization with Efficient Test Logic Partitioning for Full Chip Design
Power Optimization with Efficient Test Logic Partitioning for Full Chip DesignPower Optimization with Efficient Test Logic Partitioning for Full Chip Design
Power Optimization with Efficient Test Logic Partitioning for Full Chip DesignPankaj Singh
 

Mehr von Pankaj Singh (9)

An Approach to Overcome Modeling Inaccuracies for Performance Simulation Sig...
An Approach to Overcome Modeling  Inaccuracies for Performance Simulation Sig...An Approach to Overcome Modeling  Inaccuracies for Performance Simulation Sig...
An Approach to Overcome Modeling Inaccuracies for Performance Simulation Sig...
 
Unified methodology for effective correlation of soc power
Unified methodology for effective correlation of soc powerUnified methodology for effective correlation of soc power
Unified methodology for effective correlation of soc power
 
Overcoming challenges of_verifying complex mixed signal designs
Overcoming challenges of_verifying complex mixed signal designsOvercoming challenges of_verifying complex mixed signal designs
Overcoming challenges of_verifying complex mixed signal designs
 
Qualifying a high performance memory subsysten for Functional Safety
Qualifying a high performance memory subsysten for Functional SafetyQualifying a high performance memory subsysten for Functional Safety
Qualifying a high performance memory subsysten for Functional Safety
 
Safety Verification and Software aspects of Automotive SoC
Safety Verification and Software aspects of Automotive SoCSafety Verification and Software aspects of Automotive SoC
Safety Verification and Software aspects of Automotive SoC
 
Thesis
ThesisThesis
Thesis
 
Managing securityforautomotivesoc
Managing securityforautomotivesocManaging securityforautomotivesoc
Managing securityforautomotivesoc
 
Panel:The secret of Indian leadership in Electronic Design skill... From Desi...
Panel:The secret of Indian leadership in Electronic Design skill... From Desi...Panel:The secret of Indian leadership in Electronic Design skill... From Desi...
Panel:The secret of Indian leadership in Electronic Design skill... From Desi...
 
Power Optimization with Efficient Test Logic Partitioning for Full Chip Design
Power Optimization with Efficient Test Logic Partitioning for Full Chip DesignPower Optimization with Efficient Test Logic Partitioning for Full Chip Design
Power Optimization with Efficient Test Logic Partitioning for Full Chip Design
 

Kürzlich hochgeladen

DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenHervé Boutemy
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxLoriGlavin3
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxLoriGlavin3
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionDilum Bandara
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfMounikaPolabathina
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.Curtis Poe
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersRaghuram Pandurangan
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningLars Bell
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESSALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESmohitsingh558521
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxLoriGlavin3
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity PlanDatabarracks
 

Kürzlich hochgeladen (20)

DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache Maven
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An Introduction
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdf
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information Developers
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine Tuning
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESSALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity Plan
 

FUSION APU & TRENDS/ CHALLENGES IN FUTURE SoC DESIGN

  • 1. FUSION APU AND TRENDS/ CHALLENGES IN FUTURE SOC (PROCESSOR) DESIGN Pankaj Singh, Acknowledgement: Denis Foley. Sr. Fellow, AMD 9th International SoC Conference 2nd & 3rd November 2011
  • 2. 2 | 9th Intl. SoC Conference| Nov 2nd,3rd, 2011 TODAY’S TOPICS  Trends: – Three Eras of Processor Performance – Evolution of Heterogeneous Computing  FSA and Open Standard: – Why Fusion ? – Open Standard, Open CL  Power, Performance  High Speed, Scalable Interconnect: NoC’s  3-D Stacking  SoC Trends & Challenges – Verification Effort – IP Integration – TLM, RTL Co-simulation challenges.
  • 3. 3 | 9th Intl. SoC Conference| Nov 2nd,3rd, 2011 TRENDS: THREE ERAS OF PROCESSOR PERFORMANCE Single-Core Era Single-threadPerformance ? Time we are here o Enabled by:  Moore’s Law  Voltage Scaling  MicroArchitecture Constrained by: Power Complexity Multi-Core Era ThroughputPerformance Time (# of Processors) we are here o Enabled by:  Moore’s Law  Desire for Throughput  20 years of SMP arch Constrained by: Power Parallel SW availability Scalability Heterogeneous Systems Era TargetedApplication Performance Time (Data-parallel exploitation) we are here o Enabled by:  Moore’s Law  Abundant data parallelism  Power efficient GPUs Currently constrained by: Programming models Communication overheads
  • 4. 4 | 9th Intl. SoC Conference| Nov 2nd,3rd, 2011 TRENDS: EVOLUTION OF HETEROGENEOUS COMPUTINGArchitectureMaturity&ProgrammerAccessibility PoorExcellent 2012 - 20202009 - 20112002 - 2008 Graphics & Proprietary Driver-based APIs Proprietary Drivers Era  “Adventurous” programmers  Exploit early programmable “shader cores” in the GPU  Make your program look like “graphics” to the GPU  CUDA™, Brook+, etc OpenCL™, DirectCompute Driver-based APIs Standards Drivers Era  Expert programmers  C and C++ subsets  Compute centric APIs , data types  Multiple address spaces with explicit data movement  Specialized work queue based structures  Kernel mode dispatch Fusion™ System Architecture GPU Peer Processor Architected Era  Mainstream programmers  Full C++  GPU as a co-processor  Unified coherent address space  Task parallel runtimes  Nested Data Parallel programs  User mode dispatch  Pre-emption and context switching More uptodate information on FSA: http://developer.amd.com/afds/pages/keynote.aspx#/Dev_AFDS_Reb_2
  • 5. 5 | 9th Intl. SoC Conference| Nov 2nd,3rd, 2011 FSA & OPEN STANDARD: ENTER FUSION Dual Core CPU Northbridge DirectX®11 GPU FUSION APU (Accelerated Processing Unit) Heterogeneous compute engine combining x86 compute and parallel processing capabilities of the GPU on a single die
  • 6. 6 | 9th Intl. SoC Conference| Nov 2nd,3rd, 2011 FSA & OPEN STANDARD: WHY FUSION? 6  Integrating CPUs, Northbridge and GPU enables: – Unified Memory – High-bandwidth, low latency access by GPU – Saves on interface power and PHY area – Shared Power Control and TDP envelope Potential bandwidth bottleneck Relatively long memory latency
  • 7. 7 | 9th Intl. SoC Conference| Nov 2nd,3rd, 2011 COMMITTED TO OPEN STANDARDS  AMD drives open and de-facto standards – Compete on the best implementation  Open standards are the basis for large ecosystems  Open standards always win over time – SW developers want their applications to run on multiple platforms from multiple hardware vendors DirectX®
  • 8. 8 | 9th Intl. SoC Conference| Nov 2nd,3rd, 2011 OPENCL™ AND FSA  FSA is an optimized platform architecture for OpenCL™ – Not an alternative to OpenCL™  OpenCL™ on FSA will benefit from – Avoidance of wasteful copies – Low latency dispatch – Improved memory model – Shared pointers  FSA also exposes a lower level programming interface, for those that want the ultimate in control and performance  Optimized libraries may choose the lower level interface
  • 10. 10 | 9th Intl. SoC Conference| Nov 2nd,3rd, 2011 POWER-THERMAL EFFECTS IN SYSTEMS ON CHIPS ¡ Local failures ! Part not working  Complex SoCs: High power density  Non-uniform power dissipation: Hotspots  Spatial gradients: Cause malfunctions  High on-chip temperatures cause malfunctions affecting reliability.  Power consumption depends on frequency  Setting frequencies to control power and temperature
  • 11. 11 | 9th Intl. SoC Conference| Nov 2nd,3rd, 2011 OPTIONS FOR POWER SAVINGS  Convergence of Performance and Low Power – Notebook->Netbook-> Tablet Tablet<-Smartphone
  • 12. 12 | 9th Intl. SoC Conference| Nov 2nd,3rd, 2011 PERFORMANCE AND POWER S3 idle Static Screen MM07 Media Playback Full Compute APU Power vs. Use Case Performance Power  Performance versus Power Efficiency  Power Management versus Power reduction  Performance & Thermal Design Power
  • 14. 14 | 9th Intl. SoC Conference| Nov 2nd,3rd, 2011 NOC’S: FROM BUSES TO NETWORKS: [Friedman Harel:10] Note: This slide presents industry specific information does not relate to AMD NoC status
  • 15. 15 | 9th Intl. SoC Conference| Nov 2nd,3rd, 2011 NOC CHALLENGES: CAD TOOLS  Capturing application traffic.  Which Topology ?  Mapping? Routes to use?  Fixing communication architecture : parameters.  Verification for correctness, performance.  Build models.  QoS under un-reliable conditions. Key to success: Automate & integrate the steps. Mesh Topology homogeneous systems, with regular tiles Customized Topology heterogeneous systems, with different cores & irregular FP Software Services Mapping, QoS, middleware... Architecture Packeting, buffering, flow control... Physical Implementation Synchronization, wires, power... CAD Tools
  • 16. 16 | 9th Intl. SoC Conference| Nov 2nd,3rd, 2011 Synchronous Delay Insensitive Global None Timing Assumptions Less Detection Local Clocks, Interaction with data (becoming aperiodic)  A complete spectrum of approaches to system-timing exist [Mullins06-07] NOC CHALLENGES: BEYOND GLOBAL SYNCHRONY Delay Insensitive
  • 18. 18 | 9th Intl. SoC Conference| Nov 2nd,3rd, 2011 3-D STACKING  Supporting Heterogeneous computing: high density, high performance, high memory B.W requirement.  3-D NoC’s option  Futuristic view: Integrating Bio-sensor Note: This slide presents industry specific information does not relate to AMD 3-D stacking status
  • 19. SOC TRENDS & CHALLENGES: 1. VERIFICATION EFFORT 2. IP INTEGRATION 3. TLM-RTL CO-SIMULATION
  • 20. 20 | 9th Intl. SoC Conference| Nov 2nd,3rd, 2011 WHAT’S NEW IN SOC DESIGN?  Larger and more complex chips with heavy use of pre-existing cores.  Heavy use of multi core processors and DSPs.  Complex Interconnect.  Shorter time to market and Smaller design teams.  … and software.  Leads to: – Increased verification effort: Debugging is harder. – Integration is more difficult. – Need for scalable and high speed interconnect. – SW / HW co-simulation is a major issue. – Power –Performance challenge. – How do we treat the system software?
  • 21. 21 | 9th Intl. SoC Conference| Nov 2nd,3rd, 2011 VERIFICATION EFFORT  Debugging – Seamless debug across h/w and software[especially SW]  Testbench Development: – Several methodologies  VMM,OVMUVM.  New developments [Unified strategy] – UCIS,UVM TLM2.0 – Coverage trend  Address Gaps in VHDL, System C coverage
  • 22. 22 | 9th Intl. SoC Conference| Nov 2nd,3rd, 2011 VERIFICATION EFFORT  Creating/Running Testcase: – Direct & Random – Run time improvement Save-restore. Verification Cycle per second instead of Cycles per second: Configuring environment to dynamically select relevant design/core. Alternate options
  • 23. 23 | 9th Intl. SoC Conference| Nov 2nd,3rd, 2011 Emulation Focus Areas: 1. Tests/regression run with Long run time 2. Corner case bugs that may escape traditional verification 3. Replicating System level scenarios Ongoing Initiatives/Need: 1.Seemless support for assertions. 2.Improve portability between Simulation & Emulation 3. Common model from TLM-HDL-Emulation VERIFICATION EFFORT  Alternate Options
  • 24. 24 | 9th Intl. SoC Conference| Nov 2nd,3rd, 2011 IP INTEGRATION CHALLENGE  Integration of IP : – Multiple IP’s, various configurations, design languages – IP’s to be in Sync: macro’s , libraries. – Complexity increases with mixed language designs SYSTEM C SVLO G VERILOG VHDL Unique Strengths of Languages Diversity of Design Teams Importing Existing IP Legacy Testbench Environment
  • 25. 25 | 9th Intl. SoC Conference| Nov 2nd,3rd, 2011 IP INTEGRATION CHALLENGE COMPARISON OF CHOICES Direct Instantiation SV Bind Construct SystemC Control/Observe SCV- Connect() SC-DPI Source Code Available Yes Yes Yes Yes Yes One IP Compiled Yes Yes Yes Yes Yes Both IP Compiled No Yes No No No Performance ++++ (3) +++ (2) + (1) + (1) +++++(4) Delta Delay Yes Yes No No No Languages Supported SV, SC, VHDL SV, SC, VHDL SC + SV/VHDL SC + SV/VHDL SC + SV Gap: No standardized automated methodology for integration. Recommended Approach: • Understand IP blocks: language, source code availability. • Understand connection: 1-1, distributed, method port • Option for optimized solution to quickly build a system
  • 26. 26 | 9th Intl. SoC Conference| Nov 2nd,3rd, 2011 IP INTEGRATION CHALLENGE: GAPS WITH ANALOG IP INTEGRATION IN SOC Table1. Gaps with Analog IP Integration in SoC Gaps Root Cause Testchip setup -Testchip scenario is different -Tester used for testchip differs Inbuilt debug -Incomplete inbuilt SoC test/debug capability or derisk option for basic functionality such as PLL clock IP I/F verification -Incomplete test setup Review process -No common detailed review process between IP and SoC team. Incorrect assumption based on past analog IP working silicon IP Modelling -Mismtach in version between IP simulation model and spice netlist -Limitations of behavioral model to replicate actual analog IP functionality -Timing issue -DFT issue EDA tools -Gaps in analog and digital simulation environment
  • 27. 27 | 9th Intl. SoC Conference| Nov 2nd,3rd, 2011  Verification Environment Bring-up – Automated Assertions for early checks. – Review forces, tie-off and relevant checkers from IP to SoC – Bottleneck for SoC team to get started with verification: Option to use fake model for initial bring up. Usage of system model. – Super Block Concept: pre-verified IP blocks at similar frequency & interface  Requirement:  Current solution: In-house methodology and process. No clear solution from EDA vendors. IP INTEGRATION CHALLENGE IP Block1 IP Block2 Minimum Manual Effort Hookup Using ICU No BUGS!
  • 28. 28 | 9th Intl. SoC Conference| Nov 2nd,3rd, 2011 TLM, RTL Co-simulation  Traditional use of System level models : Architecture profiling & Performance Analysis  Increasing Demand for Co-simulation: Tradeoff between Accuracy and Performance.  Open Challenges  Different level of Abstraction.  Need for improvement in Integration methodology and Test bench development  Seamless Debug and Coverage methodology.  Using System Level model for HDL generation  Legacy system model not written with conversion in mind.  Current limitation: Incomplete translation.  Lack of reliable Equivalence Check tool.  Need: Merge top down (SystemC) and bottom-up (System Verilog) methodology/flow.  Gaps/Work to do: How to do Power analysis
  • 29. 29 | 9th Intl. SoC Conference| Nov 2nd,3rd, 2011 THANK YOU!
  • 30. 30 | 9th Intl. SoC Conference| Nov 2nd,3rd, 2011 REFERENCES [1] Wilson Research Group-MGC study blog 2011. [2] AMD Coolchip2011 presentation. Denis Foley, AMD Sr. Fellow. [3] Fusion Processors and HPC-2011, Chuck Moore, AMD Corporate Fellow & Technology Group CTO [3] AMD Fusion Developer Summit 2011. Phil Rogers, AMD Corporate Fellow [4] Fully Asynchronous framework for GALS network on chip. Friedman H [5]Future of EE, NoC’s presentation. Dr. Srinivasan Murali [6] Analog IP integration in SoC, IP reuse’09. Mixed language IP integration DVCoN 2010. Extending Fucntional coverage to SystemC, VHDL-IP’10. Pankaj S
  • 31. 31 | 9th Intl. SoC Conference| Nov 2nd,3rd, 2011 GLOSSARY  GPU – Graphics processing unit  APU: Accelerated Processing Unit  Open CL: Open Computing Language  TDP – Thermal Design power – a measure of a design infrastructure’s ability to cool a device  NoC: Network On Chip  TLM: Transaction Level Modeling  Turbo Core – AMD boost mechanism  QoS: Quality of Service  UVM: Universal Verification Methodology  UCIS: Unified Coverage Interoperability Standard
  • 32. 32 | 9th Intl. SoC Conference| Nov 2nd,3rd, 2011 BACKUP
  • 33. 33 | 9th Intl. SoC Conference| Nov 2nd,3rd, 2011 Disclaimer The information presented in this document is for informational purposes only and may contain technical inaccuracies, omissions and typographical errors. The information contained herein is subject to change and may be rendered inaccurate for many reasons, including but not limited to product and roadmap changes, component and motherboard version changes, new model and/or product releases, product differences between differing manufacturers, software changes, BIOS flashes, firmware upgrades, or the like. AMD assumes no obligation to update or otherwise correct or revise this information. However, AMD reserves the right to revise this information and to make changes from time to time to the content hereof without obligation of AMD to notify any person of such revisions or changes. AMD makes no representations or warranties with respect to the contents hereof and assumes no responsibility for any inaccuracies, errors or omissions that appear in this information. AMD specifically disclaims any implied warranties of merchantability or fitness for any particular purpose. In no event will AMD be liable to any person for any direct, indirect, special or other consequential damages arising from the use of any information contained herein, even if AMD is expressly advised of the possibility of such damages. Trademark Attribution AMD, the AMD Arrow logo, AMD Athlon, AMD Phenom, AMD Turion, AMD Radeon, and combinations thereof are trademarks of Advanced Micro Devices, Inc. in the United States and/or other jurisdictions. Microsoft, Windows and DirectX are registered trademarks of Microsoft Corporation in the United States and/or other jurisdictions. PCIe is a registered trademark of PCI-SIG. Other names used in this presentation are for identification purposes only and may be trademarks of their respective owners. ©2011 Advanced Micro Devices, Inc. All rights reserved.