The new AMD Opteron™ processor: The core of the cloud
* Designed for the inflection point around the hyper-efficient, virtualized Cloud
* Strong OEM and end-customer support out of the gate
* Superior performance, ranging from 24% to 84% in key trending workloads
* Increased virtualization scalability
* As much as 56% lower power-per-core
* Perfectly matched architecture for today’s highly threaded workloads including cloud, web, virtualization, database and HPC
1. THE CORE OF THE CLOUD
AMD Opteron™ 6200 and 4200 Series Processors
2. THE UNDENIABLE BUSINESS AND I.T . UPLIFT
FROM CLOUD-BASED ECONOMICS
Clouds
UNIX and
PCs Virtualized
Mainframes
and Minis and Cloud
Distributed Cloud deployments accelerate
Traditional IT gains in business agility and
efficiency while speeding the
rate of information exchange
TIMELINE
2 | AMD Opteron™ 6200/4200 Series Processors Launch Presentation | Under Embargo until 12:01 a.m. EST, November 14, 2011 |
3. SUCCESSFUL NEW SERVER PLATFORMS MUST BALANCE
WORKLOADS AND DATA CENTER REQUIRMENTS
DATABASE VIRTUALIZATION CLOUD HPC
Performance for Scalability for Economics for
data centers Virtualization the Cloud
3 | AMD Opteron™ 6200/4200 Series Processors Launch Presentation | Under Embargo until 12:01 a.m. EST, November 14, 2011 |
4. AVAILABLE TODAY:
SERVERS FEATURING THE ALL NEW AMD OPTERON PROCESSOR
Performance for data centers
Up to 84% higher performance*
Scalability for Virtualization
73% more memory bandwidth*
A BETTER OPTION FOR Economics for the Cloud
THE HYPEREFFICIENT, 1/2 the power-per-core*
VIRTUALIZED,
CLOUD-READY WORLD Requires 2/3 less floor space*
1/3 to 2/3 lower platform price*
*See backup processor and performance information on slides 22-23.
4 | AMD Opteron™ 6200/4200 Series Processors Launch Presentation | Under Embargo until 12:01 a.m. EST, November 14, 2011 |
5. AMD “BULLDOZER” TECHNOLOGY:
BUILT TO WORK FOR SERVER TRENDS
World’s first 16 core x86 CPU is an innovative
architecture like nothing else in the market
– Flex FP – the world’s only dynamic, flexible floating
point complex
– Revamped integrated memory controller
– AMD Turbo CORE technology for additional
performance
– TDP Power Cap
Driving huge benefits to customers
– Outstanding value and power efficiency
– Modular architecture for future innovation
Flexible Shared & Dedicated Resources approach
helps maximize power efficiency and costs
– Shared components maximize performance per
watt Dedicated Components
– Dedicated components to help increase Shared at the module level
performance and scalability
Shared at the chip level
5 | AMD Opteron™ 6200/4200 Series Processors Launch Presentation | Under Embargo until 12:01 a.m. EST, November 14, 2011 |
6. INTRODUCING THE NEW AMD OPTERON 6200 AND 4200 SERIES PROCESSORS
AMD Opteron™ 6200 Series Processor AMD Opteron™ 4200 Series Processor
(“Interlagos”) (“Valencia”)
The world’s first x86 The world’s lowest x86
16-core processor1 power-per-core2
Scalability Up to 4 sockets with up to 16 cores Up to 2 sockets with up to 8 cores
Memory 4 memory channels up to 1600 MHz memory 2 memory channels up to 1600 MHz memory
Frequency Up to 3.3 GHz base frequency & up to 3.7 GHz frequency using AMD Turbo CORE technology*
Cache: L1 - 16KB data per core + 64KB instruction Cache: L1 - 16KB data per core + 64KB instruction
Cache per module; per module; L2 - 1MB per core; L3 – 8MB per
L2 - 1MB per core; L3 - 16MB per socket socket
Four x16 HyperTransport™ technology 3.0 links Three x16 HyperTransport™ technology 3.0 links
I/O
@ up to 6.4GT/s per link @ up to 6.4GT/s per link
85W to 140 W TDP (Consistent with AMD Opteron 6100 35 to 95W TDP (Consistent with AMD Opteron 4100
Power
Series) Series)
1 Based on 16-core AMD Opteron 6200 Series processor compared to 6-core Intel Xeon 5600 Series and 10-core Intel Xeon E7 processors.
2 As of Nov 1, 2011, AMD Opteron™ processor Models 4200 EE have the lowest known power per core of any x86 server processor, at 35W TDP (35W /8
= 4.375W/core). Intel 's lowest power per core server processor, L5630, is 40W TDP (40W/4 = 10W/core). See
http://www.intel.com/Assets/PDF/prodbrief/323501.pdf. Previous record held by AMD Opteron processor Models 4100 EE at 35W TDP / 6 cores = 5.83
W/core.
6 | AMD Opteron™ 6200/4200 Series Processors Launch Presentation | Under Embargo until 12:01 a.m. EST, November 14, 2011 |
7. WORLD’S MOST CONFIGURABLE CORES FOR AN AGILE CLOUD
Platform & Virtualization
Processing Flexibility Power Flexibility
Flexibility
AMD Turbo CORE TDP Power Cap Highest x86 core
count* for more VMs,
AMD Core Select C6 state better server utilization
Multiple Flex FP 6 power bands Scalable for rapid
processing modes growth/expansion
Low & Ultra Low Voltage
Virtual Cache Partitioning memory support 1P to 4P with same
New technical computing platform and chipset
instructions Supports 4 to 64 cores
Up to 46% lower power
Up to 84% greater at idle*; as little as Multi-generational
performance* 4.375W per core platform consistency
*See backup processor and performance information on slides 24.
7 | AMD Opteron™ 6200/4200 Series Processors Launch Presentation | Under Embargo until 12:01 a.m. EST, November 14, 2011 |
8. AMD TURBO CORE TECHNOLOGY
Base frequency with All core boost activated Max turbo activated
TDP headroom (up to 500MHz) (up to 1GHz+, half cores)
+
All Core Boost Max Turbo Boost
When there is TDP headroom in a When a lightly threaded workload sends half the
given workload, AMD Turbo CORE “Bulldozer” modules into C6 sleep state but also
technology is automatically activated requests max performance, AMD Turbo CORE
and can increase clock speeds by technology can increase clock speeds by up to
300-500 MHz* across all cores. 1 GHz+* across half the cores.
*Based on AMD Opteron™ 6200 Series processors with up to 300 MHz in P1 boost state and up to 1 GHz+ in P0 boost state over base P2 clock frequency.
8 | AMD Opteron™ 6200/4200 Series Processors Launch Presentation | Under Embargo until 12:01 a.m. EST, November 14, 2011 |
9. FLEX FP 256-BIT FPU AND NEW TECHNICAL COMPUTING INSTRUCTIONS
Instructions Applications/Use Cases
SSSE3, SSE4.1, • Video encoding and transcoding
SSE4.2 • Biometrics algorithms
(AMD and Intel) • Text-intensive applications
• Application using AES encryption
AESNI • Secure network transactions
PCLMULQDQ • Disk encryption (MSFT BitLocker)
(AMD and Intel) • Database encryption (Oracle)
• Cloud security
Floating point intensive applications:
• Signal processing / Seismic
• A flexible floating point unit shared AVX • Multimedia
between 2 integer cores (AMD and Intel) • Scientific simulations
• Financial analytics
• Simultaneously executes two 128-bit • 3D modeling
instructions or one 256-bit instruction • Vector and matrix multiplications
FMA4 • Polynomial evaluations
• Saves die space and conserves power (AMD Unique) • Chemistry, physics, quantum mechanics and
for majority of non-FP applications digital signal processing
• Dedicated floating point scheduler, XOP
• Numeric applications
which minimizes latency for floating • Multimedia applications
(AMD Unique) • Algorithms used for audio/radio
point applications
9 | AMD Opteron™ 6200/4200 Series Processors Launch Presentation | Under Embargo until 12:01 a.m. EST, November 14, 2011 |
10. DESIGNED TO DRIVE DOWN POWER REQUIREMENTS
More Low Power Reduces Idle CPU Enables More Power Intelligent Circuit
Memory Choices Power By Up to 46%² Control for IT Design
Low and Ultra Low C6 power state TDP Power Cap All New Design
Voltage Memory Shuts down clocks and Flexibility to set power Minimizes the number
1.35v DIMMs reduce power to idle cores limits without capping of active transistors for
voltage by 10%; frequency lower power and better
1.25v DIMMs reduce performance
voltage by 16%¹
Up to 56% better power-per-core than Xeon³
¹Regular voltage=1.5v, low voltage=1.35v, ultra-low voltage=1.25v; ² ² Based on internal testing as of 8/2011: AMD Opteron™ processor model 6174 (12-core 2.2GHz) consumes 11.7W in
active idle C1E power state, while AMD Opteron™ processor model 6276 (16-core 2.3GHz) consumes only 6.4W in the active idle C1E power state with new C6 power gating employed.
System configuration: “Drachma” reference design kit, 32GB (8 x 4GB DDR3-1333) memory, 500GB SATA disk drive, Microsoft® Windows Server® 2008 x64 Enterprise Edition R2. SVR-
60; ³based on AMD Opteron 4200 Series processor with 8 cores at 35W TDP versus lowest wattage, highest core Intel Xeon processor with 6 cores at 60W TDP.
10 | AMD Opteron™ 6200/4200 Series Processors Launch Presentation | Under Embargo until 12:01 a.m. EST, November 14, 2011 |
11. TDP POWER CAP – THE ULTIMATE POWER AND UTILIZATION TOOL
6000W Rack Example
Allows granular control of power
envelopes
Maximizes the density of servers
in the data center
Convert untapped space in the
rack into additional density
Customer-definable, up to 1W
increments
Many workloads will still operate 17 2U servers @ 20 2U servers @
within the same performance 350W max power 300W max power
range despite the lower TDP 5,950W total power 6,000W total power
8U of wasted space 17% more servers
11 | AMD Opteron™ 6200/4200 Series Processors Launch Presentation | Under Embargo until 12:01 a.m. EST, November 14, 2011 |
12. BEATING THE MOST POPULAR XEON PROCESSORS BY 89%
AMD Opteron™ processor model 6276 beats Intel‟s most popular Xeon by 89%, at the same price
AMD delivers clearly superior performance where customers are actually spending their money
Intel 2011 Xeon 5600 Series Shipments1 Xeon E5640 AMD Opteron 6276
Xeon X5680 100%
160
Xeon X5677 90% 140
Xeon X5670 80%
89%
120
Xeon X5667 70%
100
Xeon X5660 60%
50% 80
Xeon X5650
40%
Xeon E5640 60
30%
Xeon E5630 40
20%
Xeon E5620
10% 20
0% 0
1Q11 2Q11 3Q11 Price Performance
1 Source:
Mercury Research, PC Processors and Chip Sets Updated Edition 3Q2011
²SPECint®_rate2006 Score= 480, 2 x AMD Opteron™ processors Model 6276 in Supermicro A+ Server 1022-URF server, 128GB (16 x 8GB DDR3-1333) memory, Red Hat Enterprise Linux 6.1, x86
Open64 4.2.5.2 Compiler Suite, based on AMD internal testing in Oct‟11. SPECint®_rate2006 Score = 253, 2 x Intel Xeon processors Model X5640 in Cisco UCS C210 M2 , 48GB (12 x 4GB DDR3-
1333) memory, SUSE Linux Enterprise Server 11 SP1, Intel C++ Compiler XE Version 12.0.1.116 Build 20101116, published Mar‟11 at http://www.spec.org/cpu2006/results/res2011q2/cpu2006-
20110329-15460.html
12 | AMD Opteron™ 6200/4200 Series Processors Launch Presentation | Under Embargo until 12:01 a.m. EST, November 14, 2011 |
13. SAME PRICE, BETTER PERFORMANCE, WITH 2-3X THE BENEFIT¹
Intel AMD Advantage
2 AMD Opteron™ Series
Processor 2 Xeon E5640
Processors Model 6220
25% faster
Memory 16GB 32GB Twice the memory
RAID Controller None RAID 5 Controller Higher availability
Hard Drives 1 x 146GB 3 x 146GB Triple the storage
Price $4,736 $4,576 3% lower price
Helping businesses get the most for their budget
1 Pricing and specifications from www.hp.com as of 10/26/11. As of 10/26/11, pricing for the HP DL385 with AMD Opteron processor Model 6220 (1ku pricing $523) is
not yet available. The price of an HP DL385 with AMD Opteron processors Model 6134 (1ku pricing $523) was used as a proxy because pricing for AMD Opteron 6100
and 6200 Series processor-based servers is expected to be very similar. Server configurations are identical except on the specifications listed above and include
standard warranty, power supply and no OS. Difference in processor speeds is based on the top SPECint®_rate2006 results published by, or submitted by AMD to,
SPEC as of 10/26/2011 for 2P servers with the above identified processors and may differ from the performance of the servers priced and described above. For the
latest results, visit www.spec.org. See backup slide for configuration details.
13 | AMD Opteron™ 6200/4200 Series Processors Launch Presentation | Under Embargo until 12:01 a.m. EST, November 14, 2011 |
14. AMD OPTERON ADVANTAGES FOR VIRTUALIZATION
Virtualization Lower Cost Greater Virtualization
per VM Density
Microsoft Hyper-V Intel AMD
VMware vSphere
Microsoft Remote FX
VMware View
Xen, KVM
Customer Requirements:
Core density At 1VM per core, AMD can host
2P 4P
High memory addressability 672 VMs in a standard 42U rack
Estimated 72 – 77% platform The same load would require
Large L3 cache
level savings on like to like 2.6 racks of Intel systems²,
Cost efficiency configurations¹ 3X the floor space
¹, ² See backup data on slide 25
14 | AMD Opteron™ 6200/4200 Series Processors Launch Presentation | Under Embargo until 12:01 a.m. EST, November 14, 2011 |
15. PERFORMANCE THAT DRIVES THE CLOUD
Web / Cloud Unmatched Power Core Density
Efficiency for Greater Throughput
Windows® Azure Xeon 5690 AMD Opteron 6282
12
LAMP Stack 140
10
Java 8 120
100
Hadoop 6
80
4
OpenStack 60
2
40
0 20
Customer Requirements: Xeon AMD AMD
0
Power efficiency 5600 Opteron Opteron
6200 4200 Spec Int SPEC JBB
Core scalability
Node and thread density Lowest x86 watts/core in the 24-25% better performance in
High throughput industry: 5.3W for “Interlagos” web-focused workloads²
and 4.375W for “Valencia”¹
¹, ² See backup data on slide 26
15 | AMD Opteron™ 6200/4200 Series Processors Launch Presentation | Under Embargo until 12:01 a.m. EST, November 14, 2011 |
16. AMD IS POWERING THE CLOUD TODAY
AMD’s Private Cloud
115K cores at work and counting
Supports thousands of engineers worldwide
40M+ tasks executed per month
90% utilization rate for both cores and systems
100% of engineering projects in the company
reside in the cloud
High core count, scalability and
power efficiency are driving
clouds to adopt AMD
There are more than 2 million
AMD Opteron™ processors at
work today in the cloud
16 | AMD Opteron™ 6200/4200 Series Processors Launch Presentation | Under Embargo until 12:01 a.m. EST, November 14, 2011 |
17. THE HPC LEADER
HPC 73GB/s Superior Greatest FLOPs
memory Performance¹ per Sq. Foot
throughput3
Linux OS Xeon 5670 AMD Opteron 6276
Open64 73% more 200
memory
GCC bandwidth
150
than Intel3
PGI Compilers
100
Maximum
Customer Requirements: cores 50
Scalable performance per rack2
-
Strong floating point
NAMD
WRF
LAMMPS
SPEC FP
STREAM
LINPACK
performance More FLOPs
High memory throughput per sq. foot2
With almost twice the FLOPs
More cores for highly per sq. ft. with “Interlagos”, it
threaded apps would take Intel almost 2 racks
33% lower 24-84% better performance at
Wide range of technical cost per core4 significantly lower price¹ to match AMD in density and
instructions performance²
¹-3 See complete benchmark data on slides 27-29.
17 | AMD Opteron™ 6200/4200 Series Processors Launch Presentation | Under Embargo until 12:01 a.m. EST, November 14, 2011 |
18. HPC: THE EARLY ADOPTER MARKET
HPC acts as leading indicator for new technology Half a million "Bulldozer" cores have already shipped to
Adopts and proves viability of new performance and some of the most demanding customers in the world
efficiency trends U.S. Department of Energy
As of June 2011, 1/3 of the world‟s top 100 – Oak Ridge National Lab
– Sandia National Lab
Supercomputers are based on the AMD Opteron™
– Los Alamos National Lab
processor – Pacific Northwest National Lab
AMD – the power behind the first system to break the National Oceanic & Atmospheric Administration
petaflop barrier - currently powers 4 petaflop-capable top
Department of Defense
10 systems
The HECToR initiative in Europe
University of Stuttgart‟s High Performance Computing Center
“ORNL is deploying more than 25,000 of AMD‟s „Interlagos‟ processors
over the next few months as we upgrade Jaguar to the new Titan system.
Our users are excited about the increase in performance over previous-
generation processors, and our sponsors are delighted with the power
savings that will make Titan one of the world‟s most powerful and efficient
research tools.”
Buddy Bland
Project Director, ORNL Leadership Computing Facility
18 | AMD Opteron™ 6200/4200 Series Processors Launch Presentation | Under Embargo until 12:01 a.m. EST, November 14, 2011 |
19. CUSTOMERS AND PARTNERS
Systems and Infrastructure Software and Ecosystem Partners
Range of Infrastructure platforms
More than 30
expected server platforms
19 | AMD Opteron™ 6200/4200 Series Processors Launch Presentation | Under Embargo until 12:01 a.m. EST, November 14, 2011 |
20. THE NEW AMD OPTERON™ PROCESSOR:
THE CORE OF THE CLOUD
Designed for the inflection point around the
hyper-efficient, virtualized Cloud
Strong OEM and end-customer support out of
the gate
Superior performance, ranging from 24% to 84%
in key trending workloads
Increased virtualization scalability
As much as 56% lower power-per-core
Perfectly matched architecture for today‟s highly
threaded workloads including cloud, web,
virtualization, database and HPC
20 | AMD Opteron™ 6200/4200 Series Processors Launch Presentation | Under Embargo until 12:01 a.m. EST, November 14, 2011 |
21. BACKUP
21 | AMD Opteron™ 6200/4200 Series Processors Launch Presentation | Under Embargo until 12:01 a.m. EST, November 14, 2011 |
22. DATA:
AVAILABLE TODAY: SERVERS FEATURING THE ALL NEW AMD OPTERON PROCESSOR
84% Better performance:
LINPACK (2P) AMD Opteron processor Model 6276 generates 84% more FLOPS than Intel Xeon processor Model X5670
– 239.1 FLOPS, AMD Opteron 6276, Trial 127, STEP B2G, Node 1, PROC. 32, PARAM WR01R2R4, N 83200, NB 100, P 4, Q 8, E .81, FC
gfortran 4.6, BLAS ACML 5.0.0.0, FCFLAGS -fomit-frame-pointer -O3 -funroll-loops -W -Wall -mavx -mfma4, NUMA hwloc, MPI OMPI 1.5.3,
DIMM MHz 1600, DIMM# 8, DIMM GB 8, Total Capacity 64, System SM H8DGT, OS SLES11 SP1, Kernel 2.6.32.28-fam15h-default, Notes
HPC mode
– 130.1 FLOPS, Intel Xeon X5670, Trial 5, STEP NA, Node 1, PROC. 1, PARAM WR01R2R4, N 51744, NB 112, P 2, Q 6, E .93, FC ifort 11.1,
BLAS MKL 10.3, FCFLAGS O3 -w -ansi-alias -i-static -openmp -nocompchk, NUMA –localalloc, MPI OMPI 1.5.1, DIMM MHz 1333, DIMM# 4,
DIMM GB 6, Total Capacity 24, System SM X8DTT, OS SLES11, Kernel 2.6.27, Notes Turbo
73% better scalability:
STREAM (2P) AMD Opteron processor Model 6276 has 73% higher memory bandwidth than Intel Xeon processor Model X5670
– 73, 2 x AMD Opteron™ processors Model 6276 in Supermicro H8DGT, 64GB (8 x 8GB DDR3-1600) memory, SuSE Linux® Enterprise Server
11 SP1 64-bit, x86 Open64 4.2.5-1 Compiler Suite
– 42, 2 x Intel Xeon processors Model X5670 in Supermicro X8DTT server, 24GB (6 x 4GB DDR3-1333) memory, SuSE Linux® Enterprise Server
11 SP1 64-bit, Intel Compiler v11.1.064
22 | AMD Opteron™ 6200/4200 Series Processors Launch Presentation | Under Embargo until 12:01 a.m. EST, November 14, 2011 |
23. DATA:
AVAILABLE TODAY: SERVERS FEATURING THE ALL NEW AMD OPTERON PROCESSOR
1/2 the power-per-core*
– As of Nov 1, 2011, AMD Opteron™ processor Models 4200 EE have the lowest known power per core of any x86 server processor, at 35W
TDP (35W/8 = 4.375W/core). Intel 's lowest power per core server processor, L5630, is 40W TDP (40W/4 = 10W/core). See
http://www.intel.com/Assets/PDF/prodbrief/323501.pdf. Previous record held by AMD Opteron processor Models 4100 EE at 35W TDP / 6
cores = 5.83 W/core.
Requires 2/3 less floor space*
– VMs/rack (2P and 4P) One rack of AMD Opteron 6200 Series-based servers can support 672 VMs (1 VM per core, 2U servers)
– This would take three racks and 56 2U Intel Xeon 5600 Series-based servers to do the same.
– Assumes 1 VM/core, AMD Opteron 6200 Series-based 2P 2U server has up to 32 cores, supports up to 32 VMs/server x 21 servers per rack,
which equals 672 VMs per server. Intel Xeon 5600 Series-based 2P 2U server has up to 12 cores, supports up to 12 VMs/server x 21 servers
per rack, which equals 252 VMs per server, Intel specs as of 11/4/11 at www.intc.com/pricelist.cfm.
1/3 to 2/3 lower platform price*
– Top bin comparisons. Dell R710 with two top bin Intel Xeon processor Model X5690s is $7,103 . Since pricing for a Dell R715 with two top bin
AMD Opteron processor Model 6282 SE is not yet available, the similarly priced AMD Opteron processor Model 6140 was used and the server
yielded a price of $4564. That is a 36% price savings. Both servers were configured with 32GB RAM, 146GB 10K hdd, and 3yr base warranty
and large enterprise pricing is from www.dell.com as of 10/22/11. HP DL 580 with four top bin Intel Xeon processor Model E7-4870 is $29,336
at www.dell.com. Since pricing for an HP DL585 pricing with four top bin AMD Opteron Model 6828 SE is not yet available, the similarly priced
AMD Opteron processor Model 6140 was used and server yielded a price of $11,094. Both servers were configured with 64GB RAM, 72GB
15K hdd, and 3yr base warranty and large enterprise pricing is from www.hp.com as of 10/22/11. VMware vSphere pricing not included,
assuming both servers configured with versions 5.0 or 4.1u2, which are the same price for AMD- and Intel-based servers.
23 | AMD Opteron™ 6200/4200 Series Processors Launch Presentation | Under Embargo until 12:01 a.m. EST, November 14, 2011 |
24. DATA: WORLD’S MOST CONFIGURABLE CORES FOR AN AGILE CLOUD
1 84% Better performance:
– LINPACK (2P) AMD Opteron processor Model 6276 generates 84% more FLOPS than Intel Xeon processor Model X5670
– 239.1 FLOPS, AMD Opteron 6276, Trial 127, STEP B2G, Node 1, PROC. 32, PARAM WR01R2R4, N 83200, NB 100, P 4, Q 8, E .81, FC gfortran 4.6, BLAS ACML 5.0.0.0,
FCFLAGS -fomit-frame-pointer -O3 -funroll-loops -W -Wall -mavx -mfma4, NUMA hwloc, MPI OMPI 1.5.3, DIMM MHz 1600, DIMM# 8, DIMM GB 8, Total Capacity 64, System SM
H8DGT, OS SLES11 SP1, Kernel 2.6.32.28-fam15h-default, Notes HPC mode
– 130.1 FLOPS, Intel Xeon X5670, Trial 5, STEP NA, Node 1, PROC. 1, PARAM WR01R2R4, N 51744, NB 112, P 2, Q 6, E .93, FC ifort 11.1, BLAS MKL 10.3, FCFLAGS O3 -w -
ansi-alias -i-static -openmp -nocompchk, NUMA –localalloc, MPI OMPI 1.5.1, DIMM MHz 1333, DIMM# 4, DIMM GB 6, Total Capacity 24, System SM X8DTT, OS SLES11, Kernel
2.6.27, Notes Turbo
² Based on internal testing as of 8/2011: AMD Opteron™ processor model 6174 (12-core 2.2GHz) consumes 11.7W in active idle C1E power state, while AMD Opteron™ processor
model 6276 (16-core 2.3GHz) consumes only 6.4W in the active idle C1E power state with new C6 power gating employed. System configuration: “Drachma” reference design kit, 32GB
(8 x 4GB DDR3-1333) memory, 500GB SATA disk drive, Microsoft® Windows Server® 2008 x64 Enterprise Edition R2. SVR-60
3 Based on 16-core AMD Opteron 6200 Series processor compared to 6-core Intel Xeon 5600 Series and 10-core Intel Xeon E7 processors.
24 | AMD Opteron™ 6200/4200 Series Processors Launch Presentation | Under Embargo until 12:01 a.m. EST, November 14, 2011 |
25. DATA: “AMD OPTERON ADVANTAGES FOR VIRTUALIZATION”
1Server cost per VM (2P) AMD Opteron 6200-based server cost per VM is up to 72% less expensive
Assumes 1 VM/core, Dell R710 with two top bin Intel Xeon processor Model X5690 is $7,361, has 12 cores and 12VMs, $613/VM.
Since pricing for a Dell R715 with two top bin AMD Opteron processor Model 6282 SE is not yet available, the similarly priced AMD
Opteron processor Model 6140 was used and the server yielded a price of $5406, has 32 cores and 32VMs, $169/VM . Servers
were configured with 32GB RAM, 146GB 10K hdd, and 3yr base warranty and large enterprise pricing is from www.dell.com as of
10/27/11 and 10/28/11.
Server cost per VM (4P) AMD Opteron 6200-based server cost per VM is up to 77% less expensive
Assumes 1 VM/core, HP DL 580 G7 with four top bin Intel Xeon processor Model E7-4870 is $28,936, has 40 cores and 32VMs,
$723/VM. Since pricing for an HP DL585 with four top bin AMD Opteron processor Model 6282 SE is not yet available, the
similarly priced AMD Opteron processor Model 6140 was used and the server yielded a price of $10,698, has 64 cores and 64VMs,
$167/VM. Servers were configured with 64GB RAM, 72GB 15K hdd, and 3yr base warranty and large enterprise pricing is from
www.hp.com as of 11/4/11. VMware vSphere pricing not included, assuming both servers configured with versions 5.0 or 4.1u2,
which are the same price for AMD- and Intel-based servers.
²AMD Opteron 6200 Series-based 2P servers can support up to 32VMs each, assuming 1 VM per core. 21 servers fit in a rack x
32 VMs = 672 VMs per rack. Intel Xeon 5600 Series-based 2P servers can support up to 12VMs each, based on core counts
listed at www.intc.com/pricelist.cfm as of 10/24/11, which equates to 21 servers x 12 VMs = 252 VMs. # of vCPUs for 16 VMs =
max cores/16.
25 | AMD Opteron™ 6200/4200 Series Processors Launch Presentation | Under Embargo until 12:01 a.m. EST, November 14, 2011 |
26. DATA: “PERFORMANCE THAT DRIVES THE CLOUD”
¹Intel Xeon processor Model L5609 is 10W/core (40W TDP/4 cores) as of 10/24/11, see http://www.intc.com/pricelist.cfm. AMD Opteron
6200 Series “HE” processors will be as low as 85W with 16 cores for 5.3 W/core. AMD Opteron 4200 Series “EE” processors will be as
low as 35W TDP with 8 cores for 4.375W/core.
²SPECint®_rate2006 Score= 526, 2 x AMD Opteron™ processors Model 6282 SE in Supermicro A+ Server 1022-URF server, 128GB (16
x 8GB DDR3-1333) memory, Red Hat Enterprise Linux 6.1, x86 Open64 4.2.5.2 Compiler Suite, based on AMD internal testing in Oct‟11.
SPECint®_rate2006 Score = 421, 2 x Intel Xeon processors Model X5690 in NEC Express5800/R120b-1 server, 96GB (12 x 8GB DDR3-
1333) memory, SUSE Linux Enterprise Server 11 SP1, Intel C++ Compiler XE v12.0.3.174, published Aug‟11 at
http://www.spec.org/cpu2006/results/res2011q3/cpu2006-20110801-17813.html.
SPECjbb2005 Score = 1211915 for AMD Opteron processor Model 6282 SE based on AMD internal testing in Oct‟11. SPECjbb2005
Score = 975257 for Intel Xeon processor Model X5690. Cisco UCS B200 M2 (Intel Xeon X5690, 3.46GHz) 48GB(12 x 4GB 2Rx4 PC3-
10600R, ECC, running 1333MHz) IBM Corporation IBM J9 JVM (build 2.4, JRE 1.6.0 IBM J9 2.4 Windows Server 2008 amd64-64
jvmwa6460sr6-20090923_42924 (JIT enabled, AOT enabled), published in Mar‟11 at
http://www.spec.org/osg/jbb2005/results/res2011q1/jbb2005-20110223-00958.html.
SPEC, SPECint, and SPECJBB are registered trademarks of the Standard Performance Evaluation Corporation. The results stated above
top reflect results published on http://www.spec.org/cpu2006/results/ as of 10/26/11
26 | AMD Opteron™ 6200/4200 Series Processors Launch Presentation | Under Embargo until 12:01 a.m. EST, November 14, 2011 |
27. DATA: “THE HPC LEADER”
¹SPECfp
360, 2 x AMD Opteron™ processors Model 6276 in Supermicro A+ 1022-URFserver, 64GB (8 x 8GB DDR3-1600) memory, Red Hat Enterprise Linux 6.1 64-bit, x86 Open64 4.2.5.2
Compiler Suite
263, 2 x Intel Xeon processors Model X5670 in Cisco UCS B200 M2 server, 48GB (12 x 4GB DDR3-1333) memory, SUSE Linux® Enterprise Server 11 SP1 64-bit, Intel C++ Compiler
XE v12.0.1.116
http://www.spec.org/cpu2006/results/res2011q1/cpu2006-20110301-14786.html
STREAM
73, 2 x AMD Opteron™ processors Model 6276 in Supermicro H8DGT, 64GB (8 x 8GB DDR3-1600) memory, SuSE Linux® Enterprise Server 11 SP1 64-bit, x86 Open64 4.2.5-1
Compiler Suite
42, 2 x Intel Xeon processors Model X5670 in Supermicro X8DTT server, 24GB (6 x 4GB DDR3-1333) memory, SuSE Linux® Enterprise Server 11 SP1 64-bit, Intel Compiler v11.1.064
LINPACK
GFLOPS = # processors x processor clock speed x # cores per processor x # 32-bit operations per second
147.2, 1 x AMD Opteron™ processors Model 6276 x 16 cores x 2.3GHz x 4
70.32, 1 x Intel Xeon processors Model X5670 x 6 cores x 2.93GHz x 4
GROMACS
100s, Intel Xeon X5670, 1 node, 12 processes, 1 OMP, 1S, 1E, 40.67 GFLOPS, CC ICC 11.1, CCFLAGS-O3 -axS -no-prec-div, NUMA numactl, FFT fftw 3 MKL, PREC Single, MPI
OMPI 1.5.1, DIMM Spec 1333, DIMM Capacity(GB) 4, DIMM Count 6, Total Capacity (GB) 24, System SM X8DTT, OS SLES11 SP1, Kernel 2.6.32.12-0.7-default
84s, AMD Opteron 6276, 1 node, 32 processes, 1 OMP, 1S, 1E, 48.737 GFLOPS, GCC 4.60, CCFLAGS-O3 -O3 -funroll-loops, NUMA APPFILE, FFT 3.2.2, PREC Single, MPI OpenMPI
1.5.3, DIMM Spec 1600, DIMM Capacity(GB) 8, DIMM Count 4, Total Capacity (GB) 32, System SM H8DGT, OS SLES11 SP1, Kernel 2.6.32.12-0.7-default
27 | AMD Opteron™ 6200/4200 Series Processors Launch Presentation | Under Embargo until 12:01 a.m. EST, November 14, 2011 |
28. DATA: “THE HPC LEADER” (CONT.)
LAMMPS
471s, Intel Xeon X5670, Trial 24, Nodes 1, CXX Intel 11.1.064, CXXFLAGS -O2 -funroll-loops -fstrict-aliasin, MPI OMPI 1.5.1 + knem 0.9.5, DIMM Speed (MHz) 1333, DIMM Capacity
(GB) 4, DIMM Count 6, Total Capacity (GB) 24, System SM X8DTT, OS SLES11 SP1, Kernel 2.6.32.12-0.7-default, Notes Turbo ON, HT OFF, -mca btl_sm_use_knem 1
333s, AMD Opteron 6276, Trial 9, Nodes 1, CXX openCC 4.2.5.2-1, CXXFLAGS -O3 -OPT:Ofast -OPT:rsqrt=2 -march=bdver1 -mavx -mfma4, MPI OpenMPI 1.5.3, DIMM Speed (MHz)
1600, DIMM Capacity (GB) 8, DIMM Count 8, Total Capacity (GB) 64, System SM H8DGT, OS SLES11 SP1, Kernel 2.6.32.28-fam15h-default, Notes hpc
NAMD
.636 day/ns, Intel Xeon X5670, Trial 40, NAMD Version 2.7, Nodes 1, MB memory 274.328, CC Intel 11.1, CCFLAGS -ip -fno-rtti -O3 -xSSE4.2 -no-prec-div, MPI OMPI 1.5.1, DIMM
Speed (MHz) 1333, DIMM Capacity (GB) 4, DIMM Count 6, Total Capacity (GB) 24, System SM X8DTT, OS SLES11 SP1, Kernel 2.6.32.12-0.7-default, Notes default
.375 day/ns, AMD Opteron 6276, Trial 77, NAMD Version 2.8, Nodes 1, MB memory 260.277, CC opencc 4.2.5-2.1, CCFLAGS -O3 -m64 -march=bdver1 -mfma4 -mavx -
CG:compute_to=ON -OPT:Olimit=40000, MPI OpenMPI 1.5.3+knem 0.9.6, DIMM Speed (MHz) 1600, DIMM Capacity (GB) 8, DIMM Count 8, Total Capacity (GB) 64, System SM
H8DGT, OS SLES11 SP1, Kernel 2.6.32.28-fam15h-default, Notes default
WRF
224s, Intel Xeon X5670, Trial 124, STEP NA, NODE 1, FC Intel 11.1.064, NETCDF 4.1.1, FCFLAGS -w -O3 -ip -xSSE4.2 -fp-model fast=2 -no-prec-div -no-prec-sqr, NUMA NA, MPI
OMPI 1.5.1 + knem 0.9.5, DIMM # 6, DIMM GB 4, DIMM MHz 2 1333, Total Capacity 24, System SM X8DTT, OS SLES11 SP1, Kernel 2.6.32.12-0.7-default, Notes -mca
btl_sm_use_knem 1
180s, AMD Opteron 6276, Trial 436, STEP B2g, NODE 1, FC open64 4.2.5-1, NETCDF 4.1.2, FCFLAGS -O3 -HP -march=bdver1 -mavx -mfma4 -DpgiFortran -OPT:unroll_size=256 -
LNO:blocking=off -LANG:copyinout=o, NUMA APP FILE, MPI OMPI 1.5.3, DIMM # 8, DIMM GB 8, DIMM MHz 2 1600, Total Capacity 64, System SM H8DGT, OS SLES11 SP1, Kernel
2.6.32.28-fam15h-default, Notes NA
2 Intel can only process one 128-bit instruction at a time. See page 13 of the Intel Hot Chips presentation (http://www.hotchips.org/archives/hc18/3_Tues/HC18.S9/HC18.S9T4.pdf).
AMD Opteron™ 6200 Series has FMAC that can execute an FMA4 execution (a=b+c*d) in one cycle vs. 2 cycles that would be required for FMA3 or standard SSE floating point
calculation. Intel cannot multiply and add in one clock cycle (FMA4 and XOP instructions). See page 9 of the Intel Hot Chips presentation
(http://www.hotchips.org/archives/hc18/3_Tues/HC18.S9/HC18.S9T4.pdf).
²GFLOPS = # processors x processor clock speed x # cores per processor x # 32-bit operations per second. 2P server using AMD Opteron 6282 SE can generate up to 358 GFLOPs. 21
(2P) servers per rack x 358 = 7518 GFLOPs per rack. 2P server using Intel Xeon 5690 can generate up to 166 GLOPS per server. Therefore, it take over 2 racks (over 45 Intel-based
servers) to generate 7518 GFLOPs.
Information on Intel‟s specs as of 9/26/11 at http://www.intc.com/priceList.cfm. AMD specs based on pre-production values.
28 | AMD Opteron™ 6200/4200 Series Processors Launch Presentation | Under Embargo until 12:01 a.m. EST, November 14, 2011 |
29. DATA: THE HPC LEADER
3 STREAM (2P) AMD Opteron processor Model 6276 has 73% higher memory bandwidth than Intel Xeon processor Model X5670
– 73, 2 x AMD Opteron™ processors Model 6276 in Supermicro H8DGT, 64GB (8 x 8GB DDR3-1600) memory, SuSE Linux® Enterprise Server
11 SP1 64-bit, x86 Open64 4.2.5-1 Compiler Suite
– 42, 2 x Intel Xeon processors Model X5670 in Supermicro X8DTT server, 24GB (6 x 4GB DDR3-1333) memory, SuSE Linux® Enterprise Server
11 SP1 64-bit, Intel Compiler v11.1.064
4 Comparison of 12-core AMD Opteron™ processor Moel 6234 expected price of $377 at launch with 4-
core Intel Xeon E5603 price of $188 according to www.intel.com as of 11/4/11.
29 | AMD Opteron™ 6200/4200 Series Processors Launch Presentation | Under Embargo until 12:01 a.m. EST, November 14, 2011 |
32. GET MORE WITH AMD FOR EVERY IT DOLLAR SPENT
Intel-based server AMD Opteron™-based server Advantage
$8,246 $7,915
Processor 2 x Xeon X5690 2 x AMD Opteron 6282SE 26% faster processor
Memory 48GB 64GB 33% more memory
RAS upgrades None RAID 5 Controller High availability
Redundant power supply
Hard Drives 1 x 300GB 3 x 300GB 3X the storage
Support 1yr 24x7 support Installation + 3yr 24x7 support 3X the support
upgrade upgrade
More hardware for the same IT budget
Pricing and specifications from www.hp.com as of 10/26/11. As of 10/26/11, pricing for the HP DL385 with AMD Opteron processor Model 6282 SE (1ku pricing $1019) is not yet
available. The price of an HP DL385 with AMD Opteron processors Model 6172 (1ku pricing $989) was used as a proxy because pricing for AMD Opteron 6100 and 6200 Series
processor-based servers is expected to be very similar. . Server configurations are identical except on the specifications listed above and include no OS.. Difference in processor
speeds is based on the top SPECint®_rate2006 results published by, or submitted by AMD to, SPEC as of 10/26/2011 for 2P servers with the above identified processors and
may differ from the performance of the servers priced and described above. For the latest results, visit www.spec.org. See backup slide for configuration details.
32 | AMD Opteron™ 6200/4200 Series Processors Launch Presentation | Under Embargo until 12:01 a.m. EST, November 14, 2011 |
33. BACKUP – SAME PRICE, 2-3X THE BENEFIT
526 using 2 x AMD Opteron™ processor Model 6282SE in Supermicro A+ 1022-URF server, 128GB (16 x
8GB DDR3-1333) memory, Red Hat Enterprise Linux Server release 6.1
419 using 2 x Intel Xeon X5690: http://www.spec.org/cpu2006/results/res2011q1/cpu2006-20110211-
14447.html
317 using 2 x AMD Opteron™ processor Model 6220 in Supermicro A+ 1022G-URF server, 64GB (8 x
8GB DDR3-1) memory, Red Hat Enterprise Linux Server release 6.1
253 using 2 x Intel Xeon E5640: http://www.spec.org/cpu2006/results/res2011q2/cpu2006-20110329-
15452.html
33 | AMD Opteron™ 6200/4200 Series Processors Launch Presentation | Under Embargo until 12:01 a.m. EST, November 14, 2011 |