SlideShare ist ein Scribd-Unternehmen logo
1 von 14
Downloaden Sie, um offline zu lesen
X265	
  OPEN	
  SOURCE	
  H.265	
  ENCODER	
  
OPTIMIZATION	
  DETAILS	
  
X265	
  OPEN	
  SOURCE	
  H.265	
  ENCODER	
  
OPTIMIZATION	
  DETAILS	
  
	
  
HEVC	
  wrinkles	
  
H.265/HEVC	
  FINALIZED	
  JANUARY	
  25,	
  2013	
  
NOTABLE	
  CHANGES	
  FROM	
  H.264	
  

!  H.264’s	
  16x16	
  macroblocks	
  replaced	
  with	
  64x64	
  CUs	
  and	
  QuadTrees	
  
‒  Coding	
  	
  QuadTree	
  can	
  be	
  recursively	
  split	
  down	
  to	
  8x8	
  blocks	
  
‒  At	
  all	
  levels,	
  the	
  coding	
  blocks	
  can	
  chose	
  inter	
  or	
  intra	
  predic]on	
  

‒  The	
  final	
  coding	
  blocks	
  can	
  be	
  further	
  split	
  
	
  
‒  The	
  residual	
  is	
  	
  signaled	
  in	
  a	
  second	
  QuadTree	
  which	
  can	
  have	
  more	
  depth	
  than	
  the	
  coding	
  QT	
  

!  Inter	
  predic]on	
  has	
  more	
  accuracy	
  
‒  HPEL	
  filter	
  has	
  8-­‐taps,	
  QPEL	
  has	
  7-­‐taps.	
  	
  (H.264	
  has	
  6-­‐tap	
  HPEL	
  and	
  avg	
  QPEL)	
  
‒  Merge	
  candidates	
  replace	
  direct	
  and	
  skip	
  H.264	
  modes	
  
‒  AMVP	
  allows	
  mo]on	
  predic]on	
  to	
  be	
  selected	
  from	
  a	
  list,	
  in	
  H.264	
  it	
  was	
  en]rely	
  implicit	
  
4	
   |	
  	
  	
  PRESENTATION	
  TITLE	
  	
  	
  |	
  	
  	
  NOVEMBER	
  19,	
  2013	
  	
  	
  |	
  	
  	
  CONFIDENTIAL	
  
H.265/HEVC	
  FINALIZED	
  JANUARY	
  25,	
  2013	
  
NOTABLE	
  CHANGES	
  FROM	
  H.264	
  

!  More	
  intra	
  predic]ons	
  
‒  DC	
  and	
  planar	
  modes,	
  similar	
  to	
  H.264	
  
‒  33	
  angular	
  predic]ons	
  with	
  emphasis	
  on	
  near-­‐ver]cal	
  and	
  near-­‐horizontal	
  angles	
  

‒  35	
  predic]ons	
  in	
  total	
  (for	
  all	
  block	
  sizes	
  from	
  32x32	
  to	
  4x4)	
  but	
  few	
  special	
  cases	
  

!  Sample	
  Adap]ve	
  Offset	
  loop	
  filter	
  for	
  reduced	
  compression	
  ar]facts	
  

5	
   |	
  	
  	
  PRESENTATION	
  TITLE	
  	
  	
  |	
  	
  	
  NOVEMBER	
  19,	
  2013	
  	
  	
  |	
  	
  	
  CONFIDENTIAL	
  
H.265/HEVC	
  PARALLELIZATION	
  CONSIDERATIONS	
  
NOTABLE	
  CHANGES	
  FROM	
  H.264	
  

!  WaveFront	
  Parallel	
  Processing	
  
‒  Each	
  row	
  of	
  largest	
  CU	
  blocks	
  can	
  be	
  encoded	
  in	
  parallel,	
  with	
  a	
  two	
  block	
  lag	
  to	
  row	
  above	
  
‒  The	
  CABAC	
  state	
  of	
  block	
  2	
  is	
  communicated	
  to	
  block	
  0	
  of	
  row	
  below	
  
‒  <1%	
  loss	
  of	
  compression	
  efficiency,	
  much	
  more	
  efficient	
  	
  than	
  slices	
  or	
  ]les	
  

!  Tiles	
  –	
  split	
  each	
  frame	
  into	
  regular	
  rectangular	
  parts,	
  encode	
  each	
  in	
  parallel	
  
!  Deblocking	
  only	
  on	
  8x8	
  boundaries,	
  and	
  beler	
  ordering	
  of	
  opera]ons	
  
6	
   |	
  	
  	
  PRESENTATION	
  TITLE	
  	
  	
  |	
  	
  	
  NOVEMBER	
  19,	
  2013	
  	
  	
  |	
  	
  	
  CONFIDENTIAL	
  
H.265/HEVC	
  PARALLELIZATION	
  CONSIDERATIONS	
  
THE	
  FINE	
  PRINT	
  

!  Larger	
  block	
  sizes	
  reduce	
  the	
  effec]veness	
  of	
  frame	
  parallelism	
  
‒  Only	
  a	
  quarter	
  of	
  the	
  available	
  block	
  rows	
  as	
  H.264	
  for	
  the	
  same	
  resolu]on	
  video	
  
‒  Aner	
  accoun]ng	
  for	
  deblocking,	
  and	
  SAO	
  there	
  is	
  a	
  three	
  row	
  (192	
  line)	
  lag	
  between	
  references	
  
‒  Wavefront	
  analysis	
  or	
  ]les	
  must	
  be	
  used	
  in	
  conjunc]on	
  with	
  frame	
  parallelism	
  to	
  make	
  up	
  for	
  this	
  
‒  High	
  percentage	
  of	
  B	
  frames	
  to	
  P	
  frames	
  alleviates	
  this	
  bolleneck	
  

!  Large	
  blocks	
  increase	
  serial	
  opera]ons,	
  add	
  longer	
  data	
  dependencies	
  
‒  Each	
  CU	
  in	
  the	
  quad-­‐tree	
  must	
  be	
  analyzed	
  in	
  Z-­‐scan	
  order	
  
‒  Since	
  each	
  CU	
  can	
  chose	
  intra,	
  all	
  prior	
  blocks	
  must	
  generate	
  recon	
  pixels	
  –	
  no	
  shortcuts	
  
‒  Varia]ons	
  in	
  CU	
  encode	
  ]mes	
  reduce	
  the	
  effec]veness	
  of	
  wavefront	
  analysis	
  by	
  causing	
  stalls	
  

7	
   |	
  	
  	
  PRESENTATION	
  TITLE	
  	
  	
  |	
  	
  	
  NOVEMBER	
  19,	
  2013	
  	
  	
  |	
  	
  	
  CONFIDENTIAL	
  
Introducing	
  
x265	
  
X265	
  –	
  A	
  SHORT	
  HISTORY	
  
!  x265	
  Consor]um	
  founded	
  in	
  April	
  of	
  2013	
  	
  
‒  Dual	
  commercial	
  and	
  GPLv2+	
  license	
  
‒  Development	
  	
  primarily	
  centered	
  in	
  Chennai,	
  India	
  with	
  contribu]ons	
  from	
  China	
  and	
  US	
  
‒  Started	
  from	
  the	
  HEVC	
  reference	
  encoder	
  (HM),	
  less	
  than	
  half	
  of	
  HM	
  source	
  remains	
  today	
  
‒  Achieved	
  1080p	
  15fps	
  in	
  June	
  
‒  Public	
  	
  announcement	
  and	
  first	
  open	
  source	
  release	
  in	
  July	
  

!  Op]miza]ons	
  
‒  WPP	
  wavefront	
  CTU	
  analysis	
  and	
  frame	
  parallelism	
  
‒  Compiler	
  intrinsic	
  SIMD	
  based	
  performance	
  primi]ves	
  
‒  Hand-­‐wrilen	
  assembly	
  performance	
  primi]ves	
  
‒  Data	
  flow	
  improvements,	
  early	
  outs,	
  RDO	
  reduc]ons	
  

!  Today	
  
‒  1080p@30fps	
  or	
  720p@200fps	
  on	
  16-­‐core	
  SandyBridge	
  Xeon	
  

9	
   |	
  	
  	
  PRESENTATION	
  TITLE	
  	
  	
  |	
  	
  	
  NOVEMBER	
  19,	
  2013	
  	
  	
  |	
  	
  	
  CONFIDENTIAL	
  
X265	
  –	
  A	
  SHORT	
  HISTORY	
  
!  Ecosystem	
  
‒  Licensed	
  to	
  reuse	
  x264	
  source	
  code	
  and	
  algorithms	
  
‒  Open	
  development	
  on	
  mailing	
  list	
  and	
  IRC	
  
‒  Public	
  repositories	
  on	
  Bitbucket	
  and	
  VideoLan.org	
  
‒  Integra]on	
  into	
  VLC,	
  libav,	
  ffmpeg,	
  and	
  Handbrake	
  in	
  various	
  stages	
  of	
  comple]on	
  

!  x264	
  feature	
  adop]on	
  
‒  Lookahead	
  /	
  slicetype	
  decision	
  and	
  scene	
  cut	
  detec]on	
  
‒  Mo]on	
  es]ma]on	
  and	
  bitcost	
  func]ons	
  
‒  CLI	
  interface	
  and	
  public	
  C	
  interface	
  
‒  Assembly	
  primi]ves	
  for	
  SAD,	
  SATD,	
  SSD,	
  etc	
  
‒  ABR	
  and	
  CRF	
  rate	
  control	
  –	
  VBV	
  adop]on	
  in	
  progress	
  by	
  O/S	
  contributor	
  

!  It	
  took	
  eight	
  years	
  for	
  x264	
  to	
  dominate	
  H.264	
  encoding	
  market	
  
‒  We	
  would	
  like	
  to	
  achieve	
  dominance	
  in	
  the	
  HEVC	
  market	
  sooner	
  

10	
   |	
  	
  	
  PRESENTATION	
  TITLE	
  	
  	
  |	
  	
  	
  NOVEMBER	
  19,	
  2013	
  	
  	
  |	
  	
  	
  CONFIDENTIAL	
  
Encoding	
  and	
  
GPUs	
  
GPU	
  CONSIDERATIONS	
  
A	
  SAD	
  HISTORY	
  

!  Historically,	
  GPUs	
  have	
  been	
  poor	
  for	
  video	
  encoding	
  
‒  Intra	
  predic]on	
  requires	
  blocks	
  above	
  and	
  to	
  the	
  len	
  to	
  be	
  fully	
  encoded	
  and	
  decoded	
  
‒  Inter	
  predic]on	
  requires	
  blocks	
  above	
  and	
  to	
  the	
  len	
  to	
  be	
  fully	
  analyzed	
  
‒  Rate	
  distor]on	
  op]miza]ons	
  require	
  all	
  blocks	
  to	
  be	
  encoded	
  in	
  scan	
  order	
  
‒  Together,	
  these	
  dependencies	
  severely	
  limit	
  the	
  amount	
  of	
  parallelism	
  that	
  can	
  be	
  exposed	
  to	
  the	
  GPU	
  

!  Encoder	
  data	
  dependencies	
  are	
  complex	
  
‒  Copying	
  data	
  to	
  and	
  from	
  GPU	
  device	
  memory	
  generally	
  outweighs	
  any	
  performance	
  improvements	
  
‒  Even	
  zero	
  copy	
  memory	
  is	
  insufficient,	
  the	
  CPU	
  and	
  GPU	
  must	
  share	
  structures	
  at	
  full	
  speed	
  

!  Previous	
  alempts	
  at	
  GPU	
  encoding	
  take	
  short	
  cuts	
  
‒  One	
  can	
  ignore	
  some	
  of	
  these	
  	
  dependencies	
  at	
  the	
  cost	
  of	
  compression	
  efficiency	
  and	
  quality	
  
‒  In	
  x264,	
  we	
  only	
  used	
  the	
  GPU	
  for	
  lookahead	
  analysis	
  that	
  has	
  no	
  intra	
  and	
  RDO	
  dependencies	
  

12	
   |	
  	
  	
  PRESENTATION	
  TITLE	
  	
  	
  |	
  	
  	
  NOVEMBER	
  19,	
  2013	
  	
  	
  |	
  	
  	
  CONFIDENTIAL	
  
APU	
  CONSIDERATIONS	
  

A	
  WELL	
  BALANCED	
  COMPUTE	
  PROCESSOR	
  

!  Heterogeneous	
  architecture	
  
‒  GPU	
  compute	
  units	
  can	
  perform	
  high	
  bandwidth	
  opera]ons	
  and	
  highly	
  parallel	
  opera]ons	
  	
  
‒  CPU	
  performs	
  necessary	
  serial	
  and	
  logis]cal	
  opera]ons	
  
‒  CPU	
  and	
  GPU	
  can	
  see	
  each	
  other’s	
  memory	
  

!  x265	
  opportunity	
  
‒  Via	
  WPP	
  and	
  frame	
  parallelism	
  we	
  can	
  expose	
  two	
  dozen	
  parallel	
  	
  CU	
  blocks	
  to	
  be	
  encoded	
  
‒  Each	
  parallel	
  CU	
  block	
  requires	
  recursive	
  analysis	
  
‒  Control	
  must	
  transfer	
  between	
  the	
  CPU	
  and	
  GPU	
  many	
  ]mes	
  to	
  complete	
  analysis	
  
‒  GPU	
  performs	
  all	
  cost	
  es]mates	
  for	
  inter	
  and	
  inter	
  compression,	
  loop	
  filters,	
  and	
  pixel	
  weigh]ng	
  
‒  CPU	
  makes	
  QT	
  split	
  and	
  encode	
  decisions,	
  entropy	
  encoding,	
  and	
  dependency	
  tracking	
  
‒  Many	
  CUs	
  can	
  be	
  busy	
  on	
  the	
  GPU	
  at	
  once,	
  only	
  four	
  may	
  use	
  the	
  CPU	
  cores	
  at	
  a	
  ]me.	
  
‒  Making	
  use	
  the	
  GPU	
  compute	
  units	
  with	
  minimal	
  CPU	
  overhead	
  is	
  the	
  key	
  

13	
   |	
  	
  	
  PRESENTATION	
  TITLE	
  	
  	
  |	
  	
  	
  NOVEMBER	
  19,	
  2013	
  	
  	
  |	
  	
  	
  CONFIDENTIAL	
  
DISCLAIMER	
  &	
  ATTRIBUTION	
  

The	
  informa]on	
  presented	
  in	
  this	
  document	
  is	
  for	
  informa]onal	
  purposes	
  only	
  and	
  may	
  contain	
  technical	
  inaccuracies,	
  omissions	
  and	
  typographical	
  errors.	
  
	
  
The	
  informa]on	
  contained	
  herein	
  is	
  subject	
  to	
  change	
  and	
  may	
  be	
  rendered	
  inaccurate	
  for	
  many	
  reasons,	
  including	
  but	
  not	
  limited	
  to	
  product	
  and	
  roadmap	
  
changes,	
  component	
  and	
  motherboard	
  version	
  changes,	
  new	
  model	
  and/or	
  product	
  releases,	
  product	
  differences	
  between	
  differing	
  manufacturers,	
  sonware	
  
changes,	
  BIOS	
  flashes,	
  firmware	
  upgrades,	
  or	
  the	
  like.	
  AMD	
  assumes	
  no	
  obliga]on	
  to	
  update	
  or	
  otherwise	
  correct	
  or	
  revise	
  this	
  informa]on.	
  However,	
  AMD	
  
reserves	
  the	
  right	
  to	
  revise	
  this	
  informa]on	
  and	
  to	
  make	
  changes	
  from	
  ]me	
  to	
  ]me	
  to	
  the	
  content	
  hereof	
  without	
  obliga]on	
  of	
  AMD	
  to	
  no]fy	
  any	
  person	
  of	
  
such	
  revisions	
  or	
  changes.	
  
	
  
AMD	
  MAKES	
  NO	
  REPRESENTATIONS	
  OR	
  WARRANTIES	
  WITH	
  RESPECT	
  TO	
  THE	
  CONTENTS	
  HEREOF	
  AND	
  ASSUMES	
  NO	
  RESPONSIBILITY	
  FOR	
  ANY	
  
INACCURACIES,	
  ERRORS	
  OR	
  OMISSIONS	
  THAT	
  MAY	
  APPEAR	
  IN	
  THIS	
  INFORMATION.	
  
	
  
AMD	
  SPECIFICALLY	
  DISCLAIMS	
  ANY	
  IMPLIED	
  WARRANTIES	
  OF	
  MERCHANTABILITY	
  OR	
  FITNESS	
  FOR	
  ANY	
  PARTICULAR	
  PURPOSE.	
  IN	
  NO	
  EVENT	
  WILL	
  AMD	
  BE	
  
LIABLE	
  TO	
  ANY	
  PERSON	
  FOR	
  ANY	
  DIRECT,	
  INDIRECT,	
  SPECIAL	
  OR	
  OTHER	
  CONSEQUENTIAL	
  DAMAGES	
  ARISING	
  FROM	
  THE	
  USE	
  OF	
  ANY	
  INFORMATION	
  
CONTAINED	
  HEREIN,	
  EVEN	
  IF	
  AMD	
  IS	
  EXPRESSLY	
  ADVISED	
  OF	
  THE	
  POSSIBILITY	
  OF	
  SUCH	
  DAMAGES.	
  
	
  
ATTRIBUTION	
  
©	
  2013	
  Advanced	
  Micro	
  Devices,	
  Inc.	
  All	
  rights	
  reserved.	
  AMD,	
  the	
  AMD	
  Arrow	
  logo	
  and	
  combina]ons	
  thereof	
  are	
  trademarks	
  of	
  Advanced	
  Micro	
  Devices,	
  
Inc.	
  in	
  the	
  United	
  States	
  and/or	
  other	
  jurisdic]ons.	
  	
  SPEC	
  	
  is	
  a	
  registered	
  trademark	
  of	
  the	
  Standard	
  Performance	
  Evalua]on	
  Corpora]on	
  (SPEC).	
  Other	
  
names	
  are	
  for	
  informa]onal	
  purposes	
  only	
  and	
  may	
  be	
  trademarks	
  of	
  their	
  respec]ve	
  owners.	
  
14	
   |	
  	
  	
  PRESENTATION	
  TITLE	
  	
  	
  |	
  	
  	
  NOVEMBER	
  19,	
  2013	
  	
  	
  |	
  	
  	
  CONFIDENTIAL	
  

Weitere ähnliche Inhalte

Mehr von AMD Developer Central

The Small Batch (and other) solutions in Mantle API, by Guennadi Riguer, Mant...
The Small Batch (and other) solutions in Mantle API, by Guennadi Riguer, Mant...The Small Batch (and other) solutions in Mantle API, by Guennadi Riguer, Mant...
The Small Batch (and other) solutions in Mantle API, by Guennadi Riguer, Mant...AMD Developer Central
 
TressFX The Fast and The Furry by Nicolas Thibieroz
TressFX The Fast and The Furry by Nicolas ThibierozTressFX The Fast and The Furry by Nicolas Thibieroz
TressFX The Fast and The Furry by Nicolas ThibierozAMD Developer Central
 
Rendering Battlefield 4 with Mantle by Yuriy ODonnell
Rendering Battlefield 4 with Mantle by Yuriy ODonnellRendering Battlefield 4 with Mantle by Yuriy ODonnell
Rendering Battlefield 4 with Mantle by Yuriy ODonnellAMD Developer Central
 
Low-level Shader Optimization for Next-Gen and DX11 by Emil Persson
Low-level Shader Optimization for Next-Gen and DX11 by Emil PerssonLow-level Shader Optimization for Next-Gen and DX11 by Emil Persson
Low-level Shader Optimization for Next-Gen and DX11 by Emil PerssonAMD Developer Central
 
Direct3D12 and the Future of Graphics APIs by Dave Oldcorn
Direct3D12 and the Future of Graphics APIs by Dave OldcornDirect3D12 and the Future of Graphics APIs by Dave Oldcorn
Direct3D12 and the Future of Graphics APIs by Dave OldcornAMD Developer Central
 
Introduction to Direct 3D 12 by Ivan Nevraev
Introduction to Direct 3D 12 by Ivan NevraevIntroduction to Direct 3D 12 by Ivan Nevraev
Introduction to Direct 3D 12 by Ivan NevraevAMD Developer Central
 
Holy smoke! Faster Particle Rendering using Direct Compute by Gareth Thomas
Holy smoke! Faster Particle Rendering using Direct Compute by Gareth ThomasHoly smoke! Faster Particle Rendering using Direct Compute by Gareth Thomas
Holy smoke! Faster Particle Rendering using Direct Compute by Gareth ThomasAMD Developer Central
 
Computer Vision Powered by Heterogeneous System Architecture (HSA) by Dr. Ha...
Computer Vision Powered by Heterogeneous System Architecture (HSA) by  Dr. Ha...Computer Vision Powered by Heterogeneous System Architecture (HSA) by  Dr. Ha...
Computer Vision Powered by Heterogeneous System Architecture (HSA) by Dr. Ha...AMD Developer Central
 
Productive OpenCL Programming An Introduction to OpenCL Libraries with Array...
Productive OpenCL Programming An Introduction to OpenCL Libraries  with Array...Productive OpenCL Programming An Introduction to OpenCL Libraries  with Array...
Productive OpenCL Programming An Introduction to OpenCL Libraries with Array...AMD Developer Central
 
Rendering Battlefield 4 with Mantle by Johan Andersson - AMD at GDC14
Rendering Battlefield 4 with Mantle by Johan Andersson - AMD at GDC14Rendering Battlefield 4 with Mantle by Johan Andersson - AMD at GDC14
Rendering Battlefield 4 with Mantle by Johan Andersson - AMD at GDC14AMD Developer Central
 
RapidFire - the Easy Route to low Latency Cloud Gaming Solutions - AMD at GDC14
RapidFire - the Easy Route to low Latency Cloud Gaming Solutions - AMD at GDC14RapidFire - the Easy Route to low Latency Cloud Gaming Solutions - AMD at GDC14
RapidFire - the Easy Route to low Latency Cloud Gaming Solutions - AMD at GDC14AMD Developer Central
 
Mantle and Nitrous - Combining Efficient Engine Design with a modern API - AM...
Mantle and Nitrous - Combining Efficient Engine Design with a modern API - AM...Mantle and Nitrous - Combining Efficient Engine Design with a modern API - AM...
Mantle and Nitrous - Combining Efficient Engine Design with a modern API - AM...AMD Developer Central
 
Mantle - Introducing a new API for Graphics - AMD at GDC14
Mantle - Introducing a new API for Graphics - AMD at GDC14Mantle - Introducing a new API for Graphics - AMD at GDC14
Mantle - Introducing a new API for Graphics - AMD at GDC14AMD Developer Central
 
Direct3D and the Future of Graphics APIs - AMD at GDC14
Direct3D and the Future of Graphics APIs - AMD at GDC14Direct3D and the Future of Graphics APIs - AMD at GDC14
Direct3D and the Future of Graphics APIs - AMD at GDC14AMD Developer Central
 
Vertex Shader Tricks by Bill Bilodeau - AMD at GDC14
Vertex Shader Tricks by Bill Bilodeau - AMD at GDC14Vertex Shader Tricks by Bill Bilodeau - AMD at GDC14
Vertex Shader Tricks by Bill Bilodeau - AMD at GDC14AMD Developer Central
 
GS-4106 The AMD GCN Architecture - A Crash Course, by Layla Mah
GS-4106 The AMD GCN Architecture - A Crash Course, by Layla MahGS-4106 The AMD GCN Architecture - A Crash Course, by Layla Mah
GS-4106 The AMD GCN Architecture - A Crash Course, by Layla MahAMD Developer Central
 
Keynote (Tony King-Smith) - Silicon? Check. HSA? Check. All done? Wrong! - by...
Keynote (Tony King-Smith) - Silicon? Check. HSA? Check. All done? Wrong! - by...Keynote (Tony King-Smith) - Silicon? Check. HSA? Check. All done? Wrong! - by...
Keynote (Tony King-Smith) - Silicon? Check. HSA? Check. All done? Wrong! - by...AMD Developer Central
 

Mehr von AMD Developer Central (20)

The Small Batch (and other) solutions in Mantle API, by Guennadi Riguer, Mant...
The Small Batch (and other) solutions in Mantle API, by Guennadi Riguer, Mant...The Small Batch (and other) solutions in Mantle API, by Guennadi Riguer, Mant...
The Small Batch (and other) solutions in Mantle API, by Guennadi Riguer, Mant...
 
Inside XBox- One, by Martin Fuller
Inside XBox- One, by Martin FullerInside XBox- One, by Martin Fuller
Inside XBox- One, by Martin Fuller
 
TressFX The Fast and The Furry by Nicolas Thibieroz
TressFX The Fast and The Furry by Nicolas ThibierozTressFX The Fast and The Furry by Nicolas Thibieroz
TressFX The Fast and The Furry by Nicolas Thibieroz
 
Rendering Battlefield 4 with Mantle by Yuriy ODonnell
Rendering Battlefield 4 with Mantle by Yuriy ODonnellRendering Battlefield 4 with Mantle by Yuriy ODonnell
Rendering Battlefield 4 with Mantle by Yuriy ODonnell
 
Low-level Shader Optimization for Next-Gen and DX11 by Emil Persson
Low-level Shader Optimization for Next-Gen and DX11 by Emil PerssonLow-level Shader Optimization for Next-Gen and DX11 by Emil Persson
Low-level Shader Optimization for Next-Gen and DX11 by Emil Persson
 
Gcn performance ftw by stephan hodes
Gcn performance ftw by stephan hodesGcn performance ftw by stephan hodes
Gcn performance ftw by stephan hodes
 
Inside XBOX ONE by Martin Fuller
Inside XBOX ONE by Martin FullerInside XBOX ONE by Martin Fuller
Inside XBOX ONE by Martin Fuller
 
Direct3D12 and the Future of Graphics APIs by Dave Oldcorn
Direct3D12 and the Future of Graphics APIs by Dave OldcornDirect3D12 and the Future of Graphics APIs by Dave Oldcorn
Direct3D12 and the Future of Graphics APIs by Dave Oldcorn
 
Introduction to Direct 3D 12 by Ivan Nevraev
Introduction to Direct 3D 12 by Ivan NevraevIntroduction to Direct 3D 12 by Ivan Nevraev
Introduction to Direct 3D 12 by Ivan Nevraev
 
Holy smoke! Faster Particle Rendering using Direct Compute by Gareth Thomas
Holy smoke! Faster Particle Rendering using Direct Compute by Gareth ThomasHoly smoke! Faster Particle Rendering using Direct Compute by Gareth Thomas
Holy smoke! Faster Particle Rendering using Direct Compute by Gareth Thomas
 
Computer Vision Powered by Heterogeneous System Architecture (HSA) by Dr. Ha...
Computer Vision Powered by Heterogeneous System Architecture (HSA) by  Dr. Ha...Computer Vision Powered by Heterogeneous System Architecture (HSA) by  Dr. Ha...
Computer Vision Powered by Heterogeneous System Architecture (HSA) by Dr. Ha...
 
Productive OpenCL Programming An Introduction to OpenCL Libraries with Array...
Productive OpenCL Programming An Introduction to OpenCL Libraries  with Array...Productive OpenCL Programming An Introduction to OpenCL Libraries  with Array...
Productive OpenCL Programming An Introduction to OpenCL Libraries with Array...
 
Rendering Battlefield 4 with Mantle by Johan Andersson - AMD at GDC14
Rendering Battlefield 4 with Mantle by Johan Andersson - AMD at GDC14Rendering Battlefield 4 with Mantle by Johan Andersson - AMD at GDC14
Rendering Battlefield 4 with Mantle by Johan Andersson - AMD at GDC14
 
RapidFire - the Easy Route to low Latency Cloud Gaming Solutions - AMD at GDC14
RapidFire - the Easy Route to low Latency Cloud Gaming Solutions - AMD at GDC14RapidFire - the Easy Route to low Latency Cloud Gaming Solutions - AMD at GDC14
RapidFire - the Easy Route to low Latency Cloud Gaming Solutions - AMD at GDC14
 
Mantle and Nitrous - Combining Efficient Engine Design with a modern API - AM...
Mantle and Nitrous - Combining Efficient Engine Design with a modern API - AM...Mantle and Nitrous - Combining Efficient Engine Design with a modern API - AM...
Mantle and Nitrous - Combining Efficient Engine Design with a modern API - AM...
 
Mantle - Introducing a new API for Graphics - AMD at GDC14
Mantle - Introducing a new API for Graphics - AMD at GDC14Mantle - Introducing a new API for Graphics - AMD at GDC14
Mantle - Introducing a new API for Graphics - AMD at GDC14
 
Direct3D and the Future of Graphics APIs - AMD at GDC14
Direct3D and the Future of Graphics APIs - AMD at GDC14Direct3D and the Future of Graphics APIs - AMD at GDC14
Direct3D and the Future of Graphics APIs - AMD at GDC14
 
Vertex Shader Tricks by Bill Bilodeau - AMD at GDC14
Vertex Shader Tricks by Bill Bilodeau - AMD at GDC14Vertex Shader Tricks by Bill Bilodeau - AMD at GDC14
Vertex Shader Tricks by Bill Bilodeau - AMD at GDC14
 
GS-4106 The AMD GCN Architecture - A Crash Course, by Layla Mah
GS-4106 The AMD GCN Architecture - A Crash Course, by Layla MahGS-4106 The AMD GCN Architecture - A Crash Course, by Layla Mah
GS-4106 The AMD GCN Architecture - A Crash Course, by Layla Mah
 
Keynote (Tony King-Smith) - Silicon? Check. HSA? Check. All done? Wrong! - by...
Keynote (Tony King-Smith) - Silicon? Check. HSA? Check. All done? Wrong! - by...Keynote (Tony King-Smith) - Silicon? Check. HSA? Check. All done? Wrong! - by...
Keynote (Tony King-Smith) - Silicon? Check. HSA? Check. All done? Wrong! - by...
 

Kürzlich hochgeladen

New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
Assure Ecommerce and Retail Operations Uptime with ThousandEyes
Assure Ecommerce and Retail Operations Uptime with ThousandEyesAssure Ecommerce and Retail Operations Uptime with ThousandEyes
Assure Ecommerce and Retail Operations Uptime with ThousandEyesThousandEyes
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxLoriGlavin3
 
Connecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfConnecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfNeo4j
 
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...panagenda
 
Data governance with Unity Catalog Presentation
Data governance with Unity Catalog PresentationData governance with Unity Catalog Presentation
Data governance with Unity Catalog PresentationKnoldus Inc.
 
Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Farhan Tariq
 
Decarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a realityDecarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a realityIES VE
 
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...Wes McKinney
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024Lonnie McRorey
 
2024 April Patch Tuesday
2024 April Patch Tuesday2024 April Patch Tuesday
2024 April Patch TuesdayIvanti
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsPixlogix Infotech
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxLoriGlavin3
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxLoriGlavin3
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfMounikaPolabathina
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxLoriGlavin3
 
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...Scott Andery
 
Generative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfGenerative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfIngrid Airi González
 
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...AliaaTarek5
 
Scale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL RouterScale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL RouterMydbops
 

Kürzlich hochgeladen (20)

New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
Assure Ecommerce and Retail Operations Uptime with ThousandEyes
Assure Ecommerce and Retail Operations Uptime with ThousandEyesAssure Ecommerce and Retail Operations Uptime with ThousandEyes
Assure Ecommerce and Retail Operations Uptime with ThousandEyes
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptx
 
Connecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfConnecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdf
 
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
 
Data governance with Unity Catalog Presentation
Data governance with Unity Catalog PresentationData governance with Unity Catalog Presentation
Data governance with Unity Catalog Presentation
 
Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...
 
Decarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a realityDecarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a reality
 
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024
 
2024 April Patch Tuesday
2024 April Patch Tuesday2024 April Patch Tuesday
2024 April Patch Tuesday
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and Cons
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdf
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
 
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...
 
Generative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfGenerative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdf
 
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
 
Scale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL RouterScale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL Router
 

MM-4096, x265: Open Source H.265/HEVC Video Encoder, by Steve Borho

  • 1. X265  OPEN  SOURCE  H.265  ENCODER   OPTIMIZATION  DETAILS  
  • 2. X265  OPEN  SOURCE  H.265  ENCODER   OPTIMIZATION  DETAILS    
  • 4. H.265/HEVC  FINALIZED  JANUARY  25,  2013   NOTABLE  CHANGES  FROM  H.264   !  H.264’s  16x16  macroblocks  replaced  with  64x64  CUs  and  QuadTrees   ‒  Coding    QuadTree  can  be  recursively  split  down  to  8x8  blocks   ‒  At  all  levels,  the  coding  blocks  can  chose  inter  or  intra  predic]on   ‒  The  final  coding  blocks  can  be  further  split     ‒  The  residual  is    signaled  in  a  second  QuadTree  which  can  have  more  depth  than  the  coding  QT   !  Inter  predic]on  has  more  accuracy   ‒  HPEL  filter  has  8-­‐taps,  QPEL  has  7-­‐taps.    (H.264  has  6-­‐tap  HPEL  and  avg  QPEL)   ‒  Merge  candidates  replace  direct  and  skip  H.264  modes   ‒  AMVP  allows  mo]on  predic]on  to  be  selected  from  a  list,  in  H.264  it  was  en]rely  implicit   4   |      PRESENTATION  TITLE      |      NOVEMBER  19,  2013      |      CONFIDENTIAL  
  • 5. H.265/HEVC  FINALIZED  JANUARY  25,  2013   NOTABLE  CHANGES  FROM  H.264   !  More  intra  predic]ons   ‒  DC  and  planar  modes,  similar  to  H.264   ‒  33  angular  predic]ons  with  emphasis  on  near-­‐ver]cal  and  near-­‐horizontal  angles   ‒  35  predic]ons  in  total  (for  all  block  sizes  from  32x32  to  4x4)  but  few  special  cases   !  Sample  Adap]ve  Offset  loop  filter  for  reduced  compression  ar]facts   5   |      PRESENTATION  TITLE      |      NOVEMBER  19,  2013      |      CONFIDENTIAL  
  • 6. H.265/HEVC  PARALLELIZATION  CONSIDERATIONS   NOTABLE  CHANGES  FROM  H.264   !  WaveFront  Parallel  Processing   ‒  Each  row  of  largest  CU  blocks  can  be  encoded  in  parallel,  with  a  two  block  lag  to  row  above   ‒  The  CABAC  state  of  block  2  is  communicated  to  block  0  of  row  below   ‒  <1%  loss  of  compression  efficiency,  much  more  efficient    than  slices  or  ]les   !  Tiles  –  split  each  frame  into  regular  rectangular  parts,  encode  each  in  parallel   !  Deblocking  only  on  8x8  boundaries,  and  beler  ordering  of  opera]ons   6   |      PRESENTATION  TITLE      |      NOVEMBER  19,  2013      |      CONFIDENTIAL  
  • 7. H.265/HEVC  PARALLELIZATION  CONSIDERATIONS   THE  FINE  PRINT   !  Larger  block  sizes  reduce  the  effec]veness  of  frame  parallelism   ‒  Only  a  quarter  of  the  available  block  rows  as  H.264  for  the  same  resolu]on  video   ‒  Aner  accoun]ng  for  deblocking,  and  SAO  there  is  a  three  row  (192  line)  lag  between  references   ‒  Wavefront  analysis  or  ]les  must  be  used  in  conjunc]on  with  frame  parallelism  to  make  up  for  this   ‒  High  percentage  of  B  frames  to  P  frames  alleviates  this  bolleneck   !  Large  blocks  increase  serial  opera]ons,  add  longer  data  dependencies   ‒  Each  CU  in  the  quad-­‐tree  must  be  analyzed  in  Z-­‐scan  order   ‒  Since  each  CU  can  chose  intra,  all  prior  blocks  must  generate  recon  pixels  –  no  shortcuts   ‒  Varia]ons  in  CU  encode  ]mes  reduce  the  effec]veness  of  wavefront  analysis  by  causing  stalls   7   |      PRESENTATION  TITLE      |      NOVEMBER  19,  2013      |      CONFIDENTIAL  
  • 9. X265  –  A  SHORT  HISTORY   !  x265  Consor]um  founded  in  April  of  2013     ‒  Dual  commercial  and  GPLv2+  license   ‒  Development    primarily  centered  in  Chennai,  India  with  contribu]ons  from  China  and  US   ‒  Started  from  the  HEVC  reference  encoder  (HM),  less  than  half  of  HM  source  remains  today   ‒  Achieved  1080p  15fps  in  June   ‒  Public    announcement  and  first  open  source  release  in  July   !  Op]miza]ons   ‒  WPP  wavefront  CTU  analysis  and  frame  parallelism   ‒  Compiler  intrinsic  SIMD  based  performance  primi]ves   ‒  Hand-­‐wrilen  assembly  performance  primi]ves   ‒  Data  flow  improvements,  early  outs,  RDO  reduc]ons   !  Today   ‒  1080p@30fps  or  720p@200fps  on  16-­‐core  SandyBridge  Xeon   9   |      PRESENTATION  TITLE      |      NOVEMBER  19,  2013      |      CONFIDENTIAL  
  • 10. X265  –  A  SHORT  HISTORY   !  Ecosystem   ‒  Licensed  to  reuse  x264  source  code  and  algorithms   ‒  Open  development  on  mailing  list  and  IRC   ‒  Public  repositories  on  Bitbucket  and  VideoLan.org   ‒  Integra]on  into  VLC,  libav,  ffmpeg,  and  Handbrake  in  various  stages  of  comple]on   !  x264  feature  adop]on   ‒  Lookahead  /  slicetype  decision  and  scene  cut  detec]on   ‒  Mo]on  es]ma]on  and  bitcost  func]ons   ‒  CLI  interface  and  public  C  interface   ‒  Assembly  primi]ves  for  SAD,  SATD,  SSD,  etc   ‒  ABR  and  CRF  rate  control  –  VBV  adop]on  in  progress  by  O/S  contributor   !  It  took  eight  years  for  x264  to  dominate  H.264  encoding  market   ‒  We  would  like  to  achieve  dominance  in  the  HEVC  market  sooner   10   |      PRESENTATION  TITLE      |      NOVEMBER  19,  2013      |      CONFIDENTIAL  
  • 12. GPU  CONSIDERATIONS   A  SAD  HISTORY   !  Historically,  GPUs  have  been  poor  for  video  encoding   ‒  Intra  predic]on  requires  blocks  above  and  to  the  len  to  be  fully  encoded  and  decoded   ‒  Inter  predic]on  requires  blocks  above  and  to  the  len  to  be  fully  analyzed   ‒  Rate  distor]on  op]miza]ons  require  all  blocks  to  be  encoded  in  scan  order   ‒  Together,  these  dependencies  severely  limit  the  amount  of  parallelism  that  can  be  exposed  to  the  GPU   !  Encoder  data  dependencies  are  complex   ‒  Copying  data  to  and  from  GPU  device  memory  generally  outweighs  any  performance  improvements   ‒  Even  zero  copy  memory  is  insufficient,  the  CPU  and  GPU  must  share  structures  at  full  speed   !  Previous  alempts  at  GPU  encoding  take  short  cuts   ‒  One  can  ignore  some  of  these    dependencies  at  the  cost  of  compression  efficiency  and  quality   ‒  In  x264,  we  only  used  the  GPU  for  lookahead  analysis  that  has  no  intra  and  RDO  dependencies   12   |      PRESENTATION  TITLE      |      NOVEMBER  19,  2013      |      CONFIDENTIAL  
  • 13. APU  CONSIDERATIONS   A  WELL  BALANCED  COMPUTE  PROCESSOR   !  Heterogeneous  architecture   ‒  GPU  compute  units  can  perform  high  bandwidth  opera]ons  and  highly  parallel  opera]ons     ‒  CPU  performs  necessary  serial  and  logis]cal  opera]ons   ‒  CPU  and  GPU  can  see  each  other’s  memory   !  x265  opportunity   ‒  Via  WPP  and  frame  parallelism  we  can  expose  two  dozen  parallel    CU  blocks  to  be  encoded   ‒  Each  parallel  CU  block  requires  recursive  analysis   ‒  Control  must  transfer  between  the  CPU  and  GPU  many  ]mes  to  complete  analysis   ‒  GPU  performs  all  cost  es]mates  for  inter  and  inter  compression,  loop  filters,  and  pixel  weigh]ng   ‒  CPU  makes  QT  split  and  encode  decisions,  entropy  encoding,  and  dependency  tracking   ‒  Many  CUs  can  be  busy  on  the  GPU  at  once,  only  four  may  use  the  CPU  cores  at  a  ]me.   ‒  Making  use  the  GPU  compute  units  with  minimal  CPU  overhead  is  the  key   13   |      PRESENTATION  TITLE      |      NOVEMBER  19,  2013      |      CONFIDENTIAL  
  • 14. DISCLAIMER  &  ATTRIBUTION   The  informa]on  presented  in  this  document  is  for  informa]onal  purposes  only  and  may  contain  technical  inaccuracies,  omissions  and  typographical  errors.     The  informa]on  contained  herein  is  subject  to  change  and  may  be  rendered  inaccurate  for  many  reasons,  including  but  not  limited  to  product  and  roadmap   changes,  component  and  motherboard  version  changes,  new  model  and/or  product  releases,  product  differences  between  differing  manufacturers,  sonware   changes,  BIOS  flashes,  firmware  upgrades,  or  the  like.  AMD  assumes  no  obliga]on  to  update  or  otherwise  correct  or  revise  this  informa]on.  However,  AMD   reserves  the  right  to  revise  this  informa]on  and  to  make  changes  from  ]me  to  ]me  to  the  content  hereof  without  obliga]on  of  AMD  to  no]fy  any  person  of   such  revisions  or  changes.     AMD  MAKES  NO  REPRESENTATIONS  OR  WARRANTIES  WITH  RESPECT  TO  THE  CONTENTS  HEREOF  AND  ASSUMES  NO  RESPONSIBILITY  FOR  ANY   INACCURACIES,  ERRORS  OR  OMISSIONS  THAT  MAY  APPEAR  IN  THIS  INFORMATION.     AMD  SPECIFICALLY  DISCLAIMS  ANY  IMPLIED  WARRANTIES  OF  MERCHANTABILITY  OR  FITNESS  FOR  ANY  PARTICULAR  PURPOSE.  IN  NO  EVENT  WILL  AMD  BE   LIABLE  TO  ANY  PERSON  FOR  ANY  DIRECT,  INDIRECT,  SPECIAL  OR  OTHER  CONSEQUENTIAL  DAMAGES  ARISING  FROM  THE  USE  OF  ANY  INFORMATION   CONTAINED  HEREIN,  EVEN  IF  AMD  IS  EXPRESSLY  ADVISED  OF  THE  POSSIBILITY  OF  SUCH  DAMAGES.     ATTRIBUTION   ©  2013  Advanced  Micro  Devices,  Inc.  All  rights  reserved.  AMD,  the  AMD  Arrow  logo  and  combina]ons  thereof  are  trademarks  of  Advanced  Micro  Devices,   Inc.  in  the  United  States  and/or  other  jurisdic]ons.    SPEC    is  a  registered  trademark  of  the  Standard  Performance  Evalua]on  Corpora]on  (SPEC).  Other   names  are  for  informa]onal  purposes  only  and  may  be  trademarks  of  their  respec]ve  owners.   14   |      PRESENTATION  TITLE      |      NOVEMBER  19,  2013      |      CONFIDENTIAL