SlideShare ist ein Scribd-Unternehmen logo
1 von 27
Downloaden Sie, um offline zu lesen
HSA FOR APPLICATION
PROGRAMMING
Wen-mei W. Hwu
CTO, MulticoreWare, Inc.
Professor University of Illinois, Urbana-Champaign
CURRENT GPU COMPUTING PAIN
POINTS


Kernel launch overhead



Limited virtual and physical memory space



Extra data movement in I/O and networking



Tedious host code



Multiple source code development
KERNEL LAUNCH OVERHEAD
DESIRED DATA TRANSFER BEHAVIOR
Main Memory
(DRAM)

Network I/O

CPU
Disk I/O

Device
Memory

DMA

GPU card
(or other Accelerator cards)
SAMOS 2013
ACTUAL DATA TRANSFER BEHAVIOR

Main Memory
(DRAM)

Each additional copy
diminishes applicationperceived bandwidth
Network I/O

CPU
Disk I/O

Device
Memory

DMA

GPU card
(or other Accelerator cards)
SAMOS 2013
HSA COMMON ADDRESS SPACE
STANDARDIZED USER-LEVEL QUEUES
LOW OVERHEAD KERNEL LAUNCH
DYNAMIC COMPILATION AND BINARY
COMPATIBLITY
FAMILIAR COMPUTATION ORGANIZATION
EXAMPLE: COMPUTER VISION
EXAMPLE EXECUTION TIME
TOOLS WORKING GROUP


Tools Group will be looking at foundation for Developer Tools (compilation,
debugging and performance analysis)




Working with the working group to insure the right interface are in place Compilation,
Debugging, and Performance Analysis initially
We act as stewards for



HSA functional simulator with GDB support (AMD/MCW)



HSA performance simulator (AMD)





LLVM backend for generating HSAIL (AMD/MCW)

Loader Library for Simulator - BRIG object loading support ( AMD)

Longer Terms will spawn sub-group that will drive requirement for language
support for HSA


C, C++, C++ AMP



Java (JVM, Dalvik)



Python



JavaScript



DSL…



…
Hwu 2013
CURRENT TIME TABLE


Kickoff Conference and Refined Charter –
November 2013



Tools Roadmap – December 2013



Initial Tools – December 2013



Initial Language Tool Chains – January 2013

Please join us and contribute!
Hwu 2013
HAS BOOK IN THE MAKING

Heterogeneous System Architecture
For Application Programming


Audience
Performance application developers
 System architects
 Component architects
 Software stack developers


Hwu 2013
EDITORIAL BOARD


Gaster, Benedict – Qualcomm



Hegde Manju - AMD



Hwu, Wen-mei - MulticoreWare/UIUC



Jablin, Thomas – MultcoreWare



Lokhmotov, Anton – ARM



Lu, Chien-Ping – MediaTek



Whitecotton, Bob - AMD

Hwu 2013
CURRENT TABLE OF CONTENTS


HSA Overview (50 pages)



Chapter 1: What is HSA? (Hwu/Jablin)



Chapter 2: The HSA Architecture (Hwu/Jablin/Others)



Chapter 3: A Programmer's View of HSA (Hwu/Jablin)





Chapter 4: The Tools Framework for HAS,
(Hwu/Jablin/Others)
Chapter 5: Mapping of Classical algorithms onto the
HSA Architecture (Hwu/Jablin)
Hwu 2013
CURRENT TABLE OF CONTENTS (CONT.)



Representative Workloads:
Chapter 6: Photography, Minh Do, Professor UIUC, CTO
Personify



Chapter 7: Video-Audio Search, Ren Wu, Baidu



Chapter 8: Augmented Reality Chein-ping Lu, MeidaTek





Chapter 9: Biometrics (Face Detection) Mike Jones, MERL
& Harris Gasparakis, AMD
Chapter 10: Audio-Video processing Bill Herz, Sr. Fellow
AMD
Hwu 2013
CURRENT TABLE OF CONTENTS (CONT.)











Chapter 11: Ray-Tracing, TBD
Chapter 12: Natural user Interfaces, Navneett
CEO Flutter

Dallal,

Chapter 13: Physics Processing, Ronald Fedkiw,
Professor Stanford U. and Eftychios Sifakis
Chapter 14: Graphics enhancement, Ignacio Vargas, CTO,
Nextlimit (Interested)
Chapter 15: Communications and networking, Don BanksCisco (Interested)
Chapter 16: Data/Business Analytics , Zubin Dowlaty,
CTO Mu Sigma
Hwu 2013
CURRENT TABLE OF CONTENTS (CONT.)









Chapter 17: Hadoop, Memcached and cloud frameworks,
Bharath Mundlapudi, CTO Orzota

Chapter 18: Bioinformatics Wu Feng, VirginiaTech
Chapter 19: Scientific Computations, Byunghyun Jang,
Univ Miss.
Chapter 20: Computer Aided Design and Engineering,
Martin Wong, Professor UIUC

Chapter 21: Oil and Gas, Nacho Navarro, UPC/BSC
(interested)

Hwu 2013
CURRENT TABLE OF CONTENTS (CONT.)








Chapter 22: Financial Services & Analysis, Surra
Yanamadala, CA VP
Chapter 23: Computer Vision facial expression,
Elnar Hajiyev, Realeye

Chapter 24: Legacy Code Interoperability, Wenmei and Tom Jablin, MCW
Chapter 25: Java for HSA, Gary Frost, AMD

Hwu 2013
MAJOR UPCOMING MILESTONES


Author Algorithm and Code Due – February 2014



Author Chapters Due – April 2014



Review Due Back to Authors – May 2014



Chapters to Elsevier – June 2014



Books available – October 2014

Hwu 2013
THANK YOU!
ANY MORE QUESTIONS?

Hwu 2013

Weitere ähnliche Inhalte

Andere mochten auch

SWFs and Sustainable Infrastructure
SWFs and Sustainable InfrastructureSWFs and Sustainable Infrastructure
SWFs and Sustainable InfrastructureJ_Schmitz
 
Buy Nevada Holiday Gift Guide 2014-15
Buy Nevada Holiday Gift Guide 2014-15Buy Nevada Holiday Gift Guide 2014-15
Buy Nevada Holiday Gift Guide 2014-15Nevada Agriculture
 
2014.05.14 cofely fabricom maintenance mechanical workshops brochure
2014.05.14   cofely fabricom maintenance mechanical workshops brochure2014.05.14   cofely fabricom maintenance mechanical workshops brochure
2014.05.14 cofely fabricom maintenance mechanical workshops brochureStefanieDeWilde
 
Redex formato encuesta
Redex formato encuestaRedex formato encuesta
Redex formato encuestagohan_go
 
Science Update - No 259 - May 2015
Science Update - No 259 - May 2015Science Update - No 259 - May 2015
Science Update - No 259 - May 2015DOILibrary1151
 
Diplomado Liderazgo Sindical Femenino
Diplomado Liderazgo Sindical FemeninoDiplomado Liderazgo Sindical Femenino
Diplomado Liderazgo Sindical FemeninoComunidadMujer Cm
 
Kultura zientifikoa
Kultura zientifikoaKultura zientifikoa
Kultura zientifikoaiazpiro1
 
Los proyectos de los filósofos
Los proyectos de los filósofosLos proyectos de los filósofos
Los proyectos de los filósofosROUSEBARBER
 
Como Guardar y Consultar Presentaciones Zip, y Pdf
Como Guardar y Consultar Presentaciones  Zip, y  PdfComo Guardar y Consultar Presentaciones  Zip, y  Pdf
Como Guardar y Consultar Presentaciones Zip, y Pdfgrodriguezcu
 
Proyecto Social: Cuidado y Protección Social
Proyecto Social: Cuidado y Protección SocialProyecto Social: Cuidado y Protección Social
Proyecto Social: Cuidado y Protección SocialkRyss
 
Tipos de motores eléctricos
Tipos de motores eléctricosTipos de motores eléctricos
Tipos de motores eléctricosEIYSC
 
PEG y RCIU
PEG y RCIUPEG y RCIU
PEG y RCIUlablih
 
Sociedad anonima exposicion
Sociedad anonima exposicionSociedad anonima exposicion
Sociedad anonima exposicionkathom
 

Andere mochten auch (17)

SWFs and Sustainable Infrastructure
SWFs and Sustainable InfrastructureSWFs and Sustainable Infrastructure
SWFs and Sustainable Infrastructure
 
Buy Nevada Holiday Gift Guide 2014-15
Buy Nevada Holiday Gift Guide 2014-15Buy Nevada Holiday Gift Guide 2014-15
Buy Nevada Holiday Gift Guide 2014-15
 
Boletín Oficial 2013 08-16
Boletín Oficial 2013 08-16Boletín Oficial 2013 08-16
Boletín Oficial 2013 08-16
 
Analisis reforma
Analisis reforma Analisis reforma
Analisis reforma
 
2014.05.14 cofely fabricom maintenance mechanical workshops brochure
2014.05.14   cofely fabricom maintenance mechanical workshops brochure2014.05.14   cofely fabricom maintenance mechanical workshops brochure
2014.05.14 cofely fabricom maintenance mechanical workshops brochure
 
Redex formato encuesta
Redex formato encuestaRedex formato encuesta
Redex formato encuesta
 
Science Update - No 259 - May 2015
Science Update - No 259 - May 2015Science Update - No 259 - May 2015
Science Update - No 259 - May 2015
 
Diplomado Liderazgo Sindical Femenino
Diplomado Liderazgo Sindical FemeninoDiplomado Liderazgo Sindical Femenino
Diplomado Liderazgo Sindical Femenino
 
Kultura zientifikoa
Kultura zientifikoaKultura zientifikoa
Kultura zientifikoa
 
Los proyectos de los filósofos
Los proyectos de los filósofosLos proyectos de los filósofos
Los proyectos de los filósofos
 
Como Guardar y Consultar Presentaciones Zip, y Pdf
Como Guardar y Consultar Presentaciones  Zip, y  PdfComo Guardar y Consultar Presentaciones  Zip, y  Pdf
Como Guardar y Consultar Presentaciones Zip, y Pdf
 
Proyecto Social: Cuidado y Protección Social
Proyecto Social: Cuidado y Protección SocialProyecto Social: Cuidado y Protección Social
Proyecto Social: Cuidado y Protección Social
 
Tipos de motores eléctricos
Tipos de motores eléctricosTipos de motores eléctricos
Tipos de motores eléctricos
 
PEG y RCIU
PEG y RCIUPEG y RCIU
PEG y RCIU
 
GRUPO DE A TERCERA FASE: PROFOCOM
GRUPO DE A TERCERA FASE: PROFOCOMGRUPO DE A TERCERA FASE: PROFOCOM
GRUPO DE A TERCERA FASE: PROFOCOM
 
Sociedad anonima exposicion
Sociedad anonima exposicionSociedad anonima exposicion
Sociedad anonima exposicion
 
Whatsup
WhatsupWhatsup
Whatsup
 

Ähnlich wie HSA-4130, HSA for Application Programming, by Wen Mei Hwu

assignment3
assignment3assignment3
assignment3Kirti J
 
What it takes to run Hadoop at Scale: Yahoo! Perspectives
What it takes to run Hadoop at Scale: Yahoo! PerspectivesWhat it takes to run Hadoop at Scale: Yahoo! Perspectives
What it takes to run Hadoop at Scale: Yahoo! PerspectivesDataWorks Summit
 
Presentation
PresentationPresentation
Presentationbutest
 
Hive @ Hadoop day seattle_2010
Hive @ Hadoop day seattle_2010Hive @ Hadoop day seattle_2010
Hive @ Hadoop day seattle_2010nzhang
 
Big Data, Simple and Fast: Addressing the Shortcomings of Hadoop
Big Data, Simple and Fast: Addressing the Shortcomings of HadoopBig Data, Simple and Fast: Addressing the Shortcomings of Hadoop
Big Data, Simple and Fast: Addressing the Shortcomings of HadoopHazelcast
 
Is Multicore Hardware For General-Purpose Parallel Processing Broken? : Notes
Is Multicore Hardware For General-Purpose Parallel Processing Broken? : NotesIs Multicore Hardware For General-Purpose Parallel Processing Broken? : Notes
Is Multicore Hardware For General-Purpose Parallel Processing Broken? : NotesSubhajit Sahu
 
Worldranking universities final documentation
Worldranking universities final documentationWorldranking universities final documentation
Worldranking universities final documentationBhadra Gowdra
 
"Big Data" Bioinformatics
"Big Data" Bioinformatics"Big Data" Bioinformatics
"Big Data" BioinformaticsBrian Repko
 
A Collaborative Research Proposal To The NSF Research Accelerator For Multip...
A Collaborative Research Proposal To The NSF  Research Accelerator For Multip...A Collaborative Research Proposal To The NSF  Research Accelerator For Multip...
A Collaborative Research Proposal To The NSF Research Accelerator For Multip...Scott Donald
 
Hadoop @ Sara & BiG Grid
Hadoop @ Sara & BiG GridHadoop @ Sara & BiG Grid
Hadoop @ Sara & BiG GridEvert Lammerts
 
Empowering Transformational Science
Empowering Transformational ScienceEmpowering Transformational Science
Empowering Transformational ScienceChelle Gentemann
 
Cloud Computing Ambiance using Secluded Access Control Method
Cloud Computing Ambiance using Secluded Access Control MethodCloud Computing Ambiance using Secluded Access Control Method
Cloud Computing Ambiance using Secluded Access Control MethodIRJET Journal
 
Sjug #26 ml is in java but is dl too - ver1.04 - tomasz sikora 2018-03-23
Sjug #26   ml is in java but is dl too - ver1.04 - tomasz sikora 2018-03-23Sjug #26   ml is in java but is dl too - ver1.04 - tomasz sikora 2018-03-23
Sjug #26 ml is in java but is dl too - ver1.04 - tomasz sikora 2018-03-23Tomasz Sikora
 
Overview of big data & hadoop v1
Overview of big data & hadoop   v1Overview of big data & hadoop   v1
Overview of big data & hadoop v1Thanh Nguyen
 
HSA Introduction Hot Chips 2013
HSA Introduction  Hot Chips 2013HSA Introduction  Hot Chips 2013
HSA Introduction Hot Chips 2013HSA Foundation
 
Hadoop at Yahoo! -- University Talks
Hadoop at Yahoo! -- University TalksHadoop at Yahoo! -- University Talks
Hadoop at Yahoo! -- University Talksyhadoop
 

Ähnlich wie HSA-4130, HSA for Application Programming, by Wen Mei Hwu (20)

assignment3
assignment3assignment3
assignment3
 
What it takes to run Hadoop at Scale: Yahoo! Perspectives
What it takes to run Hadoop at Scale: Yahoo! PerspectivesWhat it takes to run Hadoop at Scale: Yahoo! Perspectives
What it takes to run Hadoop at Scale: Yahoo! Perspectives
 
Overview of the Data Processing Error Analysis System (DPEAS)
Overview of the Data Processing Error Analysis System (DPEAS)Overview of the Data Processing Error Analysis System (DPEAS)
Overview of the Data Processing Error Analysis System (DPEAS)
 
Presentation
PresentationPresentation
Presentation
 
Hive @ Hadoop day seattle_2010
Hive @ Hadoop day seattle_2010Hive @ Hadoop day seattle_2010
Hive @ Hadoop day seattle_2010
 
Big Data, Simple and Fast: Addressing the Shortcomings of Hadoop
Big Data, Simple and Fast: Addressing the Shortcomings of HadoopBig Data, Simple and Fast: Addressing the Shortcomings of Hadoop
Big Data, Simple and Fast: Addressing the Shortcomings of Hadoop
 
Is Multicore Hardware For General-Purpose Parallel Processing Broken? : Notes
Is Multicore Hardware For General-Purpose Parallel Processing Broken? : NotesIs Multicore Hardware For General-Purpose Parallel Processing Broken? : Notes
Is Multicore Hardware For General-Purpose Parallel Processing Broken? : Notes
 
Hadoop
HadoopHadoop
Hadoop
 
Worldranking universities final documentation
Worldranking universities final documentationWorldranking universities final documentation
Worldranking universities final documentation
 
"Big Data" Bioinformatics
"Big Data" Bioinformatics"Big Data" Bioinformatics
"Big Data" Bioinformatics
 
A Collaborative Research Proposal To The NSF Research Accelerator For Multip...
A Collaborative Research Proposal To The NSF  Research Accelerator For Multip...A Collaborative Research Proposal To The NSF  Research Accelerator For Multip...
A Collaborative Research Proposal To The NSF Research Accelerator For Multip...
 
Hadoop @ Sara & BiG Grid
Hadoop @ Sara & BiG GridHadoop @ Sara & BiG Grid
Hadoop @ Sara & BiG Grid
 
Empowering Transformational Science
Empowering Transformational ScienceEmpowering Transformational Science
Empowering Transformational Science
 
Cloud Computing Ambiance using Secluded Access Control Method
Cloud Computing Ambiance using Secluded Access Control MethodCloud Computing Ambiance using Secluded Access Control Method
Cloud Computing Ambiance using Secluded Access Control Method
 
MY_PRE.pptx
MY_PRE.pptxMY_PRE.pptx
MY_PRE.pptx
 
Sjug #26 ml is in java but is dl too - ver1.04 - tomasz sikora 2018-03-23
Sjug #26   ml is in java but is dl too - ver1.04 - tomasz sikora 2018-03-23Sjug #26   ml is in java but is dl too - ver1.04 - tomasz sikora 2018-03-23
Sjug #26 ml is in java but is dl too - ver1.04 - tomasz sikora 2018-03-23
 
HSA Introduction
HSA IntroductionHSA Introduction
HSA Introduction
 
Overview of big data & hadoop v1
Overview of big data & hadoop   v1Overview of big data & hadoop   v1
Overview of big data & hadoop v1
 
HSA Introduction Hot Chips 2013
HSA Introduction  Hot Chips 2013HSA Introduction  Hot Chips 2013
HSA Introduction Hot Chips 2013
 
Hadoop at Yahoo! -- University Talks
Hadoop at Yahoo! -- University TalksHadoop at Yahoo! -- University Talks
Hadoop at Yahoo! -- University Talks
 

Mehr von AMD Developer Central

DX12 & Vulkan: Dawn of a New Generation of Graphics APIs
DX12 & Vulkan: Dawn of a New Generation of Graphics APIsDX12 & Vulkan: Dawn of a New Generation of Graphics APIs
DX12 & Vulkan: Dawn of a New Generation of Graphics APIsAMD Developer Central
 
Leverage the Speed of OpenCL™ with AMD Math Libraries
Leverage the Speed of OpenCL™ with AMD Math LibrariesLeverage the Speed of OpenCL™ with AMD Math Libraries
Leverage the Speed of OpenCL™ with AMD Math LibrariesAMD Developer Central
 
An Introduction to OpenCL™ Programming with AMD GPUs - AMD & Acceleware Webinar
An Introduction to OpenCL™ Programming with AMD GPUs - AMD & Acceleware WebinarAn Introduction to OpenCL™ Programming with AMD GPUs - AMD & Acceleware Webinar
An Introduction to OpenCL™ Programming with AMD GPUs - AMD & Acceleware WebinarAMD Developer Central
 
Webinar: Whats New in Java 8 with Develop Intelligence
Webinar: Whats New in Java 8 with Develop IntelligenceWebinar: Whats New in Java 8 with Develop Intelligence
Webinar: Whats New in Java 8 with Develop IntelligenceAMD Developer Central
 
The Small Batch (and other) solutions in Mantle API, by Guennadi Riguer, Mant...
The Small Batch (and other) solutions in Mantle API, by Guennadi Riguer, Mant...The Small Batch (and other) solutions in Mantle API, by Guennadi Riguer, Mant...
The Small Batch (and other) solutions in Mantle API, by Guennadi Riguer, Mant...AMD Developer Central
 
TressFX The Fast and The Furry by Nicolas Thibieroz
TressFX The Fast and The Furry by Nicolas ThibierozTressFX The Fast and The Furry by Nicolas Thibieroz
TressFX The Fast and The Furry by Nicolas ThibierozAMD Developer Central
 
Rendering Battlefield 4 with Mantle by Yuriy ODonnell
Rendering Battlefield 4 with Mantle by Yuriy ODonnellRendering Battlefield 4 with Mantle by Yuriy ODonnell
Rendering Battlefield 4 with Mantle by Yuriy ODonnellAMD Developer Central
 
Low-level Shader Optimization for Next-Gen and DX11 by Emil Persson
Low-level Shader Optimization for Next-Gen and DX11 by Emil PerssonLow-level Shader Optimization for Next-Gen and DX11 by Emil Persson
Low-level Shader Optimization for Next-Gen and DX11 by Emil PerssonAMD Developer Central
 
Direct3D12 and the Future of Graphics APIs by Dave Oldcorn
Direct3D12 and the Future of Graphics APIs by Dave OldcornDirect3D12 and the Future of Graphics APIs by Dave Oldcorn
Direct3D12 and the Future of Graphics APIs by Dave OldcornAMD Developer Central
 
Introduction to Direct 3D 12 by Ivan Nevraev
Introduction to Direct 3D 12 by Ivan NevraevIntroduction to Direct 3D 12 by Ivan Nevraev
Introduction to Direct 3D 12 by Ivan NevraevAMD Developer Central
 
Holy smoke! Faster Particle Rendering using Direct Compute by Gareth Thomas
Holy smoke! Faster Particle Rendering using Direct Compute by Gareth ThomasHoly smoke! Faster Particle Rendering using Direct Compute by Gareth Thomas
Holy smoke! Faster Particle Rendering using Direct Compute by Gareth ThomasAMD Developer Central
 
Computer Vision Powered by Heterogeneous System Architecture (HSA) by Dr. Ha...
Computer Vision Powered by Heterogeneous System Architecture (HSA) by  Dr. Ha...Computer Vision Powered by Heterogeneous System Architecture (HSA) by  Dr. Ha...
Computer Vision Powered by Heterogeneous System Architecture (HSA) by Dr. Ha...AMD Developer Central
 
Productive OpenCL Programming An Introduction to OpenCL Libraries with Array...
Productive OpenCL Programming An Introduction to OpenCL Libraries  with Array...Productive OpenCL Programming An Introduction to OpenCL Libraries  with Array...
Productive OpenCL Programming An Introduction to OpenCL Libraries with Array...AMD Developer Central
 
Rendering Battlefield 4 with Mantle by Johan Andersson - AMD at GDC14
Rendering Battlefield 4 with Mantle by Johan Andersson - AMD at GDC14Rendering Battlefield 4 with Mantle by Johan Andersson - AMD at GDC14
Rendering Battlefield 4 with Mantle by Johan Andersson - AMD at GDC14AMD Developer Central
 

Mehr von AMD Developer Central (20)

DX12 & Vulkan: Dawn of a New Generation of Graphics APIs
DX12 & Vulkan: Dawn of a New Generation of Graphics APIsDX12 & Vulkan: Dawn of a New Generation of Graphics APIs
DX12 & Vulkan: Dawn of a New Generation of Graphics APIs
 
Leverage the Speed of OpenCL™ with AMD Math Libraries
Leverage the Speed of OpenCL™ with AMD Math LibrariesLeverage the Speed of OpenCL™ with AMD Math Libraries
Leverage the Speed of OpenCL™ with AMD Math Libraries
 
Introduction to Node.js
Introduction to Node.jsIntroduction to Node.js
Introduction to Node.js
 
Media SDK Webinar 2014
Media SDK Webinar 2014Media SDK Webinar 2014
Media SDK Webinar 2014
 
An Introduction to OpenCL™ Programming with AMD GPUs - AMD & Acceleware Webinar
An Introduction to OpenCL™ Programming with AMD GPUs - AMD & Acceleware WebinarAn Introduction to OpenCL™ Programming with AMD GPUs - AMD & Acceleware Webinar
An Introduction to OpenCL™ Programming with AMD GPUs - AMD & Acceleware Webinar
 
DirectGMA on AMD’S FirePro™ GPUS
DirectGMA on AMD’S  FirePro™ GPUSDirectGMA on AMD’S  FirePro™ GPUS
DirectGMA on AMD’S FirePro™ GPUS
 
Webinar: Whats New in Java 8 with Develop Intelligence
Webinar: Whats New in Java 8 with Develop IntelligenceWebinar: Whats New in Java 8 with Develop Intelligence
Webinar: Whats New in Java 8 with Develop Intelligence
 
The Small Batch (and other) solutions in Mantle API, by Guennadi Riguer, Mant...
The Small Batch (and other) solutions in Mantle API, by Guennadi Riguer, Mant...The Small Batch (and other) solutions in Mantle API, by Guennadi Riguer, Mant...
The Small Batch (and other) solutions in Mantle API, by Guennadi Riguer, Mant...
 
Inside XBox- One, by Martin Fuller
Inside XBox- One, by Martin FullerInside XBox- One, by Martin Fuller
Inside XBox- One, by Martin Fuller
 
TressFX The Fast and The Furry by Nicolas Thibieroz
TressFX The Fast and The Furry by Nicolas ThibierozTressFX The Fast and The Furry by Nicolas Thibieroz
TressFX The Fast and The Furry by Nicolas Thibieroz
 
Rendering Battlefield 4 with Mantle by Yuriy ODonnell
Rendering Battlefield 4 with Mantle by Yuriy ODonnellRendering Battlefield 4 with Mantle by Yuriy ODonnell
Rendering Battlefield 4 with Mantle by Yuriy ODonnell
 
Low-level Shader Optimization for Next-Gen and DX11 by Emil Persson
Low-level Shader Optimization for Next-Gen and DX11 by Emil PerssonLow-level Shader Optimization for Next-Gen and DX11 by Emil Persson
Low-level Shader Optimization for Next-Gen and DX11 by Emil Persson
 
Gcn performance ftw by stephan hodes
Gcn performance ftw by stephan hodesGcn performance ftw by stephan hodes
Gcn performance ftw by stephan hodes
 
Inside XBOX ONE by Martin Fuller
Inside XBOX ONE by Martin FullerInside XBOX ONE by Martin Fuller
Inside XBOX ONE by Martin Fuller
 
Direct3D12 and the Future of Graphics APIs by Dave Oldcorn
Direct3D12 and the Future of Graphics APIs by Dave OldcornDirect3D12 and the Future of Graphics APIs by Dave Oldcorn
Direct3D12 and the Future of Graphics APIs by Dave Oldcorn
 
Introduction to Direct 3D 12 by Ivan Nevraev
Introduction to Direct 3D 12 by Ivan NevraevIntroduction to Direct 3D 12 by Ivan Nevraev
Introduction to Direct 3D 12 by Ivan Nevraev
 
Holy smoke! Faster Particle Rendering using Direct Compute by Gareth Thomas
Holy smoke! Faster Particle Rendering using Direct Compute by Gareth ThomasHoly smoke! Faster Particle Rendering using Direct Compute by Gareth Thomas
Holy smoke! Faster Particle Rendering using Direct Compute by Gareth Thomas
 
Computer Vision Powered by Heterogeneous System Architecture (HSA) by Dr. Ha...
Computer Vision Powered by Heterogeneous System Architecture (HSA) by  Dr. Ha...Computer Vision Powered by Heterogeneous System Architecture (HSA) by  Dr. Ha...
Computer Vision Powered by Heterogeneous System Architecture (HSA) by Dr. Ha...
 
Productive OpenCL Programming An Introduction to OpenCL Libraries with Array...
Productive OpenCL Programming An Introduction to OpenCL Libraries  with Array...Productive OpenCL Programming An Introduction to OpenCL Libraries  with Array...
Productive OpenCL Programming An Introduction to OpenCL Libraries with Array...
 
Rendering Battlefield 4 with Mantle by Johan Andersson - AMD at GDC14
Rendering Battlefield 4 with Mantle by Johan Andersson - AMD at GDC14Rendering Battlefield 4 with Mantle by Johan Andersson - AMD at GDC14
Rendering Battlefield 4 with Mantle by Johan Andersson - AMD at GDC14
 

Kürzlich hochgeladen

Varsha Sewlal- Cyber Attacks on Critical Critical Infrastructure
Varsha Sewlal- Cyber Attacks on Critical Critical InfrastructureVarsha Sewlal- Cyber Attacks on Critical Critical Infrastructure
Varsha Sewlal- Cyber Attacks on Critical Critical Infrastructureitnewsafrica
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsPixlogix Infotech
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
Decarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a realityDecarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a realityIES VE
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersNicole Novielli
 
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...Alkin Tezuysal
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxLoriGlavin3
 
Data governance with Unity Catalog Presentation
Data governance with Unity Catalog PresentationData governance with Unity Catalog Presentation
Data governance with Unity Catalog PresentationKnoldus Inc.
 
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...Wes McKinney
 
Bridging Between CAD & GIS: 6 Ways to Automate Your Data Integration
Bridging Between CAD & GIS:  6 Ways to Automate Your Data IntegrationBridging Between CAD & GIS:  6 Ways to Automate Your Data Integration
Bridging Between CAD & GIS: 6 Ways to Automate Your Data Integrationmarketing932765
 
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24Mark Goldstein
 
Long journey of Ruby standard library at RubyConf AU 2024
Long journey of Ruby standard library at RubyConf AU 2024Long journey of Ruby standard library at RubyConf AU 2024
Long journey of Ruby standard library at RubyConf AU 2024Hiroshi SHIBATA
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfLoriGlavin3
 
QCon London: Mastering long-running processes in modern architectures
QCon London: Mastering long-running processes in modern architecturesQCon London: Mastering long-running processes in modern architectures
QCon London: Mastering long-running processes in modern architecturesBernd Ruecker
 
So einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdfSo einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdfpanagenda
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxLoriGlavin3
 
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better StrongerModern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better Strongerpanagenda
 
Scale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL RouterScale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL RouterMydbops
 
Generative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfGenerative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfIngrid Airi González
 

Kürzlich hochgeladen (20)

Varsha Sewlal- Cyber Attacks on Critical Critical Infrastructure
Varsha Sewlal- Cyber Attacks on Critical Critical InfrastructureVarsha Sewlal- Cyber Attacks on Critical Critical Infrastructure
Varsha Sewlal- Cyber Attacks on Critical Critical Infrastructure
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and Cons
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
Decarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a realityDecarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a reality
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software Developers
 
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
 
Data governance with Unity Catalog Presentation
Data governance with Unity Catalog PresentationData governance with Unity Catalog Presentation
Data governance with Unity Catalog Presentation
 
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
 
Bridging Between CAD & GIS: 6 Ways to Automate Your Data Integration
Bridging Between CAD & GIS:  6 Ways to Automate Your Data IntegrationBridging Between CAD & GIS:  6 Ways to Automate Your Data Integration
Bridging Between CAD & GIS: 6 Ways to Automate Your Data Integration
 
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
 
Long journey of Ruby standard library at RubyConf AU 2024
Long journey of Ruby standard library at RubyConf AU 2024Long journey of Ruby standard library at RubyConf AU 2024
Long journey of Ruby standard library at RubyConf AU 2024
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdf
 
QCon London: Mastering long-running processes in modern architectures
QCon London: Mastering long-running processes in modern architecturesQCon London: Mastering long-running processes in modern architectures
QCon London: Mastering long-running processes in modern architectures
 
So einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdfSo einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdf
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
 
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better StrongerModern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
 
Scale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL RouterScale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL Router
 
Generative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfGenerative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdf
 

HSA-4130, HSA for Application Programming, by Wen Mei Hwu

  • 1. HSA FOR APPLICATION PROGRAMMING Wen-mei W. Hwu CTO, MulticoreWare, Inc. Professor University of Illinois, Urbana-Champaign
  • 2. CURRENT GPU COMPUTING PAIN POINTS  Kernel launch overhead  Limited virtual and physical memory space  Extra data movement in I/O and networking  Tedious host code  Multiple source code development
  • 4. DESIRED DATA TRANSFER BEHAVIOR Main Memory (DRAM) Network I/O CPU Disk I/O Device Memory DMA GPU card (or other Accelerator cards) SAMOS 2013
  • 5. ACTUAL DATA TRANSFER BEHAVIOR Main Memory (DRAM) Each additional copy diminishes applicationperceived bandwidth Network I/O CPU Disk I/O Device Memory DMA GPU card (or other Accelerator cards) SAMOS 2013
  • 9. DYNAMIC COMPILATION AND BINARY COMPATIBLITY
  • 12.
  • 13.
  • 14.
  • 15.
  • 17. TOOLS WORKING GROUP  Tools Group will be looking at foundation for Developer Tools (compilation, debugging and performance analysis)   Working with the working group to insure the right interface are in place Compilation, Debugging, and Performance Analysis initially We act as stewards for   HSA functional simulator with GDB support (AMD/MCW)  HSA performance simulator (AMD)   LLVM backend for generating HSAIL (AMD/MCW) Loader Library for Simulator - BRIG object loading support ( AMD) Longer Terms will spawn sub-group that will drive requirement for language support for HSA  C, C++, C++ AMP  Java (JVM, Dalvik)  Python  JavaScript  DSL…  … Hwu 2013
  • 18. CURRENT TIME TABLE  Kickoff Conference and Refined Charter – November 2013  Tools Roadmap – December 2013  Initial Tools – December 2013  Initial Language Tool Chains – January 2013 Please join us and contribute! Hwu 2013
  • 19. HAS BOOK IN THE MAKING Heterogeneous System Architecture For Application Programming  Audience Performance application developers  System architects  Component architects  Software stack developers  Hwu 2013
  • 20. EDITORIAL BOARD  Gaster, Benedict – Qualcomm  Hegde Manju - AMD  Hwu, Wen-mei - MulticoreWare/UIUC  Jablin, Thomas – MultcoreWare  Lokhmotov, Anton – ARM  Lu, Chien-Ping – MediaTek  Whitecotton, Bob - AMD Hwu 2013
  • 21. CURRENT TABLE OF CONTENTS  HSA Overview (50 pages)  Chapter 1: What is HSA? (Hwu/Jablin)  Chapter 2: The HSA Architecture (Hwu/Jablin/Others)  Chapter 3: A Programmer's View of HSA (Hwu/Jablin)   Chapter 4: The Tools Framework for HAS, (Hwu/Jablin/Others) Chapter 5: Mapping of Classical algorithms onto the HSA Architecture (Hwu/Jablin) Hwu 2013
  • 22. CURRENT TABLE OF CONTENTS (CONT.)   Representative Workloads: Chapter 6: Photography, Minh Do, Professor UIUC, CTO Personify  Chapter 7: Video-Audio Search, Ren Wu, Baidu  Chapter 8: Augmented Reality Chein-ping Lu, MeidaTek   Chapter 9: Biometrics (Face Detection) Mike Jones, MERL & Harris Gasparakis, AMD Chapter 10: Audio-Video processing Bill Herz, Sr. Fellow AMD Hwu 2013
  • 23. CURRENT TABLE OF CONTENTS (CONT.)       Chapter 11: Ray-Tracing, TBD Chapter 12: Natural user Interfaces, Navneett CEO Flutter Dallal, Chapter 13: Physics Processing, Ronald Fedkiw, Professor Stanford U. and Eftychios Sifakis Chapter 14: Graphics enhancement, Ignacio Vargas, CTO, Nextlimit (Interested) Chapter 15: Communications and networking, Don BanksCisco (Interested) Chapter 16: Data/Business Analytics , Zubin Dowlaty, CTO Mu Sigma Hwu 2013
  • 24. CURRENT TABLE OF CONTENTS (CONT.)      Chapter 17: Hadoop, Memcached and cloud frameworks, Bharath Mundlapudi, CTO Orzota Chapter 18: Bioinformatics Wu Feng, VirginiaTech Chapter 19: Scientific Computations, Byunghyun Jang, Univ Miss. Chapter 20: Computer Aided Design and Engineering, Martin Wong, Professor UIUC Chapter 21: Oil and Gas, Nacho Navarro, UPC/BSC (interested) Hwu 2013
  • 25. CURRENT TABLE OF CONTENTS (CONT.)     Chapter 22: Financial Services & Analysis, Surra Yanamadala, CA VP Chapter 23: Computer Vision facial expression, Elnar Hajiyev, Realeye Chapter 24: Legacy Code Interoperability, Wenmei and Tom Jablin, MCW Chapter 25: Java for HSA, Gary Frost, AMD Hwu 2013
  • 26. MAJOR UPCOMING MILESTONES  Author Algorithm and Code Due – February 2014  Author Chapters Due – April 2014  Review Due Back to Authors – May 2014  Chapters to Elsevier – June 2014  Books available – October 2014 Hwu 2013
  • 27. THANK YOU! ANY MORE QUESTIONS? Hwu 2013