SlideShare ist ein Scribd-Unternehmen logo
1 von 27
Downloaden Sie, um offline zu lesen
HSA FOR APPLICATION
PROGRAMMING
Wen-mei W. Hwu
CTO, MulticoreWare, Inc.
Professor University of Illinois, Urbana-Champaign
CURRENT GPU COMPUTING PAIN
POINTS


Kernel launch overhead



Limited virtual and physical memory space



Extra data movement in I/O and networking



Tedious host code



Multiple source code development
KERNEL LAUNCH OVERHEAD
DESIRED DATA TRANSFER BEHAVIOR
Main Memory
(DRAM)

Network I/O

CPU
Disk I/O

Device
Memory

DMA

GPU card
(or other Accelerator cards)
SAMOS 2013
ACTUAL DATA TRANSFER BEHAVIOR

Main Memory
(DRAM)

Each additional copy
diminishes applicationperceived bandwidth
Network I/O

CPU
Disk I/O

Device
Memory

DMA

GPU card
(or other Accelerator cards)
SAMOS 2013
HSA COMMON ADDRESS SPACE
STANDARDIZED USER-LEVEL QUEUES
LOW OVERHEAD KERNEL LAUNCH
DYNAMIC COMPILATION AND BINARY
COMPATIBLITY
FAMILIAR COMPUTATION ORGANIZATION
EXAMPLE: COMPUTER VISION
EXAMPLE EXECUTION TIME
TOOLS WORKING GROUP


Tools Group will be looking at foundation for Developer Tools (compilation,
debugging and performance analysis)




Working with the working group to insure the right interface are in place Compilation,
Debugging, and Performance Analysis initially
We act as stewards for



HSA functional simulator with GDB support (AMD/MCW)



HSA performance simulator (AMD)





LLVM backend for generating HSAIL (AMD/MCW)

Loader Library for Simulator - BRIG object loading support ( AMD)

Longer Terms will spawn sub-group that will drive requirement for language
support for HSA


C, C++, C++ AMP



Java (JVM, Dalvik)



Python



JavaScript



DSL…



…
Hwu 2013
CURRENT TIME TABLE


Kickoff Conference and Refined Charter –
November 2013



Tools Roadmap – December 2013



Initial Tools – December 2013



Initial Language Tool Chains – January 2013

Please join us and contribute!
Hwu 2013
HAS BOOK IN THE MAKING

Heterogeneous System Architecture
For Application Programming


Audience
Performance application developers
 System architects
 Component architects
 Software stack developers


Hwu 2013
EDITORIAL BOARD


Gaster, Benedict – Qualcomm



Hegde Manju - AMD



Hwu, Wen-mei - MulticoreWare/UIUC



Jablin, Thomas – MultcoreWare



Lokhmotov, Anton – ARM



Lu, Chien-Ping – MediaTek



Whitecotton, Bob - AMD

Hwu 2013
CURRENT TABLE OF CONTENTS


HSA Overview (50 pages)



Chapter 1: What is HSA? (Hwu/Jablin)



Chapter 2: The HSA Architecture (Hwu/Jablin/Others)



Chapter 3: A Programmer's View of HSA (Hwu/Jablin)





Chapter 4: The Tools Framework for HAS,
(Hwu/Jablin/Others)
Chapter 5: Mapping of Classical algorithms onto the
HSA Architecture (Hwu/Jablin)
Hwu 2013
CURRENT TABLE OF CONTENTS (CONT.)



Representative Workloads:
Chapter 6: Photography, Minh Do, Professor UIUC, CTO
Personify



Chapter 7: Video-Audio Search, Ren Wu, Baidu



Chapter 8: Augmented Reality Chein-ping Lu, MeidaTek





Chapter 9: Biometrics (Face Detection) Mike Jones, MERL
& Harris Gasparakis, AMD
Chapter 10: Audio-Video processing Bill Herz, Sr. Fellow
AMD
Hwu 2013
CURRENT TABLE OF CONTENTS (CONT.)











Chapter 11: Ray-Tracing, TBD
Chapter 12: Natural user Interfaces, Navneett
CEO Flutter

Dallal,

Chapter 13: Physics Processing, Ronald Fedkiw,
Professor Stanford U. and Eftychios Sifakis
Chapter 14: Graphics enhancement, Ignacio Vargas, CTO,
Nextlimit (Interested)
Chapter 15: Communications and networking, Don BanksCisco (Interested)
Chapter 16: Data/Business Analytics , Zubin Dowlaty,
CTO Mu Sigma
Hwu 2013
CURRENT TABLE OF CONTENTS (CONT.)









Chapter 17: Hadoop, Memcached and cloud frameworks,
Bharath Mundlapudi, CTO Orzota

Chapter 18: Bioinformatics Wu Feng, VirginiaTech
Chapter 19: Scientific Computations, Byunghyun Jang,
Univ Miss.
Chapter 20: Computer Aided Design and Engineering,
Martin Wong, Professor UIUC

Chapter 21: Oil and Gas, Nacho Navarro, UPC/BSC
(interested)

Hwu 2013
CURRENT TABLE OF CONTENTS (CONT.)








Chapter 22: Financial Services & Analysis, Surra
Yanamadala, CA VP
Chapter 23: Computer Vision facial expression,
Elnar Hajiyev, Realeye

Chapter 24: Legacy Code Interoperability, Wenmei and Tom Jablin, MCW
Chapter 25: Java for HSA, Gary Frost, AMD

Hwu 2013
MAJOR UPCOMING MILESTONES


Author Algorithm and Code Due – February 2014



Author Chapters Due – April 2014



Review Due Back to Authors – May 2014



Chapters to Elsevier – June 2014



Books available – October 2014

Hwu 2013
THANK YOU!
ANY MORE QUESTIONS?

Hwu 2013

Weitere ähnliche Inhalte

Andere mochten auch

Science Update - No 259 - May 2015
Science Update - No 259 - May 2015Science Update - No 259 - May 2015
Science Update - No 259 - May 2015
DOILibrary1151
 
Kultura zientifikoa
Kultura zientifikoaKultura zientifikoa
Kultura zientifikoa
iazpiro1
 
Los proyectos de los filósofos
Los proyectos de los filósofosLos proyectos de los filósofos
Los proyectos de los filósofos
ROUSEBARBER
 
PEG y RCIU
PEG y RCIUPEG y RCIU
PEG y RCIU
lablih
 
Sociedad anonima exposicion
Sociedad anonima exposicionSociedad anonima exposicion
Sociedad anonima exposicion
kathom
 

Andere mochten auch (17)

SWFs and Sustainable Infrastructure
SWFs and Sustainable InfrastructureSWFs and Sustainable Infrastructure
SWFs and Sustainable Infrastructure
 
Buy Nevada Holiday Gift Guide 2014-15
Buy Nevada Holiday Gift Guide 2014-15Buy Nevada Holiday Gift Guide 2014-15
Buy Nevada Holiday Gift Guide 2014-15
 
Boletín Oficial 2013 08-16
Boletín Oficial 2013 08-16Boletín Oficial 2013 08-16
Boletín Oficial 2013 08-16
 
Analisis reforma
Analisis reforma Analisis reforma
Analisis reforma
 
2014.05.14 cofely fabricom maintenance mechanical workshops brochure
2014.05.14   cofely fabricom maintenance mechanical workshops brochure2014.05.14   cofely fabricom maintenance mechanical workshops brochure
2014.05.14 cofely fabricom maintenance mechanical workshops brochure
 
Redex formato encuesta
Redex formato encuestaRedex formato encuesta
Redex formato encuesta
 
Science Update - No 259 - May 2015
Science Update - No 259 - May 2015Science Update - No 259 - May 2015
Science Update - No 259 - May 2015
 
Diplomado Liderazgo Sindical Femenino
Diplomado Liderazgo Sindical FemeninoDiplomado Liderazgo Sindical Femenino
Diplomado Liderazgo Sindical Femenino
 
Kultura zientifikoa
Kultura zientifikoaKultura zientifikoa
Kultura zientifikoa
 
Los proyectos de los filósofos
Los proyectos de los filósofosLos proyectos de los filósofos
Los proyectos de los filósofos
 
Como Guardar y Consultar Presentaciones Zip, y Pdf
Como Guardar y Consultar Presentaciones  Zip, y  PdfComo Guardar y Consultar Presentaciones  Zip, y  Pdf
Como Guardar y Consultar Presentaciones Zip, y Pdf
 
Proyecto Social: Cuidado y Protección Social
Proyecto Social: Cuidado y Protección SocialProyecto Social: Cuidado y Protección Social
Proyecto Social: Cuidado y Protección Social
 
Tipos de motores eléctricos
Tipos de motores eléctricosTipos de motores eléctricos
Tipos de motores eléctricos
 
PEG y RCIU
PEG y RCIUPEG y RCIU
PEG y RCIU
 
GRUPO DE A TERCERA FASE: PROFOCOM
GRUPO DE A TERCERA FASE: PROFOCOMGRUPO DE A TERCERA FASE: PROFOCOM
GRUPO DE A TERCERA FASE: PROFOCOM
 
Sociedad anonima exposicion
Sociedad anonima exposicionSociedad anonima exposicion
Sociedad anonima exposicion
 
Whatsup
WhatsupWhatsup
Whatsup
 

Ähnlich wie HSA-4130, HSA for Application Programming, by Wen Mei Hwu

assignment3
assignment3assignment3
assignment3
Kirti J
 
Presentation
PresentationPresentation
Presentation
butest
 
Hive @ Hadoop day seattle_2010
Hive @ Hadoop day seattle_2010Hive @ Hadoop day seattle_2010
Hive @ Hadoop day seattle_2010
nzhang
 
Worldranking universities final documentation
Worldranking universities final documentationWorldranking universities final documentation
Worldranking universities final documentation
Bhadra Gowdra
 
Hadoop @ Sara & BiG Grid
Hadoop @ Sara & BiG GridHadoop @ Sara & BiG Grid
Hadoop @ Sara & BiG Grid
Evert Lammerts
 
Overview of big data & hadoop v1
Overview of big data & hadoop   v1Overview of big data & hadoop   v1
Overview of big data & hadoop v1
Thanh Nguyen
 
Hadoop at Yahoo! -- University Talks
Hadoop at Yahoo! -- University TalksHadoop at Yahoo! -- University Talks
Hadoop at Yahoo! -- University Talks
yhadoop
 

Ähnlich wie HSA-4130, HSA for Application Programming, by Wen Mei Hwu (20)

assignment3
assignment3assignment3
assignment3
 
What it takes to run Hadoop at Scale: Yahoo! Perspectives
What it takes to run Hadoop at Scale: Yahoo! PerspectivesWhat it takes to run Hadoop at Scale: Yahoo! Perspectives
What it takes to run Hadoop at Scale: Yahoo! Perspectives
 
Overview of the Data Processing Error Analysis System (DPEAS)
Overview of the Data Processing Error Analysis System (DPEAS)Overview of the Data Processing Error Analysis System (DPEAS)
Overview of the Data Processing Error Analysis System (DPEAS)
 
Presentation
PresentationPresentation
Presentation
 
Hive @ Hadoop day seattle_2010
Hive @ Hadoop day seattle_2010Hive @ Hadoop day seattle_2010
Hive @ Hadoop day seattle_2010
 
Big Data, Simple and Fast: Addressing the Shortcomings of Hadoop
Big Data, Simple and Fast: Addressing the Shortcomings of HadoopBig Data, Simple and Fast: Addressing the Shortcomings of Hadoop
Big Data, Simple and Fast: Addressing the Shortcomings of Hadoop
 
Is Multicore Hardware For General-Purpose Parallel Processing Broken? : Notes
Is Multicore Hardware For General-Purpose Parallel Processing Broken? : NotesIs Multicore Hardware For General-Purpose Parallel Processing Broken? : Notes
Is Multicore Hardware For General-Purpose Parallel Processing Broken? : Notes
 
Hadoop
HadoopHadoop
Hadoop
 
Worldranking universities final documentation
Worldranking universities final documentationWorldranking universities final documentation
Worldranking universities final documentation
 
"Big Data" Bioinformatics
"Big Data" Bioinformatics"Big Data" Bioinformatics
"Big Data" Bioinformatics
 
A Collaborative Research Proposal To The NSF Research Accelerator For Multip...
A Collaborative Research Proposal To The NSF  Research Accelerator For Multip...A Collaborative Research Proposal To The NSF  Research Accelerator For Multip...
A Collaborative Research Proposal To The NSF Research Accelerator For Multip...
 
Hadoop @ Sara & BiG Grid
Hadoop @ Sara & BiG GridHadoop @ Sara & BiG Grid
Hadoop @ Sara & BiG Grid
 
Empowering Transformational Science
Empowering Transformational ScienceEmpowering Transformational Science
Empowering Transformational Science
 
Cloud Computing Ambiance using Secluded Access Control Method
Cloud Computing Ambiance using Secluded Access Control MethodCloud Computing Ambiance using Secluded Access Control Method
Cloud Computing Ambiance using Secluded Access Control Method
 
MY_PRE.pptx
MY_PRE.pptxMY_PRE.pptx
MY_PRE.pptx
 
Sjug #26 ml is in java but is dl too - ver1.04 - tomasz sikora 2018-03-23
Sjug #26   ml is in java but is dl too - ver1.04 - tomasz sikora 2018-03-23Sjug #26   ml is in java but is dl too - ver1.04 - tomasz sikora 2018-03-23
Sjug #26 ml is in java but is dl too - ver1.04 - tomasz sikora 2018-03-23
 
HSA Introduction
HSA IntroductionHSA Introduction
HSA Introduction
 
Overview of big data & hadoop v1
Overview of big data & hadoop   v1Overview of big data & hadoop   v1
Overview of big data & hadoop v1
 
HSA Introduction Hot Chips 2013
HSA Introduction  Hot Chips 2013HSA Introduction  Hot Chips 2013
HSA Introduction Hot Chips 2013
 
Hadoop at Yahoo! -- University Talks
Hadoop at Yahoo! -- University TalksHadoop at Yahoo! -- University Talks
Hadoop at Yahoo! -- University Talks
 

Mehr von AMD Developer Central

Rendering Battlefield 4 with Mantle by Yuriy ODonnell
Rendering Battlefield 4 with Mantle by Yuriy ODonnellRendering Battlefield 4 with Mantle by Yuriy ODonnell
Rendering Battlefield 4 with Mantle by Yuriy ODonnell
AMD Developer Central
 

Mehr von AMD Developer Central (20)

DX12 & Vulkan: Dawn of a New Generation of Graphics APIs
DX12 & Vulkan: Dawn of a New Generation of Graphics APIsDX12 & Vulkan: Dawn of a New Generation of Graphics APIs
DX12 & Vulkan: Dawn of a New Generation of Graphics APIs
 
Leverage the Speed of OpenCL™ with AMD Math Libraries
Leverage the Speed of OpenCL™ with AMD Math LibrariesLeverage the Speed of OpenCL™ with AMD Math Libraries
Leverage the Speed of OpenCL™ with AMD Math Libraries
 
Introduction to Node.js
Introduction to Node.jsIntroduction to Node.js
Introduction to Node.js
 
Media SDK Webinar 2014
Media SDK Webinar 2014Media SDK Webinar 2014
Media SDK Webinar 2014
 
An Introduction to OpenCL™ Programming with AMD GPUs - AMD & Acceleware Webinar
An Introduction to OpenCL™ Programming with AMD GPUs - AMD & Acceleware WebinarAn Introduction to OpenCL™ Programming with AMD GPUs - AMD & Acceleware Webinar
An Introduction to OpenCL™ Programming with AMD GPUs - AMD & Acceleware Webinar
 
DirectGMA on AMD’S FirePro™ GPUS
DirectGMA on AMD’S  FirePro™ GPUSDirectGMA on AMD’S  FirePro™ GPUS
DirectGMA on AMD’S FirePro™ GPUS
 
Webinar: Whats New in Java 8 with Develop Intelligence
Webinar: Whats New in Java 8 with Develop IntelligenceWebinar: Whats New in Java 8 with Develop Intelligence
Webinar: Whats New in Java 8 with Develop Intelligence
 
The Small Batch (and other) solutions in Mantle API, by Guennadi Riguer, Mant...
The Small Batch (and other) solutions in Mantle API, by Guennadi Riguer, Mant...The Small Batch (and other) solutions in Mantle API, by Guennadi Riguer, Mant...
The Small Batch (and other) solutions in Mantle API, by Guennadi Riguer, Mant...
 
Inside XBox- One, by Martin Fuller
Inside XBox- One, by Martin FullerInside XBox- One, by Martin Fuller
Inside XBox- One, by Martin Fuller
 
TressFX The Fast and The Furry by Nicolas Thibieroz
TressFX The Fast and The Furry by Nicolas ThibierozTressFX The Fast and The Furry by Nicolas Thibieroz
TressFX The Fast and The Furry by Nicolas Thibieroz
 
Rendering Battlefield 4 with Mantle by Yuriy ODonnell
Rendering Battlefield 4 with Mantle by Yuriy ODonnellRendering Battlefield 4 with Mantle by Yuriy ODonnell
Rendering Battlefield 4 with Mantle by Yuriy ODonnell
 
Low-level Shader Optimization for Next-Gen and DX11 by Emil Persson
Low-level Shader Optimization for Next-Gen and DX11 by Emil PerssonLow-level Shader Optimization for Next-Gen and DX11 by Emil Persson
Low-level Shader Optimization for Next-Gen and DX11 by Emil Persson
 
Gcn performance ftw by stephan hodes
Gcn performance ftw by stephan hodesGcn performance ftw by stephan hodes
Gcn performance ftw by stephan hodes
 
Inside XBOX ONE by Martin Fuller
Inside XBOX ONE by Martin FullerInside XBOX ONE by Martin Fuller
Inside XBOX ONE by Martin Fuller
 
Direct3D12 and the Future of Graphics APIs by Dave Oldcorn
Direct3D12 and the Future of Graphics APIs by Dave OldcornDirect3D12 and the Future of Graphics APIs by Dave Oldcorn
Direct3D12 and the Future of Graphics APIs by Dave Oldcorn
 
Introduction to Direct 3D 12 by Ivan Nevraev
Introduction to Direct 3D 12 by Ivan NevraevIntroduction to Direct 3D 12 by Ivan Nevraev
Introduction to Direct 3D 12 by Ivan Nevraev
 
Holy smoke! Faster Particle Rendering using Direct Compute by Gareth Thomas
Holy smoke! Faster Particle Rendering using Direct Compute by Gareth ThomasHoly smoke! Faster Particle Rendering using Direct Compute by Gareth Thomas
Holy smoke! Faster Particle Rendering using Direct Compute by Gareth Thomas
 
Computer Vision Powered by Heterogeneous System Architecture (HSA) by Dr. Ha...
Computer Vision Powered by Heterogeneous System Architecture (HSA) by  Dr. Ha...Computer Vision Powered by Heterogeneous System Architecture (HSA) by  Dr. Ha...
Computer Vision Powered by Heterogeneous System Architecture (HSA) by Dr. Ha...
 
Productive OpenCL Programming An Introduction to OpenCL Libraries with Array...
Productive OpenCL Programming An Introduction to OpenCL Libraries  with Array...Productive OpenCL Programming An Introduction to OpenCL Libraries  with Array...
Productive OpenCL Programming An Introduction to OpenCL Libraries with Array...
 
Rendering Battlefield 4 with Mantle by Johan Andersson - AMD at GDC14
Rendering Battlefield 4 with Mantle by Johan Andersson - AMD at GDC14Rendering Battlefield 4 with Mantle by Johan Andersson - AMD at GDC14
Rendering Battlefield 4 with Mantle by Johan Andersson - AMD at GDC14
 

Kürzlich hochgeladen

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Victor Rentea
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Victor Rentea
 

Kürzlich hochgeladen (20)

WSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering Developers
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistan
 
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot ModelMcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
 
[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Platformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityPlatformless Horizons for Digital Adaptability
Platformless Horizons for Digital Adaptability
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 

HSA-4130, HSA for Application Programming, by Wen Mei Hwu

  • 1. HSA FOR APPLICATION PROGRAMMING Wen-mei W. Hwu CTO, MulticoreWare, Inc. Professor University of Illinois, Urbana-Champaign
  • 2. CURRENT GPU COMPUTING PAIN POINTS  Kernel launch overhead  Limited virtual and physical memory space  Extra data movement in I/O and networking  Tedious host code  Multiple source code development
  • 4. DESIRED DATA TRANSFER BEHAVIOR Main Memory (DRAM) Network I/O CPU Disk I/O Device Memory DMA GPU card (or other Accelerator cards) SAMOS 2013
  • 5. ACTUAL DATA TRANSFER BEHAVIOR Main Memory (DRAM) Each additional copy diminishes applicationperceived bandwidth Network I/O CPU Disk I/O Device Memory DMA GPU card (or other Accelerator cards) SAMOS 2013
  • 9. DYNAMIC COMPILATION AND BINARY COMPATIBLITY
  • 12.
  • 13.
  • 14.
  • 15.
  • 17. TOOLS WORKING GROUP  Tools Group will be looking at foundation for Developer Tools (compilation, debugging and performance analysis)   Working with the working group to insure the right interface are in place Compilation, Debugging, and Performance Analysis initially We act as stewards for   HSA functional simulator with GDB support (AMD/MCW)  HSA performance simulator (AMD)   LLVM backend for generating HSAIL (AMD/MCW) Loader Library for Simulator - BRIG object loading support ( AMD) Longer Terms will spawn sub-group that will drive requirement for language support for HSA  C, C++, C++ AMP  Java (JVM, Dalvik)  Python  JavaScript  DSL…  … Hwu 2013
  • 18. CURRENT TIME TABLE  Kickoff Conference and Refined Charter – November 2013  Tools Roadmap – December 2013  Initial Tools – December 2013  Initial Language Tool Chains – January 2013 Please join us and contribute! Hwu 2013
  • 19. HAS BOOK IN THE MAKING Heterogeneous System Architecture For Application Programming  Audience Performance application developers  System architects  Component architects  Software stack developers  Hwu 2013
  • 20. EDITORIAL BOARD  Gaster, Benedict – Qualcomm  Hegde Manju - AMD  Hwu, Wen-mei - MulticoreWare/UIUC  Jablin, Thomas – MultcoreWare  Lokhmotov, Anton – ARM  Lu, Chien-Ping – MediaTek  Whitecotton, Bob - AMD Hwu 2013
  • 21. CURRENT TABLE OF CONTENTS  HSA Overview (50 pages)  Chapter 1: What is HSA? (Hwu/Jablin)  Chapter 2: The HSA Architecture (Hwu/Jablin/Others)  Chapter 3: A Programmer's View of HSA (Hwu/Jablin)   Chapter 4: The Tools Framework for HAS, (Hwu/Jablin/Others) Chapter 5: Mapping of Classical algorithms onto the HSA Architecture (Hwu/Jablin) Hwu 2013
  • 22. CURRENT TABLE OF CONTENTS (CONT.)   Representative Workloads: Chapter 6: Photography, Minh Do, Professor UIUC, CTO Personify  Chapter 7: Video-Audio Search, Ren Wu, Baidu  Chapter 8: Augmented Reality Chein-ping Lu, MeidaTek   Chapter 9: Biometrics (Face Detection) Mike Jones, MERL & Harris Gasparakis, AMD Chapter 10: Audio-Video processing Bill Herz, Sr. Fellow AMD Hwu 2013
  • 23. CURRENT TABLE OF CONTENTS (CONT.)       Chapter 11: Ray-Tracing, TBD Chapter 12: Natural user Interfaces, Navneett CEO Flutter Dallal, Chapter 13: Physics Processing, Ronald Fedkiw, Professor Stanford U. and Eftychios Sifakis Chapter 14: Graphics enhancement, Ignacio Vargas, CTO, Nextlimit (Interested) Chapter 15: Communications and networking, Don BanksCisco (Interested) Chapter 16: Data/Business Analytics , Zubin Dowlaty, CTO Mu Sigma Hwu 2013
  • 24. CURRENT TABLE OF CONTENTS (CONT.)      Chapter 17: Hadoop, Memcached and cloud frameworks, Bharath Mundlapudi, CTO Orzota Chapter 18: Bioinformatics Wu Feng, VirginiaTech Chapter 19: Scientific Computations, Byunghyun Jang, Univ Miss. Chapter 20: Computer Aided Design and Engineering, Martin Wong, Professor UIUC Chapter 21: Oil and Gas, Nacho Navarro, UPC/BSC (interested) Hwu 2013
  • 25. CURRENT TABLE OF CONTENTS (CONT.)     Chapter 22: Financial Services & Analysis, Surra Yanamadala, CA VP Chapter 23: Computer Vision facial expression, Elnar Hajiyev, Realeye Chapter 24: Legacy Code Interoperability, Wenmei and Tom Jablin, MCW Chapter 25: Java for HSA, Gary Frost, AMD Hwu 2013
  • 26. MAJOR UPCOMING MILESTONES  Author Algorithm and Code Due – February 2014  Author Chapters Due – April 2014  Review Due Back to Authors – May 2014  Chapters to Elsevier – June 2014  Books available – October 2014 Hwu 2013
  • 27. THANK YOU! ANY MORE QUESTIONS? Hwu 2013