For the full video of this presentation, please visit:
http://www.embedded-vision.com/platinum-members/embedded-vision-alliance/embedded-vision-training/videos/pages/sept-2016-member-meeting-hsa-foundation
For more information about embedded vision, please visit:
http://www.embedded-vision.com
Dr. John Glossner, President of the HSA Foundation and CEO of GPT-US, delivers the presentation "Enabling Efficient Heterogeneous Processing Through Coherency" at the September 2016 Embedded Vision Alliance Member Meeting. Glossner describes the organization's goals and deliverables for enabling heterogenous programming.
32. Python on GPU’s
Numba: NumPy aware python compiler
Open source. Avail on Github
Sponsored by Continuum Analytics
Direct HSA Support
Automatic Parallelization
2x-200x speedup
34. Python Geographic Locality
What is the distance from a set of
points to a target point
How many points are within a specified
range
Numba can auto-parallelize user
universal functions for HSA
Ufunc’s broadcast operation over
elements of a NumPy array
ZERO HSA developer knowledge
required
1M Points
>8X speedup
https://github.com/ContinuumIO/Numba-HSA-Webinar
37. 37
Heterogeneous System Architecture Is At The Core of ROCm
Rich Foundation for HPC and Ultrascale Computing support our APU’s and Discreet GPU’s
HSA Drives rich capabilities into the ROCm
Systems Architecture
‒ User Mode Queues
‒ Architected Queuing Language
‒ Flat memory Addressing
‒ Atomic Memory Transactions
‒ Process Concurrency & Preemption
HSA Runtime enables a programming language
neutral systems interface
Supports standardized loader and linker interface
ROCm: Radeon
Open Compute
Platform
38. 38
ROCm Enabled Hardware 2016
S9150W9100
RADEON R9 Nano S9300x2 RADEON RX480 ( Oct ROCm 1.3)
S9170
AMD Proprietary and Confidential August 2016
AMD Embedded
R-Series SOC
AMD FX 98xx,
A12-97xx