Abstractions and Directives for Adapting Wavefront Algorithms to Future Architectures

President of insideHPC Media um inside-BigData.com
22. Jul 2018
Abstractions and Directives for Adapting Wavefront Algorithms to Future Architectures
Abstractions and Directives for Adapting Wavefront Algorithms to Future Architectures
Abstractions and Directives for Adapting Wavefront Algorithms to Future Architectures
Abstractions and Directives for Adapting Wavefront Algorithms to Future Architectures
Abstractions and Directives for Adapting Wavefront Algorithms to Future Architectures
Abstractions and Directives for Adapting Wavefront Algorithms to Future Architectures
Abstractions and Directives for Adapting Wavefront Algorithms to Future Architectures
Abstractions and Directives for Adapting Wavefront Algorithms to Future Architectures
Abstractions and Directives for Adapting Wavefront Algorithms to Future Architectures
Abstractions and Directives for Adapting Wavefront Algorithms to Future Architectures
Abstractions and Directives for Adapting Wavefront Algorithms to Future Architectures
Abstractions and Directives for Adapting Wavefront Algorithms to Future Architectures
Abstractions and Directives for Adapting Wavefront Algorithms to Future Architectures
Abstractions and Directives for Adapting Wavefront Algorithms to Future Architectures
Abstractions and Directives for Adapting Wavefront Algorithms to Future Architectures
Abstractions and Directives for Adapting Wavefront Algorithms to Future Architectures
Abstractions and Directives for Adapting Wavefront Algorithms to Future Architectures
Abstractions and Directives for Adapting Wavefront Algorithms to Future Architectures
Abstractions and Directives for Adapting Wavefront Algorithms to Future Architectures
Abstractions and Directives for Adapting Wavefront Algorithms to Future Architectures
Abstractions and Directives for Adapting Wavefront Algorithms to Future Architectures
Abstractions and Directives for Adapting Wavefront Algorithms to Future Architectures
Abstractions and Directives for Adapting Wavefront Algorithms to Future Architectures
Abstractions and Directives for Adapting Wavefront Algorithms to Future Architectures
Abstractions and Directives for Adapting Wavefront Algorithms to Future Architectures
Abstractions and Directives for Adapting Wavefront Algorithms to Future Architectures
Abstractions and Directives for Adapting Wavefront Algorithms to Future Architectures
Abstractions and Directives for Adapting Wavefront Algorithms to Future Architectures
Abstractions and Directives for Adapting Wavefront Algorithms to Future Architectures
Abstractions and Directives for Adapting Wavefront Algorithms to Future Architectures
Abstractions and Directives for Adapting Wavefront Algorithms to Future Architectures
Abstractions and Directives for Adapting Wavefront Algorithms to Future Architectures
Abstractions and Directives for Adapting Wavefront Algorithms to Future Architectures
Abstractions and Directives for Adapting Wavefront Algorithms to Future Architectures
Abstractions and Directives for Adapting Wavefront Algorithms to Future Architectures
Abstractions and Directives for Adapting Wavefront Algorithms to Future Architectures
Abstractions and Directives for Adapting Wavefront Algorithms to Future Architectures
Abstractions and Directives for Adapting Wavefront Algorithms to Future Architectures
1 von 38

Más contenido relacionado

Was ist angesagt?

LEGaTO: Software Stack RuntimesLEGaTO: Software Stack Runtimes
LEGaTO: Software Stack RuntimesLEGATO project
Serving BERT Models in Production with TorchServeServing BERT Models in Production with TorchServe
Serving BERT Models in Production with TorchServeNidhin Pattaniyil
Graal Tutorial at CGO 2015 by Christian WimmerGraal Tutorial at CGO 2015 by Christian Wimmer
Graal Tutorial at CGO 2015 by Christian WimmerThomas Wuerthinger
Micro-Benchmarking Considered HarmfulMicro-Benchmarking Considered Harmful
Micro-Benchmarking Considered HarmfulThomas Wuerthinger
Braden Hancock "Programmatically creating and managing training data with Sno...Braden Hancock "Programmatically creating and managing training data with Sno...
Braden Hancock "Programmatically creating and managing training data with Sno...Fwdays
Graal and Truffle: Modularity and Separation of Concerns as Cornerstones for ...Graal and Truffle: Modularity and Separation of Concerns as Cornerstones for ...
Graal and Truffle: Modularity and Separation of Concerns as Cornerstones for ...Thomas Wuerthinger

Was ist angesagt?(20)

Similar a Abstractions and Directives for Adapting Wavefront Algorithms to Future Architectures

A Library for Emerging High-Performance Computing ClustersA Library for Emerging High-Performance Computing Clusters
A Library for Emerging High-Performance Computing ClustersIntel® Software
Improving Efficiency of Machine Learning Algorithms using HPCC SystemsImproving Efficiency of Machine Learning Algorithms using HPCC Systems
Improving Efficiency of Machine Learning Algorithms using HPCC SystemsHPCC Systems
Assisting User’s Transition to Titan’s Accelerated ArchitectureAssisting User’s Transition to Titan’s Accelerated Architecture
Assisting User’s Transition to Titan’s Accelerated Architectureinside-BigData.com
Introduction to OpenSees by Frank McKennaIntroduction to OpenSees by Frank McKenna
Introduction to OpenSees by Frank McKennaopenseesdays
Performance Optimization of Deep Learning Frameworks Caffe* and Tensorflow* f...Performance Optimization of Deep Learning Frameworks Caffe* and Tensorflow* f...
Performance Optimization of Deep Learning Frameworks Caffe* and Tensorflow* f...Intel® Software
Current & Future Use-Cases of OpenDaylightCurrent & Future Use-Cases of OpenDaylight
Current & Future Use-Cases of OpenDaylightabhijit2511

Similar a Abstractions and Directives for Adapting Wavefront Algorithms to Future Architectures(20)

Más de inside-BigData.com

Major Market Shifts in ITMajor Market Shifts in IT
Major Market Shifts in ITinside-BigData.com
Preparing to program Aurora at Exascale - Early experiences and future direct...Preparing to program Aurora at Exascale - Early experiences and future direct...
Preparing to program Aurora at Exascale - Early experiences and future direct...inside-BigData.com
Transforming Private 5G NetworksTransforming Private 5G Networks
Transforming Private 5G Networksinside-BigData.com
The Incorporation of Machine Learning into Scientific Simulations at Lawrence...The Incorporation of Machine Learning into Scientific Simulations at Lawrence...
The Incorporation of Machine Learning into Scientific Simulations at Lawrence...inside-BigData.com
How to Achieve High-Performance, Scalable and Distributed DNN Training on Mod...How to Achieve High-Performance, Scalable and Distributed DNN Training on Mod...
How to Achieve High-Performance, Scalable and Distributed DNN Training on Mod...inside-BigData.com
Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...
Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...inside-BigData.com

Más de inside-BigData.com(20)

Último

Smart Contracts - The Blockchain Beyond BitcoinSmart Contracts - The Blockchain Beyond Bitcoin
Smart Contracts - The Blockchain Beyond BitcoinJim McKeeth
Swiss Re Reinsurance Solutions - Automated Claims Experience – Insurer Innova...Swiss Re Reinsurance Solutions - Automated Claims Experience – Insurer Innova...
Swiss Re Reinsurance Solutions - Automated Claims Experience – Insurer Innova...The Digital Insurer
Document Understanding as Cloud APIs and Generative AI Pre-labeling Extractio...Document Understanding as Cloud APIs and Generative AI Pre-labeling Extractio...
Document Understanding as Cloud APIs and Generative AI Pre-labeling Extractio...DianaGray10
Roottoo Innovation V24_CP.pdfRoottoo Innovation V24_CP.pdf
Roottoo Innovation V24_CP.pdfroottooinnovation
Accelerating Data Science through Feature Platform, Transformers, and GenAIAccelerating Data Science through Feature Platform, Transformers, and GenAI
Accelerating Data Science through Feature Platform, Transformers, and GenAIFeatureByte
Zodhya intro deck Japan ILSZodhya intro deck Japan ILS
Zodhya intro deck Japan ILSRohithPallerla

Último(20)

Abstractions and Directives for Adapting Wavefront Algorithms to Future Architectures

Hinweis der Redaktion

  1. Simple Parallel: loop parallelism, recursion, task, master/worker, pipeline
  2. Robbie – I changed implementation to ->proof-of-concept Third bullet -> currently only slide 26 is for this bullet isn’t it? That is not enough. Think about adding CUDA and OpenMP limitations as 2 slides or in 1 slide, then this bullet will be covered.
  3. Mention that OpenMP 4.5 is now starting to support offloading since 2013, but that is not what it was originally intended for.
  4. Robbie –I removed SummitDev, you can mention it, our focus on this slide is Summit and Top500 Since you are presenting in Europe, I added ORNL to the machine names, coz not everyone is going to put 2 and 2 together that Titan, Beacon where ORNL machines.
  5. EXPLAIN GANG & VECTOR Talk about how to map wavefront using gang/vector
  6. EXPLAIN GANG & VECTOR
  7. EXPLAIN GANG & VECTOR
  8. Go through the motions of analyzing a complex parallel pattern Come up with an abstraction for the pattern in question Robbie – edited the bullet on polyhedral -> Polyhedral transformation tools require code refactoring; only supports affine loops For your notes-> Typical limitations of the polyhedral model is essentially anything that makes the code non-affine. These include array indirections, data-dependent conditionals, linearized arrays and use of pointers.  These limitations need to be addressed before broadening the applicability of polyhedral transformations.
  9. Go through the motions of analyzing a complex parallel pattern Come up with an abstraction for the pattern in question
  10. EXPLAIN FACE ARRAYS! Optimal memory size implies brutal index calculations.
  11. Note that minisweep actually runs 8 sweeps (one per direction) Need a 3rd layer of parallelism for this!
  12. Robbie – didn’t we have results on SummitDev? It is in the paper, so add that to the second bullet? Its OK if you are showing titan results in your chart. Keep the absolute runtime numbers on summitdev as a backup slide. Someone might ask about absolute runtime as people don’t like speedup chart
  13. .
  14. SC: After this slide, bring back the contribution slide, and we have to discuss what we have contributed to these proposed bullets. Because contribution slides means we have contributed something. So how about? Creating standardized high-level abstraction for complex parallel patterns Evaluate abstraction on multiple state-of-the-art platforms to study performance/portability Apply abstraction to real-world scientific applications of interest to Exascale projects
  15. Robbie – this can be your last slide and then switch to the contribution slide and then put the next slide up while taking questions.
  16. Center for Accelerated Application Readiness (CAAR) VASP material science Gaussian Quantum Chemistry ANSYS Fluent, CFD Just keep this slide, in case someone goes like who is using OpenACC, you can bring this up pretty quickly.
  17. current DOE INCITE project to model the International Thermonuclear Experimental Reactor (ITER) fusion reactor and is the primary radiation transport code used in the multi-industry Consortium for Advanced Simulation of Light Water Reactors (CASL) [2] headquartered at Oak Ridge National Laboratory (ORNL). impact of the Fukushima nuclear accident in Japan, highlights the need to increase the production of safe, abundant quantities of energy A transport solver that incorporates an adequately resolved discretization for all scales using current discrete models would require 1017–1021 degrees of freedom per step, which is well beyond even exascale resources. Space: in the x , y and z dimensions, each gridcell depends on results from three upstream cells, based on octant direction, resulting in a wavefront problem. Octant: results for different octants can be calculated indepen- dently, the only dependency being that results from different octants at the same entry of vo′ut are added together and thus are a race hazard, depending on how the computation is done. Energy: computations for different energy groups have no coupling and can be considered separate problem instances, enabling flexible parallelization. One of the six applications selected for ORNL CAAR project Similar to Sn wavefront codes including Kripke, Sn (Discrete Ordinates) Application Proxy (SNAP) and PARTISN