SlideShare ist ein Scribd-Unternehmen logo
1 von 17
Downloaden Sie, um offline zu lesen
Tuning Parallel Applications to
Accelerate Scientific Discoveries

              Robert Henschel
           rhensche@indiana.edu


                October 2009
Contents
•   PTI / High Performance Applications
•   Performance of Scientific Codes
•   IU and TeraGrid Compute Resources
•   Optimizing for IU's HPC Systems
•   Using TeraGrid HPC Systems
•   HPA is Here to Help




                                          Robert Henschel
What this talk will be about
• Making you aware of compute resources that you
  can use for your work, to make you more
  productive.

• Introducing the High Performance Applications
  group and how we can help get research done
  faster.

• Give you examples of what we have done for
  researchers to make them more competitive in their
  field.



                                            Robert Henschel
PTI and High Performance Applications
• Pervasive Technology Institute
   – Develop and deliver innovative information technology
     to advance research, education, industry and society.
   – School of Informatics
   – School of Law
   – University Information Technology Services

• High Performance Applications
  – Part of the Digital Science Center of PTI
  – Part of the Research Technologies of UITS
  – Seven people that help IU researchers make efficient
    use of IU and TeraGrid compute resources

                                                Robert Henschel
Performance of Scientific Codes
• Supercomputing, or High Performance Computing (HPC),
  is not just for computer geeks!

• Performance for computer scientists
   – Amdahls law and scalability
   – Efficient usage of functional units of processors
   – Optimally using memory bandwidth
   – Trying to avoid I/O as much as possible

• Performance for researchers
   – When do I get the answer to my problem?
   – When does my job run and when is it done?

                                                   Robert Henschel
IU and TeraGrid Compute Resources
• Two HPC systems at IU
   – BigRed    30 TFLOPS            (3000 cores)
   – Quarry     7 TFLOPS            (1000 cores)

• Several special purpose systems
   – Small Cell B.E. Cluster
   – MDGRAPE-2 machine

• Several storage resources
   – IU Data Capacitor
   – GPFS, RFS, HPSS

• Policy of open access to compute resources
                                               Robert Henschel
IU and TeraGrid Compute Resources cont'd




                                     Robert Henschel
IU and TeraGrid Compute Resources cont'd
• TeraGrid
   – NSF funded HPC systems and support infrastructure
   – 11 resource providers
   – More than 1,500 TFLOPS      (150,000 cores)

• Central allocation and support structure




                                              Robert Henschel
Optimizing for IU's HPC Systems
• Help researchers access the central systems and
  determine what system to use
• Install and optimize applications
• Provide guidance on compiler and library optimization
• Help with job submission, especially running many
  thousands of jobs




                                                Robert Henschel
Using TeraGrid HPC Systems
• Low barrier of entry

• Identify if a problem and workflow will work on the
  TeraGrid
• Get a startup allocation
• Use it and identify if it is worth pursuing this further
• Submit a full allocation request




                                                      Robert Henschel
Contents – HPA is Here to Help
• HPA is Here to Help
  – What We Do

• Recent Examples
  – Integrating HPC Systems into an Electron
    Microscope Workflow
  – Migrating Research in Gas Giant Planets from IU
    to TeraGrid HPC Systems
  – Developing Computational Models to Predict
    Drug-Drug Interactions



                                           Robert Henschel
What We Do
• Consulting about HPC system usage
  – From start to finish
  – Optimize source code for architectures

• Help with TeraGrid allocation proposals
• Adapting and creating workflows for new environments
• Consulting for grant proposals




                                               Robert Henschel
HPC Systems and an Electron Microscope
General Case
  – Users have an instrument that produces a lot of data
    on a daily basis
  – This data needs to be stored and analyzed

Electron Microscope in Simon Hall (IU Bloomington)
   – Microscope stores data on a Windows workstation
   – Researcher does quality checks on local workstation
   – IU Data Capacitor links workstations, IU HPC systems
     and the IU long term archive together




                                                Robert Henschel
Gas Giant Planets on the TeraGrid
General Case
  – Users have a set workflow for analyzing data
  – Locally available compute resources are not big
    enough to keep up with demand

Understanding Gas Giant Planets
  – IDL is used to visualize simulation data
     • Commercial software, IU Astronomy has a license
  – Simulation application needs to run on a large shared
    memory system
  – TeraGrid and IU Data Capacitor tie this workflow
    together


                                                Robert Henschel
Predicting Drug-Drug Interactions
General Case
  – Researchers implement proof of concept research
    algorithms
  – Scaling from proof of concept to production science is
    difficult
  – The ability to add HPC expertise to grant proposals will
    make the proposal more competitive

Computational Models to Predict Drug-Drug Interactions
  – Drug exposure model developed in R
  – Scaling to real world data sets not possible without
    using HPC systems
  – Porting to C and running on UITS hardware

                                                 Robert Henschel
What this talk was about
• Made you aware of compute resources that you can
  use for your work, to make you more productive.

• Introduced the High Performance Applications
  group and how we can help get research done
  faster.

• Gave you examples of what we have done for
  researchers to make them more competitive in their
  field.




                                            Robert Henschel
Acknowledgments
This material is based upon work supported by the National Science
Foundation under Grant Numbers 0116050 and 0521433. Any
opinions, findings and conclusions or recommendations expressed in
this material are those of the author and do not necessarily reflect the
views of the National Science Foundation (NSF).

This work was support in part by the Indiana Metabolomics and
Cytomics Initiative (METACyt). METACyt is supported in part by Lilly
Endowment, Inc.

This work was support in part by the Indiana Genomics Initiative. The
Indiana Genomics Initiative of Indiana University is supported in part by
Lilly Endowment, Inc.

This work was supported in part by Shared University Research grants
from IBM, Inc. to Indiana University.

                                                             Robert Henschel

Weitere ähnliche Inhalte

Andere mochten auch

Ed tech story book ...my community.1
Ed tech story book ...my community.1Ed tech story book ...my community.1
Ed tech story book ...my community.1Carrie Chang
 
GeneIndex: an open source parallel program for enumerating and locating words...
GeneIndex: an open source parallel program for enumerating and locating words...GeneIndex: an open source parallel program for enumerating and locating words...
GeneIndex: an open source parallel program for enumerating and locating words...PTIHPA
 
5 Vampir Configuration At IU
5 Vampir Configuration At IU5 Vampir Configuration At IU
5 Vampir Configuration At IUPTIHPA
 
Big Iron and Parallel Processing, USArray Data Processing Workshop
Big Iron and Parallel Processing, USArray Data Processing WorkshopBig Iron and Parallel Processing, USArray Data Processing Workshop
Big Iron and Parallel Processing, USArray Data Processing WorkshopPTIHPA
 
Github:fi Presentation
Github:fi PresentationGithub:fi Presentation
Github:fi PresentationPTIHPA
 
2010 05 hands_on
2010 05 hands_on2010 05 hands_on
2010 05 hands_onPTIHPA
 
A Common Sense Approach Electronic
A Common Sense Approach   ElectronicA Common Sense Approach   Electronic
A Common Sense Approach ElectronicRegina Henderson
 
2010 vampir workshop_iu_configuration
2010 vampir workshop_iu_configuration2010 vampir workshop_iu_configuration
2010 vampir workshop_iu_configurationPTIHPA
 
Trace Visualization
Trace VisualizationTrace Visualization
Trace VisualizationPTIHPA
 
Air Traffic
Air TrafficAir Traffic
Air Traffichumair73
 
Appraisers Direct, Inc.
Appraisers Direct, Inc.Appraisers Direct, Inc.
Appraisers Direct, Inc.ClintCornett
 
How to Win the Moment in Real Time Events
How to Win the Moment in Real Time EventsHow to Win the Moment in Real Time Events
How to Win the Moment in Real Time EventsTammy Gordon
 
1 Vampir Overview
1 Vampir Overview1 Vampir Overview
1 Vampir OverviewPTIHPA
 
Overview: Event Based Program Analysis
Overview: Event Based Program AnalysisOverview: Event Based Program Analysis
Overview: Event Based Program AnalysisPTIHPA
 
3 Vampir Trace In Detail
3 Vampir Trace In Detail3 Vampir Trace In Detail
3 Vampir Trace In DetailPTIHPA
 

Andere mochten auch (18)

Ed tech story book ...my community.1
Ed tech story book ...my community.1Ed tech story book ...my community.1
Ed tech story book ...my community.1
 
GeneIndex: an open source parallel program for enumerating and locating words...
GeneIndex: an open source parallel program for enumerating and locating words...GeneIndex: an open source parallel program for enumerating and locating words...
GeneIndex: an open source parallel program for enumerating and locating words...
 
Aca Talent
Aca TalentAca Talent
Aca Talent
 
5 Vampir Configuration At IU
5 Vampir Configuration At IU5 Vampir Configuration At IU
5 Vampir Configuration At IU
 
Big Iron and Parallel Processing, USArray Data Processing Workshop
Big Iron and Parallel Processing, USArray Data Processing WorkshopBig Iron and Parallel Processing, USArray Data Processing Workshop
Big Iron and Parallel Processing, USArray Data Processing Workshop
 
Ciclismo Neiva
Ciclismo NeivaCiclismo Neiva
Ciclismo Neiva
 
Github:fi Presentation
Github:fi PresentationGithub:fi Presentation
Github:fi Presentation
 
2010 05 hands_on
2010 05 hands_on2010 05 hands_on
2010 05 hands_on
 
A Common Sense Approach Electronic
A Common Sense Approach   ElectronicA Common Sense Approach   Electronic
A Common Sense Approach Electronic
 
critical thinking
critical thinkingcritical thinking
critical thinking
 
2010 vampir workshop_iu_configuration
2010 vampir workshop_iu_configuration2010 vampir workshop_iu_configuration
2010 vampir workshop_iu_configuration
 
Trace Visualization
Trace VisualizationTrace Visualization
Trace Visualization
 
Air Traffic
Air TrafficAir Traffic
Air Traffic
 
Appraisers Direct, Inc.
Appraisers Direct, Inc.Appraisers Direct, Inc.
Appraisers Direct, Inc.
 
How to Win the Moment in Real Time Events
How to Win the Moment in Real Time EventsHow to Win the Moment in Real Time Events
How to Win the Moment in Real Time Events
 
1 Vampir Overview
1 Vampir Overview1 Vampir Overview
1 Vampir Overview
 
Overview: Event Based Program Analysis
Overview: Event Based Program AnalysisOverview: Event Based Program Analysis
Overview: Event Based Program Analysis
 
3 Vampir Trace In Detail
3 Vampir Trace In Detail3 Vampir Trace In Detail
3 Vampir Trace In Detail
 

Ähnlich wie Statewide It Robert Henschel

Intel life sciences_personalizedmedicine_stanford biomed 052214 dist
Intel life sciences_personalizedmedicine_stanford biomed 052214 distIntel life sciences_personalizedmedicine_stanford biomed 052214 dist
Intel life sciences_personalizedmedicine_stanford biomed 052214 distKetan Paranjape
 
big_data_casestudies_2.ppt
big_data_casestudies_2.pptbig_data_casestudies_2.ppt
big_data_casestudies_2.pptvishal choudhary
 
HPE and Hortonworks join forces to Deliver Healthcare Transformation
HPE and Hortonworks join forces to Deliver Healthcare TransformationHPE and Hortonworks join forces to Deliver Healthcare Transformation
HPE and Hortonworks join forces to Deliver Healthcare TransformationHortonworks
 
High Performance Data Analytics and a Java Grande Run Time
High Performance Data Analytics and a Java Grande Run TimeHigh Performance Data Analytics and a Java Grande Run Time
High Performance Data Analytics and a Java Grande Run TimeGeoffrey Fox
 
Dell High-Performance Computing solutions: Enable innovations, outperform exp...
Dell High-Performance Computing solutions: Enable innovations, outperform exp...Dell High-Performance Computing solutions: Enable innovations, outperform exp...
Dell High-Performance Computing solutions: Enable innovations, outperform exp...Dell World
 
Using The Hadoop Ecosystem to Drive Healthcare Innovation
Using The Hadoop Ecosystem to Drive Healthcare InnovationUsing The Hadoop Ecosystem to Drive Healthcare Innovation
Using The Hadoop Ecosystem to Drive Healthcare InnovationDan Wellisch
 
Data-intensive bioinformatics on HPC and Cloud
Data-intensive bioinformatics on HPC and CloudData-intensive bioinformatics on HPC and Cloud
Data-intensive bioinformatics on HPC and CloudOla Spjuth
 
Challenges and Issues of Next Cloud Computing Platforms
Challenges and Issues of Next Cloud Computing PlatformsChallenges and Issues of Next Cloud Computing Platforms
Challenges and Issues of Next Cloud Computing PlatformsFrederic Desprez
 
OpenStack at SJTU: Predictive Data Mining in Clinical Medicine with Dynamical...
OpenStack at SJTU: Predictive Data Mining in Clinical Medicine with Dynamical...OpenStack at SJTU: Predictive Data Mining in Clinical Medicine with Dynamical...
OpenStack at SJTU: Predictive Data Mining in Clinical Medicine with Dynamical...Shuquan Huang
 
Enterprise data science at scale
Enterprise data science at scaleEnterprise data science at scale
Enterprise data science at scaleCarolyn Duby
 
Analyzing Big Data in Medicine with Virtual Research Environments and Microse...
Analyzing Big Data in Medicine with Virtual Research Environments and Microse...Analyzing Big Data in Medicine with Virtual Research Environments and Microse...
Analyzing Big Data in Medicine with Virtual Research Environments and Microse...Ola Spjuth
 
The Transformation of HPC: Simulation and Cognitive Methods in the Era of Big...
The Transformation of HPC: Simulation and Cognitive Methods in the Era of Big...The Transformation of HPC: Simulation and Cognitive Methods in the Era of Big...
The Transformation of HPC: Simulation and Cognitive Methods in the Era of Big...inside-BigData.com
 
Tragedy of the Data Commons (ODSC-East, 2021)
Tragedy of the Data Commons (ODSC-East, 2021)Tragedy of the Data Commons (ODSC-East, 2021)
Tragedy of the Data Commons (ODSC-East, 2021)James Hendler
 
How HPC and large-scale data analytics are transforming experimental science
How HPC and large-scale data analytics are transforming experimental scienceHow HPC and large-scale data analytics are transforming experimental science
How HPC and large-scale data analytics are transforming experimental scienceinside-BigData.com
 
Data-intensive applications on cloud computing resources: Applications in lif...
Data-intensive applications on cloud computing resources: Applications in lif...Data-intensive applications on cloud computing resources: Applications in lif...
Data-intensive applications on cloud computing resources: Applications in lif...Ola Spjuth
 
High Performance Computing and the Opportunity with Cognitive Technology
 High Performance Computing and the Opportunity with Cognitive Technology High Performance Computing and the Opportunity with Cognitive Technology
High Performance Computing and the Opportunity with Cognitive TechnologyIBM Watson
 
HPC Resource Accounting
HPC Resource AccountingHPC Resource Accounting
HPC Resource AccountingKen Schumacher
 

Ähnlich wie Statewide It Robert Henschel (20)

Intel life sciences_personalizedmedicine_stanford biomed 052214 dist
Intel life sciences_personalizedmedicine_stanford biomed 052214 distIntel life sciences_personalizedmedicine_stanford biomed 052214 dist
Intel life sciences_personalizedmedicine_stanford biomed 052214 dist
 
big_data_casestudies_2.ppt
big_data_casestudies_2.pptbig_data_casestudies_2.ppt
big_data_casestudies_2.ppt
 
HPE and Hortonworks join forces to Deliver Healthcare Transformation
HPE and Hortonworks join forces to Deliver Healthcare TransformationHPE and Hortonworks join forces to Deliver Healthcare Transformation
HPE and Hortonworks join forces to Deliver Healthcare Transformation
 
HPC at NIBR
HPC at NIBRHPC at NIBR
HPC at NIBR
 
High Performance Data Analytics and a Java Grande Run Time
High Performance Data Analytics and a Java Grande Run TimeHigh Performance Data Analytics and a Java Grande Run Time
High Performance Data Analytics and a Java Grande Run Time
 
Dell High-Performance Computing solutions: Enable innovations, outperform exp...
Dell High-Performance Computing solutions: Enable innovations, outperform exp...Dell High-Performance Computing solutions: Enable innovations, outperform exp...
Dell High-Performance Computing solutions: Enable innovations, outperform exp...
 
Using The Hadoop Ecosystem to Drive Healthcare Innovation
Using The Hadoop Ecosystem to Drive Healthcare InnovationUsing The Hadoop Ecosystem to Drive Healthcare Innovation
Using The Hadoop Ecosystem to Drive Healthcare Innovation
 
Data-intensive bioinformatics on HPC and Cloud
Data-intensive bioinformatics on HPC and CloudData-intensive bioinformatics on HPC and Cloud
Data-intensive bioinformatics on HPC and Cloud
 
Challenges and Issues of Next Cloud Computing Platforms
Challenges and Issues of Next Cloud Computing PlatformsChallenges and Issues of Next Cloud Computing Platforms
Challenges and Issues of Next Cloud Computing Platforms
 
OpenStack at SJTU: Predictive Data Mining in Clinical Medicine with Dynamical...
OpenStack at SJTU: Predictive Data Mining in Clinical Medicine with Dynamical...OpenStack at SJTU: Predictive Data Mining in Clinical Medicine with Dynamical...
OpenStack at SJTU: Predictive Data Mining in Clinical Medicine with Dynamical...
 
Enterprise data science at scale
Enterprise data science at scaleEnterprise data science at scale
Enterprise data science at scale
 
Analyzing Big Data in Medicine with Virtual Research Environments and Microse...
Analyzing Big Data in Medicine with Virtual Research Environments and Microse...Analyzing Big Data in Medicine with Virtual Research Environments and Microse...
Analyzing Big Data in Medicine with Virtual Research Environments and Microse...
 
Big Data
Big Data Big Data
Big Data
 
The Transformation of HPC: Simulation and Cognitive Methods in the Era of Big...
The Transformation of HPC: Simulation and Cognitive Methods in the Era of Big...The Transformation of HPC: Simulation and Cognitive Methods in the Era of Big...
The Transformation of HPC: Simulation and Cognitive Methods in the Era of Big...
 
Tragedy of the Data Commons (ODSC-East, 2021)
Tragedy of the Data Commons (ODSC-East, 2021)Tragedy of the Data Commons (ODSC-East, 2021)
Tragedy of the Data Commons (ODSC-East, 2021)
 
How HPC and large-scale data analytics are transforming experimental science
How HPC and large-scale data analytics are transforming experimental scienceHow HPC and large-scale data analytics are transforming experimental science
How HPC and large-scale data analytics are transforming experimental science
 
Data-intensive applications on cloud computing resources: Applications in lif...
Data-intensive applications on cloud computing resources: Applications in lif...Data-intensive applications on cloud computing resources: Applications in lif...
Data-intensive applications on cloud computing resources: Applications in lif...
 
High Performance Computing and the Opportunity with Cognitive Technology
 High Performance Computing and the Opportunity with Cognitive Technology High Performance Computing and the Opportunity with Cognitive Technology
High Performance Computing and the Opportunity with Cognitive Technology
 
HPC Resource Accounting
HPC Resource AccountingHPC Resource Accounting
HPC Resource Accounting
 
Collins seattle-2014-final
Collins seattle-2014-finalCollins seattle-2014-final
Collins seattle-2014-final
 

Mehr von PTIHPA

2010 02 instrumentation_and_runtime_measurement
2010 02 instrumentation_and_runtime_measurement2010 02 instrumentation_and_runtime_measurement
2010 02 instrumentation_and_runtime_measurementPTIHPA
 
2010 03 papi_indiana
2010 03 papi_indiana2010 03 papi_indiana
2010 03 papi_indianaPTIHPA
 
Switc Hpa
Switc HpaSwitc Hpa
Switc HpaPTIHPA
 
2 Vampir Trace Visualization
2 Vampir Trace Visualization2 Vampir Trace Visualization
2 Vampir Trace VisualizationPTIHPA
 
4 HPA Examples Of Vampir Usage
4 HPA Examples Of Vampir Usage4 HPA Examples Of Vampir Usage
4 HPA Examples Of Vampir UsagePTIHPA
 
Implementing 3D SPHARM Surfaces Registration on Cell B.E. Processor
Implementing 3D SPHARM Surfaces Registration on Cell B.E. ProcessorImplementing 3D SPHARM Surfaces Registration on Cell B.E. Processor
Implementing 3D SPHARM Surfaces Registration on Cell B.E. ProcessorPTIHPA
 

Mehr von PTIHPA (6)

2010 02 instrumentation_and_runtime_measurement
2010 02 instrumentation_and_runtime_measurement2010 02 instrumentation_and_runtime_measurement
2010 02 instrumentation_and_runtime_measurement
 
2010 03 papi_indiana
2010 03 papi_indiana2010 03 papi_indiana
2010 03 papi_indiana
 
Switc Hpa
Switc HpaSwitc Hpa
Switc Hpa
 
2 Vampir Trace Visualization
2 Vampir Trace Visualization2 Vampir Trace Visualization
2 Vampir Trace Visualization
 
4 HPA Examples Of Vampir Usage
4 HPA Examples Of Vampir Usage4 HPA Examples Of Vampir Usage
4 HPA Examples Of Vampir Usage
 
Implementing 3D SPHARM Surfaces Registration on Cell B.E. Processor
Implementing 3D SPHARM Surfaces Registration on Cell B.E. ProcessorImplementing 3D SPHARM Surfaces Registration on Cell B.E. Processor
Implementing 3D SPHARM Surfaces Registration on Cell B.E. Processor
 

Kürzlich hochgeladen

What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024Stephanie Beckett
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsPixlogix Infotech
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Manik S Magar
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clashcharlottematthew16
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.Curtis Poe
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteDianaGray10
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...Fwdays
 
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfPrecisely
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenHervé Boutemy
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionDilum Bandara
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningLars Bell
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays
 

Kürzlich hochgeladen (20)

What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and Cons
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clash
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test Suite
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
 
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache Maven
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An Introduction
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine Tuning
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
 

Statewide It Robert Henschel

  • 1. Tuning Parallel Applications to Accelerate Scientific Discoveries Robert Henschel rhensche@indiana.edu October 2009
  • 2. Contents • PTI / High Performance Applications • Performance of Scientific Codes • IU and TeraGrid Compute Resources • Optimizing for IU's HPC Systems • Using TeraGrid HPC Systems • HPA is Here to Help Robert Henschel
  • 3. What this talk will be about • Making you aware of compute resources that you can use for your work, to make you more productive. • Introducing the High Performance Applications group and how we can help get research done faster. • Give you examples of what we have done for researchers to make them more competitive in their field. Robert Henschel
  • 4. PTI and High Performance Applications • Pervasive Technology Institute – Develop and deliver innovative information technology to advance research, education, industry and society. – School of Informatics – School of Law – University Information Technology Services • High Performance Applications – Part of the Digital Science Center of PTI – Part of the Research Technologies of UITS – Seven people that help IU researchers make efficient use of IU and TeraGrid compute resources Robert Henschel
  • 5. Performance of Scientific Codes • Supercomputing, or High Performance Computing (HPC), is not just for computer geeks! • Performance for computer scientists – Amdahls law and scalability – Efficient usage of functional units of processors – Optimally using memory bandwidth – Trying to avoid I/O as much as possible • Performance for researchers – When do I get the answer to my problem? – When does my job run and when is it done? Robert Henschel
  • 6. IU and TeraGrid Compute Resources • Two HPC systems at IU – BigRed 30 TFLOPS (3000 cores) – Quarry 7 TFLOPS (1000 cores) • Several special purpose systems – Small Cell B.E. Cluster – MDGRAPE-2 machine • Several storage resources – IU Data Capacitor – GPFS, RFS, HPSS • Policy of open access to compute resources Robert Henschel
  • 7. IU and TeraGrid Compute Resources cont'd Robert Henschel
  • 8. IU and TeraGrid Compute Resources cont'd • TeraGrid – NSF funded HPC systems and support infrastructure – 11 resource providers – More than 1,500 TFLOPS (150,000 cores) • Central allocation and support structure Robert Henschel
  • 9. Optimizing for IU's HPC Systems • Help researchers access the central systems and determine what system to use • Install and optimize applications • Provide guidance on compiler and library optimization • Help with job submission, especially running many thousands of jobs Robert Henschel
  • 10. Using TeraGrid HPC Systems • Low barrier of entry • Identify if a problem and workflow will work on the TeraGrid • Get a startup allocation • Use it and identify if it is worth pursuing this further • Submit a full allocation request Robert Henschel
  • 11. Contents – HPA is Here to Help • HPA is Here to Help – What We Do • Recent Examples – Integrating HPC Systems into an Electron Microscope Workflow – Migrating Research in Gas Giant Planets from IU to TeraGrid HPC Systems – Developing Computational Models to Predict Drug-Drug Interactions Robert Henschel
  • 12. What We Do • Consulting about HPC system usage – From start to finish – Optimize source code for architectures • Help with TeraGrid allocation proposals • Adapting and creating workflows for new environments • Consulting for grant proposals Robert Henschel
  • 13. HPC Systems and an Electron Microscope General Case – Users have an instrument that produces a lot of data on a daily basis – This data needs to be stored and analyzed Electron Microscope in Simon Hall (IU Bloomington) – Microscope stores data on a Windows workstation – Researcher does quality checks on local workstation – IU Data Capacitor links workstations, IU HPC systems and the IU long term archive together Robert Henschel
  • 14. Gas Giant Planets on the TeraGrid General Case – Users have a set workflow for analyzing data – Locally available compute resources are not big enough to keep up with demand Understanding Gas Giant Planets – IDL is used to visualize simulation data • Commercial software, IU Astronomy has a license – Simulation application needs to run on a large shared memory system – TeraGrid and IU Data Capacitor tie this workflow together Robert Henschel
  • 15. Predicting Drug-Drug Interactions General Case – Researchers implement proof of concept research algorithms – Scaling from proof of concept to production science is difficult – The ability to add HPC expertise to grant proposals will make the proposal more competitive Computational Models to Predict Drug-Drug Interactions – Drug exposure model developed in R – Scaling to real world data sets not possible without using HPC systems – Porting to C and running on UITS hardware Robert Henschel
  • 16. What this talk was about • Made you aware of compute resources that you can use for your work, to make you more productive. • Introduced the High Performance Applications group and how we can help get research done faster. • Gave you examples of what we have done for researchers to make them more competitive in their field. Robert Henschel
  • 17. Acknowledgments This material is based upon work supported by the National Science Foundation under Grant Numbers 0116050 and 0521433. Any opinions, findings and conclusions or recommendations expressed in this material are those of the author and do not necessarily reflect the views of the National Science Foundation (NSF). This work was support in part by the Indiana Metabolomics and Cytomics Initiative (METACyt). METACyt is supported in part by Lilly Endowment, Inc. This work was support in part by the Indiana Genomics Initiative. The Indiana Genomics Initiative of Indiana University is supported in part by Lilly Endowment, Inc. This work was supported in part by Shared University Research grants from IBM, Inc. to Indiana University. Robert Henschel