SlideShare ist ein Scribd-Unternehmen logo
1 von 24
Downloaden Sie, um offline zu lesen
34324 - Measurement Tools and Techniques



              Instrumenting
            the MG application
        of NAS Parallel Benchmark


               Maria Stylianou
             marsty5@gmail.com
                 20-APR-2012
Outline
●   Basic Information

●   Instrumentation
    ●   By observation
    ●   Using Performance Counters
    ●   Using Histograms

●   Conclusions

                                     2
Outline

●   Basic Information

●   Instrumentation
    ●   By observation
    ●   Using Performance Counters
    ●   Using Histograms

●   Conclusions

    1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters,   3
                               (c) with Histograms, 3 - Conclusions
Basic Information
Execution Environments
●   Personal Laptop
    ●   Ubuntu 11.10, 64-bit
    ●   Intel Quad Core i5
    ●   4GB RAM


●   Boada Server
    ●   Intel(R) Xeon(R) CPU E5645 @ 2.40GHz
    ●   12 Cores with HT support
    ●   24 GΒ RAM
     1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters,   4
                                (c) with Histograms, 3 - Conclusions
Basic Information
NAS Parallel Benchmark
●   Evaluate the performance of parallel supercomputers

●   Several Applications                                   MG – MPI Version
    ●    IS, EP, CG, MG                                   Multi-Grid on a sequence
    ●    FT, BT, SP, LU                                              of meshes



●   Extrae → Produce traces
●   Paraver → Analyse traces
        1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters,   5
                                   (c) with Histograms, 3 - Conclusions
Outline
●   Basic Information

●   Instrumentation
    ●   By observation
    ●   Using Performance Counters
    ●   Using Histograms

●   Conclusions

    1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters,   6
                               (c) with Histograms, 3 - Conclusions
Instrumentation by Observation




1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters,   7
                           (c) with Histograms, 3 - Conclusions
Instrumentation by Observation




1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters,   8
                           (c) with Histograms, 3 - Conclusions
Instrumentation by Observation




1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters,   9
                           (c) with Histograms, 3 - Conclusions
Instrumentation by Observation
Initialization




  1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters,   10
                             (c) with Histograms, 3 - Conclusions
Instrumentation by Observation
Execution




  1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters,   11
                             (c) with Histograms, 3 - Conclusions
Instrumentation by Observation
Finalization




  1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters,   12
                             (c) with Histograms, 3 - Conclusions
Outline
●   Basic Information

●   Instrumentation
    ●   By observation
    ●   Using Performance Counters
    ●   Using Histograms

●   Conclusions

    1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters,   13
                                (c) with Histograms, 3 - Conclusions
Instrumentation
            using Performance Counters
Instructions




  1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters,   14
                              (c) with Histograms, 3 - Conclusions
Instrumentation
            using Performance Counters
Cycles




  1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters,   15
                              (c) with Histograms, 3 - Conclusions
Instrumentation
            using Performance Counters
IPC: Instructions Per Cycle




  1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters,   16
                              (c) with Histograms, 3 - Conclusions
Instrumentation
            using Performance Counters
L1 Cache Misses




  1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters,   17
                              (c) with Histograms, 3 - Conclusions
Outline
●   Basic Information

●   Instrumentation
    ●   By observation
    ●   Using Performance Counters
    ●   Using Histograms

●   Conclusions

    1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters,   18
                              (c) with Histograms, 3 - Conclusions
Instrumentation using Histograms
Time
Histogram




  1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters,   19
                            (c) with Histograms, 3 - Conclusions
Instrumentation using Histograms
Percentage
Histogram




  1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters,   20
                            (c) with Histograms, 3 - Conclusions
Instrumentation using Histograms
Percentage
Histogram




  1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters,   21
                            (c) with Histograms, 3 - Conclusions
Instrumentation using Histograms
Percentage
Histogram




  1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters,   22
                            (c) with Histograms, 3 - Conclusions
Conclusions
●   Scalability
    ●   In laptop: No way!
    ●   In Boada: Yes!

●   #Processors Increase
        → L1 Cache Misses Increase


●   Useful information very fast → Histograms!

     1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters,   23
                               (c) with Histograms, 3 - Conclusions
34324 - Measurement Tools and Techniques



              Instrumenting
            the MG application
        of NAS Parallel Benchmark


               Maria Stylianou
             marsty5@gmail.com
               20-APR-2012
                                           24

Weitere ähnliche Inhalte

Andere mochten auch

A Survey on Large-Scale Decentralized Storage Systems to be used by Volunteer...
A Survey on Large-Scale Decentralized Storage Systems to be used by Volunteer...A Survey on Large-Scale Decentralized Storage Systems to be used by Volunteer...
A Survey on Large-Scale Decentralized Storage Systems to be used by Volunteer...Maria Stylianou
 
How Companies Learn Your Secrets
How Companies Learn Your SecretsHow Companies Learn Your Secrets
How Companies Learn Your SecretsMaria Stylianou
 
Automatic Energy-based Scheduling
Automatic Energy-based SchedulingAutomatic Energy-based Scheduling
Automatic Energy-based SchedulingMaria Stylianou
 
SPARJA: a Distributed Social Graph Partitioning and Replication Middleware
SPARJA: a Distributed Social Graph Partitioning and Replication MiddlewareSPARJA: a Distributed Social Graph Partitioning and Replication Middleware
SPARJA: a Distributed Social Graph Partitioning and Replication MiddlewareMaria Stylianou
 
Performance Analysis of multithreaded applications based on Hardware Simulati...
Performance Analysis of multithreaded applications based on Hardware Simulati...Performance Analysis of multithreaded applications based on Hardware Simulati...
Performance Analysis of multithreaded applications based on Hardware Simulati...Maria Stylianou
 
Data Streaming with Apache Kafka & MongoDB
Data Streaming with Apache Kafka & MongoDBData Streaming with Apache Kafka & MongoDB
Data Streaming with Apache Kafka & MongoDBconfluent
 
Apache kafka-a distributed streaming platform
Apache kafka-a distributed streaming platformApache kafka-a distributed streaming platform
Apache kafka-a distributed streaming platformconfluent
 

Andere mochten auch (10)

A Survey on Large-Scale Decentralized Storage Systems to be used by Volunteer...
A Survey on Large-Scale Decentralized Storage Systems to be used by Volunteer...A Survey on Large-Scale Decentralized Storage Systems to be used by Volunteer...
A Survey on Large-Scale Decentralized Storage Systems to be used by Volunteer...
 
How Companies Learn Your Secrets
How Companies Learn Your SecretsHow Companies Learn Your Secrets
How Companies Learn Your Secrets
 
Pregel - Paper Review
Pregel - Paper ReviewPregel - Paper Review
Pregel - Paper Review
 
Erlang in 10 minutes
Erlang in 10 minutesErlang in 10 minutes
Erlang in 10 minutes
 
Automatic Energy-based Scheduling
Automatic Energy-based SchedulingAutomatic Energy-based Scheduling
Automatic Energy-based Scheduling
 
SPARJA: a Distributed Social Graph Partitioning and Replication Middleware
SPARJA: a Distributed Social Graph Partitioning and Replication MiddlewareSPARJA: a Distributed Social Graph Partitioning and Replication Middleware
SPARJA: a Distributed Social Graph Partitioning and Replication Middleware
 
Google's Dremel
Google's DremelGoogle's Dremel
Google's Dremel
 
Performance Analysis of multithreaded applications based on Hardware Simulati...
Performance Analysis of multithreaded applications based on Hardware Simulati...Performance Analysis of multithreaded applications based on Hardware Simulati...
Performance Analysis of multithreaded applications based on Hardware Simulati...
 
Data Streaming with Apache Kafka & MongoDB
Data Streaming with Apache Kafka & MongoDBData Streaming with Apache Kafka & MongoDB
Data Streaming with Apache Kafka & MongoDB
 
Apache kafka-a distributed streaming platform
Apache kafka-a distributed streaming platformApache kafka-a distributed streaming platform
Apache kafka-a distributed streaming platform
 

Ähnlich wie Instrumenting the MG applicaiton of NAS Parallel Benchmark

Medical Image Segmentation Using Hidden Markov Random Field A Distributed Ap...
Medical Image Segmentation Using Hidden Markov Random Field  A Distributed Ap...Medical Image Segmentation Using Hidden Markov Random Field  A Distributed Ap...
Medical Image Segmentation Using Hidden Markov Random Field A Distributed Ap...EL-Hachemi Guerrout
 
Incheon National University - EATED SRA
Incheon National University - EATED SRAIncheon National University - EATED SRA
Incheon National University - EATED SRAssuser58d6dc2
 
Real time-image-processing-applied-to-traffic-queue-detection-algorithm
Real time-image-processing-applied-to-traffic-queue-detection-algorithmReal time-image-processing-applied-to-traffic-queue-detection-algorithm
Real time-image-processing-applied-to-traffic-queue-detection-algorithmajayrampelli
 
SFScon21 - Alex Bojeri - Artificial Intelligence Algorithms for Automatic Seg...
SFScon21 - Alex Bojeri - Artificial Intelligence Algorithms for Automatic Seg...SFScon21 - Alex Bojeri - Artificial Intelligence Algorithms for Automatic Seg...
SFScon21 - Alex Bojeri - Artificial Intelligence Algorithms for Automatic Seg...South Tyrol Free Software Conference
 
Mi rna data analysis 2013
Mi rna data analysis 2013Mi rna data analysis 2013
Mi rna data analysis 2013Elsa von Licy
 
Design and Implementation of Modules for the Extraction of Biometric Paramete...
Design and Implementation of Modules for the Extraction of Biometric Paramete...Design and Implementation of Modules for the Extraction of Biometric Paramete...
Design and Implementation of Modules for the Extraction of Biometric Paramete...Salvatore La Bua
 

Ähnlich wie Instrumenting the MG applicaiton of NAS Parallel Benchmark (8)

Medical Image Segmentation Using Hidden Markov Random Field A Distributed Ap...
Medical Image Segmentation Using Hidden Markov Random Field  A Distributed Ap...Medical Image Segmentation Using Hidden Markov Random Field  A Distributed Ap...
Medical Image Segmentation Using Hidden Markov Random Field A Distributed Ap...
 
SRA final project
SRA final projectSRA final project
SRA final project
 
Incheon National University - EATED SRA
Incheon National University - EATED SRAIncheon National University - EATED SRA
Incheon National University - EATED SRA
 
Dongliang_Slides
Dongliang_SlidesDongliang_Slides
Dongliang_Slides
 
Real time-image-processing-applied-to-traffic-queue-detection-algorithm
Real time-image-processing-applied-to-traffic-queue-detection-algorithmReal time-image-processing-applied-to-traffic-queue-detection-algorithm
Real time-image-processing-applied-to-traffic-queue-detection-algorithm
 
SFScon21 - Alex Bojeri - Artificial Intelligence Algorithms for Automatic Seg...
SFScon21 - Alex Bojeri - Artificial Intelligence Algorithms for Automatic Seg...SFScon21 - Alex Bojeri - Artificial Intelligence Algorithms for Automatic Seg...
SFScon21 - Alex Bojeri - Artificial Intelligence Algorithms for Automatic Seg...
 
Mi rna data analysis 2013
Mi rna data analysis 2013Mi rna data analysis 2013
Mi rna data analysis 2013
 
Design and Implementation of Modules for the Extraction of Biometric Paramete...
Design and Implementation of Modules for the Extraction of Biometric Paramete...Design and Implementation of Modules for the Extraction of Biometric Paramete...
Design and Implementation of Modules for the Extraction of Biometric Paramete...
 

Kürzlich hochgeladen

Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Alan Dix
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 

Kürzlich hochgeladen (20)

Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen Frames
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 

Instrumenting the MG applicaiton of NAS Parallel Benchmark

  • 1. 34324 - Measurement Tools and Techniques Instrumenting the MG application of NAS Parallel Benchmark Maria Stylianou marsty5@gmail.com 20-APR-2012
  • 2. Outline ● Basic Information ● Instrumentation ● By observation ● Using Performance Counters ● Using Histograms ● Conclusions 2
  • 3. Outline ● Basic Information ● Instrumentation ● By observation ● Using Performance Counters ● Using Histograms ● Conclusions 1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters, 3 (c) with Histograms, 3 - Conclusions
  • 4. Basic Information Execution Environments ● Personal Laptop ● Ubuntu 11.10, 64-bit ● Intel Quad Core i5 ● 4GB RAM ● Boada Server ● Intel(R) Xeon(R) CPU E5645 @ 2.40GHz ● 12 Cores with HT support ● 24 GΒ RAM 1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters, 4 (c) with Histograms, 3 - Conclusions
  • 5. Basic Information NAS Parallel Benchmark ● Evaluate the performance of parallel supercomputers ● Several Applications MG – MPI Version ● IS, EP, CG, MG Multi-Grid on a sequence ● FT, BT, SP, LU of meshes ● Extrae → Produce traces ● Paraver → Analyse traces 1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters, 5 (c) with Histograms, 3 - Conclusions
  • 6. Outline ● Basic Information ● Instrumentation ● By observation ● Using Performance Counters ● Using Histograms ● Conclusions 1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters, 6 (c) with Histograms, 3 - Conclusions
  • 7. Instrumentation by Observation 1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters, 7 (c) with Histograms, 3 - Conclusions
  • 8. Instrumentation by Observation 1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters, 8 (c) with Histograms, 3 - Conclusions
  • 9. Instrumentation by Observation 1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters, 9 (c) with Histograms, 3 - Conclusions
  • 10. Instrumentation by Observation Initialization 1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters, 10 (c) with Histograms, 3 - Conclusions
  • 11. Instrumentation by Observation Execution 1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters, 11 (c) with Histograms, 3 - Conclusions
  • 12. Instrumentation by Observation Finalization 1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters, 12 (c) with Histograms, 3 - Conclusions
  • 13. Outline ● Basic Information ● Instrumentation ● By observation ● Using Performance Counters ● Using Histograms ● Conclusions 1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters, 13 (c) with Histograms, 3 - Conclusions
  • 14. Instrumentation using Performance Counters Instructions 1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters, 14 (c) with Histograms, 3 - Conclusions
  • 15. Instrumentation using Performance Counters Cycles 1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters, 15 (c) with Histograms, 3 - Conclusions
  • 16. Instrumentation using Performance Counters IPC: Instructions Per Cycle 1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters, 16 (c) with Histograms, 3 - Conclusions
  • 17. Instrumentation using Performance Counters L1 Cache Misses 1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters, 17 (c) with Histograms, 3 - Conclusions
  • 18. Outline ● Basic Information ● Instrumentation ● By observation ● Using Performance Counters ● Using Histograms ● Conclusions 1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters, 18 (c) with Histograms, 3 - Conclusions
  • 19. Instrumentation using Histograms Time Histogram 1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters, 19 (c) with Histograms, 3 - Conclusions
  • 20. Instrumentation using Histograms Percentage Histogram 1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters, 20 (c) with Histograms, 3 - Conclusions
  • 21. Instrumentation using Histograms Percentage Histogram 1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters, 21 (c) with Histograms, 3 - Conclusions
  • 22. Instrumentation using Histograms Percentage Histogram 1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters, 22 (c) with Histograms, 3 - Conclusions
  • 23. Conclusions ● Scalability ● In laptop: No way! ● In Boada: Yes! ● #Processors Increase → L1 Cache Misses Increase ● Useful information very fast → Histograms! 1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters, 23 (c) with Histograms, 3 - Conclusions
  • 24. 34324 - Measurement Tools and Techniques Instrumenting the MG application of NAS Parallel Benchmark Maria Stylianou marsty5@gmail.com 20-APR-2012 24