SlideShare ist ein Scribd-Unternehmen logo
1 von 24
Downloaden Sie, um offline zu lesen
34324 - Measurement Tools and Techniques



              Instrumenting
            the MG application
        of NAS Parallel Benchmark


               Maria Stylianou
             marsty5@gmail.com
                 20-APR-2012
Outline
●   Basic Information

●   Instrumentation
    ●   By observation
    ●   Using Performance Counters
    ●   Using Histograms

●   Conclusions

                                     2
Outline

●   Basic Information

●   Instrumentation
    ●   By observation
    ●   Using Performance Counters
    ●   Using Histograms

●   Conclusions

    1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters,   3
                               (c) with Histograms, 3 - Conclusions
Basic Information
Execution Environments
●   Personal Laptop
    ●   Ubuntu 11.10, 64-bit
    ●   Intel Quad Core i5
    ●   4GB RAM


●   Boada Server
    ●   Intel(R) Xeon(R) CPU E5645 @ 2.40GHz
    ●   12 Cores with HT support
    ●   24 GΒ RAM
     1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters,   4
                                (c) with Histograms, 3 - Conclusions
Basic Information
NAS Parallel Benchmark
●   Evaluate the performance of parallel supercomputers

●   Several Applications                                   MG – MPI Version
    ●    IS, EP, CG, MG                                   Multi-Grid on a sequence
    ●    FT, BT, SP, LU                                              of meshes



●   Extrae → Produce traces
●   Paraver → Analyse traces
        1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters,   5
                                   (c) with Histograms, 3 - Conclusions
Outline
●   Basic Information

●   Instrumentation
    ●   By observation
    ●   Using Performance Counters
    ●   Using Histograms

●   Conclusions

    1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters,   6
                               (c) with Histograms, 3 - Conclusions
Instrumentation by Observation




1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters,   7
                           (c) with Histograms, 3 - Conclusions
Instrumentation by Observation




1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters,   8
                           (c) with Histograms, 3 - Conclusions
Instrumentation by Observation




1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters,   9
                           (c) with Histograms, 3 - Conclusions
Instrumentation by Observation
Initialization




  1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters,   10
                             (c) with Histograms, 3 - Conclusions
Instrumentation by Observation
Execution




  1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters,   11
                             (c) with Histograms, 3 - Conclusions
Instrumentation by Observation
Finalization




  1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters,   12
                             (c) with Histograms, 3 - Conclusions
Outline
●   Basic Information

●   Instrumentation
    ●   By observation
    ●   Using Performance Counters
    ●   Using Histograms

●   Conclusions

    1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters,   13
                                (c) with Histograms, 3 - Conclusions
Instrumentation
            using Performance Counters
Instructions




  1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters,   14
                              (c) with Histograms, 3 - Conclusions
Instrumentation
            using Performance Counters
Cycles




  1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters,   15
                              (c) with Histograms, 3 - Conclusions
Instrumentation
            using Performance Counters
IPC: Instructions Per Cycle




  1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters,   16
                              (c) with Histograms, 3 - Conclusions
Instrumentation
            using Performance Counters
L1 Cache Misses




  1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters,   17
                              (c) with Histograms, 3 - Conclusions
Outline
●   Basic Information

●   Instrumentation
    ●   By observation
    ●   Using Performance Counters
    ●   Using Histograms

●   Conclusions

    1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters,   18
                              (c) with Histograms, 3 - Conclusions
Instrumentation using Histograms
Time
Histogram




  1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters,   19
                            (c) with Histograms, 3 - Conclusions
Instrumentation using Histograms
Percentage
Histogram




  1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters,   20
                            (c) with Histograms, 3 - Conclusions
Instrumentation using Histograms
Percentage
Histogram




  1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters,   21
                            (c) with Histograms, 3 - Conclusions
Instrumentation using Histograms
Percentage
Histogram




  1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters,   22
                            (c) with Histograms, 3 - Conclusions
Conclusions
●   Scalability
    ●   In laptop: No way!
    ●   In Boada: Yes!

●   #Processors Increase
        → L1 Cache Misses Increase


●   Useful information very fast → Histograms!

     1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters,   23
                               (c) with Histograms, 3 - Conclusions
34324 - Measurement Tools and Techniques



              Instrumenting
            the MG application
        of NAS Parallel Benchmark


               Maria Stylianou
             marsty5@gmail.com
               20-APR-2012
                                           24

Weitere ähnliche Inhalte

Andere mochten auch

A Survey on Large-Scale Decentralized Storage Systems to be used by Volunteer...
A Survey on Large-Scale Decentralized Storage Systems to be used by Volunteer...A Survey on Large-Scale Decentralized Storage Systems to be used by Volunteer...
A Survey on Large-Scale Decentralized Storage Systems to be used by Volunteer...Maria Stylianou
 
How Companies Learn Your Secrets
How Companies Learn Your SecretsHow Companies Learn Your Secrets
How Companies Learn Your SecretsMaria Stylianou
 
Automatic Energy-based Scheduling
Automatic Energy-based SchedulingAutomatic Energy-based Scheduling
Automatic Energy-based SchedulingMaria Stylianou
 
SPARJA: a Distributed Social Graph Partitioning and Replication Middleware
SPARJA: a Distributed Social Graph Partitioning and Replication MiddlewareSPARJA: a Distributed Social Graph Partitioning and Replication Middleware
SPARJA: a Distributed Social Graph Partitioning and Replication MiddlewareMaria Stylianou
 
Performance Analysis of multithreaded applications based on Hardware Simulati...
Performance Analysis of multithreaded applications based on Hardware Simulati...Performance Analysis of multithreaded applications based on Hardware Simulati...
Performance Analysis of multithreaded applications based on Hardware Simulati...Maria Stylianou
 
Data Streaming with Apache Kafka & MongoDB
Data Streaming with Apache Kafka & MongoDBData Streaming with Apache Kafka & MongoDB
Data Streaming with Apache Kafka & MongoDBconfluent
 
Apache kafka-a distributed streaming platform
Apache kafka-a distributed streaming platformApache kafka-a distributed streaming platform
Apache kafka-a distributed streaming platformconfluent
 

Andere mochten auch (10)

A Survey on Large-Scale Decentralized Storage Systems to be used by Volunteer...
A Survey on Large-Scale Decentralized Storage Systems to be used by Volunteer...A Survey on Large-Scale Decentralized Storage Systems to be used by Volunteer...
A Survey on Large-Scale Decentralized Storage Systems to be used by Volunteer...
 
How Companies Learn Your Secrets
How Companies Learn Your SecretsHow Companies Learn Your Secrets
How Companies Learn Your Secrets
 
Pregel - Paper Review
Pregel - Paper ReviewPregel - Paper Review
Pregel - Paper Review
 
Erlang in 10 minutes
Erlang in 10 minutesErlang in 10 minutes
Erlang in 10 minutes
 
Automatic Energy-based Scheduling
Automatic Energy-based SchedulingAutomatic Energy-based Scheduling
Automatic Energy-based Scheduling
 
SPARJA: a Distributed Social Graph Partitioning and Replication Middleware
SPARJA: a Distributed Social Graph Partitioning and Replication MiddlewareSPARJA: a Distributed Social Graph Partitioning and Replication Middleware
SPARJA: a Distributed Social Graph Partitioning and Replication Middleware
 
Google's Dremel
Google's DremelGoogle's Dremel
Google's Dremel
 
Performance Analysis of multithreaded applications based on Hardware Simulati...
Performance Analysis of multithreaded applications based on Hardware Simulati...Performance Analysis of multithreaded applications based on Hardware Simulati...
Performance Analysis of multithreaded applications based on Hardware Simulati...
 
Data Streaming with Apache Kafka & MongoDB
Data Streaming with Apache Kafka & MongoDBData Streaming with Apache Kafka & MongoDB
Data Streaming with Apache Kafka & MongoDB
 
Apache kafka-a distributed streaming platform
Apache kafka-a distributed streaming platformApache kafka-a distributed streaming platform
Apache kafka-a distributed streaming platform
 

Ähnlich wie Instrumenting the MG applicaiton of NAS Parallel Benchmark

Medical Image Segmentation Using Hidden Markov Random Field A Distributed Ap...
Medical Image Segmentation Using Hidden Markov Random Field  A Distributed Ap...Medical Image Segmentation Using Hidden Markov Random Field  A Distributed Ap...
Medical Image Segmentation Using Hidden Markov Random Field A Distributed Ap...EL-Hachemi Guerrout
 
Incheon National University - EATED SRA
Incheon National University - EATED SRAIncheon National University - EATED SRA
Incheon National University - EATED SRAssuser58d6dc2
 
Real time-image-processing-applied-to-traffic-queue-detection-algorithm
Real time-image-processing-applied-to-traffic-queue-detection-algorithmReal time-image-processing-applied-to-traffic-queue-detection-algorithm
Real time-image-processing-applied-to-traffic-queue-detection-algorithmajayrampelli
 
SFScon21 - Alex Bojeri - Artificial Intelligence Algorithms for Automatic Seg...
SFScon21 - Alex Bojeri - Artificial Intelligence Algorithms for Automatic Seg...SFScon21 - Alex Bojeri - Artificial Intelligence Algorithms for Automatic Seg...
SFScon21 - Alex Bojeri - Artificial Intelligence Algorithms for Automatic Seg...South Tyrol Free Software Conference
 
Mi rna data analysis 2013
Mi rna data analysis 2013Mi rna data analysis 2013
Mi rna data analysis 2013Elsa von Licy
 
Design and Implementation of Modules for the Extraction of Biometric Paramete...
Design and Implementation of Modules for the Extraction of Biometric Paramete...Design and Implementation of Modules for the Extraction of Biometric Paramete...
Design and Implementation of Modules for the Extraction of Biometric Paramete...Salvatore La Bua
 

Ähnlich wie Instrumenting the MG applicaiton of NAS Parallel Benchmark (8)

Medical Image Segmentation Using Hidden Markov Random Field A Distributed Ap...
Medical Image Segmentation Using Hidden Markov Random Field  A Distributed Ap...Medical Image Segmentation Using Hidden Markov Random Field  A Distributed Ap...
Medical Image Segmentation Using Hidden Markov Random Field A Distributed Ap...
 
SRA final project
SRA final projectSRA final project
SRA final project
 
Incheon National University - EATED SRA
Incheon National University - EATED SRAIncheon National University - EATED SRA
Incheon National University - EATED SRA
 
Dongliang_Slides
Dongliang_SlidesDongliang_Slides
Dongliang_Slides
 
Real time-image-processing-applied-to-traffic-queue-detection-algorithm
Real time-image-processing-applied-to-traffic-queue-detection-algorithmReal time-image-processing-applied-to-traffic-queue-detection-algorithm
Real time-image-processing-applied-to-traffic-queue-detection-algorithm
 
SFScon21 - Alex Bojeri - Artificial Intelligence Algorithms for Automatic Seg...
SFScon21 - Alex Bojeri - Artificial Intelligence Algorithms for Automatic Seg...SFScon21 - Alex Bojeri - Artificial Intelligence Algorithms for Automatic Seg...
SFScon21 - Alex Bojeri - Artificial Intelligence Algorithms for Automatic Seg...
 
Mi rna data analysis 2013
Mi rna data analysis 2013Mi rna data analysis 2013
Mi rna data analysis 2013
 
Design and Implementation of Modules for the Extraction of Biometric Paramete...
Design and Implementation of Modules for the Extraction of Biometric Paramete...Design and Implementation of Modules for the Extraction of Biometric Paramete...
Design and Implementation of Modules for the Extraction of Biometric Paramete...
 

Kürzlich hochgeladen

Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...apidays
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)wesley chun
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdflior mazor
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoffsammart93
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...DianaGray10
 
Manulife - Insurer Innovation Award 2024
Manulife - Insurer Innovation Award 2024Manulife - Insurer Innovation Award 2024
Manulife - Insurer Innovation Award 2024The Digital Insurer
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProduct Anonymous
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobeapidays
 
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...Principled Technologies
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businesspanagenda
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MIND CTI
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyKhushali Kathiriya
 

Kürzlich hochgeladen (20)

Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Manulife - Insurer Innovation Award 2024
Manulife - Insurer Innovation Award 2024Manulife - Insurer Innovation Award 2024
Manulife - Insurer Innovation Award 2024
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 

Instrumenting the MG applicaiton of NAS Parallel Benchmark

  • 1. 34324 - Measurement Tools and Techniques Instrumenting the MG application of NAS Parallel Benchmark Maria Stylianou marsty5@gmail.com 20-APR-2012
  • 2. Outline ● Basic Information ● Instrumentation ● By observation ● Using Performance Counters ● Using Histograms ● Conclusions 2
  • 3. Outline ● Basic Information ● Instrumentation ● By observation ● Using Performance Counters ● Using Histograms ● Conclusions 1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters, 3 (c) with Histograms, 3 - Conclusions
  • 4. Basic Information Execution Environments ● Personal Laptop ● Ubuntu 11.10, 64-bit ● Intel Quad Core i5 ● 4GB RAM ● Boada Server ● Intel(R) Xeon(R) CPU E5645 @ 2.40GHz ● 12 Cores with HT support ● 24 GΒ RAM 1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters, 4 (c) with Histograms, 3 - Conclusions
  • 5. Basic Information NAS Parallel Benchmark ● Evaluate the performance of parallel supercomputers ● Several Applications MG – MPI Version ● IS, EP, CG, MG Multi-Grid on a sequence ● FT, BT, SP, LU of meshes ● Extrae → Produce traces ● Paraver → Analyse traces 1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters, 5 (c) with Histograms, 3 - Conclusions
  • 6. Outline ● Basic Information ● Instrumentation ● By observation ● Using Performance Counters ● Using Histograms ● Conclusions 1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters, 6 (c) with Histograms, 3 - Conclusions
  • 7. Instrumentation by Observation 1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters, 7 (c) with Histograms, 3 - Conclusions
  • 8. Instrumentation by Observation 1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters, 8 (c) with Histograms, 3 - Conclusions
  • 9. Instrumentation by Observation 1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters, 9 (c) with Histograms, 3 - Conclusions
  • 10. Instrumentation by Observation Initialization 1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters, 10 (c) with Histograms, 3 - Conclusions
  • 11. Instrumentation by Observation Execution 1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters, 11 (c) with Histograms, 3 - Conclusions
  • 12. Instrumentation by Observation Finalization 1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters, 12 (c) with Histograms, 3 - Conclusions
  • 13. Outline ● Basic Information ● Instrumentation ● By observation ● Using Performance Counters ● Using Histograms ● Conclusions 1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters, 13 (c) with Histograms, 3 - Conclusions
  • 14. Instrumentation using Performance Counters Instructions 1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters, 14 (c) with Histograms, 3 - Conclusions
  • 15. Instrumentation using Performance Counters Cycles 1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters, 15 (c) with Histograms, 3 - Conclusions
  • 16. Instrumentation using Performance Counters IPC: Instructions Per Cycle 1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters, 16 (c) with Histograms, 3 - Conclusions
  • 17. Instrumentation using Performance Counters L1 Cache Misses 1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters, 17 (c) with Histograms, 3 - Conclusions
  • 18. Outline ● Basic Information ● Instrumentation ● By observation ● Using Performance Counters ● Using Histograms ● Conclusions 1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters, 18 (c) with Histograms, 3 - Conclusions
  • 19. Instrumentation using Histograms Time Histogram 1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters, 19 (c) with Histograms, 3 - Conclusions
  • 20. Instrumentation using Histograms Percentage Histogram 1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters, 20 (c) with Histograms, 3 - Conclusions
  • 21. Instrumentation using Histograms Percentage Histogram 1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters, 21 (c) with Histograms, 3 - Conclusions
  • 22. Instrumentation using Histograms Percentage Histogram 1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters, 22 (c) with Histograms, 3 - Conclusions
  • 23. Conclusions ● Scalability ● In laptop: No way! ● In Boada: Yes! ● #Processors Increase → L1 Cache Misses Increase ● Useful information very fast → Histograms! 1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters, 23 (c) with Histograms, 3 - Conclusions
  • 24. 34324 - Measurement Tools and Techniques Instrumenting the MG application of NAS Parallel Benchmark Maria Stylianou marsty5@gmail.com 20-APR-2012 24