SlideShare a Scribd company logo
1 of 1
Download to read offline
LINUX
+
OPENMP
LEON3
LEON3
LEON3
LEON3
S 1
S 2
S 3
AHB
Controller
LEON3 LEON3 JTAG
Dbg Link
Ethernet
MAC
LEON3 LEON3 AHB/APB
Bridge
Memory
Controller
JTAG PHY
AMBA AHB
AMBA APB
UARTUART - USB SRAM
S1 S2 S3
Non-parallel region:
Master thread only
Parallel region starts:
#pragma omp parallel
ID:0
fork
ID:0 ID:1 ID:2 ID:3
Parallel region:
Several thread execute
simultaneously
join
Parallel region ends:
program waits for all
threads to terminate
ID:0
Program reverts to single
threaded execution
APB
Interface
Decode
Section
Event
Monitor
Time
Monitor
Counter
AHB - Adapter
APB
Bus
AHB Bus
SNIFFER BLOCK DIAGRAM
LEON3
7 - Stage
Integer Pipeline
3-Port Register File
IEEE-754 FPU
Co-Processor
HW MUL/DIV
Trace Buffer
Debug port
Interrupt port
I-Cache D-Cache
SRMMU
AHB I/F
Local IRAM
ITLB
Local DRAM
DTLB
AMBA AHB Master (32-bit)
SYSTEM BEHAVIOUR
Perfomance evaluation
of the platform by
means of Pi calculation
algorithm, proposed in
four different versions:
serial computation,
single process multiple
data (SPMD) technique
with false sharing,
SPMD technique
without false sharing
and OMP reduction
function.
Proposed profiling technique, used to monitor computational behaviour of the 4-LOOP platform, follows the
approach of runtime bus sampling.
LEGEND:
1 Thread
2 Threads
3 Threads
4 Threads
Event monitor:
strobe generation
(ld_ac_event) during
access on specified
address range
(delimited by
sig_out_inf and
sig_out_sup).
Time monitor:
counter activated by
read operation
(during_read) and
stopped by write
operation
(during_write), both
on specified address
(0x808).
LEON3 HW PROFILING SYSTEM
SYSTEM DESCRIPTION
The LEON3 processor is designed for Embedded applications,
combining high performance with low complexity and low power
consumption. The LEON3 processor is highly configurable.
A distributed hardware profiling system has been developed for runtime analysis.
It is composed of distributed AHB bus monitoring elements (sniffers) that moni-
tor AHB bus, initialized by means of APB bus. A global monitor unit, represented
by one LEON3 processor, provides sniffers initialization and collects results.
A Linux distribution, customized to work with multicore platform
in SMP mode, has been developed using Buildroot tool, starting
from LEON LINUX kernel (provided by Gaisler research).
Libraries required to implement parallel applications using shared memory,
developed with OpenMP C/C++, have been cross-compiled and added to the
adopted Linux distribution.
LINUX OPENMP
HARDWARE ARCHITECTURE
THE PLATFORM
OVERVIEW
The opportunity to build multi-processor systems exploiting soft-cores is increasing the
range of applications that can be implemented on FPGAs.
Inordertomaximizeperformanceaparallelprogrammingmodelshouldbeused:OpenMPAPI
is a specification for a set of compiler directives, library routines, and environment variables
that can be used to specify high-level parallelism.
Runtime analysis on SoC is useful to optimize reconfigurable systems. However, software
profiling systems impose software overhead to application execution.
1)
2)
3)
Proposed platform is composed of a working symmetric multi-processor
systems (SMP) based on four LEON3 cores, enhanced by adding a custom hardware
profiling system with no software overhead introduction. A SMP LINUX kernel
targeting the proposed system and including the device drivers needed to collect data
from the custom hardware profilers has been also built. The system has
been further customized to support the execution of OpenMP-based applications.
PROPOSED PLATFORMMOTIVATIONS
4-LOOP IS A PLATFORM DEVELOPED TO OFFER ADVANTAGE OF PARALLEL EXECUTION, WHILE MONITORING RUNTIME SYSTEM
BEHAVIOUR WITHOUT SOFTWARE OVERHEAD
4 - LOOP LEON34-CORE LEON3 WITH LINUX OPERATING SYSTEM, OPENMP LIBRARY AND HARDWARE PROFILING SYSTEM
G. Valente, V. Muttillo, L. Pomante, M. Faccio, F. Federici, A. Moro
Main Contacts: giacomo.valente@graduate.univaq.it, luigi.pomante@univaq.it, marco.faccio@univaq.it, fabio.federici@univaq.it
UNIVERSITA’ degli STUDI dell’AQUILA - CENTER of EXCELLENCE DEWS (ITALY)
http://dews.univaq.it
Graphic Designed By: Tania Valentina Ferro

More Related Content

Viewers also liked

הוראות למלוי טפסים
הוראות למלוי טפסיםהוראות למלוי טפסים
הוראות למלוי טפסים
shelly_zil1
 
Executive Judgement: Kevin K. Tang at SMECC - 20130809
Executive Judgement: Kevin K. Tang at SMECC - 20130809Executive Judgement: Kevin K. Tang at SMECC - 20130809
Executive Judgement: Kevin K. Tang at SMECC - 20130809
smecchk
 
Native Union Presentation
Native Union PresentationNative Union Presentation
Native Union Presentation
prestigioplaza
 

Viewers also liked (14)

Begins 08
Begins 08Begins 08
Begins 08
 
הוראות למלוי טפסים
הוראות למלוי טפסיםהוראות למלוי טפסים
הוראות למלוי טפסים
 
Decálogo de la gestión de la reputación en la red
Decálogo de la gestión de la reputación en la redDecálogo de la gestión de la reputación en la red
Decálogo de la gestión de la reputación en la red
 
Clinical trials in ahf
Clinical trials in ahfClinical trials in ahf
Clinical trials in ahf
 
niteshresume4oct
niteshresume4octniteshresume4oct
niteshresume4oct
 
Executive Judgement: Kevin K. Tang at SMECC - 20130809
Executive Judgement: Kevin K. Tang at SMECC - 20130809Executive Judgement: Kevin K. Tang at SMECC - 20130809
Executive Judgement: Kevin K. Tang at SMECC - 20130809
 
WiMAX Forum(R) anuncia el WiMAX Forum Congreso de las Americas para el aA�o 2009
WiMAX Forum(R) anuncia el WiMAX Forum Congreso de las Americas para el aA�o 2009WiMAX Forum(R) anuncia el WiMAX Forum Congreso de las Americas para el aA�o 2009
WiMAX Forum(R) anuncia el WiMAX Forum Congreso de las Americas para el aA�o 2009
 
New trends in Enterprise Social Networks
New trends in Enterprise Social NetworksNew trends in Enterprise Social Networks
New trends in Enterprise Social Networks
 
owen Resume final
owen Resume finalowen Resume final
owen Resume final
 
Control estádistico del proceso (spc) correlación xr
Control estádistico del proceso (spc) correlación xrControl estádistico del proceso (spc) correlación xr
Control estádistico del proceso (spc) correlación xr
 
Native Union Presentation
Native Union PresentationNative Union Presentation
Native Union Presentation
 
Xplanificacion estrategica
Xplanificacion estrategicaXplanificacion estrategica
Xplanificacion estrategica
 
Presentazione "Da Vinci - De Giorgio" - Istituto Professionale
Presentazione "Da Vinci - De Giorgio" -  Istituto ProfessionalePresentazione "Da Vinci - De Giorgio" -  Istituto Professionale
Presentazione "Da Vinci - De Giorgio" - Istituto Professionale
 
Project work ipe ubi pramerica
Project work ipe   ubi pramericaProject work ipe   ubi pramerica
Project work ipe ubi pramerica
 

More from Vittoriano Muttillo

More from Vittoriano Muttillo (7)

AIPHS: AdaptIve Profiling Hardware Sub-system
AIPHS: AdaptIve Profiling Hardware Sub-systemAIPHS: AdaptIve Profiling Hardware Sub-system
AIPHS: AdaptIve Profiling Hardware Sub-system
 
A-LOOP: AMP system: 2-cores ARM Cortex A9/Linux OS and 4-cores Leon3/Linux OS...
A-LOOP: AMP system: 2-cores ARM Cortex A9/Linux OS and 4-cores Leon3/Linux OS...A-LOOP: AMP system: 2-cores ARM Cortex A9/Linux OS and 4-cores Leon3/Linux OS...
A-LOOP: AMP system: 2-cores ARM Cortex A9/Linux OS and 4-cores Leon3/Linux OS...
 
Progettazione e realizzazione di un sistema software per il time logging
Progettazione e realizzazione di un sistema software per il time loggingProgettazione e realizzazione di un sistema software per il time logging
Progettazione e realizzazione di un sistema software per il time logging
 
Invarianza di un politopo
Invarianza di un politopoInvarianza di un politopo
Invarianza di un politopo
 
Ieee 802.15.4e nel wirelessHART
Ieee 802.15.4e nel wirelessHARTIeee 802.15.4e nel wirelessHART
Ieee 802.15.4e nel wirelessHART
 
Algoritmi per l'ottimizzazione convessa
Algoritmi per l'ottimizzazione convessaAlgoritmi per l'ottimizzazione convessa
Algoritmi per l'ottimizzazione convessa
 
Advances in gml for geospatial applications slide
Advances in gml for geospatial applications slideAdvances in gml for geospatial applications slide
Advances in gml for geospatial applications slide
 

Recently uploaded

UNIT-V FMM.HYDRAULIC TURBINE - Construction and working
UNIT-V FMM.HYDRAULIC TURBINE - Construction and workingUNIT-V FMM.HYDRAULIC TURBINE - Construction and working
UNIT-V FMM.HYDRAULIC TURBINE - Construction and working
rknatarajan
 
AKTU Computer Networks notes --- Unit 3.pdf
AKTU Computer Networks notes ---  Unit 3.pdfAKTU Computer Networks notes ---  Unit 3.pdf
AKTU Computer Networks notes --- Unit 3.pdf
ankushspencer015
 
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Christo Ananth
 

Recently uploaded (20)

CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete Record
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete RecordCCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete Record
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete Record
 
(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
 
KubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghlyKubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghly
 
UNIT-V FMM.HYDRAULIC TURBINE - Construction and working
UNIT-V FMM.HYDRAULIC TURBINE - Construction and workingUNIT-V FMM.HYDRAULIC TURBINE - Construction and working
UNIT-V FMM.HYDRAULIC TURBINE - Construction and working
 
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...
 
Processing & Properties of Floor and Wall Tiles.pptx
Processing & Properties of Floor and Wall Tiles.pptxProcessing & Properties of Floor and Wall Tiles.pptx
Processing & Properties of Floor and Wall Tiles.pptx
 
AKTU Computer Networks notes --- Unit 3.pdf
AKTU Computer Networks notes ---  Unit 3.pdfAKTU Computer Networks notes ---  Unit 3.pdf
AKTU Computer Networks notes --- Unit 3.pdf
 
University management System project report..pdf
University management System project report..pdfUniversity management System project report..pdf
University management System project report..pdf
 
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINEMANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
 
Coefficient of Thermal Expansion and their Importance.pptx
Coefficient of Thermal Expansion and their Importance.pptxCoefficient of Thermal Expansion and their Importance.pptx
Coefficient of Thermal Expansion and their Importance.pptx
 
Java Programming :Event Handling(Types of Events)
Java Programming :Event Handling(Types of Events)Java Programming :Event Handling(Types of Events)
Java Programming :Event Handling(Types of Events)
 
Extrusion Processes and Their Limitations
Extrusion Processes and Their LimitationsExtrusion Processes and Their Limitations
Extrusion Processes and Their Limitations
 
Top Rated Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
Top Rated  Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...Top Rated  Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
Top Rated Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
 
MANUFACTURING PROCESS-II UNIT-1 THEORY OF METAL CUTTING
MANUFACTURING PROCESS-II UNIT-1 THEORY OF METAL CUTTINGMANUFACTURING PROCESS-II UNIT-1 THEORY OF METAL CUTTING
MANUFACTURING PROCESS-II UNIT-1 THEORY OF METAL CUTTING
 
UNIT - IV - Air Compressors and its Performance
UNIT - IV - Air Compressors and its PerformanceUNIT - IV - Air Compressors and its Performance
UNIT - IV - Air Compressors and its Performance
 
Russian Call Girls in Nagpur Grishma Call 7001035870 Meet With Nagpur Escorts
Russian Call Girls in Nagpur Grishma Call 7001035870 Meet With Nagpur EscortsRussian Call Girls in Nagpur Grishma Call 7001035870 Meet With Nagpur Escorts
Russian Call Girls in Nagpur Grishma Call 7001035870 Meet With Nagpur Escorts
 
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
 
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
 
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur EscortsCall Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts
 
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
 

4-LOOP: 4-core Leon 3 with linux Operating system, OpenMP library and hardware Profiling system

  • 1. LINUX + OPENMP LEON3 LEON3 LEON3 LEON3 S 1 S 2 S 3 AHB Controller LEON3 LEON3 JTAG Dbg Link Ethernet MAC LEON3 LEON3 AHB/APB Bridge Memory Controller JTAG PHY AMBA AHB AMBA APB UARTUART - USB SRAM S1 S2 S3 Non-parallel region: Master thread only Parallel region starts: #pragma omp parallel ID:0 fork ID:0 ID:1 ID:2 ID:3 Parallel region: Several thread execute simultaneously join Parallel region ends: program waits for all threads to terminate ID:0 Program reverts to single threaded execution APB Interface Decode Section Event Monitor Time Monitor Counter AHB - Adapter APB Bus AHB Bus SNIFFER BLOCK DIAGRAM LEON3 7 - Stage Integer Pipeline 3-Port Register File IEEE-754 FPU Co-Processor HW MUL/DIV Trace Buffer Debug port Interrupt port I-Cache D-Cache SRMMU AHB I/F Local IRAM ITLB Local DRAM DTLB AMBA AHB Master (32-bit) SYSTEM BEHAVIOUR Perfomance evaluation of the platform by means of Pi calculation algorithm, proposed in four different versions: serial computation, single process multiple data (SPMD) technique with false sharing, SPMD technique without false sharing and OMP reduction function. Proposed profiling technique, used to monitor computational behaviour of the 4-LOOP platform, follows the approach of runtime bus sampling. LEGEND: 1 Thread 2 Threads 3 Threads 4 Threads Event monitor: strobe generation (ld_ac_event) during access on specified address range (delimited by sig_out_inf and sig_out_sup). Time monitor: counter activated by read operation (during_read) and stopped by write operation (during_write), both on specified address (0x808). LEON3 HW PROFILING SYSTEM SYSTEM DESCRIPTION The LEON3 processor is designed for Embedded applications, combining high performance with low complexity and low power consumption. The LEON3 processor is highly configurable. A distributed hardware profiling system has been developed for runtime analysis. It is composed of distributed AHB bus monitoring elements (sniffers) that moni- tor AHB bus, initialized by means of APB bus. A global monitor unit, represented by one LEON3 processor, provides sniffers initialization and collects results. A Linux distribution, customized to work with multicore platform in SMP mode, has been developed using Buildroot tool, starting from LEON LINUX kernel (provided by Gaisler research). Libraries required to implement parallel applications using shared memory, developed with OpenMP C/C++, have been cross-compiled and added to the adopted Linux distribution. LINUX OPENMP HARDWARE ARCHITECTURE THE PLATFORM OVERVIEW The opportunity to build multi-processor systems exploiting soft-cores is increasing the range of applications that can be implemented on FPGAs. Inordertomaximizeperformanceaparallelprogrammingmodelshouldbeused:OpenMPAPI is a specification for a set of compiler directives, library routines, and environment variables that can be used to specify high-level parallelism. Runtime analysis on SoC is useful to optimize reconfigurable systems. However, software profiling systems impose software overhead to application execution. 1) 2) 3) Proposed platform is composed of a working symmetric multi-processor systems (SMP) based on four LEON3 cores, enhanced by adding a custom hardware profiling system with no software overhead introduction. A SMP LINUX kernel targeting the proposed system and including the device drivers needed to collect data from the custom hardware profilers has been also built. The system has been further customized to support the execution of OpenMP-based applications. PROPOSED PLATFORMMOTIVATIONS 4-LOOP IS A PLATFORM DEVELOPED TO OFFER ADVANTAGE OF PARALLEL EXECUTION, WHILE MONITORING RUNTIME SYSTEM BEHAVIOUR WITHOUT SOFTWARE OVERHEAD 4 - LOOP LEON34-CORE LEON3 WITH LINUX OPERATING SYSTEM, OPENMP LIBRARY AND HARDWARE PROFILING SYSTEM G. Valente, V. Muttillo, L. Pomante, M. Faccio, F. Federici, A. Moro Main Contacts: giacomo.valente@graduate.univaq.it, luigi.pomante@univaq.it, marco.faccio@univaq.it, fabio.federici@univaq.it UNIVERSITA’ degli STUDI dell’AQUILA - CENTER of EXCELLENCE DEWS (ITALY) http://dews.univaq.it Graphic Designed By: Tania Valentina Ferro