SlideShare ist ein Scribd-Unternehmen logo
1 von 6
Downloaden Sie, um offline zu lesen
International
OPEN ACCESS Journal
Of Modern Engineering Research (IJMER)
| IJMER | ISSN: 2249–6645 | www.ijmer.com | Vol. 4 | Iss. 4 | Apr. 2014 | 50 |
Accelerating Real Time Applications on Heterogeneous Platforms
Vivek Vanpariya1
, Prof. S. Ramani2
1, 2
(SCSE, VIT University, Vellore, India)
I. INTRODUCTION
General purpose CPU performance scaling has opened up a new direction for computer scientist and
architect. Scaling performance of new generation GP-CPU has not as predicted by Gordon moor. After
observing it closely the throughput for sequential code in GP-CPU is getting fluctuated between subsequent
processor generations. This problem has motivated the scientist to innovate the high performance and cost
effective hardware. As a solution computer scientist comes up with a low level GP-GPU core which has features
like out of order execution and lack branch prediction to optimize the sequential code [1]. Khronos has a
framework and standard set for heterogeneous parallel computing on cross-vendor and cross-platform hardware
[7]. It provides an abstraction layer via which code can scale from simple embedded microcontrollers to GP-
CPU’s from Intel and AMD, up to massively-parallel GP-GPU hardware pipelines, all without modifying code
[8]. Most important task of graphical processing unit (GPU) is to accelerate the graphical applications. It
contains large amount of streaming processors for commutations when compared to Central Processing Unit
(CPU). Also GPU have high-bandwidth for data transfer. Graphics processing unit’s hasty increase in both
programmability and capability has spawned a researcher’s community that has effectively mapped a broad
range of computationally demanding, complex problems to the GPU. In other words GPU computing is using it
as to boost up central processing units for general-purpose scientific and engineering computations. It can be
achieved by offloading some of the intensive and overwhelming portions of the code and the remaining part of
the code will remain in the CPU. From user’s view point, the application run more rapidly because of the
extremely parallel processing power of the general purpose unit to boost performance which is well- known as
“heterogeneous” or “hybrid” computing. Some critical factors like power, cost, bandwidth on which we have to
focus when we working on GPU.
Figure 1: CPU vs. GPU
II. GPU ARCHITECTURE
Graphics processing units will be able to process independent fragments and vertices, but also can
process many of them in parallel. This comes handy when the programmer needs to process many vertices or
fragments in the same way. In this logic, GPUs are stream processors also called as stream at once. Stream can
be said as collection of records that requires similar kind of computation.
Abstract: In this paper we describe about the novel implementations of depth estimation from a stereo
images using feature extraction algorithms that run on the graphics processing unit (GPU) which is
suitable for real time applications like analyzing video in real-time vision systems. Modern graphics
cards contain large number of parallel processors and high-bandwidth memory for accelerating the
processing of data computation operations. In this paper we give general idea of how to accelerate the
real time application using heterogeneous platforms. We have proposed to use some added resources to
grasp more computationally involved optimization methods. This proposed approach will indirectly
accelerate a database by producing better plan quality.
Index Terms: Graphics processing unit (GPU), depth estimation, and video analysis.
Accelerating Real Time Applications on heterogeneous Platforms
| IJMER | ISSN: 2249–6645 | www.ijmer.com | Vol. 4 | Iss. 4 | Apr. 2014 | 51 |
Figure 2: GPU Architecture
The 8800 is composed of 128 stream processors each rotating at the rate of 1350 Mhz. Each processor is capable
to do calculation like MAD and MUL per clock cycle. Each processor need 4 cycles for executing specials
instructions like EXP, LOG, RCP, RSQ, SIN, COS which is managed by an additional unit.
III. OPEN COMPUTING LANGUAGE
Open CL is popularly known as one of the free of royalty standard for general purpose parallel
programming across CPUs, GPUs and other processors, giving software developer’s transportable and capable
access to the power of these assorted processing platforms. OpenCL consists of an application programming
interface for coordinating parallel computation across heterogeneous processors and a cross-platform
programming language with a well précised computation background.
Standards of Open CL:
 It supports data-based and task-based parallel programming models.
 It utilizes a rift of ISO C99 with extensions for parallelism.
 It defines steady numerical supplies based on IEEE 754.
 It defines a configuration report for handheld and embedded devices.
 It efficiently interoperates with OpenGL, OpenGL ES and other graphics APIs.
OPENCL memory model
Open CL framework provides the distributed memory. As shown in figure 3, each processor has its own private
memory and each core has its own local memory.
Figure 3: Architecture of Conceptual Open CL device having processing elements (PE), computing units and
devices.
Accelerating Real Time Applications on heterogeneous Platforms
| IJMER | ISSN: 2249–6645 | www.ijmer.com | Vol. 4 | Iss. 4 | Apr. 2014 | 52 |
IV. TEST CASE FOR SERIAL VS PARALLEL CODE
In this section we give the basic comparison of serial and parallel code performance. If we take simple
example of matrix multiplication in which each element is computed sequentially from the row of the first
matrix and column of the second matrix. Same matrix multiplication can be done on parallel environment
because there will be no previous data dependency. The below discussion give knowledge how to write parallel
code using Open CL.
Parallel code of matrix multiplication is ported on Storm which is a tool of STMicroelectronics, which provide
the parallel environment with 64 parallel processing units.
No. of Data
Elements per
matrix
No. of
Cycles
required for
sequential
code to run
No. of
Cycles
required for
parallel code
to run
Speed Up
16x16 1654850 32230 51
32x32 13185830 212402 62
48x48 44479008 701355 63
64x64 105413896 1653468 63
128x128 843161792 13180778 64
256x256 6744844288 105394568 64
512x512 53957111808 843086272 64
Table 2: Comparison table of Serial vs Parallel code of Matrix multiplication.
Tools have got 4 clusters and each cluster contains 16 processing elements (cores). Above table give comparison
chart of matrix multiplication using Storm. If we take the advantage of local memory and memory bandwidth
for accessing the data from host part to device part then we can get better speedup i.e., speed of data transfer
from global memory to compute device is slower than the speed of data transfer from local memory to compute
device. Because local memory is more near to clusters and it is local to each clusters. Memory coalscaling is
another feature of parallel processing. Suppose my bit transfer capacity from host to device is 32 bit’s then why
should I transfer 8 bit at a time. Fetch 32 bit’s at a time and store it on device local memory until four bytes
process.
No. of Data
Elements per
matrix
No. of
Cycles using
Global
Memory
No. of
Cycles using
Local
Memory
Speed Up
16x16 32230 20477 1.57
32x32 212402 114949 1.85
48x48 701355 368612 1.90
64x64 1653468 860118 1.92
Table 3: Comparison table of global vs serial memory for Matrix multiplication.
Figure 4: Cycles Needs for Global Vs. Local Memory for Matrix Multiplication
Accelerating Real Time Applications on heterogeneous Platforms
| IJMER | ISSN: 2249–6645 | www.ijmer.com | Vol. 4 | Iss. 4 | Apr. 2014 | 53 |
V. REAL TIME APPLICATION
Real time applications like depth estimation for stereo images, video analytics for survellience and etc.
are the good applications on image processing technology where repetitive work on pixels is needed. For this we
are concentrating on throughput rather than the latency. Filtering and/or convolution for images are one of the
important modules for depth estimation or in video analytics. In below discussion we briefly discuss
convolution and result analysis after porting it on NVIDIA’s Quadro FX 4600 and Storm tool of
STMicroelectronics.
Convolution
Kernel: It is a collection of numbers in the matrix form which is used in convolution of images. They
are available in different size having different number patterns producing different results. The dimension of
each kernel is subjective but most often 3x3 is preferred.
A convolution is usually performed by doing the multiplication of a pixel and its adjacent pixels color value by a
matrix.
Convolve is likely used to Enhance, Sharpen or smoothing the image.
We have the formulae for doing convolution of image f(x, y) and kernel k(x, y) represented as
f(x,y) * k(x,y)= 





f(u,v) k(x-u,y-v)dudv ….(1)
In discrete form,
f(x,y)*k(x,y)= 


1
0
w
i



1
0
H
j
f(i,j) k(x-i, y-j) …..(2)
W, H is the width and height of image respectively.
Figure 5: Masking on image.
VI. RESULT ANALYSIS
Function Name Time
require
for
assembly
code
(ms)
Time
require
for
C/C++
code
(ms)
Time
require
for
Parallel
code
(ms)
convolve_cols_3x3 50 360 136
convolve_121_row_3x3_16bit 10 50 30
convolve_101_row_3x3_16bit 20 60 23
Table 4: Time analysis for assembly, serial and parallel code (using NVIDIA device).
Accelerating Real Time Applications on heterogeneous Platforms
| IJMER | ISSN: 2249–6645 | www.ijmer.com | Vol. 4 | Iss. 4 | Apr. 2014 | 54 |
Figure 6: Comparison graph for table 4
Function Name Time require
for Assembly
Code (ms)
Time
require
for
C/C++
Code
(ms)
Time
require for
Parallel
Code on
xp70(ms)
convolve_cols_3x3 50 360 128
convolve_121_row_
3x3_16bit
10 50 28
convolve_101_row_
3x3_16bit
20 60 20
Table 6: Time analysis for assembly, serial and parallel code (using Sthorm device).
VII. CONCLUSION AND FUTURE WORK
In this paper we have discussed the behavior of serial and parallel code with respect to different
heterogeneous platforms. OpenCL has lot of task like getting a platform, kernel setting, memory/buffer
initialization, filling the read buffer with input data, taking the device information, setting the work group and
work items etc.. Due to this, performance results are not expected for small size data. By analyzing the
comparison tables, we can accelerate the sequential code with the help of parallel computing elements on
heterogeneous platforms.
In future related work, one can apply the optimization technique like memory co-aliasing, vector data
types and pipeline futures on complex real time applications.
REFERENCES
[1] I. Blthoff, H. Blthoff, and P. Sinha. “Topdown influences on stereoscopic depth-perception”. Nature Neuroscience,
1:254.257.
[2] S. Das and N. Ahuja. “Performance analysis of stereo, vergence, and focus as depth cues for active vision”. IEEE
Trans PAMI, 17:1213.1219.
[3] A. Smolic and P. Kauff, “Interactive 3-D video representation and coding technologies,” Proc. IEEE, Special Issue
on Advances in Video Coding and Delivery, vol. 93, no. 1, pp. 98–110, Jan. 2005.
[4] K. Mueller, P.Merkle, H. Schwarz, T. Hinz, A. Smolic, T. Oelbaum, and T. Wiegand, “Multi-view video coding
based on H.264.MPEG4-AVC using hierarchical B pictures,” in Proc. PCS’ 2006,
Apr. 2006, paper SS3–3. [Online.] http://www.engineeringvillage2.org.
[5] Niem W, Buschmann R, ”Automatic Modelling of 3D Natural Objects fromMultiple Views”, European Workshop
on Combined real and synthetic image processing for broadcast and video productions, 23–24. 11. 1994, Hamburg,
Germany.
[6] L. Falkenhagen, A. Kopernik, M. Strintzis, ”Disparity Estimation based on 3D Arbi- trarily Shaped Regions”,
RACE Project Deliverable, R2045/UH/DS/P/023/b1.
[7] Y. Yakimovski, R. Cunningham, ”A System for Extracting 3D Measurements from a Stereo Pair of TV Cameras”,
CGVIP, Vol. 7, 1978, pp. 195 – 210.
[8] Koch, R., ”3–D Scene Modelling from Stereoscopic Image Sequences”, European Workshop on Combined real
and synthetic image processing for broadcast and video productions, 23–24. 11. 1994, Hamburg, Germany.
Accelerating Real Time Applications on heterogeneous Platforms
| IJMER | ISSN: 2249–6645 | www.ijmer.com | Vol. 4 | Iss. 4 | Apr. 2014 | 55 |
Vivek Vanpariya received his B.E degree from A.D Patel institute of Technology affiliated to Sardar Patel
University, Gujrat, India. He did his M. Tech in Computer Science and Engineering from VIT University
Vellore, India. His area of interests includes Computer Network, Algorithms and Operating Systems.
Ramani S received his M.Tech degree from Bharathidasan Institute of Technology affiliated to Bharathidasan
University, TamilNadu, India. He is pursuing his Ph. D from VIT University Vellore, India. His area of interests
includes Database, Web Technologies and Data Mining.

Weitere ähnliche Inhalte

Was ist angesagt?

Hardware Acceleration of SVM Training for Real-time Embedded Systems: An Over...
Hardware Acceleration of SVM Training for Real-time Embedded Systems: An Over...Hardware Acceleration of SVM Training for Real-time Embedded Systems: An Over...
Hardware Acceleration of SVM Training for Real-time Embedded Systems: An Over...Ilham Amezzane
 
PhD defense talk (portfolio of my expertise)
PhD defense talk (portfolio of my expertise)PhD defense talk (portfolio of my expertise)
PhD defense talk (portfolio of my expertise)Gernot Ziegler
 
Architecture and Performance of Runtime Environments for Data Intensive Scala...
Architecture and Performance of Runtime Environments for Data Intensive Scala...Architecture and Performance of Runtime Environments for Data Intensive Scala...
Architecture and Performance of Runtime Environments for Data Intensive Scala...jaliyae
 
Spine net learning scale permuted backbone for recognition and localization
Spine net learning scale permuted backbone for recognition and localizationSpine net learning scale permuted backbone for recognition and localization
Spine net learning scale permuted backbone for recognition and localizationDevansh16
 
(Im2col)accelerating deep neural networks on low power heterogeneous architec...
(Im2col)accelerating deep neural networks on low power heterogeneous architec...(Im2col)accelerating deep neural networks on low power heterogeneous architec...
(Im2col)accelerating deep neural networks on low power heterogeneous architec...Bomm Kim
 
Real Time Face Detection on GPU Using OPENCL
Real Time Face Detection on GPU Using OPENCLReal Time Face Detection on GPU Using OPENCL
Real Time Face Detection on GPU Using OPENCLcsandit
 
AI On the Edge: Model Compression
AI On the Edge: Model CompressionAI On the Edge: Model Compression
AI On the Edge: Model CompressionApache MXNet
 
Intra-coding using non-linear prediction, KLT and Texture Synthesis: AV1 enco...
Intra-coding using non-linear prediction, KLT and Texture Synthesis: AV1 enco...Intra-coding using non-linear prediction, KLT and Texture Synthesis: AV1 enco...
Intra-coding using non-linear prediction, KLT and Texture Synthesis: AV1 enco...Förderverein Technische Fakultät
 
AI optimizing HPC simulations (presentation from 6th EULAG Workshop)
AI optimizing HPC simulations (presentation from  6th EULAG Workshop)AI optimizing HPC simulations (presentation from  6th EULAG Workshop)
AI optimizing HPC simulations (presentation from 6th EULAG Workshop)byteLAKE
 
"Semantic Segmentation for Scene Understanding: Algorithms and Implementation...
"Semantic Segmentation for Scene Understanding: Algorithms and Implementation..."Semantic Segmentation for Scene Understanding: Algorithms and Implementation...
"Semantic Segmentation for Scene Understanding: Algorithms and Implementation...Edge AI and Vision Alliance
 
Example uses of gpu compute models
Example uses of gpu compute modelsExample uses of gpu compute models
Example uses of gpu compute modelsPedram Mazloom
 
AVOIDING DUPLICATED COMPUTATION TO IMPROVE THE PERFORMANCE OF PFSP ON CUDA GPUS
AVOIDING DUPLICATED COMPUTATION TO IMPROVE THE PERFORMANCE OF PFSP ON CUDA GPUSAVOIDING DUPLICATED COMPUTATION TO IMPROVE THE PERFORMANCE OF PFSP ON CUDA GPUS
AVOIDING DUPLICATED COMPUTATION TO IMPROVE THE PERFORMANCE OF PFSP ON CUDA GPUScsandit
 
[unofficial] Pyramid Scene Parsing Network (CVPR 2017)
[unofficial] Pyramid Scene Parsing Network (CVPR 2017)[unofficial] Pyramid Scene Parsing Network (CVPR 2017)
[unofficial] Pyramid Scene Parsing Network (CVPR 2017)Shunta Saito
 
Big data 2.0, deep learning and financial Usecases
Big data 2.0, deep learning and financial UsecasesBig data 2.0, deep learning and financial Usecases
Big data 2.0, deep learning and financial UsecasesArvind Rapaka
 
Hybrid Model Based Testing Tool Architecture for Exascale Computing System
Hybrid Model Based Testing Tool Architecture for Exascale Computing SystemHybrid Model Based Testing Tool Architecture for Exascale Computing System
Hybrid Model Based Testing Tool Architecture for Exascale Computing SystemCSCJournals
 
Computer architecture pptx
Computer architecture pptxComputer architecture pptx
Computer architecture pptxMDSHABBIR12
 
A Framework for Scene Recognition Using Convolutional Neural Network as Featu...
A Framework for Scene Recognition Using Convolutional Neural Network as Featu...A Framework for Scene Recognition Using Convolutional Neural Network as Featu...
A Framework for Scene Recognition Using Convolutional Neural Network as Featu...Tahmid Abtahi
 
HPC with Clouds and Cloud Technologies
HPC with Clouds and Cloud TechnologiesHPC with Clouds and Cloud Technologies
HPC with Clouds and Cloud TechnologiesInderjeet Singh
 

Was ist angesagt? (19)

Hardware Acceleration of SVM Training for Real-time Embedded Systems: An Over...
Hardware Acceleration of SVM Training for Real-time Embedded Systems: An Over...Hardware Acceleration of SVM Training for Real-time Embedded Systems: An Over...
Hardware Acceleration of SVM Training for Real-time Embedded Systems: An Over...
 
PhD defense talk (portfolio of my expertise)
PhD defense talk (portfolio of my expertise)PhD defense talk (portfolio of my expertise)
PhD defense talk (portfolio of my expertise)
 
Architecture and Performance of Runtime Environments for Data Intensive Scala...
Architecture and Performance of Runtime Environments for Data Intensive Scala...Architecture and Performance of Runtime Environments for Data Intensive Scala...
Architecture and Performance of Runtime Environments for Data Intensive Scala...
 
An35225228
An35225228An35225228
An35225228
 
Spine net learning scale permuted backbone for recognition and localization
Spine net learning scale permuted backbone for recognition and localizationSpine net learning scale permuted backbone for recognition and localization
Spine net learning scale permuted backbone for recognition and localization
 
(Im2col)accelerating deep neural networks on low power heterogeneous architec...
(Im2col)accelerating deep neural networks on low power heterogeneous architec...(Im2col)accelerating deep neural networks on low power heterogeneous architec...
(Im2col)accelerating deep neural networks on low power heterogeneous architec...
 
Real Time Face Detection on GPU Using OPENCL
Real Time Face Detection on GPU Using OPENCLReal Time Face Detection on GPU Using OPENCL
Real Time Face Detection on GPU Using OPENCL
 
AI On the Edge: Model Compression
AI On the Edge: Model CompressionAI On the Edge: Model Compression
AI On the Edge: Model Compression
 
Intra-coding using non-linear prediction, KLT and Texture Synthesis: AV1 enco...
Intra-coding using non-linear prediction, KLT and Texture Synthesis: AV1 enco...Intra-coding using non-linear prediction, KLT and Texture Synthesis: AV1 enco...
Intra-coding using non-linear prediction, KLT and Texture Synthesis: AV1 enco...
 
AI optimizing HPC simulations (presentation from 6th EULAG Workshop)
AI optimizing HPC simulations (presentation from  6th EULAG Workshop)AI optimizing HPC simulations (presentation from  6th EULAG Workshop)
AI optimizing HPC simulations (presentation from 6th EULAG Workshop)
 
"Semantic Segmentation for Scene Understanding: Algorithms and Implementation...
"Semantic Segmentation for Scene Understanding: Algorithms and Implementation..."Semantic Segmentation for Scene Understanding: Algorithms and Implementation...
"Semantic Segmentation for Scene Understanding: Algorithms and Implementation...
 
Example uses of gpu compute models
Example uses of gpu compute modelsExample uses of gpu compute models
Example uses of gpu compute models
 
AVOIDING DUPLICATED COMPUTATION TO IMPROVE THE PERFORMANCE OF PFSP ON CUDA GPUS
AVOIDING DUPLICATED COMPUTATION TO IMPROVE THE PERFORMANCE OF PFSP ON CUDA GPUSAVOIDING DUPLICATED COMPUTATION TO IMPROVE THE PERFORMANCE OF PFSP ON CUDA GPUS
AVOIDING DUPLICATED COMPUTATION TO IMPROVE THE PERFORMANCE OF PFSP ON CUDA GPUS
 
[unofficial] Pyramid Scene Parsing Network (CVPR 2017)
[unofficial] Pyramid Scene Parsing Network (CVPR 2017)[unofficial] Pyramid Scene Parsing Network (CVPR 2017)
[unofficial] Pyramid Scene Parsing Network (CVPR 2017)
 
Big data 2.0, deep learning and financial Usecases
Big data 2.0, deep learning and financial UsecasesBig data 2.0, deep learning and financial Usecases
Big data 2.0, deep learning and financial Usecases
 
Hybrid Model Based Testing Tool Architecture for Exascale Computing System
Hybrid Model Based Testing Tool Architecture for Exascale Computing SystemHybrid Model Based Testing Tool Architecture for Exascale Computing System
Hybrid Model Based Testing Tool Architecture for Exascale Computing System
 
Computer architecture pptx
Computer architecture pptxComputer architecture pptx
Computer architecture pptx
 
A Framework for Scene Recognition Using Convolutional Neural Network as Featu...
A Framework for Scene Recognition Using Convolutional Neural Network as Featu...A Framework for Scene Recognition Using Convolutional Neural Network as Featu...
A Framework for Scene Recognition Using Convolutional Neural Network as Featu...
 
HPC with Clouds and Cloud Technologies
HPC with Clouds and Cloud TechnologiesHPC with Clouds and Cloud Technologies
HPC with Clouds and Cloud Technologies
 

Andere mochten auch

Secure Multi-Owner Group Signature Based Secure M-Health Records in Cloud
Secure Multi-Owner Group Signature Based Secure M-Health  Records in Cloud Secure Multi-Owner Group Signature Based Secure M-Health  Records in Cloud
Secure Multi-Owner Group Signature Based Secure M-Health Records in Cloud IJMER
 
Calculating Voltage Instability Using Index Analysis in Radial Distribution ...
Calculating Voltage Instability Using Index Analysis in Radial  Distribution ...Calculating Voltage Instability Using Index Analysis in Radial  Distribution ...
Calculating Voltage Instability Using Index Analysis in Radial Distribution ...IJMER
 
Ijmer 46067276
Ijmer 46067276Ijmer 46067276
Ijmer 46067276IJMER
 
An efficient model for design of 64-bit High Speed Parallel Prefix VLSI adder
An efficient model for design of 64-bit High Speed Parallel Prefix VLSI adderAn efficient model for design of 64-bit High Speed Parallel Prefix VLSI adder
An efficient model for design of 64-bit High Speed Parallel Prefix VLSI adderIJMER
 
Design of Shaft Is an Important Tool in Mercerization Machine
Design of Shaft Is an Important Tool in Mercerization Machine Design of Shaft Is an Important Tool in Mercerization Machine
Design of Shaft Is an Important Tool in Mercerization Machine IJMER
 
Effect of artificial roughness on Thermal and Thermohydraulic efficiency in R...
Effect of artificial roughness on Thermal and Thermohydraulic efficiency in R...Effect of artificial roughness on Thermal and Thermohydraulic efficiency in R...
Effect of artificial roughness on Thermal and Thermohydraulic efficiency in R...IJMER
 
Design Of Hand-Held Alert System Providing Security For Individuals Using Veh...
Design Of Hand-Held Alert System Providing Security For Individuals Using Veh...Design Of Hand-Held Alert System Providing Security For Individuals Using Veh...
Design Of Hand-Held Alert System Providing Security For Individuals Using Veh...IJMER
 
Improving the Stability of Cascaded DC Power Supply System by Adaptive Active...
Improving the Stability of Cascaded DC Power Supply System by Adaptive Active...Improving the Stability of Cascaded DC Power Supply System by Adaptive Active...
Improving the Stability of Cascaded DC Power Supply System by Adaptive Active...IJMER
 
Total Harmonic Distortion Alleviation by using Shunt Active Filter
Total Harmonic Distortion Alleviation by using Shunt Active FilterTotal Harmonic Distortion Alleviation by using Shunt Active Filter
Total Harmonic Distortion Alleviation by using Shunt Active FilterIJMER
 
Design and Implementation of wireless heart monitor for expectant mothers in ...
Design and Implementation of wireless heart monitor for expectant mothers in ...Design and Implementation of wireless heart monitor for expectant mothers in ...
Design and Implementation of wireless heart monitor for expectant mothers in ...IJMER
 
Af32633636
Af32633636Af32633636
Af32633636IJMER
 
Design &Analysis of Waste Heat Recovery System for Domestic Refrigerator
Design &Analysis of Waste Heat Recovery System for Domestic  RefrigeratorDesign &Analysis of Waste Heat Recovery System for Domestic  Refrigerator
Design &Analysis of Waste Heat Recovery System for Domestic RefrigeratorIJMER
 
Secure Ticket- Based Anonymity and Traceability in Wireless Mesh Networks
Secure Ticket- Based Anonymity and Traceability in Wireless Mesh NetworksSecure Ticket- Based Anonymity and Traceability in Wireless Mesh Networks
Secure Ticket- Based Anonymity and Traceability in Wireless Mesh NetworksIJMER
 
Investigations of Surface Dissolution on Fatigue Crack Nucleation in Ni-22Cr-...
Investigations of Surface Dissolution on Fatigue Crack Nucleation in Ni-22Cr-...Investigations of Surface Dissolution on Fatigue Crack Nucleation in Ni-22Cr-...
Investigations of Surface Dissolution on Fatigue Crack Nucleation in Ni-22Cr-...IJMER
 
Tannin gel derived from Leaves of Ricinus Communis as an adsorbent for the Re...
Tannin gel derived from Leaves of Ricinus Communis as an adsorbent for the Re...Tannin gel derived from Leaves of Ricinus Communis as an adsorbent for the Re...
Tannin gel derived from Leaves of Ricinus Communis as an adsorbent for the Re...IJMER
 
Developing Infrared Controlled Automated Door System
Developing Infrared Controlled Automated Door SystemDeveloping Infrared Controlled Automated Door System
Developing Infrared Controlled Automated Door SystemIJMER
 
Seismic Assessment of Existing Bridge Using OPENSEES
Seismic Assessment of Existing Bridge Using OPENSEESSeismic Assessment of Existing Bridge Using OPENSEES
Seismic Assessment of Existing Bridge Using OPENSEESIJMER
 
Ijmer 46067782
Ijmer 46067782Ijmer 46067782
Ijmer 46067782IJMER
 
An Optimal Cooperative Provable Data Possession Scheme for Distributed Cloud ...
An Optimal Cooperative Provable Data Possession Scheme for Distributed Cloud ...An Optimal Cooperative Provable Data Possession Scheme for Distributed Cloud ...
An Optimal Cooperative Provable Data Possession Scheme for Distributed Cloud ...IJMER
 
Effect of Radiation on Mixed Convection Flow of a Non-Newtonian Nan fluid ove...
Effect of Radiation on Mixed Convection Flow of a Non-Newtonian Nan fluid ove...Effect of Radiation on Mixed Convection Flow of a Non-Newtonian Nan fluid ove...
Effect of Radiation on Mixed Convection Flow of a Non-Newtonian Nan fluid ove...IJMER
 

Andere mochten auch (20)

Secure Multi-Owner Group Signature Based Secure M-Health Records in Cloud
Secure Multi-Owner Group Signature Based Secure M-Health  Records in Cloud Secure Multi-Owner Group Signature Based Secure M-Health  Records in Cloud
Secure Multi-Owner Group Signature Based Secure M-Health Records in Cloud
 
Calculating Voltage Instability Using Index Analysis in Radial Distribution ...
Calculating Voltage Instability Using Index Analysis in Radial  Distribution ...Calculating Voltage Instability Using Index Analysis in Radial  Distribution ...
Calculating Voltage Instability Using Index Analysis in Radial Distribution ...
 
Ijmer 46067276
Ijmer 46067276Ijmer 46067276
Ijmer 46067276
 
An efficient model for design of 64-bit High Speed Parallel Prefix VLSI adder
An efficient model for design of 64-bit High Speed Parallel Prefix VLSI adderAn efficient model for design of 64-bit High Speed Parallel Prefix VLSI adder
An efficient model for design of 64-bit High Speed Parallel Prefix VLSI adder
 
Design of Shaft Is an Important Tool in Mercerization Machine
Design of Shaft Is an Important Tool in Mercerization Machine Design of Shaft Is an Important Tool in Mercerization Machine
Design of Shaft Is an Important Tool in Mercerization Machine
 
Effect of artificial roughness on Thermal and Thermohydraulic efficiency in R...
Effect of artificial roughness on Thermal and Thermohydraulic efficiency in R...Effect of artificial roughness on Thermal and Thermohydraulic efficiency in R...
Effect of artificial roughness on Thermal and Thermohydraulic efficiency in R...
 
Design Of Hand-Held Alert System Providing Security For Individuals Using Veh...
Design Of Hand-Held Alert System Providing Security For Individuals Using Veh...Design Of Hand-Held Alert System Providing Security For Individuals Using Veh...
Design Of Hand-Held Alert System Providing Security For Individuals Using Veh...
 
Improving the Stability of Cascaded DC Power Supply System by Adaptive Active...
Improving the Stability of Cascaded DC Power Supply System by Adaptive Active...Improving the Stability of Cascaded DC Power Supply System by Adaptive Active...
Improving the Stability of Cascaded DC Power Supply System by Adaptive Active...
 
Total Harmonic Distortion Alleviation by using Shunt Active Filter
Total Harmonic Distortion Alleviation by using Shunt Active FilterTotal Harmonic Distortion Alleviation by using Shunt Active Filter
Total Harmonic Distortion Alleviation by using Shunt Active Filter
 
Design and Implementation of wireless heart monitor for expectant mothers in ...
Design and Implementation of wireless heart monitor for expectant mothers in ...Design and Implementation of wireless heart monitor for expectant mothers in ...
Design and Implementation of wireless heart monitor for expectant mothers in ...
 
Af32633636
Af32633636Af32633636
Af32633636
 
Design &Analysis of Waste Heat Recovery System for Domestic Refrigerator
Design &Analysis of Waste Heat Recovery System for Domestic  RefrigeratorDesign &Analysis of Waste Heat Recovery System for Domestic  Refrigerator
Design &Analysis of Waste Heat Recovery System for Domestic Refrigerator
 
Secure Ticket- Based Anonymity and Traceability in Wireless Mesh Networks
Secure Ticket- Based Anonymity and Traceability in Wireless Mesh NetworksSecure Ticket- Based Anonymity and Traceability in Wireless Mesh Networks
Secure Ticket- Based Anonymity and Traceability in Wireless Mesh Networks
 
Investigations of Surface Dissolution on Fatigue Crack Nucleation in Ni-22Cr-...
Investigations of Surface Dissolution on Fatigue Crack Nucleation in Ni-22Cr-...Investigations of Surface Dissolution on Fatigue Crack Nucleation in Ni-22Cr-...
Investigations of Surface Dissolution on Fatigue Crack Nucleation in Ni-22Cr-...
 
Tannin gel derived from Leaves of Ricinus Communis as an adsorbent for the Re...
Tannin gel derived from Leaves of Ricinus Communis as an adsorbent for the Re...Tannin gel derived from Leaves of Ricinus Communis as an adsorbent for the Re...
Tannin gel derived from Leaves of Ricinus Communis as an adsorbent for the Re...
 
Developing Infrared Controlled Automated Door System
Developing Infrared Controlled Automated Door SystemDeveloping Infrared Controlled Automated Door System
Developing Infrared Controlled Automated Door System
 
Seismic Assessment of Existing Bridge Using OPENSEES
Seismic Assessment of Existing Bridge Using OPENSEESSeismic Assessment of Existing Bridge Using OPENSEES
Seismic Assessment of Existing Bridge Using OPENSEES
 
Ijmer 46067782
Ijmer 46067782Ijmer 46067782
Ijmer 46067782
 
An Optimal Cooperative Provable Data Possession Scheme for Distributed Cloud ...
An Optimal Cooperative Provable Data Possession Scheme for Distributed Cloud ...An Optimal Cooperative Provable Data Possession Scheme for Distributed Cloud ...
An Optimal Cooperative Provable Data Possession Scheme for Distributed Cloud ...
 
Effect of Radiation on Mixed Convection Flow of a Non-Newtonian Nan fluid ove...
Effect of Radiation on Mixed Convection Flow of a Non-Newtonian Nan fluid ove...Effect of Radiation on Mixed Convection Flow of a Non-Newtonian Nan fluid ove...
Effect of Radiation on Mixed Convection Flow of a Non-Newtonian Nan fluid ove...
 

Ähnlich wie Accelerating Real Time Applications on Heterogeneous Platforms

Performance analysis of sobel edge filter on heterogeneous system using opencl
Performance analysis of sobel edge filter on heterogeneous system using openclPerformance analysis of sobel edge filter on heterogeneous system using opencl
Performance analysis of sobel edge filter on heterogeneous system using opencleSAT Publishing House
 
IRJET- Latin Square Computation of Order-3 using Open CL
IRJET- Latin Square Computation of Order-3 using Open CLIRJET- Latin Square Computation of Order-3 using Open CL
IRJET- Latin Square Computation of Order-3 using Open CLIRJET Journal
 
A SURVEY ON GPU SYSTEM CONSIDERING ITS PERFORMANCE ON DIFFERENT APPLICATIONS
A SURVEY ON GPU SYSTEM CONSIDERING ITS PERFORMANCE ON DIFFERENT APPLICATIONSA SURVEY ON GPU SYSTEM CONSIDERING ITS PERFORMANCE ON DIFFERENT APPLICATIONS
A SURVEY ON GPU SYSTEM CONSIDERING ITS PERFORMANCE ON DIFFERENT APPLICATIONScseij
 
Hybrid Multicore Computing : NOTES
Hybrid Multicore Computing : NOTESHybrid Multicore Computing : NOTES
Hybrid Multicore Computing : NOTESSubhajit Sahu
 
SPEED-UP IMPROVEMENT USING PARALLEL APPROACH IN IMAGE STEGANOGRAPHY
SPEED-UP IMPROVEMENT USING PARALLEL APPROACH IN IMAGE STEGANOGRAPHY SPEED-UP IMPROVEMENT USING PARALLEL APPROACH IN IMAGE STEGANOGRAPHY
SPEED-UP IMPROVEMENT USING PARALLEL APPROACH IN IMAGE STEGANOGRAPHY cscpconf
 
Parallel and Distributed Computing Chapter 8
Parallel and Distributed Computing Chapter 8Parallel and Distributed Computing Chapter 8
Parallel and Distributed Computing Chapter 8AbdullahMunir32
 
SPEED-UP IMPROVEMENT USING PARALLEL APPROACH IN IMAGE STEGANOGRAPHY
SPEED-UP IMPROVEMENT USING PARALLEL APPROACH IN IMAGE STEGANOGRAPHYSPEED-UP IMPROVEMENT USING PARALLEL APPROACH IN IMAGE STEGANOGRAPHY
SPEED-UP IMPROVEMENT USING PARALLEL APPROACH IN IMAGE STEGANOGRAPHYcsandit
 
Multi Processor Architecture for image processing
Multi Processor Architecture for image processingMulti Processor Architecture for image processing
Multi Processor Architecture for image processingideas2ignite
 
Image Processing Application on Graphics processors
Image Processing Application on Graphics processorsImage Processing Application on Graphics processors
Image Processing Application on Graphics processorsCSCJournals
 
An Enhanced FPGA Based Asynchronous Microprocessor Design Using VIVADO and ISIM
An Enhanced FPGA Based Asynchronous Microprocessor Design Using VIVADO and ISIMAn Enhanced FPGA Based Asynchronous Microprocessor Design Using VIVADO and ISIM
An Enhanced FPGA Based Asynchronous Microprocessor Design Using VIVADO and ISIMjournalBEEI
 
Conference Paper: Universal Node: Towards a high-performance NFV environment
Conference Paper: Universal Node: Towards a high-performance NFV environmentConference Paper: Universal Node: Towards a high-performance NFV environment
Conference Paper: Universal Node: Towards a high-performance NFV environmentEricsson
 
Volume 2-issue-6-2040-2045
Volume 2-issue-6-2040-2045Volume 2-issue-6-2040-2045
Volume 2-issue-6-2040-2045Editor IJARCET
 
Volume 2-issue-6-2040-2045
Volume 2-issue-6-2040-2045Volume 2-issue-6-2040-2045
Volume 2-issue-6-2040-2045Editor IJARCET
 
Design and Implementation of Quintuple Processor Architecture Using FPGA
Design and Implementation of Quintuple Processor Architecture Using FPGADesign and Implementation of Quintuple Processor Architecture Using FPGA
Design and Implementation of Quintuple Processor Architecture Using FPGAIJERA Editor
 
An area and power efficient on chip communication architectures for image enc...
An area and power efficient on chip communication architectures for image enc...An area and power efficient on chip communication architectures for image enc...
An area and power efficient on chip communication architectures for image enc...eSAT Publishing House
 
Performance evaluation of ecc in single and multi( eliptic curve)
Performance evaluation of ecc in single and multi( eliptic curve)Performance evaluation of ecc in single and multi( eliptic curve)
Performance evaluation of ecc in single and multi( eliptic curve)Danilo Calle
 
Effective Sparse Matrix Representation for the GPU Architectures
 Effective Sparse Matrix Representation for the GPU Architectures Effective Sparse Matrix Representation for the GPU Architectures
Effective Sparse Matrix Representation for the GPU ArchitecturesIJCSEA Journal
 
Effective Sparse Matrix Representation for the GPU Architectures
Effective Sparse Matrix Representation for the GPU ArchitecturesEffective Sparse Matrix Representation for the GPU Architectures
Effective Sparse Matrix Representation for the GPU ArchitecturesIJCSEA Journal
 

Ähnlich wie Accelerating Real Time Applications on Heterogeneous Platforms (20)

Performance analysis of sobel edge filter on heterogeneous system using opencl
Performance analysis of sobel edge filter on heterogeneous system using openclPerformance analysis of sobel edge filter on heterogeneous system using opencl
Performance analysis of sobel edge filter on heterogeneous system using opencl
 
IRJET- Latin Square Computation of Order-3 using Open CL
IRJET- Latin Square Computation of Order-3 using Open CLIRJET- Latin Square Computation of Order-3 using Open CL
IRJET- Latin Square Computation of Order-3 using Open CL
 
A SURVEY ON GPU SYSTEM CONSIDERING ITS PERFORMANCE ON DIFFERENT APPLICATIONS
A SURVEY ON GPU SYSTEM CONSIDERING ITS PERFORMANCE ON DIFFERENT APPLICATIONSA SURVEY ON GPU SYSTEM CONSIDERING ITS PERFORMANCE ON DIFFERENT APPLICATIONS
A SURVEY ON GPU SYSTEM CONSIDERING ITS PERFORMANCE ON DIFFERENT APPLICATIONS
 
Hybrid Multicore Computing : NOTES
Hybrid Multicore Computing : NOTESHybrid Multicore Computing : NOTES
Hybrid Multicore Computing : NOTES
 
SPEED-UP IMPROVEMENT USING PARALLEL APPROACH IN IMAGE STEGANOGRAPHY
SPEED-UP IMPROVEMENT USING PARALLEL APPROACH IN IMAGE STEGANOGRAPHY SPEED-UP IMPROVEMENT USING PARALLEL APPROACH IN IMAGE STEGANOGRAPHY
SPEED-UP IMPROVEMENT USING PARALLEL APPROACH IN IMAGE STEGANOGRAPHY
 
Parallel and Distributed Computing Chapter 8
Parallel and Distributed Computing Chapter 8Parallel and Distributed Computing Chapter 8
Parallel and Distributed Computing Chapter 8
 
SPEED-UP IMPROVEMENT USING PARALLEL APPROACH IN IMAGE STEGANOGRAPHY
SPEED-UP IMPROVEMENT USING PARALLEL APPROACH IN IMAGE STEGANOGRAPHYSPEED-UP IMPROVEMENT USING PARALLEL APPROACH IN IMAGE STEGANOGRAPHY
SPEED-UP IMPROVEMENT USING PARALLEL APPROACH IN IMAGE STEGANOGRAPHY
 
Multi Processor Architecture for image processing
Multi Processor Architecture for image processingMulti Processor Architecture for image processing
Multi Processor Architecture for image processing
 
Image Processing Application on Graphics processors
Image Processing Application on Graphics processorsImage Processing Application on Graphics processors
Image Processing Application on Graphics processors
 
An Enhanced FPGA Based Asynchronous Microprocessor Design Using VIVADO and ISIM
An Enhanced FPGA Based Asynchronous Microprocessor Design Using VIVADO and ISIMAn Enhanced FPGA Based Asynchronous Microprocessor Design Using VIVADO and ISIM
An Enhanced FPGA Based Asynchronous Microprocessor Design Using VIVADO and ISIM
 
Conference Paper: Universal Node: Towards a high-performance NFV environment
Conference Paper: Universal Node: Towards a high-performance NFV environmentConference Paper: Universal Node: Towards a high-performance NFV environment
Conference Paper: Universal Node: Towards a high-performance NFV environment
 
Volume 2-issue-6-2040-2045
Volume 2-issue-6-2040-2045Volume 2-issue-6-2040-2045
Volume 2-issue-6-2040-2045
 
Volume 2-issue-6-2040-2045
Volume 2-issue-6-2040-2045Volume 2-issue-6-2040-2045
Volume 2-issue-6-2040-2045
 
Design and Implementation of Quintuple Processor Architecture Using FPGA
Design and Implementation of Quintuple Processor Architecture Using FPGADesign and Implementation of Quintuple Processor Architecture Using FPGA
Design and Implementation of Quintuple Processor Architecture Using FPGA
 
An area and power efficient on chip communication architectures for image enc...
An area and power efficient on chip communication architectures for image enc...An area and power efficient on chip communication architectures for image enc...
An area and power efficient on chip communication architectures for image enc...
 
Performance evaluation of ecc in single and multi( eliptic curve)
Performance evaluation of ecc in single and multi( eliptic curve)Performance evaluation of ecc in single and multi( eliptic curve)
Performance evaluation of ecc in single and multi( eliptic curve)
 
Open power ddl and lms
Open power ddl and lmsOpen power ddl and lms
Open power ddl and lms
 
20120140505010
2012014050501020120140505010
20120140505010
 
Effective Sparse Matrix Representation for the GPU Architectures
 Effective Sparse Matrix Representation for the GPU Architectures Effective Sparse Matrix Representation for the GPU Architectures
Effective Sparse Matrix Representation for the GPU Architectures
 
Effective Sparse Matrix Representation for the GPU Architectures
Effective Sparse Matrix Representation for the GPU ArchitecturesEffective Sparse Matrix Representation for the GPU Architectures
Effective Sparse Matrix Representation for the GPU Architectures
 

Mehr von IJMER

A Study on Translucent Concrete Product and Its Properties by Using Optical F...
A Study on Translucent Concrete Product and Its Properties by Using Optical F...A Study on Translucent Concrete Product and Its Properties by Using Optical F...
A Study on Translucent Concrete Product and Its Properties by Using Optical F...IJMER
 
Developing Cost Effective Automation for Cotton Seed Delinting
Developing Cost Effective Automation for Cotton Seed DelintingDeveloping Cost Effective Automation for Cotton Seed Delinting
Developing Cost Effective Automation for Cotton Seed DelintingIJMER
 
Study & Testing Of Bio-Composite Material Based On Munja Fibre
Study & Testing Of Bio-Composite Material Based On Munja FibreStudy & Testing Of Bio-Composite Material Based On Munja Fibre
Study & Testing Of Bio-Composite Material Based On Munja FibreIJMER
 
Hybrid Engine (Stirling Engine + IC Engine + Electric Motor)
Hybrid Engine (Stirling Engine + IC Engine + Electric Motor)Hybrid Engine (Stirling Engine + IC Engine + Electric Motor)
Hybrid Engine (Stirling Engine + IC Engine + Electric Motor)IJMER
 
Fabrication & Characterization of Bio Composite Materials Based On Sunnhemp F...
Fabrication & Characterization of Bio Composite Materials Based On Sunnhemp F...Fabrication & Characterization of Bio Composite Materials Based On Sunnhemp F...
Fabrication & Characterization of Bio Composite Materials Based On Sunnhemp F...IJMER
 
Geochemistry and Genesis of Kammatturu Iron Ores of Devagiri Formation, Sandu...
Geochemistry and Genesis of Kammatturu Iron Ores of Devagiri Formation, Sandu...Geochemistry and Genesis of Kammatturu Iron Ores of Devagiri Formation, Sandu...
Geochemistry and Genesis of Kammatturu Iron Ores of Devagiri Formation, Sandu...IJMER
 
Experimental Investigation on Characteristic Study of the Carbon Steel C45 in...
Experimental Investigation on Characteristic Study of the Carbon Steel C45 in...Experimental Investigation on Characteristic Study of the Carbon Steel C45 in...
Experimental Investigation on Characteristic Study of the Carbon Steel C45 in...IJMER
 
Non linear analysis of Robot Gun Support Structure using Equivalent Dynamic A...
Non linear analysis of Robot Gun Support Structure using Equivalent Dynamic A...Non linear analysis of Robot Gun Support Structure using Equivalent Dynamic A...
Non linear analysis of Robot Gun Support Structure using Equivalent Dynamic A...IJMER
 
Static Analysis of Go-Kart Chassis by Analytical and Solid Works Simulation
Static Analysis of Go-Kart Chassis by Analytical and Solid Works SimulationStatic Analysis of Go-Kart Chassis by Analytical and Solid Works Simulation
Static Analysis of Go-Kart Chassis by Analytical and Solid Works SimulationIJMER
 
High Speed Effortless Bicycle
High Speed Effortless BicycleHigh Speed Effortless Bicycle
High Speed Effortless BicycleIJMER
 
Integration of Struts & Spring & Hibernate for Enterprise Applications
Integration of Struts & Spring & Hibernate for Enterprise ApplicationsIntegration of Struts & Spring & Hibernate for Enterprise Applications
Integration of Struts & Spring & Hibernate for Enterprise ApplicationsIJMER
 
Microcontroller Based Automatic Sprinkler Irrigation System
Microcontroller Based Automatic Sprinkler Irrigation SystemMicrocontroller Based Automatic Sprinkler Irrigation System
Microcontroller Based Automatic Sprinkler Irrigation SystemIJMER
 
On some locally closed sets and spaces in Ideal Topological Spaces
On some locally closed sets and spaces in Ideal Topological SpacesOn some locally closed sets and spaces in Ideal Topological Spaces
On some locally closed sets and spaces in Ideal Topological SpacesIJMER
 
Intrusion Detection and Forensics based on decision tree and Association rule...
Intrusion Detection and Forensics based on decision tree and Association rule...Intrusion Detection and Forensics based on decision tree and Association rule...
Intrusion Detection and Forensics based on decision tree and Association rule...IJMER
 
Natural Language Ambiguity and its Effect on Machine Learning
Natural Language Ambiguity and its Effect on Machine LearningNatural Language Ambiguity and its Effect on Machine Learning
Natural Language Ambiguity and its Effect on Machine LearningIJMER
 
Evolvea Frameworkfor SelectingPrime Software DevelopmentProcess
Evolvea Frameworkfor SelectingPrime Software DevelopmentProcessEvolvea Frameworkfor SelectingPrime Software DevelopmentProcess
Evolvea Frameworkfor SelectingPrime Software DevelopmentProcessIJMER
 
Material Parameter and Effect of Thermal Load on Functionally Graded Cylinders
Material Parameter and Effect of Thermal Load on Functionally Graded CylindersMaterial Parameter and Effect of Thermal Load on Functionally Graded Cylinders
Material Parameter and Effect of Thermal Load on Functionally Graded CylindersIJMER
 
Studies On Energy Conservation And Audit
Studies On Energy Conservation And AuditStudies On Energy Conservation And Audit
Studies On Energy Conservation And AuditIJMER
 
An Implementation of I2C Slave Interface using Verilog HDL
An Implementation of I2C Slave Interface using Verilog HDLAn Implementation of I2C Slave Interface using Verilog HDL
An Implementation of I2C Slave Interface using Verilog HDLIJMER
 
Discrete Model of Two Predators competing for One Prey
Discrete Model of Two Predators competing for One PreyDiscrete Model of Two Predators competing for One Prey
Discrete Model of Two Predators competing for One PreyIJMER
 

Mehr von IJMER (20)

A Study on Translucent Concrete Product and Its Properties by Using Optical F...
A Study on Translucent Concrete Product and Its Properties by Using Optical F...A Study on Translucent Concrete Product and Its Properties by Using Optical F...
A Study on Translucent Concrete Product and Its Properties by Using Optical F...
 
Developing Cost Effective Automation for Cotton Seed Delinting
Developing Cost Effective Automation for Cotton Seed DelintingDeveloping Cost Effective Automation for Cotton Seed Delinting
Developing Cost Effective Automation for Cotton Seed Delinting
 
Study & Testing Of Bio-Composite Material Based On Munja Fibre
Study & Testing Of Bio-Composite Material Based On Munja FibreStudy & Testing Of Bio-Composite Material Based On Munja Fibre
Study & Testing Of Bio-Composite Material Based On Munja Fibre
 
Hybrid Engine (Stirling Engine + IC Engine + Electric Motor)
Hybrid Engine (Stirling Engine + IC Engine + Electric Motor)Hybrid Engine (Stirling Engine + IC Engine + Electric Motor)
Hybrid Engine (Stirling Engine + IC Engine + Electric Motor)
 
Fabrication & Characterization of Bio Composite Materials Based On Sunnhemp F...
Fabrication & Characterization of Bio Composite Materials Based On Sunnhemp F...Fabrication & Characterization of Bio Composite Materials Based On Sunnhemp F...
Fabrication & Characterization of Bio Composite Materials Based On Sunnhemp F...
 
Geochemistry and Genesis of Kammatturu Iron Ores of Devagiri Formation, Sandu...
Geochemistry and Genesis of Kammatturu Iron Ores of Devagiri Formation, Sandu...Geochemistry and Genesis of Kammatturu Iron Ores of Devagiri Formation, Sandu...
Geochemistry and Genesis of Kammatturu Iron Ores of Devagiri Formation, Sandu...
 
Experimental Investigation on Characteristic Study of the Carbon Steel C45 in...
Experimental Investigation on Characteristic Study of the Carbon Steel C45 in...Experimental Investigation on Characteristic Study of the Carbon Steel C45 in...
Experimental Investigation on Characteristic Study of the Carbon Steel C45 in...
 
Non linear analysis of Robot Gun Support Structure using Equivalent Dynamic A...
Non linear analysis of Robot Gun Support Structure using Equivalent Dynamic A...Non linear analysis of Robot Gun Support Structure using Equivalent Dynamic A...
Non linear analysis of Robot Gun Support Structure using Equivalent Dynamic A...
 
Static Analysis of Go-Kart Chassis by Analytical and Solid Works Simulation
Static Analysis of Go-Kart Chassis by Analytical and Solid Works SimulationStatic Analysis of Go-Kart Chassis by Analytical and Solid Works Simulation
Static Analysis of Go-Kart Chassis by Analytical and Solid Works Simulation
 
High Speed Effortless Bicycle
High Speed Effortless BicycleHigh Speed Effortless Bicycle
High Speed Effortless Bicycle
 
Integration of Struts & Spring & Hibernate for Enterprise Applications
Integration of Struts & Spring & Hibernate for Enterprise ApplicationsIntegration of Struts & Spring & Hibernate for Enterprise Applications
Integration of Struts & Spring & Hibernate for Enterprise Applications
 
Microcontroller Based Automatic Sprinkler Irrigation System
Microcontroller Based Automatic Sprinkler Irrigation SystemMicrocontroller Based Automatic Sprinkler Irrigation System
Microcontroller Based Automatic Sprinkler Irrigation System
 
On some locally closed sets and spaces in Ideal Topological Spaces
On some locally closed sets and spaces in Ideal Topological SpacesOn some locally closed sets and spaces in Ideal Topological Spaces
On some locally closed sets and spaces in Ideal Topological Spaces
 
Intrusion Detection and Forensics based on decision tree and Association rule...
Intrusion Detection and Forensics based on decision tree and Association rule...Intrusion Detection and Forensics based on decision tree and Association rule...
Intrusion Detection and Forensics based on decision tree and Association rule...
 
Natural Language Ambiguity and its Effect on Machine Learning
Natural Language Ambiguity and its Effect on Machine LearningNatural Language Ambiguity and its Effect on Machine Learning
Natural Language Ambiguity and its Effect on Machine Learning
 
Evolvea Frameworkfor SelectingPrime Software DevelopmentProcess
Evolvea Frameworkfor SelectingPrime Software DevelopmentProcessEvolvea Frameworkfor SelectingPrime Software DevelopmentProcess
Evolvea Frameworkfor SelectingPrime Software DevelopmentProcess
 
Material Parameter and Effect of Thermal Load on Functionally Graded Cylinders
Material Parameter and Effect of Thermal Load on Functionally Graded CylindersMaterial Parameter and Effect of Thermal Load on Functionally Graded Cylinders
Material Parameter and Effect of Thermal Load on Functionally Graded Cylinders
 
Studies On Energy Conservation And Audit
Studies On Energy Conservation And AuditStudies On Energy Conservation And Audit
Studies On Energy Conservation And Audit
 
An Implementation of I2C Slave Interface using Verilog HDL
An Implementation of I2C Slave Interface using Verilog HDLAn Implementation of I2C Slave Interface using Verilog HDL
An Implementation of I2C Slave Interface using Verilog HDL
 
Discrete Model of Two Predators competing for One Prey
Discrete Model of Two Predators competing for One PreyDiscrete Model of Two Predators competing for One Prey
Discrete Model of Two Predators competing for One Prey
 

Kürzlich hochgeladen

KubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghlyKubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghlysanyuktamishra911
 
(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...
(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...
(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...ranjana rawat
 
Top Rated Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
Top Rated  Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...Top Rated  Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
Top Rated Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...Call Girls in Nagpur High Profile
 
Processing & Properties of Floor and Wall Tiles.pptx
Processing & Properties of Floor and Wall Tiles.pptxProcessing & Properties of Floor and Wall Tiles.pptx
Processing & Properties of Floor and Wall Tiles.pptxpranjaldaimarysona
 
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130Suhani Kapoor
 
Java Programming :Event Handling(Types of Events)
Java Programming :Event Handling(Types of Events)Java Programming :Event Handling(Types of Events)
Java Programming :Event Handling(Types of Events)simmis5
 
Introduction and different types of Ethernet.pptx
Introduction and different types of Ethernet.pptxIntroduction and different types of Ethernet.pptx
Introduction and different types of Ethernet.pptxupamatechverse
 
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur High Profile
 
Coefficient of Thermal Expansion and their Importance.pptx
Coefficient of Thermal Expansion and their Importance.pptxCoefficient of Thermal Expansion and their Importance.pptx
Coefficient of Thermal Expansion and their Importance.pptxAsutosh Ranjan
 
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...Dr.Costas Sachpazis
 
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...roncy bisnoi
 
UNIT-II FMM-Flow Through Circular Conduits
UNIT-II FMM-Flow Through Circular ConduitsUNIT-II FMM-Flow Through Circular Conduits
UNIT-II FMM-Flow Through Circular Conduitsrknatarajan
 
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...Christo Ananth
 
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...ranjana rawat
 
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...Soham Mondal
 
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICS
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICSHARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICS
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICSRajkumarAkumalla
 
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur EscortsCall Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur High Profile
 
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLSMANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLSSIVASHANKAR N
 

Kürzlich hochgeladen (20)

Roadmap to Membership of RICS - Pathways and Routes
Roadmap to Membership of RICS - Pathways and RoutesRoadmap to Membership of RICS - Pathways and Routes
Roadmap to Membership of RICS - Pathways and Routes
 
KubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghlyKubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghly
 
(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...
(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...
(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...
 
Top Rated Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
Top Rated  Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...Top Rated  Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
Top Rated Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
 
Processing & Properties of Floor and Wall Tiles.pptx
Processing & Properties of Floor and Wall Tiles.pptxProcessing & Properties of Floor and Wall Tiles.pptx
Processing & Properties of Floor and Wall Tiles.pptx
 
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
 
Java Programming :Event Handling(Types of Events)
Java Programming :Event Handling(Types of Events)Java Programming :Event Handling(Types of Events)
Java Programming :Event Handling(Types of Events)
 
Introduction and different types of Ethernet.pptx
Introduction and different types of Ethernet.pptxIntroduction and different types of Ethernet.pptx
Introduction and different types of Ethernet.pptx
 
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
 
Coefficient of Thermal Expansion and their Importance.pptx
Coefficient of Thermal Expansion and their Importance.pptxCoefficient of Thermal Expansion and their Importance.pptx
Coefficient of Thermal Expansion and their Importance.pptx
 
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
 
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
 
UNIT-II FMM-Flow Through Circular Conduits
UNIT-II FMM-Flow Through Circular ConduitsUNIT-II FMM-Flow Through Circular Conduits
UNIT-II FMM-Flow Through Circular Conduits
 
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
 
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...
 
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
 
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICS
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICSHARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICS
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICS
 
Water Industry Process Automation & Control Monthly - April 2024
Water Industry Process Automation & Control Monthly - April 2024Water Industry Process Automation & Control Monthly - April 2024
Water Industry Process Automation & Control Monthly - April 2024
 
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur EscortsCall Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts
 
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLSMANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS
 

Accelerating Real Time Applications on Heterogeneous Platforms

  • 1. International OPEN ACCESS Journal Of Modern Engineering Research (IJMER) | IJMER | ISSN: 2249–6645 | www.ijmer.com | Vol. 4 | Iss. 4 | Apr. 2014 | 50 | Accelerating Real Time Applications on Heterogeneous Platforms Vivek Vanpariya1 , Prof. S. Ramani2 1, 2 (SCSE, VIT University, Vellore, India) I. INTRODUCTION General purpose CPU performance scaling has opened up a new direction for computer scientist and architect. Scaling performance of new generation GP-CPU has not as predicted by Gordon moor. After observing it closely the throughput for sequential code in GP-CPU is getting fluctuated between subsequent processor generations. This problem has motivated the scientist to innovate the high performance and cost effective hardware. As a solution computer scientist comes up with a low level GP-GPU core which has features like out of order execution and lack branch prediction to optimize the sequential code [1]. Khronos has a framework and standard set for heterogeneous parallel computing on cross-vendor and cross-platform hardware [7]. It provides an abstraction layer via which code can scale from simple embedded microcontrollers to GP- CPU’s from Intel and AMD, up to massively-parallel GP-GPU hardware pipelines, all without modifying code [8]. Most important task of graphical processing unit (GPU) is to accelerate the graphical applications. It contains large amount of streaming processors for commutations when compared to Central Processing Unit (CPU). Also GPU have high-bandwidth for data transfer. Graphics processing unit’s hasty increase in both programmability and capability has spawned a researcher’s community that has effectively mapped a broad range of computationally demanding, complex problems to the GPU. In other words GPU computing is using it as to boost up central processing units for general-purpose scientific and engineering computations. It can be achieved by offloading some of the intensive and overwhelming portions of the code and the remaining part of the code will remain in the CPU. From user’s view point, the application run more rapidly because of the extremely parallel processing power of the general purpose unit to boost performance which is well- known as “heterogeneous” or “hybrid” computing. Some critical factors like power, cost, bandwidth on which we have to focus when we working on GPU. Figure 1: CPU vs. GPU II. GPU ARCHITECTURE Graphics processing units will be able to process independent fragments and vertices, but also can process many of them in parallel. This comes handy when the programmer needs to process many vertices or fragments in the same way. In this logic, GPUs are stream processors also called as stream at once. Stream can be said as collection of records that requires similar kind of computation. Abstract: In this paper we describe about the novel implementations of depth estimation from a stereo images using feature extraction algorithms that run on the graphics processing unit (GPU) which is suitable for real time applications like analyzing video in real-time vision systems. Modern graphics cards contain large number of parallel processors and high-bandwidth memory for accelerating the processing of data computation operations. In this paper we give general idea of how to accelerate the real time application using heterogeneous platforms. We have proposed to use some added resources to grasp more computationally involved optimization methods. This proposed approach will indirectly accelerate a database by producing better plan quality. Index Terms: Graphics processing unit (GPU), depth estimation, and video analysis.
  • 2. Accelerating Real Time Applications on heterogeneous Platforms | IJMER | ISSN: 2249–6645 | www.ijmer.com | Vol. 4 | Iss. 4 | Apr. 2014 | 51 | Figure 2: GPU Architecture The 8800 is composed of 128 stream processors each rotating at the rate of 1350 Mhz. Each processor is capable to do calculation like MAD and MUL per clock cycle. Each processor need 4 cycles for executing specials instructions like EXP, LOG, RCP, RSQ, SIN, COS which is managed by an additional unit. III. OPEN COMPUTING LANGUAGE Open CL is popularly known as one of the free of royalty standard for general purpose parallel programming across CPUs, GPUs and other processors, giving software developer’s transportable and capable access to the power of these assorted processing platforms. OpenCL consists of an application programming interface for coordinating parallel computation across heterogeneous processors and a cross-platform programming language with a well précised computation background. Standards of Open CL:  It supports data-based and task-based parallel programming models.  It utilizes a rift of ISO C99 with extensions for parallelism.  It defines steady numerical supplies based on IEEE 754.  It defines a configuration report for handheld and embedded devices.  It efficiently interoperates with OpenGL, OpenGL ES and other graphics APIs. OPENCL memory model Open CL framework provides the distributed memory. As shown in figure 3, each processor has its own private memory and each core has its own local memory. Figure 3: Architecture of Conceptual Open CL device having processing elements (PE), computing units and devices.
  • 3. Accelerating Real Time Applications on heterogeneous Platforms | IJMER | ISSN: 2249–6645 | www.ijmer.com | Vol. 4 | Iss. 4 | Apr. 2014 | 52 | IV. TEST CASE FOR SERIAL VS PARALLEL CODE In this section we give the basic comparison of serial and parallel code performance. If we take simple example of matrix multiplication in which each element is computed sequentially from the row of the first matrix and column of the second matrix. Same matrix multiplication can be done on parallel environment because there will be no previous data dependency. The below discussion give knowledge how to write parallel code using Open CL. Parallel code of matrix multiplication is ported on Storm which is a tool of STMicroelectronics, which provide the parallel environment with 64 parallel processing units. No. of Data Elements per matrix No. of Cycles required for sequential code to run No. of Cycles required for parallel code to run Speed Up 16x16 1654850 32230 51 32x32 13185830 212402 62 48x48 44479008 701355 63 64x64 105413896 1653468 63 128x128 843161792 13180778 64 256x256 6744844288 105394568 64 512x512 53957111808 843086272 64 Table 2: Comparison table of Serial vs Parallel code of Matrix multiplication. Tools have got 4 clusters and each cluster contains 16 processing elements (cores). Above table give comparison chart of matrix multiplication using Storm. If we take the advantage of local memory and memory bandwidth for accessing the data from host part to device part then we can get better speedup i.e., speed of data transfer from global memory to compute device is slower than the speed of data transfer from local memory to compute device. Because local memory is more near to clusters and it is local to each clusters. Memory coalscaling is another feature of parallel processing. Suppose my bit transfer capacity from host to device is 32 bit’s then why should I transfer 8 bit at a time. Fetch 32 bit’s at a time and store it on device local memory until four bytes process. No. of Data Elements per matrix No. of Cycles using Global Memory No. of Cycles using Local Memory Speed Up 16x16 32230 20477 1.57 32x32 212402 114949 1.85 48x48 701355 368612 1.90 64x64 1653468 860118 1.92 Table 3: Comparison table of global vs serial memory for Matrix multiplication. Figure 4: Cycles Needs for Global Vs. Local Memory for Matrix Multiplication
  • 4. Accelerating Real Time Applications on heterogeneous Platforms | IJMER | ISSN: 2249–6645 | www.ijmer.com | Vol. 4 | Iss. 4 | Apr. 2014 | 53 | V. REAL TIME APPLICATION Real time applications like depth estimation for stereo images, video analytics for survellience and etc. are the good applications on image processing technology where repetitive work on pixels is needed. For this we are concentrating on throughput rather than the latency. Filtering and/or convolution for images are one of the important modules for depth estimation or in video analytics. In below discussion we briefly discuss convolution and result analysis after porting it on NVIDIA’s Quadro FX 4600 and Storm tool of STMicroelectronics. Convolution Kernel: It is a collection of numbers in the matrix form which is used in convolution of images. They are available in different size having different number patterns producing different results. The dimension of each kernel is subjective but most often 3x3 is preferred. A convolution is usually performed by doing the multiplication of a pixel and its adjacent pixels color value by a matrix. Convolve is likely used to Enhance, Sharpen or smoothing the image. We have the formulae for doing convolution of image f(x, y) and kernel k(x, y) represented as f(x,y) * k(x,y)=       f(u,v) k(x-u,y-v)dudv ….(1) In discrete form, f(x,y)*k(x,y)=    1 0 w i    1 0 H j f(i,j) k(x-i, y-j) …..(2) W, H is the width and height of image respectively. Figure 5: Masking on image. VI. RESULT ANALYSIS Function Name Time require for assembly code (ms) Time require for C/C++ code (ms) Time require for Parallel code (ms) convolve_cols_3x3 50 360 136 convolve_121_row_3x3_16bit 10 50 30 convolve_101_row_3x3_16bit 20 60 23 Table 4: Time analysis for assembly, serial and parallel code (using NVIDIA device).
  • 5. Accelerating Real Time Applications on heterogeneous Platforms | IJMER | ISSN: 2249–6645 | www.ijmer.com | Vol. 4 | Iss. 4 | Apr. 2014 | 54 | Figure 6: Comparison graph for table 4 Function Name Time require for Assembly Code (ms) Time require for C/C++ Code (ms) Time require for Parallel Code on xp70(ms) convolve_cols_3x3 50 360 128 convolve_121_row_ 3x3_16bit 10 50 28 convolve_101_row_ 3x3_16bit 20 60 20 Table 6: Time analysis for assembly, serial and parallel code (using Sthorm device). VII. CONCLUSION AND FUTURE WORK In this paper we have discussed the behavior of serial and parallel code with respect to different heterogeneous platforms. OpenCL has lot of task like getting a platform, kernel setting, memory/buffer initialization, filling the read buffer with input data, taking the device information, setting the work group and work items etc.. Due to this, performance results are not expected for small size data. By analyzing the comparison tables, we can accelerate the sequential code with the help of parallel computing elements on heterogeneous platforms. In future related work, one can apply the optimization technique like memory co-aliasing, vector data types and pipeline futures on complex real time applications. REFERENCES [1] I. Blthoff, H. Blthoff, and P. Sinha. “Topdown influences on stereoscopic depth-perception”. Nature Neuroscience, 1:254.257. [2] S. Das and N. Ahuja. “Performance analysis of stereo, vergence, and focus as depth cues for active vision”. IEEE Trans PAMI, 17:1213.1219. [3] A. Smolic and P. Kauff, “Interactive 3-D video representation and coding technologies,” Proc. IEEE, Special Issue on Advances in Video Coding and Delivery, vol. 93, no. 1, pp. 98–110, Jan. 2005. [4] K. Mueller, P.Merkle, H. Schwarz, T. Hinz, A. Smolic, T. Oelbaum, and T. Wiegand, “Multi-view video coding based on H.264.MPEG4-AVC using hierarchical B pictures,” in Proc. PCS’ 2006, Apr. 2006, paper SS3–3. [Online.] http://www.engineeringvillage2.org. [5] Niem W, Buschmann R, ”Automatic Modelling of 3D Natural Objects fromMultiple Views”, European Workshop on Combined real and synthetic image processing for broadcast and video productions, 23–24. 11. 1994, Hamburg, Germany. [6] L. Falkenhagen, A. Kopernik, M. Strintzis, ”Disparity Estimation based on 3D Arbi- trarily Shaped Regions”, RACE Project Deliverable, R2045/UH/DS/P/023/b1. [7] Y. Yakimovski, R. Cunningham, ”A System for Extracting 3D Measurements from a Stereo Pair of TV Cameras”, CGVIP, Vol. 7, 1978, pp. 195 – 210. [8] Koch, R., ”3–D Scene Modelling from Stereoscopic Image Sequences”, European Workshop on Combined real and synthetic image processing for broadcast and video productions, 23–24. 11. 1994, Hamburg, Germany.
  • 6. Accelerating Real Time Applications on heterogeneous Platforms | IJMER | ISSN: 2249–6645 | www.ijmer.com | Vol. 4 | Iss. 4 | Apr. 2014 | 55 | Vivek Vanpariya received his B.E degree from A.D Patel institute of Technology affiliated to Sardar Patel University, Gujrat, India. He did his M. Tech in Computer Science and Engineering from VIT University Vellore, India. His area of interests includes Computer Network, Algorithms and Operating Systems. Ramani S received his M.Tech degree from Bharathidasan Institute of Technology affiliated to Bharathidasan University, TamilNadu, India. He is pursuing his Ph. D from VIT University Vellore, India. His area of interests includes Database, Web Technologies and Data Mining.