20131024

•Als PPTX, PDF herunterladen•

0 gefällt mir•251 views

Jocelyn

2013/10/24 meeting at Delta R621

Technologie Bildung

WEEKLY REPORT
Thur., Oct 24, 2013
Pin Yi Tsai

OUTLINE
• Current Work
• Compute Integral Image – parallel version
• Why the difference is so implicit?
• An accidental Error

• In Process
• Compute 11 types of Features

COMPUTE INTEGRAL IMAGE – PARALLEL VERSION
• Computation and communication time

 input 16x16:
 serial version: 0.006336 ms
 for loop outside of kernel function:

 parallel version: 6.80778 ms
 for loop inside of kernel function:
 parallel version: 5.88559e-39 ms

COMPUTE INTEGRAL IMAGE (CONT.)
 input 640x480:

 serial version: 5.1607 ms
 parallel version: 4.94058 ms

AN ACCIDENTAL ERROR
• Occurred when copy data from Mat to one-dimension float array

0 175 175 175 175 175
0 174 174 174 174 174
6.78807e-29 175 175 175 175 175
0 175 175 175 175 175
0 175 175 175 175 175
6.79909e-29 134 151 158 136 142
0 138 132 135 140 135
6.80354e-29 136 136 143 142 137

AN ACCIDENTAL ERROR (CONT.)
• Why?

•

memset(ar,0,sizeof(float)*(image1.step+1)*(image1.rows+1));

• The size is not correct.

Empfohlen

ReactivePranav E K

PROCESS WARP祐司伊藤

Workshop on OSHWYann Chemin

DSD-INT 2014 - NGHS Workshop Scripting in SOBEK 3 & Delft3D Flexible Mesh - P...Deltares

20131107Jocelyn

20131002Jocelyn

Hedland Variable Area Flow MetersBell Flow Systems

Macnaught MX Series of Oval Gear Flow MetersBell Flow Systems

Empfohlen

ReactivePranav E K

PROCESS WARP祐司伊藤

Workshop on OSHWYann Chemin

DSD-INT 2014 - NGHS Workshop Scripting in SOBEK 3 & Delft3D Flexible Mesh - P...Deltares

20131107Jocelyn

20131002Jocelyn

Hedland Variable Area Flow MetersBell Flow Systems

Macnaught MX Series of Oval Gear Flow MetersBell Flow Systems

20131121Jocelyn

Blancett Turbine Flow MetersBell Flow Systems

20140109Jocelyn

20131219Jocelyn

Electromagnetic Flow Meters Overview (Badger Meter)Bell Flow Systems

Pricelist Zubit Life Care - PCD Pharma Company | PCD Pharma Franchise | Pharm...Zubit Life Care

The DXN Portable Ultrasonic Flow MeterBell Flow Systems

Vortex Flow Meters- Badger MeterBell Flow Systems

Hydraulic Turbine Flow Meters (flo-tech)Bell Flow Systems

VuHeat Ultrasonic Heat Meters DatasheetsBell Flow Systems

20131212Jocelyn

20131114Jocelyn

Etica del contadorMoltem8

토토추천 ＃＃＃too93.com＃＃＃ 토토추천 토토추천waqregfd

JEEVAN.KHOT_BE _MECH_ENGGjeevan khot

The Little Foxes College Night ReleaseMeara Brady

Teaching Summit - oGIP EP Backgrounds and Target MarketsBun Socheata

Declaración por escrito del Estado Plurinacional de Bolivia en la excepción p...Miguel Angel De La Roca

ไวยกรณ์ต้องรู้สำหรับสอบวัดระดับ3ラッキーね

NVIDIA 深度學習教育機構 (DLI): Image segmentation with tensorflowNVIDIA Taiwan

Toronto meetup 20190917Bill Liu

Weitere ähnliche Inhalte

Andere mochten auch

20131121Jocelyn

Blancett Turbine Flow MetersBell Flow Systems

20140109Jocelyn

20131219Jocelyn

Electromagnetic Flow Meters Overview (Badger Meter)Bell Flow Systems

Pricelist Zubit Life Care - PCD Pharma Company | PCD Pharma Franchise | Pharm...Zubit Life Care

The DXN Portable Ultrasonic Flow MeterBell Flow Systems

Vortex Flow Meters- Badger MeterBell Flow Systems

Hydraulic Turbine Flow Meters (flo-tech)Bell Flow Systems

VuHeat Ultrasonic Heat Meters DatasheetsBell Flow Systems

20131212Jocelyn

20131114Jocelyn

Etica del contadorMoltem8

토토추천 ＃＃＃too93.com＃＃＃ 토토추천 토토추천waqregfd

JEEVAN.KHOT_BE _MECH_ENGGjeevan khot

The Little Foxes College Night ReleaseMeara Brady

Teaching Summit - oGIP EP Backgrounds and Target MarketsBun Socheata

Declaración por escrito del Estado Plurinacional de Bolivia en la excepción p...Miguel Angel De La Roca

ไวยกรณ์ต้องรู้สำหรับสอบวัดระดับ3ラッキーね

Andere mochten auch (20)

20131121

Blancett Turbine Flow Meters

20140109

20131219

Electromagnetic Flow Meters Overview (Badger Meter)

Pricelist Zubit Life Care - PCD Pharma Company | PCD Pharma Franchise | Pharm...

The DXN Portable Ultrasonic Flow Meter

Vortex Flow Meters- Badger Meter

Hydraulic Turbine Flow Meters (flo-tech)

VuHeat Ultrasonic Heat Meters Datasheets

20131212

20131114

Etica del contador

토토추천 ＃＃＃too93.com＃＃＃ 토토추천 토토추천

JEEVAN.KHOT_BE _MECH_ENGG

The Little Foxes College Night Release

Teaching Summit - oGIP EP Backgrounds and Target Markets

Declaración por escrito del Estado Plurinacional de Bolivia en la excepción p...

ไวยกรณ์ต้องรู้สำหรับสอบวัดระดับ3

Ähnlich wie 20131024

NVIDIA 深度學習教育機構 (DLI): Image segmentation with tensorflowNVIDIA Taiwan

Toronto meetup 20190917Bill Liu

Adaptive Query Execution: Speeding Up Spark SQL at RuntimeDatabricks

Ehsan parallel accelerator-dec2015Christian Peel

Ask the expert - App performance on Series 40 phonesMicrosoft Mobile Developer

Spark Summit EU talk by Nick PentreathSpark Summit

Accelerate Machine Learning on Google CloudSamantha Guerriero

Oracle Database In-Memory Option in ActionTanel Poder

In Memory Database In Action by Tanel Poder and Kerry OsborneEnkitec

Performance Optimization of Deep Learning Frameworks Caffe* and Tensorflow* f...Intel® Software

Using The New Flash Stage3D Web Technology To Build Your Own Next 3D Browser ...Daosheng Mu

Low latency & mechanical sympathy issues and solutionsJean-Philippe BEMPEL

SPARKNaCl: A verified, fast cryptographic libraryAdaCore

Sathya Final reviewSathiyasainathan Soundararajan

Predicting Optimal Parallelism for Data AnalyticsDatabricks

What’s eating python performancePiotr Przymus

Python高级编程（二）Qiangning Hong

MIT's experience on OpenPOWER/POWER 9 platformGanesan Narayanasamy

Reproducible Computational Pipelines with Docker and Nextflowinside-BigData.com

2018 03 25 system ml ai and openpower meetupGanesan Narayanasamy

Ähnlich wie 20131024 (20)

NVIDIA 深度學習教育機構 (DLI): Image segmentation with tensorflow

Toronto meetup 20190917

Adaptive Query Execution: Speeding Up Spark SQL at Runtime

Ehsan parallel accelerator-dec2015

Ask the expert - App performance on Series 40 phones

Spark Summit EU talk by Nick Pentreath

Accelerate Machine Learning on Google Cloud

Oracle Database In-Memory Option in Action

In Memory Database In Action by Tanel Poder and Kerry Osborne

Performance Optimization of Deep Learning Frameworks Caffe* and Tensorflow* f...

Using The New Flash Stage3D Web Technology To Build Your Own Next 3D Browser ...

Low latency & mechanical sympathy issues and solutions

SPARKNaCl: A verified, fast cryptographic library

Sathya Final review

Predicting Optimal Parallelism for Data Analytics

What’s eating python performance

Python高级编程（二）

MIT's experience on OpenPOWER/POWER 9 platform

Reproducible Computational Pipelines with Docker and Nextflow

2018 03 25 system ml ai and openpower meetup

Kürzlich hochgeladen

How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes

EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEarley Information Science

From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software

2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong

08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls

Slack Application Development 101 Slidespraypatel2

The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge

Presentation on how to chat with PDF using ChatGPT code interpreternaman860154

Developing An App To Navigate The Roads of BrazilV3cube

08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls

TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc

Histor y of HAM Radio presentation slidevu2urc

Partners Life - Insurer Innovation Award 2024The Digital Insurer

Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun

Boost PC performance: How more available memory can improve productivityPrincipled Technologies

Driving Behavioral Change for Information Management through Data-Driven Gree...Enterprise Knowledge

Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j

Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung

Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko

WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure servicePooja Nehwal

Kürzlich hochgeladen (20)

How to Troubleshoot Apps for the Modern Connected Worker

EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx

From Event to Action: Accelerate Your Decision Making with Real-Time Automation

2024: Domino Containers - The Next Step. News from the Domino Container commu...

08448380779 Call Girls In Friends Colony Women Seeking Men

Slack Application Development 101 Slides

The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf

Presentation on how to chat with PDF using ChatGPT code interpreter

Developing An App To Navigate The Roads of Brazil

08448380779 Call Girls In Diplomatic Enclave Women Seeking Men

TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments

Histor y of HAM Radio presentation slide

Partners Life - Insurer Innovation Award 2024

Data Cloud, More than a CDP by Matt Robison

Boost PC performance: How more available memory can improve productivity

Driving Behavioral Change for Information Management through Data-Driven Gree...

Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...

Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...

Handwritten Text Recognition for manuscripts and early printed texts

WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service

20131024

1. WEEKLY REPORT Thur., Oct 24, 2013 Pin Yi Tsai

2. OUTLINE • Current Work • Compute Integral Image – parallel version • Why the difference is so implicit? • An accidental Error • In Process • Compute 11 types of Features

3. COMPUTE INTEGRAL IMAGE – PARALLEL VERSION • Computation and communication time  input 16x16:  serial version: 0.006336 ms  for loop outside of kernel function:  parallel version: 6.80778 ms  for loop inside of kernel function:  parallel version: 5.88559e-39 ms

4. COMPUTE INTEGRAL IMAGE (CONT.)  input 640x480:  serial version: 5.1607 ms  parallel version: 4.94058 ms

5. WHY THE DIFFERENCE IS SO IMPLICIT? • Profile: Time : 4.91024 ms ======== Profiling result: Time(%) 71.71 Time Calls 2.75ms 1 Avg 2.75ms Min 2.75ms Max Name 2.75ms computeByColumn(float*, int) 10.91 418.56us 2 209.28us 209.06us 209.50us [CUDA memcpy HtoD] 10.08 386.46us 2 193.23us 191.10us 195.36us [CUDA memcpy DtoH] 7.31 280.22us int) 1 280.22us 280.22us 280.22us computeByRow(float*, int,  Access the inconsistent memory  Memory Access is too time-consuming

6. AN ACCIDENTAL ERROR • Occurred when copy data from Mat to one-dimension float array 0 175 175 175 175 175 0 174 174 174 174 174 6.78807e-29 175 175 175 175 175 0 175 175 175 175 175 0 175 175 175 175 175 6.79909e-29 134 151 158 136 142 0 138 132 135 140 135 6.80354e-29 136 136 143 142 137

7. AN ACCIDENTAL ERROR (CONT.) • Why? • memset(ar,0,sizeof(float)*(image1.step+1)*(image1.rows+1)); • The size is not correct.

8. The End