SlideShare ist ein Scribd-Unternehmen logo
1 von 37
Downloaden Sie, um offline zu lesen
Scientific applications using
GPU
Overview about field of study and practical
applications
June 22, 2017
Raul Sena Ferreira
IT Coordinator at IPEA (Research Institute for Applied Economics)
Lecturer in Computer Science at UFRRJ
MSc Student in Data Engineering (Major in Machine Learning) at PESC-UFRJ
www.raulferreira.com.br
June 22, 2017
GPU
Graphics processing unit
GPU-accelerated computing is the use of a graphics processing unit (GPU)
together with a CPU to accelerate deep learning, analytics, and engineering
applications [1]
CUDA® is a parallel computing platform and programming model invented by
NVIDIA. It enables dramatic increases in computing performance by harnessing
the power of the graphics processing unit (GPU) [2]
June 22, 2017
How it works?
June 22, 2017
Example
June 22, 2017
Published work
June 22, 2017
RAIC - 2014
Paralelização do algoritmo de Método de Estimação
Não-Paramétrico por Núcleo Estimador Multivariado
(KDE) utilizando GPU/CUDA
Research problem
Kernel Density Estimation (KDE): Non-parametric statistical algorithm to perform
probabilities about events within a population
Multidimensional KDE: O(n2
k)
How to parallelize this method?
There are real some advantages?
June 22, 2017
Serial vs Parallel
● Processor: Intel I5 2500
● Graphic card: GeForce GTX 650 Ti Boost
● Nº of cores: 4 (Hyperthread)
● Input: 30.000 points (x,y)
3024% faster than serial. 6 times faster than parallel matlab.
KDE (serial) Matlab KDE (parallelized) KDE with CUDA
31.08 6.35 1.03
June 22, 2017
ERSI - 2016
W-SAGE: Ferramenta Web para Análise de Dados
Geoespaciais
Research problem
How to infer statistically some events from geographic coordinates ?
How to make an Information System with this approach ?
How to get the benefits of GPU computing inside this Information System ?
How to “glue” all of them ?
June 22, 2017
System flow
June 22, 2017
Integrating tools for GPU programming
PyCUDA: Nvidia‘s CUDA parallel computation API from Python
● Maps all of CUDA into Python
● Enables run-time code generation (RTCG) for flexible, fast, automatically
tuned codes
● Added robustness: automatic management of object lifetimes, automatic error
checking
● Added convenience: comes with ready-made on-GPU linear algebra,
reduction, scan. Add-on packages for FFT and LAPACK available
● Fast. Near-zero wrapping overhead
● Complete, helpful documentation
June 22, 2017
Study case
Dataset: 14027 registers collected from UFRRJ undergraduate students between
2000 and 2013
Two dimensional data: Coordinates from students home
Question: How the probability of a person that lives in certain regions to study in
UFRRJ ? How these probabilities are distributed across the country ?
June 22, 2017
Results
June 22, 2017
ERAD - 2017
Análise de desempenho da biblioteca Theano com
GPUs sob a ótica do CUDA Profiler
ERAD-RJ 2017
How efficient are the libraries regarding hardware architectures, specially using
GPUs ?
Literature often evaluate performance measuring total processing time
Goal is measure the performance in different GPUs using CUDA Profiler as
collecting metric tool for GPU
Repository: https://github.com/felipe-melo/Erad-Code
June 22, 2017
Motivation
Deep Learning are a class of machine learning algorithms that uses many layers
to process something:
● Non-linear transformations
● Feature extraction, image classification, etc
(Returning of) Neural networks:
● CNN, RNN, Autoencoders ...
June 22, 2017
Neural Network & MultiLayer Perceptron
June 22, 2017
Theano
June 22, 2017
CUDA Profile
Performance analysis tool from NVIDIA
Commands:
--print-gpu-trace -u s
--metrics flop_count_sp, flop_sp_efficiency, flop_count_dp, flop_dp_efficiency
June 22, 2017
Neural network operations
● Matrix summing
○ Cmn
= Amn
+ Bmn
● Matrix multiplication
○ Cmo
= Amn
* Bno
● Hadamard product
○ Cmn
= Amn
o Bmn
● Scalar
○ Cmn
= a * Bmn
● Sigmoid
○ Cmn
= Sigmoid(Amn
)
● Tanh
○ Cmn
= Tanh(Amn
)
June 22, 2017
Results
June 22, 2017
Results
June 22, 2017
Ongoing research
June 22, 2017
Paraphrase detection + deep learning + cuda
Comparison of Recursive Auto Encoders implementations between Tensorflow
and Theano
Datasets: Microsoft Research Paraphrase Corpus & Webis Crowd Paraphrase
Corpus 2011
Main contributions: Speedup differences using different libraries; Evaluating if use
of hypernyms improves the outcomes
Peripheral contributions: Comparing two widely used frameworks within an
important problem of neural networks
June 22, 2017
Applications in GPU
computing
Bioinformatics: Sequencing protein processing using NVBIO[3]
Computational Finance: Monte Carlo simulation using NVIDIA TESLA GPU [4]
Computational Fluid Dynamics: Navier-Stokes models and Lattice methods
with very large speedups using CUDA-enabled GPUs [5]
June 22, 2017
Media and Entertainment: GPUs to deliver high performance graphics and
parallelism for video editing, digital animation, rendering and media creation[7]
Data Science, Analytics and Databases: GPUs for big data analytics to make
better, real –time business decisions [6]
June 22, 2017
Electronic Design and Automation: Recent trends in HPC are increasingly
exploiting many core GPUs to achieve speedup of computationally intensive
simulations including verilog simulation, signal integrity & Electromagnetic [8]
Weather and Climate: Weather Research and Forecasting model and tsunami
simulations. [12]
Defense and Intelligence: Deep Learning Tools for Defense, Safety, and Security
Applications. [9]
Imaging and Computer Visions: Applications can achieve interactive video
frame-rate performance. Libraries: GPU4Vision, OpenVIDIA, MinGPU, etc [10]
June 22, 2017
Medical Imaging: Advanced MRI reconstruction, Computed Tomography
Reconstruction [13]
June 22, 2017
Structural mechanics: CUDA-enabled GPUs ANSY, Abaqus, MSC Nastran,
IMPETUS Afea and additional structural mechanics applications [11]
Machine Learning: Many machine learning tools [14]:
● Caffe: Framework for convolutional neural network algorithms
● Theano: Python library to define, optimize, and evaluate mathematical
expressions
● cxxnet: Neural network toolkit
● cuBLAS: GPU-accelerated version of the complete standard BLAS library
June 22, 2017
Want to join in ?
Ongoing research or new ideas. Send me an email :)
Requirements:
● Python programming
● English (basic reading and writing)
● Basic understanding of neural networks
● Eager to learning and resilience
June 22, 2017
Good places to start
NVIDIA Website
Intro to Parallel Programming Using CUDA to Harness the Power of GPUs
CUDA University Courses
Multicore and GPU Programming for Video Games
CUDA Tutorials | The Supercomputing Blog
June 22, 2017
"In the U.S. there are two types of hipsters: those
who know how to program and those who serve
coffee."
César Hidalgo
June 22, 2017
Thank you!
raulsf@cos.ufrj.br
https://www.linkedin.com/in/raulsenaferreira/
https://github.com/raulsenaferreira
June 22, 2017

Weitere ähnliche Inhalte

Was ist angesagt?

Introduction to multi gpu deep learning with DIGITS 2 - Mike Wang
Introduction to multi gpu deep learning with DIGITS 2 - Mike WangIntroduction to multi gpu deep learning with DIGITS 2 - Mike Wang
Introduction to multi gpu deep learning with DIGITS 2 - Mike WangPAPIs.io
 
Deep learning for FinTech
Deep learning for FinTechDeep learning for FinTech
Deep learning for FinTechgeetachauhan
 
Cloud Roundtable at Microsoft Switzerland
Cloud Roundtable at Microsoft Switzerland Cloud Roundtable at Microsoft Switzerland
Cloud Roundtable at Microsoft Switzerland mictc
 
On-device machine learning: TensorFlow on Android
On-device machine learning: TensorFlow on AndroidOn-device machine learning: TensorFlow on Android
On-device machine learning: TensorFlow on AndroidYufeng Guo
 
TensorFlow in Context
TensorFlow in ContextTensorFlow in Context
TensorFlow in ContextAltoros
 
Mastering Computer Vision Problems with State-of-the-art Deep Learning
Mastering Computer Vision Problems with State-of-the-art Deep LearningMastering Computer Vision Problems with State-of-the-art Deep Learning
Mastering Computer Vision Problems with State-of-the-art Deep LearningMiguel González-Fierro
 
"Deep Learning and Vision Algorithm Development in MATLAB Targeting Embedded ...
"Deep Learning and Vision Algorithm Development in MATLAB Targeting Embedded ..."Deep Learning and Vision Algorithm Development in MATLAB Targeting Embedded ...
"Deep Learning and Vision Algorithm Development in MATLAB Targeting Embedded ...Edge AI and Vision Alliance
 
Deeplearning on Hadoop @OSCON 2014
Deeplearning on Hadoop @OSCON 2014Deeplearning on Hadoop @OSCON 2014
Deeplearning on Hadoop @OSCON 2014Adam Gibson
 
Smaller and Easier: Machine Learning on Embedded Things
Smaller and Easier: Machine Learning on Embedded ThingsSmaller and Easier: Machine Learning on Embedded Things
Smaller and Easier: Machine Learning on Embedded ThingsNUS-ISS
 
Deep learning with Tensorflow in R
Deep learning with Tensorflow in RDeep learning with Tensorflow in R
Deep learning with Tensorflow in Rmikaelhuss
 
Deep Learning with Microsoft R Open
Deep Learning with Microsoft R OpenDeep Learning with Microsoft R Open
Deep Learning with Microsoft R OpenPoo Kuan Hoong
 
Koss Lab 세미나 오픈소스 인공지능(AI) 프레임웍파헤치기
Koss Lab 세미나 오픈소스 인공지능(AI) 프레임웍파헤치기 Koss Lab 세미나 오픈소스 인공지능(AI) 프레임웍파헤치기
Koss Lab 세미나 오픈소스 인공지능(AI) 프레임웍파헤치기 Mario Cho
 
Squeezing Deep Learning Into Mobile Phones
Squeezing Deep Learning Into Mobile PhonesSqueezing Deep Learning Into Mobile Phones
Squeezing Deep Learning Into Mobile PhonesAnirudh Koul
 
Deep Learning Explained
Deep Learning ExplainedDeep Learning Explained
Deep Learning ExplainedMelanie Swan
 
Applying Transfer Learning in TensorFlow
Applying Transfer Learning in TensorFlowApplying Transfer Learning in TensorFlow
Applying Transfer Learning in TensorFlowScott Thompson
 
Deep Learning for Computer Vision - PyconDE 2017
Deep Learning for Computer Vision - PyconDE 2017Deep Learning for Computer Vision - PyconDE 2017
Deep Learning for Computer Vision - PyconDE 2017Alex Conway
 
[Thomas chamberlain] learning_om_ne_t++(z-lib.org)
[Thomas chamberlain] learning_om_ne_t++(z-lib.org)[Thomas chamberlain] learning_om_ne_t++(z-lib.org)
[Thomas chamberlain] learning_om_ne_t++(z-lib.org)wissem hammouda
 
"Collaboratively Benchmarking and Optimizing Deep Learning Implementations," ...
"Collaboratively Benchmarking and Optimizing Deep Learning Implementations," ..."Collaboratively Benchmarking and Optimizing Deep Learning Implementations," ...
"Collaboratively Benchmarking and Optimizing Deep Learning Implementations," ...Edge AI and Vision Alliance
 
Deep Learning: Application Landscape - March 2018
Deep Learning: Application Landscape - March 2018Deep Learning: Application Landscape - March 2018
Deep Learning: Application Landscape - March 2018Grigory Sapunov
 
Samsung SDS OpeniT - The possibility of Python
Samsung SDS OpeniT - The possibility of PythonSamsung SDS OpeniT - The possibility of Python
Samsung SDS OpeniT - The possibility of PythonInsuk (Chris) Cho
 

Was ist angesagt? (20)

Introduction to multi gpu deep learning with DIGITS 2 - Mike Wang
Introduction to multi gpu deep learning with DIGITS 2 - Mike WangIntroduction to multi gpu deep learning with DIGITS 2 - Mike Wang
Introduction to multi gpu deep learning with DIGITS 2 - Mike Wang
 
Deep learning for FinTech
Deep learning for FinTechDeep learning for FinTech
Deep learning for FinTech
 
Cloud Roundtable at Microsoft Switzerland
Cloud Roundtable at Microsoft Switzerland Cloud Roundtable at Microsoft Switzerland
Cloud Roundtable at Microsoft Switzerland
 
On-device machine learning: TensorFlow on Android
On-device machine learning: TensorFlow on AndroidOn-device machine learning: TensorFlow on Android
On-device machine learning: TensorFlow on Android
 
TensorFlow in Context
TensorFlow in ContextTensorFlow in Context
TensorFlow in Context
 
Mastering Computer Vision Problems with State-of-the-art Deep Learning
Mastering Computer Vision Problems with State-of-the-art Deep LearningMastering Computer Vision Problems with State-of-the-art Deep Learning
Mastering Computer Vision Problems with State-of-the-art Deep Learning
 
"Deep Learning and Vision Algorithm Development in MATLAB Targeting Embedded ...
"Deep Learning and Vision Algorithm Development in MATLAB Targeting Embedded ..."Deep Learning and Vision Algorithm Development in MATLAB Targeting Embedded ...
"Deep Learning and Vision Algorithm Development in MATLAB Targeting Embedded ...
 
Deeplearning on Hadoop @OSCON 2014
Deeplearning on Hadoop @OSCON 2014Deeplearning on Hadoop @OSCON 2014
Deeplearning on Hadoop @OSCON 2014
 
Smaller and Easier: Machine Learning on Embedded Things
Smaller and Easier: Machine Learning on Embedded ThingsSmaller and Easier: Machine Learning on Embedded Things
Smaller and Easier: Machine Learning on Embedded Things
 
Deep learning with Tensorflow in R
Deep learning with Tensorflow in RDeep learning with Tensorflow in R
Deep learning with Tensorflow in R
 
Deep Learning with Microsoft R Open
Deep Learning with Microsoft R OpenDeep Learning with Microsoft R Open
Deep Learning with Microsoft R Open
 
Koss Lab 세미나 오픈소스 인공지능(AI) 프레임웍파헤치기
Koss Lab 세미나 오픈소스 인공지능(AI) 프레임웍파헤치기 Koss Lab 세미나 오픈소스 인공지능(AI) 프레임웍파헤치기
Koss Lab 세미나 오픈소스 인공지능(AI) 프레임웍파헤치기
 
Squeezing Deep Learning Into Mobile Phones
Squeezing Deep Learning Into Mobile PhonesSqueezing Deep Learning Into Mobile Phones
Squeezing Deep Learning Into Mobile Phones
 
Deep Learning Explained
Deep Learning ExplainedDeep Learning Explained
Deep Learning Explained
 
Applying Transfer Learning in TensorFlow
Applying Transfer Learning in TensorFlowApplying Transfer Learning in TensorFlow
Applying Transfer Learning in TensorFlow
 
Deep Learning for Computer Vision - PyconDE 2017
Deep Learning for Computer Vision - PyconDE 2017Deep Learning for Computer Vision - PyconDE 2017
Deep Learning for Computer Vision - PyconDE 2017
 
[Thomas chamberlain] learning_om_ne_t++(z-lib.org)
[Thomas chamberlain] learning_om_ne_t++(z-lib.org)[Thomas chamberlain] learning_om_ne_t++(z-lib.org)
[Thomas chamberlain] learning_om_ne_t++(z-lib.org)
 
"Collaboratively Benchmarking and Optimizing Deep Learning Implementations," ...
"Collaboratively Benchmarking and Optimizing Deep Learning Implementations," ..."Collaboratively Benchmarking and Optimizing Deep Learning Implementations," ...
"Collaboratively Benchmarking and Optimizing Deep Learning Implementations," ...
 
Deep Learning: Application Landscape - March 2018
Deep Learning: Application Landscape - March 2018Deep Learning: Application Landscape - March 2018
Deep Learning: Application Landscape - March 2018
 
Samsung SDS OpeniT - The possibility of Python
Samsung SDS OpeniT - The possibility of PythonSamsung SDS OpeniT - The possibility of Python
Samsung SDS OpeniT - The possibility of Python
 

Ähnlich wie Raul sena - Apresentação Analiticsemtudo - Scientific Applications using GPU

Real-Time Implementation and Performance Optimization of Local Derivative Pat...
Real-Time Implementation and Performance Optimization of Local Derivative Pat...Real-Time Implementation and Performance Optimization of Local Derivative Pat...
Real-Time Implementation and Performance Optimization of Local Derivative Pat...IJECEIAES
 
Chainer OpenPOWER developer congress HandsON 20170522_ota
Chainer OpenPOWER developer congress HandsON 20170522_otaChainer OpenPOWER developer congress HandsON 20170522_ota
Chainer OpenPOWER developer congress HandsON 20170522_otaPreferred Networks
 
A Gentle Introduction to GPU Computing by Armen Donigian
A Gentle Introduction to GPU Computing by Armen DonigianA Gentle Introduction to GPU Computing by Armen Donigian
A Gentle Introduction to GPU Computing by Armen DonigianData Con LA
 
Accelerating Data Science With GPUs
Accelerating Data Science With GPUsAccelerating Data Science With GPUs
Accelerating Data Science With GPUsiguazio
 
Graphics Processing Unit: An Introduction
Graphics Processing Unit: An IntroductionGraphics Processing Unit: An Introduction
Graphics Processing Unit: An Introductionijtsrd
 
Fueling the AI Revolution with Gaming
Fueling the AI Revolution with GamingFueling the AI Revolution with Gaming
Fueling the AI Revolution with GamingAlison B. Lowndes
 
Alison B Lowndes - Fueling the Artificial Intelligence Revolution with Gaming...
Alison B Lowndes - Fueling the Artificial Intelligence Revolution with Gaming...Alison B Lowndes - Fueling the Artificial Intelligence Revolution with Gaming...
Alison B Lowndes - Fueling the Artificial Intelligence Revolution with Gaming...Codemotion
 
OpenACC Monthly Highlights: January 2021
OpenACC Monthly Highlights: January 2021OpenACC Monthly Highlights: January 2021
OpenACC Monthly Highlights: January 2021OpenACC
 
Update on the Mont-Blanc Project for ARM-based HPC
Update on the Mont-Blanc Project for ARM-based HPCUpdate on the Mont-Blanc Project for ARM-based HPC
Update on the Mont-Blanc Project for ARM-based HPCinside-BigData.com
 
OpenACC Monthly Highlights: October2020
OpenACC Monthly Highlights: October2020OpenACC Monthly Highlights: October2020
OpenACC Monthly Highlights: October2020OpenACC
 
Introduction to GPUs for Machine Learning
Introduction to GPUs for Machine LearningIntroduction to GPUs for Machine Learning
Introduction to GPUs for Machine LearningSri Ambati
 
SPEED-UP IMPROVEMENT USING PARALLEL APPROACH IN IMAGE STEGANOGRAPHY
SPEED-UP IMPROVEMENT USING PARALLEL APPROACH IN IMAGE STEGANOGRAPHYSPEED-UP IMPROVEMENT USING PARALLEL APPROACH IN IMAGE STEGANOGRAPHY
SPEED-UP IMPROVEMENT USING PARALLEL APPROACH IN IMAGE STEGANOGRAPHYcsandit
 
A SURVEY ON GPU SYSTEM CONSIDERING ITS PERFORMANCE ON DIFFERENT APPLICATIONS
A SURVEY ON GPU SYSTEM CONSIDERING ITS PERFORMANCE ON DIFFERENT APPLICATIONSA SURVEY ON GPU SYSTEM CONSIDERING ITS PERFORMANCE ON DIFFERENT APPLICATIONS
A SURVEY ON GPU SYSTEM CONSIDERING ITS PERFORMANCE ON DIFFERENT APPLICATIONScseij
 
GPU Computing: An Introduction
GPU Computing: An IntroductionGPU Computing: An Introduction
GPU Computing: An Introductionijtsrd
 
Memory Efficient Graph Convolutional Network based Distributed Link Prediction
Memory Efficient Graph Convolutional Network based Distributed Link PredictionMemory Efficient Graph Convolutional Network based Distributed Link Prediction
Memory Efficient Graph Convolutional Network based Distributed Link Predictionmiyurud
 
Machine Learning with JavaScript
Machine Learning with JavaScriptMachine Learning with JavaScript
Machine Learning with JavaScriptIvo Andreev
 
OpenACC Monthly Highlights: May 2019
OpenACC Monthly Highlights: May 2019OpenACC Monthly Highlights: May 2019
OpenACC Monthly Highlights: May 2019OpenACC
 
Hand Finger Counting using Deep Convolutional Neural Network (CNN) on GPU
Hand Finger Counting using Deep Convolutional Neural Network (CNN) on GPUHand Finger Counting using Deep Convolutional Neural Network (CNN) on GPU
Hand Finger Counting using Deep Convolutional Neural Network (CNN) on GPUMahesh Khadatare
 

Ähnlich wie Raul sena - Apresentação Analiticsemtudo - Scientific Applications using GPU (20)

Real-Time Implementation and Performance Optimization of Local Derivative Pat...
Real-Time Implementation and Performance Optimization of Local Derivative Pat...Real-Time Implementation and Performance Optimization of Local Derivative Pat...
Real-Time Implementation and Performance Optimization of Local Derivative Pat...
 
Chainer OpenPOWER developer congress HandsON 20170522_ota
Chainer OpenPOWER developer congress HandsON 20170522_otaChainer OpenPOWER developer congress HandsON 20170522_ota
Chainer OpenPOWER developer congress HandsON 20170522_ota
 
A Gentle Introduction to GPU Computing by Armen Donigian
A Gentle Introduction to GPU Computing by Armen DonigianA Gentle Introduction to GPU Computing by Armen Donigian
A Gentle Introduction to GPU Computing by Armen Donigian
 
Accelerating Data Science With GPUs
Accelerating Data Science With GPUsAccelerating Data Science With GPUs
Accelerating Data Science With GPUs
 
Pycon9 dibernado
Pycon9 dibernadoPycon9 dibernado
Pycon9 dibernado
 
Graphics Processing Unit: An Introduction
Graphics Processing Unit: An IntroductionGraphics Processing Unit: An Introduction
Graphics Processing Unit: An Introduction
 
GPU Computing
GPU ComputingGPU Computing
GPU Computing
 
Fueling the AI Revolution with Gaming
Fueling the AI Revolution with GamingFueling the AI Revolution with Gaming
Fueling the AI Revolution with Gaming
 
Alison B Lowndes - Fueling the Artificial Intelligence Revolution with Gaming...
Alison B Lowndes - Fueling the Artificial Intelligence Revolution with Gaming...Alison B Lowndes - Fueling the Artificial Intelligence Revolution with Gaming...
Alison B Lowndes - Fueling the Artificial Intelligence Revolution with Gaming...
 
OpenACC Monthly Highlights: January 2021
OpenACC Monthly Highlights: January 2021OpenACC Monthly Highlights: January 2021
OpenACC Monthly Highlights: January 2021
 
Update on the Mont-Blanc Project for ARM-based HPC
Update on the Mont-Blanc Project for ARM-based HPCUpdate on the Mont-Blanc Project for ARM-based HPC
Update on the Mont-Blanc Project for ARM-based HPC
 
OpenACC Monthly Highlights: October2020
OpenACC Monthly Highlights: October2020OpenACC Monthly Highlights: October2020
OpenACC Monthly Highlights: October2020
 
Introduction to GPUs for Machine Learning
Introduction to GPUs for Machine LearningIntroduction to GPUs for Machine Learning
Introduction to GPUs for Machine Learning
 
SPEED-UP IMPROVEMENT USING PARALLEL APPROACH IN IMAGE STEGANOGRAPHY
SPEED-UP IMPROVEMENT USING PARALLEL APPROACH IN IMAGE STEGANOGRAPHYSPEED-UP IMPROVEMENT USING PARALLEL APPROACH IN IMAGE STEGANOGRAPHY
SPEED-UP IMPROVEMENT USING PARALLEL APPROACH IN IMAGE STEGANOGRAPHY
 
A SURVEY ON GPU SYSTEM CONSIDERING ITS PERFORMANCE ON DIFFERENT APPLICATIONS
A SURVEY ON GPU SYSTEM CONSIDERING ITS PERFORMANCE ON DIFFERENT APPLICATIONSA SURVEY ON GPU SYSTEM CONSIDERING ITS PERFORMANCE ON DIFFERENT APPLICATIONS
A SURVEY ON GPU SYSTEM CONSIDERING ITS PERFORMANCE ON DIFFERENT APPLICATIONS
 
GPU Computing: An Introduction
GPU Computing: An IntroductionGPU Computing: An Introduction
GPU Computing: An Introduction
 
Memory Efficient Graph Convolutional Network based Distributed Link Prediction
Memory Efficient Graph Convolutional Network based Distributed Link PredictionMemory Efficient Graph Convolutional Network based Distributed Link Prediction
Memory Efficient Graph Convolutional Network based Distributed Link Prediction
 
Machine Learning with JavaScript
Machine Learning with JavaScriptMachine Learning with JavaScript
Machine Learning with JavaScript
 
OpenACC Monthly Highlights: May 2019
OpenACC Monthly Highlights: May 2019OpenACC Monthly Highlights: May 2019
OpenACC Monthly Highlights: May 2019
 
Hand Finger Counting using Deep Convolutional Neural Network (CNN) on GPU
Hand Finger Counting using Deep Convolutional Neural Network (CNN) on GPUHand Finger Counting using Deep Convolutional Neural Network (CNN) on GPU
Hand Finger Counting using Deep Convolutional Neural Network (CNN) on GPU
 

Kürzlich hochgeladen

Night 7k Call Girls Noida Sector 128 Call Me: 8448380779
Night 7k Call Girls Noida Sector 128 Call Me: 8448380779Night 7k Call Girls Noida Sector 128 Call Me: 8448380779
Night 7k Call Girls Noida Sector 128 Call Me: 8448380779Delhi Call girls
 
Microsoft Copilot AI for Everyone - created by AI
Microsoft Copilot AI for Everyone - created by AIMicrosoft Copilot AI for Everyone - created by AI
Microsoft Copilot AI for Everyone - created by AITatiana Gurgel
 
Navi Mumbai Call Girls Service Pooja 9892124323 Real Russian Girls Looking Mo...
Navi Mumbai Call Girls Service Pooja 9892124323 Real Russian Girls Looking Mo...Navi Mumbai Call Girls Service Pooja 9892124323 Real Russian Girls Looking Mo...
Navi Mumbai Call Girls Service Pooja 9892124323 Real Russian Girls Looking Mo...Pooja Nehwal
 
Introduction to Prompt Engineering (Focusing on ChatGPT)
Introduction to Prompt Engineering (Focusing on ChatGPT)Introduction to Prompt Engineering (Focusing on ChatGPT)
Introduction to Prompt Engineering (Focusing on ChatGPT)Chameera Dedduwage
 
Mathematics of Finance Presentation.pptx
Mathematics of Finance Presentation.pptxMathematics of Finance Presentation.pptx
Mathematics of Finance Presentation.pptxMoumonDas2
 
If this Giant Must Walk: A Manifesto for a New Nigeria
If this Giant Must Walk: A Manifesto for a New NigeriaIf this Giant Must Walk: A Manifesto for a New Nigeria
If this Giant Must Walk: A Manifesto for a New NigeriaKayode Fayemi
 
WhatsApp 📞 9892124323 ✅Call Girls In Juhu ( Mumbai )
WhatsApp 📞 9892124323 ✅Call Girls In Juhu ( Mumbai )WhatsApp 📞 9892124323 ✅Call Girls In Juhu ( Mumbai )
WhatsApp 📞 9892124323 ✅Call Girls In Juhu ( Mumbai )Pooja Nehwal
 
Andrés Ramírez Gossler, Facundo Schinnea - eCommerce Day Chile 2024
Andrés Ramírez Gossler, Facundo Schinnea - eCommerce Day Chile 2024Andrés Ramírez Gossler, Facundo Schinnea - eCommerce Day Chile 2024
Andrés Ramírez Gossler, Facundo Schinnea - eCommerce Day Chile 2024eCommerce Institute
 
CTAC 2024 Valencia - Sven Zoelle - Most Crucial Invest to Digitalisation_slid...
CTAC 2024 Valencia - Sven Zoelle - Most Crucial Invest to Digitalisation_slid...CTAC 2024 Valencia - Sven Zoelle - Most Crucial Invest to Digitalisation_slid...
CTAC 2024 Valencia - Sven Zoelle - Most Crucial Invest to Digitalisation_slid...henrik385807
 
Chiulli_Aurora_Oman_Raffaele_Beowulf.pptx
Chiulli_Aurora_Oman_Raffaele_Beowulf.pptxChiulli_Aurora_Oman_Raffaele_Beowulf.pptx
Chiulli_Aurora_Oman_Raffaele_Beowulf.pptxraffaeleoman
 
Governance and Nation-Building in Nigeria: Some Reflections on Options for Po...
Governance and Nation-Building in Nigeria: Some Reflections on Options for Po...Governance and Nation-Building in Nigeria: Some Reflections on Options for Po...
Governance and Nation-Building in Nigeria: Some Reflections on Options for Po...Kayode Fayemi
 
VVIP Call Girls Nalasopara : 9892124323, Call Girls in Nalasopara Services
VVIP Call Girls Nalasopara : 9892124323, Call Girls in Nalasopara ServicesVVIP Call Girls Nalasopara : 9892124323, Call Girls in Nalasopara Services
VVIP Call Girls Nalasopara : 9892124323, Call Girls in Nalasopara ServicesPooja Nehwal
 
Thirunelveli call girls Tamil escorts 7877702510
Thirunelveli call girls Tamil escorts 7877702510Thirunelveli call girls Tamil escorts 7877702510
Thirunelveli call girls Tamil escorts 7877702510Vipesco
 
Presentation on Engagement in Book Clubs
Presentation on Engagement in Book ClubsPresentation on Engagement in Book Clubs
Presentation on Engagement in Book Clubssamaasim06
 
No Advance 8868886958 Chandigarh Call Girls , Indian Call Girls For Full Nigh...
No Advance 8868886958 Chandigarh Call Girls , Indian Call Girls For Full Nigh...No Advance 8868886958 Chandigarh Call Girls , Indian Call Girls For Full Nigh...
No Advance 8868886958 Chandigarh Call Girls , Indian Call Girls For Full Nigh...Sheetaleventcompany
 
ANCHORING SCRIPT FOR A CULTURAL EVENT.docx
ANCHORING SCRIPT FOR A CULTURAL EVENT.docxANCHORING SCRIPT FOR A CULTURAL EVENT.docx
ANCHORING SCRIPT FOR A CULTURAL EVENT.docxNikitaBankoti2
 
George Lever - eCommerce Day Chile 2024
George Lever -  eCommerce Day Chile 2024George Lever -  eCommerce Day Chile 2024
George Lever - eCommerce Day Chile 2024eCommerce Institute
 
BDSM⚡Call Girls in Sector 93 Noida Escorts >༒8448380779 Escort Service
BDSM⚡Call Girls in Sector 93 Noida Escorts >༒8448380779 Escort ServiceBDSM⚡Call Girls in Sector 93 Noida Escorts >༒8448380779 Escort Service
BDSM⚡Call Girls in Sector 93 Noida Escorts >༒8448380779 Escort ServiceDelhi Call girls
 
Re-membering the Bard: Revisiting The Compleat Wrks of Wllm Shkspr (Abridged)...
Re-membering the Bard: Revisiting The Compleat Wrks of Wllm Shkspr (Abridged)...Re-membering the Bard: Revisiting The Compleat Wrks of Wllm Shkspr (Abridged)...
Re-membering the Bard: Revisiting The Compleat Wrks of Wllm Shkspr (Abridged)...Hasting Chen
 
Call Girl Number in Khar Mumbai📲 9892124323 💞 Full Night Enjoy
Call Girl Number in Khar Mumbai📲 9892124323 💞 Full Night EnjoyCall Girl Number in Khar Mumbai📲 9892124323 💞 Full Night Enjoy
Call Girl Number in Khar Mumbai📲 9892124323 💞 Full Night EnjoyPooja Nehwal
 

Kürzlich hochgeladen (20)

Night 7k Call Girls Noida Sector 128 Call Me: 8448380779
Night 7k Call Girls Noida Sector 128 Call Me: 8448380779Night 7k Call Girls Noida Sector 128 Call Me: 8448380779
Night 7k Call Girls Noida Sector 128 Call Me: 8448380779
 
Microsoft Copilot AI for Everyone - created by AI
Microsoft Copilot AI for Everyone - created by AIMicrosoft Copilot AI for Everyone - created by AI
Microsoft Copilot AI for Everyone - created by AI
 
Navi Mumbai Call Girls Service Pooja 9892124323 Real Russian Girls Looking Mo...
Navi Mumbai Call Girls Service Pooja 9892124323 Real Russian Girls Looking Mo...Navi Mumbai Call Girls Service Pooja 9892124323 Real Russian Girls Looking Mo...
Navi Mumbai Call Girls Service Pooja 9892124323 Real Russian Girls Looking Mo...
 
Introduction to Prompt Engineering (Focusing on ChatGPT)
Introduction to Prompt Engineering (Focusing on ChatGPT)Introduction to Prompt Engineering (Focusing on ChatGPT)
Introduction to Prompt Engineering (Focusing on ChatGPT)
 
Mathematics of Finance Presentation.pptx
Mathematics of Finance Presentation.pptxMathematics of Finance Presentation.pptx
Mathematics of Finance Presentation.pptx
 
If this Giant Must Walk: A Manifesto for a New Nigeria
If this Giant Must Walk: A Manifesto for a New NigeriaIf this Giant Must Walk: A Manifesto for a New Nigeria
If this Giant Must Walk: A Manifesto for a New Nigeria
 
WhatsApp 📞 9892124323 ✅Call Girls In Juhu ( Mumbai )
WhatsApp 📞 9892124323 ✅Call Girls In Juhu ( Mumbai )WhatsApp 📞 9892124323 ✅Call Girls In Juhu ( Mumbai )
WhatsApp 📞 9892124323 ✅Call Girls In Juhu ( Mumbai )
 
Andrés Ramírez Gossler, Facundo Schinnea - eCommerce Day Chile 2024
Andrés Ramírez Gossler, Facundo Schinnea - eCommerce Day Chile 2024Andrés Ramírez Gossler, Facundo Schinnea - eCommerce Day Chile 2024
Andrés Ramírez Gossler, Facundo Schinnea - eCommerce Day Chile 2024
 
CTAC 2024 Valencia - Sven Zoelle - Most Crucial Invest to Digitalisation_slid...
CTAC 2024 Valencia - Sven Zoelle - Most Crucial Invest to Digitalisation_slid...CTAC 2024 Valencia - Sven Zoelle - Most Crucial Invest to Digitalisation_slid...
CTAC 2024 Valencia - Sven Zoelle - Most Crucial Invest to Digitalisation_slid...
 
Chiulli_Aurora_Oman_Raffaele_Beowulf.pptx
Chiulli_Aurora_Oman_Raffaele_Beowulf.pptxChiulli_Aurora_Oman_Raffaele_Beowulf.pptx
Chiulli_Aurora_Oman_Raffaele_Beowulf.pptx
 
Governance and Nation-Building in Nigeria: Some Reflections on Options for Po...
Governance and Nation-Building in Nigeria: Some Reflections on Options for Po...Governance and Nation-Building in Nigeria: Some Reflections on Options for Po...
Governance and Nation-Building in Nigeria: Some Reflections on Options for Po...
 
VVIP Call Girls Nalasopara : 9892124323, Call Girls in Nalasopara Services
VVIP Call Girls Nalasopara : 9892124323, Call Girls in Nalasopara ServicesVVIP Call Girls Nalasopara : 9892124323, Call Girls in Nalasopara Services
VVIP Call Girls Nalasopara : 9892124323, Call Girls in Nalasopara Services
 
Thirunelveli call girls Tamil escorts 7877702510
Thirunelveli call girls Tamil escorts 7877702510Thirunelveli call girls Tamil escorts 7877702510
Thirunelveli call girls Tamil escorts 7877702510
 
Presentation on Engagement in Book Clubs
Presentation on Engagement in Book ClubsPresentation on Engagement in Book Clubs
Presentation on Engagement in Book Clubs
 
No Advance 8868886958 Chandigarh Call Girls , Indian Call Girls For Full Nigh...
No Advance 8868886958 Chandigarh Call Girls , Indian Call Girls For Full Nigh...No Advance 8868886958 Chandigarh Call Girls , Indian Call Girls For Full Nigh...
No Advance 8868886958 Chandigarh Call Girls , Indian Call Girls For Full Nigh...
 
ANCHORING SCRIPT FOR A CULTURAL EVENT.docx
ANCHORING SCRIPT FOR A CULTURAL EVENT.docxANCHORING SCRIPT FOR A CULTURAL EVENT.docx
ANCHORING SCRIPT FOR A CULTURAL EVENT.docx
 
George Lever - eCommerce Day Chile 2024
George Lever -  eCommerce Day Chile 2024George Lever -  eCommerce Day Chile 2024
George Lever - eCommerce Day Chile 2024
 
BDSM⚡Call Girls in Sector 93 Noida Escorts >༒8448380779 Escort Service
BDSM⚡Call Girls in Sector 93 Noida Escorts >༒8448380779 Escort ServiceBDSM⚡Call Girls in Sector 93 Noida Escorts >༒8448380779 Escort Service
BDSM⚡Call Girls in Sector 93 Noida Escorts >༒8448380779 Escort Service
 
Re-membering the Bard: Revisiting The Compleat Wrks of Wllm Shkspr (Abridged)...
Re-membering the Bard: Revisiting The Compleat Wrks of Wllm Shkspr (Abridged)...Re-membering the Bard: Revisiting The Compleat Wrks of Wllm Shkspr (Abridged)...
Re-membering the Bard: Revisiting The Compleat Wrks of Wllm Shkspr (Abridged)...
 
Call Girl Number in Khar Mumbai📲 9892124323 💞 Full Night Enjoy
Call Girl Number in Khar Mumbai📲 9892124323 💞 Full Night EnjoyCall Girl Number in Khar Mumbai📲 9892124323 💞 Full Night Enjoy
Call Girl Number in Khar Mumbai📲 9892124323 💞 Full Night Enjoy
 

Raul sena - Apresentação Analiticsemtudo - Scientific Applications using GPU

  • 1. Scientific applications using GPU Overview about field of study and practical applications June 22, 2017
  • 2. Raul Sena Ferreira IT Coordinator at IPEA (Research Institute for Applied Economics) Lecturer in Computer Science at UFRRJ MSc Student in Data Engineering (Major in Machine Learning) at PESC-UFRJ www.raulferreira.com.br June 22, 2017
  • 3. GPU
  • 4. Graphics processing unit GPU-accelerated computing is the use of a graphics processing unit (GPU) together with a CPU to accelerate deep learning, analytics, and engineering applications [1] CUDA® is a parallel computing platform and programming model invented by NVIDIA. It enables dramatic increases in computing performance by harnessing the power of the graphics processing unit (GPU) [2] June 22, 2017
  • 8. RAIC - 2014 Paralelização do algoritmo de Método de Estimação Não-Paramétrico por Núcleo Estimador Multivariado (KDE) utilizando GPU/CUDA
  • 9. Research problem Kernel Density Estimation (KDE): Non-parametric statistical algorithm to perform probabilities about events within a population Multidimensional KDE: O(n2 k) How to parallelize this method? There are real some advantages? June 22, 2017
  • 10. Serial vs Parallel ● Processor: Intel I5 2500 ● Graphic card: GeForce GTX 650 Ti Boost ● Nº of cores: 4 (Hyperthread) ● Input: 30.000 points (x,y) 3024% faster than serial. 6 times faster than parallel matlab. KDE (serial) Matlab KDE (parallelized) KDE with CUDA 31.08 6.35 1.03 June 22, 2017
  • 11. ERSI - 2016 W-SAGE: Ferramenta Web para Análise de Dados Geoespaciais
  • 12. Research problem How to infer statistically some events from geographic coordinates ? How to make an Information System with this approach ? How to get the benefits of GPU computing inside this Information System ? How to “glue” all of them ? June 22, 2017
  • 14. Integrating tools for GPU programming PyCUDA: Nvidia‘s CUDA parallel computation API from Python ● Maps all of CUDA into Python ● Enables run-time code generation (RTCG) for flexible, fast, automatically tuned codes ● Added robustness: automatic management of object lifetimes, automatic error checking ● Added convenience: comes with ready-made on-GPU linear algebra, reduction, scan. Add-on packages for FFT and LAPACK available ● Fast. Near-zero wrapping overhead ● Complete, helpful documentation June 22, 2017
  • 15. Study case Dataset: 14027 registers collected from UFRRJ undergraduate students between 2000 and 2013 Two dimensional data: Coordinates from students home Question: How the probability of a person that lives in certain regions to study in UFRRJ ? How these probabilities are distributed across the country ? June 22, 2017
  • 17. ERAD - 2017 Análise de desempenho da biblioteca Theano com GPUs sob a ótica do CUDA Profiler
  • 18. ERAD-RJ 2017 How efficient are the libraries regarding hardware architectures, specially using GPUs ? Literature often evaluate performance measuring total processing time Goal is measure the performance in different GPUs using CUDA Profiler as collecting metric tool for GPU Repository: https://github.com/felipe-melo/Erad-Code June 22, 2017
  • 19. Motivation Deep Learning are a class of machine learning algorithms that uses many layers to process something: ● Non-linear transformations ● Feature extraction, image classification, etc (Returning of) Neural networks: ● CNN, RNN, Autoencoders ... June 22, 2017
  • 20. Neural Network & MultiLayer Perceptron June 22, 2017
  • 22. CUDA Profile Performance analysis tool from NVIDIA Commands: --print-gpu-trace -u s --metrics flop_count_sp, flop_sp_efficiency, flop_count_dp, flop_dp_efficiency June 22, 2017
  • 23. Neural network operations ● Matrix summing ○ Cmn = Amn + Bmn ● Matrix multiplication ○ Cmo = Amn * Bno ● Hadamard product ○ Cmn = Amn o Bmn ● Scalar ○ Cmn = a * Bmn ● Sigmoid ○ Cmn = Sigmoid(Amn ) ● Tanh ○ Cmn = Tanh(Amn ) June 22, 2017
  • 27. Paraphrase detection + deep learning + cuda Comparison of Recursive Auto Encoders implementations between Tensorflow and Theano Datasets: Microsoft Research Paraphrase Corpus & Webis Crowd Paraphrase Corpus 2011 Main contributions: Speedup differences using different libraries; Evaluating if use of hypernyms improves the outcomes Peripheral contributions: Comparing two widely used frameworks within an important problem of neural networks June 22, 2017
  • 29. Bioinformatics: Sequencing protein processing using NVBIO[3] Computational Finance: Monte Carlo simulation using NVIDIA TESLA GPU [4] Computational Fluid Dynamics: Navier-Stokes models and Lattice methods with very large speedups using CUDA-enabled GPUs [5] June 22, 2017
  • 30. Media and Entertainment: GPUs to deliver high performance graphics and parallelism for video editing, digital animation, rendering and media creation[7] Data Science, Analytics and Databases: GPUs for big data analytics to make better, real –time business decisions [6] June 22, 2017
  • 31. Electronic Design and Automation: Recent trends in HPC are increasingly exploiting many core GPUs to achieve speedup of computationally intensive simulations including verilog simulation, signal integrity & Electromagnetic [8] Weather and Climate: Weather Research and Forecasting model and tsunami simulations. [12] Defense and Intelligence: Deep Learning Tools for Defense, Safety, and Security Applications. [9] Imaging and Computer Visions: Applications can achieve interactive video frame-rate performance. Libraries: GPU4Vision, OpenVIDIA, MinGPU, etc [10] June 22, 2017
  • 32. Medical Imaging: Advanced MRI reconstruction, Computed Tomography Reconstruction [13] June 22, 2017
  • 33. Structural mechanics: CUDA-enabled GPUs ANSY, Abaqus, MSC Nastran, IMPETUS Afea and additional structural mechanics applications [11] Machine Learning: Many machine learning tools [14]: ● Caffe: Framework for convolutional neural network algorithms ● Theano: Python library to define, optimize, and evaluate mathematical expressions ● cxxnet: Neural network toolkit ● cuBLAS: GPU-accelerated version of the complete standard BLAS library June 22, 2017
  • 34. Want to join in ? Ongoing research or new ideas. Send me an email :) Requirements: ● Python programming ● English (basic reading and writing) ● Basic understanding of neural networks ● Eager to learning and resilience June 22, 2017
  • 35. Good places to start NVIDIA Website Intro to Parallel Programming Using CUDA to Harness the Power of GPUs CUDA University Courses Multicore and GPU Programming for Video Games CUDA Tutorials | The Supercomputing Blog June 22, 2017
  • 36. "In the U.S. there are two types of hipsters: those who know how to program and those who serve coffee." César Hidalgo June 22, 2017