智啓出川

38 Followers

gpgpu gpu cuda university lecture programming numerical simulation finite difference method fdm shared memory vector addition parallel processing education cpu gnuplot cuda fortran architecture cusparse python cfd linear simultaneous equations gpu accelerated library cublas euler method time integration double buffering constant memory optimization fortran modern fortran fortran 2003 computational fluid dynamics cavity flow diffusion equation matrix-matrix multiplication parallel reduction naive implementation memory hierarchy global memory box filter image processing cluster openmp object-oriented vorticity streamfunction convection equation thrust monte carlo curand particle laplacian bank conflict opencl jetson tk1 hierarchy mosaic negative thread processor multicore educational material goal orientation curriculum software development vscode computational science visual studio code lagrange polynomial sympy array of characters string power approximation excel scipy best practices generation schematic diagram multiple gpu fortran 95 fortran 90 cylinder fem iso_c_binding incompressible flow project-based learning micro intelligent robot system numazu national collage of technology nagaoka university of technology lbm d2q9 model lattice boltzmann method bounceback pycuda stream overlap asynchronous cooperative processing concurrent processing multi-gpu uva gpu direct unified virtual addressing marching porting fluid dynamics cpu implementation vorticity equation taylor-green vortex laplace equation conjugate gradient method poisson equation residual red-black ordering sor method rotating cone fastmath atomic operation flops compute-bound roofline flop/byte memory-bound performance pinned memory zero-copy page-locked memory transpose warp branch divergence branch memory access stride access coalesce access cuda event pi csr library cufft all-pair loop unroll interaction n-body problem order of accuracy runge-kutta method modified euler method grayscale blur gaussian blur bitmap uchar4 template occupancy profiler profiling moving average openacc embedded platform tegra opencv memory flip multi-thread universitiy fermi lectura tesla m2050 mpi process co-processor accelerator software hardware open source

Aktivität
Info

智啓出川

Präsentationen

技術系大学におけるGPU教育の一試行

2015年度GPGPU実践基礎工学　第1回　学際的分野における先端シミュレーション技術の歴史

2015年度GPGPU実践基礎工学　第2回　GPGPUの歴史と応用例

2015年度GPGPU実践基礎工学　第3回　GPUクラスタ上でのプログラミング（CUDA）

2015年度GPGPU実践基礎工学　第4回　CPUのアーキテクチャ

2015年度GPGPU実践基礎工学　第5回　ハードウェアによるCPUの高速化技術

2015年度GPGPU実践基礎工学　第6回　ソフトウェアによるCPUの高速化技術

2015年度GPGPU実践基礎工学　第7回　シングルコアとマルチコア

2015年度GPGPU実践基礎工学　第8回　並列計算の概念（プロセスとスレッド）

2015年度GPGPU実践基礎工学　第9回　GPUのアーキテクチャ

2015年度GPGPU実践基礎工学　第9回補足　GROUSEの利用方法

2015年度GPGPU実践基礎工学　第10回　GPUのプログラム構造

2015年度GPGPU実践基礎工学　第11回　GPUでの並列プログラミング（ベクトル和）

2015年度GPGPU実践基礎工学　第12回　GPUによる画像処理

2015年度GPGPU実践基礎工学　第13回　GPUのメモリ階層

2015年度GPGPU実践基礎工学　第14回　GPGPU組込開発環境

2015年度GPGPU実践基礎工学　第15回　GPGPU開発環境（OpenCL）

2015年度GPGPU実践プログラミング　第1回　GPGPUの歴史と応用例

2015年度GPGPU実践プログラミング　第2回　GPUのアーキテクチャとプログラム構造

2015年度GPGPU実践プログラミング　第3回　GPGPUプログラミング環境

2015年度GPGPU実践プログラミング　第4回　GPUでの並列プログラミング（ベクトル和，移動平均，差分法）

2015年度GPGPU実践プログラミング　第5回　GPUのメモリ階層

2015年度GPGPU実践プログラミング　第6回　パフォーマンス解析ツール

2015年度GPGPU実践プログラミング　第7回　総和計算

2015年度GPGPU実践プログラミング　第8回　総和計算（高度な最適化）

2015年度GPGPU実践プログラミング　第9回　行列計算（行列－行列積）

2015年度GPGPU実践プログラミング　第10回　行列計算（行列－行列積の高度な最適化）

2015年度GPGPU実践プログラミング　第11回　画像処理

2015年度GPGPU実践プログラミング　第12回　偏微分方程式の差分計算

2015年度GPGPU実践プログラミング　第13回　多粒子の運動

Gefällt mir

ゼロから始める技術書執筆 by 湊川あい

智啓 出川

Präsentationen

Gefällt mir

智啓出川