CENG 443

Heterogenous Parallel Programming

Heterogeneous GPU architecture, GPU programming models and techniques, CUDA programming.

Topics
Introduction to heterogeneous parallel programming
GPU hardware, trends and architecture
Introduction to CUDA
CUDA thread basics
CUDA memory
CUDA shared memory
Midterm
CUDA performance
Warp level CUDA
Parallel patterns
Parallel patterns-2
Performance optimization
CUDA streams
Multi-GPUs