CUDA Programming for NVIDIA H100s – Comprehensive Course
Learn CUDA programming for NVIDIA Hopper GPUs. You will learn to build efficient WGMMA pipelines and leverage Cutlass optimizations to perform the massive matrix multiplications that power modern AI. Beyond single-chip performance, the curriculum covers multi-GPU scaling and NCCL primitives necessary for training trillion-parameter models. To get the most out of these lessons, you should have a foundational grasp of C++ syntax and linear algebra, particularly how matrices are tiled and multiplied.
- Course website: https://cudacourseh100.github.io
- Course repo: https://github.com/cudacourseh…
Watch on YouTube ↗
(saves to browser)
DeepCamp AI