Aussie AI
CUDA C++ Optimization Book
-
Book Excerpt from "CUDA C++ Optimization: Coding Faster GPU Kernels"
-
by David Spuler
CUDA C++ Optimization Book
Table of Contents
10. Data Transfer Optimizations
13. Warp Divergence
15. Compile-Time Optimizations
Appendix: CUDA C++ Slugs
Bonus Materials
- Fused and Shared Epilogues
- Rubin and Feynman Optimizations
- Branchless Coding Tricks
- Hopper and Blackwell Optimizations
- Grace CPU Optimizations
- CUDA C++ BF16x9 Emulation in Blackwell
|
• Online: Table of Contents • PDF: Free PDF book download • Buy: CUDA C++ Optimization |
|
The new CUDA C++ Optimization book:
Get your copy from Amazon: CUDA C++ Optimization |