Aussie AI

Publications

List of Aussie AI Publications

New LLM inference optimization research book:

More LLM and C++ books for CPU and GPU:

Free AI and C++ books full text or PDF download:

Published books for AI engineers:

  1. LLM Inference Optimization: State-of-the-Art Research, new book by David Spuler, table of contents, buy on Amazon.
  2. RAG Optimization: Accurate and Efficient LLM Applications:, 2025 (Free PDF download)
  3. Generative AI Applications: Planning, Design, and Implementation, 2025 (Free PDF download)
  4. Generative AI in C++: Coding Transformers and LLMs, 2025 (Full text online free, Free PDF download)
  5. CUDA C++ Optimization: Programming Faster GPU Kernels, 2024 (Free PDF download)
  6. CUDA C++ Debugging: Safer GPU Kernels, 2024 (Free PDF download)
  7. The Sweetest Lesson: Your Brain Versus AI, 2025 (Free PDF download)

Published books for C++ software developers:

  1. LLM Inference Optimization: State-of-the-Art Research Breakthroughs, 2026 — theoretical examination of recent and emerging LLM breakthroughs.
  2. GPU Bit Tricks: LLM Kernel Arithmetic Optimizations, 2026 — low-level integer and floating-point kernel development tricks.
  3. C++ Branchless Coding: CPU and GPU Efficiency , 2026 — low latency C++ coding.
  4. C++ Bit Tricks: Integer and Floating-Point Optimizations, 2026 — low-level bitwise coding, floating-point, SWAR, etc.
  5. C++ AVX Optimization: CPU SIMD Vectorization, 2025 (Free PDF download)
  6. C++ Ultra-Low Latency: Multithreading and Low-Level Optimizations, 2025 (Free PDF download)
  7. Advanced C++ Memory Techniques: Efficiency & Safety, 2025 (Free PDF download)
  8. Efficient C++ Multithreading: Modern Concurrency Optimization, 2025 (Free PDF download)
  9. Efficient Modern C++ Data Structures, 2025 (Free PDF download)
  10. Safe C++: Fixing Memory Safety Issues, 2024 (Free PDF download)
  11. Low Latency C++: Multithreading and Hotpath Optimizations, 2024 (Free PDF download)

Patent filings:

AI Articles and Papers

Articles and general publications:

Generative AI Applications Book



Generative AI in C++ The new Generative AI Applications book by Aussie AI co-founders:
  • Deciding on your AI project
  • Planning for success and safety
  • Designs and LLM architectures
  • Expediting development
  • Implementation and deployment

Get your copy from Amazon: Generative AI Applications

Advanced C++ Coding for AI Developers

Generative AI in C++ The new Generative AI programming book by Aussie AI co-founders:
  • Generative AI coding in C++
  • Transformer engines & LLM models
  • Phone and desktop AI
  • Code examples
  • Research citations
  • Full text online: Table of Contents
  • Buy your copy: Generative AI in C++

CUDA C++ Optimization Book



CUDA C++ Optimization The new CUDA C++ Optimization book:
  • Faster CUDA C++ kernels
  • Optimization tools & techniques
  • Compute optimization
  • Memory optimization

Get your copy from Amazon: CUDA C++ Optimization

CUDA C++ Debugging Book



CUDA C++ Optimization The new CUDA C++ Debugging book:
  • Debugging CUDA C++ kernels
  • Tools & techniques
  • Self-testing & reliability
  • Common GPU kernel bugs

Get your copy from Amazon: CUDA C++ Debugging

Safe C++ Book



Safe C++: Fixing Memory Safety Issues The new Safe C++ coding book:
  • Memory Safety
  • Rust versus C++
  • The Safe C++ Standard
  • Pragmatic Memory Safety

Get your copy from Amazon: Safe C++: Fixing Memory Safety Issues