Aussie AI
Publications
List of Aussie AI Publications
New LLM inference optimization research book:
More LLM and C++ books for CPU and GPU:
- LLM Inference Optimization: State-of-the-Art Research Breakthroughs, 2026 — theoretical examination of recent and emerging LLM breakthroughs.
- GPU Bit Tricks: LLM Kernel Arithmetic Optimizations, 2026 — low-level integer and floating-point kernel development tricks.
- C++ Branchless Coding: CPU and GPU Efficiency , 2026 — low latency C++ coding.
- C++ Bit Tricks: Integer and Floating-Point Optimizations, 2026 — low-level bitwise coding, floating-point, SWAR, etc.
Free AI and C++ books full text or PDF download:
- Free AI books (full text online)
- Free PDF book downloads
Published books for AI engineers:
- LLM Inference Optimization: State-of-the-Art Research, new book by David Spuler, table of contents, buy on Amazon.
- RAG Optimization: Accurate and Efficient LLM Applications:, 2025 (Free PDF download)
- Generative AI Applications: Planning, Design, and Implementation, 2025 (Free PDF download)
- Generative AI in C++: Coding Transformers and LLMs, 2025 (Full text online free, Free PDF download)
- CUDA C++ Optimization: Programming Faster GPU Kernels, 2024 (Free PDF download)
- CUDA C++ Debugging: Safer GPU Kernels, 2024 (Free PDF download)
- The Sweetest Lesson: Your Brain Versus AI, 2025 (Free PDF download)
Published books for C++ software developers:
- LLM Inference Optimization: State-of-the-Art Research Breakthroughs, 2026 — theoretical examination of recent and emerging LLM breakthroughs.
- GPU Bit Tricks: LLM Kernel Arithmetic Optimizations, 2026 — low-level integer and floating-point kernel development tricks.
- C++ Branchless Coding: CPU and GPU Efficiency , 2026 — low latency C++ coding.
- C++ Bit Tricks: Integer and Floating-Point Optimizations, 2026 — low-level bitwise coding, floating-point, SWAR, etc.
- C++ AVX Optimization: CPU SIMD Vectorization, 2025 (Free PDF download)
- C++ Ultra-Low Latency: Multithreading and Low-Level Optimizations, 2025 (Free PDF download)
- Advanced C++ Memory Techniques: Efficiency & Safety, 2025 (Free PDF download)
- Efficient C++ Multithreading: Modern Concurrency Optimization, 2025 (Free PDF download)
- Efficient Modern C++ Data Structures, 2025 (Free PDF download)
- Safe C++: Fixing Memory Safety Issues, 2024 (Free PDF download)
- Low Latency C++: Multithreading and Hotpath Optimizations, 2024 (Free PDF download)
Patent filings:
- Optimizing On-Device Transformer Inference for Source Code Checking: IP Australia Patent Filing, June 2024
- Heuristic Optimization of Transformer On-Device Inference: IP Australia Patent Filing, June 2024
- Speculative Decoding With Early Exit for Optimized Transformer On-Device Inference: IP Australia Patent Filing, June 2024
- Edit Decoding With Early Exit for Optimized Transformer On-Device Inference: IP Australia Patent Filing, June 2024
AI Articles and Papers
Articles and general publications:
- Blog Articles
- Original Research
- Aussie AI Research
- AI Research Literature Survey
- Patents by Aussie AI
Generative AI Applications Book
|
The new Generative AI Applications book by Aussie AI co-founders:
Get your copy from Amazon: Generative AI Applications |
Advanced C++ Coding for AI Developers
|
The new Generative AI programming book by Aussie AI co-founders:
|
CUDA C++ Optimization Book
|
The new CUDA C++ Optimization book:
Get your copy from Amazon: CUDA C++ Optimization |
CUDA C++ Debugging Book
|
The new CUDA C++ Debugging book:
Get your copy from Amazon: CUDA C++ Debugging |
Safe C++ Book
|
The new Safe C++ coding book:
Get your copy from Amazon: Safe C++: Fixing Memory Safety Issues |