Aussie AI Blog
-
by David Spuler, Ph.D.
Latest Blog Articles
- Vector Dot Product Optimization in C++ with Instruction-Level Parallelism
- List of 100+ AI Smartness Techniques
- C++ Low Latency Book
Most Popular
Low Latency C++ Blog Articles
- C++ Low Latency Book
- False Sharing and Cache Line Sizes in Multithreading
- Overview of C++ Multithreading Optimizations
- Low Latency Programming
CUDA C++ Efficiency Articles
CUDA C++ Safety Articles
C++ Safety and Debugging Articles
- DIY Preventive C++ Memory Safety
- Canary Values & Redzones for Memory-Safe C++
- User-After-Free Memory Errors in C++
- Array Bounds Violations and Memory Safe C++
- Poisoning Memory Blocks for Safer C++
- Uninitialized Memory Safety in C++
- DIY Memory Safety in C++
- Memory Safe C++ Library Functions
- Smart Stack Buffers for Memory Safe C++
- Safe C++ Text Buffers with snprintf
- Safe C++ Standard and Memory Safety Book
Aussie AI Book Releases
- C++ Low Latency Book
- Generative AI Applications: Planning, Design, and Implementation
- CUDA C++ Optimization
- Debugging CUDA C++ Kernels
- Safe C++ Standard and Memory Safety Book
- Generative AI in C++: Coding Transformers and LLMs
March 2025 Blog Articles
- C++ Low Latency Book
- False Sharing and Cache Line Sizes in Multithreading
- Overview of C++ Multithreading Optimizations
- What's Hot in LLM Inference Optimization in 2025?
- AI Research by Country
- What's New in Speculative Decoding?
February 2025 Blog Articles
- DeepSeek is Good for NVIDIA and the AI Industry
- DeepSeek Upends Progress in Reasoning and AGI
- Low Latency Programming
January 2025 Blog Articles
- Debugging OpenAI Node.js API Wrappers
- Chain-of-Thought Efficiency Optimization
- Reasoning Decoding Algorithms
- Reasoning Inference Optimization
December 2024 Blog Articles
- AI Hitting the Wall?
- Reasoning is the New AI Middleware
- Humans are the Top Layer of the AI Stack
- The AI Application Layer
- Consumer vs Enterprise AI
November 2024 Blog Articles
- DIY Preventive C++ Memory Safety
- Canary Values & Redzones for Memory-Safe C++
- User-After-Free Memory Errors in C++
- Array Bounds Violations and Memory Safe C++
- Poisoning Memory Blocks for Safer C++
- Uninitialized Memory Safety in C++
- DIY Memory Safety in C++
October 2024 Blog Articles
- CUDA C++ Floating Point Exceptions
- Memory Safe C++ Library Functions
- Smart Stack Buffers for Memory Safe C++
- Safe C++ Text Buffers with snprintf
- Weight Clustering Needs a Refresh
- Generalizing Prefix KV Caching to RAG Chunks
- RAG Optimization via Caching
- CUDA Memory Coalescing Optimizations
- CUDA GPU Thread Divergence
September 2024 Blog Articles
- Deciding on Your AI Business Project
- Planning Your AI Business Project
- CUDA Basic C++ Programming Mistakes
- 500+ Techniques for LLM Inference Optimization
August 2024 Blog Articles
- State-of-the-Art LLM Backends
- Hot Inference Optimization Techniques
- Inference Optimization Research Ideas
- Sequential Speculative Decoding
- Generative AI Textbook Free Online
AI Books from Aussie AI
![]() |
The Sweetest Lesson: Your Brain Versus AI: new book on AI intelligence theory:
Get your copy from Amazon: The Sweetest Lesson |
![]() |
RAG Optimization: Accurate and Efficient LLM Applications:
new book on RAG architectures:
Get your copy from Amazon: RAG Optimization |
![]() |
Generative AI Applications book:
Get your copy from Amazon: Generative AI Applications |
![]() |
Generative AI programming book:
Get your copy from Amazon: Generative AI in C++ |
![]() |
CUDA C++ Optimization book:
Get your copy from Amazon: CUDA C++ Optimization |
![]() |
CUDA C++ Debugging book:
Get your copy from Amazon: CUDA C++ Debugging |
More AI Research Topics
Read more about: