Aussie AI Original Research
-
by David Spuler, Ph.D.
Research Papers & Articles
Innovative research areas with publications and patents:
- Optimizing On-Device Transformer Inference for Source Code Checking: IP Australia Patent Filing, June 2024
- Heuristic Optimization of Transformer On-Device Inference: IP Australia Patent Filing, June 2024
- Speculative Decoding With Early Exit for Optimized Transformer On-Device Inference: IP Australia Patent Filing, June 2024
- Edit Decoding With Early Exit for Optimized Transformer On-Device Inference: IP Australia Patent Filing, June 2024
- Sequential Speculative Decoding
Research Overviews
Lists, surveys, and overviews of research areas:
- 500+ Techniques for LLM Inference Optimization
- Curated literature survey of AI research papers
- List of 100+ AI Smartness Techniques
- What's Hot in LLM Inference Optimization in 2025?
- Chain-of-Thought Efficiency Optimization
- What's New in Speculative Decoding?
- Hot Inference Optimization Techniques
- LLM Inference Optimization Research Ideas
- Promising LLM Inference Optimization Research
Other Research Articles
Innovative articles on research topics:
- Vector Dot Product Optimization in C++ with Instruction-Level Parallelism
- Reasoning Inference Optimization
- Weight Clustering Needs a Refresh
- State-of-the-Art LLM Backends
- Generative AI Textbook Free Online
AI Books from Aussie AI
|
The Sweetest Lesson: Your Brain Versus AI: new book on AI intelligence theory:
Get your copy from Amazon: The Sweetest Lesson |
|
RAG Optimization: Accurate and Efficient LLM Applications:
new book on RAG architectures:
Get your copy from Amazon: RAG Optimization |
|
Generative AI Applications book:
Get your copy from Amazon: Generative AI Applications |
|
Generative AI programming book:
Get your copy from Amazon: Generative AI in C++ |
|
CUDA C++ Optimization book:
Get your copy from Amazon: CUDA C++ Optimization |
|
CUDA C++ Debugging book:
Get your copy from Amazon: CUDA C++ Debugging |
Free AI and C++ Books
Generative AI programming books:
- The Sweetest Lesson: Your Brain Versus AI, November 2025: full text online, free PDF available
- RAG Optimization: Accurate and Efficient LLM Applications, June 2025: full text online, free PDF available
- Generative AI Applications: Planning, Design and Implementation, November 2024: full text online, free PDF available
- Generative AI in C++ (Spuler, March 2024): full text online, free PDF available, table of contents, bonus materials, reference lists, source code
CUDA C++ GPU Programming Books:
- CUDA C++ Optimization: Coding Faster GPU Kernels, July 2024: full text online, bonus materials, free PDF available
- CUDA C++ Debugging: Safer GPU Kernel Programming, July 2024: full text online, free PDF available
Modern C++ Programming Books
- C++ AVX Optimization: CPU SIMD Vectorization, 2025: full text online, free PDF available
- C++ Ultra-Low Latency: Multithreading and Low-Level Optimizations, 2025: full text online, free PDF available
- Advanced C++ Memory Techniques: Efficiency and Safety, 2025: full text online, free PDF available
- Efficient C++ Multithreading: Modern Concurrency Optimization, 2025: free PDF available
- Efficient Modern C++ Data Structures: Container and Algorithm Optimizations, 2025: free PDF available
- C++ Low Latency: Multithreading and Hotpath Optimizations, 2025: free PDF available
- Safe C++: Fixing Memory Safety Issues, Oct 2024: full text online, free PDF available
More AI Research Topics
Read more about: