Aussie AI

RAG Ontology Architectures

  • Last Updated 22 October, 2025
  • by David Spuler, Ph.D.

Research on RAG Ontology Architectures

Research papers include:

  • Prajwal Kailas, Max Homilius, Rahul C. Deo, Calum A. MacRae, 16 Dec 2024, NoteContrast: Contrastive Language-Diagnostic Pretraining for Medical Text, https://arxiv.org/abs/2412.11477
  • Muhayy Ud Din, Jan Rosell, Waseem Akram, Isiah Zaplana, Maximo A Roa, Lakmal Seneviratne, Irfan Hussain, 10 Dec 2024, Ontology-driven Prompt Tuning for LLM-based Task and Motion Planning, https://arxiv.org/abs/2412.07493 https://muhayyuddin.github.io/llm-tamp/ (Detecting objects in the prompt text and then using a RALM algorithm to query an ontology database.)
  • Oleksandr Palagin, Vladislav Kaverinskiy, Anna Litvin, Kyrylo Malakhov, 11 Jul 2023, OntoChatGPT Information System: Ontology-Driven Structured Prompts for ChatGPT Meta-Learning, International Journal of Computing, 22(2), 170-183, https://arxiv.org/abs/2307.05082 https://doi.org/10.47839/ijc.22.2.3086 https://computingonline.net/computing/article/view/3086
  • Alhassan Mumuni, Fuseini Mumuni, 6 Jan 2025, Large language models for artificial general intelligence (AGI): A survey of foundational principles and approaches, https://arxiv.org/abs/2501.03151
  • Kartik Sharma, Peeyush Kumar, Yunqing Li, 12 Dec 2024, OG-RAG: Ontology-Grounded Retrieval-Augmented Generation For Large Language Models, https://arxiv.org/abs/2412.15235
  • Chengshuai Zhao, Garima Agrawal, Tharindu Kumarage, Zhen Tan, Yuli Deng, Ying-Chih Chen, Huan Liu, 10 Dec 2024, Ontology-Aware RAG for Improved Question-Answering in Cybersecurity Education, https://arxiv.org/abs/2412.14191
  • Ramona Kühn, Jelena Mitrović, Michael Granitzer, 18 Dec 2024, Enhancing Rhetorical Figure Annotation: An Ontology-Based Web Application with RAG Integration, https://arxiv.org/abs/2412.13799
  • Xueli Pan, Jacco van Ossenbruggen, Victor de Boer, Zhisheng Huang, 13 Sep 2024, A RAG Approach for Generating Competency Questions in Ontology Engineering, https://arxiv.org/abs/2409.08820
  • Rafael Teixeira de Lima, Shubham Gupta, Cesar Berrospi, Lokesh Mishra, Michele Dolfi, Peter Staar, Panagiotis Vagenas, 29 Nov 2024, Know Your RAG: Dataset Taxonomy and Generation Strategies for Evaluating RAG Systems, https://arxiv.org/abs/2411.19710
  • Yuxing Lu, Sin Yee Goi, Xukai Zhao, Jinzhuo Wang, 22 Jan 2025 (v2), Biomedical Knowledge Graph: A Survey of Domains, Tasks, and Real-World Applications, https://arxiv.org/abs/2501.11632
  • Battazza, I. F. C., Rodrigues, C. M. d. O., & Oliveira, J. F. L. d. (2025). A Framework for Market State Prediction with Ontological Asset Selection: A Multimodal Approach. Applied Sciences, 15(3), 1034. https://doi.org/10.3390/app15031034 https://www.mdpi.com/2076-3417/15/3/1034
  • AD Al Hauna, AP Yunus, M Fukui, S Khomsah - International Journal on Robotics, Apr 2025, Enhancing LLM Efficiency: A Literature Review of Emerging Prompt Optimization Strategies, https://doi.org/10.33093/ijoras.2025.7.1.9 https://mmupress.com/index.php/ijoras/article/view/1311 PDF: https://mmupress.com/index.php/ijoras/article/view/1311/834
  • Jean-Philippe Corbeil, Amin Dada, Jean-Michel Attendu, Asma Ben Abacha, Alessandro Sordoni, Lucas Caccia, François Beaulieu, Thomas Lin, Jens Kleesiek, Paul Vozila, 15 May 2025, A Modular Approach for Clinical SLMs Driven by Synthetic Data with Pre-Instruction Tuning, Model Merging, and Clinical-Tasks Alignment, https://arxiv.org/abs/2505.10717
  • Junde Wu, Jiayuan Zhu, Yunli Qi, Jingkun Chen, Aug 2025, Min Xu, Filippo Menolascina, Yueming Jin, Vicente Grau, Medical Graph RAG: Evidence-based Medical Large Language Model via Graph Retrieval-Augmented Generation, Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 28443–28467 July 27- August 1, 2025, https://aclanthology.org/2025.acl-long.1381.pdf
  • Ziheng Zhang, Zhenxi Lin, Yefeng Zheng, and Xian Wu. 2025. How much Medical Knowledge do LLMs have? An Evaluation of Medical Knowledge Coverage for LLMs. In Proceedings of the ACM on Web Conference 2025 (WWW '25). Association for Computing Machinery, New York, NY, USA, 5330–5341. https://doi.org/10.1145/3696410.3714535 https://dl.acm.org/doi/abs/10.1145/3696410.3714535 https://dl.acm.org/doi/pdf/10.1145/3696410.3714535
  • Yan Ting Chok, Soyon Park, Seungheun Baek, Hajung Kim, Junhyun Lee, Jaewoo Kang, 14 Aug 2025, HiRef: Leveraging Hierarchical Ontology and Network Refinement for Robust Medication Recommendation, https://arxiv.org/abs/2508.10425
  • Yiping Song, Jiaoyan Chen and Renate A. Schmidt, 14 Aug 2025, GenOM: Ontology Matching with Description Generation and Large Language Model, https://arxiv.org/abs/2508.10703
  • Qing Cheng, Zefan Zeng, Xingchen Hu, Yuehang Si, Zhong Liu, 23 Jul 2025, A Survey of Event Causality Identification: Taxonomy, Challenges, Assessment, and Prospects, https://arxiv.org/abs/2411.10371
  • Stefan Borgwardt, Duy Nhu, Gabriele R\"oger, 23 Jul 2025, Automated planning with ontologies under coherence update semantics (Extended Version), https://arxiv.org/abs/2507.15120
  • Lam Nguyen and Erika Barcelos and Roger French and Yinghui Wu, 18 Jul 2025, KROMA: Ontology Matching with Knowledge Retrieval and Large Language Models, https://arxiv.org/abs/2507.14032
  • Oussama Bouaggad, Natalia Grabar, 18 Jul 2025, Search-Optimized Quantization in Biomedical Ontology Alignment, https://arxiv.org/abs/2507.13742
  • Hui Yang, Jiaoyan Chen, Yuan He, Yongsheng Gao, Ian Horrocks, 18 Jul 2025, Language Models as Ontology Encoders, https://arxiv.org/abs/2507.14334
  • Anna Sofia Lippolis, Mohammad Javad Saeedizade, Robin Keskis\"arkk\"a, Aldo Gangemi, Eva Blomqvist, Andrea Giovanni Nuzzolese, 19 Jul 2025, Large Language Models Assisting Ontology Evaluation, https://arxiv.org/abs/2507.14552
  • Ritesh Chandra, Shashi Shekhar Kumar, Rushil Patra, Sonali Agarwal, 21 Jul 2025, Decision support system for Forest fire management using Ontology with Big Data and LLMs, https://arxiv.org/abs/2405.11346
  • Devichand Budagam, Ashutosh Kumar, Mahsa Khoshnoodi, Sankalp KJ, Vinija Jain, Aman Chadha, 21 Jul 2025, Hierarchical Prompting Taxonomy: A Universal Evaluation Framework for Large Language Models Aligned with Human Cognitive Principles, https://arxiv.org/abs/2406.12644
  • Soumen Sinha, Tanisha Rana, Rahul Roy, 22 Jul 2025, A novel approach to navigate the taxonomic hierarchy to address the Open-World Scenarios in Medicinal Plant Classification, https://arxiv.org/abs/2502.17289
  • Maurice Funk, Marvin Grosser, Carsten Lutz, 11 Aug 2025, Fitting Description Logic Ontologies to ABox and Query Examples, https://arxiv.org/abs/2508.08007
  • Xiaohua Feng,Jiaming Zhang,Fengyuan Yu,Chengye Wang,Li Zhang,Kaixiang Li,Yuyuan Li,Chaochao Chen,Jianwei Yin, 26 Jul 2025, A Survey on Generative Model Unlearning: Fundamentals, Taxonomy, Evaluation, and Future Direction, https://arxiv.org/abs/2507.19894
  • Md Fantacher Islam, Jarrod Mosier, Vignesh Subbian, 26 Jul 2025, NIRS: An Ontology for Non-Invasive Respiratory Support in Acute Care, https://arxiv.org/abs/2507.19992
  • Joydeep Chandra and Satyam Kumar Navneet, 26 Jul 2025, Policy-Driven AI in Dataspaces: Taxonomy, Explainability, and Pathways for Compliant Innovation, https://arxiv.org/abs/2507.20014
  • Wenbin Guo, Xin Wang, Jiaoyan Chen, Zhao Li and Zirui Chen, 28 Jul 2025, Ontology-Enhanced Knowledge Graph Completion using Large Language Models, https://arxiv.org/abs/2507.20643
  • Federico Donato and Adrien Barton, 26 Jul 2025, An ontological analysis of risk in Basic Formal Ontology, https://arxiv.org/abs/2507.21171
  • Vishal Raman, Vijai Aravindh R, 29 Jul 2025, Evo-DKD: Dual-Knowledge Decoding for Autonomous Ontology Evolution in Large Language Models, https://arxiv.org/abs/2507.21438
  • Sabrina Patania, Luca Annese, Cansu Koyuturk, Azzurra Ruggeri, Dimitri Ognibene, 25 May 2025, Dialogic Social Learning for Artificial Agents: Enhancing LLM Ontology Acquisition through Mixed-Initiative Educational Interactions, https://arxiv.org/abs/2507.21065
  • Meghyn Bienvenu, Diego Figueira, Pierre Lafourcade, 31 Jul 2025, Tractable Responsibility Measures for Ontology-Mediated Query Answering, https://arxiv.org/abs/2507.23191
  • Zhangcheng Qiang, Kerry Taylor, Weiqing Wang, Jing Jiang, 25 Mar 2025, OAEI-LLM-T: A TBox Benchmark Dataset for Understanding Large Language Model Hallucinations in Ontology Matching, https://arxiv.org/abs/2503.21813
  • Renato Vukovic, Carel van Niekerk, Michael Heck, Benjamin Ruppik, Hsien-Chin Lin, Shutong Feng, Nurul Lubis, Milica Gasic, 31 Jul 2025, Text-to-SQL Task-oriented Dialogue Ontology Construction, https://arxiv.org/abs/2507.23358
  • Haonan Bian, Yutao Qi, Rui Yang, Yuanxi Che, Jiaqian Wang, Heming Xia, Ranran Zhen, 2 Aug 2025, From Query to Logic: Ontology-Driven Multi-Hop Reasoning in LLMs, https://arxiv.org/abs/2508.01424
  • Manuel Cossio, 3 Aug 2025, A comprehensive taxonomy of hallucinations in Large Language Models, https://arxiv.org/abs/2508.01781
  • Yuki Yamagata, Koji Kyoda, Hiroya Itoga, Emi Fujisawa and Shuichi Onami, 4 Aug 2025, SSBD Ontology: A Two-Tier Approach for Interoperable Bioimaging Metadata, https://arxiv.org/abs/2508.02084
  • Haoran Sun, Yusen Wu, Peng Wang, Wei Chen, Yukun Cheng, Xiaotie Deng, Xu Chu, 5 Aug 2025, Game Theory Meets Large Language Models: A Systematic Survey with Taxonomy and New Frontiers, https://arxiv.org/abs/2502.09053
  • Alessia Pisu, Livio Pompianu, Francesco Osborne, Diego Reforgiato Recupero, Daniele Riboni, Angelo Salatino, 6 Aug 2025, A Hybrid AI Methodology for Generating Ontologies of Research Topics from Scientific Paper Corpora, https://arxiv.org/abs/2508.04213
  • Yuyang Liu, Qiuhe Hong, Linlan Huang, Alexandra Gomez-Villa, Dipam Goswami, Xialei Liu, Joost van de Weijer, Yonghong Tian, 6 Aug 2025, Continual Learning for VLMs: A Survey and Taxonomy Beyond Forgetting, https://arxiv.org/abs/2508.04227
  • Sigma Jahan, Saurabh Singh Rajput, Tushar Sharma, Mohammad Masudur Rahman, 6 Aug 2025, Taxonomy of Faults in Attention-Based Neural Networks, https://arxiv.org/abs/2508.04925
  • Anouk Oudshoorn, Magdalena Ortiz, Mantas Simkus, 16 Jul 2025, SHACL Validation in the Presence of Ontologies: Semantics and Rewriting Techniques, https://arxiv.org/abs/2507.12286
  • Sviatoslav Lushnei, Dmytro Shumskyi, Severyn Shykula, Ernesto Jimenez-Ruiz, Artur d'Avila Garcez, 11 Aug 2025, Large Language Models as Oracles for Ontology Alignment, https://arxiv.org/abs/2508.08500
  • Amir Mohammad Salehoof, Ali Ramezani, Yadollah Yaghoobzadeh, Majid Nili Ahmadabadi, 12 Aug 2025, A Dual-Axis Taxonomy of Knowledge Editing for LLMs: From Mechanisms to Functions, https://arxiv.org/abs/2508.08795
  • Farzana Zahid, Anjalika Sewwandi, Lee Brandon, Vimal Kumar, Roopak Sinha, 12 Aug 2025, Securing Educational LLMs: A Generalised Taxonomy of Attacks on LLMs and DREAD Risk Assessment, https://arxiv.org/abs/2508.08629
  • Jiawei Zhou, Amy Z. Chen, Darshi Shah, Laura M. Schwab Reese, and Munmun De Choudhury, 11 Aug 2025, A Risk Taxonomy and Reflection Tool for Large Language Model Adoption in Public Health, https://arxiv.org/abs/2411.02594
  • David J. Moore, 18 Aug 2025, A Taxonomy of Hierarchical Multi-Agent Systems: Design Patterns, Coordination Mechanisms, and Industrial Applications, https://arxiv.org/abs/2508.12683
  • Zabir Al Nazi, Vagelis Hristidis, Aaron Lawson McLean, Jannat Ara Meem and Md Taukir Azam Chowdhury, 15 Aug 2025, Ontology-Guided Query Expansion for Biomedical Document Retrieval using Large Language Models, https://arxiv.org/abs/2508.11784
  • Simon Hosemann, Jean Christoph Jung, Carsten Lutz, Sebastian Rudolph, 11 Aug 2025, Fitting Ontologies and Constraints to Relational Structures, https://arxiv.org/abs/2508.13176
  • Hui Wei, Dong Yoon Lee, Shubham Rohal, Zhizhang Hu, Ryan Rossi, Shiwei Fang, Shijia Pan, 21 Aug 2025, A Survey of Foundation Models for IoT: Taxonomy and Criteria-Based Analysis, https://arxiv.org/abs/2506.12263
  • Runxuan Liu, Bei Luo, Jiaqi Li, Baoxin Wang, Ming Liu, Dayong Wu, Shijin Wang, Bing Qin, 21 Aug 2025, Ontology-Guided Reverse Thinking Makes Large Language Models Stronger on Knowledge Graph Question Answering, https://arxiv.org/abs/2502.11491
  • John Beverley and Danielle Limbaugh, 26 Jul 2025, Ontological Foundations of State Sovereignty, https://arxiv.org/abs/2507.21172
  • Michael Banf and Johannes Kuhn, 22 Aug 2025, Tripartite-GraphRAG via Plugin Ontologies, https://arxiv.org/abs/2504.19667
  • Natalie Abreu, Edwin Zhang, Eran Malach, Naomi Saphra, 25 Aug 2025, A Taxonomy of Transcendence, https://arxiv.org/abs/2508.17669
  • Aarush Kumbhakern, Saransh Kumar Gupta, Lipika Dey, Partha Pratim Das, 4 Sep 2025, Towards an Action-Centric Ontology for Cooking Procedures Using Temporal Graphs, https://arxiv.org/abs/2509.04159
  • John Wentworth, David Lorell, 4 Sep 2025, Natural Latents: Latent Variables Stable Across Ontologies, https://arxiv.org/abs/2509.03780
  • Barbara Gendron (LORIA, UL), Ga\"el Guibon (LIPN, LORIA), Mathieu D'aquin (LORIA, UL), 5 Sep 2025, Towards Ontology-Based Descriptions of Conversations with Qualitatively-Defined Concepts, https://arxiv.org/abs/2509.04926
  • Heinke Hihn, Dennis A. V. Dittrich, Carl Jeske, Cayo Costa Sobral, Helio Pais, and Timm Lochmann, 5 Sep 2025, Ontology-Aligned Embeddings for Data-Driven Labour Market Analytics, https://arxiv.org/abs/2509.04942
  • Cosmin-Andrei Hatfaludi and Alex Serban, 5 Sep 2025, Foundational Models and Federated Learning: Survey, Taxonomy, Challenges and Practical Insights, https://arxiv.org/abs/2509.05142
  • Alice Schiavone (1 and 2), Marco Fraccaro (3), Lea Marie Pehrson (1, 4 and 5), Silvia Ingala (4 and 6), Rasmus Bonnevie (3), Michael Bachmann Nielsen (5), Vincent Beliveau (7), Melanie Ganz (1 and 2), Desmond Elliott (1) ((1) Department of Computer Science, University of Copenhagen, Denmark, (2) Neurobiology Research Unit, Copenhagen University Hospital, Denmark, (3) Unumed Aps, Denmark, (4) Department of Diagnostic Radiology, Copenhagen University Hospital, Denmark, (5) Department of Clinical Medicine, University of Copenhagen, Denmark, (6) Cerebriu A/S, Denmark, (7) Institute for Human Genetics, Medical University of Innsbruck, Austria), 29 Aug 2025, MOSAIC: A Multilingual, Taxonomy-Agnostic, and Computationally Efficient Approach for Radiological Report Classification, https://arxiv.org/abs/2509.04471
  • Samira Khorshidi, Azadeh Nikfarjam, Suprita Shankar, Yisi Sang, Yash Govind, Hyun Jang, Ali Kasgari, Alexis McClimans, Mohamed Soliman, Vishnu Konda, Ahmed Fakhry, Xiaoguang Qi, 4 Sep 2025, ODKE+: Ontology-Guided Open-Domain Knowledge Extraction with LLMs, https://arxiv.org/abs/2509.04696
  • Hudson de Martim, 26 Aug 2025, An Ontology-Driven Graph RAG for Legal Norms: A Hierarchical, Temporal, and Deterministic Approach, https://arxiv.org/abs/2505.00039
  • Felix N\"utzel, Mischa Dombrowski, Bernhard Kainz, 27 Aug 2025, Ontology-Based Concept Distillation for Radiology Report Retrieval and Labeling, https://arxiv.org/abs/2508.19915
  • Generoso Immediato, 26 Aug 2025, Epistemic Trade-Off: An Analysis of the Operational Breakdown and Ontological Limits of "Certainty-Scope" in AI, https://arxiv.org/abs/2508.19304
  • Samah Alkhuzaey, Floriana Grasso, Terry R. Payne and Valentina Tamma, 27 Aug 2025, Evaluating the Fitness of Ontologies for the Task of Question Generation, https://arxiv.org/abs/2504.07994
  • Mohsen Nayebi Kerdabadi, Arya Hadizadeh Moghaddam, Dongjie Wang, Zijun Yao, 29 Aug 2025, Multi-Ontology Integration with Dual-Axis Propagation for Medical Concept Representation, https://arxiv.org/abs/2508.21320
  • Maijunxian Wang, Ran Ji, 2 Sep 2025, AGI as Second Being: The Structural-Generative Ontology of Intelligence, https://arxiv.org/abs/2509.02089
  • Aryan Amit Barsainyan, Jing Yu Lim, Dianbo Liu, 1 Sep 2025, Toward a Unified Benchmark and Taxonomy of Stochastic Environments, https://arxiv.org/abs/2509.01793
  • Luca Cotti, Anisa Rula, Devis Bianchini, Federico Cerutti, 26 Aug 2025, Enabling Transparent Cyber Threat Intelligence Combining Large Language Models and Domain Ontologies, https://arxiv.org/abs/2509.00081
  • Songhui Yue, 29 Aug 2025, LLM-based Triplet Extraction for Automated Ontology Generation in Software Engineering Standards, https://arxiv.org/abs/2509.00140
  • Peter Stockinger (ESCOM, PLIDAM, Inalco, CIS), 1 Sep 2025, Animer une base de connaissance: des ontologies aux mod{\`e}les d'I.A. g{\'e}n{\'e}rative, https://arxiv.org/abs/2509.01304
  • Theodor Stoecker, Samed Bayer, and Ingo Weber, 28 Aug 2025, Bias Mitigation for AI-Feedback Loops in Recommender Systems: A Systematic Literature Review and Taxonomy, https://arxiv.org/abs/2509.00109
  • Khalid M. Saqr, 2 Sep 2025, A Novel Kuhnian Ontology for Epistemic Classification of STM Scholarly Articles, https://arxiv.org/abs/2002.03531
  • Shriyank Somvanshi, Md Monzurul Islam, Syed Aaqib Javed, Gaurab Chhetri, Kazi Sifatul Islam, Tausif Islam Chowdhury, Sazzad Bin Bashar Polock, Anandi Dutta, Subasish Das, 31 Aug 2025, A Comprehensive Survey on Bio-Inspired Algorithms: Taxonomy, Applications, and Future Directions, https://arxiv.org/abs/2506.04238
  • Chengshuai Zhao, Riccardo De Maria, Tharindu Kumarage, Kumar Satvik Chaudhary, Garima Agrawal, Yiwen Li, Jongchan Park, Yuli Deng, Ying-Chih Chen, Huan Liu, 3 Sep 2025, CyberBOT: Towards Reliable Cybersecurity Education via Ontology-Grounded Retrieval Augmented Generation, https://arxiv.org/abs/2504.00389
  • Aleksandr Boldachev, 11 Sep 2025, Executable Ontologies: Synthesizing Event Semantics with Dataflow Architecture, https://arxiv.org/abs/2509.09775
  • Hanna Abi Akl, 12 Sep 2025, Investigating Language Model Capabilities to Represent and Process Formal Knowledge: A Preliminary Study to Assist Ontology Engineering, https://arxiv.org/abs/2509.10249
  • Teresa Salazar, Helder Ara\'ujo, Alberto Cano, Pedro Henriques Abreu, 12 Sep 2025, A Survey on Group Fairness in Federated Learning: Challenges, Taxonomy of Solutions and Directions for Future Research, https://arxiv.org/abs/2410.03855
  • Lukas Laakmann, Seyyid A. Ciftci, Christian Janiesch, 19 Sep 2025, A Nascent Taxonomy of Machine Learning in Intelligent Robotic Process Automation, https://arxiv.org/abs/2509.15730
  • Natallia Kokash, Bernard de Bono and Tom Gillespie, 19 Sep 2025, Ontology Creation and Management Tools: the Case of Anatomical Connectivity, https://arxiv.org/abs/2509.15780
  • Xinyu Zhang, Pei Zhang, Shuang Luo, Jialong Tang, Yu Wan, Baosong Yang, Fei Huang, 13 Sep 2025, CultureSynth: A Hierarchical Taxonomy-Guided and Retrieval-Augmented Framework for Cultural Question-Answer Synthesis, https://arxiv.org/abs/2509.10886
  • Haoye Tian, Chong Wang, BoYang Yang, Lyuye Zhang, Yang Liu, 17 Sep 2025, A Taxonomy of Prompt Defects in LLM Systems, https://arxiv.org/abs/2509.14404
  • Tom Westermann, Malte Ramonat, Johannes Hujer, Felix Gehlhoff, Alexander Fay, 18 Sep 2025, Automatic Mapping of AutomationML Files to Ontologies for Graph Queries and Validation, https://arxiv.org/abs/2504.21694
  • Pranav Pawar, Kavish Shah, Akshat Bhalani, Komal Kasat, Dev Mittal, Hadi Gala, Deepali Patil, Nikita Raichada, Monali Deshmukh, 10 Sep 2025, Interpretable Physics Reasoning and Performance Taxonomy in Vision-Language Models, https://arxiv.org/abs/2509.08270

AI Books from Aussie AI



The Sweetest Lesson: Your Brain Versus AI The Sweetest Lesson: Your Brain Versus AI: new book on AI intelligence theory:
  • Your brain is 50 times bigger than the best AI engines.
  • Truly intelligent AI will require more compute!
  • Another case of the bitter lesson?
  • Maybe it's the opposite of that: the sweetest lesson.

Get your copy from Amazon: The Sweetest Lesson



RAG Optimization RAG Optimization: Accurate and Efficient LLM Applications: new book on RAG architectures:
  • Smarter RAG
  • Faster RAG
  • Cheaper RAG
  • Agentic RAG
  • RAG reasoning

Get your copy from Amazon: RAG Optimization



Generative AI in C++ Generative AI Applications book:
  • Deciding on your AI project
  • Planning for success and safety
  • Designs and LLM architectures
  • Expediting development
  • Implementation and deployment

Get your copy from Amazon: Generative AI Applications



Generative AI in C++ Generative AI programming book:
  • Generative AI coding in C++
  • Transformer engine speedups
  • LLM models
  • Phone and desktop AI
  • Code examples
  • Research citations

Get your copy from Amazon: Generative AI in C++



CUDA C++ Optimization CUDA C++ Optimization book:
  • Faster CUDA C++ kernels
  • Optimization tools & techniques
  • Compute optimization
  • Memory optimization

Get your copy from Amazon: CUDA C++ Optimization



CUDA C++ Optimization CUDA C++ Debugging book:
  • Debugging CUDA C++ kernels
  • Tools & techniques
  • Self-testing & reliability
  • Common GPU kernel bugs

Get your copy from Amazon: CUDA C++ Debugging

More AI Research Topics

Read more about: