Aussie AI

Applications of Generative AI

Last Updated 17 November, 2025

by David Spuler, Ph.D.

Apps Built on AI

Timo Lehto, June 2024, Developing LLM-powered Applications Using Modern Frameworks, Bachelor’s Thesis, Information and Communications Technology, Jamk University of Applied Sciences, Finland, June 2024, 53 pages., https://www.theseus.fi/bitstream/handle/10024/862271/Lehto_Timo.pdf?sequence=2 (Building LLM-based applications in RAG architecture using LangChain.)
Evelyn Cheng Apr 17, 2024 Baidu releases new AI tools to promote application development, https://www.cnbc.com/2024/04/18/baidu-releases-new-ai-tools-to-promote-application-development.html
David Cahn, Sep 20, 2023, AI’s $200B Question: GPU capacity is getting overbuilt. Long-term, this is good. Short-term, things could get messy, https://www.sequoiacap.com/article/follow-the-gpus-perspective/
Kirill Kolodiazhnyi, May 15, 2020, Hands-On Machine Learning with C++: Build, train, and deploy end-to-end machine learning and deep learning pipelines, https://www.amazon.com/Hands-Machine-Learning-end-end/dp/1789955335/
Prakash Joshi Pax, Aug 26, 2024, Fabric: The Best AI Tool That Nobody is Talking About. An open-source AI tool to automate every day tasks https://beingpax.medium.com/why-fabric-ai-can-change-the-way-you-use-ai-973e725354da
Andrew Ng, Sep 2024, X post, https://x.com/AndrewYNg/status/1829190549842321758 (Dropping token prices for LLMs means developers can focus on the app layer.)
Junting Lu, Zhiyang Zhang, Fangkai Yang, Jue Zhang, Lu Wang, Chao Du, Qingwei Lin, Saravan Rajmohan, Dongmei Zhang, Qi Zhang, 25 Sep 2024, Turn Every Application into an Agent: Towards Efficient Human-Agent-Computer Interaction with API-First LLM-Based Agents, https://arxiv.org/abs/2409.17140
Sonya Huang, Pat Grady, and o1, Sequoia, October 9, 2024 Generative AI’s Act o1, https://www.sequoiacap.com/article/generative-ais-act-o1/
Apple, December 16, 2024, Apple reveals 2024’s most downloaded apps and games on the App Store, https://www.apple.com/newsroom/2024/12/apple-reveals-2024s-most-downloaded-apps-and-games-on-the-app-store/
Sarah Perez, December 16, 2024, Temu is the most downloaded app on the US App Store in 2024, https://techcrunch.com/2024/12/16/temu-is-the-most-downloaded-app-on-the-u-s-app-store-in-2024/
Jess Weatherbed, Dec 10, 2024, AI is booming on the App Store, and developers are taking advantage of it. Many high-ranking AI apps feel like an attempted cash grab, and it’s not easy to spot the trash from the treasure. https://www.theverge.com/2024/12/9/24314972/apple-app-store-ai-apps-art-design-photography
A16Z, The Top 100 [Gen AI] Consumer Apps, August 27, 2025, https://a16z.com/100-gen-ai-apps-5/

Building Applications for Generative AI

Research on building Gen AI apps:

Metin Karatas, June 25, 2024, Developing AI Applications: An Introduction (New Edition), Rheinwerk Computing; New edition, https://www.amazon.com/Developing-AI-Applications-Metin-Karatas/dp/1493226010/
Mistral AI Team, Aug 7, 2024, Build, tweak, repeat: Making it easier to develop and share generative AI applications, https://mistral.ai/news/build-tweak-repeat/
Yorick Sens, Henriette Knopp, Sven Peldszus, Thorsten Berger, 12 Aug 2024, A Large-Scale Study of Model Integration in ML-Enabled Software Systems, https://arxiv.org/abs/2408.06226
Google, 2024, L’Oréal: Launching Gen AI as a Service in 3 months with Cloud Run and LangChain, https://services.google.com/fh/files/misc/google_loreal_with_langchain_case_study.pdf
Raymond Lo, Jul 10, 2024, How to Build Faster GenAI Apps with Fewer Lines of Code using OpenVINO™ GenAI API, https://medium.com/openvino-toolkit/how-to-build-faster-genai-apps-with-fewer-lines-of-code-using-openvino-genai-api-5dd5fcabea17
Victor Dibia, Jingya Chen, Gagan Bansal, Suff Syed, Adam Fourney, Erkang Zhu, Chi Wang, Saleema Amershi, 9 Aug 2024, AutoGen Studio: A No-Code Developer Tool for Building and Debugging Multi-Agent Systems, https://arxiv.org/abs/2408.15247
Abhimanyu Bambhaniya, Ritik Raj, Geonhwa Jeong, Souvik Kundu, Sudarshan Srinivasan, Midhilesh Elavazhagan, Madhu Kumar, Tushar Krishna, 3 Jun 2024, Demystifying Platform Requirements for Diverse LLM Inference Use Cases, https://arxiv.org/abs/2406.01698 Code: https://github.com/abhibambhaniya/GenZ-LLM-Analyzer (Analysis of cost of serving LLMs, including separate profiles of prefill versus decoding phases, and the cost of extra prompt processing in RAG architectures with prepended information.)
Timo Lehto, June 2024, Developing LLM-powered Applications Using Modern Frameworks, Bachelor’s Thesis, Information and Communications Technology, Jamk University of Applied Sciences, Finland, June 2024, 53 pages., https://www.theseus.fi/bitstream/handle/10024/862271/Lehto_Timo.pdf?sequence=2 (Building LLM-based applications in RAG architecture using LangChain.)
Fareed Khan, March 2024, BasicLINGUA: LLM Based NLP Library, https://github.com/FareedKhan-dev/basiclingua-LLM-Based-NLP
Eugene Yan, Bryan Bischof, Charles Frye, Hamel Husain, Jason Liu and Shreya Shankar, May 28, 2024, What We Learned from a Year of Building with LLMs (Part I), https://www.oreilly.com/radar/what-we-learned-from-a-year-of-building-with-llms-part-i/
Dell Technologies, May 20, 2024, Dell Technologies Expands Dell AI Factory with NVIDIA to Turbocharge AI Adoption, PR Newswire, https://www.prnewswire.com/news-releases/dell-technologies-expands-dell-ai-factory-with-nvidia-to-turbocharge-ai-adoption-302150245.html
JH Jones, May 2024, A Quantitative Comparison of Pre-Trained Model Registries to Traditional Software Package Registries, Masters Thesis, Electrical and Computer Engineering, Purdue University, https://hammer.purdue.edu/articles/thesis/A_Quantitative_Comparison_of_Pre-Trained_Model_Registries_to_Traditional_Software_Package_Registries/25686447/1 PDF: https://hammer.purdue.edu/ndownloader/files/46096152
Evelyn Cheng Apr 17, 2024 Baidu releases new AI tools to promote application development, https://www.cnbc.com/2024/04/18/baidu-releases-new-ai-tools-to-promote-application-development.html
Priyank Rathod, May 21, 2024, Efficient Usage of RAG Systems in the World of LLMs, https://www.techrxiv.org/doi/full/10.36227/techrxiv.171625877.73379410/v1
Kirill Kolodiazhnyi, May 15, 2020, Hands-On Machine Learning with C++: Build, train, and deploy end-to-end machine learning and deep learning pipelines, https://www.amazon.com/Hands-Machine-Learning-end-end/dp/1789955335/
Mozilla, June 3, 2024, Announcing Mozilla Builders: 2024 Accelerator Theme: Local AI, https://future.mozilla.org/builders/blog/announcing-mozilla-builders/
June 2024 (accessed), R2R: The ultimate open-source RAG framework, https://github.com/SciPhi-AI/R2R
Hesam Sheikh, Jun 1, 2024, Towards AI Build Blog Writer and Researcher AI Agents with Ollama (100% local): Creating AI agents with Crewai and using Ollama to run them 100% locally in 5 very easy steps!, https://pub.towardsai.net/build-your-first-ai-agent-in-5-easy-steps-100-local-2fb771438a8f
Simeon Emanuilov, Apr 4, 2024 LLM agent operating system (AIOS) and the future of LLM-powered agents, https://medium.com/@simeon.emanuilov/llm-agent-operating-system-aios-and-the-future-of-llm-powered-agents-3d08b4e91c34 https://unfoldai.com/aios-llm-powered-agents/
Grant Gross, 13 Jun 2024, IT leaders go small for purpose-built AI, https://www.cio.com/article/2139985/it-leaders-go-small-for-purpose-built-ai.html
Will Larson, April 8, 2024, Notes on how to use LLMs in your product. https://lethain.com/mental-model-for-how-to-use-llms-in-products/
Matt Murphy, Tim Tully, Grace Ge, Derek Xiao, Katie Keller, January 18, 2024, The Modern AI Stack: Design Principles for the Future of Enterprise AI Architectures, https://menlovc.com/perspective/the-modern-ai-stack-design-principles-for-the-future-of-enterprise-ai-architectures/?tpcc=NL_Marketing
NLUX: The 𝗣𝗼𝘄𝗲𝗿𝗳𝘂𝗹 Conversational AI JavaScript Library, https://github.com/nlkitai/nlux
Jesse Clayton, Kedar Potdar and Annamalai Chockalingam, Jun 02, 2024, Streamline Development of AI-Powered Apps with NVIDIA RTX AI Toolkit for Windows RTX PCs, NVIDIA Technical Blog, https://developer.nvidia.com/blog/streamline-ai-powered-app-development-with-nvidia-rtx-ai-toolkit-for-windows-rtx-pcs/
John Borthwick, May 28, 2024, Announcing AI Camp: Native Applications, https://render.betaworks.com/announcing-ai-camp-native-applications-e1358061c601
Julian Yip, Apr 2, 2024, Build Autonomous AI Agents with Function Calling: Transform your chatbot into an agent that can interact with external APIs, https://towardsdatascience.com/build-autonomous-ai-agents-with-function-calling-0bb483753975 (Implement agents via models that output a JSON object that describes the API to call and the parmaeters to send.)
Benedict Evans, 2024, Building AI products, https://www.ben-evans.com/benedictevans/2024/6/8/building-ai-products
David Spuler, March 2024, Generative AI in C++: Coding Transformers and LLMs, https://www.amazon.com/dp/B0CXJKCWX9
Ben Auffarth, Dec 22, 2023 Generative AI with LangChain: Build large language model (LLM) apps with Python, ChatGPT and other LLMs,https://www.amazon.com/Generative-AI-LangChain-language-ChatGPT/dp/1835083463/
Olivier Caelen and Marie-Alice Blete, Oct 3, 2023 Developing Apps with GPT-4 and ChatGPT: Build Intelligent Chatbots, Content Generators, and More, https://www.amazon.com/Developing-Apps-GPT-4-ChatGPT-Intelligent/dp/1098152484/
Douglas C. Youvan , June 15, 2024, Developing and Deploying AI Applications on NVIDIA Jetson Orin NX: A Comprehensive Guide, https://www.researchgate.net/profile/Douglas-Youvan/publication/381434888_Developing_and_Deploying_AI_Applications_on_NVIDIA_Jetson_Orin_NX_A_Comprehensive_Guide/links/666d7390de777205a32fceb6/Developing-and-Deploying-AI-Applications-on-NVIDIA-Jetson-Orin-NX-A-Comprehensive-Guide.pdf
Lak Lakshmanan, March 7, 2024, Building an AI Assistant with DSPy: A way to program and tune prompt-agnostic LLM agent pipelines, https://towardsdatascience.com/building-an-ai-assistant-with-dspy-2e1e749a1a95
Michael Lin, June 2024, How to Successfully Manage AI Software Projects: The 4 Phases of AI Projects I Shared at VixulCon https://medium.com/@_michaellin/how-to-successfully-manage-ai-software-projects-a8344b5b76a9
Fabian Both, June 2024, why we no longer use LangChain for building our AI agents , https://www.octomind.dev/blog/why-we-no-longer-use-langchain-for-building-our-ai-agents (Replaces LangChain with their own more-focused internal tool sets.)
Charles Lamanna, March 28, 2023, Companies innovate with low-code and fusion development, Microsoft, https://www.microsoft.com/en-us/industry/microsoft-in-business/business-transformation/2023/03/28/companies-innovate-with-low-code-and-fusion-development/ (States that 750 million new apps are required in the next two years, but there are only 4 million developers.)
McKinsey & Company, June 14, 2024, Scott Johnston on designing and building scalable platforms, https://www.mckinsey.com/industries/technology-media-and-telecommunications/our-insights/scott-johnston-on-designing-and-building-scalable-platforms (Docker CEO states that 750 million new apps are required.)
Valentina Alto, May 2024, Building LLM Powered Applications: Create intelligent apps and agents with large language models, Packt Publishing, https://www.amazon.com/Building-LLM-Apps-Intelligent-Language/dp/1835462316/
Irene Weber, 13 Jun 2024, Large Language Models as Software Components: A Taxonomy for LLM-Integrated Applications, https://arxiv.org/abs/2406.10300
Aarushi Kansal, Building Generative AI-Powered Apps: A Hands-on Guide for Developers, Apress, https://www.amazon.com/Building-Generative-AI-Powered-Apps-Hands-ebook/dp/B0CTXXP1S4/
Louis-François Bouchard, Louie Peters, May 2024, Building LLMs for Production: Enhancing LLM Abilities and Reliability with Prompting, Fine-Tuning, and RAG, https://www.amazon.com/Building-LLMs-Production-Reliability-Fine-Tuning/dp/B0D4FFPFW8/
Kristian McCann July 15, 2024, AWS Unveils AI Service That Makes Enterprise Apps in Minutes, https://aimagazine.com/articles/aws-unveils-ai-service-that-builds-enterprise-apps-in-minute (Low-code enterprise AI app builder from AWS.)
Gene Rapoport, Sanjin Bicanic, Jue Wang, Richard Lichtenstein, Arjun Dutt, June 20, 2024, AI Survey: Four Themes Emerging: If 2023 was about experimentation, 2024 is all about results. Bain & Company, https://www.bain.com/insights/ai-survey-four-themes-emerging/ (Bain reports that use cases have been broadly successful in the use cases of sales, sales operations, software development, marketing, customer service, and customer onboarding, but less successful in HR, operations and legal. Interestingly, the main reason for AI project failures was that it couldn't perform the necessary task.)
Chip Huyen, Jul 25, 2024, Building A Generative AI Platform, https://huyenchip.com/2024/07/25/genai-platform.html
Juan Pablo Bottaro, April 25, 2024, Musings on building a Generative AI product, https://www.linkedin.com/blog/engineering/generative-ai/musings-on-building-a-generative-ai-product?_l=en_US
Writer, Aug 2024 (accessed), Writer AI Studio: The fastest way to build AI apps, https://writer.com/product/ai-studio/
OpenAI, Aug 2024 (accessed), .NET library, https://platform.openai.com/docs/libraries/dotnet-library https://github.com/openai/openai-dotnet
Travis Wilson, Jun 07 2024, Azure OpenAI Service expands .NET SDK support, https://techcommunity.microsoft.com/t5/ai-azure-ai-services-blog/azure-openai-service-expands-net-sdk-support/ba-p/4162940
Makhkamova, Ozoda, and Doohyun Kim. 2021. "A Conversation History-Based Q&A Cache Mechanism for Multi-Layered Chatbot Services" Applied Sciences 11, no. 21: 9981. https://doi.org/10.3390/app11219981 https://www.mdpi.com/2076-3417/11/21/9981 https://www.mdpi.com/2076-3417/11/21/9981/pdf
Yanxi Chen, Yaliang Li, Bolin Ding, Jingren Zhou, 20 Jul 2024, On the Design and Analysis of LLM-Based Algorithms, https://arxiv.org/abs/2407.14788 https://github.com/modelscope/agentscope/tree/main/examples/paper_llm_based_algorithm
Chuan Yan, Ruomai Ren, Mark Huasong Meng, Liuhuo Wan, Tian Yang Ooi, Guangdong Bai, 26 Aug 2024, Exploring ChatGPT App Ecosystem: Distribution, Deployment and Security, https://arxiv.org/abs/2408.14357
Xinyi Hou, Yanjie Zhao, Haoyu Wang, 3 Aug 2024, Voices from the Frontier: A Comprehensive Analysis of the OpenAI Developer Forum, https://arxiv.org/abs/2408.01687
Xiang Chen, Chaoyang Gao, Chunyang Chen, Guangbei Zhang, Yong Liu, 12 Aug 2024 (v2), An Empirical Study on Challenges for LLM Developers, https://arxiv.org/abs/2408.05002
Chaojun Xiao, Zhengyan Zhang, Chenyang Song, Dazhi Jiang, Feng Yao, Xu Han, Xiaozhi Wang, Shuo Wang, Yufei Huang, Guanyu Lin, Yingfa Chen, Weilin Zhao, Yuge Tu, Zexuan Zhong, Ao Zhang, Chenglei Si, Khai Hao Moo, Chenyang Zhao, Huimin Chen, Yankai Lin, Zhiyuan Liu, Jingbo Shang, Maosong Sun, Sep 2024, Configurable Foundation Models: Building LLMs from a Modular Perspective, https://arxiv.org/pdf/2409.02877
Lior Solomon, Sep 2024, Gen AI testing strategies and tools, https://medium.com/ai-in-grc/gen-ai-testing-strategies-and-tools-257383e5cbfb
Carl Franzen, September 13, 2024, What OpenAI’s new o1-preview and o1-mini models mean for developers, https://venturebeat.com/programming-development/what-openais-new-o1-preview-and-o1-mini-models-mean-for-developers/
Ellie Ko, Sep 25, 2024, A Survey of Python Frameworks, https://ploomber.io/blog/survey-python-frameworks/
Timothy Mugayi, Sep 2024, LLM Practical Ideas to Build Your Next AI-Powered Application: Realistic Use Cases to Unleash the Power of AI in Your Next Project, https://levelup.gitconnected.com/llm-practical-ideas-to-build-your-next-ai-powered-application-9379feba6cbc
Matt Asay, Sep 23, 2024, Too much assembly required for AI, https://www.infoworld.com/article/3536292/too-much-assembly-required-for-ai.html
Kif Leswing, Fri, Oct 4 2024, As Apple enters AI race, iPhone maker turns to its army of developers for an edge, https://www.cnbc.com/2024/10/04/apple-is-turning-to-its-army-of-developers-for-an-edge-in-the-ai-race.html
Nicola Sessions, Oct 15, 2024, DataStax Announces New AI Development Platform, Built with NVIDIA AI, https://developer.nvidia.com/blog/datastax-announces-new-ai-development-platform-built-with-nvidia-ai/
Anurag Guda and Shruthii Sathyanarayanan, Oct 16, 2024, Simplify AI Application Development with NVIDIA Cloud Native Stack, https://developer.nvidia.com/blog/simplify-ai-application-development-with-nvidia-cloud-native-stack/
Sid Chatterjee, Matt Silverlock, Celso Martinho, 2024-10-24, Build durable applications on Cloudflare Workers: you write the Workflows, we take care of the rest, https://blog.cloudflare.com/building-workflows-durable-execution-on-workers/
LangChain, Nov 7, 2024. SCIPE - Systematic Chain Improvement and Problem Evaluation, https://blog.langchain.dev/scipe-systematic-chain-improvement-and-problem-evaluation/ https://github.com/garg-ankush/scipe/tree/main
Lak Lakshmanan, Oct 4, 2024, How to Choose the Architecture for Your GenAI Application. A framework to select the simplest, fastest, cheapest architecture that will balance LLMs’ creativity and risk, https://towardsdatascience.com/how-to-choose-the-architecture-for-your-genai-application-6053e862c457
Siyun Zhao, Yuqing Yang, Zilong Wang, Zhiyuan He, Luna K. Qiu, Lili Qiu, 23 Sep 2024, Retrieval Augmented Generation (RAG) and Beyond: A Comprehensive Survey on How to Make your LLMs use External Data More Wisely, https://arxiv.org/abs/2409.14924
Dhavalkumar Patel, Ganesh Raut, Satya Narayan Cheetirala, Girish N Nadkarni, Robert Freeman, Benjamin S. Glicksberg, Eyal Klang, Prem Timsina, 8 Dec 2024, Cloud Platforms for Developing Generative AI Solutions: A Scoping Review of Tools and Services, https://arxiv.org/abs/2412.06044
Isabel Hulseman and Ruchika Kharwar, Dec 11, 2024, Three Building Blocks for Creating AI Virtual Assistants for Customer Service with an NVIDIA AI Blueprint, https://developer.nvidia.com/blog/three-building-blocks-for-creating-ai-virtual-assistants-for-customer-service-with-an-nvidia-nim-agent-blueprint/
Jason Redmond, Jan 2025, Microsoft CEO Nadella forms new AI group to build and run apps for customers. Microsoft hired DeepMind co-founder Mustafa Suleyman to lead Copilot AI initiatives last year. https://www.nbcnews.com/business/business-news/microsoft-ceo-nadella-forms-new-ai-group-build-run-apps-customers-rcna187506
Xiang Chen, Chaoyang Gao, Chunyang Chen, Guangbei Zhang, and Yong Liu. 2025. An Empirical Study on Challenges for LLM Application Developers. ACM Trans. Softw. Eng. Methodol. Just Accepted (January 2025). https://doi.org/10.1145/3715007 https://dl.acm.org/doi/pdf/10.1145/3715007
Bharani Subramaniam, 13 February 2025, Emerging Patterns in Building GenAI Products, https://martinfowler.com/articles/gen-ai-patterns/
Michael Nuñez, May 19, 2025, Microsoft announces over 50 AI tools to build the ‘agentic web’ at Build 2025, https://venturebeat.com/ai/microsoft-announces-over-50-ai-tools-to-build-the-agentic-web-at-build-2025/
Maximilian Schreiner, Jun 26, 2025, Anthropics Claude can now build AI apps, https://the-decoder.com/anthropics-claude-can-now-build-ai-apps/

Inference Frameworks

Research papers include:

Yiheng Liu, Hao He, Tianle Han, Xu Zhang, Mengyuan Liu, Jiaming Tian, Yutong Zhang, Jiaqi Wang, Xiaohui Gao, Tianyang Zhong, Yi Pan, Shaochen Xu, Zihao Wu, Zhengliang Liu, Xin Zhang, Shu Zhang, Xintao Hu, Tuo Zhang, Ning Qiang, Tianming Liu, Bao Ge, Jan 2024, Understanding LLMs: A Comprehensive Overview from Training to Inference https://arxiv.org/abs/2401.02038
MLC team. 2023. MLC-LLM. https://github.com/mlc-ai/mlc-llm
tinygrad. 2023. Tinygrad. https://github.com/tinygrad/tinygrad
Woosuk Kwon, Zhuohan Li, Siyuan Zhuang, Ying Sheng, Lianmin Zheng, Cody Hao Yu, Joseph E. Gonzalez, Hao Zhang, Ion Stoica, Oct 2023, Efficient Memory Management for Large Language Model Serving with PagedAttention, SOSP ’23, October 23–26, 2023, Koblenz, Germany, https://dl.acm.org/doi/pdf/10.1145/3600006.3613165 (The original Paged Attention and vLLM paper, focusing on optimizing memory size of the KV cache using methods similar to operating-system memory paging.)
Vince Lam, Mar 12, 2024, 50+ Open-Source Options for Running LLMs Locally, https://medium.com/thedeephub/50-open-source-options-for-running-llms-locally-db1ec6f5a54f
Jason Perlow, Aug. 6, 2024, How to run dozens of AI models on your Mac or PC - no third-party cloud needed, https://www.zdnet.com/article/how-to-run-dozens-of-ai-models-on-your-mac-or-pc-no-third-party-cloud-needed/
Lianmin Zheng, Liangsheng Yin, Zhiqiang Xie, Chuyue Sun, Jeff Huang, Cody Hao Yu, Shiyi Cao, Christos Kozyrakis, Ion Stoica, Joseph E. Gonzalez, Clark Barrett, Ying Sheng, 6 Jun 2024 (v2), SGLang: Efficient Execution of Structured Language Model Programs, https://arxiv.org/abs/2312.07104 https://github.com/sgl-project/sglang
The SGLang Team, Jul 25, 2024, Achieving Faster Open-Source Llama3 Serving with SGLang Runtime (vs. TensorRT-LLM, vLLM), https://lmsys.org/blog/2024-07-25-sglang-llama3/
Anna Popovych, Sofiya Merenych, February 16, 2024, Top AI Frameworks in 2024: Comparison of Artificial Intelligence Frameworks, https://clockwise.software/blog/artificial-intelligence-framework/
Hugging Face, 2024, Text Generation Inference, https://huggingface.co/docs/text-generation-inference/index
ZML, Sep 2024, ZML: High performance AI inference stack. Built for productionl https://docs.zml.ai/ https://github.com/zml/zml?tab=readme-ov-file
Xupeng Miao, Gabriele Oliaro, Zhihao Zhang, Xinhao Cheng, Hongyi Jin, Tianqi Chen, Zhihao Jia, 23 Dec 2023, Towards Efficient Generative Large Language Model Serving: A Survey from Algorithms to Systems, https://arxiv.org/abs/2312.15234
Ruihao Gong, Yifu Ding, Zining Wang, Chengtao Lv, Xingyu Zheng, Jinyang Du, Haotong Qin, Jinyang Guo, Michele Magno, Xianglong Liu, 25 Sep 2024, A Survey of Low-bit Large Language Models: Basics, Systems, and Algorithms, https://arxiv.org/abs/2409.16694
Sebastian Petrus, Sep 4, 2024, Top 10 RAG Frameworks Github Repos 2024, https://sebastian-petrus.medium.com/top-10-rag-frameworks-github-repos-2024-12b2a81f4a49
Rick Zhou, Larme Zhao, Bo Jiang, and Sean Sheng, June 5, 2024, Benchmarking LLM Inference Backends: vLLM, LMDeploy, MLC-LLM, TensorRT-LLM, and TGI, https://www.bentoml.com/blog/benchmarking-llm-inference-backends
Wenchao Xu, Jinyu Chen, Peirong Zheng, Xiaoquan Yi, Tianyi Tian, Wenhui Zhu, Quan Wan, Haozhao Wang, Yunfeng Fan, Qinliang Su, Xuemin Shen, https://arxiv.org/abs/2412.13437 18 Dec 2024, Deploying Foundation Model Powered Agent Services: A Survey, (A survey of not just deployment, but many inference optimization techniques.)
Meta, Jan 2025 (accessed), Llama Stack: Composable building blocks to build Llama Apps, https://github.com/meta-llama/llama-stack
Mozhgan Navardi, Romina Aalishah, Yuzhe Fu, Yueqian Lin, Hai Li, Yiran Chen, Tinoosh Mohsenin, 19 Feb 2025, GenAI at the Edge: Comprehensive Survey on Empowering Edge Devices, https://arxiv.org/abs/2502.15816
Amr Elmeleegy, Harry Kim, David Zier, Kyle Kranen, Neelay Shah, Ryan Olson and Omri Kahalon, Mar 18, 2025, Introducing NVIDIA Dynamo, A Low-Latency Distributed Inference Framework for Scaling Reasoning AI Models, https://developer.nvidia.com/blog/introducing-nvidia-dynamo-a-low-latency-distributed-inference-framework-for-scaling-reasoning-ai-models/
Matthias Jobst, Tim Langer, Chen Liu, Mehmet Alici, Hector A. Gonzalez, Christian Mayr, 18 Jul 2025, An End-to-End DNN Inference Framework for the SpiNNaker2 Neuromorphic MPSoC, https://arxiv.org/abs/2507.13736
Kaichuan Kong, Dongjie Liu, Xiaobo Jin, Guanggang Geng, Zhiying Li, Jian Weng, 6 Aug 2025, DMFI: Dual-Modality Fine-Tuning and Inference Framework for LLM-Based Insider Threat Detection, https://arxiv.org/abs/2508.05694
Soorya Ram Shimgekar, Shayan Vassef, Abhay Goyal, Navin Kumar, Koustuv Saha, 24 Jul 2025, Agentic AI framework for End-to-End Medical Data Inference, https://arxiv.org/abs/2507.18115
Raghav Singhal, Zachary Horvitz, Ryan Teehan, Mengye Ren, Zhou Yu, Kathleen McKeown, Rajesh Ranganath, 18 Jul 2025, A General Framework for Inference-time Scaling and Steering of Diffusion Models, https://arxiv.org/abs/2501.06848
Jiawen Qi, Chang Gao, Zhaochun Ren, Qinyu Chen, 25 Jul 2025, DeltaLLM: A Training-Free Framework Exploiting Temporal Sparsity for Efficient Edge LLM Inference, https://arxiv.org/abs/2507.19608
Riddhi J. Pitliya, Ozan Catal, Toon Van de Maele, Corrado Pezzato, Tim Verbelen, 1 Aug 2025, Theory of Mind Using Active Inference: A Framework for Multi-Agent Cooperation, https://arxiv.org/abs/2508.00401
Chakattrai Sookkongwaree, Tattep Lakmuang, and Chainarong Amornbunchornvej, 1 Aug 2025, Multi-Band Variable-Lag Granger Causality: A Unified Framework for Causal Time Series Inference across Frequencies, https://arxiv.org/abs/2508.00658
Bo Wen, 7 Aug 2025, A Framework for Inherently Safer AGI through Language-Mediated Active Inference, https://arxiv.org/abs/2508.05766
Bj\"orn Volkmann, Jan-Hendrik Ewering, Michael Meindl, Simon F. G. Ehlers, Thomas Seel, 21 Aug 2025, Bayesian Inference and Learning in Nonlinear Dynamical Systems: A Framework for Incorporating Explicit and Implicit Prior Knowledge, https://arxiv.org/abs/2508.15345
Zucheng Liang, Wenxin Wei, Kaijie Zhang, Hongyi Chen, 5 Sep 2025, Research on Multi-hop Inference Optimization of LLM Based on MQUAKE Framework, https://arxiv.org/abs/2509.04770
Yongsheng Feng, Yuetonghui Xu, Jiehui Luo, Hongjia Liu, Xiaobing Li, Feng Yu, Wei Li, 19 Sep 2025, TISDiSS: A Training-Time and Inference-Time Scalable Framework for Discriminative Source Separation, https://arxiv.org/abs/2509.15666
Enyu Zhou, Kai Sheng, Hao Chen, Xin He, 19 Sep 2025, CARD: A Cache-Assisted Parallel Speculative Decoding Framework via Query-and-Correct Paradigm for Accelerating LLM Inference, https://arxiv.org/abs/2508.04462
Yudong Shen, Wenyu Wu, Jiali Mao, Yixiao Tong, Guoping Liu, Chaoya Wang, 15 Sep 2025, Bridging the Gap Between Sparsity and Redundancy: A Dual-Decoding Framework with Global Context for Map Inference, https://arxiv.org/abs/2509.11731
Giorgos Armeniakos, Alexis Maras, Sotirios Xydis, Dimitrios Soudris, 18 Sep 2025, MaRVIn: A Cross-Layer Mixed-Precision RISC-V Framework for DNN Inference, from ISA Extension to Hardware Acceleration, https://arxiv.org/abs/2509.15187
Nathanael Jo, Ashia Wilson, 23 Sep 2025, What Does Your Benchmark Really Measure? A Framework for Robust Inference of AI Capabilities, https://arxiv.org/abs/2509.19590
Miruna Oprescu, David K. Park, Xihaier Luo, Shinjae Yoo, Nathan Kallus, 28 Oct 2025, GST-UNet: A Neural Framework for Spatiotemporal Causal Inference with Time-Varying Confounding, https://arxiv.org/abs/2502.05295
Aditya Puttaparthi Tirumala, 23 Oct 2025, DeepCausalMMM: A Deep Learning Framework for Marketing Mix Modeling with Causal Inference, https://arxiv.org/abs/2510.13087
Qilin Liao, Anamika Lochab, Ruqi Zhang, 20 Oct 2025, VERA-V: Variational Inference Framework for Jailbreaking Vision-Language Models, https://arxiv.org/abs/2510.17759
Arshika Lalan, Rajat Ghosh, Aditya Kolsur, Debojyoti Dutta, 8 Oct 2025, A Multi-Agent Framework for Stateful Inference-Time Search, https://arxiv.org/abs/2510.07147
Haojie Ouyang, Jianwei Lv, Lei Ren, Chen Wei, Xiaojie Wang, Fangxiang Feng, 28 Sep 2025, ChunkLLM: A Lightweight Pluggable Framework for Accelerating LLMs Inference, https://arxiv.org/abs/2510.02361
Ke Wang, Felix Qu, Libin Xia, Zishuo Zhao, Chris Tong, Lynn Ai, Eric Yang, 29 Sep 2025, VeriLLM: A Lightweight Framework for Publicly Verifiable Decentralized Inference, https://arxiv.org/abs/2509.24257
Subhodip Panda, MS Varun, Shreyans Jain, Sarthak Kumar Maharana and Prathosh A.P, 5 Oct 2025, Variational Diffusion Unlearning: A Variational Inference Framework for Unlearning in Diffusion Models under Data Constraints, https://arxiv.org/abs/2510.04058
Christopher Klugmann and Daniel Kondermann, 5 Oct 2025, Quantifying Ambiguity in Categorical Annotations: A Measure and Statistical Inference Framework, https://arxiv.org/abs/2510.04366
Yuxin Ma, Lun Du, Lanning Wei, Kun Chen, Qian Xu, Kangyu Wang, Guofeng Feng, Guoshan Lu, Lin Liu, Xiaojing Qi, Xinyuan Zhang, Zhen Tao, Haibo Feng, Ziyun Jiang, Ying Xu, Zenan Huang, Yihong Zhuang, Haokai Xu, Jiaqi Hu, Zhenzhong Lan, Junbo Zhao, Jianguo Li, Da Zheng, 9 Oct 2025, dInfer: An Efficient Inference Framework for Diffusion Language Models, https://arxiv.org/abs/2510.08666

Orchestration Frameworks

Research papers include:

Konstantinos Papaioannou, Thaleia Dimitra Doudali, April 2024, The Importance of Workload Choice in Evaluating LLM Inference Systems, EuroMLSys '24: Proceedings of the 4th Workshop on Machine Learning and Systems, April 2024, Pages 39–46, https://doi.org/10.1145/3642970.3655823 https://dl.acm.org/doi/abs/10.1145/3642970.3655823
Jacob Robbins, January 4, 2024, Why generative AI orchestration startups are poised for growth in 2024, Pitch Book, https://pitchbook.com/news/articles/generative-ai-orchestration-startups-venture-capital-unicorns
Xin Tan, Yimin Jiang, Yitao Yang, Hong Xu, 29 Jun 2024, Teola: Towards End-to-End Optimization of LLM-based Applications, https://arxiv.org/abs/2407.00326
Chip Huyen, Jul 25, 2024, Building A Generative AI Platform, https://huyenchip.com/2024/07/25/genai-platform.html
Lianmin Zheng, Liangsheng Yin, Zhiqiang Xie, Chuyue Sun, Jeff Huang, Cody Hao Yu, Shiyi Cao, Christos Kozyrakis, Ion Stoica, Joseph E. Gonzalez, Clark Barrett, Ying Sheng, 6 Jun 2024 (v2), SGLang: Efficient Execution of Structured Language Model Programs, https://arxiv.org/abs/2312.07104 https://github.com/sgl-project/sglang
The SGLang Team, Jul 25, 2024, Achieving Faster Open-Source Llama3 Serving with SGLang Runtime (vs. TensorRT-LLM, vLLM), https://lmsys.org/blog/2024-07-25-sglang-llama3/
An Efficient Network Orchestrator for Distributed Compound Language Model Systems Muhammad Shahir Abdurrahman, Stanford University, Stanford, California, USA, https://www.scs.stanford.edu/24sp-cs244b/projects/An_Efficient_Network_Orchestrator_for_Distributed_Compound_Language_Model_Systems.pdf
Melissa Malec, June 5, 2024, AI Orchestration Explained: The What, Why & How for 2024, https://hatchworks.com/blog/gen-ai/ai-orchestration/
Manish Kochar, May 19, 2024, Compounding GenAI Success: Why Orchestration is the Key to Mastering Generative AI, https://medium.com/@mkochar/compounding-genai-success-why-orchestration-is-the-key-to-mastering-generative-ai-543a2952acfe
Carl Franzen, August 23, 2024, Grok-2 gets a speed bump after developers rewrite code in three days, https://venturebeat.com/ai/grok-2-gets-a-speed-bump-after-developers-rewrite-code-in-three-days/ (Inference speed improvement by rewriting using the SGLang orchestration framework.)
Gary Grossman, September 8, 2024, AI orchestration: Crafting harmony or creating dependency? https://venturebeat.com/ai/ai-orchestration-crafting-harmony-or-creating-dependency/
A. R. Ali, K. Kumar, M. A. Siddiqui and M. Zahid, 2024, An Open-source Cross-Industry and Cloud-agnostic Generative AI Platform, 2024 International Joint Conference on Neural Networks (IJCNN), Yokohama, Japan, 2024, pp. 1-10, doi: 10.1109/IJCNN60899.2024.10650688, https://ieeexplore.ieee.org/abstract/document/10650688
LiLMod, Aug 27, 2024, Haystack: the new LLM framework that is shaking its competitors, https://ai.plainenglish.io/haystack-the-new-llm-framework-that-is-shaking-its-competitors-1a083a153fd9
Yiyuan He, Minxian Xu, Jingfeng Wu, Wanyi Zheng, Kejiang Ye, Chengzhong Xu, 24 Sep 2024 (v2), UELLM: A Unified and Efficient Approach for LLM Inference Serving, https://arxiv.org/abs/2409.14961
Michael Nuñez, September 25, 2024, AI for all: Meta’s ‘Llama Stack’ promises to simplify enterprise adoption, https://venturebeat.com/ai/ai-for-all-meta-llama-stack-promises-to-simplify-enterprise-ai-adoption/
Matt Asay, Sep 23, 2024, Too much assembly required for AI, https://www.infoworld.com/article/3536292/too-much-assembly-required-for-ai.html
Kabir Nagrecha, Oct 2024, Thesis, Orchestration Systems to Support Deep Learning at Scale Doctor of Philosophy, Computer Science, University of California San Diego, https://escholarship.org/content/qt3pp6k1p4/qt3pp6k1p4_noSplash_457f4c7c0435172a3d0a17428455894c.pdf (Pipeline and data parallelism systems.)
Emilia David, November 19, 2024, Orchestrator agents: Integration, human interaction, and enterprise knowledge at the core, https://venturebeat.com/ai/orchestrator-agents-integration-human-interaction-and-enterprise-knowledge-at-the-core/
Guanzi Yao, Heyao Liu, Linyan Dai, 14 Aug 2025, Multi-Agent Reinforcement Learning for Adaptive Resource Orchestration in Cloud-Native Clusters, https://arxiv.org/abs/2508.10253
Rodrigo Moreira and Rafael Pasquini and Joberto S. B. Martins and Tereza C. Carvalho and Fl\'avio de Oliveira Silva, 21 Jul 2025, AI-driven Orchestration at Scale: Estimating Service Metrics on National-Wide Testbeds, https://arxiv.org/abs/2507.16077
Konstantinos I. Roumeliotis, Ranjan Sapkota, Manoj Karkee, Nikolaos D. Tselikas, 18 Jul 2025, Orchestrator-Agent Trust: A Modular Agentic AI Visual Classification System with Trust-Aware Orchestration and RAG-Based Reasoning, https://arxiv.org/abs/2507.10571
Botao Zhu, Xianbin Wang, and Dusit Niyato, 31 Jul 2025, Semantic Chain-of-Trust: Autonomous Trust Orchestration for Collaborator Selection via Hypergraph-Aided Agentic AI, https://arxiv.org/abs/2507.23565
Yuge Zhang, Nan Chen, Jiahang Xu, Yuqing Yang, 19 Aug 2025, Prompt Orchestration Markup Language, https://arxiv.org/abs/2508.13948
Jusheng Zhang, Yijia Fan, Kaitong Cai, Xiaofei Sun, Keze Wang, 5 Sep 2025, OSC: Cognitive Orchestration through Dynamic Knowledge Alignment in Multi-Agent LLM Collaboration, https://arxiv.org/abs/2509.04876
Gohar Irfan Chaudhry, Esha Choukse, Haoran Qiu, \'I\~nigo Goiri, Rodrigo Fonseca, Adam Belay, Ricardo Bianchini, 22 Aug 2025, Murakkab: Resource-Efficient Agentic Workflow Orchestration in Cloud Platforms, https://arxiv.org/abs/2508.18298
Xiyu Guo, Shan Wang, Chunfang Ji, Xuefeng Zhao, Wenhao Xi, Yaoyao Liu, Qinglan Li, Chao Deng, Junlan Feng, 9 Sep 2025, Towards Generalized Routing: Model and Agent Orchestration for Adaptive and Efficient Inference, https://arxiv.org/abs/2509.07571
Liangxuan Guo, Bin Zhu, Qingqian Tao, Kangning Liu, Xun Zhao, Xianzhe Qin, Jin Gao and Guangfu Hao, 16 Sep 2025, Agentic Lybic: Multi-Agent Execution System with Tiered Reasoning and Orchestration, https://arxiv.org/abs/2509.11067
Xingchen Wan, Han Zhou, Ruoxi Sun, Hootan Nakhost, Ke Jiang, Rajarishi Sinha, Sercan \"O. Ar{\i}k, 12 Sep 2025, Maestro: Self-Improving Text-to-Image Generation via Agent Orchestration, https://arxiv.org/abs/2509.10704
Jinwei Su, Yinghui Xia, Qizhen Lan, Xinyuan Song, Yang Jingsong, Lewei He, Tianyu Shi, 14 Sep 2025, Difficulty-Aware Agent Orchestration in LLM-Powered Workflows, https://arxiv.org/abs/2509.11079
Aaron Xuxiang Tian, Ruofan Zhang, Jiayao Tang, Young Min Cho, Xueqian Li, Qiang Yi, Ji Wang, Zhunping Zhang, Danrui Qi, Zekun Li, Xingyu Xiang, Sharath Chandra Guntuku, Lyle Ungar, Tianyu Shi, Chi Wang, 1 Oct 2025, Beyond the Strongest LLM: Multi-Turn Multi-Agent Orchestration vs. Single LLMs on Benchmarks, https://arxiv.org/abs/2509.23537
Hassen Dhrif, 30 Sep 2025, Reasoning-Aware Prompt Orchestration: A Foundation Model for Multi-Agent Language Model Coordination, https://arxiv.org/abs/2510.00326
Danilo Trombino, Vincenzo Pecorella, Alessandro de Giulii, Davide Tresoldi, 23 Sep 2025, Knowledge Base-Aware Orchestration: A Dynamic, Privacy-Preserving Method for Multi-Agent Systems, https://arxiv.org/abs/2509.19599
Yifu Lu, Shengjie Liu, Li Dong, 28 Oct 2025, OrchDAG: Complex Tool Orchestration in Multi-Turn Interactions with Plan DAGs, https://arxiv.org/abs/2510.24663
Kushagra Agrawal, Nisharg Nargund, 26 Sep 2025, Neural Orchestration for Multi-Agent Systems: A Deep Learning Framework for Optimal Agent Selection in Multi-Domain Task Environments, https://arxiv.org/abs/2505.02861
Chiara Mignacco, Matthieu Jonckheere, Gilles Stoltz, 7 Oct 2025, Online Matching via Reinforcement Learning: An Expert Policy Orchestration Strategy, https://arxiv.org/abs/2510.06515
Yuncheng Hua, Sion Weatherhead, Mehdi Jafari, Hao Xue, Flora D. Salim, 21 Oct 2025, SOCIA-Nabla: Textual Gradient Meets Multi-Agent Orchestration for Automated Simulator Generation, https://arxiv.org/abs/2510.18551
Shumaila Javaid, and Nasir Saeed, 1 Oct 2025, Carbon-Aware Orchestration of Integrated Satellite Aerial Terrestrial Networks via Digital Twin, https://arxiv.org/abs/2510.17825
Yufan Dang, Chen Qian, Xueheng Luo, Jingru Fan, Zihao Xie, Ruijie Shi, Weize Chen, Cheng Yang, Xiaoyin Che, Ye Tian, Xuantang Xiong, Lei Han, Zhiyuan Liu, Maosong Sun, 21 Oct 2025, Multi-Agent Collaboration via Evolving Orchestration, https://arxiv.org/abs/2505.19591
Weifan Jiang, Rana Shahout, Yilun Du, Michael Mitzenmacher, Minlan Yu, 29 Sep 2025, Intra-request branch orchestration for efficient LLM reasoning, https://arxiv.org/abs/2509.24957
Yan Ke, Xin Yu, Heming Du, Scott Chapman, Helen Huang, 29 Sep 2025, Dynamic Orchestration of Multi-Agent System for Real-World Multi-Image Agricultural VQA, https://arxiv.org/abs/2509.24350
Ahmad Raeisi, Mahdi Dolati, Sina Darabi, Sadegh Talebi, Patrick Eugster, and Ahmad Khonsari, 17 Oct 2025, GOGH: Correlation-Guided Orchestration of GPUs in Heterogeneous Clusters, https://arxiv.org/abs/2510.15652
Zainab Saad, Jialin Yang, Henry Leung, Steve Drew, 4 Oct 2025, Towards Carbon-Aware Container Orchestration: Predicting Workload Energy Consumption with Federated Learning, https://arxiv.org/abs/2510.03970
Mohanakrishnan Hariharan, Satish Arvapalli, Seshu Barma, Evangeline Sheela, 12 Oct 2025, Agentic RAG for Software Testing with Hybrid Vector-Graph and Multi-Agent Orchestration, https://arxiv.org/abs/2510.10824
Jinling Gan, Churong Liang, Runnan Li, 9 Oct 2025, Prepared mind, fast response: A temporal decoupling framework for adaptive knowledge orchestration in open-domain dialogue, https://arxiv.org/abs/2510.08175
Cheng Qian, Zuxin Liu, Shirley Kokane, Akshara Prabhakar, Jielin Qiu, Haolin Chen, Zhiwei Liu, Heng Ji, Weiran Yao, Shelby Heinecke, Silvio Savarese, Caiming Xiong, Huan Wang, 9 Oct 2025, xRouter: Training Cost-Aware LLMs Orchestration System via Reinforcement Learning, https://arxiv.org/abs/2510.08439
Chunhao Tian, Yutong Wang, Xuebo Liu, Zhexuan Wang, Liang Ding, Miao Zhang, Min Zhang, 23 Sep 2025, AgentInit: Initializing LLM-based Multi-Agent Systems via Diversity and Expertise Orchestration for Effective and Efficient Collaboration, https://arxiv.org/abs/2509.19236
Jia-Kai Dong, I-Wei Huang, Chun-Tin Wu, Yi-Tien Tsai, 22 Oct 2025, MSC-Bench: A Rigorous Benchmark for Multi-Server Tool Orchestration, https://arxiv.org/abs/2510.19423
Lunyiu Nie, Nedim Lipka, Ryan A. Rossi, Swarat Chaudhuri, 2 Oct 2025, FlashResearch: Real-time Agent Orchestration for Efficient Deep Research, https://arxiv.org/abs/2510.05145

LangChain

LangChain is an AI orchestration framework that allows "chaining" of multiple components in a sequence. Research papers on LangChain usage:

Timo Lehto, June 2024, Developing LLM-powered Applications Using Modern Frameworks, Bachelor’s Thesis, Information and Communications Technology, Jamk University of Applied Sciences, Finland, June 2024, 53 pages., https://www.theseus.fi/bitstream/handle/10024/862271/Lehto_Timo.pdf?sequence=2 (Building LLM-based applications in RAG architecture using LangChain.)
Ben Auffarth, Dec 22, 2023 Generative AI with LangChain: Build large language model (LLM) apps with Python, ChatGPT and other LLMs,https://www.amazon.com/Generative-AI-LangChain-language-ChatGPT/dp/1835083463/
Fabian Both, June 2024, why we no longer use LangChain for building our AI agents , https://www.octomind.dev/blog/why-we-no-longer-use-langchain-for-building-our-ai-agents (Replaces LangChain with their own more-focused internal tool sets.)
Louis-François Bouchard, Louie Peters, May 2024, Chapter 4: Prompting, and Chapter 6, Prompting with LangChain, Building LLMs for Production: Enhancing LLM Abilities and Reliability with Prompting, Fine-Tuning, and RAG, https://www.amazon.com/Building-LLMs-Production-Reliability-Fine-Tuning/dp/B0D4FFPFW8/
Aarushi Kansal, Chapter 2: LangChain: Your Swiss Army Knife, Building Generative AI-Powered Apps: A Hands-on Guide for Developers, Apress, https://www.amazon.com/Building-Generative-AI-Powered-Apps-Hands-ebook/dp/B0CTXXP1S4/
Eddie Forson, Apr 29, 2024, Why I’m building my own AI Agent library, https://medium.com/@Ed_Forson/why-im-building-my-own-ai-agent-library-e20ec9aa3647
AI Agent Workflows: A Complete Guide on Whether to Build With LangGraph or LangChain, Sandi Besen, Oct 2024, https://towardsdatascience.com/ai-agent-workflows-a-complete-guide-on-whether-to-build-with-langgraph-or-langchain-117025509fa0
R Szilágyi, 2024, OpenSource alternatives of Generative Artifical Intelligence for SME's, Journal of Agricultural Informatics, Vol. 15 No. 2 (2024), https://doi.org/10.17700/jai.2024.15.2.733 https://journal.magisz.org/index.php/jai/article/view/733 https://journal.magisz.org/index.php/jai/article/view/733/412
Marina Temkin, July 8, 2025, LangChain is about to become a unicorn, sources say, https://techcrunch.com/2025/07/08/langchain-is-about-to-become-a-unicorn-sources-say/

Wrap Architectures for Gen AI Applications

The simplest architectures for AI applications are those that simply "wrap" around LLMs, whether it is commercial LLMs like GPT, or open source LLMs like Mistral or Llama.

A16Z, April 2nd, 2024 (accessed), AI Getting Started https://github.com/a16z-infra/ai-getting-started (Javascript wrapper kits for several commercial AI APIs.)
Ben Auffarth, Dec 22, 2023 Generative AI with LangChain: Build large language model (LLM) apps with Python, ChatGPT and other LLMs,https://www.amazon.com/Generative-AI-LangChain-language-ChatGPT/dp/1835083463/
Thiyagarajan Maruthavan (Rajan), Apr 12, 2024, So what if it is a thin wrapper on OpenAI? https://medium.com/@mtrajan/so-what-if-it-is-a-thin-wrapper-on-openai-274dd005b6d3
Adva Nakash Peleg, May 30, 2024, An LLM Journey: From POC to Production, https://medium.com/cyberark-engineering/an-llm-journey-from-poc-to-production-6c5ec6a172fb
Apurv Sibal, February 26, 2025, Hands-On Prompt Engineering: Learning to Program ChatGPT Using OpenAI APIs, Wiley, https://www.amazon.com/Hands-Prompt-Engineering-Learning-Program/dp/1394210760/
Dr Kris Jamsa, Dec 2023, OpenAI and ChatGPT Programming: Using Python to Unlock OpenAI and ChatGPT, https://www.amazon.com/OpenAI-ChatGPT-Programming-Python-Unlock/dp/B0CQK41P6B/
Cuantum Technologies, May 2023, ChatGPT API Bible: Mastering Python Programming for Conversational AI: Build Intelligent Chatbots and AI Applications with ChatGPT API and Python (Mastering AI and Python), https://www.amazon.com/ChatGPT-API-Bible-Conversational-Applications/dp/B0C47NWRT7/
Mike Gold, October 6, 2023, Crafting Applications with ChatGPT API: Using Python, Green Belt Book LLC, https://www.amazon.com/Crafting-Applications-ChatGPT-API-Python-ebook/dp/B0CHJX36X3/
Henry Habib, Paul Siegel, March 12, 2024, OpenAI API Cookbook: Build intelligent applications including chatbots, virtual assistants, and content generators, Packt Publishing, https://www.amazon.com/OpenAI-API-Cookbook-intelligent-applications-ebook/dp/B0CT8W7B79/
Olivier Caelen, Marie-Alice Blete, August 13, 2024, Developing Apps with GPT-4 and ChatGPT: Build Intelligent Chatbots, Content Generators, and More, 2nd edition, O'Reilly Media; https://www.amazon.com/Developing-Apps-GPT-4-ChatGPT-Intelligent/dp/1098168100/
Michael J. Lever, Aug 2024, AI or API? | Chatbot cuckoos are bloating tech OpenAI wrappers are becoming a shortcut for start-ups, but are they sustainable? https://medium.com/future-ux/ai-or-api-chatbot-cuckoos-are-bloating-tech-d6b8d8255279
Yorick Sens, Henriette Knopp, Sven Peldszus, Thorsten Berger, 12 Aug 2024, A Large-Scale Study of Model Integration in ML-Enabled Software Systems, https://arxiv.org/abs/2408.06226
Raymond Lo, Jul 10, 2024, How to Build Faster GenAI Apps with Fewer Lines of Code using OpenVINO™ GenAI API, https://medium.com/openvino-toolkit/how-to-build-faster-genai-apps-with-fewer-lines-of-code-using-openvino-genai-api-5dd5fcabea17
Rachel Curry, Aug 28 2024, Why companies including JPMorgan and Walmart are opting for internal gen AI assistants after initially restricting usage, https://www.cnbc.com/2024/08/28/why-jpmorgan-and-walmart-are-opting-for-internal-gen-ai-assistants.html
Chuan Yan, Ruomai Ren, Mark Huasong Meng, Liuhuo Wan, Tian Yang Ooi, Guangdong Bai, 26 Aug 2024, Exploring ChatGPT App Ecosystem: Distribution, Deployment and Security, https://arxiv.org/abs/2408.14357
Xinyi Hou, Yanjie Zhao, Haoyu Wang, 3 Aug 2024, Voices from the Frontier: A Comprehensive Analysis of the OpenAI Developer Forum, https://arxiv.org/abs/2408.01687
Chandra Irugalbandara, Ashish Mahendra, Roland Daynauth, Tharuka Kasthuri Arachchige, Jayanaka Dantanarayana, Krisztian Flautner, Lingjia Tang, Yiping Kang, Jason Mars, 16 Apr 2024 (v3), Scaling Down to Scale Up: A Cost-Benefit Analysis of Replacing OpenAI's LLM with Open Source SLMs in Production, https://arxiv.org/abs/2312.14972
Xiang Chen, Chaoyang Gao, Chunyang Chen, Guangbei Zhang, Yong Liu, 12 Aug 2024 (v2), An Empirical Study on Challenges for LLM Developers, https://arxiv.org/abs/2408.05002
Dennis Rall, Bernhard Bauer, Thomas Fraunholz, 8 Nov 2023, Towards Democratizing AI: A Comparative Analysis of AI as a Service Platforms and the Open Space for Machine Learning Approach, https://arxiv.org/abs/2311.04518
David Spuler, March 2024, API Wrapper Architecture Optimizations, in Generative AI in C++, https://www.aussieai.com/book/ch7-api-wrapper-optimizations
Andrew Zuo, Sep 2024, Don’t Judge An LLM Only By The Web App, https://andrewzuo.com/dont-judge-an-llm-only-by-the-web-app-0a47d29390c3
Emilia David, September 3, 2024, Anthropic to release system prompts for Artifacts, latest Claude family prompts found incomplete, https://venturebeat.com/ai/anthropic-to-release-system-prompts-for-artifacts-latest-claude-family-prompts-found-incomplete/
Emilia David, August 27, 2024, Anthropic releases AI model system prompts, winning praise for transparency, https://venturebeat.com/ai/anthropic-releases-ai-model-system-prompts-winning-praise-for-transparency/
Gian Segato, September 2024, The dawn of a new startup era, https://giansegato.com/essays/dawn-new-startup-era
Kris Ograbek, Aug 30, 2024, 6 Hard-learned Lessons from My First Project as a Freelance AI Engineer, https://ai.gopubby.com/6-hard-learned-lessons-from-my-first-project-as-a-freelance-ai-engineer-9519e6edee90
Asankhaya Sharma (codelion), Sep 2024, Optillm: Optimizing inference proxy for LLMs, https://github.com/codelion/optillm
Xiaoxia Liu, Jingyi Wang, Jun Sun, Xiaohan Yuan, Guoliang Dong, Peng Di, Wenhai Wang, Dongxia Wang, 21 Nov 2023, Prompting Frameworks for Large Language Models: A Survey, https://arxiv.org/abs/2311.12785
Carl Franzen, September 13, 2024, What OpenAI’s new o1-preview and o1-mini models mean for developers, https://venturebeat.com/programming-development/what-openais-new-o1-preview-and-o1-mini-models-mean-for-developers/
Sascha Heyer, Sep 2024, RAG API: 30 lines of code is all you need for RAG. The easiest way to get started with RAG. https://medium.com/google-cloud/google-cloud-rag-api-c7e3c9931b3e
Kyle Wiggers, September 16, 2024, Runway announces an API for its video-generating AI models, https://techcrunch.com/2024/09/16/runway-announces-an-api-for-its-video-generating-models/
Kyle Wiggers, October 21, 2024, xAI, Elon Musk’s AI startup, launches an API, https://techcrunch.com/2024/10/21/xai-elon-musks-ai-startup-launches-an-api/
Quang H. Nguyen, Duy C. Hoang, Juliette Decugis, Saurav Manchanda, Nitesh V. Chawla, Khoa D. Doan, 24 Jul 2024 (v2), MetaLLM: A High-performant and Cost-efficient Dynamic Framework for Wrapping LLMs, https://arxiv.org/abs/2407.10834
K. Balázs Neszlényi, A. Milos and A. Kiss, "AssistantGPT: Enhancing User Interaction with LLM Integration," 2024 IEEE 22nd Jubilee International Symposium on Intelligent Systems and Informatics (SISY), Pula, Croatia, 2024, pp. 000619-000624, doi: 10.1109/SISY62279.2024.10737548. https://ieeexplore.ieee.org/abstract/document/10737548
swyx, Sep 2024, What Works in AI UX (lightning talk + Q&A), https://www.youtube.com/watch?v=PkHjoihjo6U
Latent Space, Nov 2024, Why GPT Wrappers Are Good, Actually, https://www.latent.space/p/gpt-wrappers
Tegan Jones, 22 November, 2024, Neural Notes: Stop building AI startups with “the same crap” as everyone else. In this edition: warnings for startups relying too heavily on generic AI models and how AI has changed the relationship between VCs and founders. https://www.smartcompany.com.au/artificial-intelligence/neural-notes-stop-building-ai-startups-same-crap-everyone-else/
Yu Gu, Yiheng Shu, Hao Yu, Xiao Liu, Yuxiao Dong, Jie Tang, Jayanth Srinivasa, Hugo Latapie, Yu Su, 22 Feb 2024, Middleware for LLMs: Tools Are Instrumental for Language Agents in Complex Environments, https://arxiv.org/abs/2402.14672
Narcisa Guran, Florian Knauf, Man Ngo, Stefan Petrescu, Jan S. Rellermeyer, 21 Nov 2024, Towards a Middleware for Large Language Models, https://arxiv.org/abs/2411.14513
Andrew Ng, Nov 2024, Simple, unified interface to multiple Generative AI providers, https://github.com/andrewyng/aisuite
Angular Ventures, December 03, 2024, Engines or plastics? How we talk about LLMs and how we use them. The Angle Issue #249, https://newsletter.angularventures.com/p/engines-or-plastics-how-we-talk-about-llms-and-how-we-use-them
Chris Pedregal, December 9, 2024, How to Build a Truly Useful AI Product. Generative AI breaks the old startup playbook, https://every.to/thesis/how-to-build-a-truly-useful-ai-product
Anshul Ramachandran, Jul 08, 2023, How to Make AI UX Your Moat. Design great AI Products that go beyond "just LLM Wrappers": make AI more present, more practical, and then more powerful. https://www.latent.space/p/ai-ux-moat
Lester Mapp, Feb. 6, 2025, From zero to millions? How regular people are cashing in on AI. Every day people are using AI in ways you wouldn't expect. You can too. Here's how, https://www.zdnet.com/article/from-zero-to-millions-how-regular-people-are-cashing-in-on-ai/
Andrew Chen, Feb 05, 2025, Revenge of the GPT Wrappers: Defensibility in a world of commoditized AI models: Why network effects and distribution will be king, once more, https://andrewchen.substack.com/p/revenge-of-the-gpt-wrappers-defensibility
Mandar Karhade, Feb 2025, Tired of LLM Chaos? LiteLLM Should Be Your Default. Stop juggling multiple LLM APIs and their “standards”. https://pub.towardsai.net/tired-of-llm-chaos-litellm-should-be-your-default-e04730b3c33c
Alex Fazio, Feb 2025, How to Build an LLM Chat App: The New Litmus Test for Junior Devs, https://x.com/alxfazio/status/1893242657331101976 (How to build a wrapper chat app that scales by taking care of message queueing, API rate limits, history database management, caching, and other real-world deployment issues.)
Jovan Cicmil, Feb 2025, Your ‘AI Startup’ Is Just OpenAI’s API: Why 99% of AI Companies Are Just Wrappers Around GPT and Will Die a Quick Death, https://blog.startupstash.com/your-ai-startup-is-just-openai-s-api-85940e81d2bd
Mary Ann Azevedo, February 27, 2025, Stripe says AI startups are growing faster than SaaS ever did, and calling them wrappers ‘misses the point’, https://techcrunch.com/2025/02/27/stripe-ceo-says-ai-startups-are-growing-faster-than-saas-ever-did-and-calling-them-wrappers-misses-the-point/
John Webber, January 6, 2025, Building an AI Wrapper SaaS in 2025: Opportunities and Challenges, https://saasminded.dev/building-an-ai-wrapper-saas-in-2025-opportunities-and-challenges/ ("...a faster route to market, the ability to tap into cutting-edge technology, and the potential for rapid scaling... requires a deep understanding of the market dynamics, a commitment to continuous innovation, a strategic approach to building defensibility, and a relentless focus on delivering unique and irreplaceable value to users.")
Wil Chung, 21 Nov 2024, The moats are in the GPT-wrappers, https://interjectedfuture.com/the-moats-are-in-the-gpt-wrappers/ ("...the GPT-wrapper application layer is where the value accrues.")
Stewart Townsend, 16 August 2024, The Future of AI Wrapper Companies: Will They Survive in 2024? https://stewarttownsend.com/the-future-of-ai-wrapper-companies-will-they-survive-in-2024/ ("Conclusion: The future of AI wrapper companies indeed looks promising and intriguing.")
Kate Clark Fri, March 7, 2025, The Hottest AI Companies Right Now Are ‘Apps’, Bloomberg, https://finance.yahoo.com/news/hottest-ai-companies-now-apps-140037730.html
Supreeth Koundinya, March 10, 2025, Manus is a Wrapper of Anthropic’s Claude, and It’s Okay, https://analyticsindiamag.com/ai-features/manus-is-a-wrapper-of-anthropics-claude-and-its-okay/ (“Manus didn’t just slap an API on a model. They built an autonomous system that can execute deep research, deep thinking, and multi-step tasks in a way that no other AI have.”)
Garry Tan, March 2025, X post, https://x.com/garrytan/status/1898949767335752019 ("... the models are plenty smart already and all the alpha is in custom prompting, tool use, clever workflow and evals.")
Nickie Louise, March 31, 2025, The Rise of AI Wrappers: Why Value Is Moving Up the Stack from Foundation Models to AI Apps, https://techstartups.com/2025/03/31/the-rise-of-ai-wrappers-why-value-is-moving-up-the-stack-from-foundation-models-to-ai-apps/
Alex Duffy, May 10, 2025, Rise of the AI Wrappers, https://every.to/context-window/rise-of-the-ai-wrappers
Kyle Wiggers, April 29, 2025, Meta previews an API for its Llama AI models, https://techcrunch.com/2025/04/29/meta-previews-an-api-for-its-llama-ai-models/

OpenAI API Applications

One particular type of "wrap" AI application is to use the OpenAI API (e.g. for ChatGPT).

Dr Kris Jamsa, Dec 2023, OpenAI and ChatGPT Programming: Using Python to Unlock OpenAI and ChatGPT, https://www.amazon.com/OpenAI-ChatGPT-Programming-Python-Unlock/dp/B0CQK41P6B/
Cuantum Technologies, May 2023, ChatGPT API Bible: Mastering Python Programming for Conversational AI: Build Intelligent Chatbots and AI Applications with ChatGPT API and Python (Mastering AI and Python), https://www.amazon.com/ChatGPT-API-Bible-Conversational-Applications/dp/B0C47NWRT7/
Mike Gold, October 6, 2023, Crafting Applications with ChatGPT API: Using Python, Green Belt Book LLC, https://www.amazon.com/Crafting-Applications-ChatGPT-API-Python-ebook/dp/B0CHJX36X3/
Henry Habib, Paul Siegel, March 12, 2024, OpenAI API Cookbook: Build intelligent applications including chatbots, virtual assistants, and content generators, Packt Publishing, https://www.amazon.com/OpenAI-API-Cookbook-intelligent-applications-ebook/dp/B0CT8W7B79/
Olivier Caelen, Marie-Alice Blete, August 13, 2024, Developing Apps with GPT-4 and ChatGPT: Build Intelligent Chatbots, Content Generators, and More, 2nd edition, O'Reilly Media; https://www.amazon.com/Developing-Apps-GPT-4-ChatGPT-Intelligent/dp/1098168100/

Batch API for Inference

Michael Nuñez, October 8, 2024, Anthropic challenges OpenAI with affordable batch processing, https://venturebeat.com/ai/anthropic-challenges-openai-with-affordable-batch-processing/
Microsoft Nov 2024, Getting started with Azure OpenAI global batch deployments, https://learn.microsoft.com/en-us/azure/ai-services/openai/how-to/batch
OpenAI, Nov 2024, Batch API FAQ. Batch API endpoint for asynchronous batch processing, https://help.openai.com/en/articles/9197833-batch-api-faq
Anthropic, 9 Oct 2024, Introducing the Message Batches API, https://www.anthropic.com/news/message-batches-api
Katia Gil Guzman Apr 24, 2024, Batch processing with the Batch API, https://cookbook.openai.com/examples/batch_processing
Lunary, Oct 22, 2024, Using the Batch API with Azure OpenAI, https://lunary.ai/blog/batch-api-azure-openai
Sukalp Tripathi, Sep 8, 2024, Batch API: OpenAI, https://sukalp.medium.com/batch-api-openai-831a0b09690c
Google, Nov 2024, Get batch predictions for Gemini, https://cloud.google.com/vertex-ai/generative-ai/docs/model-reference/batch-prediction-api
Google, Nov 2024, Send a batch process documents request, https://cloud.google.com/document-ai/docs/samples/documentai-batch-process-document
Gibion AI, Jan 15, 2024, Efficient Batch Processing with LangChain and OpenAI: Overcoming RateLimitError, https://medium.com/@hey_16878/efficient-batch-processing-with-langchain-and-openai-overcoming-ratelimiterror-daa9de4bbd8b
Bingli Liao, Danilo Vasconcellos Vargas, 13 Jul 2024, Beyond KV Caching: Shared Attention for Efficient LLMs, https://arxiv.org/abs/2407.12866 (Layerwise weight sharing in attention.)

Application Layer

The "application layer" is the whole range of applications that can be built on top of generative AI and its LLMs as building blocks. Research includes:

Ashu Garg, Oct 25, 2024, Why OpenAI’s $157B valuation misreads AI’s future, https://foundationcapital.com/why-openais-157b-valuation-misreads-ais-future/ (Bullish on the "application layer" saying "The top of the stack is where I see the most promise. ...the most valuable companies of the AI era don’t exist yet."... "The cloud era created over 20 application companies with $1B+ revenue. In AI, we believe this number could exceed 100.")
Akash Bajwa, Nov 18, 2024, Opinionated AI Products: Strong Technologies Forms Beliefs, https://akashbajwa.substack.com/p/opinionated-ai-products
Meno Ventures, Nov 2024, 2024: The State of Generative AI in the Enterprise: The enterprise AI landscape is being rewritten in real time, https://menlovc.com/2024-the-state-of-generative-ai-in-the-enterprise/
Tegan Jones, 22 November, 2024, Neural Notes: Stop building AI startups with “the same crap” as everyone else. In this edition: warnings for startups relying too heavily on generic AI models and how AI has changed the relationship between VCs and founders. https://www.smartcompany.com.au/artificial-intelligence/neural-notes-stop-building-ai-startups-same-crap-everyone-else/
Angular Ventures, December 03, 2024, Engines or plastics? How we talk about LLMs and how we use them. The Angle Issue #249, https://newsletter.angularventures.com/p/engines-or-plastics-how-we-talk-about-llms-and-how-we-use-them
Charles Rollet, December 4, 2024, Key leaders behind Google’s viral NotebookLM are leaving to create their own startup, https://techcrunch.com/2024/12/04/key-leaders-behind-googles-viral-notebooklm-are-leaving-to-create-their-own-startup/ ("As the frontier models and their capabilities continue to grow, thoughtful products are required to make the benefits of this technology accessible, useful, and obvious to everyday people — so our team is going to be focused on building a user-first AI product...the team wanted to create something that leverages the latest AI models to build something useful to regular people.")
Leah Hodgson, December 7, 2024, Where are all the consumer AI startups—and why aren’t VCs funding them? https://pitchbook.com/news/articles/where-are-all-the-consumer-ai-startups-and-why-arent-vcs-funding-them ("...consumer AI market by 2032 will be twice the size of the enterprise market for AI."..."According to Zion Market Research, the market size for consumer AI is predicted to grow to around $1.3 trillion by 2032. For enterprise, it is estimated to reach only around $560 billion by the same year, according to Precedence research.")
Kevin Mahaffey, Dec 13, 2024, Defensibility: Applications. Part 7: Where bucks are born, https://writing.snr.vc/p/defensibility-applications
Chris Pedregal, December 9, 2024, How to Build a Truly Useful AI Product. Generative AI breaks the old startup playbook, https://every.to/thesis/how-to-build-a-truly-useful-ai-product
Anshul Ramachandran, Jul 08, 2023, How to Make AI UX Your Moat. Design great AI Products that go beyond "just LLM Wrappers": make AI more present, more practical, and then more powerful. https://www.latent.space/p/ai-ux-moat
Ignacio de Gregorio Noblejas, December 15, 2024, The AI Trillion-Dollar Product, https://thewhitebox.beehiiv.com/p/the-ai-trillion-dollar-product
Akash Bajwa, Dec 16, 2024, Vertical Integration: Model vs Product Companies: The False Dichotomy of Model & App Layer, https://akashbajwa.substack.com/p/vertical-integration-model-vs-product
Apple, December 16, 2024, Apple reveals 2024’s most downloaded apps and games on the App Store, https://www.apple.com/newsroom/2024/12/apple-reveals-2024s-most-downloaded-apps-and-games-on-the-app-store/
Sarah Perez, December 16, 2024, Temu is the most downloaded app on the US App Store in 2024, https://techcrunch.com/2024/12/16/temu-is-the-most-downloaded-app-on-the-u-s-app-store-in-2024/
Jess Weatherbed, Dec 10, 2024, AI is booming on the App Store, and developers are taking advantage of it. Many high-ranking AI apps feel like an attempted cash grab, and it’s not easy to spot the trash from the treasure. https://www.theverge.com/2024/12/9/24314972/apple-app-store-ai-apps-art-design-photography
Johan Uddståhl, Jan 2, 2025, …when all we ever needed was a text box, or how 2025 will be back to basics for the web, https://medium.com/@baktakt/when-all-we-ever-needed-was-a-text-box-c672c52a0dca
Rex Woodbury, Jun 05, 2024, The Consumer Renaissance: From Predicting Consumer AI Applications to Analyzing Consumer Spend, https://www.digitalnative.tech/p/the-consumer-renaissance
Rex Woodbury, Jun 13, 2024, The Consumer Renaissance (Part II): Shopping, Consumer Health, and Patterns of Household Spend, https://www.digitalnative.tech/p/the-consumer-renaissance-part-ii
James Currier, Jan 2025, Consumer is Back – And Why It’s Been So Hard Since 2014, https://www.nfx.com/post/consumer-is-back
Alex Kantrowitz, Jan 28, 2025, Notes on DeepSeek: Generative AI is All About the Applications Now: Building with AI might cost 5% of what it did a week ago, so what gets built has never been more important. https://www.bigtechnology.com/p/notes-on-deepseek-generative-ai-is
What’s 🔥 in Enterprise IT/VC #431, Feb 01, 2025, 🙏🏼 DeepSeek - years compressed into days - the cost 💰 of intelligence 🧠 has dramatically 📉 - the time to build 🏗️ is now! https://www.whatshotit.vc/p/whats-in-enterprise-itvc-431
Alex Kantrowitz, Feb 01, 2025, OpenAI is an App Company Now. After DeepSeek, OpenAI is an app builder above all else. Perhaps that was always the way. https://www.bigtechnology.com/p/openai-is-an-app-company-now
Olivia Moore, Feb 20025, AI Voice Agent Update - 2025, A16Z, https://a16z.com/ai-voice-agents-2025-update/ https://gamma.app/docs/a16z-AI-Voice-Update-2025--ttkorld8iy6wfnj?mode=doc (Thesis that voice will be the primary AI interface for consumers.)
AL Anany, Feb 2025, Now That AI is Affordable — It’s Time To Build. It is time for perfect use cases. https://entreprenal.com/now-that-ai-is-affordable-its-time-to-build-8e84337355eb
Tanay Jaipuria, Feb 11, 2025, How Big Tech Sees DeepSeek: Five Key Takeaways: On diffusion of innovation, the need for strong business models, lower inference costs benefiting apps and investing in infrastructure as a strategic advantage, https://www.tanayj.com/p/how-big-tech-sees-deepseek-five-key
Leah Hodgson, February 8, 2025, DeepSeek's gift to the AI app space: DeepSeek might be just what the AI app space needs, https://pitchbook.com/news/articles/deepseek-might-be-just-what-the-ai-app-space-needs
Andrew Chen, Feb 05, 2025, Revenge of the GPT Wrappers: Defensibility in a world of commoditized AI models: Why network effects and distribution will be king, once more, https://andrewchen.substack.com/p/revenge-of-the-gpt-wrappers-defensibility
Jan Kammerath, Feb 11, 2025, Programmers’ New Goldrush: Seizing Opportunities With Local AI, https://medium.com/@jankammerath/programmers-new-goldrush-seizing-opportunities-with-local-ai-12b1a3e2692f
Yaakov Carno, Feb 24, 2025, The surprising patterns behind viral AI products: A deep dive into Bolt, Cursor, Granola, PhotoRoom, Replit and more, https://open.substack.com/pub/kylepoyar/p/ai-ux-patterns (The "surprising pattern" in successful AI products is that they all have a slick UI.)
Kyle Wiggers, February 25, 2025, Quora’s Poe now lets users create and share custom AI-powered apps, https://techcrunch.com/2025/02/25/quoras-poe-now-lets-users-create-and-share-custom-ai-powered-apps/
CNBC, Feb 2025, Hugging Face co-founder: The next step in AI will be applications, https://www.msn.com/en-au/money/other/hugging-face-co-founder-the-next-step-in-ai-will-be-applications/vi-AA1xFd8X
Joe McKendrick, Feb. 20, 2025, Brace yourself: The era of 'citizen developers' creating apps is here, thanks to AI, https://www.zdnet.com/article/brace-yourself-the-era-of-citizen-developers-creating-apps-is-here-thanks-to-ai/
Craig Le Clair, Oct 23 2024, Predictions 2025: GenAI, Citizen Developers, And Caution Influence Automation, https://www.forrester.com/blogs/predictions-2025-automation/
Rex Woodbury, Feb 26, 2025, The ChatGPT Prompts That Can Be $1B+ Companies: The Unbundling of ChatGPT? https://open.substack.com/pub/digitalnative/p/the-chatgpt-prompts-that-can-be-1b
Rex Woodbury, Feb 20, 2025, How Consumer Psychology Informs AI Product Design: The IKEA Effect, the Paradox of Choice, and AI's Interface Problem, https://www.digitalnative.tech/p/how-consumer-psychology-informs-ai
John Webber, January 6, 2025, Building an AI Wrapper SaaS in 2025: Opportunities and Challenges, https://saasminded.dev/building-an-ai-wrapper-saas-in-2025-opportunities-and-challenges/ ("...a faster route to market, the ability to tap into cutting-edge technology, and the potential for rapid scaling... requires a deep understanding of the market dynamics, a commitment to continuous innovation, a strategic approach to building defensibility, and a relentless focus on delivering unique and irreplaceable value to users.")
Wil Chung, 21 Nov 2024, The moats are in the GPT-wrappers, https://interjectedfuture.com/the-moats-are-in-the-gpt-wrappers/ ("...the GPT-wrapper application layer is where the value accrues.")
Stewart Townsend, 16 August 2024, The Future of AI Wrapper Companies: Will They Survive in 2024? https://stewarttownsend.com/the-future-of-ai-wrapper-companies-will-they-survive-in-2024/ ("Conclusion: The future of AI wrapper companies indeed looks promising and intriguing.")
Kate Clark Fri, March 7, 2025, The Hottest AI Companies Right Now Are ‘Apps’, Bloomberg, https://finance.yahoo.com/news/hottest-ai-companies-now-apps-140037730.html
Garry Tan, March 2025, X post, https://x.com/garrytan/status/1898949767335752019 ("... the models are plenty smart already and all the alpha is in custom prompting, tool use, clever workflow and evals.")
Julio Pessan, Mar 7, 2025, Don’t Sell AI Agents, Sell AI Infrastructures Instead — The Billion-Dollar Opportunity, https://medium.com/@julio.pessan.pessan/dont-sell-ai-agents-sell-ai-infrastructures-instead-the-billion-dollar-opportunity-04eb7166b3d9
Nickie Louise, March 31, 2025, The Rise of AI Wrappers: Why Value Is Moving Up the Stack from Foundation Models to AI Apps, https://techstartups.com/2025/03/31/the-rise-of-ai-wrappers-why-value-is-moving-up-the-stack-from-foundation-models-to-ai-apps/
Alex Duffy, May 10, 2025, Rise of the AI Wrappers, https://every.to/context-window/rise-of-the-ai-wrappers
OnlyCFO, Apr 29, 2025, Bullish: Vertical & Compound Software: In a world of AI, companies need to be more multi-product and vertical to win, https://www.onlycfo.io/p/bullish-vertical-and-compound-software
Rex Woodbury, Jun 18, 2025, The Opportunities in Consumer AI: Mapping Where to Build + Examining Changes in Product Design & Business Model, https://www.digitalnative.tech/p/the-opportunities-in-consumer-ai
Sameer Singh, June 4, 2025, Stop Asking, Start Showing: Why GUIs Still Win in the Age of AI, https://www.speedinvest.com/blog/consumer-ai-cognitive-load-and-the-gui
Maximilian Schreiner, Jun 26, 2025, Anthropics Claude can now build AI apps, https://the-decoder.com/anthropics-claude-can-now-build-ai-apps/
Tomasz Tunguz, Jun 27, 2025, Voice, Context & Control: The Three Pillars of Useful AI Email, https://tomtunguz.com/my-own-ai-email-generator/ ("We’re still in the horseless carriage era of AI applications. The breakthrough will come when software adapts to us instead of forcing us to adapt to it.")

Code Generation Applications of Generative AI

Hadi Ghaemi, Zakieh Alizadehsani, Amin Shahraki, Juan M. Corchado, June 2024, Transformers in source code generation: A comprehensive survey, Journal of Systems Architecture, 103193, https://www.sciencedirect.com/science/article/abs/pii/S1383762124001309
Franklin Huang, May 17, 2024, Machine Learning Systems with Reduced Memory Requirements, Masters of Science, Electrical Engineering and Computer Sciences, University of California, Berkeley, Technical Report No. UCB/EECS-2024-120 http://www2.eecs.berkeley.edu/Pubs/TechRpts/2024/EECS-2024-120.html https://www2.eecs.berkeley.edu/Pubs/TechRpts/2024/EECS-2024-120.pdf Code: https://github.com/hongyihuang/spec-mcts/blob/main/triton (Broad paper that examines a lot of different optimizations that reduce memory costs, including quantization, kernel fusion, sparsity, MatMul optimizations, KV cache compression, and various other methods.)
Lianghong Guo, Yanlin Wang, Ensheng Shi, Wanjun Zhong, Hongyu Zhang, Jiachi Chen, Ruikai Zhang, Yuchi Ma, Zibin Zheng, 29 Jul 2024, When to Stop? Towards Efficient Code Generation in LLMs with Excess Token Prevention, https://arxiv.org/abs/2407.20042 Code: https://github.com/DeepSoftwareAnalytics/CodeFast
AIM, 2024, Mistral AI Unveils Mistral Large 2, Beats Llama 3.1 on Code and Math, https://analyticsindiamag.com/ai-news-updates/mistral-ai-unveils-mistral-large-2-beats-llama-3-1-on-code-and-math/
Kevin Zhang, Jun 26, 2024, Investing in the Age of Generative AI, https://eastwind.substack.com/p/investing-in-the-age-of-generative
by Nicholas Carlini, 2024-08-01, How I Use "AI", https://nicholas.carlini.com/writing/2024/how-i-use-ai.html (Generative AI and LLM use cases are "unglamorous" but useful to software developers.)
Haolin Jin, Linghan Huang, Haipeng Cai, Jun Yan, Bo Li, Huaming Chen, 5 Aug 2024, From LLMs to LLM-based Agents for Software Engineering: A Survey of Current, Challenges and Future, https://arxiv.org/abs/2408.02479
Grant Gross, 30 Aug 2024, Agentic AI: Decisive, operational AI arrives in business, https://www.cio.com/article/3496519/agentic-ai-decisive-operational-ai-arrives-in-business.html
Hao Zhou, Chengming Hu, Ye Yuan, Yufei Cui, Yili Jin, Can Chen, Haolun Wu, Dun Yuan, Li Jiang, Di Wu, Xue Liu, Charlie Zhang, Xianbin Wang, Jiangchuan Liu, 17 May 2024, Large Language Model (LLM) for Telecommunications: A Comprehensive Survey on Principles, Key Techniques, and Opportunities, https://arxiv.org/abs/2405.10825
Yaroslav Zharov, Yury Khudyakov, Evgeniia Fedotova, Evgeny Grigorenko, Egor Bogomolov, 18 Feb 2024, Tool-Augmented LLMs as a Universal Interface for IDEs, https://arxiv.org/abs/2402.11635
Advait Sarkar, 1 Nov 2023, Will Code Remain a Relevant User Interface for End-User Programming with Generative AI Models? https://arxiv.org/abs/2311.00382
Liwenhan Xie, Chengbo Zheng, Haijun Xia, Huamin Qu, Chen Zhu-Tian, 3 Aug 2024, WaitGPT: Monitoring and Steering Conversational LLM Agent in Data Analysis with On-the-Fly Code Visualization, https://arxiv.org/abs/2408.01703
Madhumita Murgia, August 23 2024, AI-powered coding pulls in almost $1bn of funding to claim ‘killer app’ status, https://www.ft.com/content/4868bd38-613c-4fa9-ba9d-1ed8fa8a40c8
Hesam Sheikh, Aug 2024, The Smarter Way of Using AI in Programming, https://towardsdatascience.com/the-smarter-way-of-using-ai-in-programming-0492ac610385
Pragmatic Coders, Sep 2024, Best AI tools for developers in 2024: AI-powered coding, https://medium.com/@pragmaticcoders/best-ai-tools-for-developers-in-2024-ai-powered-coding-32e31dff6024
Zheyuan (Kevin) Cui, Mert Demirer, Sonia Jaffe, Leon Musolff, Sida Peng, Tobias Salz, September 03, 2024, The Effects of Generative AI on High Skilled Work: Evidence from Three Field Experiments with Software Developers, https://papers.ssrn.com/sol3/papers.cfm?abstract_id=4945566 https://papers.ssrn.com/sol3/Delivery.cfm/4945566.pdf?abstractid=4945566&mirid=1
Asif Razzaq, September 5, 2024, Yi-Coder Released by 01.AI: A Powerful Small-Scale Code LLM Series, Delivering Exceptional Performance in Code Generation, Editing, and Long-Context Comprehension, https://www.marktechpost.com/2024/09/05/yi-coder-released-by-01-ai-a-powerful-small-scale-code-llm-series-delivering-exceptional-performance-in-code-generation-editing-and-long-context-comprehension/
OpenAI, September 12, 2024, Learning to Reason with LLMs, https://openai.com/index/learning-to-reason-with-llms/
Grant Gross, 12 Sep 2024, AI coding assistants wave goodbye to junior developers, https://www.cio.com/article/3509174/ai-coding-assistants-wave-goodbye-to-junior-developers.html
Evan Wang, Federico Cassano, Catherine Wu, Yunfeng Bai, Will Song, Vaskar Nath, Ziwen Han, Sean Hendryx, Summer Yue, Hugh Zhang, 5 Sep 2024, Planning In Natural Language Improves LLM Search For Code Generation, https://arxiv.org/abs/2409.03733
Michael Nuñez, September 19, 2024, Microsoft’s GRIN-MoE AI model takes on coding and math, beating competitors in key benchmarks, https://venturebeat.com/ai/microsofts-grin-moe-ai-model-takes-on-coding-and-math-beating-competitors-in-key-benchmarks/
Yanxian Huang, Wanjun Zhong, Ensheng Shi, Min Yang, Jiachi Chen, Hui Li, Yuchi Ma, Qianxiang Wang, Zibin Zheng, Yanlin Wang, 13 Sep 2024, Agents in Software Engineering: Survey, Landscape, and Vision, https://arxiv.org/abs/2409.09030 https://github.com/DeepSoftwareAnalytics/Awesome-Agent4SE
Grant Gross, 26 Sep 2024, Devs gaining little (if anything) from AI coding assistants, https://www.cio.com/article/3540579/devs-gaining-little-if-anything-from-ai-coding-assistants.html
Ulyana Piterbarg, Lerrel Pinto, Rob Fergus, 3 Oct 2024, Training Language Models on Synthetic Edit Sequences Improves Code Synthesis, https://arxiv.org/abs/2410.02749
https://www.cio.com/article/3567138/ai-native-software-engineering-may-be-closer-than-developers-think.html
C Thiede, M Taeumel, L Böhme, R Hirschfeld, 2024, Talking to Objects in Natural Language: Toward Semantic Tools for Exploratory Programming, Onward! ’24, October 23–25, 2024, Pasadena, CA, USA, https://dl.acm.org/doi/pdf/10.1145/3689492.3690049
Aki Ranin, Sep 2, 2024, The Code Canaries Are Singing — Our Path Toward AGI: How the fate of human software developers reveals our path toward AGI, https://akiranin.medium.com/the-code-canaries-are-singing-our-path-toward-agi-6c234cae0189
Jose Yapur, 29 OCT 2024, Introducing the next-level of AI-powered workflows with Amazon Q Developer inline chat, https://aws.amazon.com/blogs/devops/amazon-q-developer-inline-chat/
GitHub, Oct 2024, Bringing developer choice to Copilot with Anthropic’s Claude 3.5 Sonnet, Google’s Gemini 1.5 Pro, and OpenAI’s o1-preview, https://github.blog/news-insights/product-news/bringing-developer-choice-to-copilot/
John Wang, Oct 2024, How we saved hundreds of engineering hours by writing tests with LLMs, https://www.assembled.com/blog/how-we-saved-hundreds-of-engineering-hours-by-writing-tests-with-llms
Shihan Dou, Jiazheng Zhang, Jianxiang Zang, Yunbo Tao, Haoxiang Jia, Shichun Liu, Yuming Yang, Shenxi Wu, Shaoqing Zhang, Muling Wu, Changze Lv, Limao Xiong, Wenyu Zhan, Lin Zhang, Rongxiang Weng, Jingang Wang, Xunliang Cai, Yueming Wu, Ming Wen, Rui Zheng, Tao Ji, Yixin Cao, Tao Gui, Xipeng Qiu, Qi Zhang, Xuanjing Huang, 30 Oct 2024, Multi-Programming Language Sandbox for LLMs, https://arxiv.org/abs/2410.23074
David Gewirtz, September 27, 2024, The best AI for coding, and a bunch that failed miserably, https://www.zdnet.com/article/the-best-ai-for-coding/
Jason Perlow, Nov. 6, 2024, The best open-source AI models: All your free-to-use options explained: Here are the best open-source and free-to-use AI models for text, images, and audio, organized by type, application, and licensing considerations. https://www.zdnet.com/article/the-best-open-source-ai-models-all-your-free-to-use-options-explained/
Fali Wang, Zhiwei Zhang, Xianren Zhang, Zongyu Wu, Tzuhao Mo, Qiuhao Lu, Wanjing Wang, Rui Li, Junjie Xu, Xianfeng Tang, Qi He, Yao Ma, Ming Huang, Suhang Wang, 4 Nov 2024, A Comprehensive Survey of Small Language Models in the Era of Large Language Models: Techniques, Enhancements, Applications, Collaboration with LLMs, and Trustworthiness, https://arxiv.org/abs/2411.03350
Qwen Team, November 12, 2024, Qwen2.5-Coder Series: Powerful, Diverse, Practical, https://qwenlm.github.io/blog/qwen2.5-coder-family/
Evan Doyle, Nov 14, 2024, AI Makes Tech Debt More Expensive, https://www.gauge.sh/blog/ai-makes-tech-debt-more-expensive
Haoxiang Zhang, Shi Chang, Arthur Leung, Kishanthan Thangarajah, Boyuan Chen, Hanan Lutfiyya, Ahmed E. Hassan, 14 Nov 2024, Software Performance Engineering for Foundation Model-Powered Software (FMware), https://arxiv.org/abs/2411.09580
Josh Fruhlinger, Dec 02, 2024, Refactoring AI code: The good, the bad, and the weird, https://www.infoworld.com/article/3610521/refactoring-ai-code-the-good-the-bad-and-the-weird.html
Joe McKendrick, Nov. 27, 2024, Gen AI gives software developers surge in productivity - but it's not for everyone, https://www.zdnet.com/article/gen-ai-gives-software-developers-surge-in-productivity-but-its-not-for-everyone/
Cory Hymel, Dec 02, 2024, 5 ways AI will change the software development life cycle, https://www.infoworld.com/article/3609988/5-ways-ai-will-change-the-software-development-life-cycle.html
Paul Heltzel, 03 Dec 2024, 5 dead-end IT skills — and how to avoid becoming obsolete, https://www.cio.com/article/188985/6-dead-end-it-skills-and-how-to-avoid-becoming-obsolete.html
Google, Dec 2024, Welcome to Project IDX, a new web-based development workspace from Google. IDX is designed to make it faster and easier to build, ship, and manage full-stack, multiplatform apps from the comfort of your browser. https://idx.google.com/
Giordano d'Aloisio, Luca Traini, Federica Sarro, Antinisca Di Marco, 18 Dec 2024, On the Compression of Language Models for Code: An Empirical Study on CodeBERT, https://arxiv.org/abs/2412.13737 (Quantization, pruning and distillation on code generation models.)
Francisco Durán, Matias Martinez, Patricia Lago, Silverio Martínez-Fernández, 19 Dec 2024, Energy consumption of code small language models serving with runtime engines and execution providers, https://arxiv.org/abs/2412.15441
David Gewirtz, Nov. 27, 2024, 25 AI tips to boost your programming productivity with ChatGPT. With ChatGPT in your toolkit, coding can be faster and smoother. I share the best ways of using AI to overcome common coding challenges, so you can streamline your development projects. https://www.zdnet.com/article/25-ai-tips-to-boost-your-programming-productivity-with-chatgpt/
Dewu Zheng, Yanlin Wang, Ensheng Shi, Hongyu Zhang, Zibin Zheng, 24 Dec 2024, How Well Do LLMs Generate Code for Different Application Domains? Benchmark and Evaluation, https://arxiv.org/abs/2412.18573
Aman, May 14, 2024, Near-Instant Full-File Edits, Cursor, https://cursor.sh/blog/instant-apply (A type of speculative decoding for code editing called "speculative edits.")
Lucas Mearian, 03 Apr 2024 Just how good is AI-assisted code generation? Computer World, https://www.computerworld.com/article/2077802/just-how-good-is-ai-assisted-code-generation.html (Notes issues with code quality, security, and reuse.)
Yao Wan, Yang He, Zhangqian Bi, Jianguo Zhang, Hongyu Zhang, Yulei Sui, Guandong Xu, Hai Jin, Philip S. Yu, 30 Dec 2023, Deep Learning for Code Intelligence: Survey, Benchmark and Toolkit, https://arxiv.org/abs/2401.00288 Code: https://xcodemind.github.io/
Xuanle Zhao, Xianzhen Luo, Qi Shi, Chi Chen, Shuo Wang, Wanxiang Che, Zhiyuan Liu, Maosong Sun, 11 Jan 2025, ChartCoder: Advancing Multimodal Large Language Model for Chart-to-Code Generation, https://arxiv.org/abs/2501.06598
Tari Ibaba, Jan 2025, This new IDE just destroyed VS Code and Copilot without even trying, https://medium.com/coding-beauty/windsurf-ide-0678288ce0a4
Sida Peng, Eirini Kalliamvakou, Peter Cihon, Mert Demirer, 13 Feb 2023, The Impact of AI on Developer Productivity: Evidence from GitHub Copilot, https://arxiv.org/abs/2302.06590
Paul Sawers, February 6, 2025, GitHub Copilot brings mockups to life by generating code from images, https://techcrunch.com/2025/02/06/github-copilot-brings-mockups-to-life-by-generating-code-from-images/
Daniel Delaney, Feb 2025, Chat is a bad UI pattern for development tools, https://danieldelaney.net/chat/
Dacheng Li, Shiyi Cao, Chengkun Cao, Xiuyu Li, Shangyin Tan, Kurt Keutzer, Jiarong Xing, Joseph E. Gonzalez, Ion Stoica, 20 Feb 2025, S*: Test Time Scaling for Code Generation, https://arxiv.org/abs/2502.14382 https://github.com/NovaSky-AI/SkyThought
David Gewirtz, Feb. 25, 2025, Google just made AI coding assistance free for everyone - with very generous limits,] https://www.zdnet.com/article/google-just-made-ai-coding-assistance-free-for-everyone-with-very-generous-limits/
Qianhui Zhao, Li Zhang, Fang Liu, Xiaoli Lian, Qiaoyuanhe Meng, Ziqian Jiao, Zetong Zhou, Borui Zhang, Runlin Guo, Jia Li, 24 Feb 2025, CodeSwift: Accelerating LLM Inference for Efficient Code Generation, https://arxiv.org/abs/2502.17139 (Using draft sequences from a datastore of code, to achieve parallel inference, similar to prompt looking decoding or retrieval lookup decoding.)
alexp, February 19, 2025, Vibe Coding and the Future of Software Engineering, https://alexp.pl/2025/02/19/vibe-coding.html
Kate Rooney, Mar 15 2025, Y Combinator startups are fastest growing, most profitable in fund history because of AI, https://www.cnbc.com/2025/03/15/y-combinator-startups-are-fastest-growing-in-fund-history-because-of-ai.html
David Gewirtz, March 18, 2025, What is AI vibe coding? It's all the rage but it's not for everyone - here's why: Caution: Experience required. Vibe coding feels like magic, until your AI assistant starts overwriting your work, https://www.zdnet.com/article/what-is-ai-vibe-coding-its-all-the-rage-but-its-not-for-everyone-heres-why/
Bill Doerrfeld, Mar 17, 2025, Why AI-generated code isn’t good enough (and how it will get better), https://www.infoworld.com/article/3844363/why-ai-generated-code-isnt-good-enough-and-how-it-will-get-better.html
Kathy Korevec, May 20, 2025, Build with Jules, your asynchronous coding agent: Our autonomous coding agent, Jules, is now in public beta, https://blog.google/technology/google-labs/jules/
Yihong Dong, Yuchen Liu, Xue Jiang, Zhi Jin, Ge Li, 15 May 2025, Rethinking Repetition Problems of LLMs in Code Generation, https://arxiv.org/abs/2505.10402
Gergely Orosz, Jul 02, 2025, Software engineering with LLMs in 2025: reality check: How are devs at AI startups and in Big Tech using AI tools, and what do they think of them? A broad overview of the state of play in tooling, with Anthropic, Google, Amazon, and others, https://newsletter.pragmaticengineer.com/p/software-engineering-with-llms-in-2025
Ben Dickson, April 10, 2025, DeepCoder delivers top coding performance in efficient 14B open model, https://venturebeat.com/ai/deepcoder-delivers-top-coding-performance-in-efficient-14b-open-model/

Code Checker Applications

Aman, May 14, 2024, Near-Instant Full-File Edits, Cursor, https://cursor.sh/blog/instant-apply (A type of speculative decoding for code editing called "speculative edits.")
Ansong Ni, Miltiadis Allamanis, Arman Cohan, Yinlin Deng, Kensen Shi, Charles Sutton, Pengcheng Yin, 23 Apr 2024, NExT: Teaching Large Language Models to Reason about Code Execution, https://arxiv.org/abs/2404.14662
David Spuler, March 2024, Chapter 40. Reliability, Generative AI in C++: Coding Transformers and LLMs, https://www.amazon.com/dp/B0CXJKCWX9
Yingbing Huang, Lily Jiaxin Wan, Hanchen Ye, Manvi Jha, Jinghua Wang, Yuhong Li, Xiaofan Zhang, Deming Chen, 16 Jun 2024, New Solutions on LLM Acceleration, Optimization, and Application, https://arxiv.org/abs/2406.10903 (A survey of inference optimization methods and further analysis of Medusa-type speculative decoding and KV cache compression. Also explores hardware co-design, ML compilers and LLM-assisted code debugging.)
Nat McAleese, Rai (Michael Pokorny), Evgenia Nitishinskaya, Jan Leike, Juan Felipe Cerón Uribe, Maja Trebacz, 2024, LMCritics Help Catch LLM Bugs, https://cdn.openai.com/llm-critics-help-catch-llm-bugs-paper.pdf
Patrick J. Chapman, Cindy Rubio-González, and Aditya V. Thakur. 2024. Interleaving Static Analysis and LLM Prompting. In Proceedings of the 13th ACM SIGPLAN International Workshop on the State Of the Art in Program Analysis (SOAP 2024). Association for Computing Machinery, New York, NY, USA, 9–17. https://doi.org/10.1145/3652588.3663317 https://dl.acm.org/doi/abs/10.1145/3652588.3663317
Junwei Liu, Yixuan Chen, Mingwei Liu, Xin Peng, Yiling Lou, 14 Jun 2024, STALL+: Boosting LLM-based Repository-level Code Completion with Static Analysis, https://arxiv.org/abs/2406.10018
Shaojian Qiu, Huihao Huang, Jianxiang Luo, Yingjie Kuang, Haoyu Luo, 11 Feb 2024, BAFLineDP: Code Bilinear Attention Fusion Framework for Line-Level Defect Prediction, https://arxiv.org/pdf/2402.07132
Pragmatic Coders, Sep 2024, Best AI tools for developers in 2024: AI-powered coding, https://medium.com/@pragmaticcoders/best-ai-tools-for-developers-in-2024-ai-powered-coding-32e31dff6024
Tom Ganz, April 2024, Software Defect Localization Using Explainable Deep Learning, Master's Thesis, Master of Science, der Technischen Universität Berlin, https://api-depositonce.tu-berlin.de/server/api/core/bitstreams/308879e0-b14b-4baf-a0c3-19067184ef50/content (AI-based security vulnerability code checker.)
Francisco Ribeiro, José Nuno Castro de Macedo, Kanae Tsushima, Rui Abreu, João Saraiva, 2023, GPT-3-Powered Type Error Debugging: Investigating the Use of Large Language Models for Code Repair, SLE 2023: Proceedings of the 16th ACM SIGPLAN International Conference on Software Language Engineering, October 2023, Pages 111–124, https://doi.org/10.1145/3623476.3623522 (Code corrections are a type of GEC.)
Jiawei Guo, Ziming Li, Xueling Liu, Kaijing Ma, Tianyu Zheng, Zhouliang Yu, Ding Pan, Yizhi LI, Ruibo Liu, Yue Wang, Shuyue Guo, Xingwei Qu, Xiang Yue, Ge Zhang, Wenhu Chen, Jie Fu, 4 Apr 2024, CodeEditorBench: Evaluating Code Editing Capability of Large Language Models, https://arxiv.org/abs/2404.03543
David Spuler, June 2024, Aussie AI, Optimizing On-Device Transformer Inference for Source Code Checking: IP Australia, https://ipsearch.ipaustralia.gov.au/patents/2024901675
Ulyana Piterbarg, Lerrel Pinto, Rob Fergus, 3 Oct 2024, Training Language Models on Synthetic Edit Sequences Improves Code Synthesis, https://arxiv.org/abs/2410.02749
Albin Johansson, Carl Holmberg, Francisco Gomes De Oliveira Neto, and Philipp Leitner. 2024. The Impact of Compiler Warnings on Code Quality in C++ Projects. In Proceedings of the 32nd IEEE/ACM International Conference on Program Comprehension (ICPC '24). Association for Computing Machinery, New York, NY, USA, 270–279. https://doi.org/10.1145/3643916.3644410 https://dl.acm.org/doi/abs/10.1145/3643916.3644410 (Using compiler warnings correlations with higher quality metrics.)
Fang Liu, Zhenwei Liu, Qianhui Zhao, Jing Jiang, Li Zhang, Zian Sun, Ge Li, Zhongqi Li, and Yuchi Ma. 2024. FastFixer: An Efficient and Effective Approach for Repairing Programming Assignments. In Proceedings of the 39th IEEE/ACM International Conference on Automated Software Engineering (ASE '24). Association for Computing Machinery, New York, NY, USA, 669–680. https://doi.org/10.1145/3691620.3695062 https://dl.acm.org/doi/abs/10.1145/3691620.3695062
Andrea Lepori, Alexandru Calotoiu, and Torsten Hoefler. 2024. Iterating Pointers: Enabling Static Analysis for Loop-based Pointers. ACM Trans. Archit. Code Optim. Just Accepted (October 2024). https://doi.org/10.1145/3701993 https://dl.acm.org/doi/pdf/10.1145/3701993
A Hück, T Ziegler, S Schwitanski, J Jenke, C Bischof, Nov 2024, Compiler-Aided Correctness Checking of CUDA-Aware MPI Applications, https://conferences.computer.org/sc-wpub/pdfs/SC-W2024-6oZmigAQfgJ1GhPL0yE3pS/555400a204/555400a204.pdf
Zeyu Chen, Daiping Liu, Jidong Xiao, and Haining Wang. 2023. All Use-After-Free Vulnerabilities Are Not Created Equal: An Empirical Study on Their Characteristics and Detectability. In Proceedings of the 26th International Symposium on Research in Attacks, Intrusions and Defenses (RAID '23). Association for Computing Machinery, New York, NY, USA, 623–638. https://doi.org/10.1145/3607199.3607229 https://dl.acm.org/doi/10.1145/3607199.3607229 https://vtechworks.lib.vt.edu/bitstream/handle/10919/116595/3607199.3607229.pdf
B. Gui, W. Song, H. Xiong and J. Huang, "Automated Use-After-Free Detection and Exploit Mitigation: How Far Have We Gone?," in IEEE Transactions on Software Engineering, vol. 48, no. 11, pp. 4569-4589, 1 Nov. 2022, doi: 10.1109/TSE.2021.3121994. https://ieeexplore.ieee.org/document/9583875
H. Wei, L. Chen, X. Nie, Z. Zhang, Y. Zhang and G. Shi, "An Efficient Metric-Based Approach for Static Use-After-Free Detection," 2022 IEEE Intl Conf on Parallel & Distributed Processing with Applications, Big Data & Cloud Computing, Sustainable Computing & Communications, Social Computing & Networking (ISPA/BDCloud/SocialCom/SustainCom), Melbourne, Australia, 2022, pp. 58-65, doi: 10.1109/ISPA-BDCloud-SocialCom-SustainCom57177.2022.00015. https://ieeexplore.ieee.org/document/10070682
Paul E Black. 2018. Juliet 1.3 test suite: Changes from 1.2. US Department of Commerce, National Institute of Standards and Technology. https://samate.nist.gov/SARD/test-suites/112 (Test suite from US Govt for testing static analyzers and sanitizers, that is copyright-free.)
Andrea Hrckova, Robert Moro, Ivan Srba, Jakub Simko, Maria Bielikova, 4 Sep 2025, Autonomation, Not Automation: Activities and Needs of European Fact-checkers as a Basis for Designing Human-Centered AI Systems, https://arxiv.org/abs/2211.12143
Chenyuan Yang, Zijie Zhao, Zichen Xie, Haoyu Li, Lingming Zhang, 3 Sep 2025, KNighter: Transforming Static Analysis with LLM-Synthesized Checkers, https://arxiv.org/abs/2503.09002
Ming Zhong, Xiang Zhou, Ting-Yun Chang, Qingze Wang, Nan Xu, Xiance Si, Dan Garrette, Shyam Upadhyay, Jeremiah Liu, Jiawei Han, Benoit Schillings, Jiao Sun, 8 Oct 2025, Vibe Checker: Aligning Code Evaluation with Human Preference, https://arxiv.org/abs/2510.07315

User Interface (UI) Issues for AI Apps

Li Zhang, Shihe Wang, Xianqing Jia, Zhihan Zheng, Yunhe Yan, Longxi Gao, Yuanchun Li, Mengwei Xu, 12 Apr 2024, LlamaTouch: A Faithful and Scalable Testbed for Mobile UI Automation Task Evaluation, https://arxiv.org/abs/2404.16054
Jiachen Liu, Zhiyu Wu, Jae-Won Chung, Fan Lai, Myungjin Lee, Mosharaf Chowdhury, 25 Apr 2024, Andes: Defining and Enhancing Quality-of-Experience in LLM-Based Text Streaming Services, https://arxiv.org/abs/2404.16283 (Scheduling GPU activity for multiple queries to ensure good UI experience for text-streaming outputs like chatbots.)
NLUX: The 𝗣𝗼𝘄𝗲𝗿𝗳𝘂𝗹 Conversational AI JavaScript Library, https://github.com/nlkitai/nlux
Yuechen Zhang, Shengju Qian, Bohao Peng, Shu Liu, Jiaya Jia, 7 Dec 2023, Prompt Highlighter: Interactive Control for Multi-Modal LLMs, https://arxiv.org/abs/2312.04302 Code: https://github.com/dvlab-research/Prompt-Highlighter/ (Allows users to highlight part of their prompt for more specificity.)
Michael Nuñez, June 21, 2024, Why Anthropic’s Artifacts may be this year’s most important AI feature: Unveiling the interface battle, https://venturebeat.com/ai/why-anthropics-artifacts-may-be-this-years-most-important-ai-feature-unveiling-the-interface-battle/
Paul DelSignore, Jul 5, 2024, From AI Models to Products: The Shift in AI Strategy: Why Model Performance No Longer Matters, https://generativeai.pub/from-ai-models-to-products-the-shift-in-ai-strategy-b377aeee3948
Vince Lam, Mar 12, 2024, 50+ Open-Source Options for Running LLMs Locally, https://medium.com/thedeephub/50-open-source-options-for-running-llms-locally-db1ec6f5a54f
Ethan Mollick, Aug 01, 2024, On speaking to AI: Voice changes a lot of things, https://www.oneusefulthing.org/p/on-speaking-to-ai
Arvind Narayanan and Sayash Kapoor, Aug 19, 2024, AI companies are pivoting from creating gods to building products. Good. Turning models into products runs into five challenges, https://www.aisnakeoil.com/p/ai-companies-are-pivoting-from-creating
Lance Whitney, Aug. 28, 2024, Why Claude's Artifacts is the coolest feature I've seen in generative AI so far, https://www.zdnet.com/article/why-claudes-artifacts-is-the-coolest-feature-ive-seen-in-generative-ai-so-far/
Kevin Lin, Sumant Guha, Joe Spaniac, Andy Zheng, 13 Nov 2020 (v3), Nifty Web Apps: Build a Web App for Any Text-Based Programming Assignment, https://arxiv.org/abs/2010.04671
Songqin Nong, Jiali Zhu, Rui Wu, Jiongchao Jin, Shuo Shan, Xiutian Huang, Wenhao Xu, 7 Aug 2024 (v2), MobileFlow: A Multimodal LLM For Mobile GUI Agent, https://arxiv.org/abs/2407.04346
Dongping Chen, Yue Huang, Siyuan Wu, Jingyu Tang, Liuyi Chen, Yilin Bai, Zhigang He, Chenlong Wang, Huichi Zhou, Yiqiang Li, Tianshuo Zhou, Yue Yu, Chujie Gao, Qihui Zhang, Yi Gui, Zhen Li, Yao Wan, Pan Zhou, Jianfeng Gao, Lichao Sun, 16 Jun 2024, GUI-WORLD: A Dataset for GUI-oriented Multimodal LLM-based Agents, https://arxiv.org/abs/2406.10819 https://gui-world.github.io/
Kristian Kolthoff, Felix Kretzer, Christian Bartelt, Alexander Maedche, Simone Paolo Ponzetto, 12 Jun 2024, Interlinking User Stories and GUI Prototyping: A Semi-Automatic LLM-based Approach, https://arxiv.org/abs/2406.08120
Abdur Rahman, Rajat Chawla, Muskaan Kumar, Arkajit Datta, Adarsh Jha, Mukunda NS, Ishaan Bhola, 21 Jul 2024 (v2), V-Zen: Efficient GUI Understanding and Precise Grounding With A Novel Multimodal LLM, https://arxiv.org/abs/2405.15341
Danyang Zhang, Zhennan Shen, Rui Xie, Situo Zhang, Tianbao Xie, Zihan Zhao, Siyuan Chen, Lu Chen, Hongshen Xu, Ruisheng Cao, Kai Yu, 13 Jun 2024 (v4), Mobile-Env: Building Qualified Evaluation Benchmarks for LLM-GUI Interaction, https://arxiv.org/abs/2305.08144
Quanfeng Lu, Wenqi Shao, Zitao Liu, Fanqing Meng, Boxuan Li, Botong Chen, Siyuan Huang, Kaipeng Zhang, Yu Qiao, Ping Luo, 12 Jun 2024, GUI Odyssey: A Comprehensive Dataset for Cross-App GUI Navigation on Mobile Devices, https://arxiv.org/abs/2406.08451 https://github.com/OpenGVLab/GUI-Odyssey
Shengcheng Yu, Chunrong Fang, Ziyuan Tuo, Quanjun Zhang, Chunyang Chen, Zhenyu Chen, Zhendong Su, 20 Oct 2023, Vision-Based Mobile App GUI Testing: A Survey, https://arxiv.org/abs/2310.13518
Jieshan Chen, Chunyang Chen, Zhenchang Xing, Xiwei Xu, Liming Zhu, Guoqiang Li, Jinshui Wang, 2 Jul 2020 (v2), Unblind Your Apps: Predicting Natural-Language Labels for Mobile GUI Components by Deep Learning, https://arxiv.org/abs/2003.00380
Carlos Bernal-Cardenas, Kevin Moran, Michele Tufano, Zichang Liu, Linyong Nan, Zhehan Shi, Denys Poshyvanyk, 3 Jan 2019, Guigle: A GUI Search Engine for Android Apps, https://arxiv.org/abs/1901.00891
Yijie Guo, Zhenhan Huang, Ruhan Wang, Zhihao Yao, Tianyu Yu, Zhiling Xu, Xinyu Zhao, Xueqing Li, Haipeng Mi, 24 Jul 2024, AI-Gadget Kit: Integrating Swarm User Interfaces with LLM-driven Agents for Rich Tabletop Game Applications, https://arxiv.org/abs/2407.17086
Harry Li, Gabriel Appleby, Ashley Suh, 7 Jun 2024, LinkQ: An LLM-Assisted Visual Interface for Knowledge Graph Question-Answering, https://arxiv.org/abs/2406.06621
William Seymour, Emilee Rader, 23 May 2024, Speculating About Multi-user Conversational Interfaces and LLMs: What If Chatting Wasn't So Lonely? https://arxiv.org/abs/2405.14390
Daniel Chin, Yuxuan Wang, Gus Xia, 19 May 2024, Human-Centered LLM-Agent User Interface: A Position Paper, https://arxiv.org/abs/2405.13050
Yaroslav Zharov, Yury Khudyakov, Evgeniia Fedotova, Evgeny Grigorenko, Egor Bogomolov, 18 Feb 2024, Tool-Augmented LLMs as a Universal Interface for IDEs, https://arxiv.org/abs/2402.11635
Syed Mekael Wasti, Ken Q. Pu, Ali Neshati, 16 Apr 2024 (v2), Large Language User Interfaces: Voice Interactive User Interfaces powered by LLMs, https://arxiv.org/abs/2402.07938
Qirui Huang, Min Lu, Joel Lanir, Dani Lischinski, Daniel Cohen-Or, Hui Huang, 24 Jan 2024, GraphiMind: LLM-centric Interface for Information Graphics Design, https://arxiv.org/abs/2401.13245
Yue Jiang, Changkong Zhou, Vikas Garg, Antti Oulasvirta, 21 Apr 2024, Graph4GUI: Graph Neural Networks for Representing Graphical User Interfaces, https://arxiv.org/abs/2404.13521
Daniel Buschek, 27 May 2024, Collage is the New Writing: Exploring the Fragmentation of Text and User Interfaces in AI Tools, https://arxiv.org/abs/2405.17217
Abdallah Namoun, Ahmed Alrehaili, Zaib Un Nisa, Hani Almoamari, Ali Tufail, 5 May 2024, Predicting the usability of mobile applications using AI tools: the rise of large user interface models, opportunities, and challenges, https://arxiv.org/abs/2405.03716
Zijian Ding, 2 May 2024 (v2), Towards Intent-based User Interfaces: Charting the Design Space of Intent-AI Interactions Across Task Types, https://arxiv.org/abs/2404.18196
Patrick Ebel, 16 Feb 2024, Generative AI and Attentive User Interfaces: Five Strategies to Enhance Take-Over Quality in Automated Driving, https://arxiv.org/abs/2402.10664
Advait Sarkar, 1 Nov 2023, Will Code Remain a Relevant User Interface for End-User Programming with Generative AI Models? https://arxiv.org/abs/2311.00382
Alex Renda, Harrison Goldstein, Sarah Bird, Chris Quirk, Adrian Sampson, 14 Sep 2017, Abstractions for AI-Based User Interfaces and Systems, https://arxiv.org/abs/1709.04991
Thomas Mildner, Orla Cooney, Anna-Maria Meck, Marion Bartl, Gian-Luca Savino, Philip R. Doyle, Diego Garaialde, Leigh Clark, John Sloan, Nina Wenig, Rainer Malaka, Jasmin Niess, 26 Jan 2024, Listening to the Voices: Describing Ethical Caveats of Conversational User Interfaces According to Experts and Frequent Users, Proceedings of the CHI Conference on Human Factors in Computing Systems (CHI '24), May 11--16, 2024, Honolulu, HI, USA, https://arxiv.org/abs/2401.14746 https://doi.org/https://doi.org/10.1145/3613904.3642542
Andreas Liesenfeld, Alianda Lopez, Mark Dingemanse, 28 Jul 2023, The timing bottleneck: Why timing and overlap are mission-critical for conversational user interfaces, speech recognition and dialogue systems, https://arxiv.org/abs/2307.15493
William Seymour, Xiao Zhan, Mark Cote, Jose Such, 8 Jun 2023, Who are CUIs Really For? Representation and Accessibility in the Conversational User Interface Literature, https://arxiv.org/abs/2306.05228
Open WebUI, 2024, Open WebUI (Formerly Ollama WebUI), https://github.com/open-webui/open-webui
Xhoni Shollaj, 2024, Awesome LLM WebUIs, https://github.com/JShollaj/Awesome-LLM-Web-UI
Sujeet Kumar, May 20, 2024, 14 Best Software for Running local LLM, https://scifilogic.com/interface-for-running-local-llm/
Mauro Sicard, Miguel Joya, LanguageGUI is the UI Kit for LLMs, 2024, https://languagegui.com/
Reddit, 2024, New Open Source Framework and No-Code GUI for Fine-Tuning LLMs: H2O LLM Studio, https://www.reddit.com/r/LocalLLaMA/comments/12yc8op/new_open_source_framework_and_nocode_gui_for/
LLM-UI, 2024, The React library for LLMs, https://llm-ui.com/
Reddit, 2024, LLM Web-UI recommendations, https://www.reddit.com/r/LocalLLaMA/comments/1847qt6/llm_webui_recommendations/
Yuzhe Cai, Shaoguang Mao, Wenshan Wu, Zehua Wang, Yaobo Liang, Tao Ge, Chenfei Wu, Wang You, Ting Song, Yan Xia, Jonathan Tien, Nan Duan, Furu Wei, 1 Apr 2024 (v3), Low-code LLM: Graphical User Interface over Large Language Models, https://arxiv.org/abs/2304.08103 https://github.com/chenfei-wu/TaskMatrix/tree/main/LowCodeLLM https://www.youtube.com/watch?v=jb2C1vaeO3E
Ramalingame, Hari, May 2024, Deployable Web GUI for LLM Applications, Thesis, Arizona State University, https://keep.lib.asu.edu/items/192554
by Jarrett Yeo and Tammy Lim , 12 DEC 2023, Create a web UI to interact with LLMs using Amazon SageMaker JumpStart, https://aws.amazon.com/blogs/machine-learning/create-a-web-ui-to-interact-with-llms-using-amazon-sagemaker-jumpstart/
Difei Gao, Lei Ji, Zechen Bai, Mingyu Ouyang, Peiran Li, Dongxing Mao, Qinchen Wu, Weichen Zhang, Peiyi Wang, Xiangwu Guo, Hengxu Wang, Luowei Zhou, Mike Zheng Shou; 2024, AssistGUI: Task-Oriented PC Graphical User Interface Automation, Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2024, pp. 13289-13298, https://openaccess.thecvf.com/content/CVPR2024/html/Gao_AssistGUI_Task-Oriented_PC_Graphical_User_Interface_Automation_CVPR_2024_paper.html https://openaccess.thecvf.com/content/CVPR2024/papers/Gao_AssistGUI_Task-Oriented_PC_Graphical_User_Interface_Automation_CVPR_2024_paper.pdf https://openaccess.thecvf.com/content/CVPR2024/supplemental/Gao_AssistGUI_Task-Oriented_PC_CVPR_2024_supplemental.pdf
Jie Gao, Simret Araya Gebreegziabher, Kenny Tsu Wei Choo, Toby Jia-Jun Li, Simon Tangi Perrault, Thomas W. Malone, 30 Mar 2024, A Taxonomy for Human-LLM Interaction Modes: An Initial Exploration, https://arxiv.org/abs/2404.00405 https://dl.acm.org/doi/abs/10.1145/3613905.3650786
Prakash Joshi Pax, Aug 26, 2024, Fabric: The Best AI Tool That Nobody is Talking About. An open-source AI tool to automate every day tasks https://beingpax.medium.com/why-fabric-ai-can-change-the-way-you-use-ai-973e725354da
Nick: The AI Guru, Aug 15, 2024, Why Perplexity AI Has Been a Game Changer For Me, https://medium.com/@nickm9/why-perplexity-ai-has-been-a-game-changer-for-me-b38976bdc1b4
Michal Malewicz, Sep 3, 2024, Ugly websites sell better. Web design is getting out of hand again. https://michalmalewicz.medium.com/ugly-websites-sell-better-0b0354ebff10
Yicheng Fu, Raviteja Anantha, Prabal Vashisht, Jianpeng Cheng, Etai Littwin, 6 Sep 2024, UI-JEPA: Towards Active Perception of User Intent through Onscreen User Activity, https://www.arxiv.org/abs/2409.04081
Jared Spataro, Sep 16, 2024, Microsoft 365 Copilot Wave 2: Pages, Python in Excel, and agents, Microsoft blog, https://www.microsoft.com/en-us/microsoft-365/blog/2024/09/16/microsoft-365-copilot-wave-2-pages-python-in-excel-and-agents/
Simon Willison, Sep 2024, How streaming LLM APIs work, https://til.simonwillison.net/llms/streaming-llm-apis
Ellie Ko, Sep 25, 2024, A Survey of Python Frameworks, https://ploomber.io/blog/survey-python-frameworks/
Junting Lu, Zhiyang Zhang, Fangkai Yang, Jue Zhang, Lu Wang, Chao Du, Qingwei Lin, Saravan Rajmohan, Dongmei Zhang, Qi Zhang, 25 Sep 2024, Turn Every Application into an Agent: Towards Efficient Human-Agent-Computer Interaction with API-First LLM-Based Agents, https://arxiv.org/abs/2409.17140
Mareike Hartmann, Alexander Koller, 27 Sep 2024, A Survey on Complex Tasks for Goal-Directed Interactive Agents, https://arxiv.org/abs/2409.18538 https://coli-saar.github.io/interactive-agents
Emilia David, October 3, 2024, OpenAI launches ChatGPT Canvas, challenging Claude Artifacts, https://venturebeat.com/ai/openai-launches-chatgpt-canvas-challenging-claude-artifacts/
Sabrina Ortiz, Oct. 7, 2024, I test ChatGPT features for a living, and this new one really did supercharge my productivity. If you use OpenAI's generative AI tool to co-edit code or text, Canvas will take your work to a whole new level, https://www.zdnet.com/article/i-test-chatgpt-features-for-a-living-and-this-new-one-really-did-supercharge-my-productivity/
Emilia David, October 17, 2024, Google launches NotebookLM Business to make enterprise AI audio, text, https://venturebeat.com/ai/googles-notebooklm-will-expand-to-business-use-cases-soon/
David Gewirtz, Oct. 25, 2024, I wrote half this article on Apple Watch, thanks to this under-the-radar iOS 18 feature: Here's how to transform your writing workflow and turn your Apple Watch into a productivity powerhouse, https://www.zdnet.com/article/i-wrote-half-this-article-on-apple-watch-thanks-to-this-under-the-radar-ios-18-feature/
LangChain, Jul 26, 2024, UX for Agents, Part 1: Chat, https://blog.langchain.dev/ux-for-agents-part-1-chat-2/
LangChain, Aug 2, 2024, UX for Agents, Part 2: Ambient, https://blog.langchain.dev/ux-for-agents-part-2-ambient/
LangChain, Aug 10, 2024, UX for Agents, Part 3: Spreadsheet, Generative, and Collaborative UI/UX, https://blog.langchain.dev/ux-for-agents-part-3/
Lance Whitney, Oct. 30, 2024, Apple Watch lets you translate your conversations in real-time. Here's how: WatchOS 11's Translate app lets you have a live conversation in two languages with another person - right from your wrist, https://www.zdnet.com/article/apple-watch-lets-you-translate-your-conversations-in-real-time-heres-how/
Julia Winn, Oct 2024, The AI Productivity Paradox: Why Aren’t More Workers Using ChatGPT? The real barrier isn’t technical skills — it’s time to think. https://towardsdatascience.com/the-ai-productivity-paradox-why-arent-more-workers-using-chatgpt-a1dfe96a9460
Lance Whitney, Oct. 31, 2024, Claude AI adds desktop apps and dictation mode – here's how to use them, https://www.zdnet.com/article/claude-ai-adds-desktop-apps-and-dictation-mode-heres-how-to-use-them/
K. Balázs Neszlényi, A. Milos and A. Kiss, "AssistantGPT: Enhancing User Interaction with LLM Integration," 2024 IEEE 22nd Jubilee International Symposium on Intelligent Systems and Informatics (SISY), Pula, Croatia, 2024, pp. 000619-000624, doi: 10.1109/SISY62279.2024.10737548. https://ieeexplore.ieee.org/abstract/document/10737548
OpenAI, October 3, 2024, Introducing canvas: A new way of working with ChatGPT to write and code, https://openai.com/index/introducing-canvas/
Emilia David, November 14, 2024, OpenAI launches ChatGPT desktop integrations, rivaling Copilot, https://venturebeat.com/ai/openai-launches-chatgpt-desktop-integrations-rivaling-copilot/
swyx, Sep 2024, What Works in AI UX (lightning talk + Q&A), https://www.youtube.com/watch?v=PkHjoihjo6U
swyx & Alessio, Maggie Appleton, Linus Lee, and Geoffrey Litt, Apr 27, 2023, It's Time To Build AI | UX. Bridging the Capability Overhang from Generative AI to Generative UI, https://www.latent.space/p/build-ai-ux
Akash Bajwa, Nov 18, 2024, Opinionated AI Products: Strong Technologies Forms Beliefs, https://akashbajwa.substack.com/p/opinionated-ai-products
Jared Spataro, November 19, 2024, Introducing Copilot Actions, new agents, and tools to empower IT teams, https://www.microsoft.com/en-us/microsoft-365/blog/2024/11/19/introducing-copilot-actions-new-agents-and-tools-to-empower-it-teams/ ("Copilot is the UI for AI")
Tiernan Ray, Nov. 21, 2024 , Even Nvidia's CEO is obsessed with Google's NotebookLM AI tool, https://www.zdnet.com/article/even-nvidias-ceo-is-obsessed-with-googles-notebooklm-ai-tool/
Ethan Mollick, Nov 24, 2024, Getting started with AI: Good enough prompting. Don't make this hard. https://www.oneusefulthing.org/p/getting-started-with-ai-good-enough
Charlie Guo, Nov 15, 2024, The Chatbot Trap. Why AI products really need some better UX. https://www.ignorance.ai/p/the-chatbot-trap
Christian Swinehart, Dec 2024, Skia-Canvas: A GPU-accelerated 2D graphics environment for Node.js, https://github.com/samizdatco/skia-canvas
Charles Rollet, December 4, 2024, Key leaders behind Google’s viral NotebookLM are leaving to create their own startup, https://techcrunch.com/2024/12/04/key-leaders-behind-googles-viral-notebooklm-are-leaving-to-create-their-own-startup/ ("As the frontier models and their capabilities continue to grow, thoughtful products are required to make the benefits of this technology accessible, useful, and obvious to everyday people — so our team is going to be focused on building a user-first AI product...the team wanted to create something that leverages the latest AI models to build something useful to regular people.")
Ian Drosos, Jack Williams, Advait Sarkar, Nicholas Wilson, 3 Dec 2024, Dynamic Prompt Middleware: Contextual Prompt Refinement Controls for Comprehension Tasks, https://arxiv.org/abs/2412.02357
Emilia David, December 10, 2024, OpenAI expands ChatGPT Canvas to all users, https://venturebeat.com/ai/openai-expands-chatgpt-canvas-to-all-users/
Anshul Ramachandran, Jul 08, 2023, How to Make AI UX Your Moat. Design great AI Products that go beyond "just LLM Wrappers": make AI more present, more practical, and then more powerful. https://www.latent.space/p/ai-ux-moat
Sabrina Ortiz, Dec. 13, 2024, ChatGPT finally gets easier to organize on the 7th day of OpenAI, https://www.zdnet.com/article/chatgpt-finally-gets-easier-to-organize-on-the-7th-day-of-openai/
Maxwell Zeff, November 20, 2024, Current AI scaling laws are showing diminishing returns, forcing AI labs to change course, https://techcrunch.com/2024/11/20/ai-scaling-laws-are-showing-diminishing-returns-forcing-ai-labs-to-change-course/ ("at least 10 to 20x gains in model performance ...intelligent prompting, UX decisions, and passing context at the right time into the models...")
Google, Dec 2024, Welcome to Project IDX, a new web-based development workspace from Google. IDX is designed to make it faster and easier to build, ship, and manage full-stack, multiplatform apps from the comfort of your browser. https://idx.google.com/
Avi Siegel, Dec 2024, Features shouldn’t feel like features: Why (and how) to craft product experiences that feel inevitable, https://uxdesign.cc/features-shouldnt-feel-like-features-fba44644f961
Kartik Hosanagar, Daehwan Ahn, 14 Dec 2024, Designing Human and Generative AI Collaboration, https://arxiv.org/abs/2412.14199
Google, Dec 2024, multimodal-live-api-web-console: A react-based starter app for using the Multimodal Live API over websockets with Gemini, https://github.com/google-gemini/multimodal-live-api-web-console
Will Whitney, Dec 2024, Computing inside an AI: What would it mean to treat AI as a tool instead of a person? https://willwhitney.com/computing-inside-ai.html
Latent Space, Jan 05, 2025, AI Engineering for Art — with comfyanonymous, of ComfyUI, Using models for "Art Engineering", building hard to use UIs, and how image generation is moving from text boxes to DAGs https://www.latent.space/p/comfyui
comfyanonymous, Jan 2025, ComfyUI: The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface, https://github.com/comfyanonymous/ComfyUI
Boqiang Liang, Jan 2025, SaaS is Dead, Says Microsoft CEO, https://medium.com/@lbq999/saas-is-dead-says-microsoft-ceo-a8cff2a516c4
Ori Ziv, Jan 2025, How AI Agents Will Disrupt SaaS in 2025, https://medium.com/@oriziv4/how-ai-agents-will-disrupt-saas-in-2025-7567d793ca68
Johan Uddståhl, Jan 2, 2025, …when all we ever needed was a text box, or how 2025 will be back to basics for the web, https://medium.com/@baktakt/when-all-we-ever-needed-was-a-text-box-c672c52a0dca
Tari Ibaba, Jan 2025, AI is killing apps, https://medium.com/coding-beauty/ai-is-killing-apps-868a7b59fafe
James Currier, Jan 2025, Consumer is Back – And Why It’s Been So Hard Since 2014, https://www.nfx.com/post/consumer-is-back
Akash Bajwa, Feb 03, 2025, Forward Deployed Engineers: A Means To An End For AI Startups: Capturing Business Logic And Expert Reasoning, https://akashbajwa.substack.com/p/forward-deployed-engineers-a-means (" AI truly is a new way of computing, and that means the better analogies are to computing itself. Transformers are the transistor, and mainframes are today’s models. The GUI is, arguably, still TBD.")
Olivia Moore, Feb 20025, AI Voice Agent Update - 2025, A16Z, https://a16z.com/ai-voice-agents-2025-update/ https://gamma.app/docs/a16z-AI-Voice-Update-2025--ttkorld8iy6wfnj?mode=doc (Thesis that voice will be the primary AI interface for consumers.)
Sharon Goldman, December 13, 2023, Lightning AI debuts ‘iPhone approach’ to new AI dev platform, https://venturebeat.com/ai/lightning-ai-debuts-iphone-approach-to-new-ai-dev-platform/
Daniel Delaney, Feb 2025, Chat is a bad UI pattern for development tools, https://danieldelaney.net/chat/
Jack Wallen, Feb. 6, 2025, I tried to replace my desktop with a phone for work - 5 frustrating lessons I learned As phones continue to win the consumer war against desktops and laptops, those who swear by our PCs will never give in to the lure of mobile-only. Here's why. https://www.zdnet.com/article/i-tried-to-replace-my-desktop-with-a-phone-for-work-5-frustrating-lessons-i-learned/
Alexander Deplov, Feb 12, 2025, How I Automated My Computer Routine With macOS Folder Actions, https://interfacecraft.online/posts/blog/2025/how-i-automated-my-computer-life-with-macos-folder-actions/
M.G. Siegler, Feb 14, 2025, The Great AI UI Unification. ChatGPT starts cleaning up the cruft..., https://spyglass.org/chatgpt-ai-ui/
Rex Woodbury, Feb 20, 2025, How Consumer Psychology Informs AI Product Design: The IKEA Effect, the Paradox of Choice, and AI's Interface Problem, https://www.digitalnative.tech/p/how-consumer-psychology-informs-ai
Yaakov Carno, Feb 24, 2025, The surprising patterns behind viral AI products: A deep dive into Bolt, Cursor, Granola, PhotoRoom, Replit and more, https://open.substack.com/pub/kylepoyar/p/ai-ux-patterns (The "surprising pattern" in successful AI products is that they all have a slick UI.)
Tetiana Sydorenko, Feb 2025, AI is reshaping UI — have you noticed the biggest change yet? https://uxdesign.cc/ai-is-reshaping-ui-have-you-noticed-the-biggest-change-yet-ee80efcbf8a5
Andrew Zuo, March 2025, Developers Are Keeping The Best AI Interface To Themselves, https://andrewzuo.com/developers-are-keeping-the-best-ai-interface-to-themselves-f558261ee109
Leixian Shen, Haotian Li, Yifang Wang, Xing Xie, Huamin Qu, 4 Mar 2025, Prompting Generative AI with Interaction-Augmented Instructions, https://arxiv.org/abs/2503.02874
Dom Couldwell, Mar 10, 2025, Building generative AI? Get ready for generative UI,https://www.infoworld.com/article/3834886/building-generative-ai-get-ready-for-generative-ui.html
Anthropic, Mar 2025, Text editor tool, https://docs.anthropic.com/en/docs/build-with-claude/tool-use/text-editor-tool
Dave Citron, Mar 18, 2025, New ways to collaborate and get creative with Gemini: Explore Gemini's latest features: Canvas, a new interactive space for refining your documents and code and Audio Overview, which transform your files into engaging podcast-style discussions, Google Blog, https://blog.google/products/gemini/gemini-collaboration-features/
RM Amin, OH Kühle, D Buschek, A Butz, 2025, Composable Prompting Workspaces for Creative Writing: Exploration and Iteration Using Dynamic Widgets, https://www.medien.ifi.lmu.de/pubdb/publications/pub/amin2025chi/amin2025chi.pdf
Nikunj Kothari. Mar 21, 2025, Beyond Chat: The New Patterns of AI Interfaces,https://writing.nikunjk.com/p/beyond-chat
Zachary DeWitt, May 21, 2025, Why Agents Break PLG (And How to Rebuild It), https://www.notoriousplg.ai/p/why-agents-break-plg-and-how-to-rebuild
Sarah Perez, June 10, 2025, Love it or hate it? Apple’s new ‘Liquid Glass’ design is getting mixed reviews, https://techcrunch.com/2025/06/10/love-it-or-hate-it-apples-new-liquid-glass-design-is-getting-mixed-reviews/
Rex Woodbury, Jun 18, 2025, The Opportunities in Consumer AI: Mapping Where to Build + Examining Changes in Product Design & Business Model, https://www.digitalnative.tech/p/the-opportunities-in-consumer-ai
Sameer Singh, June 4, 2025, Stop Asking, Start Showing: Why GUIs Still Win in the Age of AI, https://www.speedinvest.com/blog/consumer-ai-cognitive-load-and-the-gui
Tomasz Tunguz, Jun 27, 2025, Voice, Context & Control: The Three Pillars of Useful AI Email, https://tomtunguz.com/my-own-ai-email-generator/ ("We’re still in the horseless carriage era of AI applications. The breakthrough will come when software adapts to us instead of forcing us to adapt to it.")
LangDiff, Aug 2025, LangDiff: Progressive UI from LLM: LangDiff is a Python library that solves the hard problems of streaming structured LLM outputs to frontends, https://github.com/globalaiplatform/langdiff https://langdiff.readthedocs.io/en/latest/
Kenneth Wolters, Aug 12, 2025, No AGI in Sight: What This Means for LLMs, https://kennethwolters.com/posts/no-agi/
Google, Aug 2025, LangExtract: A Python library for extracting structured information from unstructured text using LLMs with precise source grounding and interactive visualization, https://pypi.org/project/langextract/ https://github.com/google/langextract
Ivan Mehta, Aug 2025, Apple’s new Siri may allow users to operate apps just using voice, https://techcrunch.com/2025/08/11/apples-new-siri-may-allow-users-to-operate-apps-just-using-voice/
Matthew Tyson, Aug 13, 2025, Hands-on with Svelte: Build-time compilation in a reactive framework, https://www.infoworld.com/article/2265950/hands-on-with-svelte.html
MKWriteshere, Aug 2025, Microsoft Just Solved AI’s Biggest Problem: Why Magentic-UI Changes Everything: How human-AI collaboration beats pure automation every time with 71% better results, https://pub.towardsai.net/microsoft-just-solved-ais-biggest-problem-why-magentic-ui-changes-everything-ae09b5d09223
Ivan Mehta, August 18, 2025, Grammarly gets a design overhaul, multiple AI features, https://techcrunch.com/2025/08/18/grammarly-gets-a-design-overhaul-multiple-ai-features/
Fei Tang, Zhangxuan Gu, Zhengxi Lu, Xuyang Liu, Shuheng Shen, Changhua Meng, Wen Wang, Wenqi Zhang, Yongliang Shen, Weiming Lu, Jun Xiao, Yueting Zhuang, 22 Jul 2025, GUI-G$^2$: Gaussian Reward Modeling for GUI Grounding, https://arxiv.org/abs/2507.15846
ZongHan Hsieh, Tzer-Jen Wei, ShengJing Yang, 18 Jul 2025, ZonUI-3B: A Lightweight Vision-Language Model for Cross-Resolution GUI Grounding, https://arxiv.org/abs/2506.23491
Ammar Ahmed, Ali Shariq Imran, 17 Jul 2025, The role of large language models in UI/UX design: A systematic literature review, https://arxiv.org/abs/2507.04469
Benjamin Raphael Ernhofer, Daniil Prokhorov, Jannica Langner and Dominik Bollmann, 20 Jul 2025, Leveraging Vision-Language Models for Visual Grounding and Analysis of Automotive UI, https://arxiv.org/abs/2505.05895
Shuquan Lian, Yuhang Wu, Jia Ma, Yifan Ding, Zihan Song, Bingqi Chen, Xiawu Zheng, Hui Li, 9 Aug 2025, UI-AGILE: Advancing GUI Agents with Effective Reinforcement Learning and Precise Inference-Time Grounding, https://arxiv.org/abs/2507.22025
Miaosen Zhang, Ziqiang Xu, Jialiang Zhu, Qi Dai, Kai Qiu, Yifan Yang, Chong Luo, Tianyi Chen, Justin Wagle, Tim Franklin, Baining Guo, 31 Jul 2025, Phi-Ground Tech Report: Advancing Perception in GUI Grounding, https://arxiv.org/abs/2507.23779
Hussein Mozannar, Gagan Bansal, Cheng Tan, Adam Fourney, Victor Dibia, Jingya Chen, Jack Gerrits, Tyler Payne, Matheus Kunzler Maldaner, Madeleine Grunde-McLaughlin, Eric Zhu, Griffin Bassman, Jacob Alber, Peter Chang, Ricky Loynd, Friederike Niedtner, Ece Kamar, Maya Murad, Rafah Hosn, Saleema Amershi, 30 Jul 2025, Magentic-UI: Towards Human-in-the-loop Agentic Systems, https://arxiv.org/abs/2507.22358
Zihan Zheng, Tianle Cui, Chuwen Xie, Jiahui Zhang, Jiahui Pan, Lewei He, Qianglong Chen, 2 Aug 2025, NatureGAIA: Pushing the Frontiers of GUI Agents with a Challenging Benchmark and High-Quality Trajectory Dataset, https://arxiv.org/abs/2508.01330
Zhihao Luo and Wentao Yan abd Jingyu Gong and Min Wang and Zhizhong Zhang and Xuhong Wang and Yuan Xie and Xin Tan, 4 Aug 2025, NaviMaster: Learning a Unified Policy for GUI and Embodied Navigation Tasks, https://arxiv.org/abs/2508.02046
Zheng Wu and Pengzhou Cheng and Zongru Wu and Lingzhong Dong and Zhuosheng Zhang, 4 Aug 2025, GEM: Gaussian Embedding Modeling for Out-of-Distribution Detection in GUI Agents, https://arxiv.org/abs/2505.12842
Jaehyun Jeon, Min Soo Kim, Jang Han Yoon, Sumin Shim, Yejin Choi, Hanbin Kim, Youngjae Yu, 4 Aug 2025, Do MLLMs Capture How Interfaces Guide User Behavior? A Benchmark for Multimodal UI/UX Design Understanding, https://arxiv.org/abs/2505.05026
Chao Hao, Shuai Wang and Kaiwen Zhou, 6 Aug 2025, Uncertainty-Aware GUI Agent: Adaptive Perception through Component Recommendation and Human-in-the-Loop Refinement, https://arxiv.org/abs/2508.04025
Weitai Kang, Bin Lei, Gaowen Liu, Caiwen Ding, Yan Yan, 6 Aug 2025, GuirlVG: Incentivize GUI Visual Grounding via Empirical Exploration on Reinforcement Learning, https://arxiv.org/abs/2508.04389
Liujian Tang, Shaokang Dong, Yijia Huang, Minqi Xiang, Hongtao Ruan, Bin Wang, Shuo Li, Zhihui Cao, Hailiang Pang, Heng Kong, He Yang, Mingxu Chai, Zhilin Gao, Xingyu Liu, Yingnan Fu, Jiaming Liu, Tao Gui, Xuanjing Huang, Yu-Gang Jiang, Qi Zhang, Kang Wang, Yunke Zhang, Yuran Wang, 19 Jul 2025, MagicGUI: A Foundational Mobile GUI Agent with Scalable Data Pipeline and Reinforcement Fine-tuning, https://arxiv.org/abs/2508.03700
Wenkang Han, Zhixiong Zeng, Jing Huang, Shu Jiang, Liming Zheng, Haibo Qiu, Chang Yao, Jingyuan Chen, Lin Ma, 6 Aug 2025, UITron-Speech: Towards Automated GUI Agents Based on Speech Instructions, https://arxiv.org/abs/2506.11127
Yong Du, Yuchen Yan, Fei Tang, Zhengxi Lu, Chang Zong, Weiming Lu, Shengpei Jiang, Yongliang Shen, 7 Aug 2025, Test-Time Reinforcement Learning for GUI Grounding via Region Consistency, https://arxiv.org/abs/2508.05615
Yuhang Liu, Zeyu Liu, Shuanghe Zhu, Pengxiang Li, Congkai Xie, Jiasheng Wang, Xueyu Hu, Xiaotian Han, Jianbo Yuan, Xinyao Wang, Shengyu Zhang, Hongxia Yang, Fei Wu, 7 Aug 2025, InfiGUI-G1: Advancing GUI Grounding with Adaptive Exploration Policy Optimization, https://arxiv.org/abs/2508.05731
Songqin Nong, Jingxuan Xu, Sheng Zhou, Jianfeng Chen, Xiaoxuan Tang, Tao Jiang, Wenhao Xu, 15 Aug 2025, CRAFT-GUI: Curriculum-Reinforced Agent For GUI Tasks, https://arxiv.org/abs/2508.11360
Jikai Chen, Long Chen, Dong Wang, Leilei Gan, Chenyi Zhuang, Jinjie Gu, 19 Aug 2025, V2P: From Background Suppression to Center Peaking for Robust GUI Grounding Task, https://arxiv.org/abs/2508.13634
Yutong Bian, Xianhao Lin, Yupeng Xie, Tianyang Liu, Mingchen Zhuge, Siyuan Lu, Haoming Tang, Jinlin Wang, Jiayi Zhang, Jiaqi Chen, Xiangru Tang, Yongxin Ni, Sirui Hong, Chenglin Wu, 17 Aug 2025, You Don't Know Until You Click:Automated GUI Testing for Production-Ready Software Evaluation, https://arxiv.org/abs/2508.14104
Jiabo Ye, Xi Zhang, Haiyang Xu, Haowei Liu, Junyang Wang, Zhaoqing Zhu, Ziwei Zheng, Feiyu Gao, Junjie Cao, Zhengxi Lu, Jitong Liao, Qi Zheng, Fei Huang, Jingren Zhou, and Ming Yan, 21 Aug 2025, Mobile-Agent-v3: Foundamental Agents for GUI Automation, https://arxiv.org/abs/2508.15144
Hayden Bleasel, Ryan Haraki, Aug 6, 2025, Introducing AI Elements: Prebuilt, composable AI SDK components, https://vercel.com/changelog/introducing-ai-elements
Yi Xu, Yesheng Zhang, jiajia Liu, Jingdong Chen, 22 Aug 2025, Structuring GUI Elements through Vision Language Models: Towards Action Space Generation, https://arxiv.org/abs/2508.16271
El Hassane Ettifouri and Jessica L\'opez Espejel and Laura Minkova and Tassnim Dardouri and Walid Dahhane, 18 Jul 2025, Visual Grounding Methods for Efficient Interaction with Desktop Graphical User Interfaces, https://arxiv.org/abs/2407.01558
Wanfu Wang, Qipeng Huang, Guangquan Xue, Xiaobo Liang, Juntao Li, 4 Sep 2025, Learning Active Perception via Self-Evolving Preference Optimization for GUI Grounding, https://arxiv.org/abs/2509.04243
Hongyi Jing, Jiafu Chen, Chen Rao, Ziqiang Dang, Jiajie Teng, Tianyi Chu, Juncheng Mo, Shuo Fang, Huaizhong Lin, Rui Lv, Chenguang Ma, Lei Zhao, 5 Sep 2025, SparkUI-Parser: Enhancing GUI Perception with Robust Grounding and Parsing, https://arxiv.org/abs/2509.04908
Dang Nguyen, Jian Chen, Yu Wang, Gang Wu, Namyong Park, Zhengmian Hu, Hanjia Lyu, Junda Wu, Ryan Aponte, Yu Xia, Xintong Li, Jing Shi, Hongjie Chen, Viet Dac Lai, Zhouhang Xie, Sungchul Kim, Ruiyi Zhang, Tong Yu, Mehrab Tanjim, Nesreen K. Ahmed, Puneet Mathur, Seunghyun Yoon, Lina Yao, Branislav Kveton, Thien Huu Nguyen, Trung Bui, Tianyi Zhou, Ryan A. Rossi, Franck Dernoncourt, 4 Sep 2025, GUI Agents: A Survey, https://arxiv.org/abs/2412.13501
Gaole Dai, Shiqi Jiang, Ting Cao, Yuanchun Li, Yuqing Yang, Rui Tan, Mo Li, Lili Qiu, 5 Sep 2025, Advancing Mobile GUI Agents: A Verifier-Driven Approach to Practical Deployment, https://arxiv.org/abs/2503.15937
Hang Wu, Hongkai Chen, Yujun Cai, Chang Liu, Qingwen Ye, Ming-Hsuan Yang, Yiwei Wang, 5 Sep 2025, DiMo-GUI: Advancing Test-time Scaling in GUI Grounding via Modality-Aware Visual Reasoning, https://arxiv.org/abs/2507.00008
Haoming Wang, Haoyang Zou, Huatong Song, Jiazhan Feng, Junjie Fang, Junting Lu, Longxiang Liu, Qinyu Luo, Shihao Liang, Shijue Huang, Wanjun Zhong, Yining Ye, Yujia Qin, Yuwen Xiong, Yuxin Song, Zhiyong Wu, Aoyan Li, Bo Li, Chen Dun, Chong Liu, Daoguang Zan, Fuxing Leng, Hanbin Wang, Hao Yu, Haobin Chen, Hongyi Guo, Jing Su, Jingjia Huang, Kai Shen, Kaiyu Shi, Lin Yan, Peiyao Zhao, Pengfei Liu, Qinghao Ye, Renjie Zheng, Shulin Xin, Wayne Xin Zhao, Wen Heng, Wenhao Huang, Wenqian Wang, Xiaobo Qin, Yi Lin, Youbin Wu, Zehui Chen, Zihao Wang, Baoquan Zhong, Xinchun Zhang, Xujing Li, Yuanfan Li, Zhongkai Zhao, Chengquan Jiang, Faming Wu, Haotian Zhou, Jinlin Pang, Li Han, Qi Liu, Qianli Ma, Siyao Liu, Songhua Cai, Wenqi Fu, Xin Liu, Yaohui Wang, Zhi Zhang, Bo Zhou, Guoliang Li, Jiajun Shi, Jiale Yang, et al. (45 additional authors not shown), 5 Sep 2025, UI-TARS-2 Technical Report: Advancing GUI Agent with Multi-Turn Reinforcement Learning, https://arxiv.org/abs/2509.02544
Yuyang Zhao, Wentao Shi, Fuli Feng, and Xiangnan He, 26 Aug 2025, AppAgent-Pro: A Proactive GUI Agent System for Multidomain Information Integration and User Assistance, https://arxiv.org/abs/2508.18689
Maximilian Frank and Simon Lund, 26 Aug 2025, Insights into User Interface Innovations from a Design Thinking Workshop at deRSE25, https://arxiv.org/abs/2508.18784
Quanfeng Lu, Zhantao Ma, Shuai Zhong, Jin Wang, Dahai Yu, Michael K. Ng, Ping Luo, 27 Aug 2025, SWIRL: A Staged Workflow for Interleaved Reinforcement Learning in Mobile GUI Control, https://arxiv.org/abs/2508.20018
Jaewoo Ahn, Junseo Kim, Heeseung Yun, Jaehyeon Son, Dongmin Park, Jaewoong Cho, Gunhee Kim, 1 Sep 2025, FlashAdventure: A Benchmark for GUI Agents Solving Full Story Arcs in Diverse Adventure Games, https://arxiv.org/abs/2509.01052
Pengxiang Zhao, Guangyi Liu, Yaozhen Liang, Weiqing He, Zhengxi Lu, Yuehao Huang, Yaxuan Guo, Kexin Zhang, Hao Wang, Liang Liu, Yong Liu, 8 Sep 2025, MAS-Bench: A Unified Benchmark for Shortcut-Augmented Hybrid Mobile GUI Agents, https://arxiv.org/abs/2509.06477
Yannick Kalff, Katharina Simbeck, 8 Sep 2025, Explained, yet misunderstood: How AI Literacy shapes HR Managers' interpretation of User Interfaces in Recruiting Recommender Systems, https://arxiv.org/abs/2509.06475
Anthropic, 10 Sept 2025, Claude can now create and edit files, https://www.anthropic.com/news/create-files
Dr. Derek Austin, Oct 2025, The Frontend Stack Wars Are Over. OpenAI Just Picked a Winner, https://medium.com/according-to-context/the-frontend-stack-wars-are-over-openai-just-picked-a-winner-099fa716f3f0
Jungjae Lee, Dongjae Lee, Chihun Choi, Youngmin Im, Jaeyoung Wi, Kihong Heo, Sangeun Oh, Sunjae Lee, Insik Shin, 11 Sep 2025, VeriSafe Agent: Safeguarding Mobile GUI Agent via Logic-based Action Verification, https://arxiv.org/abs/2503.18492
Musen Lin, Minghao Liu, Taoran Lu, Lichen Yuan, Yiwei Liu, Haonan Xu, Yu Miao, Yuhao Chao, Zhaojian Li, 19 Sep 2025, GUI-ReWalk: Massive Data Generation for GUI Agent via Stochastic Exploration and Intent-Aware Reasoning, https://arxiv.org/abs/2509.15738
Xianhang Ye, Yiqing Li, Wei Dai, Miancan Liu, Ziyuan Chen, Zhangye Han, Hongbo Min, Jinkui Ren, Xiantao Zhang, Wen Yang, Zhi Jin, 19 Sep 2025, GUI-ARP: Enhancing Grounding with Adaptive Region Perception for GUI Agents, https://arxiv.org/abs/2509.15532
Shaojie Zhang, Ruoceng Zhang, Pei Fu, Shaokang Wang, Jiahui Yang, Xin Du, Shiqi Cui, Bin Qin, Ying Huang, Zhenbo Luo, Jian Luan, 19 Sep 2025, BTL-UI: Blink-Think-Link Reasoning Model for GUI Agent, https://arxiv.org/abs/2509.15566
Zhengxi Lu, Jiabo Ye, Fei Tang, Yongliang Shen, Haiyang Xu, Ziwei Zheng, Weiming Lu, Ming Yan, Fei Huang, Jun Xiao, Yueting Zhuang, 15 Sep 2025, UI-S1: Advancing GUI Automation via Semi-online Reinforcement Learning, https://arxiv.org/abs/2509.11543
Jingyu Tang, Chaoran Chen, Jiawen Li, Zhiping Zhang, Bingcan Guo, Ibrahim Khalilov, Simret Araya Gebreegziabher, Bingsheng Yao, Dakuo Wang, Yanfang Ye, Tianshi Li, Ziang Xiao, Yaxing Yao, Toby Jia-Jun Li, 12 Sep 2025, Dark Patterns Meet GUI Agents: LLM Agent Susceptibility to Manipulative Interfaces and the Role of Human Oversight, https://arxiv.org/abs/2509.10723
Jingyu Xiao and Zhongyi Zhang and Yuxuan Wan and Yintong Huo and Yang Liu and Michael R.Lyu, 15 Sep 2025, EfficientUICoder: Efficient MLLM-based UI Code Generation via Input and Output Token Compression, https://arxiv.org/abs/2509.12159
Aryan Garg, Yue Jiang, Antti Oulasvirta, 13 Sep 2025, Controllable GUI Exploration, https://arxiv.org/abs/2502.03330
Zongru Wu, Rui Mao, Zhiyuan Tian, Pengzhou Cheng, Tianjie Ju, Zheng Wu, Lingzhong Dong, Haiyue Sheng, Zhuosheng Zhang, Gongshen Liu, 17 Sep 2025, See, Think, Act: Teaching Multimodal Agents to Effectively Interact with GUI by Identifying Toggles, https://arxiv.org/abs/2509.13615
Liangtao Lin, Zhaomeng Zhu, Tianwei Zhang and Yonggang Wen, 17 Sep 2025, InfraMind: A Novel Exploration-based GUI Agentic Framework for Mission-critical Industrial Management, https://arxiv.org/abs/2509.13704
OpenAI, October 21, 2025 Introducing ChatGPT Atlas: The browser with ChatGPT built in, https://openai.com/index/introducing-chatgpt-atlas/
Louis Columbus, September 19, 2025, Legacy UI is dead: Shadow AI is how real work gets done now, https://venturebeat.com/security/legacy-ui-is-dead-shadow-ai-is-how-real-work-gets-done-now
Jenny T. Liang, Titus Barik, Jeffrey Nichols, Eldon Schoop, Ruijia Cheng, 14 Oct 2025, AgentBuilder: Exploring Scaffolds for Prototyping User Experiences of Interface Agents, https://arxiv.org/abs/2510.04452
Shuqing Li, Binchang Li, Yepang Liu, Cuiyun Gao, Jianping Zhang, Shing-Chi Cheung, Michael R. Lyu, 1 Oct 2025, Grounded GUI Understanding for Vision-Based Spatial Intelligent Agent: Exemplified by Extended Reality Apps, https://arxiv.org/abs/2409.10811
Ziang Ye, Yang Zhang, Wentao Shi, Xiaoyu You, Fuli Feng, Tat-Seng Chua, 24 Sep 2025, VisualTrap: A Stealthy Backdoor Attack on GUI Agents via Visual Grounding Manipulation, https://arxiv.org/abs/2507.06899
Weihua Cheng, Ersheng Ni, Wenlong Wang, Yifei Sun, Junming Liu, Wangyu Shen, Yirong Chen, Botian Shi, Ding Wang, 28 Oct 2025, MGA: Memory-Driven GUI Agent for Observation-Centric Interaction, https://arxiv.org/abs/2510.24168
Qiushi Sun, Mukai Li, Zhoumianze Liu, Zhihui Xie, Fangzhi Xu, Zhangyue Yin, Kanzhi Cheng, Zehao Li, Zichen Ding, Qi Liu, Zhiyong Wu, Zhuosheng Zhang, Ben Kao, Lingpeng Kong, 28 Oct 2025, OS-Sentinel: Towards Safety-Enhanced Mobile GUI Agents via Hybrid Validation in Realistic Workflows, https://arxiv.org/abs/2510.24411
Liangyu Chen, Hanzhang Zhou, Chenglin Cai, Jianan Zhang, Panrong Tong, Quyu Kong, Xu Zhang, Chen Liu, Yuqi Liu, Wenxuan Wang, Yue Wang, Qin Jin, Steven Hoi, 23 Oct 2025, UI-Ins: Enhancing GUI Grounding with Multi-Perspective Instruction-as-Reasoning, https://arxiv.org/abs/2510.20286
Smit Desai, Jessie Chin, Dakuo Wang, Benjamin Cowan, Michael Twidale, 23 Oct 2025, Toward Metaphor-Fluid Conversation Design for Voice User Interfaces, https://arxiv.org/abs/2502.11554
Sofiya Garkot, Maksym Shamrai, Ivan Synytsia, Mariya Hirna, 16 Oct 2025, GUIrilla: A Scalable Framework for Automated Desktop UI Exploration, https://arxiv.org/abs/2510.16051
Pei Yang, Hai Ci, and Mike Zheng Shou, 18 Oct 2025, macOSWorld: A Multilingual Interactive Benchmark for GUI Agents, https://arxiv.org/abs/2506.04135
Junyu Lu, Songxin Zhang, Zejian Xie, Zhuoyang Song, Jiaxing Zhang, 22 Sep 2025, Orcust: Stepwise-Feedback Reinforcement Learning for GUI Agent, https://arxiv.org/abs/2509.17917
Jason Wu and Amanda Swearngin and Arun Krishna Vajjala and Alan Leung and Jeffrey Nichols and Titus Barik, 20 Sep 2025, Improving User Interface Generation Models from Designer Feedback, https://arxiv.org/abs/2509.16779
Tianbao Xie, Jiaqi Deng, Xiaochuan Li, Junlin Yang, Haoyuan Wu, Jixuan Chen, Wenjing Hu, Xinyuan Wang, Yuhui Xu, Zekun Wang, Yiheng Xu, Junli Wang, Doyen Sahoo, Tao Yu, Caiming Xiong, 24 Oct 2025, Scaling Computer-Use Grounding via User Interface Decomposition and Synthesis, https://arxiv.org/abs/2505.13227
Hongze Mi, Yibo Feng, Wenjie Lu, Yuqi Wang, Jinyuan Li, Song Cao, He Cui, Tengfei Tian, Xuelin Zhang, Haotian Luo, Di Sun, Naiqiang Tan, Gang Pan, 26 Sep 2025, D-Artemis: A Deliberative Cognitive Framework for Mobile GUI Multi-Agents, https://arxiv.org/abs/2509.21799
Gaole Dai, Shiqi Jiang, Ting Cao, Yuqing Yang, Yuanchun Li, Rui Tan, Mo Li, Lili Qiu, 26 Sep 2025, ProRe: A Proactive Reward System for GUI Agents via Reasoner-Actor Collaboration, https://arxiv.org/abs/2509.21823
Renqi Chen, Zeyin Tao, Jianming Guo, Jingzhe Zhu, Yiheng Peng, Qingqing Sun, Tianyi Zhang, Shuai Chen, 26 Sep 2025, RISK: A Framework for GUI Agents in E-commerce Risk Management, https://arxiv.org/abs/2509.21982
Seoyoung Lee, Seonbin Yoon, Seongbeen Lee, Hyesoo Kim, Joo Yong Sim, 26 Sep 2025, Log2Plan: An Adaptive GUI Automation Framework Integrated with Task Mining Approach, https://arxiv.org/abs/2509.22137
Jiannan Xiang, Yun Zhu, Lei Shu, Maria Wang, Lijun Yu, Gabriel Barcik, James Lyon, Srinivas Sunkara, Jindong Chen, 26 Sep 2025, UISim: An Interactive Image-Based UI Simulator for Dynamic Mobile Environments, https://arxiv.org/abs/2509.21733
Hans G.W. van Dam, 31 Aug 2025, A Multimodal GUI Architecture for Interfacing with LLM-Based Conversational Assistants, https://arxiv.org/abs/2510.06223
Suyuchen Wang, Tianyu Zhang, Ahmed Masry, Christopher Pal, Spandana Gella, Bang Liu, Perouz Taslakian, 3 Oct 2025, Improving GUI Grounding with Explicit Position-to-Coordinate Mapping, https://arxiv.org/abs/2510.03230
Tao Xiong, Xavier Hu, Yurun Chen, Yuhang Liu, Changqiao Wu, Pengzhi Gao, Wei Liu, Jian Luan, Shengyu Zhang, 3 Oct 2025, GUI-PRA: Process Reward Agent for GUI Tasks, https://arxiv.org/abs/2509.23263
Ho Fai Leung, Xiaoyan Xi, Fei Zuo, 21 Oct 2025, AndroidControl-Curated: Revealing the True Potential of GUI Agents through Benchmark Purification, https://arxiv.org/abs/2510.18488
Cong Chen, Kaixiang Ji, Hao Zhong, Muzhi Zhu, Anzhou Li, Guo Gan, Ziyuan Huang, Cheng Zou, Jiajia Liu, Jingdong Chen, Hao Chen, Chunhua Shen, 28 Sep 2025, GUI-Shepherd: Reliable Process Reward and Verification for Long-Sequence GUI Tasks, https://arxiv.org/abs/2509.23738
Pengxiang Li, Zechen Hu, Zirui Shang, Jingrong Wu, Yang Liu, Hui Liu, Zhi Gao, Chenrui Shi, Bofei Zhang, Zihao Zhang, Xiaochuan Shi, Zedong YU, Yuwei Wu, Xinxiao Wu, Yunde Jia, Liuyu Xiang, Zhaofeng He, Qing Li, 28 Sep 2025, Efficient Multi-turn RL for GUI Agents via Decoupled Training and Adaptive Data Curation, https://arxiv.org/abs/2509.23866
Ran Xu, Kaixin Ma, Wenhao Yu, Hongming Zhang, Joyce C. Ho, Carl Yang, Dong Yu, 29 Sep 2025, Retrieval-augmented GUI Agents with Generative Guidelines, https://arxiv.org/abs/2509.24183
Hao Yang, Weijie Qiu, Ru Zhang, Zhou Fang, Ruichao Mao, Xiaoyu Lin, Maji Huang, Zhaosong Huang, Teng Guo, Shuoyang Liu, Hai Rao, 29 Sep 2025, UI-UG: A Unified MLLM for UI Understanding and Generation, https://arxiv.org/abs/2509.24361
Yan Yang and Dongxu Li and Yutong Dai and Yuhao Yang and Ziyang Luo and Zirui Zhao and Zhiyuan Hu and Junzhe Huang and Amrita Saha and Zeyuan Chen and Ran Xu and Liyuan Pan and Silvio Savarese and Caiming Xiong and Junnan Li, 29 Sep 2025, GTA1: GUI Test-time Scaling Agent, https://arxiv.org/abs/2507.05791
Bin Lei, Nuo Xu, Ali Payani, Mingyi Hong, Chunhua Liao, Yu Cao, Caiwen Ding, 5 Oct 2025, \textsc{GUI-Spotlight}: Adaptive Iterative Focus Refinement for Enhanced GUI Visual Grounding, https://arxiv.org/abs/2510.04039
Wenyi Wu, Kun Zhou, Ruoxin Yuan, Vivian Yu, Stephen Wang, Zhiting Hu, Biwei Huang, 10 Oct 2025, Auto-scaling Continuous Memory for GUI Agent, https://arxiv.org/abs/2510.09038
Reuben A. Luera, Ryan Rossi, Franck Dernoncourt, Samyadeep Basu, Sungchul Kim, Subhojyoti Mukherjee, Puneet Mathur, Ruiyi Zhang, Jihyung Kil, Nedim Lipka, Seunghyun Yoon, Jiuxiang Gu, Zichao Wang, Cindy Xiong Bearfield, Branislav Kveton, 9 Oct 2025, MLLM as a UI Judge: Benchmarking Multimodal LLMs for Predicting Human Perception of User Interfaces, https://arxiv.org/abs/2510.08783
Longxi Gao, Li Zhang, Pengzhi Gao, Wei Liu, Jian Luan, Mengwei Xu, 10 Oct 2025, GUI-Shift: Enhancing VLM-Based GUI Agents through Self-supervised Reinforcement Learning, https://arxiv.org/abs/2505.12493
Yifan Xu, Xiao Liu, Xinghan Liu, Jiaqi Fu, Hanchen Zhang, Bohao Jing, Shudan Zhang, Yuting Wang, Wenyi Zhao, Yuxiao Dong, 24 Oct 2025, MobileRL: Online Agentic Reinforcement Learning for Mobile GUI Agents, https://arxiv.org/abs/2509.18119
Sanjari Srivastava, Gang Li, Cheng Chang, Rishu Garg, Manpreet Kaur, Charlene Y. Lee, Yuezhang Li, Yining Mao, Ignacio Cases, Yanan Xie, Peng Qi, 10 Oct 2025, WARC-Bench: Web Archive Based Benchmark for GUI Subtask Executions, https://arxiv.org/abs/2510.09872
Haitao Jia, Ming He, Zimo Yin, Likang Wu, Jianping Fan, Jitao Sang, 9 Oct 2025, ReInAgent: A Context-Aware GUI Agent Enabling Human-in-the-Loop Mobile Task Navigation, https://arxiv.org/abs/2510.07988
Zhen Yang, Zi-Yi Dou, Di Feng, Forrest Huang, Anh Nguyen, Keen You, Omar Attia, Yuhao Yang, Michael Feng, Haotian Zhang, Ram Ramrakhya, Chao Jia, Jeffrey Nichols, Alexander Toshev, Yinfei Yang, Zhe Gan, 30 Sep 2025, Ferret-UI Lite: Lessons from Building Small On-Device GUI Agents, https://arxiv.org/abs/2509.26539
Flutter, Nov 2025 (accessed), Generative UI SDK for Flutter (genui), https://github.com/flutter/genui

Workflow

Research paper on workflow interfaces for AI applications:

Microsoft, Dec 2024, Power Automate: A comprehensive, end-to-end cloud automation platform powered by low code and AI. https://www.microsoft.com/en-us/power-platform/products/power-automate
Orlando Marquez Ayala, Patrice Béchard, 29 Nov 2024, Generating a Low-code Complete Workflow via Task Decomposition and RAG, https://arxiv.org/abs/2412.00239
Laura Minkova, Jessica López Espejel, Taki Eddine Toufik Djaidja, Walid Dahhane, El Hassane Ettifouri, 4 Dec 2024, From Words to Workflows: Automating Business Processes, https://arxiv.org/abs/2412.03446
Isaac Sacolick, Jul 29, 2024, How to choose the right low-code, no-code, or process automation platform, https://www.infoworld.com/article/3476848/how-to-choose-the-right-low-code-no-code-or-process-automation-platform.html
Wes Brewer, Ana Gainaru, Frédéric Suter, Feiyi Wang, Murali Emani, Shantenu Jha, 20 Jun 2024, AI-coupled HPC Workflow Applications, Middleware and Performance, (Examines integrations of various workflows into LLMs.) https://arxiv.org/abs/2406.14315
Vishal Rajput, Apr 11, 2024, What’s next for AI: AI agentic workflows? https://medium.com/aiguys/next-for-llms-and-rag-ai-agentic-workflows-1869ba0a6796
Ben Sherry, August 15, 2024, The 3 Top AI Use Cases, According to Inc.5000 CEOs, https://www.inc-aus.com/ben-sherry/3-ways-inc-5000-companies-are-using-ai.html (Workflow automation, content creation, and "marketing" are the three use cases at over 50% penetration for businesses using AI.)
Lakshmi narayana .U, Jul 28, 2024, STORM: Stanford’s Revolutionary Research Tool Harnessing the Power of Agents and Agentic Workflows, https://blog.stackademic.com/storm-stanfords-revolutionary-research-tool-harnessing-the-power-of-agents-and-agentic-workflows-a2fa0e1a7fe3
Hao Wu, Yue Yu, and Junxiao Deng, Shadi Ibrahim, Inria; Song Wu and Hao Fan, Ziyue Cheng, Hai Jin, Huazhong, 2024, StreamBox: A Lightweight GPU SandBox for Serverless Inference Workflow, Usenix 2024, https://www.usenix.org/conference/atc24/presentation/wu-hao PDF: https://www.usenix.org/system/files/atc24-wu-hao.pdf
Ozgur Ozan Kilic, Tianle Wang, Matteo Turilli, Mikhail Titov, Andre Merzky, Line Pouchard, Shantenu Jha, 26 Mar 2024, Workflow Mini-Apps: Portable, Scalable, Tunable & Faithful Representations of Scientific Workflows, https://arxiv.org/abs/2403.18073
Zelong Li, Shuyuan Xu, Kai Mei, Wenyue Hua, Balaji Rama, Om Raheja, Hao Wang, He Zhu, Yongfeng Zhang, 1 Jul 2024, AutoFlow: Automated Workflow Generation for Large Language Model Agents, https://arxiv.org/abs/2407.12821 https://github.com/agiresearch/AutoFlow
Lukas Teufelberger, Xintong Liu, Zhipeng Li, Max Moebus, Christian Holz, 31 Jul 2024, LLM-for-X: Application-agnostic Integration of Large Language Models to Support Personal Writing Workflows, https://arxiv.org/abs/2407.21593
Emilia David, September 10, 2024, ServiceNow introduces a library of enterprise AI agents you can customize to fit your workflow, https://venturebeat.com/ai/servicenow-introduces-a-library-of-enterprise-ai-agents-you-can-customize-to-fit-your-workflow/
Rinon Gal, Adi Haviv, Yuval Alaluf, Amit H. Bermano, Daniel Cohen-Or, Gal Chechik, 2 Oct 2024, ComfyGen: Prompt-Adaptive Workflows for Text-to-Image Generation, https://arxiv.org/abs/2410.01731 https://comfygen-paper.github.io/
David Gewirtz, Oct. 25, 2024, I wrote half this article on Apple Watch, thanks to this under-the-radar iOS 18 feature: Here's how to transform your writing workflow and turn your Apple Watch into a productivity powerhouse, https://www.zdnet.com/article/i-wrote-half-this-article-on-apple-watch-thanks-to-this-under-the-radar-ios-18-feature/
Arun Shankar, Oct 2024, Designing Cognitive Architectures: Agentic Workflow Patterns from Scratch, https://medium.com/google-cloud/designing-cognitive-architectures-agentic-workflow-patterns-from-scratch-63baa74c54bc
AI Agent Workflows: A Complete Guide on Whether to Build With LangGraph or LangChain, Sandi Besen, Oct 2024, https://towardsdatascience.com/ai-agent-workflows-a-complete-guide-on-whether-to-build-with-langgraph-or-langchain-117025509fa0
Anita Kirkovska, David Vargas, Jul 11, 2024, Agentic Workflows in 2024: The ultimate guide, https://www.vellum.ai/blog/agentic-workflows-emerging-architectures-and-design-patterns
Shuofei Qiao, Runnan Fang, Zhisong Qiu, Xiaobin Wang, Ningyu Zhang, Yong Jiang, Pengjun Xie, Fei Huang, Huajun Chen, 10 Oct 2024, Benchmarking Agentic Workflow Generation, https://arxiv.org/abs/2410.07869
A. Singh, A. Ehtesham, S. Kumar and T. T. Khoei, "Enhancing AI Systems with Agentic Workflows Patterns in Large Language Model," 2024 IEEE World AI IoT Congress (AIIoT), Seattle, WA, USA, 2024, pp. 527-532, doi: 10.1109/AIIoT61789.2024.10578990. https://ieeexplore.ieee.org/abstract/document/10578990
Chawla, Chhavi; Chatterjee, Siddharth; Gadadinni, Sanketh Siddanna; Verma, Pulkit; Banerjee, Sourav, 2024, Agentic AI: The building blocks of sophisticated AI business applications, Journal of AI, Robotics & Workplace Automation, Volume 3 / Number 3 / Summer 2024, pp. 1-15(15), Henry Stewart Publications, DOI: https://doi.org/10.69554/XEHZ1946 https://www.ingentaconnect.com/content/hsp/airwa/2024/00000003/00000003/art00001
Jiayi Zhang, Jinyu Xiang, Zhaoyang Yu, Fengwei Teng, Xionghui Chen, Jiaqi Chen, Mingchen Zhuge, Xin Cheng, Sirui Hong, Jinlin Wang, Bingnan Zheng, Bang Liu, Yuyu Luo, Chenglin Wu, 14 Oct 2024, AFlow: Automating Agentic Workflow Generation, https://arxiv.org/abs/2410.10762 https://github.com/geekan/MetaGPT
Amy Nichol Smith, Lauren Holznienkemper, Aug 25, 2024, Best Workflow Apps, https://www.forbes.com/advisor/business/software/best-workflow-app/
Kyle Wiggers, March 17, 2025, OpenAI to start testing ChatGPT connectors for Google Drive and Slack, https://techcrunch.com/2025/03/17/openai-to-start-testing-chatgpt-connectors-for-google-drive-and-slack/

Consoles

Anthropic, 21 May 2024, Generate better prompts in the developer console, https://www.anthropic.com/news/prompt-generator
Michael Nuñez, September 10, 2024, Is Anthropic’s new ‘Workspaces’ feature the future of enterprise AI management? https://venturebeat.com/ai/is-anthropics-new-workspaces-feature-the-future-of-enterprise-ai-management/
Jared Spataro, Sep 16, 2024, Microsoft 365 Copilot Wave 2: Pages, Python in Excel, and agents, Microsoft blog, https://www.microsoft.com/en-us/microsoft-365/blog/2024/09/16/microsoft-365-copilot-wave-2-pages-python-in-excel-and-agents/
Ellie Ko, Sep 25, 2024, A Survey of Python Frameworks, https://ploomber.io/blog/survey-python-frameworks/
Matt Asay, Sep 23, 2024, Too much assembly required for AI, https://www.infoworld.com/article/3536292/too-much-assembly-required-for-ai.html
Emilia David, October 3, 2024, OpenAI launches ChatGPT Canvas, challenging Claude Artifacts, https://venturebeat.com/ai/openai-launches-chatgpt-canvas-challenging-claude-artifacts/
Sabrina Ortiz, Oct. 7, 2024, I test ChatGPT features for a living, and this new one really did supercharge my productivity. If you use OpenAI's generative AI tool to co-edit code or text, Canvas will take your work to a whole new level, https://www.zdnet.com/article/i-test-chatgpt-features-for-a-living-and-this-new-one-really-did-supercharge-my-productivity/
Emilia David, October 17, 2024, Google launches NotebookLM Business to make enterprise AI audio, text, https://venturebeat.com/ai/googles-notebooklm-will-expand-to-business-use-cases-soon/
Jason Perlow, Nov. 8, 2024, How to manage Bluesky, Mastodon, and Threads all from one free app Openvibe simplifies social media management with unified timelines, cross-posting, and customizable feeds for easier navigation of the digital landscape. Here's why you should try it. https://www.zdnet.com/article/how-to-manage-bluesky-mastodon-and-threads-all-from-one-free-app/
OpenAI, October 3, 2024, Introducing canvas: A new way of working with ChatGPT to write and code, https://openai.com/index/introducing-canvas/
swyx & Alessio, Maggie Appleton, Linus Lee, and Geoffrey Litt, Apr 27, 2023, It's Time To Build AI | UX. Bridging the Capability Overhang from Generative AI to Generative UI, https://www.latent.space/p/build-ai-ux
Jared Spataro, November 19, 2024, Introducing Copilot Actions, new agents, and tools to empower IT teams, https://www.microsoft.com/en-us/microsoft-365/blog/2024/11/19/introducing-copilot-actions-new-agents-and-tools-to-empower-it-teams/ ("Copilot is the UI for AI")
Microsoft, Dec 2024, Power Automate: A comprehensive, end-to-end cloud automation platform powered by low code and AI. https://www.microsoft.com/en-us/power-platform/products/power-automate
Emilia David, December 10, 2024, OpenAI expands ChatGPT Canvas to all users, https://venturebeat.com/ai/openai-expands-chatgpt-canvas-to-all-users/
Anshul Ramachandran, Jul 08, 2023, How to Make AI UX Your Moat. Design great AI Products that go beyond "just LLM Wrappers": make AI more present, more practical, and then more powerful. https://www.latent.space/p/ai-ux-moat
Sabrina Ortiz, Dec. 13, 2024, ChatGPT finally gets easier to organize on the 7th day of OpenAI, https://www.zdnet.com/article/chatgpt-finally-gets-easier-to-organize-on-the-7th-day-of-openai/
Google, Dec 2024, multimodal-live-api-web-console: A react-based starter app for using the Multimodal Live API over websockets with Gemini, https://github.com/google-gemini/multimodal-live-api-web-console
Latent Space, Jan 05, 2025, AI Engineering for Art — with comfyanonymous, of ComfyUI, Using models for "Art Engineering", building hard to use UIs, and how image generation is moving from text boxes to DAGs https://www.latent.space/p/comfyui
comfyanonymous, Jan 2025, ComfyUI: The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface, https://github.com/comfyanonymous/ComfyUI
Sharon Goldman, December 13, 2023, Lightning AI debuts ‘iPhone approach’ to new AI dev platform, https://venturebeat.com/ai/lightning-ai-debuts-iphone-approach-to-new-ai-dev-platform/
Anthropic, 7 Mar 2025, Get to production faster with the upgraded Anthropic Console, https://www.anthropic.com/news/upgraded-anthropic-console
Rex Woodbury, Jun 18, 2025, The Opportunities in Consumer AI: Mapping Where to Build + Examining Changes in Product Design & Business Model, https://www.digitalnative.tech/p/the-opportunities-in-consumer-ai
Sameer Singh, June 4, 2025, Stop Asking, Start Showing: Why GUIs Still Win in the Age of AI, https://www.speedinvest.com/blog/consumer-ai-cognitive-load-and-the-gui
Maximilian Schreiner, Jun 26, 2025, Anthropics Claude can now build AI apps, https://the-decoder.com/anthropics-claude-can-now-build-ai-apps/
Kenneth Wolters, Aug 12, 2025, No AGI in Sight: What This Means for LLMs, https://kennethwolters.com/posts/no-agi/
MKWriteshere, Aug 2025, Microsoft Just Solved AI’s Biggest Problem: Why Magentic-UI Changes Everything: How human-AI collaboration beats pure automation every time with 71% better results, https://pub.towardsai.net/microsoft-just-solved-ais-biggest-problem-why-magentic-ui-changes-everything-ae09b5d09223

Declarative Programming

Declarative programming is the method of creating apps by defining what to do, rather than how to do it. The language to define a declarative app is more like a configuration file, rather than a procedural programming language like C++.

Research on declarative programming issues:

S Madden, M Cafarella, M Franklin, T Kraska, 2024, Databases Unbound: Querying All of the World’s Bytes with AI, https://www.vldb.org/pvldb/vol17/p4546-madden.pdf
Mandana Vaziri, Louis Mandel, Claudio Spiess, Martin Hirzel, 24 Oct 2024, PDL: A Declarative Prompt Programming Language, https://arxiv.org/abs/2410.19135
Manpreet Singh, Oct 31, 2024, Let's Simplifying How We Talk to AI Using Prompt Declaration Language (PDL), https://pub.towardsai.net/lets-simplifying-how-we-talk-to-ai-using-prompt-declaration-language-pdl-b1824c4de833
Saksham Goel, October 29, 2024, Build LLM/RAG pipelines with YAML templates by Pathway, https://pathway.com/blog/llm-yaml-templates
Ignacio de Gregorio Noblejas, December 15, 2024, The AI Trillion-Dollar Product, https://thewhitebox.beehiiv.com/p/the-ai-trillion-dollar-product
C Liu, M Russo, M Cafarella, L Cao, PB Chen, Z Chen, Jan 2025, Palimpzest: Optimizing AI-Powered Analytics with Declarative Query Processing, https://vldb.org/cidrdb/papers/2025/p12-liu.pdf
Geoffrey Huntley AGENT.md: The Universal Agent Configuration File, July 2025 Request for Comments, https://ampcode.com/AGENT.md
Mariya Mansurova, Aug 18, 2025, Programming, Not Prompting: A Hands-on Guide to DSPy: A practical deep dive into declarative AI programming, https://miptgirl.medium.com/programming-not-prompting-a-hands-on-guide-to-dspy-04ea2d966e6d
Sohaib Imran, Rob Lamb, Peter M. Atkinson, 1 Aug 2025, Out-of-Context Abduction: LLMs Make Inferences About Procedural Data Leveraging Declarative Facts in Earlier Training Data, https://arxiv.org/abs/2508.00741
Alexander W. Lee, Justin Chan, Michael Fu, Nicolas Kim, Akshay Mehta, Deepti Raghavan, Ugur Cetintemel, 7 Aug 2025, Semantic Integrity Constraints: Declarative Guardrails for AI-Augmented Data Processing Systems, https://arxiv.org/abs/2503.00600
Parker Glenn, Alfy Samuel, Daben Liu, 24 Sep 2025, Play by the Type Rules: Inferring Constraints for LLM Functions in Declarative Programs, https://arxiv.org/abs/2509.20208
Mostapha Kalami Heris, 21 Oct 2025, Prompt Decorators: A Declarative and Composable Syntax for Reasoning, Formatting, and Control in LLMs, https://arxiv.org/abs/2510.19850
Elham Khabiri, Jeffrey O. Kephart, Fenno F. Heath III, Srideepika Jayaraman, Fateh A. Tipu, Yingjie Li, Dhruv Shah, Achille Fokoue, Anu Bhamidipaty, 18 Oct 2025, Declarative Techniques for NL Queries over Heterogeneous Data, https://arxiv.org/abs/2510.16470
Yuan Wang, Mingyu Li and Haibo Chen, 6 Oct 2025, A Case for Declarative LLM-friendly Interfaces for Improved Efficiency of Computer-Use Agents, https://arxiv.org/abs/2510.04607

Script Languages

L. Zheng, L. Yin, Z. Xie, J. Huang, C. Sun, C. H. Yu, S. Cao, C. Kozyrakis, I. Stoica, J. E. Gonzalez et al., Dec 2023, Efficiently programming large language models using SGLang, arXiv preprint arXiv:2312.07104, 2023, https://arxiv.org/abs/2312.07104 (Uses a radix attention method, a trie or prefix tree, for KV caching.)
Hongzheng Chen, Niansong Zhang, Shaojie Xiang, Zhichen Zeng, Mengjia Dai, Zhiru Zhang, 7 Apr 2024, Allo: A Programming Model for Composable Accelerator Design, https://arxiv.org/abs/2404.04815
Omar Khattab, Arnav Singhvi, Paridhi Maheshwari, Zhiyuan Zhang, Keshav Santhanam, Sri Vardhamanan, Saiful Haq, Ashutosh Sharma, Thomas T. Joshi, Hanna Moazam, Heather Miller, Matei Zaharia, Christopher Potts, 5 Oct 2023, DSPy: Compiling Declarative Language Model Calls into Self-Improving Pipelines, https://arxiv.org/abs/2310.03714 Code: https://github.com/stanfordnlp/dspy
Honghua Dong, Qidong Su, Yubo Gao, Zhaoyu Li, Yangjun Ruan, Gennady Pekhimenko, Chris J. Maddison, Xujie Si, 19 Jun 2024, APPL: A Prompt Programming Language for Harmonious Integration of Programs and Large Language Model Prompts, https://arxiv.org/abs/2406.13161 Code: https://github.com/appl-team/appl (A Python-like script language for prompt engineering integration into applications and agents.)
Till Döhmen, 2024/10/17, Introducing the prompt() Function: Use the Power of LLMs with SQL! https://motherduck.com/blog/sql-llm-prompt-function-gpt-models/
Mandana Vaziri, Louis Mandel, Claudio Spiess, Martin Hirzel, 24 Oct 2024, PDL: A Declarative Prompt Programming Language, https://arxiv.org/abs/2410.19135
Saksham Goel, October 29, 2024, Build LLM/RAG pipelines with YAML templates by Pathway, https://pathway.com/blog/llm-yaml-templates
Yuka Ikarashi, Kevin Qian, Samir Droubi, Alex Reinking, Gilbert Bernstein, Jonathan Ragan-Kelley, 14 Nov 2024 (v2), Exo 2: Growing a Scheduling Language, https://arxiv.org/abs/2411.07211

API Architectures

Kyle Wiggers, September 16, 2024, Runway announces an API for its video-generating AI models, https://techcrunch.com/2024/09/16/runway-announces-an-api-for-its-video-generating-models/
Mistral, Sep 2024, AI in abundance. Introducing a free API, improved pricing across the board, a new enterprise-grade Mistral Small, and free vision capabilities on le Chat. https://mistral.ai/news/september-24-release/
Luma Labs, Sep 2024, Creative Intelligence platform for magical AI products, https://lumalabs.ai/dream-machine/api (API to access video models.)
Simon Willison, Sep 2024, How streaming LLM APIs work, https://til.simonwillison.net/llms/streaming-llm-apis
Carl Franzen, September 27, Cohere updates APIs to make it easier for devs to switch from other models, https://venturebeat.com/ai/cohere-updates-apis-to-make-it-easier-for-devs-to-switch-from-other-models/
Junting Lu, Zhiyang Zhang, Fangkai Yang, Jue Zhang, Lu Wang, Chao Du, Qingwei Lin, Saravan Rajmohan, Dongmei Zhang, Qi Zhang, 25 Sep 2024, Turn Every Application into an Agent: Towards Efficient Human-Agent-Computer Interaction with API-First LLM-Based Agents, https://arxiv.org/abs/2409.17140
Michael Nuñez, September 25, 2024, AI for all: Meta’s ‘Llama Stack’ promises to simplify enterprise adoption, https://venturebeat.com/ai/ai-for-all-meta-llama-stack-promises-to-simplify-enterprise-ai-adoption/
Kyle Wiggers, October 3, 2024, Black Forest Labs, the startup behind Grok’s image generator, releases an API, https://techcrunch.com/2024/10/03/black-forest-labs-the-startup-behind-groks-image-generator-releases-an-api/
Kyle Wiggers, October 21, 2024, xAI, Elon Musk’s AI startup, launches an API, https://techcrunch.com/2024/10/21/xai-elon-musks-ai-startup-launches-an-api/
X AI, November 4, 2024 API Public Beta, https://x.ai/blog/api
Gemini is now accessible from the OpenAI Library NOV 08, 2024 Logan Kilpatrick, https://developers.googleblog.com/en/gemini-is-now-accessible-from-the-openai-library/
Kwindla Hultman Kramer and swyx & Alessio, Nov 22, 2024, OpenAI Realtime API: The Missing Manual, Latent Space, https://www.latent.space/p/realtime-api
Yu Gu, Yiheng Shu, Hao Yu, Xiao Liu, Yuxiao Dong, Jie Tang, Jayanth Srinivasa, Hugo Latapie, Yu Su, 22 Feb 2024, Middleware for LLMs: Tools Are Instrumental for Language Agents in Complex Environments, https://arxiv.org/abs/2402.14672
Andrew Ng, Nov 2024, Simple, unified interface to multiple Generative AI providers, https://github.com/andrewyng/aisuite
Asif Razzaq, November 29, 2024, Andrew Ng’s Team Releases ‘aisuite’: A New Open Source Python Library for Generative AI, https://www.marktechpost.com/2024/11/29/andrew-ngs-team-releases-aisuite-a-new-open-source-python-library-for-generative-ai/
Paul Krill Dec 05, 2024, OpenAI unveils API for tracking OpenAI API usage, costs, https://www.infoworld.com/article/3618202/openai-unveils-api-for-tracking-openai-api-usage-costs.html
Outlore, Dec 14, 2024, Reflections on building with Model Context Protocol (MCP), https://outlore.dev/blog/model-context-protocol/
Google, Dec 2024, multimodal-live-api-web-console: A react-based starter app for using the Multimodal Live API over websockets with Gemini, https://github.com/google-gemini/multimodal-live-api-web-console
Anthropic, 24 Jan 2025, Introducing Citations on the Anthropic API, https://www.anthropic.com/news/introducing-citations-api
OpenVINO™ toolkit, Nov 22, 2024, How to generate images locally on AI PC with OpenVINO GenAI API, https://medium.com/openvino-toolkit/how-to-generate-images-locally-on-ai-pc-with-openvino-genai-api-220d08370958
Anirban Ghoshal, 06 Feb 2025, NetSuite adds new AI capabilities to improve enterprise workflows, https://www.cio.com/article/3818405/netsuite-adds-new-ai-capabilities-to-improve-enterprise-workflows.html
Mandar Karhade, Feb 2025, Tired of LLM Chaos? LiteLLM Should Be Your Default. Stop juggling multiple LLM APIs and their “standards”. https://pub.towardsai.net/tired-of-llm-chaos-litellm-should-be-your-default-e04730b3c33c
Reuters, February 26, 2025, DeepSeek cuts off-peak pricing for developers by up to 75%, https://www.reuters.com/technology/chinas-deepseek-cuts-off-peak-pricing-by-up-75-2025-02-26/
Alex Fazio, Feb 2025, How to Build an LLM Chat App: The New Litmus Test for Junior Devs, https://x.com/alxfazio/status/1893242657331101976 (How to build a wrapper chat app that scales by taking care of message queueing, with RabbitMQ or Kafka API rate limits, history database management, in-memory caching with Redis, load balancing, and other real-world deployment issues.)
Chaoyun Zhang, Shilin He, Liqun Li, Si Qin, Yu Kang, Qingwei Lin, Dongmei Zhang, 14 Mar 2025, API Agents vs. GUI Agents: Divergence and Convergence, https://arxiv.org/abs/2503.11069
Marcus Mendes, May 20 2025, Apple to let developers build with its AI models starting at WWDC 2025, https://9to5mac.com/2025/05/20/apple-to-let-developers-build-with-its-own-ai-models/
Carl Franzen, May 21, 2025, OpenAI updates its new Responses API rapidly with MCP support, GPT-4o native image gen, and more enterprise features, https://venturebeat.com/programming-development/openai-updates-its-new-responses-api-rapidly-with-mcp-support-gpt-4o-native-image-gen-and-more-enterprise-features/
Kyle Wiggers, April 29, 2025, Meta previews an API for its Llama AI models, https://techcrunch.com/2025/04/29/meta-previews-an-api-for-its-llama-ai-models/
Anthropic, May 2025, New capabilities for building agents on the Anthropic API, https://www.anthropic.com/news/agent-capabilities-api
Peter Wayner, Jun 9, 2025, 9 APIs you’ll love for AI integrations and automated workflows, https://www.infoworld.com/article/3999600/9-apis-youll-love-for-ai-integrations-and-automated-workflows.html
Peer Trilcke, Ingo B\"orner, Henny Sluyter-G\"athje, Daniil Skorinkin, Frank Fischer, Carsten Milling, 19 Aug 2025, Agentic DraCor and the Art of Docstring Engineering: Evaluating MCP-empowered LLM Usage of the DraCor API, https://arxiv.org/abs/2508.13774
Zhenchang Xing, Yang Liu, Zhuo Cheng, Qing Huang, Dehai Zhao, Daniel Sun, Chenhua Liu, 9 Aug 2025, When Prompt Engineering Meets Software Engineering: CNL-P as Natural and Robust "APIs'' for Human-AI Interaction, https://arxiv.org/abs/2508.06942
Ziyao Wang, Guoheng Sun, Yexiao He, Zheyu Shen, Bowei Tian, Ang Li, 29 Jul 2025, Predictive Auditing of Hidden Tokens in LLM APIs via Reasoning Length Estimation, https://arxiv.org/abs/2508.00912
Zainab Khan, Ahmed Hussain, Mukesh Thakur, Arto Hellas, and Panos Papadimitratos, 12 Aug 2025, NEFMind: Parameter-Efficient Fine-Tuning of Open-Source LLMs for Telecom APIs Automation, https://arxiv.org/abs/2508.09240
Ajoy Das, Gias Uddin, Shaiful Chowdhury, Mostafijur Rahman Akhond, Hadi Hemmati, 22 Aug 2025, Applications and Challenges of Fairness APIs in Machine Learning Software, https://arxiv.org/abs/2508.16377
Jack Youstra, Mohammed Mahfoud, Yang Yan, Henry Sleight, Ethan Perez, Mrinank Sharma, 23 Aug 2025, Towards Safeguarding LLM Fine-tuning APIs against Cipher Attacks, https://arxiv.org/abs/2508.17158
Yuanchun Wang, Jifan Yu, Zijun Yao, Jing Zhang, Yuyang Xie, Shangqing Tu, Yiyang Fu, Youhe Feng, Jinkai Zhang, Jingyao Zhang, Bowen Huang, Yuanyao Li, Huihui Yuan, Lei Hou, Juanzi Li and Jie Tang, 28 Aug 2025, SoAy: A Solution-based LLM API-using Methodology for Academic Information Seeking, https://arxiv.org/abs/2405.15165
Seungkyu Lee, Nalim Kim, Yohan Jo, 1 Sep 2025, In-N-Out: A Parameter-Level API Graph Dataset for Tool Agents, https://arxiv.org/abs/2509.01560
Thiago Barradas, Aline Paes and V\^ania de Oliveira Neves, 5 Sep 2025, Combining TSL and LLM to Automate REST API Testing: A Comparative Study, https://arxiv.org/abs/2509.05540
AmirHossein Naghshzan, 6 Sep 2025, Automating API Documentation with LLMs: A BERTopic Approach, https://arxiv.org/abs/2509.05749
Jayachandu Bandlamudi, Ritwik Chaudhuri, Neelamadhav Gantayat, Sambit Ghosh, Kushal Mukherjee, Prerna Agarwal, Renuka Sindhgatta, Sameep Mehta, 12 Sep 2025, A Framework for Testing and Adapting REST APIs as LLM Tools, https://arxiv.org/abs/2504.15546
Prerna Agarwal, Himanshu Gupta, Soujanya Soni, Rohith Vallam, Renuka Sindhgatta, Sameep Mehta, 15 Sep 2025, Automated Creation and Enrichment Framework for Improved Invocation of Enterprise APIs as Tools, https://arxiv.org/abs/2509.11626
Daniel Maninger, Leon Chemnitz, Amir Molzam Sharifloo, Jannis Brugger, Mira Mezini, 24 Sep 2025, Benchmarking Web API Integration Code Generation, https://arxiv.org/abs/2509.20172
Juhyeong Kim, Yejin Kim, Youngbin Lee and Hyunwoo Byun, 21 Oct 2025, FinAI Data Assistant: LLM-based Financial Database Query Processing with the OpenAI Function Calling API, https://arxiv.org/abs/2510.14162
Zishuo Xu, Yuhong Gu, Dezhong Yao, 27 Sep 2025, WARBERT: A Hierarchical BERT-based Model for Web API Recommendation, https://arxiv.org/abs/2509.23175
Will Cai, Tianneng Shi, Xuandong Zhao, Dawn Song, 29 Sep 2025, Are You Getting What You Pay For? Auditing Model Substitution in LLM APIs, https://arxiv.org/abs/2504.04715
Chenwei Tang, Jingyu Xing, Xinyu Liu, Zizhou Wang, Jiawei Du, Liangli Zhen, Jiancheng Lv, 17 Oct 2025, Experience-Driven Exploration for Efficient API-Free AI Agents, https://arxiv.org/abs/2510.15259
Xander Davies, Eric Winsor, Alexandra Souly, Tomek Korbak, Robert Kirk, Christian Schroeder de Witt, Yarin Gal, 24 Oct 2025, Fundamental Limitations in Pointwise Defences of LLM Finetuning APIs, https://arxiv.org/abs/2502.14828
Hua Zhong, Shan Jiang, Sarfraz Khurshid, 29 Aug 2025, APRIL: API Synthesis with Automatic Prompt Optimization and Reinforcement Learning, https://arxiv.org/abs/2509.25196
Esakkivel Esakkiraja, Denis Akhiyarov, Aditya Shanmugham, Chitra Ganapathy, 30 Sep 2025, DeepCodeSeek: Real-Time API Retrieval for Context-Aware Code Generation, https://arxiv.org/abs/2509.25716
Hudson de Martim, 7 Oct 2025, Deterministic Legal Retrieval: An Action API for Querying the SAT-Graph RAG, https://arxiv.org/abs/2510.06002

Plugins

Reyna Abhyankar, Zijian He, Vikranth Srivatsa, Hao Zhang, Yiying Zhang, 2024, INFERCEPT: Efficient Intercept Support for Augmented Large Language Model Inference, https://openreview.net/pdf?id=wDDGQabYPQ
Zile Qiao, Wei Ye, Yong Jiang, Tong Mo, Pengjun Xie, Weiping Li, Fei Huang, Shikun Zhang, 12 Jun 2024, Supportiveness-based Knowledge Rewriting for Retrieval-augmented Language Modeling, https://arxiv.org/abs/2406.08116
Reiichiro Nakano, Jacob Hilton, Suchir Balaji, Jeff Wu, Long Ouyang, Christina Kim, Christopher Hesse, Shantanu Jain, Vineet Kosaraju, William Saunders, Xu Jiang, Karl Cobbe, Tyna Eloundou, Gretchen Krueger, Kevin Button, Matthew Knight, Benjamin Chess, John Schulman, 1 Jun 2022 (v3), WebGPT: Browser-assisted question-answering with human feedback, https://arxiv.org/abs/2112.09332
Chuan Yan, Ruomai Ren, Mark Huasong Meng, Liuhuo Wan, Tian Yang Ooi, Guangdong Bai, 26 Aug 2024, Exploring ChatGPT App Ecosystem: Distribution, Deployment and Security, https://arxiv.org/abs/2408.14357
Sreedevi Gogusetty, Dec 6, 2024, From RAG to TAG: Leveraging the Power of Table-Augmented Generation (TAG): A Leap Beyond Retrieval-Augmented Generation (RAG), https://ai.plainenglish.io/from-rag-to-tag-leveraging-the-power-of-table-augmented-generation-tag-a-leap-beyond-54d1cfadb994 (TAG for augmenting LLMs with queries from database tables, similar to data source plugins.)
Kyoungmin Kim, Anastasia Ailamaki, 23 Dec 2024, Trustworthy and Efficient LLMs Meet Databases, https://arxiv.org/abs/2412.18022
Connor Shorten, Charles Pierse, Thomas Benjamin Smith, Karel D'Oosterlinck, Tuana Celik, Erika Cardenas, Leonie Monigatti, Mohd Shukri Hasan, Edward Schmuhl, Daniel Williams, Aravind Kesiraju, Bob van Luijt, 23 Jan 2025, Querying Databases with Function Calling, https://arxiv.org/abs/2502.00032
Dr. Ashish Bamania, Feb 2025, The Open Source “Agentic Reasoning” Beats Google Gemini Deep Research. A deep dive into how the “Agentic Reasoning” framework works and the techniques behind it that make it outperform the most advanced reasoning LLMs today. https://levelup.gitconnected.com/the-open-source-agentic-reasoning-beats-google-gemini-deep-research-8ed8d9d07176
Minhua Lin, Hui Liu, Xianfeng Tang, Jingying Zeng, Zhenwei Dai, Chen Luo, Zheng Li, Xiang Zhang, Qi He, Suhang Wang, 26 Feb 2025 (v2), How Far are LLMs from Real Search? A Comprehensive Study on Efficiency, Completeness, and Inherent Capabilities, https://arxiv.org/abs/2502.18387
Krish Arvapally, Mar 2025, The End of AI Scraping? A Better Way to Unlock Data at the Point of Inference with RAG & MCP, https://medium.com/@arvapallykrish/the-end-of-ai-scraping-a-better-way-to-unlock-data-at-the-point-of-inference-with-rag-mcp-6cbb141a5765
Kyle Wiggers, March 17, 2025, OpenAI to start testing ChatGPT connectors for Google Drive and Slack, https://techcrunch.com/2025/03/17/openai-to-start-testing-chatgpt-connectors-for-google-drive-and-slack/
Michael Banf and Johannes Kuhn, 22 Aug 2025, Tripartite-GraphRAG via Plugin Ontologies, https://arxiv.org/abs/2504.19667

Custom AI Apps

Gino Zambe, Feb 1, 2024, Was The GPT store a failure? https://medium.com/@ginozambe/was-the-gpt-store-a-failure-d2a2379fdfc1
OpenAI, November 6, 2023 Introducing GPTs, OpenAI Blog, https://openai.com/blog/introducing-gpts
Lance Whitney, June 12, 2024, Microsoft scraps Copilot Pro GPT Builder after just 3 months - how to save your work, https://www.zdnet.com/article/microsoft-scraps-copilot-pro-gpt-builder-after-just-3-months-how-to-save-your-work/
Reuters, July 30, 2024, Meta to let users to create custom AI characters, https://www.reuters.com/technology/artificial-intelligence/meta-let-users-create-custom-ai-characters-2024-07-29/
Lucas Mearian, 27 Aug 2024, BCG execs: AI across the company increased productivity, ‘employee joy’, https://www.computerworld.com/article/3491334/bcg-execs-ai-across-the-company-increased-productivity-employee-joy.html
Yuanchun Li, Hao Wen, Weijun Wang, Xiangyu Li, Yizhen Yuan, Guohong Liu, Jiacheng Liu, Wenxing Xu, Xiang Wang, Yi Sun, Rui Kong, Yile Wang, Hanfei Geng, Jian Luan, Xuefeng Jin, Zilong Ye, Guanjing Xiong, Fan Zhang, Xiang Li, Mengwei Xu, Zhijun Li, Peng Li, Yang Liu, Ya-Qin Zhang, Yunxin Liu, 8 May 2024 (v2), Personal LLM Agents: Insights and Survey about the Capability, Efficiency and Security, https://arxiv.org/abs/2401.05459 https://github.com/MobileLLM/Personal_LLM_Agents_Survey
Emilia David, August 30, 2024, OpenAI gives developers more control over AI assistants, https://venturebeat.com/ai/openai-gives-developers-more-control-over-ai-assistants/
Henrique Centieiro & Bee Lee, Aug 2024, Build Your Own Money-Making Personal AI Bot: An Easy Step-by-Step Guide to Creating and Monetizing Your Personal AI Bot on Poe, https://medium.com/limitless-investor/build-your-own-money-making-personal-ai-bot-9810e3175699
OpenAI, January 10, 2024, Introducing the GPT Store, https://openai.com/index/introducing-the-gpt-store/
Chuan Yan, Ruomai Ren, Mark Huasong Meng, Liuhuo Wan, Tian Yang Ooi, Guangdong Bai, 26 Aug 2024, Exploring ChatGPT App Ecosystem: Distribution, Deployment and Security, https://arxiv.org/abs/2408.14357
Xinyi Hou, Yanjie Zhao, Haoyu Wang, 3 Aug 2024, Voices from the Frontier: A Comprehensive Analysis of the OpenAI Developer Forum, https://arxiv.org/abs/2408.01687
OpenAI, 2024, GPT Builder: What is the GPT Builder for in ChatGPT and why did we make it? https://help.openai.com/en/articles/8770868-gpt-builder
Xiang Chen, Chaoyang Gao, Chunyang Chen, Guangbei Zhang, Yong Liu, 12 Aug 2024 (v2), An Empirical Study on Challenges for LLM Developers, https://arxiv.org/abs/2408.05002
Nick: The AI Guru, Aug 15, 2024, Why Perplexity AI Has Been a Game Changer For Me, https://medium.com/@nickm9/why-perplexity-ai-has-been-a-game-changer-for-me-b38976bdc1b4
https://levelup.gitconnected.com/zero-to-hero-crafting-a-custom-gpt-e2ef22653b1f
Tiernan Ray, Sept. 4, 2024, Google's Gems are a gentle introduction to AI prompt engineering: Google's pre-built Gems offer prompt examples you can modify to get started with your own custom bot, https://www.zdnet.com/article/googles-gems-are-a-gentle-introduction-to-ai-prompt-engineering/
Chaojun Xiao, Zhengyan Zhang, Chenyang Song, Dazhi Jiang, Feng Yao, Xu Han, Xiaozhi Wang, Shuo Wang, Yufei Huang, Guanyu Lin, Yingfa Chen, Weilin Zhao, Yuge Tu, Zexuan Zhong, Ao Zhang, Chenglei Si, Khai Hao Moo, Chenyang Zhao, Huimin Chen, Yankai Lin, Zhiyuan Liu, Jingbo Shang, Maosong Sun, Sep 2024, Configurable Foundation Models: Building LLMs from a Modular Perspective, https://arxiv.org/pdf/2409.02877
Emilia David, September 10, 2024, ServiceNow introduces a library of enterprise AI agents you can customize to fit your workflow, https://venturebeat.com/ai/servicenow-introduces-a-library-of-enterprise-ai-agents-you-can-customize-to-fit-your-workflow/
Kyle Wiggers, February 25, 2025, Quora’s Poe now lets users create and share custom AI-powered apps, https://techcrunch.com/2025/02/25/quoras-poe-now-lets-users-create-and-share-custom-ai-powered-apps/

No Code/Low Code for AI Apps

Writer, Aug 2024 (accessed), Writer AI Studio: The fastest way to build AI apps, https://writer.com/product/ai-studio/
Isaac Sacolick, How to choose the right low-code, no-code, or process automation platform, Jul 29, 2024, https://www.infoworld.com/article/3476848/how-to-choose-the-right-low-code-no-code-or-process-automation-platform.html
Rebekah Carter, 2023, Gartner Magic Quadrant for Enterprise Low-Code Application Platforms 2023, https://www.cxtoday.com/loyalty-management/gartner-magic-quadrant-for-enterprise-low-code-application-platforms-2023/
Victor Dibia, Jingya Chen, Gagan Bansal, Suff Syed, Adam Fourney, Erkang Zhu, Chi Wang, Saleema Amershi, 9 Aug 2024, AutoGen Studio: A No-Code Developer Tool for Building and Debugging Multi-Agent Systems, https://arxiv.org/abs/2408.15247
Yanxi Chen, Yaliang Li, Bolin Ding, Jingren Zhou, 20 Jul 2024, On the Design and Analysis of LLM-Based Algorithms, https://arxiv.org/abs/2407.14788 https://github.com/modelscope/agentscope/tree/main/examples/paper_llm_based_algorithm
Reddit, 2024, New Open Source Framework and No-Code GUI for Fine-Tuning LLMs: H2O LLM Studio, https://www.reddit.com/r/LocalLLaMA/comments/12yc8op/new_open_source_framework_and_nocode_gui_for/
Yuzhe Cai, Shaoguang Mao, Wenshan Wu, Zehua Wang, Yaobo Liang, Tao Ge, Chenfei Wu, Wang You, Ting Song, Yan Xia, Jonathan Tien, Nan Duan, Furu Wei, 1 Apr 2024 (v3), Low-code LLM: Graphical User Interface over Large Language Models, https://arxiv.org/abs/2304.08103 https://github.com/chenfei-wu/TaskMatrix/tree/main/LowCodeLLM https://www.youtube.com/watch?v=jb2C1vaeO3E
Zelong Li, Shuyuan Xu, Kai Mei, Wenyue Hua, Balaji Rama, Om Raheja, Hao Wang, He Zhu, Yongfeng Zhang, 1 Jul 2024, AutoFlow: Automated Workflow Generation for Large Language Model Agents, https://arxiv.org/abs/2407.12821 https://github.com/agiresearch/AutoFlow
Xin Pang, Zhucong Li, Jiaxiang Chen, Yuan Cheng, Yinghui Xu, Yuan Qi, 7 Apr 2024, AI2Apps: A Visual IDE for Building LLM-based AI Agent Applications, https://arxiv.org/abs/2404.04902
Wenyi Hong, Weihan Wang, Qingsong Lv, Jiazheng Xu, Wenmeng Yu, Junhui Ji, Yan Wang, Zihan Wang, Yuxiao Dong, Ming Ding, Jie Tang; 2024, Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2024, pp. 14281-14290, https://arxiv.org/abs/2312.08914 https://openaccess.thecvf.com/content/CVPR2024/html/Hong_CogAgent_A_Visual_Language_Model_for_GUI_Agents_CVPR_2024_paper.html https://openaccess.thecvf.com/content/CVPR2024/papers/Hong_CogAgent_A_Visual_Language_Model_for_GUI_Agents_CVPR_2024_paper.pdf
Chuan Yan, Ruomai Ren, Mark Huasong Meng, Liuhuo Wan, Tian Yang Ooi, Guangdong Bai, 26 Aug 2024, Exploring ChatGPT App Ecosystem: Distribution, Deployment and Security, https://arxiv.org/abs/2408.14357
S Madden, M Cafarella, M Franklin, T Kraska, 2024, Databases Unbound: Querying All of the World’s Bytes with AI, https://www.vldb.org/pvldb/vol17/p4546-madden.pdf
Google, Sep 2024, Supercharge your work with no-code. AppSheet helps you build powerful applications and automations that boost productivity. No coding required., https://about.appsheet.com/home/ (Google AppSheet no code platform.)
Matt Asay, Sep 23, 2024, Too much assembly required for AI, https://www.infoworld.com/article/3536292/too-much-assembly-required-for-ai.html
Shubham Sharma, October 8, 2024, Databricks now lets developers create AI apps in 5 minutes: Here’s how, https://venturebeat.com/data-infrastructure/databricks-now-lets-developers-create-ai-apps-in-5-minutes-heres-how/
Dr. Marcel Müller, Oct 18, 2024, No-Code Generative AI: How Companies Can Build Without Data Scientists, https://medium.com/deep-tech-innovation/no-code-generative-ai-how-companies-can-build-without-data-scientists-7e5ca851f2ba
Mandana Vaziri, Louis Mandel, Claudio Spiess, Martin Hirzel, 24 Oct 2024, PDL: A Declarative Prompt Programming Language, https://arxiv.org/abs/2410.19135
Saksham Goel, October 29, 2024, Build LLM/RAG pipelines with YAML templates by Pathway, https://pathway.com/blog/llm-yaml-templates
Microsoft, Dec 2024, Power Automate: A comprehensive, end-to-end cloud automation platform powered by low code and AI. https://www.microsoft.com/en-us/power-platform/products/power-automate
Orlando Marquez Ayala, Patrice Béchard, 29 Nov 2024, Generating a Low-code Complete Workflow via Task Decomposition and RAG, https://arxiv.org/abs/2412.00239
Iván Alfonso, Aaron Conrardy, Jordi Cabot, 6 Dec 2024, Towards the interoperability of low-code platforms, https://arxiv.org/abs/2412.05075
Ignacio de Gregorio Noblejas, December 15, 2024, The AI Trillion-Dollar Product, https://thewhitebox.beehiiv.com/p/the-ai-trillion-dollar-product
Latent Space, Jan 05, 2025, AI Engineering for Art — with comfyanonymous, of ComfyUI, Using models for "Art Engineering", building hard to use UIs, and how image generation is moving from text boxes to DAGs https://www.latent.space/p/comfyui
comfyanonymous, Jan 2025, ComfyUI: The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface, https://github.com/comfyanonymous/ComfyUI
Dawei Gao, Zitao Li, Xuchen Pan, Weirui Kuang, Zhijian Ma, Bingchen Qian, Fei Wei, Wenhao Zhang, Yuexiang Xie, Daoyuan Chen, Liuyi Yao, Hongyi Peng, Zeyu Zhang, Lin Zhu, Chen Cheng, Hongzhu Shi, Yaliang Li, Bolin Ding, Jingren Zhou, 20 May 2024 (v2), AgentScope: A Flexible yet Robust Multi-Agent Platform, https://arxiv.org/abs/2402.14034 https://github.com/modelscope/agentscope
Joe McKendrick, Feb. 20, 2025, Brace yourself: The era of 'citizen developers' creating apps is here, thanks to AI, https://www.zdnet.com/article/brace-yourself-the-era-of-citizen-developers-creating-apps-is-here-thanks-to-ai/
Craig Le Clair, Oct 23 2024, Predictions 2025: GenAI, Citizen Developers, And Caution Influence Automation, https://www.forrester.com/blogs/predictions-2025-automation/
Hyunjn An, Yongwon Kim, Wonduk Seo, Joonil Park, Daye Kang, Changhoon Oh, Dokyun Kim, Seunghyun Lee, 4 Aug 2025, AIAP: A No-Code Workflow Builder for Non-Experts with Natural Language and Multi-Agent Collaboration, https://arxiv.org/abs/2508.02470
Jiexi Xu, 24 Sep 2025, Agentic Metacognition: Designing a "Self-Aware" Low-Code Agent for Failure Prediction and Human Handoff, https://arxiv.org/abs/2509.19783
Minh Nguyen, 14 Oct 2025, SpareCodeSearch: Searching for Code Context When You Have No Spare GPU, https://arxiv.org/abs/2510.12948
Sathvik Joel, Jie JW Wu, Fatemeh H. Fard, 26 Sep 2025, A Survey on LLM-based Code Generation for Low-Resource and Domain-Specific Programming Languages, https://arxiv.org/abs/2410.03981
Jiexi Xu, Jiaqi Liu, Lanruo Wang, Su Liu, 8 Oct 2025, Toward Causal-Visual Programming: Enhancing Agentic Reasoning in Low-Code Environments, https://arxiv.org/abs/2509.25282
Gursimran Singh, Aviral Chharia, Rahul Upadhyay, Vinay Kumar, Luca Longo, 31 Aug 2025, PyNoetic: A modular python framework for no-code development of EEG brain-computer interfaces, https://arxiv.org/abs/2509.00670
Shiyuan Guo, Henry Sleight, Fabien Roger, 10 Oct 2025, All Code, No Thought: Current Language Models Struggle to Reason in Ciphered Language, https://arxiv.org/abs/2510.09714
Jiexi Xu, 27 Sep 2025, Memory Management and Contextual Consistency for Long-Running Low-Code Agents, https://arxiv.org/abs/2509.25250

Miniapps

Kevin Lin, Sumant Guha, Joe Spaniac, Andy Zheng, 13 Nov 2020 (v3), Nifty Web Apps: Build a Web App for Any Text-Based Programming Assignment, https://arxiv.org/abs/2010.04671
Yuyang Han, Xu Ji, Zhiqiang Wang, Jianyi Zhang, 19 Nov 2023, Systematic Analysis of Security and Vulnerabilities in Miniapps, https://arxiv.org/abs/2311.11382
Shenao Wang, Yuekang Li, Kailong Wang, Yi Liu, Hui Li, Yang Liu, Haoyu Wang, 16 Jan 2024 (v2), MiniScope: Automated UI Exploration and Privacy Inconsistency Detection of MiniApps via Two-phase Iterative Hybrid Analysis, https://arxiv.org/abs/2401.03218
Chao Wang, Yue Zhang, Zhiqiang Lin, 13 Jun 2023, Uncovering and Exploiting Hidden APIs in Mobile Super Apps, https://arxiv.org/abs/2306.08134
Yuqing Yang, Chao Wang, Yue Zhang, Zhiqiang Lin, 13 Jun 2023, SoK: Decoding the Super App Enigma: The Security Mechanisms, Threats, and Trade-offs in OS-alike Apps, https://arxiv.org/abs/2306.07495
Ozgur Ozan Kilic, Tianle Wang, Matteo Turilli, Mikhail Titov, Andre Merzky, Line Pouchard, Shantenu Jha, 26 Mar 2024, Workflow Mini-Apps: Portable, Scalable, Tunable & Faithful Representations of Scientific Workflows, https://arxiv.org/abs/2403.18073
Liming Jiang, 12 Feb 2024, Utilizing Large LanguageModels to Detect Privacy Leaks in Mini-App Code, https://arxiv.org/abs/2402.07367
Yin Wang, Ming Fan, Junfeng Liu, Junjie Tao, Wuxia Jin, Qi Xiong, Yuhao Liu, Qinghua Zheng, Ting Liu, 27 Feb 2023, Do as You Say: Consistency Detection of Data Practice in Program Code and Privacy Policy in Mini-App, https://arxiv.org/abs/2302.13860
Thomas Steiner, 2024, What are mini apps? https://web.dev/articles/mini-apps/mini-app-about
Boxo, 2024, What is a Miniapp? A New Era for Apps, https://www.boxo.io/blog/what-is-a-miniapp
Electrode Native, 2024, What is a MiniApp, https://native.electrode.io/introduction/what-is-ern/what-is-a-miniapp
W3C, 2024, MiniApps Working Group, https://www.w3.org/2021/miniapps/
GMO Research, 22 March, 2023, The Rise of Super Apps , https://gmo-research.ai/en/news-events/articles/rise-super-apps
Grand View Research, 2023, Super Apps Market Size, Share & Trends Analysis Report By Platform (iOS, Android), By Device (Smartphone, Tablets), By Application, By End-user, By Region, And Segment Forecasts, 2023 - 2030, Report ID: GVR-4-68040-036-1, https://www.grandviewresearch.com/industry-analysis/super-apps-market-report
Lee Ying Shan, Nov 18 2024, Tencent challenges Amazon and Microsoft’s cloud dominance by tapping into its WeChat ecosystem, CNBC, https://www.cnbc.com/2024/11/18/tencent-is-contesting-microsoft-googles-cloud-dominance-with-wechat.html
Nicolás Cerdeira, December 17, 2024, The Rise of Mini Tools: Why AI-powered tools are the new go-to growth strategy. https://newsletter.failory.com/p/the-rise-of-mini-tools-
Nicolás Cerdeira, April 02, 2024, The Day Wise Created 250K Variants of a Calculator. How Wise used Programmatic SEO to obtain over 40.7M users/mo, https://newsletter.failory.com/p/day-wise-created-250k-variants-calculator
Tari Ibaba, Jan 2025, AI is killing apps, https://medium.com/coding-beauty/ai-is-killing-apps-868a7b59fafe
Jeff Huang, Jan 21 2025, Why U.S. tech companies struggle to replicate China’s WeChat ‘super app’ model, https://www.cnbc.com/2025/01/21/why-us-companies-struggle-to-replicate-chinas-wechat-super-app-.html https://www.cnbc.com/video/2025/01/21/why-the-us-doesnt-have-super-apps.html
Alex Heath, May 3, 2025, Sam Altman and Elon Musk are racing to build an ‘everything app’, https://www.theverge.com/command-line-newsletter/660674/sam-altman-elon-musk-everything-app-worldcoin-x
Jay Peters, Jun 26, 2025, Anthropic now lets you make apps right from its Claude AI chatbot, https://www.theverge.com/news/693342/anthropic-claude-ai-apps-artifact
Chris Welch, November 14, 2025, Apple Aims to Capitalize on Mini App Trend With New Program, https://www.bloomberg.com/news/articles/2025-11-13/apple-aims-to-capitalize-on-mini-app-trend-with-new-program (Apple "Mini Apps Partner Program" partnership 15% revenue of miniapps.)

Tabular Data Applications

Xi Fang, Weijie Xu, Fiona Anting Tan, Jiani Zhang, Ziqing Hu, Yanjun Qi, Scott Nickleach, Diego Socolinsky, Srinivasan Sengamedu, Christos Faloutsos, 1 Mar 2024 (v2), Large Language Models(LLMs) on Tabular Data: Prediction, Generation, and Understanding -- A Survey, https://arxiv.org/abs/2402.17944
Weijia Wang, 2023, Efficient and Explainable Machine Learning Ph.D. thesis, University of California San Diego, https://escholarship.org/content/qt9q52g27p/qt9q52g27p_noSplash_70dba1eae3531240d1fec8e0cdaf1be2.pdf (Processing of tabular data is a weakness of GenAI models, and this thesis examines various issues of tabular data and rules-based processing.)
David Bonet, Daniel Mas Montserrat, Xavier Giró-i-Nieto, Alexander G. Ioannidis, HyperFast: Instant Classification for Tabular Data, 2023, NeurIPS 2023, https://openreview.net/pdf?id=VRBhaU8IDz
Irwin Deng, Kushagra Dixit, Vivek Gupta, Dan Roth, 22 Jul 2024, Enhancing Temporal Understanding in LLMs for Semi-structured Tables, https://arxiv.org/abs/2407.16030
Liang, X., Hu, R., Liu, Y., Zhu, K. (2024). Open-Domain Question Answering over Tables with Large Language Models. In: Huang, DS., Pan, Y., Guo, J. (eds) Advanced Intelligent Computing Technology and Applications. ICIC 2024. Lecture Notes in Computer Science, vol 14873. Springer, Singapore. https://doi.org/10.1007/978-981-97-5615-5_28 https://link.springer.com/chapter/10.1007/978-981-97-5615-5_28
Xianjie Wu, Jian Yang, Linzheng Chai, Ge Zhang, Jiaheng Liu, Xinrun Du, Di Liang, Daixin Shu, Xianfu Cheng, Tianzhen Sun, Guanglin Niu, Tongliang Li, Zhoujun Li, 17 Aug 2024, TableBench: A Comprehensive and Complex Benchmark for Table Question Answering, https://www.arxiv.org/abs/2408.09174
Asim Biswal, Liana Patel, Siddarth Jha, Amog Kamsetty, Shu Liu, Joseph E. Gonzalez, Carlos Guestrin, Matei Zaharia, 27 Aug 2024, Text2SQL is Not Enough: Unifying AI and Databases with TAG, https://arxiv.org/abs/2408.14717 https://github.com/TAG-Research/TAG-Bench
Shubham Sharma, September 2, 2024, Table-augmented generation shows promise for complex dataset querying, outperforms text-to-SQL, https://venturebeat.com/data-infrastructure/table-augmented-generation-shows-promise-for-complex-dataset-querying-outperforms-text-to-sql/
S Madden, M Cafarella, M Franklin, T Kraska, 2024, Databases Unbound: Querying All of the World’s Bytes with AI, https://www.vldb.org/pvldb/vol17/p4546-madden.pdf
Shubham Sharma, September 12, 2024, Google’s DataGemma AI is a statistics wizard, https://venturebeat.com/ai/datagemma-googles-open-ai-models-mitigate-hallucination-on-statistical-queries/
David Gewirtz, Sept. 16, 2024, Why natural language AI scripting in Microsoft Excel could be a game changer. What if you could run advanced Excel analyses with no coding skills? Here's how Microsoft's Copilot in Excel could use Python to allow you to do just that, https://www.zdnet.com/article/why-natural-language-ai-scripting-in-microsoft-excel-could-be-a-game-changer/
Xinyuan Lu, Liangming Pan, Yubo Ma, Preslav Nakov, Min-Yen Kan, 18 Sep 2024, TART: An Open-Source Tool-Augmented Framework for Explainable Table-based Reasoning, https://arxiv.org/abs/2409.11724 https://github.com/XinyuanLu00/TART
Yuzhang Tian, Jianbo Zhao, Haoyu Dong, Junyu Xiong, Shiyu Xia, Mengyu Zhou, Yun Lin, José Cambronero, Yeye He, Shi Han, Dongmei Zhang, 12 Jul 2024, SpreadsheetLLM: Encoding Spreadsheets for Large Language Models, https://arxiv.org/abs/2407.09025
Mukul Singh, Gust Verbruggen, Vu Le, and Sumit Gulwani. 2024. Tabularis Revilio: Converting Text to Tables. In Proceedings of the 33rd ACM International Conference on Information and Knowledge Management (CIKM '24). Association for Computing Machinery, New York, NY, USA, 4056–4060. https://doi.org/10.1145/3627673.3680000 https://dl.acm.org/doi/abs/10.1145/3627673.3680000
LangChain, Aug 10, 2024, UX for Agents, Part 3: Spreadsheet, Generative, and Collaborative UI/UX, https://blog.langchain.dev/ux-for-agents-part-3/
Deyi Ji, Lanyun Zhu, Siqi Gao, Peng Xu, Hongtao Lu, Jieping Ye, Feng Zhao, 13 Nov 2024, Tree-of-Table: Unleashing the Power of LLMs for Enhanced Large-Scale Table Understanding, https://arxiv.org/abs/2411.08516
Qwen: An Yang, Baosong Yang, Beichen Zhang, Binyuan Hui, Bo Zheng, Bowen Yu, Chengyuan Li, Dayiheng Liu, Fei Huang, Haoran Wei, Huan Lin, Jian Yang, Jianhong Tu, Jianwei Zhang, Jianxin Yang, Jiaxi Yang, Jingren Zhou, Junyang Lin, Kai Dang, Keming Lu, Keqin Bao, Kexin Yang, Le Yu, Mei Li, Mingfeng Xue, Pei Zhang, Qin Zhu, Rui Men, Runji Lin, Tianhao Li, Tingyu Xia, Xingzhang Ren, Xuancheng Ren, Yang Fan, Yang Su, Yichang Zhang, Yu Wan, Yuqiong Liu, Zeyu Cui, Zhenru Zhang, Zihan Qiu (additional authors not shown), 19 Dec 2024, Qwen2.5 Technical Report, https://arxiv.org/abs/2412.15115
Xiaoqiang Kang, Zimu Wang, Xiaobo Jin, Wei Wang, Kaizhu Huang, Qiufeng Wang, 20 Dec 2024, Template-Driven LLM-Paraphrased Framework for Tabular Math Word Problem Generation, https://arxiv.org/abs/2412.15594 https://github.com/Jason8Kang/TELL
Zipeng Qiu, You Peng, Guangxin He, Binhang Yuan, Chen Wang, 29 Nov 2024, TQA-Bench: Evaluating LLMs for Multi-Table Question Answering with Scalable Context and Symbolic Extension, https://arxiv.org/abs/2411.19504
Mayi Xu, Yunfeng Ning, Yongqi Li, Jianhao Chen, Jintao Wen, Yao Xiao, Shen Zhou, Birong Pan, Zepeng Bao, Xin Miao, Hankun Kang, Ke Sun, Tieyun Qian, 2 Jan 2025, Reasoning based on symbolic and parametric knowledge bases: a survey, https://arxiv.org/abs/2501.01030 (Extensive survey of reasoning from CoT to knowledge graphs to table-based reasoning.)
FZ Subah, Oct 2025, Mitigating and Assessing Bias and Fairness in Large Language Model-Generated Synthetic Tabular Data, Masters Thesis, Department of Engineering, University of Cambridge, https://www.mlmi.eng.cam.ac.uk/files/2023-2024/fzs21_mitigating_2024.pdf
G Wang, S Zhang, T Zhan, Z Shen, J Li, X Hu, X Sun, Jan 2025, Unlocking the Mysteries of OpenAI o1: A Survey of the Reasoning Abilities of Large Language Models, https://openreview.net/pdf?id=J0ADLa2rNp
Connor Shorten, Charles Pierse, Thomas Benjamin Smith, Karel D'Oosterlinck, Tuana Celik, Erika Cardenas, Leonie Monigatti, Mohd Shukri Hasan, Edward Schmuhl, Daniel Williams, Aravind Kesiraju, Bob van Luijt, 23 Jan 2025, Querying Databases with Function Calling, https://arxiv.org/abs/2502.00032
Minchae Song, 21 May 2025, Enhancing RAG Performance by Representing Hierarchical Nodes in Headers for Tabular Data, IEEE Access, https://ieeexplore.ieee.org/stamp/stamp.jsp?arnumber=11003975
Xiaohan Yu, Pu Jian, Chong Chen 12 Jun 2025, TableRAG: A Retrieval Augmented Generation Framework for Heterogeneous Document Reasoning, https://arxiv.org/abs/2506.10380 https://github.com/yxh-y/TableRAG/tree/main
Paul Gross, June 17, 2025, Double-Entry Ledgers: The Missing Primitive in Modern Software, https://pgrs.net/2025/06/17/double-entry-ledgers-missing-primitive-in-modern-software/
Eunbin Lee, Younghan Lee, Ho Bae, July 2025, A Systematic Framework for Enhancing Retrieval-Augmented Generation for Tabular Data, https://koreascience.kr/article/JAKO202519957603573.pdf
Daniel Beaglehole, David Holzm\"uller, Adityanarayanan Radhakrishnan, Mikhail Belkin, 12 Aug 2025, xRFM: Accurate, scalable, and interpretable feature learning models for tabular data, https://arxiv.org/abs/2508.10053
Lalitesh Morishetti, Abhay Kumar, Jonathan Scott, Kaushiki Nag, Gunjan Sharma, Shanu Vashishtha, Rahul Sridhar, Rohit Chatter, and Kannan Achan, 13 Aug 2025, Personalized Product Search Ranking: A Multi-Task Learning Approach with Tabular and Non-Tabular Data, https://arxiv.org/abs/2508.09636
Jihye Lee, Minseo Kang, and Dongha Kim, 14 Aug 2025, MIRRAMS: Learning Robust Tabular Models under Unseen Missingness Shifts, https://arxiv.org/abs/2507.08280
Jessup Byun, Xiaofeng Lin, Joshua Ward, Guang Cheng, 22 Jul 2025, Risk In Context: Benchmarking Privacy Leakage of Foundation Models in Synthetic Tabular Data Generation, https://arxiv.org/abs/2507.17066
Vinura Galwaduge, Jagath Samarabandu, 23 Jul 2025, Tabular Diffusion based Actionable Counterfactual Explanations for Network Intrusion Detection, https://arxiv.org/abs/2507.17161
Rafael Ayll\'on-Gavil\'an, David Guijo-Rubio, Antonio Manuel G\'omez-Orellana, David Guijo-Rubio, Francisco B\'erchez-Moreno, V\'ictor Manuel Vargas-Yun and Pedro A. Guti\'errez, 23 Jul 2025, TOC-UCO: a comprehensive repository of tabular ordinal classification datasets, https://arxiv.org/abs/2507.17348
Calvin McCarter, 23 Jul 2025, Unmasking Trees for Tabular Data, https://arxiv.org/abs/2407.05593
Chaoyi Zhu, Jiayi Tang, Juan F. P\'erez, Marten van Dijk, Lydia Y. Chen, 21 Jul 2025, DP-TLDM: Differentially Private Tabular Latent Diffusion Model, https://arxiv.org/abs/2403.07842
Eduardo Aguilar-Bejarano, Daniel Lea, Karthikeyan Sivakumar, Jimiama M. Mase, Reza Omidvar, Ruizhe Li, Troy Kettle, James Mitchell-White, Morgan R Alexander, David A Winkler, Grazziela Figueredo, 23 Jul 2025, Helix 1.0: An Open-Source Framework for Reproducible and Interpretable Machine Learning on Tabular Scientific Data, https://arxiv.org/abs/2507.17791
Shubham Mohole, Sainyam Galhotra, 23 Jul 2025, SIFOTL: A Principled, Statistically-Informed Fidelity-Optimization Method for Tabular Learning, https://arxiv.org/abs/2507.17979
Rana Alshaikh, Israa Alghanmi, Shelan Jeawak, 24 Jul 2025, AraTable: Benchmarking LLMs' Reasoning and Understanding of Arabic Tabular Data, https://arxiv.org/abs/2507.18442
Zheyu Zhang, Shuo Yang, Bardh Prenkaj, Gjergji Kasneci, 24 Jul 2025, Not All Features Deserve Attention: Graph-Guided Dependency Learning for Tabular Data Generation with Language Models, https://arxiv.org/abs/2507.18504
Aleksey Lapin, Igor Hromov, Stanislav Chumakov, Mile Mitrovic, Dmitry Simakov, Nikolay O. Nikitin, Andrey V. Savchenko, 17 Jul 2025, LightAutoDS-Tab: Multi-AutoML Agentic System for Tabular Data, https://arxiv.org/abs/2507.13413
Anh Nguyen, Sam Schafft, Nicholas Hale, John Alfaro, 21 Jul 2025, FASTGEN: Fast and Cost-Effective Synthetic Tabular Data Generation with LLMs, https://arxiv.org/abs/2507.15839
Andrey Sidorenko and Paul Tiwald, 8 Aug 2025, Privacy-Preserving Tabular Synthetic Data Generation Using TabularARGN, https://arxiv.org/abs/2508.06647
Zilong Zhao, Robert Birke, Aditya Kunar, Lydia Y. Chen, 11 Aug 2025, Fed-TGAN: Federated Learning Framework for Synthesizing Tabular Data, https://arxiv.org/abs/2108.07927
Md Rakibul Hasan, Md Zakir Hossain, Aneesh Krishna, Shafin Rahman, Tom Gedeon, 9 Aug 2025, TFMPathy: Tabular Foundation Model for Privacy-Aware, Generalisable Empathy Detection from Videos, https://arxiv.org/abs/2504.10808
Yaobin Ling, Xiaoqian Jiang, Yejin Kim, 28 Jul 2025, MALLM-GAN: Multi-Agent Large Language Model as Generative Adversarial Network for Synthesizing Tabular Data, https://arxiv.org/abs/2406.10521
Xuechen Li, Yupeng Li, Jian Liu, Xiaolin Jin and Xin Hu, 29 Jul 2025, Multi-branch of Attention Yields Accurate Results for Tabular Data, https://arxiv.org/abs/2502.12507
Sophie Kearney, Shu Yang, Zixuan Wen, Bojian Hou, Duy Duong-Tran, Tianlong Chen, Jason Moore, Marylyn Ritchie, Li Shen, 31 Jul 2025, Enabling Few-Shot Alzheimer's Disease Diagnosis on Tabular Biomarker Data with LLMs, https://arxiv.org/abs/2507.23227
Patricia A. Apell\'aniz and Ana Jim\'enez and Borja Arroyo Galende and Juan Parras and Santiago Zazo, 31 Jul 2025, Artificial Inductive Bias for Synthetic Tabular Data Generation in Data-Scarce Scenarios, https://arxiv.org/abs/2407.03080
Leonidas Akritidis, Panayiotis Bozanis, 1 Aug 2025, A Conditional GAN for Tabular Data Generation with Probabilistic Sampling of Latent Subspaces, https://arxiv.org/abs/2508.00472
Ivona Krchova, Mariana Vargas Vieyra, Mario Scriminaci, Andrey Sidorenko, 1 Aug 2025, Democratizing Tabular Data Access with an Open$\unicode{x2013}$Source Synthetic$\unicode{x2013}$Data SDK, https://arxiv.org/abs/2508.00718
Timur Sattarov, Marco Schreyer, Damian Borth, 1 Aug 2025, Diffusion-Scheduled Denoising Autoencoders for Anomaly Detection in Tabular Data, https://arxiv.org/abs/2508.00758
Xiaofeng Wu, Alan Ritter, Wei Xu, 31 Jul 2025, Tabular Data Understanding with LLMs: A Survey of Recent Advances and Challenges, https://arxiv.org/abs/2508.00217
Patrik Kenfack, Samira Ebrahimi Kahou, Ulrich A\"ivodji, 1 Aug 2025, Towards Fair In-Context Learning with Tabular Foundation Models, https://arxiv.org/abs/2505.09503
Mengshi Chen, Yuxiang Sun, Tengchao Li, Jianwei Wang, Kai Wang, Xuemin Lin, Ying Zhang, Wenjie Zhang, 3 Aug 2025, Empowering Tabular Data Preparation with Language Models: Why and How?, https://arxiv.org/abs/2508.01556
Siyi Liu, Yujia Zheng, Yongqi Zhang, 4 Aug 2025, StructSynth: Leveraging LLMs for Structure-Aware Tabular Data Synthesis in Low-Data Regimes, https://arxiv.org/abs/2508.02601
Riccardo Francia, Maurizio Leone, Giorgio Leonardi, Stefania Montani, Marzio Pennisi, Manuel Striani, Sandra D'Alfonso, 4 Aug 2025, AutoML-Med: A Framework for Automated Machine Learning in Medical Tabular Data, https://arxiv.org/abs/2508.02625
Zhengyu Fang, Zhimeng Jiang, Huiyuan Chen, Xiaoge Zhang, Kaiyu Tang, Xiao Li, Jing Li, 2 Aug 2025, A Closer Look on Memorization in Tabular Diffusion Model: A Data-Centric Perspective, https://arxiv.org/abs/2505.22322
Youran Zhou, Mohamed Reda Bouadjenek, Sunil Aryal, 5 Aug 2025, MissDDIM: Deterministic and Efficient Conditional Diffusion for Tabular Data Imputation, https://arxiv.org/abs/2508.03083
Mengao Zhang, Jiayu Fu, Tanya Warrier, Yuwen Wang, Tianhui Tan, Ke-wei Huang, 7 Aug 2025, FAITH: A Framework for Assessing Intrinsic Tabular Hallucinations in finance, https://arxiv.org/abs/2508.05201
Yunbo Long, Liming Xu, Alexandra Brintrup, 7 Aug 2025, LLM-TabLogic: Preserving Inter-Column Logical Relationships in Synthetic Tabular Data via Prompt-Guided Latent Diffusion, https://arxiv.org/abs/2503.02161
Ruiyu Zhang, Ce Zhao, Xin Zhao, Lin Nie, Wai-Fung Lam, 8 Aug 2025, Structural Equation-VAE: Disentangled Latent Representations for Tabular Data, https://arxiv.org/abs/2508.06347
Arshia Ilaty, Hossein Shirazi, Hajar Homayouni, 11 Aug 2025, SynLLM: A Comparative Analysis of Large Language Models for Medical Tabular Synthetic Data Generation via Prompt Engineering, https://arxiv.org/abs/2508.08529
Adri\'an Gude, Roi Santos-R\'ios, Francisco Prado-Vali\~no, Ana Ezquerro, Jes\'us Vilares, 12 Aug 2025, LyS at SemEval 2025 Task 8: Zero-Shot Code Generation for Tabular QA, https://arxiv.org/abs/2508.09012
Peng Wang, Dongsheng Wang, He Zhao, Hangting Ye, Dandan Guo, Yi Chang, 12 Aug 2025, LLM Empowered Prototype Learning for Zero and Few-Shot Tasks on Tabular Data, https://arxiv.org/abs/2508.09263
Viacheslav Barkov, Jonas Schmidinger, Robin Gebbers, Martin Atzmueller, 13 Aug 2025, Modern Neural Networks for Small Tabular Datasets: The New Default for Field-Scale Digital Soil Mapping?, https://arxiv.org/abs/2508.09888
Nitish Nagesh, Salar Shakibhamedan, Mahdi Bagheri, Ziyu Wang, Nima TaheriNejad, Axel Jantsch, Amir M. Rahmani, 15 Aug 2025, FairTabGen: Unifying Counterfactual and Causal Fairness in Synthetic Tabular Data Generation, https://arxiv.org/abs/2508.11810
Andr\'es Guzm\'an-Cordero, Floor Eijkelboom, Jan-Willem van de Meent, 15 Aug 2025, Exponential Family Variational Flow Matching for Tabular Data Generation, https://arxiv.org/abs/2506.05940
Bastian Sch\"afer and Lennart Purucker and Maciej Janowski and Frank Hutter, 19 Aug 2025, How Usable is Automated Feature Engineering for Tabular Data?, https://arxiv.org/abs/2508.13932
Marco Spinaci, Marek Polewczyk, Maximilian Schambach, Sam Thelin, 19 Aug 2025, ConTextTab: A Semantics-Aware Tabular In-Context Learner, https://arxiv.org/abs/2506.10707
Anirudh Sundar, Christopher Richardson, Adar Avsian, Larry Heck, 19 Aug 2025, iTBLS: A Dataset of Interactive Conversations Over Tabular Information, https://arxiv.org/abs/2404.12580
Pablo G. Almeida, Guilherme A. L. Silva, Val\'eria Santos, Gladston Moreira, Pedro Silva and Eduardo Luz, 9 Aug 2025, Deep Learning for School Dropout Detection: A Comparison of Tabular and Graph-Based Models for Predicting At-Risk Students, https://arxiv.org/abs/2508.14057
Vishnou Vinayagame, Gregory Senay, and Luis Mart\'i, 20 Aug 2025, MATATA: Weakly Supervised End-to-End MAthematical Tool-Augmented Reasoning for Tabular Applications, https://arxiv.org/abs/2411.18915
Weijie Niu, Alberto Huertas Celdran, Karoline Siarsky, Burkhard Stiller, 22 Aug 2025, FEST: A Unified Framework for Evaluating Synthetic Tabular Data, https://arxiv.org/abs/2508.16254
Manar D. Samad, Kazi Fuad B. Akhter, Shourav B. Rabbani, Ibna Kowsar, 22 Aug 2025, Imputation Not Required in Incremental Learning of Tabular Data with Missing Values, https://arxiv.org/abs/2504.14610
Nikolaos Pavlidis, Vasilis Perifanis, Symeon Symeonidis, Pavlos S. Efraimidis, 24 Aug 2025, Large Language Models as Universal Predictors? An Empirical Study on Small Tabular Datasets, https://arxiv.org/abs/2508.17391
Kiran Madhusudhanan, Vijaya Krishna Yalavarthi, Jonas Sonntag, Maximilian Stubbemann, Lars Schmidt-Thieme, 23 Aug 2025, TabResFlow: A Normalizing Spline Flow Model for Probabilistic Univariate Tabular Regression, https://arxiv.org/abs/2508.17056
Yilang Ding, Jiawen Ren, Jiaying Lu, Gloria Hyunjung Kwak, Armin Iraji, Alex Fedorov, 25 Aug 2025, Longitudinal Progression Prediction of Alzheimer's Disease with Tabular Foundation Model, https://arxiv.org/abs/2508.17649
Jiyoon Myung, Jihyeon Park, Joohyung Han, 25 Aug 2025, HyST: LLM-Powered Hybrid Retrieval over Semi-Structured Tabular Data, https://arxiv.org/abs/2508.18048
Harshit Dhankhar and Kshitij Mishra and Tejas Bodas, 25 Aug 2025, Tabular and Deep Reinforcement Learning for Gittins Index, https://arxiv.org/abs/2405.01157
Beno\^it Ronval, Pierre Dupont, Siegfried Nijssen, 4 Sep 2025, TAGAL: Tabular Data Generation using Agentic LLM Methods, https://arxiv.org/abs/2509.04152
Nikolay Kartashev, Ivan Rubachev, Artem Babenko, 4 Sep 2025, Unveiling the Role of Data Uncertainty in Tabular Deep Learning, https://arxiv.org/abs/2509.04430
Zhenyu Wu, Jiaoyan Chen and Norman W. Paton, 4 Sep 2025, Schema Inference for Tabular Data Repositories Using Large Language Models, https://arxiv.org/abs/2509.04632
Chufan Gao, Jintai Chen, Jimeng Sun, 26 Aug 2025, Utilizing Training Data to Improve LLM Reasoning for Tabular Understanding, https://arxiv.org/abs/2508.18676
Tianyu Wang, Jiashuo Liu, Peng Cui, Hongseok Namkoong, 25 Aug 2025, Rethinking Distribution Shifts: Empirical Analysis and Inductive Modeling for Tabular Data, https://arxiv.org/abs/2307.05284
A. Yark{\i}n Y{\i}ld{\i}z and Asli Kalayci, 25 Aug 2025, Gradient Boosting Decision Trees on Medical Diagnosis over Tabular Data, https://arxiv.org/abs/2410.03705
Kushal Raj Bhandari, Sixue Xing, Soham Dan, Jianxi Gao, 26 Aug 2025, Exploring the Robustness of Language Models for Tabular Question Answering via Attention Analysis, https://arxiv.org/abs/2406.12719
Wangyang Ying, Nanxu Gong, Dongjie Wang, Xinyuan Wang, Arun Vignesh Malarkkan, Vivek Gupta, Chandan K. Reddy, Yanjie Fu, 27 Aug 2025, Distribution Shift Aware Neural Tabular Learning, https://arxiv.org/abs/2508.19486
Aamod Khatiwada, Harsha Kokel, Ibrahim Abdelaziz, Subhajit Chaudhury, Julian Dolby, Oktie Hassanzadeh, Zhenhan Huang, Tejaswini Pedapati, Horst Samulowitz, Kavitha Srinivas, 26 Aug 2025, TabSketchFM: Sketch-based Tabular Representation Learning for Data Discovery over Data Lakes, https://arxiv.org/abs/2407.01619
Dmitry Eremeev, Gleb Bazhenov, Oleg Platonov, Artem Babenko, Liudmila Prokhorenkova, 28 Aug 2025, Turning Tabular Foundation Models into Graph Foundation Models, https://arxiv.org/abs/2508.20906
Yifei Yuan, Jiatong Li, Weijia Zhang, Mohammad Aliannejadi, Evangelos Kanoulas, Renjun Hu, 29 Aug 2025, Summarize-Exemplify-Reflect: Data-driven Insight Distillation Empowers LLMs for Few-shot Tabular Classification, https://arxiv.org/abs/2508.21561
Timur Sattarov, Marco Schreyer, Damian Borth, 29 Aug 2025, Federated Diffusion Modeling with Differential Privacy for Tabular Data Synthesis, https://arxiv.org/abs/2412.16083
G. Charbel N. Kindji (MALT), Elisa Fromont (MALT), Lina Maria Rojas-Barahona, Tanguy Urvoy, 27 Aug 2025, Robust Detection of Synthetic Tabular Data under Schema Variability, https://arxiv.org/abs/2509.00092
Renat Sergazinov, Shao-An Yin, 30 Aug 2025, Chunked TabPFN: Exact Training-Free In-Context Learning for Long-Context Tabular Data, https://arxiv.org/abs/2509.00326
Wei Zhang, Brian Barr, John Paisley, 31 Aug 2025, Tabular Diffusion Counterfactual Explanations, https://arxiv.org/abs/2509.00876
Wei Zhang, Brian Barr, John Paisley, 31 Aug 2025, An Explainable Gaussian Process Auto-encoder for Tabular Data, https://arxiv.org/abs/2509.00884
Yevhen Havrylenko, Meelis K\"a\"arik and Artur Tuttar, 2 Sep 2025, Amputation-imputation based generation of synthetic tabular data for ratemaking, https://arxiv.org/abs/2509.02171
Yael Itzhakev, Amit Giloni, Yuval Elovici, Asaf Shabtai, 1 Sep 2025, Addressing Key Challenges of Adversarial Attacks and Defenses in the Tabular Domain: A Methodological Framework for Coherence and Consistency, https://arxiv.org/abs/2412.07326
Taiga Saito, Yu Otake, Stephen Wu, 3 Sep 2025, Tabular foundation model for GEOAI benchmark problems BM/AirportSoilProperties/2/2025, https://arxiv.org/abs/2509.03191
Yundi Zhang, Paul Hager, Che Liu, Suprosanna Shit, Chen Chen, Daniel Rueckert, Jiazhen Pan, 3 Sep 2025, Towards Cardiac MRI Foundation Models: Comprehensive Visual-Tabular Representations for Whole-Heart Assessment and Beyond, https://arxiv.org/abs/2504.13037
Haoyu Dong, Pengkun Zhang, Mingzhe Lu, Yanzhen Shen, Guolin Ke, 8 Sep 2025, MachineLearningLM: Continued Pretraining Language Models on Millions of Synthetic Tabular Prediction Tasks Scales In-Context ML, https://arxiv.org/abs/2509.06806
Joshua Ward, Yuxuan Yang, Chi-Hua Wang, Guang Cheng, 2 Sep 2025, Ensembling Membership Inference Attacks Against Tabular Generative Models, https://arxiv.org/abs/2509.05350
Yuntao Du, Ninghui Li, 7 Sep 2025, Systematic Assessment of Tabular Data Synthesis, https://arxiv.org/abs/2402.06806
Adrian Hayler, Xingyue Huang, \.Ismail \.Ilkan Ceylan, Michael Bronstein, and Ben Finkelshtein, 8 Sep 2025, Of Graphs and Tables: Zero-Shot Node Classification with Tabular Foundation Models, https://arxiv.org/abs/2509.07143
Mingxuan Jiang, Yongxin Wang, Ziyue Dai, Yicun Liu, Hongyi Nie, Sen Liu, and Hongfeng Chai, 12 Sep 2025, Limited Reference, Reliable Generation: A Two-Component Framework for Tabular Data Generation in Low-Data Regimes, https://arxiv.org/abs/2509.09960
Madhushan Ramalingam, 12 Sep 2025, Uncertainty-Aware Tabular Prediction: Evaluating VBLL-Enhanced TabPFN in Safety-Critical Medical Data, https://arxiv.org/abs/2509.10048
Spencer King, Zhilu Zhang, Ruofan Yu, Baris Coskun, Wei Ding, Qian Cui, 10 Sep 2025, Deep Context-Conditioned Anomaly Detection for Tabular Data, https://arxiv.org/abs/2509.09030
Nazia Nafis, Inaki Esnaola, Alvaro Martinez-Perez, Maria-Cruz Villa-Uriol, Venet Osmani, 11 Sep 2025, Critical Challenges and Guidelines in Evaluating Synthetic Tabular Data: A Systematic Review, https://arxiv.org/abs/2504.18544
Yuyang Jiang, Chacha Chen, Shengyuan Wang, Feng Li, Zecong Tang, Benjamin M. Mervak, Lydia Chelala, Christopher M Straus, Reve Chahine, Samuel G. Armato III, Chenhao Tan, 19 Sep 2025, CLEAR: A Clinically-Grounded Tabular Framework for Radiology Report Evaluation, https://arxiv.org/abs/2505.16325
Liam Ressel and Hamza A. A. Gardi, 15 Sep 2025, Linear Dimensionality Reduction for Word Embeddings in Tabular Data Classification, https://arxiv.org/abs/2509.12346
Eyal German, Daniel Samira, Yuval Elovici, Asaf Shabtai, 16 Sep 2025, MIA-EPT: Membership Inference Attack via Error Prediction for Tabular Data, https://arxiv.org/abs/2509.13046
Shriyank Somvanshi and Pavan Hebli and Gaurab Chhetri and Subasish Das, 14 Sep 2025, Tabular Data with Class Imbalance: Predicting Electric Vehicle Crash Severity with Pretrained Transformers (TabPFN) and Mamba-Based Models, https://arxiv.org/abs/2509.11449
Xiangjian Jiang, Nikola Simidjievski, Mateja Jamnik, 15 Sep 2025, TabStruct: Measuring Structural Fidelity of Tabular Data, https://arxiv.org/abs/2509.11950
Ashlesha Akella, Akshar Kaul, Krishnasuri Narayanam, Sameep Mehta, 11 Sep 2025, Quality Assessment of Tabular Data using Large Language Models and Code Generation, https://arxiv.org/abs/2509.10572
Chaeyun Ko, 15 Sep 2025, STRIDE: Subset-Free Functional Decomposition for XAI in Tabular Settings, https://arxiv.org/abs/2509.09070
Khawla Elhadri, J\"org Schl\"otterer, Christin Seifert, 10 Sep 2025, Towards Interpretable Deep Neural Networks for Tabular Data, https://arxiv.org/abs/2509.08617
G. Charbel N. Kindji (LACODAM), Lina Maria Rojas-Barahona, Elisa Fromont (LACODAM), Tanguy Urvoy, 17 Sep 2025, Tabular Data Generation Models: An In-Depth Survey and Performance Benchmarks with Extensive Tuning, https://arxiv.org/abs/2406.12945
Bart Pleiter, Behrad Tajalli, Stefanos Koffas, Gorka Abad, Jing Xu, Martha Larson, Stjepan Picek, 17 Sep 2025, Backdoor Attacks on Transformers for Tabular Data: An Empirical Study, https://arxiv.org/abs/2311.07550
Sanghyu Yoon, Dongmin Kim, Suhee Yoon, Ye Seul Sim, Seungdong Yoa, Hye-Seung Cho, Soonyoung Lee, Hankook Lee, Woohyung Lim, 2 Oct 2025, ReTabAD: A Benchmark for Restoring Semantic Context in Tabular Anomaly Detection, https://arxiv.org/abs/2510.02060
Yannis Belkhiter, Seshu Tirupathi, Giulio Zizzo, Sachin Sharma, John D. Kelleher, 2 Oct 2025, Pre-Hoc Predictions in AutoML: Leveraging LLMs to Enhance Model Selection and Benchmarking for Tabular datasets, https://arxiv.org/abs/2510.01842
Aida Tayebi, Ali Khodabandeh Yalabadi, Mehdi Yazdani-Jahromi, Ozlem Ozmen Garibay, 2 Oct 2025, FairContrast: Enhancing Fairness through Contrastive learning and Customized Augmenting Methods on Tabular Data, https://arxiv.org/abs/2510.02017
Erkan Karabulut, Paul Groth, Victoria Degeler, 2 Oct 2025, Neurosymbolic Association Rule Mining from Tabular Data, https://arxiv.org/abs/2504.19354
Chih-Chuan Cheng, Yi-Ju Tseng, 14 Oct 2025, SG-XDEAT: Sparsity-Guided Cross-Dimensional and Cross-Encoding Attention with Target-Aware Conditioning in Tabular Learning, https://arxiv.org/abs/2510.12659
Vincent Ochs, Florentin Bieder, Sidaty el Hadramy, Paul Friedrich, Stephanie Taha-Mehlitz, Anas Taha, Philippe C. Cattin, 1 Oct 2025, TabINR: An Implicit Neural Representation Framework for Tabular Data Imputation, https://arxiv.org/abs/2510.01136
Emmanouil Panagiotou, Beno\^it Ronval, Arjun Roy, Ludwig Bothmann, Bernd Bischl, Siegfried Nijssen, Eirini Ntoutsi, 24 Sep 2025, TABFAIRGDT: A Fast Fair Tabular Data Generator using Autoregressive Decision Trees, https://arxiv.org/abs/2509.19927
Erkan Karabulut, Daniel Daza, Paul Groth, Victoria Degeler, 24 Sep 2025, Discovering Association Rules in High-Dimensional Small Tabular Data, https://arxiv.org/abs/2509.20113
Jyotika Singh, Weiyi Sun, Amit Agarwal, Viji Krishnamurthy, Yassine Benajiba, Sujith Ravi, Dan Roth, 27 Oct 2025, Can LLMs Narrate Tabular Data? An Evaluation Framework for Natural Language Representations of Text-to-SQL System Outputs, https://arxiv.org/abs/2510.23854
Matteo Silvestri, Flavio Giorgi, Fabrizio Silvestri, Gabriele Tolomei, 23 Oct 2025, Evaluating Latent Knowledge of Public Tabular Datasets in Large Language Models, https://arxiv.org/abs/2510.20351
Zhipeng He, Alexander Stevens, Chun Ouyang, Johannes De Smedt, Alistair Barros, Catarina Moreira, 23 Oct 2025, Crafting Imperceptible On-Manifold Adversarial Attacks for Tabular Data, https://arxiv.org/abs/2507.10998
Pengxiang Cai, Zihao Gao, Jintai Chen, 23 Oct 2025, TabR1: Taming GRPO for tabular reasoning LLMs, https://arxiv.org/abs/2510.17385
Peini Cheng and Amir Bahmani, 16 Oct 2025, Membership Inference over Diffusion-models-based Synthetic Tabular Data, https://arxiv.org/abs/2510.16037
Zhining Liu, Zihao Li, Ze Yang, Tianxin Wei, Jian Kang, Yada Zhu, Hendrik Hamann, Jingrui He, Hanghang Tong, 20 Oct 2025, CLIMB: Class-imbalanced Learning Benchmark on Tabular Data, https://arxiv.org/abs/2505.17451
Josias K. Moukpe, Philip K. Chan, Ming Zhang, 19 Sep 2025, Highly Imbalanced Regression with Tabular Data in SEP and Other Applications, https://arxiv.org/abs/2509.16339
Sivan Sarafian, Yehudit Aperstein, 19 Sep 2025, Improving Deep Tabular Learning, https://arxiv.org/abs/2509.16354
Tianchun Li, Tianci Liu, Xingchen Wang, Rongzhe Wei, Pan Li, Lu Su, Jing Gao, 20 Sep 2025, Towards Universal Debiasing for Language Models-based Tabular Data Generation, https://arxiv.org/abs/2509.16475
Michelangelo Conserva, Remo Sasso, Paulo Rauber, 21 Sep 2025, On the Limits of Tabular Hardness Metrics for Deep RL: A Study with the Pharos Benchmark, https://arxiv.org/abs/2509.17092
Miao Li, Phuc Nguyen, Christopher Tam, Alexandra Morgan, Kenneth Ge, Rahul Bansal, Linzi Yu, Rima Arnaout, Ramy Arnaout, 22 Sep 2025, GEM-T: Generative Tabular Data via Fitting Moments, https://arxiv.org/abs/2509.17752
Julius Vetter, Manuel Gloeckler, Daniel Gedon, Jakob H. Macke, 27 Oct 2025, Effortless, Simulation-Efficient Bayesian Inference using Tabular Foundation Models, https://arxiv.org/abs/2504.17660
George Yakushev, Alina Shutova, Ivan Rubachev, Renat Sergazinov, Artem Babenko, 25 Sep 2025, Talking Trees: Reasoning-Assisted Induction of Decision Trees for Tabular Data, https://arxiv.org/abs/2509.21465
Mohammadreza Ghaffarzadeh-Esfahani, Mahdi Ghaffarzadeh-Esfahani, Arian Salahi-Niri, Hossein Toreyhi, Zahra Atf, Amirali Mohsenzadeh-Kermani, Mahshad Sarikhani, Zohreh Tajabadi, Fatemeh Shojaeian, Mohammad Hassan Bagheri, Aydin Feyzi, Mohammadamin Tarighatpayma, Narges Gazmeh, Fateme Heydari, Hossein Afshar, Amirreza Allahgholipour, Farid Alimardani, Ameneh Salehi, Naghmeh Asadimanesh, Mohammad Amin Khalafi, Hadis Shabanipour, Ali Moradi, Sajjad Hossein Zadeh, Omid Yazdani, Romina Esbati, Moozhan Maleki, Danial Samiei Nasr, Amirali Soheili, Hossein Majlesi, Saba Shahsavan, Alireza Soheilipour, Nooshin Goudarzi, Erfan Taherifard, Hamidreza Hatamabadi, Jamil S Samaan, Thomas Savage, Ankit Sakhuja, Ali Soroush, Girish Nadkarni, Ilad Alavi Darazam, Mohamad Amin Pourhoseingholi, Seyed Amir Ahmad Safavi-Naini, 26 Sep 2025, Large Language Models versus Classical Machine Learning: Performance in COVID-19 Mortality Prediction Using High-Dimensional Tabular Data, https://arxiv.org/abs/2409.02136
M. Sajid, Mushir Akhtar, A. Quadir, M. Tanveer, 6 Oct 2025, RVFL-X: A Novel Randomized Network Based on Complex Transformed Real-Valued Tabular Datasets, https://arxiv.org/abs/2510.06278
Jie Li, Andrew McCarthy, Zhizhuo Zhang, Stephen Young, 2 Oct 2025, Uncertainty-Guided Model Selection for Tabular Foundation Models in Biomolecule Efficacy Prediction, https://arxiv.org/abs/2510.02476
Maria F. Davila R and Azizjon Turaev and Wolfram Wingerath, 25 Sep 2025, Measuring LLM Sensitivity in Transformer-based Tabular Data Synthesis, https://arxiv.org/abs/2509.20768
Yuchen Shen, Haomin Wen and Leman Akoglu, 24 Sep 2025, FoMo-0D: A Foundation Model for Zero-shot Tabular Outlier Detection, https://arxiv.org/abs/2409.05672
Ange-Cl\'ement Akazan and Verlon Roel Mbingui, 27 Sep 2025, Splines-Based Feature Importance in Kolmogorov-Arnold Networks: A Framework for Supervised Tabular Data Dimensionality Reduction, https://arxiv.org/abs/2509.23366
Diane Kim and Minh Nguyen Nhat To and Sherif Abdalla and Teresa S.M. Tsang and Purang Abolmaesumi and and Christina Luong, 28 Sep 2025, TREAT-Net: Tabular-Referenced Echocardiography Analysis for Acute Coronary Syndrome Treatment Prediction, https://arxiv.org/abs/2509.23999
Shi-Yu Tian, Zhi Zhou, Wei Dong, Kun-Yang Yu, Ming Yang, Zi-Jian Cheng, Lan-Zhe Guo, Yu-Feng Li, 27 Sep 2025, TabularGSM: Understanding the Limitations of LLMs in Tabular Math Reasoning, https://arxiv.org/abs/2505.19563
Kimberly Villalobos Carballo, Liangyuan Na, Yu Ma, L\'eonard Boussioux, Cynthia Zeng, Luis R. Soenksen, Dimitris Bertsimas, 26 Sep 2025, TabText: Language-Based Representations of Tabular Health Data for Predictive Modelling, https://arxiv.org/abs/2206.10381
Elias Dubbeldam, Reza Mohammadi, Marit Schoonhoven, S. Ilker Birbil, 6 Oct 2025, Graph-based Tabular Deep Learning Should Learn Feature Interactions, Not Just Make Predictions, https://arxiv.org/abs/2510.04543
Yuandou Wang, Filip Gunnarsson, Rihan Hai, 6 Oct 2025, IMLP: An Energy-Efficient Continual Learning Method for Tabular Data Streams, https://arxiv.org/abs/2510.04660
Guri Zab\"ergja, Arlind Kadra, Christian M. M. Frey, Josif Grabocka, 5 Oct 2025, Tabular Data: Is Deep Learning all you need?, https://arxiv.org/abs/2402.03970
Yihao Ang, Peicheng Yao, Yifan Bao, Yushuo Feng, Qiang Huang, Anthony K. H. Tung, Zhiyong Huang, 9 Oct 2025, RFOD: Random Forest-based Outlier Detection for Tabular Data, https://arxiv.org/abs/2510.08747
Yuting Yang, Gang Mei, Zhengjing Ma, Nengxiong Xu, Jianbing Peng, 10 Oct 2025, Simple and Robust Forecasting of Spatiotemporally Correlated Small Earth Data with A Tabular Foundation Model, https://arxiv.org/abs/2510.08920
Tao Feng, Lizhen Qu, Niket Tandon, Gholamreza Haffari, 10 Oct 2025, IRIS: An Iterative and Integrated Framework for Verifiable Causal Discovery in the Absence of Tabular Data, https://arxiv.org/abs/2510.09217
Xiyuan Zhang, Danielle C. Maddix, Junming Yin, Nick Erickson, Abdul Fatir Ansari, Boran Han, Shuai Zhang, Leman Akoglu, Christos Faloutsos, Michael W. Mahoney, Cuixiong Hu, Huzefa Rangwala, George Karypis, Bernie Wang, 24 Oct 2025, Mitra: Mixed Synthetic Priors for Enhancing Tabular Foundation Models, https://arxiv.org/abs/2510.21204
Hossein Amiri, Mohammad Hashemi, Andreas Z\"ufle, 24 Oct 2025, World-POI: Global Point-of-Interest Data Enriched from Foursquare and OpenStreetMap as Tabular and Graph Data, https://arxiv.org/abs/2510.21342
Patryk Marsza{\l}ek, Tomasz Ku\'smierczyk, Witold Wydma\'nski, Jacek Tabor, Marek \'Smieja, 24 Oct 2025, ZEUS: Zero-shot Embeddings for Unsupervised Separation of Tabular Data, https://arxiv.org/abs/2505.10704
Zhenjiang Fan, Zengyi Qin, Yuanning Zheng, Bo Xiong, Summer Han, 10 Oct 2025, CALM: A Causal Analysis Language Model for Tabular Data in Complex Systems with Local Scores, Conditional Independence Tests, and Relation Attributes, https://arxiv.org/abs/2510.09846
Md Ibrahim Shikder Mahin, Md Shamsul Arefin and Md Tanvir Hasan, 12 Oct 2025, A Hybrid Machine Learning Approach for Synthetic Data Generation with Post Hoc Calibration for Clinical Tabular Datasets, https://arxiv.org/abs/2510.10513
Kyla Chasalow, Skyler Wu, Susan Murphy, 12 Oct 2025, Missing Data Multiple Imputation for Tabular Q-Learning in Online RL, https://arxiv.org/abs/2510.10709
Zhipeng He, Chun Ouyang, Lijie Wen, Cong Liu, Catarina Moreira, 13 Oct 2025, TabAttackBench: A Benchmark for Adversarial Attacks on Tabular Data, https://arxiv.org/abs/2505.21027
S.R. Eshwar, Gugan Thoppe, Ananyabrata Barua, Aditya Gopalan, Gal Dalal, 11 Oct 2025, Monotone and Conservative Policy Iteration Beyond the Tabular Case, https://arxiv.org/abs/2506.07134
Prabhant Singh, Pieter Gijsbers, Elif Ceren Gok Yildirim, Murat Onur Yildirim, Joaquin Vanschoren, 13 Oct 2025, Automated Machine Learning for Unsupervised Tabular Tasks, https://arxiv.org/abs/2510.07569
Justus Viga, Penelope Mueck, Alexander L\"oser, and Torben Weis, 9 Oct 2025, FuelCast: Benchmarking Tabular and Temporal Models for Ship Fuel Consumption, https://arxiv.org/abs/2510.08217
Yebin Lim, Susik Yoon, 20 Sep 2025, Multi-level Diagnosis and Evaluation for Robust Tabular Feature Engineering with Large Language Models, https://arxiv.org/abs/2509.25207
Jiaru Zou, Soumya Roy, Vinay Kumar Verma, Ziyi Wang, David Wipf, Pan Lu, Sumit Negi, James Zou, Jingrui He, 7 Oct 2025, TaTToo: Tool-Grounded Thinking PRM for Test-Time Scaling in Tabular Reasoning, https://arxiv.org/abs/2510.06217
Praneeth Vepakomma, Kaustubh Ponkshe, 7 Oct 2025, Power Mechanism: Private Tabular Representation Release for Model Agnostic Consumption, https://arxiv.org/abs/2510.05581
Felix Koch, Marcel Wever, Fabian Raisch, Benjamin Tischler, 16 Oct 2025, State-Space Models for Tabular Prior-Data Fitted Networks, https://arxiv.org/abs/2510.14573

Microsoft Excel

Use of Microsoft Excel with AI:

Yuzhang Tian, Jianbo Zhao, Haoyu Dong, Junyu Xiong, Shiyu Xia, Mengyu Zhou, Yun Lin, José Cambronero, Yeye He, Shi Han, Dongmei Zhang, 12 Jul 2024, SpreadsheetLLM: Encoding Spreadsheets for Large Language Models, https://arxiv.org/abs/2407.09025
David Gewirtz, Sept. 16, 2024, Why natural language AI scripting in Microsoft Excel could be a game changer. What if you could run advanced Excel analyses with no coding skills? Here's how Microsoft's Copilot in Excel could use Python to allow you to do just that, https://www.zdnet.com/article/why-natural-language-ai-scripting-in-microsoft-excel-could-be-a-game-changer/
Microsoft, Aug 22 2023, Announcing Python in Excel: Combining the power of Python and the flexibility of Excel, https://techcommunity.microsoft.com/t5/excel-blog/announcing-python-in-excel-combining-the-power-of-python-and-the/ba-p/3893439
Jared Spataro, Sep 16, 2024, Microsoft 365 Copilot Wave 2: Pages, Python in Excel, and agents, Microsoft blog, https://www.microsoft.com/en-us/microsoft-365/blog/2024/09/16/microsoft-365-copilot-wave-2-pages-python-in-excel-and-agents/
Cristian Challu, Oct 07, 2024, 5 ways companies can use time-series forecasting, https://www.infoworld.com/article/3543468/5-ways-companies-can-use-time-series-forecasting.html
LangChain, Aug 10, 2024, UX for Agents, Part 3: Spreadsheet, Generative, and Collaborative UI/UX, https://blog.langchain.dev/ux-for-agents-part-3/
Péter Harang, Dec 21, 2024, Building a Transformer in Excel. Pico-scale reference implementation of everyone’s favourite LLM architecture, for demostration purposes, https://medium.com/@harangpeter/building-a-transformer-in-excel-467a4a27608d
Rebecca Szkutak, August 18, 2025, Why Paradigm built a spreadsheet with an AI agent in every cell, https://techcrunch.com/2025/08/18/why-paradigm-built-a-spreadsheet-with-an-ai-agent-in-every-cell/
Ananya Singha, Harshita Sahijwani, Walt Williams, Emmanuel Aboah Boateng, Nick Hausman, Miguel Di Luca, Keegan Choudhury, Chaya Binet, Vu Le, Tianwei Chen, Oryan Rokeah Chen, Sulaiman Vesal, Sadid Hasan, 14 Aug 2025, Benchmark Dataset Generation and Evaluation for Excel Formula Repair with LLMs, https://arxiv.org/abs/2508.11715
Jenny T. Liang, Aayush Kumar, Yasharth Bajpai, Sumit Gulwani, Vu Le, Chris Parnin, Arjun Radhakrishna, Ashish Tiwari, Emerson Murphy-Hill, Guastavo Soares, 25 Aug 2025, TableTalk: Scaffolding Spreadsheet Development with a Language Agent, https://arxiv.org/abs/2502.09787
Wei Han, Geng Zhan, Sicheng Yu, Chenyu Wang, Bryan Hooi, 7 Sep 2025, From Long to Short: LLMs Excel at Trimming Own Reasoning Chains, https://arxiv.org/abs/2509.06174
Qin Chen, Yuanyi Ren, Xiaojun Ma, Mugeng Liu, Han Shi, and Dongmei Zhang, 9 Sep 2025, SheetDesigner: MLLM-Powered Spreadsheet Layout Generation with Rule-Based and Vision-Based Reflection, https://arxiv.org/abs/2509.07473
Amila Indika, Igor Molybog, 22 Oct 2025, SODBench: A Large Language Model Approach to Documenting Spreadsheet Operations, https://arxiv.org/abs/2510.19864

Copilot Apps

Research on "copilot" types of AI applications:

Chris Parnin, Gustavo Soares, Rahul Pandita, Sumit Gulwani, Jessica Rich, Austin Z. Henley, 21 Dec 2023, Building Your Own Product Copilot: Challenges, Opportunities, and Needs, https://arxiv.org/abs/2312.14231
Jared Spataro, Sep 16, 2024, Microsoft 365 Copilot Wave 2: Pages, Python in Excel, and agents, Microsoft blog, https://www.microsoft.com/en-us/microsoft-365/blog/2024/09/16/microsoft-365-copilot-wave-2-pages-python-in-excel-and-agents/
Jeremy Kahn, September 17, 2024, Microsoft introduces AI agents and updates to Copilot 365 apps as the war to make AI more useful intensifies, https://fortune.com/2024/09/16/microsoft-launches-ai-agents-updates-to-copilot-365-apps/
Tanay Jaipuria, Nov 12, 2024, Big Tech x Generative AI Q3 '24 Update (Part 2), How Meta and Microsoft's Generative AI investments are going so far, https://www.tanayj.com/p/big-tech-x-generative-ai-q3-24-update
Jason Redmond, Jan 2025, Microsoft CEO Nadella forms new AI group to build and run apps for customers. Microsoft hired DeepMind co-founder Mustafa Suleyman to lead Copilot AI initiatives last year. https://www.nbcnews.com/business/business-news/microsoft-ceo-nadella-forms-new-ai-group-build-run-apps-customers-rcna187506

AI Operating System

An AI operating system, or AI OS, is the idea of building an entire system on AI components. This is a generalization beyond just an AI framework or AI platform.

Research on an AI OS:

Simeon Emanuilov, Apr 4, 2024 LLM agent operating system (AIOS) and the future of LLM-powered agents, https://medium.com/@simeon.emanuilov/llm-agent-operating-system-aios-and-the-future-of-llm-powered-agents-3d08b4e91c34 https://unfoldai.com/aios-llm-powered-agents/
Charles Packer, Sarah Wooders, Kevin Lin, Vivian Fang, Shishir G. Patil, Ion Stoica, Joseph E. Gonzalez, 12 Feb 2024 (v2), MemGPT: Towards LLMs as Operating Systems, https://arxiv.org/abs/2310.08560 https://memgpt.ai/
Sean Michael Kerner, September 25, 2024, How Intuit plans to use agentic AI to automate complex business tasks, https://venturebeat.com/ai/how-intuit-plans-to-use-agentic-ai-to-automate-complex-business-tasks/
Nicholas Grous, Andrew Kim, June 04, 2024, Generative AI: A New Consumer Operating System, https://www.ark-invest.com/articles/analyst-research/generative-ai-a-new-consumer-operating-system
Ignacio de Gregorio Noblejas, December 15, 2024, The AI Trillion-Dollar Product, https://thewhitebox.beehiiv.com/p/the-ai-trillion-dollar-product

Security Credential Management

Security credential management is an important part of productionizing AI apps. This includes both user login passwords and the security keys of commercial APIs.

Papers on security credentials: