[2510.04607] From Imperative to Declarative: Towards LLM-friendly OS Interfaces for Boosted Computer-Use Agents
About this article
Abstract page for arXiv paper 2510.04607: From Imperative to Declarative: Towards LLM-friendly OS Interfaces for Boosted Computer-Use Agents
Computer Science > Operating Systems arXiv:2510.04607 (cs) [Submitted on 6 Oct 2025 (v1), last revised 25 Mar 2026 (this version, v2)] Title:From Imperative to Declarative: Towards LLM-friendly OS Interfaces for Boosted Computer-Use Agents Authors:Yuan Wang, Mingyu Li, Haibo Chen View a PDF of the paper titled From Imperative to Declarative: Towards LLM-friendly OS Interfaces for Boosted Computer-Use Agents, by Yuan Wang and 1 other authors View PDF HTML (experimental) Abstract:Computer-use agents (CUAs) powered by large language models (LLMs) have emerged as a promising approach to automating computer tasks, yet they struggle with the existing human-oriented OS interfaces - graphical user interfaces (GUIs). GUIs force LLMs to decompose high-level goals into lengthy, error-prone sequences of fine-grained actions, resulting in low success rates and an excessive number of LLM calls. We propose Declarative Model Interface (DMI), an abstraction that transforms existing GUIs into three declarative primitives: access, state, and observation, thereby providing novel OS interfaces tailored for LLM agents. Our key idea is policy-mechanism separation: LLMs focus on high-level semantic planning (policy) while DMI handles low-level navigation and interaction (mechanism). DMI does not require modifying the application source code or relying on application programming interfaces (APIs). We evaluate DMI with Microsoft Office Suite (Word, PowerPoint, Excel) on Windows. Integrating DMI int...