ClawVM Harness-Managed Virtual Memory (Rafique and Bindschaedler, 2026)

URL: https://arxiv.org/abs/2604.10352

The paper proposes ClawVM, a runtime layer interposed within the agent harness that enforces three semantic properties at lifecycle boundaries: atomicity (three-phase staged-updates plus validation plus scoped-commit transaction preventing destructive overwrites), persistence guarantees (minimum-fidelity invariants ensure critical state survives destruction), and capability boundaries (pages carry scope metadata; the writeback protocol checks requested scope is allowed for the active principal). The load-bearing architectural claim, quoted: "when a runtime manages a fast, scarce tier alongside slow, durable storage, the answer is virtual memory, not best-effort heuristics." And: "Operating systems solved this decades ago... State management is reframed as an OS-level concern, not agent policy."

Adopted

ClawVM is the closest external evidence for this graph's [[Agent Runtimes Require Substrate Primitives, Not External Glue]] diagnostic, arrived at independently from the LLM-agents direction rather than the DGD-lineage direction. The empirical claims (ClawVM eliminates "all policy-controllable faults" producing up to 67.8 mean faults to zero across budgets; 100 percent task success vs 76.7 percent for practitioner-configured baselines at tight budget; less than 50 microseconds median policy-engine overhead per turn) validate the substrate-properties-belong-in-the-substrate position empirically. The paper's positioning vs MemGPT, A-MEM, and Mem0 is also adopted: MemGPT "leaves residency and writeback to model discretion; ClawVM moves both to the harness with replayable enforcement."

Not adopted (yet)

ClawVM is harness-LAYER. It enforces substrate properties on top of a Python plus storage substrate that does not natively carry them; it is a thin enforcement layer rather than a runtime where the properties are primitives. eOS Continuum's substrate-LAYER position takes the same diagnosis to its conclusion -- instead of building an OS-shaped enforcement layer on Python, the runtime IS the substrate, with [[Runtime State Is Persistent by Default, Not by Application Discipline|orthogonal persistence]] as a foundational primitive and the other seven primitives derived from it. The capability-OS lineage (KeyKOS, EROS, CapROS, Coyotos) and the DGD-on-iChat production-soak credential remain this project's specific architectural ground.

Sources

Relations