Towards Resource-Efficient Compound AI Systems (Chaudhry et al., 2025)

URL: https://arxiv.org/abs/2501.16634

The paper proposes a vision for resource-efficient compound AI systems through a declarative workflow programming model paired with an adaptive runtime system for dynamic scheduling and resource-aware decision-making. The contribution is the runtime framing: compound AI systems need a runtime that handles scheduling, resource allocation, and decision-making across components -- not just an orchestration layer over external services.

Adopted

The paper's runtime framing arrives close to this graph's substrate-vs-glue diagnostic. Compound AI systems integrating multiple components (LMs, retrievers, external tools) need a runtime that owns the scheduling and resource decisions; that runtime is in substrate territory, not orchestration territory. The eOS Continuum substrate goes further: the runtime owns not just scheduling and resource decisions but also state, atomicity, capability separation, multi-agent coherence, hot reload, sandboxed code load, asynchronous events, and direct state introspection.

Not adopted (yet)

The paper's runtime is concerned with scheduling and resource allocation; it does not name the eight runtime primitives this graph's [[Agent Runtimes Require Substrate Primitives, Not External Glue]] Conviction names. The runtime-LAYER framing is right; the specific substrate commitments (orthogonal persistence as foundational, single-coherence-domain by design, capability-OS-tradition lineage) are this project's contribution.

Sources

Relations