Compound AI Systems Optimization Survey (Lee et al., 2025)

URL: https://arxiv.org/abs/2506.08234

The paper surveys the recent compound-AI-systems literature -- the design and optimization of complex AI workflows where compound systems perform sophisticated tasks through integration of multiple components (multiple LMs, retrievers, tool calls, orchestration logic). The survey covers optimization methods, identifies open challenges, and points at future directions for the compound-AI-systems research community.

Adopted

The compound-AI-systems framing (popularized by Berkeley's "Shift from Models to Compound AI Systems" essay, Zaharia et al., 2024) is the higher-level abstraction over the harness-as-driver pattern this graph's [[Agent Harnesses Drive the Runtime, Not the Reverse]] Conviction names. Compound AI systems orchestrate multiple LMs and tools as a graph of operations; the survey's enumeration of optimization concerns (cost, latency, accuracy, model selection) is the kind of orchestration concern the harness owns. eOS Continuum's substrate is what makes those graphs of operations tractable: atomic operations across the graph, capability separation between components, persistent state shared across invocations.

Not adopted (yet)

The survey assumes a substrate; few of the compound-AI-systems papers it covers name what makes the substrate well-formed. eOS Continuum's substrate-LAYER position is upstream of the survey's optimization concerns -- the optimization questions become tractable once the substrate carries atomicity, persistence, and capability separation as primitives. Adopting the compound-AI-systems vocabulary verbatim would obscure the substrate-LAYER distinction.

Sources

Relations