Concepts
A scenario is a YAML file with a starting state, a task, a policy, and expectations. A run is one agent attempting one scenario, recorded as a branch of an AgentStateGraph store.
Every decision the agent makes is a decision commit. A judge agent scores the runs against each other. The complete bundle becomes a sealed epoch — tamper-evident, independently replayable.