Skip to content

Scenarios

Scenarios are YAML files in scenarios/. Required fields: scenario (id), starting_state, task, policy, expectations.

The same scenario is replayed against every candidate agent so judgment is apples-to-apples.