In our experience using step-scoped agents, the most useful “receipt” wasn’t a log, but the fact that each step had explicitly declared inputs, tools, and outputs, and that authority was destroyed before the next step ran. If something went wrong, you could point to the exact step boundary where the assumption or permission existed.
Signed receipts feel like a strong complement to that model: step-level determinism plus post-hoc verifiability. Curious whether you see this being used more for human audit trails, or as machine-checkable constraints that downstream agents must validate before acting.
1. Crash edge case: If an agent executes a side-effect and dies before signing the receipt, is that action orphaned? Any WAL-style intent/completion model?
2. Multi-step workflows: Do receipts chain natively (parent pointers/Merkle) or via external linking? (I see storage/ledgers are out of scope, but curious about the linkage design.)
The negative proof angle (proving AI didn't touch prod) is compelling for compliance.