Annotation queues are a great discovery layer. The enforcement question is separate — once the failure mode is named, what stops the model from producing it again? Evals catch it at runtime, hooks prevent it at the trust boundary. Different layer of the pipeline.