Thank you for sharing this. I'm going to implement this judgement loop in my AI agenting runtime. Worth to try on local LLM with cloud high-end judge model in a loop + QA model to build and run tests as the second judge. I think both judge and QA agents should have an access to the initial RFC requirements. Seems to be a good approach to save API tokens iterating local LLM as code-writer.