There are quota management tools out there (e.g. 9router) but all the ones I tested require running a proxy. Subtask is a pure Claude Code plugin: just bash and markdown. No proxy, no config, no manual invocation.
It hooks into PreToolUse, watches quota pressure, and denies calls when Claude is running hot — telling it to delegate via /subtask:delegate instead. Code changes go to Codex first, web research to Gemini. Requires Codex CLI and Gemini CLI installed and logged in.
We just started testing internally — not battle-tested yet. Hope someone finds it helpful. Would love feedback if you give it a try.