But mostly I just use Claude now since I already have a max subscription. I've done hybrid setups in the past where I had Claude do the hard stuff and a local LLM do the code monkey stuff, but with the right optimizations I'm no longer hitting the Claude token ceiling.