The reason people do custom is to craft very good instructions and tools, something a machine is not capable of
Head-to-Head
┌──────────────┬─────────┬─────────────┬────────────┐
│ Metric │ Opty │ Traditional │ Ratio │
├──────────────┼─────────┼─────────────┼────────────┤
│ Input tokens │ ~13,500 │ ~39,408 │ 2.9x fewer │
├──────────────┼─────────┼─────────────┼────────────┤
│ Tool calls │ 21 │ 61 │ 2.9x fewer │
├──────────────┼─────────┼─────────────┼────────────┤
│ Round trips │ 5 │ 9 │ 1.8x fewer │
└──────────────┴─────────┴─────────────┴────────────┘
I had it run a separate analysis using traditional vs. opty and count the actual tool calls and input token counts. My prompt was basically, "do a full analysis of this entire codebase."