4 pointsby miguelspizza14 hours ago2 comments

jtbetz2211 hours ago
Maintaining (and, ideally, improving!) code quality is, IMO, the biggest open problem in agentic development.
I've been coming at this from a slightly different angle, building a system that encodes checks that run against code authored by Claude, with escalating levels of enforcement.
It's still really early, but, among early users, we see that about 40% of agent turns produce code that violates the guidance in the project's own CLAUDE.md. We catch the violations at the end of the turn and direct Claude to correct them.
Still looking for early users and feedback. http://getcaliper.dev
WalterGR12 hours ago
Do you have examples that show concrete improvements when using this?