Hacker News
new
top
best
ask
show
job
Agent-evals: Metacognitive scoring and boundary testing for LLM coding agents
(
thinkwright.ai
)
2 points
by
oceanwaves
12 hours ago
1 comment
12 hours ago
undefined