Hacker News
new
top
best
ask
show
job
LLM-test-kit – Test consistency, latency, cost and behavior of LLM apps
(
github.com
)
1 point
by
muskanjo
6 hours ago
2 comments
muskanjo
6 hours ago
I'm the author. Happy to answer questions about the methodology or how the consistency scoring works. Would love feedback on what tests would be most useful to add.
muskanjo
6 hours ago
[flagged]