Hacker News
new
top
best
ask
show
job
APIEval-20: A Benchmark for Black-Box API Test Suite Generation
(
huggingface.co
)
4 points
by
AkshatVirmani
6 hours ago
3 comments
riyajoshi
6 hours ago
Nice to see a benchmark in this space especially with black-box constraints.
akshay_93
5 hours ago
like that the scoring bias is toward bug detection & not test generation only. generating lots of tests with AI is easy but that doesn't necessarily mean they're good
saikia_
6 hours ago
curious.. let me see if this works for our internal setup